CA1320162C - Method for stabilizing heterologous protein expression and vectors for use therein - Google Patents
Method for stabilizing heterologous protein expression and vectors for use thereinInfo
- Publication number
- CA1320162C CA1320162C CA000607908A CA607908A CA1320162C CA 1320162 C CA1320162 C CA 1320162C CA 000607908 A CA000607908 A CA 000607908A CA 607908 A CA607908 A CA 607908A CA 1320162 C CA1320162 C CA 1320162C
- Authority
- CA
- Canada
- Prior art keywords
- cat
- protein
- gene
- expression
- gene sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 244
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 158
- 230000014509 gene expression Effects 0.000 title claims abstract description 75
- 239000013598 vector Substances 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 title claims abstract description 32
- 230000000087 stabilizing effect Effects 0.000 title claims abstract description 8
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 claims abstract description 105
- 238000003776 cleavage reaction Methods 0.000 claims abstract description 48
- 230000007017 scission Effects 0.000 claims abstract description 48
- 230000001580 bacterial effect Effects 0.000 claims abstract description 31
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 30
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 30
- 235000018102 proteins Nutrition 0.000 claims description 141
- 235000001014 amino acid Nutrition 0.000 claims description 71
- 229940024606 amino acid Drugs 0.000 claims description 69
- 150000001413 amino acids Chemical class 0.000 claims description 69
- 101150055766 cat gene Proteins 0.000 claims description 49
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 49
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 38
- 229920001184 polypeptide Polymers 0.000 claims description 34
- 241000588724 Escherichia coli Species 0.000 claims description 33
- 102000007620 Pulmonary Surfactant-Associated Protein C Human genes 0.000 claims description 29
- 239000013604 expression vector Substances 0.000 claims description 29
- 108020004414 DNA Proteins 0.000 claims description 28
- 108010007125 Pulmonary Surfactant-Associated Protein C Proteins 0.000 claims description 28
- 108010007131 Pulmonary Surfactant-Associated Protein B Proteins 0.000 claims description 21
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 14
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 13
- 108090000059 Complement factor D Proteins 0.000 claims description 12
- 102000003706 Complement factor D Human genes 0.000 claims description 12
- 108020004705 Codon Proteins 0.000 claims description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 9
- 239000000126 substance Substances 0.000 claims description 9
- 102000009091 Amyloidogenic Proteins Human genes 0.000 claims description 8
- 108010048112 Amyloidogenic Proteins Proteins 0.000 claims description 8
- 230000017854 proteolysis Effects 0.000 claims description 8
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 7
- 101500016415 Lophius americanus Glucagon-like peptide 1 Proteins 0.000 claims description 6
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 claims description 6
- 238000013519 translation Methods 0.000 claims description 5
- 239000004471 Glycine Substances 0.000 claims description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 3
- 230000004075 alteration Effects 0.000 claims description 3
- 230000001413 cellular effect Effects 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 230000002708 enhancing effect Effects 0.000 claims description 3
- 238000009877 rendering Methods 0.000 claims description 3
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 2
- 230000006872 improvement Effects 0.000 claims description 2
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 claims description 2
- 229930182817 methionine Natural products 0.000 claims 2
- 101800000224 Glucagon-like peptide 1 Proteins 0.000 claims 1
- 102400000322 Glucagon-like peptide 1 Human genes 0.000 claims 1
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 claims 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims 1
- 235000018417 cysteine Nutrition 0.000 claims 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims 1
- 235000013922 glutamic acid Nutrition 0.000 claims 1
- 239000004220 glutamic acid Substances 0.000 claims 1
- 229960000310 isoleucine Drugs 0.000 claims 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims 1
- 230000004927 fusion Effects 0.000 abstract description 25
- 238000000746 purification Methods 0.000 abstract description 6
- 241000894006 Bacteria Species 0.000 abstract description 5
- 238000002955 isolation Methods 0.000 abstract 1
- 239000013612 plasmid Substances 0.000 description 104
- 210000004027 cell Anatomy 0.000 description 56
- 239000012634 fragment Substances 0.000 description 54
- 229960000723 ampicillin Drugs 0.000 description 27
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 27
- 239000000499 gel Substances 0.000 description 22
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 19
- 102000012410 DNA Ligases Human genes 0.000 description 17
- 108010061982 DNA Ligases Proteins 0.000 description 17
- 102100032617 Pulmonary surfactant-associated protein B Human genes 0.000 description 17
- 238000000246 agarose gel electrophoresis Methods 0.000 description 16
- 229920002401 polyacrylamide Polymers 0.000 description 15
- 239000004098 Tetracycline Substances 0.000 description 13
- 239000002773 nucleotide Substances 0.000 description 13
- 125000003729 nucleotide group Chemical group 0.000 description 13
- 229960002180 tetracycline Drugs 0.000 description 13
- 229930101283 tetracycline Natural products 0.000 description 13
- 235000019364 tetracycline Nutrition 0.000 description 13
- 150000003522 tetracyclines Chemical class 0.000 description 13
- 238000010276 construction Methods 0.000 description 12
- 210000003000 inclusion body Anatomy 0.000 description 12
- 229960004799 tryptophan Drugs 0.000 description 12
- 125000003275 alpha amino acid group Chemical group 0.000 description 11
- 238000003780 insertion Methods 0.000 description 11
- 230000037431 insertion Effects 0.000 description 11
- 241001288713 Escherichia coli MC1061 Species 0.000 description 10
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 10
- 108091034117 Oligonucleotide Proteins 0.000 description 10
- 238000010367 cloning Methods 0.000 description 10
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 7
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 7
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 7
- 108010051815 Glutamyl endopeptidase Proteins 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- 238000001962 electrophoresis Methods 0.000 description 7
- 238000002135 phase contrast microscopy Methods 0.000 description 7
- 230000006337 proteolytic cleavage Effects 0.000 description 7
- 238000010186 staining Methods 0.000 description 7
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 6
- 101001032756 Rattus norvegicus Granzyme-like protein 1 Proteins 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 230000006698 induction Effects 0.000 description 6
- PLVPPLCLBIEYEA-AATRIKPKSA-N (E)-3-(indol-3-yl)acrylic acid Chemical compound C1=CC=C2C(/C=C/C(=O)O)=CNC2=C1 PLVPPLCLBIEYEA-AATRIKPKSA-N 0.000 description 5
- 101000612671 Homo sapiens Pulmonary surfactant-associated protein C Proteins 0.000 description 5
- 108700005078 Synthetic Genes Proteins 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 238000009835 boiling Methods 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 230000002209 hydrophobic effect Effects 0.000 description 5
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 5
- 239000008188 pellet Substances 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 4
- 102000004594 DNA Polymerase I Human genes 0.000 description 4
- 108010017826 DNA Polymerase I Proteins 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 4
- 229960005091 chloramphenicol Drugs 0.000 description 4
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 4
- 230000000875 corresponding effect Effects 0.000 description 4
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- 102000047087 human SFTPC Human genes 0.000 description 4
- PLVPPLCLBIEYEA-UHFFFAOYSA-N indoleacrylic acid Natural products C1=CC=C2C(C=CC(=O)O)=CNC2=C1 PLVPPLCLBIEYEA-UHFFFAOYSA-N 0.000 description 4
- 229940066294 lung surfactant Drugs 0.000 description 4
- 239000003580 lung surfactant Substances 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000002741 site-directed mutagenesis Methods 0.000 description 4
- 241000725101 Clea Species 0.000 description 3
- 241001455191 Dasyurus albopunctatus Species 0.000 description 3
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 3
- 101000737554 Homo sapiens Complement factor D Proteins 0.000 description 3
- 101001086862 Homo sapiens Pulmonary surfactant-associated protein B Proteins 0.000 description 3
- 102000003960 Ligases Human genes 0.000 description 3
- 108090000364 Ligases Proteins 0.000 description 3
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 239000000835 fiber Substances 0.000 description 3
- 102000045507 human CFD Human genes 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 210000003705 ribosome Anatomy 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000004094 surface-active agent Substances 0.000 description 3
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- JRNVZBWKYDBUCA-UHFFFAOYSA-N N-chlorosuccinimide Chemical compound ClN1C(=O)CCC1=O JRNVZBWKYDBUCA-UHFFFAOYSA-N 0.000 description 2
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 2
- 102000000528 Pulmonary Surfactant-Associated Proteins Human genes 0.000 description 2
- 108010041520 Pulmonary Surfactant-Associated Proteins Proteins 0.000 description 2
- 239000012722 SDS sample buffer Substances 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- OCUSNPIJIZCRSZ-ZTZWCFDHSA-N (2s)-2-amino-3-methylbutanoic acid;(2s)-2-amino-4-methylpentanoic acid;(2s,3s)-2-amino-3-methylpentanoic acid Chemical compound CC(C)[C@H](N)C(O)=O.CC[C@H](C)[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O OCUSNPIJIZCRSZ-ZTZWCFDHSA-N 0.000 description 1
- QYETVTFBDRINOR-PHGYPNQBSA-N (2s,5r,6r)-6-[[(2r)-2-amino-2-phenylacetyl]amino]-3,3-dimethyl-7-oxo-4-thia-1-azabicyclo[3.2.0]heptane-2-carboxylic acid;sulfuric acid Chemical compound OS(O)(=O)=O.C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1.C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 QYETVTFBDRINOR-PHGYPNQBSA-N 0.000 description 1
- NJZHEQOUHLZCOX-ZENOOKHLSA-N (3aR,4R,9bS)-golgicide A Chemical compound C1([C@@H]2NC3=C(F)C=C(C=C3[C@H]3C=CC[C@H]32)F)=CC=CN=C1 NJZHEQOUHLZCOX-ZENOOKHLSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241001244729 Apalis Species 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 241000908115 Bolivar Species 0.000 description 1
- 108010037936 CCCGGG-specific type II deoxyribonucleases Proteins 0.000 description 1
- 101100337060 Caenorhabditis elegans glp-1 gene Proteins 0.000 description 1
- 101100042371 Caenorhabditis elegans set-3 gene Proteins 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- XMBSYZWANAQXEV-QWRGUYRKSA-N Glu-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-QWRGUYRKSA-N 0.000 description 1
- 108060003199 Glucagon Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 102000008192 Lactoglobulins Human genes 0.000 description 1
- 108010060630 Lactoglobulins Proteins 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241001024304 Mino Species 0.000 description 1
- 101001067830 Mus musculus Peptidyl-prolyl cis-trans isomerase A Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 101150050863 T gene Proteins 0.000 description 1
- 101710162629 Trypsin inhibitor Proteins 0.000 description 1
- 229940122618 Trypsin inhibitor Drugs 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- -1 amino- Chemical class 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229950008486 carperitide Drugs 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 208000037516 chromosome inversion disease Diseases 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 101150102822 glp-1 gene Proteins 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 230000003450 growing effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003914 insulin secretion Effects 0.000 description 1
- 230000002473 insulinotropic effect Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 230000007326 intracellular aggregation Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- UOJMTSCORVQOHS-UHFFFAOYSA-N pachypodol Natural products COc1cc(ccc1O)C2=C(C)C(=O)c3c(O)cc(C)cc3O2 UOJMTSCORVQOHS-UHFFFAOYSA-N 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 229950009506 penicillinase Drugs 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000003001 serine protease inhibitor Substances 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000003381 solubilizing effect Effects 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 229960000344 thiamine hydrochloride Drugs 0.000 description 1
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 1
- 239000011747 thiamine hydrochloride Substances 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- IEDVJHCEMCRBQM-UHFFFAOYSA-N trimethoprim Chemical compound COC1=C(OC)C(OC)=CC(CC=2C(=NC(N)=NC=2)N)=C1 IEDVJHCEMCRBQM-UHFFFAOYSA-N 0.000 description 1
- 239000002753 trypsin inhibitor Substances 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12N9/1033—Chloramphenicol O-acetyltransferase (2.3.1.28)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4711—Alzheimer's disease; Amyloid plaque core protein
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/58—Atrial natriuretic factor complex; Atriopeptin; Atrial natriuretic peptide [ANP]; Cardionatrin; Cardiodilatin
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/605—Glucagons
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/785—Alveolar surfactant peptides; Pulmonary surfactant peptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/64—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue
- C12N9/6421—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue from mammals
- C12N9/6424—Serine endopeptidases (3.4.21)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/61—Fusion polypeptide containing an enzyme fusion for detection (lacZ, luciferase)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/74—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
- C07K2319/75—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor containing a fusion for activation of a cell surface receptor, e.g. thrombopoeitin, NPY and other peptide hormones
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Wood Science & Technology (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Biomedical Technology (AREA)
- Toxicology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Endocrinology (AREA)
- Pulmonology (AREA)
- Cardiology (AREA)
- Neurology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
METHOD FOR STABILIZING HETEROLOGOUS PROTEIN
EXPRESSION AND VECTORS FOR USE THEREIN
Abstract of the Invention The present invention provides a method for stabilizing heterologous protein expression in bacteria by using a 3' truncated chloramphenicol acetyltransferase (CAT) gene fused in frame with a gene encoding a heterolo-gous protein. When expressed in a bacterial host, the resulting hybrid gene produces a fusion protein in re-coverable yield. Cleavage sites separating the CAT and heterologous protein are also provided to facilitate isolation and purification of the desired heterologous protein The invention further provides bacterial vectors containing the hybrid gene fusions for expression of the fusion protein comprising the desired heterologous protein.
EXPRESSION AND VECTORS FOR USE THEREIN
Abstract of the Invention The present invention provides a method for stabilizing heterologous protein expression in bacteria by using a 3' truncated chloramphenicol acetyltransferase (CAT) gene fused in frame with a gene encoding a heterolo-gous protein. When expressed in a bacterial host, the resulting hybrid gene produces a fusion protein in re-coverable yield. Cleavage sites separating the CAT and heterologous protein are also provided to facilitate isolation and purification of the desired heterologous protein The invention further provides bacterial vectors containing the hybrid gene fusions for expression of the fusion protein comprising the desired heterologous protein.
Description
132~162 METHOD FOR STABILIZING HETEROLOGOVS PROTEIN
EXPRESSION AND VECTORS FOR VSE TH~REIN
Technical Field of the Invention The present invention relates generally to the field of biotechnology. More particularly, the invention relates to the fields of protein expression and recombi-nant DNA ~echnology to improve the yield of poorly ex-pressed mammalian polypeptides in bacterial hosts.
Backqround of the Invention Many eukaryotic proteins axe not capable of be-ing expressed in Escherichia coli in any measurable yield,or even if detectable, are not capable of being expressed at such commercially recoverable levels due to proteolysis of the foreign protein by the host. Small proteins (e.g., peptide~hormones of less than 100 amino acids) appear to be especially sensitive to degradation. The degree of proteolysis varies from host to host and protein to protein. Possibly the highest level of expression of a eukaryotic protein in E. coli has been observed with yamma interferon, which was expressed at approximately 60% of total cellular protein. The high lev~l of expression of a few eukaryotic proteins has been achieved because they reach a~concentration in the cell where they can aggregate into insolubIe masses called inclusion or refractile bvd-es (e.g., bovine growth honnone; Schoner et al (1985), ~ 3:151-154). In this~form, the eukaryotic protein is less susceptible to proteolysis.
:~:: :
~:
:.. ~, : . .:
, `:
~ 32~162 Proteins which do not become insoluble on their o~n do in some cases form inclusion bodies if joined to another protein such as a procaryotic protein. A small number of prokaryotic proteins have been used in this man-ner: E. coli lacZ, ~E, and recA genes and the lambdacII gene, for example.
Chloramphenicol acetyltransferase (CAT) has been used as a selectable maxker (resistance to chloramphenicol), as an easily assayed enzyme to monitor the efficiency of both eukaryotic and prokaryotic expres-sion from different promoters (Delegeane, A.~., et al (lg87) Mol Cell Biol 7:3994-4002), regulatory sequences, and/or ribosome binding sites, and for gene fusions which join sequences encoding a eukaryotic protein to the nucleotide sequence encoding mature, native CAT (Buckley and Hayashi (1986) Mol Gen Genet 204:120-125; European Patent Publication 161,937, published 21 November 1985) or to the carboxy terminal fragment of CAT (usually retaining CAT activity)~
While the literature establishes that fusion proteins are useful to express heterologous proteins in bacteria and that the native CAT gene sequence has been used for such a purpose, efforts to use a truncated form of CAT to express or to increase the recoverable yield of heterologous, mammalian proteins such as amyloid protein A4-7Sl insert sequence, glucagon-like peptide I, adipsin/D, and lung surfactant SP-B and 5P-C, have not been reported. In light of the fact that many important protein~ cannot be successfully expressed in bacteria in any commercially recoverable yield, there is a need to ~ develop systems for the bacterial expression and recovery : of such proteins.
: 35 ' : ~ :
:
~, ,,, - -- ~ i ~al~
Disclo~ure of the Invention One aspect of the invention concerns a method of stabilizing heterologous protein expression in a prokaryotic host comprising:
(a) constructing a hybrid gene comprising in sequential order, a 3' truncated chloramphenicol acetyltransferase (C~T) gene sequence fused in frame with a heterologous gene sequence encoding a mammalian polypeptide selec-ted from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfac~ant protein SP-B and lung surfactant protein SP-C; wherein said polypeptide is normally not recoverable in bacterial expression systems, and wherein said hybrid gene, upon translation, produces a fusion protein in a recoverable yield;
(b) providing a vector for expression of said hybrid gene;
(c) culturing the prokaryotic host transformed with the expression vector; and (d) recovering the fusion protein.
A second aspect of the invention concerns a bacterial expression vector capable of enhancing the level of expression of non-stable, bacterially produced heterologous polypeptides comprising a hybrid gene having, in sequential order, a 3' CAT truncated gene sequence fused in frame to a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP~C, wherein said polypeptide is normally not recoverable in bacterial expression systems;
whereby said truncated CAT gene sequence is capable of rendering the resulting fusion protein resistant to proteolytic degradation.
A preferred embodiment ~or both the method and vectox of the present invention employs a CAT coding . .
~4~ ~3?,~162 sequence of less than or equal to 180 amino acids, preferably between 73 and 180 amino acids. Although the resulting CAT protein is substantially reduced as compzred to the native CAT protein, surprisingly, it has been found that the truncated CAT protein substantially contributes to the stability of the expressed protein and therefore, permits recovery of an increased yield of the desired heterologous protein.
Yet another aspect of the invention provides an improved bacterial expression vector capable of Pnhancing the level of expression of non-stable, bacterially produced heterologous polypeptides wherein said vector contains a hybrid gene having in sequential order, a modified 3' truncated CAT gene sequence linked to a heterologous gene sequence. The improvement comprises altering one or more DNA codons of the truncated CAT gene to eliminate potential chemical cleavage sites within the CAT protein.
Other aspects of the invention will be readily apparent to those of skill in the art from the description and examples which follow.
Brief Description of the Drawinqs Figure 1 set3 forth the amino acid and cor-responding nucleotide sequences for a 241 amino acid (aa)CAT-hANP hybrid protein containing an endoproteinase Glu-C
proteolytic cleavage site. The amino terminal portion of this hybrid protein encodes the first 210 amino acids of CAT, which sequence is extensively referred to throughout the present invention.
Figure 2 illustrates a series of vectors and synthetic fragments used for cloning and expression of the CAT-hANF hybrid proteins of the invention Figure 2A
depicts an EcoRI-PstI synthetic fragment containing the E.
coli trp promoter-operator sequence, a ribosomal binding site, and downstxeam cloning sites. Figure 23 is a restriction site and function map of plasmid pTrp233.
Figure 2C is a restriction site and function map of plasmid pCAT21. Figure 2D is an EcoRI-HindIII synthetic fragment encoding the hANP (102-126) gene preceded by an endoproteinase Glu-C cleavage site. Figures 2E through G
are restriction site and function maps of plasmids phNF75, pChNF109, and pChNF121, respectively. Figure 2H depicts a synthetic 1-73 aa CAT gene sequence contained withi~ NdeI-HindIII fragment. Figure 2I is a restriction site and function map of plasmid pChNF142 wherein site-specific mutagenesis was used to substitute Tyr and Ser codons for residues 16 and 31, respectively, of the CAT gene.
Figure 3 illustrates two different preparative SDS-polyacrylamide gels. Figure 3A is an SDS-lS polyacrylamide gel of the CAT-A4-7Sli hybrid protein.
Lane 1 = molecular size standards; Lane 2 = induced W3110 (pCAPil32); Lane 3 - induced W3110 (pTrp83) vector control; Lane 4 = uninduced W3110 (pCAPil36); and Lane 5 =
induced W3110 (pCAPil36). Figure 3B is an SDS-polyacrylamide gel of the CAT-GLP-I hybrid protein. Lane 1 = molecular size standard; Lane 2 = uninduced W3110 (pCGLP139); Lane 3 = induced W3110 (pCGLP139); and Lane 4 = induced W3110 (pTrp83~ vector control.
Figure 4 illustrates the amino acid and cor-responding nucleotide sequences for a CAT-A4-751i hybrid protein and a CAT-GLP-I hybrid protein of the invention.
Figure 4A depicts the first 73 codons encoding the amino terminus of the CAT protein joined in-frame to the synthetic A4-751i gene preceded by a chemical cleavage and site encoded by Asn-Gly. Figure 4B depicts the first 73 codons encoding the amino terminus of the CAT protein joined in-frame to the synthetic GLP-l gene preceded by a Met codon.
Figure 5 illustrates two plasmids, pCAT73 and pCAT210,-in which the gene for tetracycline resistance is restored in these CAT expression vectors.
~ ~2~1 62 Figure 6 is the nucleotide sequence and cor-responding amino acid sequence of the SP-B expression construct pC210SP-B from the EcoRI site preceding the trP
promoter region through the HindIII site containing the translation stop codon. The CAT, linker, and SP-B regions are identified therein, respectively, by the arrows.
Figure 7 is a preparative SDS-polyacrylamide gel of the CAT:SP-B fusion protein. Lane A = molecular size standards; Lane B = induced W3110 cells containing pTrp233 vector control; and Lane C = induced pC210SP-B/W3110 cells.
Figure 8 illustrates the nucleotide sequence and corresponding amino acid sequence of the 251 residue CAT:SP-C fusion protein from plasmid pC210SP-C. The CAT
qene, linker sequence and SP-B gene are sequentially identified therein by the arrows.
Figure 9 provides the molecular wsight determinations for Pach of the CAT:SP-C fusion proteins.
Lane A = molecular size standards; Lane B = induced W3110 cells containing pTrp233 vector control; Lane C = induced pC106SP-C; Lane D = pC149SP-C; Lane E = pC179SP-C; and Lane F = pC210SP-C.
Figure 10 provides the cDNA and amino acid sequences for human adipsin/D.
Modes for CarrYing Out the Invention A. Definitions As used herein the term "stabilizing protein expression~ refers to a property of a fusion protein responsible for inhibiting proteolysis of a foreign protein by a recombinant host cell.
~ Insoluble~ as referred to proteins intends a condition wherein a protein may be recovered only by extraction with dstergents or chaotropic agents. Usually, .
-1 ~ 2 ~
insoluble proteins are formed as a consequence of intracellular aggregation of the cloned gene products.
~ High protein expression" or "enhanced protein expxession~ refers to a level of expression wherein the fused protein can comprise 10% or more of the total protein produced by each cell. A preferred range for high protein expression levels is from 10-20~ of total cell protein.
As used herein, ~'non-recoverable" refers to a level of expression wherein the desired protein may be detected using sensitive techniques, e.g., Western blbt analysis, yet the protein is not commercially recoverable using conventional purification techniques such as SDS
polyacrylamide gel el~ctrophoresis, gel filtration, ion exchange chromatography, hydrophobic chromatography, af-finity chromatogrAphy, or isoelectric focusing.
IlMammalian'' refers to any mammalian species, and includes rabbits, mice, dogs, cats, primates and humans, preferably humans.
As used herein, the term "heterologous~ proteins refers to proteins which are foreign to the host cell transformed to produce them. Thus, the host cell does not generally produce such proteins on its own.
B. CAT Fusions CAT encodes a 219 amino acid mature protein andthe gene contains a number of convenient restriction endonuclease sites (5'-PvuII, EcoRI, DdeI, NcoI, and ScaI-3l) ~hroughout its length to test gene fusions for high level expression. These restriction sites may be used for ease of convenience in constructing the hybrid gene sequences of the invention or other sites within the gene sequence may ~e generated using t~chniques commonly known to those of skill in the art. Any of the resulting CAT
sequences are considered useful so long as the resulting .
- : :
132~1 62 CAT fusion retains the ability to enhance the expression of the desired heterologous peptide.
The expression constructs of the invention can employ most of the CAT-encoding gene sequence or a substantially truncated portion of the sequence encoding an N-terminal portion of the CAT protein linked to the gene encoding the desired heterologous polypeptide. In one embodiment of the invention, the CAT portion of the fusion codes for about the N-terminal one-third of the CAT
sequence.
The expression constructs exemplified herein, which demonstrated enhanced levels of expression for a ~ variety of heterologous proteins, utilize a number of varying lengths of the CAT protein ranging in size from 73 to 210 amino acids. The 73 ~mino acid CAT fusion component is conveniently form~d by dige~ting the C~T
nucleotide sequence at the EcoRI restriction site.
Similarly, the 210 amino acid CAT fusion component is formed by digesting the CAT nucleotide sequence with ScaI.
These, as well as other CAT restriction fragmen~s, may then be ligated to any nucleotide sequence encoding a desired protein to enhance expression of the desired protein.
Significantly, although the expression level of fusion protein ~approximately 15-20~ of total cell protein) was similar for the CAT (106 amino acid) - SP-C
fusion and the CAT (210 amino acid) - SP-C fusion, it can be een that the former case actually represents a significant increase in expression level for the desired SP-C polypeptide, since the SP-C polypeptide constitutes a substantially larger proportion of the total fusion protein in the former case. The ability to increase expression level for the desired polypeptide by reducing the size of the fused CAT protein sequence was quite an unexpected finding in view of the experience of the prior art. In general, the prior art experience has been that . .
~32~62 reduction in size of the bacterial leader sequence does not result in increased production of the fused heterologous polypeptide due to a concomitant larger reduction in the expression level of the fusion protein.
With one exception, the various CAT-heterologous fusion proteins exemplified herein were found to be expressed in the range of approximately 10-20% of the total cell protein. Thus, the versatility of the CAT fu-sions, that is, the ability to use a variety of CAT coding sequences having the ability to enhance the expression of a desired protein, allows great flexibility of choice when construc~ing CAT hybrid genes.
The reading frame for translating ths nucleotide sequence into a protein begins with a portion of the amino terminus of CAT, the length of which varies, continuing in-frame with or without a linker sequence into the protein to be e~pressed, and terminating at the carboxy terminus of the protein. An enzyma~ic or chemical cleav-age site may be introduced downstream of the CAT sequence to permit recovery of the cleaved product from the hybrid protein. Such cleavage sequences are known in the art as are the conditions under which cleavage can be effected.
Following cleavage, the desired heterologous polypeptide can be recovered using known techniques of protein purification. Suitable cleavage sequences include, without limitation, cleavage following methionine residues (cyanogen bromide), glutamic acid residues (endoproteinase Glu-C), tryptophan residues ~N-chlorosuccinimide with urea or with sodium dodecyl sulfate (SDS)) and cleavage between asparagine and lysine residues (hydroxylamine).
To avoid internal cleavage within the CAT
sequence, amino acid substitutions can be made using conventional site specific mutagenesis techniques (Zollerr M.J., and Smith, M. (1982), Nuc Acids Res 10:6487-6500, and Adelman, J.P., et al (1983), DNA 2:183-193). This is conducted using a synthetic oligonucleotide primer com-.
1 ~'2~1 62 plementary to a single-stranded phage DNA to be mutagenized except for limited mismatching, representing the desired mutation. Of course, these substitutions would only be performed when expression of CAT is not significantly affected. Where there is only one internal cysteine residue, as in the short CAT sequence, this residue may be replaced to help reduce multimerization through disulfide bridges.
C. CAT Fusion Vectors Procaryotic systems may be used to express the CAT fusion sequence; procaryotic hosts are, of course, the most convenient for cloning procedures. Procaryotes most frequently are represented by various strains of E. coli;
however, other microbial strains may also be used.
Plasmid vectors which contain replication sites, select-able markers and control sequences derived from a species compatible with the hos~ are used; for example, E. coli is typically transformed using derivatives of pBR322, a plasmid derived from an E coli species by Bolivar et al, Gene 2:95 (1977). pBR322 contains genes for ampicillin and tetracycline resistancer and thus provides multiple selectable markers which can be either retained or destroyed in constructi.ng the desired vector.
In addition to the modifications described above which would facilitate cleavage and purification of the product polypeptide, the gene conferring tetracycline resistance may be restored to the exemplified CAT fusion vectorq for an alternative method of plasmid selection and maintenance.
Although the E. coli tryptophan promoter-operator sequences have been exemplified in the present CAT vec~ors, different control sequences can be substituted for the ~ regulatory sequences and are considered to be within the scope of the invention. Com-monly used procaryotic control sequences which are defined 132~1~2 herein to include promoters for transcription initiation, optionally with an operator, along with ribosome binding site sequence, include such commonly used promoters as the beta-lactamase (penicillinase) and lactose (lac) promoter systems (Chang et al, Nature 198:1056), the lambda-derived PL promoter (Shimatake et al, Nature 292:128 (1981)~ and N-gene rihosome binding site, and the trp-lac ~trc) promoter system (Amann and Brosius, Gene 40:183 (1985)).
Since the general utility of these CAT vectors have been established with very different mammalian peptides (ranging in protein size, the presence or absence of disulfide bonds, and being hydrophobic or hydrophilic in nature) vec~ors with unique restriction site~ may be created or substituted for the pBR322-derived vector il-lustrated in the examples.
D. Heteroloqous Protein Expression Amino terminal DNA sequences of CAT have beenfused to DNA sequences encoding human polypeptides for high lev~l expression in the bacterial host E. coli. The polypeptides described herein are relatively small mam-malian polypeptides ranging in size from about 30 to 76 amino acid residues. Atkempts to directly express, e.g., in a non-fused form, each of these polypeptides in bacteria have been unsuccessful, most likely due to the proteolytic degradation which occurs upon translation of the m~NA product. In the case of extremely hydrophobic polypeptides, even attempts to express such polypeptides using beta-galactosidase fusions produced detectable but very low level amounts of protein.
Examples of polypeptides that have been success-fully expressed to high level in bacteria using the truncated CAT fusions include a variety of mammalian polypeptides including amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP5 (SP-C), and lung surfactant SP18 ~.3~162 (SP-B). Preferably, the mammalian protein is of human origin, although other sources are also contemplated to be within the scope of this invention. A4-751 is a 57 amino acid sequence identified within the precursor for the A4 amyloid protein associated with Alzheimex's disease and shares homology with the Kunitz family of serine proteinase inhibitors (Ponte, P., et al (1988) Nature 331:525-527; Tanzi, R.E., et al (1988) Nature 331:528-530). Glucagon-like paptide I (GLP-I, 7-31) is a 31 amino acid hormone co-encoded in the glucagon gene which is a potent stimulator of insulin release (Mojsov, S., et al (1987) J Clin Inves 79:616-619j. Adipsin/D is a serine protease synthesized in and secreted from adipocytes (Zusalak, K.M., et al (1985) J Mol Cell Biol 5:419). Lung surfactant SP-B is a 76 amino acid hydrophobic protein.
Lung surfactant SP-C is a 35 amino acid hydrophobic protein. Both SP-B and SP-C greatly enhance spreading of surfactant phospholipids at an air:water interface.
E. Hosts Exemplified Host strains used in cloning and procaryotic expression herein are as follows:
For cloning an~ sequencing, and for expression of construction under control of mos~ bacterial pxomoters, E. coli strains such as MC1061, DHl, RRl, W3110, MM294, B, C600hfl, K803, HB101, JA221, and JM101 may be used.
F. General Methods ~ecombinant DNA methods are described in Maniatis et al (1982), Molecular Cloning, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, when not specifically cited in the following examples. Methods are also described in the literature for visualizing inclusion bodies, isolating them from cells, then solubilizing, purifying, and cleaving the hybrid protein (e.g., Itakura, X., et al ~1977) Science 19~.1056-1063; Shine, J., et al -13- ~ 3 2 a 162 (1980) Nature_285:455-461). Methods are also available, if necessary, for refolding ~he protein product (Creighton, T.E., Proceedinqs of Genex-UCLA Symposium, 1985, Kingstones (in press).
Examples I. Expression of Chloramphenicol AcetYltransferase-Human Atrial Natriuretic Peptide Hybrid Proteins in Escherichia coli.
A. Expression vector pChNF109.
Expression vector pChNF109 encodes a 241 amino acid CAT-hANP hybrid protein containing an endoproteinase Glu-C proteolytic cleavage site (Fig. 1). Most of the CAT
gene (amino acids 1-210) has been joined in-frame to the hANP(102-126) gene and cleavage site (26 amino acids) through a linker sequence (5 amino acids). The hANP
polypeptide comprises abou~ 10~ of the hybrid protein.
This vector was constructed from plasmids pTrp233, pCAT21, and phNF75 which supplied the plasmid backbone and trp promoter-operator, the CAT gene, and the h~NP(102-126) gene and cleavage site, respectively.
1. Construction of ~ChNF109.
Plasmid pTrp233 was constructed by insertion of a synthetic EcoRI-PstI fragment containing the E. coli trp promoter-operator sequence, a ribosomal binding site, and downstream cloning sites into plasmid pKK~33-2-NdeI which contains strong transcription termination signals, TlT2, `and the beta-lactamase gene. The synthetic fragment (see Fig. 2A) was assembled using the method of Vlasuk et al (1986), J. Biol Chem 261: 4789-4796 and its sequence confirmed by the method of Sanger et al (1977), Proc Natl Acad Sci USA 74:5463-5467 in M13mp8 and M13mp9. Plasmid ;~ :
:, .. .,: . -~23162 pKK233-2-NdeI (disclosed in U.S. patent 4,764,504 dated 16 August 1988) was digested with EcoRI and PstI, its termini dephosphorylated using calf intestinal phosphatase, and ligated with the synthetic EcoRI-PstI fragment. Plasmid pTrp233 was isolated (Fig. 2B) from E. Coli JA221 transformed to ampicillin resistance.
Plasmid pCAT21 was constructed by insertion of the CAT gene (from transposon Tn9, Alton and Vapnek, (1979) Nature 282:864-869) into plasmid pTrp233 under the control of the ~ promoter-operator. Plasmid pAL13ATCAT (a plasmid disclosed in Canadian application Serial No. 576,975 filed 8 September 1988), was digested with NdeI and HindIII and the approximately 750 bp NdeI-HindIII fragment containing the CAT
gene (with the initiating Met residue encoded at the NdeI
site) was purified using agarose gel electrophoresis. The CAT gene was ligated with NdeI and HindIII digested pTrp233 using T4 DNA ligase. From E. Coli MC1061 (Casadaban et al (1980), I Mo] Biol 138: 179-209) ampicillin-resistant .
transformants, plasmid pCAT21 was isolated (Fig. 2C).
Plasmid phNF75 was constructed by insertion of a synthetic hANP gene preceded by a proteolytic cleavage site into plasmid pBgal (Shine et al (1980), Nature 285:456).
Eight oligodeoxyribonucleotides (Fig. 2D) were assembled into a synthetic hANP(102-126) gene preceded by an endoproteinase Glu-C cleavage site (method of Vlasuk et al (1986), supra).
The synthetic DNA fragment (with a 5' EcoRI tail and a 3' blunt end) was ligated with EcoRI and SmaI restriction endonuclease digested M13mpl9 using T4 DNA ligase for the purpose of DNA sequencing (method of Sanger e~ al (1977), ~ ) A clone with the correct sequence, M13-hNF7, was digested with BamHI and BglII, the fragment containing the hANP gene purified by agarose gel electrophoresis, and the fragment ligated with BamHI-~' -15- 132~16~
digested and bacterial alkaline phosphatase dephosphorylated pTrp233 using ~4 DNA ligase. A plasmid with the insert in the orientation which gives adjacent HindIII, BamHI and EcoRI sites at the 3' end of the hANP
gene, phNF73, was identified by the siæe of the fragments generated ~y digestion with HindIII and PvuII. Plasmid phNF73 was digested with EcoRI, the hANP gene purified using polyacrylamide gel electxophoresis, and the gene ligated with EcoRI-digested and bacterial alkaline phosphatase dephosphorylated plasmid pBgal. From E coli MC1061 ampicillin-resistant transformants, plasmid phNF75 ~Fig. 2E) was identified by the size of the DNA fragments generated by digestion with PstI and HindIII.
Expression vector pChNF109 was constructed by insertion of DNA fragments containing CAT, hANP and the proteolytic cleavage site, and a linker sequence into plasmid pTrp233. Plasmid phNF75 was digested with EcoRI
and HindIII, the approximately 80 bp EcoRI-HindIII frag-ment containing hANP was purified by polyacrylamide gel electrophoresis, and ligated with EcoRI- and HindIII-digested pTrp233 using T4 DMA ligase. From E. coli MC1061 ampicillin-resistant transformants, plasmid phNF87 was isolated and digested with BamHI and the fragments were dephosphorylated using bacterial alkaline phosphatase. A
BamHI cassette containing the trp promoter-operator, ribosomal binding site, and large amino terminal fragment of the CAT gene was generated by digesting pCAT21 with ScaI, attaching BamHI synthetic linkers (5'-CGGATCCG-3') to the blunt termini using T4 DNA ligase, digesting the ligation with BamHI and purification of the approximately 740 bp BamHI fragment by agarose gel electrophoresis. The BamHI cassette and plasmid phNF87 were ligated using T4 ligase and ampicillin-resistant transformants of E. coli MC161 obtained. Plasmid pChNF109 (Fig. 2F), with the BamHI cassette in the orientation such that ~he C~T gene is fused in-frame ~o the endoproteinase Glu-C cleavage ~3~0~&~, site followed by the hANP gene, was selected on the basis of DNA fragment size in an EcoRI digest of the plasmid.
EXPRESSION AND VECTORS FOR VSE TH~REIN
Technical Field of the Invention The present invention relates generally to the field of biotechnology. More particularly, the invention relates to the fields of protein expression and recombi-nant DNA ~echnology to improve the yield of poorly ex-pressed mammalian polypeptides in bacterial hosts.
Backqround of the Invention Many eukaryotic proteins axe not capable of be-ing expressed in Escherichia coli in any measurable yield,or even if detectable, are not capable of being expressed at such commercially recoverable levels due to proteolysis of the foreign protein by the host. Small proteins (e.g., peptide~hormones of less than 100 amino acids) appear to be especially sensitive to degradation. The degree of proteolysis varies from host to host and protein to protein. Possibly the highest level of expression of a eukaryotic protein in E. coli has been observed with yamma interferon, which was expressed at approximately 60% of total cellular protein. The high lev~l of expression of a few eukaryotic proteins has been achieved because they reach a~concentration in the cell where they can aggregate into insolubIe masses called inclusion or refractile bvd-es (e.g., bovine growth honnone; Schoner et al (1985), ~ 3:151-154). In this~form, the eukaryotic protein is less susceptible to proteolysis.
:~:: :
~:
:.. ~, : . .:
, `:
~ 32~162 Proteins which do not become insoluble on their o~n do in some cases form inclusion bodies if joined to another protein such as a procaryotic protein. A small number of prokaryotic proteins have been used in this man-ner: E. coli lacZ, ~E, and recA genes and the lambdacII gene, for example.
Chloramphenicol acetyltransferase (CAT) has been used as a selectable maxker (resistance to chloramphenicol), as an easily assayed enzyme to monitor the efficiency of both eukaryotic and prokaryotic expres-sion from different promoters (Delegeane, A.~., et al (lg87) Mol Cell Biol 7:3994-4002), regulatory sequences, and/or ribosome binding sites, and for gene fusions which join sequences encoding a eukaryotic protein to the nucleotide sequence encoding mature, native CAT (Buckley and Hayashi (1986) Mol Gen Genet 204:120-125; European Patent Publication 161,937, published 21 November 1985) or to the carboxy terminal fragment of CAT (usually retaining CAT activity)~
While the literature establishes that fusion proteins are useful to express heterologous proteins in bacteria and that the native CAT gene sequence has been used for such a purpose, efforts to use a truncated form of CAT to express or to increase the recoverable yield of heterologous, mammalian proteins such as amyloid protein A4-7Sl insert sequence, glucagon-like peptide I, adipsin/D, and lung surfactant SP-B and 5P-C, have not been reported. In light of the fact that many important protein~ cannot be successfully expressed in bacteria in any commercially recoverable yield, there is a need to ~ develop systems for the bacterial expression and recovery : of such proteins.
: 35 ' : ~ :
:
~, ,,, - -- ~ i ~al~
Disclo~ure of the Invention One aspect of the invention concerns a method of stabilizing heterologous protein expression in a prokaryotic host comprising:
(a) constructing a hybrid gene comprising in sequential order, a 3' truncated chloramphenicol acetyltransferase (C~T) gene sequence fused in frame with a heterologous gene sequence encoding a mammalian polypeptide selec-ted from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfac~ant protein SP-B and lung surfactant protein SP-C; wherein said polypeptide is normally not recoverable in bacterial expression systems, and wherein said hybrid gene, upon translation, produces a fusion protein in a recoverable yield;
(b) providing a vector for expression of said hybrid gene;
(c) culturing the prokaryotic host transformed with the expression vector; and (d) recovering the fusion protein.
A second aspect of the invention concerns a bacterial expression vector capable of enhancing the level of expression of non-stable, bacterially produced heterologous polypeptides comprising a hybrid gene having, in sequential order, a 3' CAT truncated gene sequence fused in frame to a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP~C, wherein said polypeptide is normally not recoverable in bacterial expression systems;
whereby said truncated CAT gene sequence is capable of rendering the resulting fusion protein resistant to proteolytic degradation.
A preferred embodiment ~or both the method and vectox of the present invention employs a CAT coding . .
~4~ ~3?,~162 sequence of less than or equal to 180 amino acids, preferably between 73 and 180 amino acids. Although the resulting CAT protein is substantially reduced as compzred to the native CAT protein, surprisingly, it has been found that the truncated CAT protein substantially contributes to the stability of the expressed protein and therefore, permits recovery of an increased yield of the desired heterologous protein.
Yet another aspect of the invention provides an improved bacterial expression vector capable of Pnhancing the level of expression of non-stable, bacterially produced heterologous polypeptides wherein said vector contains a hybrid gene having in sequential order, a modified 3' truncated CAT gene sequence linked to a heterologous gene sequence. The improvement comprises altering one or more DNA codons of the truncated CAT gene to eliminate potential chemical cleavage sites within the CAT protein.
Other aspects of the invention will be readily apparent to those of skill in the art from the description and examples which follow.
Brief Description of the Drawinqs Figure 1 set3 forth the amino acid and cor-responding nucleotide sequences for a 241 amino acid (aa)CAT-hANP hybrid protein containing an endoproteinase Glu-C
proteolytic cleavage site. The amino terminal portion of this hybrid protein encodes the first 210 amino acids of CAT, which sequence is extensively referred to throughout the present invention.
Figure 2 illustrates a series of vectors and synthetic fragments used for cloning and expression of the CAT-hANF hybrid proteins of the invention Figure 2A
depicts an EcoRI-PstI synthetic fragment containing the E.
coli trp promoter-operator sequence, a ribosomal binding site, and downstxeam cloning sites. Figure 23 is a restriction site and function map of plasmid pTrp233.
Figure 2C is a restriction site and function map of plasmid pCAT21. Figure 2D is an EcoRI-HindIII synthetic fragment encoding the hANP (102-126) gene preceded by an endoproteinase Glu-C cleavage site. Figures 2E through G
are restriction site and function maps of plasmids phNF75, pChNF109, and pChNF121, respectively. Figure 2H depicts a synthetic 1-73 aa CAT gene sequence contained withi~ NdeI-HindIII fragment. Figure 2I is a restriction site and function map of plasmid pChNF142 wherein site-specific mutagenesis was used to substitute Tyr and Ser codons for residues 16 and 31, respectively, of the CAT gene.
Figure 3 illustrates two different preparative SDS-polyacrylamide gels. Figure 3A is an SDS-lS polyacrylamide gel of the CAT-A4-7Sli hybrid protein.
Lane 1 = molecular size standards; Lane 2 = induced W3110 (pCAPil32); Lane 3 - induced W3110 (pTrp83) vector control; Lane 4 = uninduced W3110 (pCAPil36); and Lane 5 =
induced W3110 (pCAPil36). Figure 3B is an SDS-polyacrylamide gel of the CAT-GLP-I hybrid protein. Lane 1 = molecular size standard; Lane 2 = uninduced W3110 (pCGLP139); Lane 3 = induced W3110 (pCGLP139); and Lane 4 = induced W3110 (pTrp83~ vector control.
Figure 4 illustrates the amino acid and cor-responding nucleotide sequences for a CAT-A4-751i hybrid protein and a CAT-GLP-I hybrid protein of the invention.
Figure 4A depicts the first 73 codons encoding the amino terminus of the CAT protein joined in-frame to the synthetic A4-751i gene preceded by a chemical cleavage and site encoded by Asn-Gly. Figure 4B depicts the first 73 codons encoding the amino terminus of the CAT protein joined in-frame to the synthetic GLP-l gene preceded by a Met codon.
Figure 5 illustrates two plasmids, pCAT73 and pCAT210,-in which the gene for tetracycline resistance is restored in these CAT expression vectors.
~ ~2~1 62 Figure 6 is the nucleotide sequence and cor-responding amino acid sequence of the SP-B expression construct pC210SP-B from the EcoRI site preceding the trP
promoter region through the HindIII site containing the translation stop codon. The CAT, linker, and SP-B regions are identified therein, respectively, by the arrows.
Figure 7 is a preparative SDS-polyacrylamide gel of the CAT:SP-B fusion protein. Lane A = molecular size standards; Lane B = induced W3110 cells containing pTrp233 vector control; and Lane C = induced pC210SP-B/W3110 cells.
Figure 8 illustrates the nucleotide sequence and corresponding amino acid sequence of the 251 residue CAT:SP-C fusion protein from plasmid pC210SP-C. The CAT
qene, linker sequence and SP-B gene are sequentially identified therein by the arrows.
Figure 9 provides the molecular wsight determinations for Pach of the CAT:SP-C fusion proteins.
Lane A = molecular size standards; Lane B = induced W3110 cells containing pTrp233 vector control; Lane C = induced pC106SP-C; Lane D = pC149SP-C; Lane E = pC179SP-C; and Lane F = pC210SP-C.
Figure 10 provides the cDNA and amino acid sequences for human adipsin/D.
Modes for CarrYing Out the Invention A. Definitions As used herein the term "stabilizing protein expression~ refers to a property of a fusion protein responsible for inhibiting proteolysis of a foreign protein by a recombinant host cell.
~ Insoluble~ as referred to proteins intends a condition wherein a protein may be recovered only by extraction with dstergents or chaotropic agents. Usually, .
-1 ~ 2 ~
insoluble proteins are formed as a consequence of intracellular aggregation of the cloned gene products.
~ High protein expression" or "enhanced protein expxession~ refers to a level of expression wherein the fused protein can comprise 10% or more of the total protein produced by each cell. A preferred range for high protein expression levels is from 10-20~ of total cell protein.
As used herein, ~'non-recoverable" refers to a level of expression wherein the desired protein may be detected using sensitive techniques, e.g., Western blbt analysis, yet the protein is not commercially recoverable using conventional purification techniques such as SDS
polyacrylamide gel el~ctrophoresis, gel filtration, ion exchange chromatography, hydrophobic chromatography, af-finity chromatogrAphy, or isoelectric focusing.
IlMammalian'' refers to any mammalian species, and includes rabbits, mice, dogs, cats, primates and humans, preferably humans.
As used herein, the term "heterologous~ proteins refers to proteins which are foreign to the host cell transformed to produce them. Thus, the host cell does not generally produce such proteins on its own.
B. CAT Fusions CAT encodes a 219 amino acid mature protein andthe gene contains a number of convenient restriction endonuclease sites (5'-PvuII, EcoRI, DdeI, NcoI, and ScaI-3l) ~hroughout its length to test gene fusions for high level expression. These restriction sites may be used for ease of convenience in constructing the hybrid gene sequences of the invention or other sites within the gene sequence may ~e generated using t~chniques commonly known to those of skill in the art. Any of the resulting CAT
sequences are considered useful so long as the resulting .
- : :
132~1 62 CAT fusion retains the ability to enhance the expression of the desired heterologous peptide.
The expression constructs of the invention can employ most of the CAT-encoding gene sequence or a substantially truncated portion of the sequence encoding an N-terminal portion of the CAT protein linked to the gene encoding the desired heterologous polypeptide. In one embodiment of the invention, the CAT portion of the fusion codes for about the N-terminal one-third of the CAT
sequence.
The expression constructs exemplified herein, which demonstrated enhanced levels of expression for a ~ variety of heterologous proteins, utilize a number of varying lengths of the CAT protein ranging in size from 73 to 210 amino acids. The 73 ~mino acid CAT fusion component is conveniently form~d by dige~ting the C~T
nucleotide sequence at the EcoRI restriction site.
Similarly, the 210 amino acid CAT fusion component is formed by digesting the CAT nucleotide sequence with ScaI.
These, as well as other CAT restriction fragmen~s, may then be ligated to any nucleotide sequence encoding a desired protein to enhance expression of the desired protein.
Significantly, although the expression level of fusion protein ~approximately 15-20~ of total cell protein) was similar for the CAT (106 amino acid) - SP-C
fusion and the CAT (210 amino acid) - SP-C fusion, it can be een that the former case actually represents a significant increase in expression level for the desired SP-C polypeptide, since the SP-C polypeptide constitutes a substantially larger proportion of the total fusion protein in the former case. The ability to increase expression level for the desired polypeptide by reducing the size of the fused CAT protein sequence was quite an unexpected finding in view of the experience of the prior art. In general, the prior art experience has been that . .
~32~62 reduction in size of the bacterial leader sequence does not result in increased production of the fused heterologous polypeptide due to a concomitant larger reduction in the expression level of the fusion protein.
With one exception, the various CAT-heterologous fusion proteins exemplified herein were found to be expressed in the range of approximately 10-20% of the total cell protein. Thus, the versatility of the CAT fu-sions, that is, the ability to use a variety of CAT coding sequences having the ability to enhance the expression of a desired protein, allows great flexibility of choice when construc~ing CAT hybrid genes.
The reading frame for translating ths nucleotide sequence into a protein begins with a portion of the amino terminus of CAT, the length of which varies, continuing in-frame with or without a linker sequence into the protein to be e~pressed, and terminating at the carboxy terminus of the protein. An enzyma~ic or chemical cleav-age site may be introduced downstream of the CAT sequence to permit recovery of the cleaved product from the hybrid protein. Such cleavage sequences are known in the art as are the conditions under which cleavage can be effected.
Following cleavage, the desired heterologous polypeptide can be recovered using known techniques of protein purification. Suitable cleavage sequences include, without limitation, cleavage following methionine residues (cyanogen bromide), glutamic acid residues (endoproteinase Glu-C), tryptophan residues ~N-chlorosuccinimide with urea or with sodium dodecyl sulfate (SDS)) and cleavage between asparagine and lysine residues (hydroxylamine).
To avoid internal cleavage within the CAT
sequence, amino acid substitutions can be made using conventional site specific mutagenesis techniques (Zollerr M.J., and Smith, M. (1982), Nuc Acids Res 10:6487-6500, and Adelman, J.P., et al (1983), DNA 2:183-193). This is conducted using a synthetic oligonucleotide primer com-.
1 ~'2~1 62 plementary to a single-stranded phage DNA to be mutagenized except for limited mismatching, representing the desired mutation. Of course, these substitutions would only be performed when expression of CAT is not significantly affected. Where there is only one internal cysteine residue, as in the short CAT sequence, this residue may be replaced to help reduce multimerization through disulfide bridges.
C. CAT Fusion Vectors Procaryotic systems may be used to express the CAT fusion sequence; procaryotic hosts are, of course, the most convenient for cloning procedures. Procaryotes most frequently are represented by various strains of E. coli;
however, other microbial strains may also be used.
Plasmid vectors which contain replication sites, select-able markers and control sequences derived from a species compatible with the hos~ are used; for example, E. coli is typically transformed using derivatives of pBR322, a plasmid derived from an E coli species by Bolivar et al, Gene 2:95 (1977). pBR322 contains genes for ampicillin and tetracycline resistancer and thus provides multiple selectable markers which can be either retained or destroyed in constructi.ng the desired vector.
In addition to the modifications described above which would facilitate cleavage and purification of the product polypeptide, the gene conferring tetracycline resistance may be restored to the exemplified CAT fusion vectorq for an alternative method of plasmid selection and maintenance.
Although the E. coli tryptophan promoter-operator sequences have been exemplified in the present CAT vec~ors, different control sequences can be substituted for the ~ regulatory sequences and are considered to be within the scope of the invention. Com-monly used procaryotic control sequences which are defined 132~1~2 herein to include promoters for transcription initiation, optionally with an operator, along with ribosome binding site sequence, include such commonly used promoters as the beta-lactamase (penicillinase) and lactose (lac) promoter systems (Chang et al, Nature 198:1056), the lambda-derived PL promoter (Shimatake et al, Nature 292:128 (1981)~ and N-gene rihosome binding site, and the trp-lac ~trc) promoter system (Amann and Brosius, Gene 40:183 (1985)).
Since the general utility of these CAT vectors have been established with very different mammalian peptides (ranging in protein size, the presence or absence of disulfide bonds, and being hydrophobic or hydrophilic in nature) vec~ors with unique restriction site~ may be created or substituted for the pBR322-derived vector il-lustrated in the examples.
D. Heteroloqous Protein Expression Amino terminal DNA sequences of CAT have beenfused to DNA sequences encoding human polypeptides for high lev~l expression in the bacterial host E. coli. The polypeptides described herein are relatively small mam-malian polypeptides ranging in size from about 30 to 76 amino acid residues. Atkempts to directly express, e.g., in a non-fused form, each of these polypeptides in bacteria have been unsuccessful, most likely due to the proteolytic degradation which occurs upon translation of the m~NA product. In the case of extremely hydrophobic polypeptides, even attempts to express such polypeptides using beta-galactosidase fusions produced detectable but very low level amounts of protein.
Examples of polypeptides that have been success-fully expressed to high level in bacteria using the truncated CAT fusions include a variety of mammalian polypeptides including amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP5 (SP-C), and lung surfactant SP18 ~.3~162 (SP-B). Preferably, the mammalian protein is of human origin, although other sources are also contemplated to be within the scope of this invention. A4-751 is a 57 amino acid sequence identified within the precursor for the A4 amyloid protein associated with Alzheimex's disease and shares homology with the Kunitz family of serine proteinase inhibitors (Ponte, P., et al (1988) Nature 331:525-527; Tanzi, R.E., et al (1988) Nature 331:528-530). Glucagon-like paptide I (GLP-I, 7-31) is a 31 amino acid hormone co-encoded in the glucagon gene which is a potent stimulator of insulin release (Mojsov, S., et al (1987) J Clin Inves 79:616-619j. Adipsin/D is a serine protease synthesized in and secreted from adipocytes (Zusalak, K.M., et al (1985) J Mol Cell Biol 5:419). Lung surfactant SP-B is a 76 amino acid hydrophobic protein.
Lung surfactant SP-C is a 35 amino acid hydrophobic protein. Both SP-B and SP-C greatly enhance spreading of surfactant phospholipids at an air:water interface.
E. Hosts Exemplified Host strains used in cloning and procaryotic expression herein are as follows:
For cloning an~ sequencing, and for expression of construction under control of mos~ bacterial pxomoters, E. coli strains such as MC1061, DHl, RRl, W3110, MM294, B, C600hfl, K803, HB101, JA221, and JM101 may be used.
F. General Methods ~ecombinant DNA methods are described in Maniatis et al (1982), Molecular Cloning, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, when not specifically cited in the following examples. Methods are also described in the literature for visualizing inclusion bodies, isolating them from cells, then solubilizing, purifying, and cleaving the hybrid protein (e.g., Itakura, X., et al ~1977) Science 19~.1056-1063; Shine, J., et al -13- ~ 3 2 a 162 (1980) Nature_285:455-461). Methods are also available, if necessary, for refolding ~he protein product (Creighton, T.E., Proceedinqs of Genex-UCLA Symposium, 1985, Kingstones (in press).
Examples I. Expression of Chloramphenicol AcetYltransferase-Human Atrial Natriuretic Peptide Hybrid Proteins in Escherichia coli.
A. Expression vector pChNF109.
Expression vector pChNF109 encodes a 241 amino acid CAT-hANP hybrid protein containing an endoproteinase Glu-C proteolytic cleavage site (Fig. 1). Most of the CAT
gene (amino acids 1-210) has been joined in-frame to the hANP(102-126) gene and cleavage site (26 amino acids) through a linker sequence (5 amino acids). The hANP
polypeptide comprises abou~ 10~ of the hybrid protein.
This vector was constructed from plasmids pTrp233, pCAT21, and phNF75 which supplied the plasmid backbone and trp promoter-operator, the CAT gene, and the h~NP(102-126) gene and cleavage site, respectively.
1. Construction of ~ChNF109.
Plasmid pTrp233 was constructed by insertion of a synthetic EcoRI-PstI fragment containing the E. coli trp promoter-operator sequence, a ribosomal binding site, and downstream cloning sites into plasmid pKK~33-2-NdeI which contains strong transcription termination signals, TlT2, `and the beta-lactamase gene. The synthetic fragment (see Fig. 2A) was assembled using the method of Vlasuk et al (1986), J. Biol Chem 261: 4789-4796 and its sequence confirmed by the method of Sanger et al (1977), Proc Natl Acad Sci USA 74:5463-5467 in M13mp8 and M13mp9. Plasmid ;~ :
:, .. .,: . -~23162 pKK233-2-NdeI (disclosed in U.S. patent 4,764,504 dated 16 August 1988) was digested with EcoRI and PstI, its termini dephosphorylated using calf intestinal phosphatase, and ligated with the synthetic EcoRI-PstI fragment. Plasmid pTrp233 was isolated (Fig. 2B) from E. Coli JA221 transformed to ampicillin resistance.
Plasmid pCAT21 was constructed by insertion of the CAT gene (from transposon Tn9, Alton and Vapnek, (1979) Nature 282:864-869) into plasmid pTrp233 under the control of the ~ promoter-operator. Plasmid pAL13ATCAT (a plasmid disclosed in Canadian application Serial No. 576,975 filed 8 September 1988), was digested with NdeI and HindIII and the approximately 750 bp NdeI-HindIII fragment containing the CAT
gene (with the initiating Met residue encoded at the NdeI
site) was purified using agarose gel electrophoresis. The CAT gene was ligated with NdeI and HindIII digested pTrp233 using T4 DNA ligase. From E. Coli MC1061 (Casadaban et al (1980), I Mo] Biol 138: 179-209) ampicillin-resistant .
transformants, plasmid pCAT21 was isolated (Fig. 2C).
Plasmid phNF75 was constructed by insertion of a synthetic hANP gene preceded by a proteolytic cleavage site into plasmid pBgal (Shine et al (1980), Nature 285:456).
Eight oligodeoxyribonucleotides (Fig. 2D) were assembled into a synthetic hANP(102-126) gene preceded by an endoproteinase Glu-C cleavage site (method of Vlasuk et al (1986), supra).
The synthetic DNA fragment (with a 5' EcoRI tail and a 3' blunt end) was ligated with EcoRI and SmaI restriction endonuclease digested M13mpl9 using T4 DNA ligase for the purpose of DNA sequencing (method of Sanger e~ al (1977), ~ ) A clone with the correct sequence, M13-hNF7, was digested with BamHI and BglII, the fragment containing the hANP gene purified by agarose gel electrophoresis, and the fragment ligated with BamHI-~' -15- 132~16~
digested and bacterial alkaline phosphatase dephosphorylated pTrp233 using ~4 DNA ligase. A plasmid with the insert in the orientation which gives adjacent HindIII, BamHI and EcoRI sites at the 3' end of the hANP
gene, phNF73, was identified by the siæe of the fragments generated ~y digestion with HindIII and PvuII. Plasmid phNF73 was digested with EcoRI, the hANP gene purified using polyacrylamide gel electxophoresis, and the gene ligated with EcoRI-digested and bacterial alkaline phosphatase dephosphorylated plasmid pBgal. From E coli MC1061 ampicillin-resistant transformants, plasmid phNF75 ~Fig. 2E) was identified by the size of the DNA fragments generated by digestion with PstI and HindIII.
Expression vector pChNF109 was constructed by insertion of DNA fragments containing CAT, hANP and the proteolytic cleavage site, and a linker sequence into plasmid pTrp233. Plasmid phNF75 was digested with EcoRI
and HindIII, the approximately 80 bp EcoRI-HindIII frag-ment containing hANP was purified by polyacrylamide gel electrophoresis, and ligated with EcoRI- and HindIII-digested pTrp233 using T4 DMA ligase. From E. coli MC1061 ampicillin-resistant transformants, plasmid phNF87 was isolated and digested with BamHI and the fragments were dephosphorylated using bacterial alkaline phosphatase. A
BamHI cassette containing the trp promoter-operator, ribosomal binding site, and large amino terminal fragment of the CAT gene was generated by digesting pCAT21 with ScaI, attaching BamHI synthetic linkers (5'-CGGATCCG-3') to the blunt termini using T4 DNA ligase, digesting the ligation with BamHI and purification of the approximately 740 bp BamHI fragment by agarose gel electrophoresis. The BamHI cassette and plasmid phNF87 were ligated using T4 ligase and ampicillin-resistant transformants of E. coli MC161 obtained. Plasmid pChNF109 (Fig. 2F), with the BamHI cassette in the orientation such that ~he C~T gene is fused in-frame ~o the endoproteinase Glu-C cleavage ~3~0~&~, site followed by the hANP gene, was selected on the basis of DNA fragment size in an EcoRI digest of the plasmid.
2. Expression of C~T(1-210)-hANP(102-126) Hybrid Protein From Plasmid pC NF103.
Plasmid pChNF109 expresses a CAT-hANP(102-126) hybrid protein under the control of the E. coli ~
promoter-operator. The plasmid was used to transform E.
coli W3110 (ATCC Accession No. 27325) to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium containing M9 salts, 2 mM
MgS04, 0.1 mM CaC12, 0.4% glucose, 0.5~ casamino acids, 40 ug/ml tryptophan, 2 ug/ml thiamine hydrochloride, and 100 ug/ml ampicillin sulfate. The overnight culture was diluted 100-fold into the same M9 medium described above (uninduced culture) and into M9 medium in which the tryptophan had been replaced by 25 ug/ml of 3 beta-indoleacrylic acid (induced culture).
Expression was assessed after shaking the cultures for 6 hr at 37C. The uninduced culture had reached a high cell density (stationary phase) and the induced culture was still at a low cell density (exponential phase). Phase-contrast microscopy revealed cells of normal morphology in the uninduced culture and elongated cells containing several refractile inclusion bodies in the induced culture. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and were analyzed by electrophoresis through a 12% SDS-polyacrylamide gel followed by staining of the protein with Coomassie Blue.
B. ExPression Vector pChNF121.
Expression vector pChNF121 encodes a 99 amino acid CAT-hANP hybrid protein containing an endoproteinase Glu-C proteolytic cleavage site (Fig. 4A). Approximately one-third of the CAT gene (amino acids 1-73) has been .
-17- ~ ~ r~ ~ ~ g ~
fused to the hANP(102~126) gene and proteolytic cleavage site (26 amino acids) without an intervening linker. The hANP polypeptide comprises 25% of the hybrid protein.
This vector was constructed from plasmids pChNF109 and phNF87 which supplied the amino terminal fragment of the CAT gene and the hANP gene and proteolytic cleavage site, respectively.
1. Construction of pChNF121.
Plasmid phNF87 was digèsted with EcoRI, its termini dephosphorylated with bacterial alkaline phosphatase, and ligated with an approximately 320 bp EcoRI fragment containing the trp promoter-operator, ribosome binding site, and amino-~erminus of the CAT gene. -This EcoRI cassette was purified from an EcoRI digest of PChNF109 using agarose gel electrophoresis. Plasmid pChNFl21 (Fig. 2G) was isolated from the ampicillin-resistant transformants of E. coli MC1061. On the basis of the si~e of the DNA fragments from a PvuII digest of the plasmid, the CAT and hANP genes were inferred to be fused in-frame to produce a hybrid protein.
2. Expression of CAT(1-73)-hANP(102-126) Hybrid Protein From Plasmid pChNF121.
Plasmid pChNF121 expresses a CAT-hANP(102-126) hybrid protein under the control of the E. coli trp promoter-operator. The plasmid was used to transform E.
coli W3110 (prototroph, 'rrpR~) to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium (see Section A.2.). The overnight culture was diluted 100-fold into complete M9 medium (uninduced culture) and into M9 medium with 25 ug/ml 3-be~a-indole-acrylic acid xeplacing the 40 ug/ml tryp-tophan (induced culture).
Expression was assessed after shaking ~he cultures for 6 hr at 37C. The uninduced culture had ,~ ~ ' ' , .
-18- ~ 3 2 ~1 g2 reached a high cell density whereas the induced culture reached about one-third this density. Phase contrast microscopy revealed cells of normal morphology in the uninduced culture and elongated cells with several refractile inclusion bodies in the induced culture. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min. and were analyzed by electrophoresis through a 12% SDS-polyacrylamide gel fol-lowed by staining of the protein with Coomassie Blue.
C. Ex~ression Vector pChNF142.
~ xpression ~ector pChNF142 encodes a 99 amino acid CAT-hANP hybrid protein containing a unique Trp residue following amino acid xesidue 73 of the CAT
protein, as a si~e for chemical cleavage. Approximately one-third of the CAT gene ~amino ac,ids 1-73) has been fused to the hANP(102-126) gene and chemical cleavage site t26 amino acids). This amino terminal fragment of CAT has been modified to substitute a Tyr residue for Trp~16] and a Ser residue for Cys[31] to remove the additional chemical cleavage site and reduce the multimerization of the hybrid protein through disulfide bridges. A synthetic hANP gene preceded by sequence encoding a Trp residue has been a~sembled for this vector.
1. Construction of pChNF142.
Plasmid pTrp233 was digested with EcoRI, its termini filled in with E. coli DNA polymerase I, Klenow fragment, and ligated with T4 DNA ligase (to remove the EcoRI restriction endonuclease cleavage site). From ampicillin-resistant transformants of ~. coli MC1061, plasmid pTrp81 was isolated and shown to resist cleavage by EcoRI. Plasmid pTrp81 was digested with NdeI and HindIII, purified by agarose gel electrophoresis, and ligated with a synthstic CAT gene ~ragment using T4 DNA
ligase. The synthetic NdeI-HindIII CAT gene fragment .~
, ~ --19--15~2~162 (Fig. 2H) was assembled from three pairs of oligo-deoxyribonucleotides as previously described. From ampicillin-resistant transformants of E. coli MC1061, plasmid pCAT127 was isolated and shown to contain the synthetic CAT fragment by digestion with EcoRI and AvaI.
The plasmid was digested with BamHI and HindIII, the BamHI-HindIII fragment containing CAT was purified by agarose gel electrophoresis, sequenced by the method of Sanger et al ~1977), supra, and the correct DNA sequence confirmed.
Plasmid pC~T127 was digested with EcoRI and HindIII and ligated using T4 DNA ligase with a pair of annealed synthetic oligodeoxyribonucleotides encoding hANP(102-126) preceded by a Trp residue on an EcoRI-HindIII DNA fragment. Plasmid pChNF142 (Fig. 2I) wasisolated from ampicillin-resistant transformants of E. coli MC1061. Insertion of the hANP gene was confirmed by the size of the DNA fragments in a BamHI and HindIII
digest of the plasmid. The sequence of -the hANP gene was confirmed from an EcoRI-ScaI agarose gel purified fragment from pChNF142.
2. Expression of CAT(1~73), T~r~l6l Ser[311-hANP(102-126) pChNF142.
~he expression of a modified CAT-hANP(102-126) hybrid protein is conducted in substantial accordance with the teaching of the previous examples A.2 and B.2.
II. E~ression of Chloramphenicol Acetyltransferase--Am~loid A4 Protein Insert (A4-751i~ Hybrid Proteins in Escherichla coli.
In the following examples high level expression of the 57 amino acid insert within the amyloid A4-751 protein was achieved by fusing a synthetic A~-751i gene to DN~ sequences encoding amino terminal fra~ments of CAT
under the control of the E. coli tryptophan promoter--20- ~32a~ 62 operator on a pBR322-derived plasmid. The synthetic A4-751i gene encodes amino acids 289-345 from amyloid A4-751 protein (Ponte et al (1988), Nature 331:525-527) preceded by a chemical cleavage site, Asn-Gly. Hydroxylamine cleavage of the hybrid protein between these two residues will yield the insert protein with a Gly residue at its amino terminus.
A. Expression Vector pCAPi132.
Expression vector pCAPil32 encodes a 132 amino acid CAT-A4751i hybrid protein containing a hydroxylamine cleavage si~e (Fig. 4A). Approximately the amino terminal third of the CAT gene (amino acids 1-73~ ha~ been joined in-frame to the A4-751i gene and clea~age ~ite (59 amino acids). The A4-751i protein comprises about 43~ of the hybrid protein. This vector was constructed from plasmids pTrp233 and pChNF121 and the synthetic A4-751i gene and cleavage site.
1. Construction of PcApil32.
Plasmid pTrp233 was digested with EcoRI and HindIII, purified by agarose gel electrophoresis, and ligated with the synthetic gene ~ncoding the A4-751i protein and cleavage site using T4 DNA ligase. The gene had been assembled from six oligodeoxyribonucleotides using previously described techniques and its sequence (Fig. 4A) co~firmed. Plasmid pAPil31 was isolated from ampicillin-resistant transformants of E. coli MCl061.
Insertion of the synthetic gene was confirmed by the size of the DNA fragments from a PvuI and BamHI digest of plasmid mini-prep DNA.
Plasmid pAPil31 was digested with EcoRI to lineari~e the vector and its termini dephosphorylated using bacterial alkaline phosphatase. Plasmid pChNF121 was digested with EcoRI and the approximately 320 bp ~coRI
fragmen~ containing the trp promoter-operator, ribosome ~ 3~0~
binding site, and amino terminus of the CAT gene ~amino acids 1-73) was purified by agarose gel electrophoresis.
This EcoRI cassette was ligated with the pAPil31 plasmid using T4 DNA ligase and ampicillin-resistant transformants s of MC1061 were obtained. On the basis of ~NA fxagment size in a PvuII digest of mini-prep plasmid DNA, plasmid pCAPil32 was isolated with an in-frame fusion of CAT and A4-751i sequences.
2. Expression of CAT( 1-73)-A4-751i Hybrid Protein From Plasmid pCAPil32.
Plasmid pCAPil32 expresses a CAT--A4-751i hybrid protein under the control of the E. coli ~ promoter-operator. The plasmid was used to transform E. coli W3110 to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium. The overnight culture was diluted 100~fold into complete M9 medium which contains 40 ug/ml tryptophan (uninduced culture) and into complete M9 medium containing 25 ug/ml 3-beta-indoleacrylic acid instead of tryptophan (induced culture).
Expression was assessed after shaking the cultures for 6 hr at 37C. The uninduced culture had reached a high cell density whereas the induced culture was at a lower cell density. Phase contrast microscopy revealed cells of normal morphology in the uninduced culture and cells with ~pre-inclusion bodies~ in the induced culture. As used herein, "pre-inclusion bodies"
are defined as less refractile bodies which appear to convert in time to the more refractile "inclusion hodies~
as the hybrid protein accumulates in the cells. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and then analyæed by electrophoresis through a 12% SDS-polyacrylamide gel fol-lowed by staining with Coomassie Blue (Fig. 3~). ThisCAT(1-73)-A4-751i hybrid protein migrates between the : :
- --22~ ~ 32~6~
lysozyme (14,300 MW) and beta-lactoglobulin (18,400 M~7) protein standards on this gel. Vsing a Kontes fiber optic scanner and Hewlett-Packard Integrator to scan the gel, the hybrid protein was estimated to comprise about 7% of the total cell protein. This is a moderate expression le~el of E. coli but A4-751i comprises almost half of the hybrid protein.
To confirm the presence of A4-751i in the hybrid protein, Western blot analysis was carried out on an unstained 12% SDS-polyacrylamide gel o~ these protein samples. Protein was blotted to nitrocellulose and incubated with anti-A4-751i serum (prepared against a 16 amino acid synthetic pep~ide containing amino acids 11-26 of the 57 amino acid insert protein). After incubation with 125I-protein A (Amersham) the blot was placed on X-ray film at -70C for several days. The synthetic peptide anti-serum detected the hybrid protein as well as several other E. coli proteins.
B. Expression vector pCAPil36.
Expression vector pCAPil36 encodes a 274 amino acid CAT-A4-751i hybrid protein containing a hydroxylamine cleavage site. Most of the CAT gene (amino acids 1-210) has been joined in-frame to the A4-751i gene and cleavage site (59 amino acids) through a linker sequence (5 amino acids). The A4-751i polypeptide comprises about 21% of the hybrid protein. This vector was constructed from plasmids pAPil31 and pChNF109.
1. Construction of pCAPil36.
Plasmid pAPil31 was digested with EcoRI to linearize the vector and its termini dephosphorylated using bacterial alkaline phosphatase. From a partial EcoRI digest of pChNF109 an approximately 740 bp EcoRI
fragment containing the trp promoter-operator, the CAT
gene (amino acids 1 210), and linker sequence was purified .
.
' -23~
by agarose gel electrophoresis. This EcoRI cassette and vector pAPil31 were ligated using T4 DNA ligase and ampicillin-resistant transformants of E. coli MC1061 were isolated. From the size of DNA fragments in plasmid mini-preps digested with BamHI, plasmid pCAPil36 was isolatedwith the CAT gene and the synthetic A4-751i gene in-frame.
2. Expression of CAT(1-210)-A4-751i Hybrid Protein From Plasmid pCAPil36.
Plasmid pCAPil36 expresses a CAT-A4-751i hybrid protein under the control of the E. coli trp promoter-operator. The plasmid was used to transform E. coli W3110 to ampicillin resistance and one colony was grown in culture overnight at 37C in complete Mg medium. The overnight culture was diluted 100-fold into the same M9 medium (uninduced culture) and into M9 complete medium containing 25 ug/ml 3-beta-indoleacrylic acid instead of tryptophan (induced culture).
Expression was assessed after shaking the cultures for 5 hr at 37C. Both the uninduced and inducqd cultures reached high cell densities. Phase contrast microscopy revealed cells of noxmal morphology in the uninduced cultures and cells containing inclusion bodies or pre-inclusion bodies (50:50) in the induced cultures.
Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and were analyzed by electrophoresis through a 12% SDS~polyacrylamide gel fol-lowed by staining with Coomassie Blue (Fig. 3A). This CAT-A4-751i hybrid protein migrates between the alpha-ch~motrypsinogen (25,700 MW) and ovalbumin (43,000 MW)protein standards on this gel. Using a Kontes fiber optic scanner and Hewlett-Packar~ In~egrator to scan the gel, the hybrid protein was estimated to comprises about 15% of total cell protein. This is moderately high level expres-sion for E. coli.
(*) Trademark .
,~:''"
.~
- -24- ~ 2 To confirm the presence of A4-751i ln the hybrid protein, Western blot analysis was carried out on an unstained 12% SDS-polyacrylamide gel of these protein samples. Using the method described above ~section II.
A.2.), the synthetic peptide anti-serum detected the hybrid protein as well as se~eral other E. coli proteins.
ssion of Chloramphenicol Acetyltransferase--Glucaqon-L e PePtide I ~-37) Hybrid Protein in Escherichia coli.
In the following example, high level expression of the 31 amino acid GLP-I(7-37) was achie~ed by fusing a synthetic GLP-I gene to DNA sequences encoding an amino terminal fragment of CAT under the control of the E. coli tryptophan promoter-operator on a pBR322-derived plasmid.
The synthetic gene encodes amino acids 7-37 of GLP-l (Mojsov et al (1987), J. Clin Invest 79:616-619) preceded by a Met residue. Treatment with cyanogen bromide releases the insulinotropic peptide.
A. Expression Vector PcGLpl39.
Expression vector pCGLP139 encodes a 105 amino acid CAT-GLP-I hybrid protein containing a cyanogen bromide clea~age site (Fig. 4B). Approximately the amino terminal third of the CAT gene (amino acids 1-73) has been joined in-frame to the GLP-l gene and cleavage site (32 amino acids). The GLP-I peptide comprises about 30~ of the hybrid protein. This vector was constructed from plasmids pTrp233 and pChNF109 and the synthetic GLP-I gene and cleavage site.
1. Construction_of pCGLP139.
Plasmid pTrp233 was digested with EcoRI and HindIII, purified by agarose gel electrophoresis, and ligated with the synthetic gene using T4 DNA ligase. The gene had been assembled from four oligodeoxyribo-25- 1~2~2 nucleotides and its sequence (Fig. 4B) confirmed. From ampicillin-resistant transformants of E. coli MC1061, plasmid pGLP138 was isolated. Insertion of the synthetic gene was confirmed by the failure of plasmid mini-prep DNA
to be cut by PstI.
Plasmid pGLP138 was digested with EcoRI to linearize the vector, its termini dephosphorylated using bacterial alkaline phosphatase, and ligated with the EcoRI
cassette from plasmid pChNF109 using T4 DNA ligase Plasmid pChNF109 had been digested with EcoRI and the ap-proximately 320 bp EcoRI fragment containing the trp promoter-oper~tor, ribosome binding site, and an amino terminal fragment of the CAT gene purified by agarose gel electrophoresis. Plasmid pCGLP139 was isolated from ampicillin-resistant transformants of MC1061. On the basis of DNA ragment size in an AvaI and PvuII digest of plasmid mini-prep DNA, the fusion of CAT and GLP-I
sequences was confirmed to be in-frame.
2. Expression of_CAT(1-73)-GLP-I(7-37) Hybrid Protei.n From Plasmid pCGLP139.
Plasmid pCGLP139 expresses a CAT-GLP-I hybrid protein under the control of the E. coli trp promoter-operator. The plasmid was used to transform E. coli W3110 to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium. The overnight culture was diluted 100-fold into complete M9 medium which contains 40 ugtml tryptophan (uninduced culture) and into complete M9 medium in which 25 ug/ml 3-beta-indoleacrylic acid has been substituted for the tryptophan tinduced culture).
Expression was assessed after shaking the cultures for 6 hr at 37C. The uninduced culture had reached a high cell density whereas the induced cul~ure was at a lower cell density. Phase contrast microscopy revealed cells of normal morphology in the uninduced -26- 132~162 culture and elongated cells with three or more refractile inclusion bodies in the induced culture. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and were analyzed by electrophoresis through a 12% SDS-polyacrylamide gel fol-lowed by staining with Coomassie Blue (Fig. 3B). This CAT(1-73~-GLP-I(7-37) hybrid protein migrates between the bovine trypsin inhibitor (6,200 MW) and lysozyme (14,300 MW) protein standards. Using a Kontes fiber optic scanner and Hewlett-Packard Integrator to scan the gel, the hybrid protein was estimated to comprise about 20~ of the total cell protein. (Considering the number of inclusion bodies observed per cell, all of the hybrid protein may not have been solubilized in the Laemmli buffer, and this estimate may be low.) This is high level expression for E. coli.
The molecular weight of the hybrid protein is as predicted for this gene fusion. Amino acid composition analysis of the purified hybrid protein or protein sequencing of the peptide after cyanogen bromide cleavage can be performed to confirm its expression.
IV. CAT Fusion With Human SP-B and SP-C.
The mature forms of both human SP-C and SP-B are expressed as fusions with portions of bacterial CAT. The surfactant peptides are ~oined to the carboxy terminus of the CAT sequences through a hydroxylamine-sensitive asparagine-glycine linkage. The CAT-surfactant fusions are expressed from the tryptophan promoter of the bacte-rial vector pTrp233.
A. Expression Vector pC210SP-B.
SP-B expression vector pC210SP-B encodes a fu-sion protein of 293 residues in which 210 amino acids of CAT are joined to the 76 amino acids of SP-B through a linker of 7 amino acids containing ~he hydroxylamlne-sensitive cleavage site. Cleavage of the fusion with -27- ~ 3~162 hydroxylamine releases a 77 amino acid SP-B product containing the 76 residue mature form of SP-B, plus an amino-terminal glycine residue.
To construct pC210SP-B, the short EcoRI-HindIII
segment containing ANF sequences was removed from pChNF109, and replaced by a portion of human SP-B cDNA #3 extending from the PstI site at nucleotide (nt) 643 (Fig.
6) to the SphI site at nt 804. The EcoRI site wa~ joined at the PstI site through two complementary oligonucleotides encoding the hydroxylamine sensitive cleavage site as well as the amino-terminal residues of mature SP-B (oligo ~2307: 5 ' -AAT TCA ACG GTT TCC CCA TTC
CTC TCC CCT ATT GCT GGC TCT GCA-3' and oligo #2308: 5'-GAC
CCA GCA ATA GGG GAG AGG AAT GGG GAA ACC GTT G-3'). The lS ~I site was joined to the HindIII site of PTrp233 through a second se~ of complementary nucleotides encoding the carboxy~terminal residues of ma~ure SP-B (oligo #3313:
5'-ACC TTA CCG GAG GAC GAG GCG GCA GAC CAG CTG GGG CAG CAT
G-3' and oligo #3314: 5'-CTG CCC CAG CTG GTC TGC CGC CTC
GTC CTC CGG TA-3').
The expression plasmid was used to trans~orm E.
c _ stain W3110 to ampicillin resistance. Rapidly grow-ing cultures of pC210SP-B/W3110 in M9 medium were made 25 ug/ml IAA (3-beta indoleacrylate, Sigma I-1625) to induce the Trp promoter. By 1 hr after induction, refractile cytoplasmic inclusion bodies were seen by phase contrast microscopy inside the still-growing cells. 5 hr after induction, the equivalent of 1 O.D.550 of cells were pelleted by centrifugation, then boiled for 5 min in SDS
sample buffer for electrophoresis in a 12~ SDS-polyacrylamide gel followed by staining with Coomassie Blue (Fig. 7). Lane A = molecular size standards; Lane B
= induced W3110 cells containing pTrp233 vector control;
and Lane C = induced pC210SP-BJW3110. The predicted molecular weight of the CAT:SP-B ~usion protein is 45,000 daltons. The hybrid CAT:SP-B protein was es~imated to - -28- ~3~62 comprise 15-20~ of the total cell protein in the induced cultures.
B. CAT Fusions with SP-C.
A series of vectors wer0 constructed encoding fusion proteins in which mature human SP-C was fused to the carboxy termini of different portions of CAT through a hydroxylamine-sensitive asparagine-glycine linkage.
~Iydroxylamine cleavage of the fusion protein produced by each cons~ruct releases a mature SP-C of 35 amino acids which lacks the amino-terminal phenylalanine residue seen in a portion of natural human SP-C.
1. pC210SP-C.
The amino acid sequence of the 251 residue fu-sion protein encoded plasmid pC210SP-C. The 210 amino acids of CAT are joined to 35 amino acids of mature SP-C
through a linker of 6 amino acids. The mature SP-C
portion of the total fusion protein comprises 14%.
In Fig. 8 is shown the nucleotide sequence of pC210$P-C, in which th~ EcoRI-HindIII fragment of pC210SP-B containing SP-B sequences has been replaced by a segment of human SP-C cDNA #18 extending from the A~LI site at nucleotide 123 to the AvaII site at nucleotide 161. The EcoRI site of the CAT vector was joined to the SP5 ApaLI
site through two complementary oligonucleotides encoding the hydroxylamine sensitive cleavage site as well as the amino-terminal residues of mature SP-C (oligo #2462: 5'-AAT TCA ACG GCA TTC CCT GCT GCC CAG-3' and oligo #2463:
5'-TGC ACT GGG CAG CAG GGA ATG CCG TTG-3'). The A~aII
site of SP-C was joined to the HindIII site of pC210SP-B
through a second set of complementary nucleo~ides encoding the carboxy-terminal residues of mature SP-C and a stop codon (oligo #2871: S'-AGC TTA GTG GAG ACC CAT GA& CAG GGC
3S TCG CAC AAT CAC C~C GAC ~AT GAG-3' and oliqo #2372: 5'-GTC
.
, '' ' ' . -29-~3~al62 CTC ATC GTC GTG GTG ATT GTG GGA GCC CTG CTC ATG GGT CTC
CAC TA-3').
2~ ~C179SP-C.
The amino acid sequence of the 217 residue fu-sion protein encoded by pC179SP-C is a slight modification of the sequence shown in Fig. 8. In pC179SP-C, the 179 amino acids of CAT are joined to 35 amino acids of mature SP-C through a linker of 3 amino acids (Glu, Phe, Asn).
SP-C portion of the total fusion protein comprises 16%.
To construct pC179SP-C, a portion of the CA~
sequence was removed from pC210SP-C. S~arting with pC210SP-C, a DNA fragment extending from the Ncol site at nt 603 (Fig. 8) to the EcoRI site at nt 728 was removed, and the NcoI and EcoRI cohesi~e ends were rejoined with two complementary oligonucleotides (oligo #3083s 5'~CAT
GGG CAA ATA T~A TAC GCA AG-3' and oligo #3084: 5'-AAT TCT
TGC GTA TAA TAT TTG CC-3'). In effect, 31 residues of CAT, and 3 residues of the linker polypeptide are missing in the new fusion protein encoded by vector pC179SP-C.
3. pC14gSP-C.
The amino acid sequence of the 187 residue fu-sion protein encoded by pC149S~-C is a slight modification of the sequence shown in Fig. 8. In plasmid pC149SP-C, the 149 amino acids of CAT are ~oined to 35 amino acids of mature SP~C through a linker of 3 amino acids (Glu, Phe, Asn). The SP-C portion of the total fusion protein comprises 18.7%.
To construct pC1495P-C, a portion of the CAT
segment of pC210SP-C extending from the DdeI site at nt 523 (Fig. 8) to the EcoRI site at nt 728 was removed and replaced by a~set of two complementary oligonucleotides (oligo #3082: 5' TCA GCC:AAT CCC:G-3' oliqo #3081: 5'-AAT
35~ TCG GGA TTG GC-3').
:: :
,. .
, .
_30_ ~3 233 1 ~2 4. pC106SP-C.
The amino acid sequence of the 144 residue fu-sion protein encoded by pC106SP-C is a slight modification of the sequence shown in Fig. 8. In plasmid pC106SP-C, the 106 amino acids of CAT are joined to 35 amino acids of mature SP-C through a linker of 3 amino acids (Glu, Phe, Asn). The SP-C portion of the total fusion protein comprises 24%.
pCl06SP-C was constructed by replacing the EcoRI
fragment of pC210SP-C (nt 302 to nt 728, Fig. 8) with two sets of complementary oligos which were annealed, then ligated together through a region of homology (oligo #3079: 5 '-AAT TCC GTA TGG CAA TGA AAG ACG GTG AGC TGG TGA
TAT GGG ATA GTG TTC ACC CTT GT-3' was annealed with oligo 15 #3085: 5'-ACA CTA TCC CAT ATC ACC P,GC TCA CCG TCT TTC ATT
GCC ATA CGG-3'; oligo #3080: 5 ' -TAC ACC GTT TTC CAT GAG
CAA ACT GAA ACG TTT TCA TCG CTC TGG G-3' was annealed with oligo #3078: 5~-AAT TCC CAG AGC GAT GAA AAC GTT TCA GTT
TGC TCA TGG AAA ACG GTG TAA CAA GG& TGA-3').
Plasmid pChNF109 expresses a CAT-hANP(102-126) hybrid protein under the control of the E. coli ~
promoter-operator. The plasmid was used to transform E.
coli W3110 (ATCC Accession No. 27325) to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium containing M9 salts, 2 mM
MgS04, 0.1 mM CaC12, 0.4% glucose, 0.5~ casamino acids, 40 ug/ml tryptophan, 2 ug/ml thiamine hydrochloride, and 100 ug/ml ampicillin sulfate. The overnight culture was diluted 100-fold into the same M9 medium described above (uninduced culture) and into M9 medium in which the tryptophan had been replaced by 25 ug/ml of 3 beta-indoleacrylic acid (induced culture).
Expression was assessed after shaking the cultures for 6 hr at 37C. The uninduced culture had reached a high cell density (stationary phase) and the induced culture was still at a low cell density (exponential phase). Phase-contrast microscopy revealed cells of normal morphology in the uninduced culture and elongated cells containing several refractile inclusion bodies in the induced culture. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and were analyzed by electrophoresis through a 12% SDS-polyacrylamide gel followed by staining of the protein with Coomassie Blue.
B. ExPression Vector pChNF121.
Expression vector pChNF121 encodes a 99 amino acid CAT-hANP hybrid protein containing an endoproteinase Glu-C proteolytic cleavage site (Fig. 4A). Approximately one-third of the CAT gene (amino acids 1-73) has been .
-17- ~ ~ r~ ~ ~ g ~
fused to the hANP(102~126) gene and proteolytic cleavage site (26 amino acids) without an intervening linker. The hANP polypeptide comprises 25% of the hybrid protein.
This vector was constructed from plasmids pChNF109 and phNF87 which supplied the amino terminal fragment of the CAT gene and the hANP gene and proteolytic cleavage site, respectively.
1. Construction of pChNF121.
Plasmid phNF87 was digèsted with EcoRI, its termini dephosphorylated with bacterial alkaline phosphatase, and ligated with an approximately 320 bp EcoRI fragment containing the trp promoter-operator, ribosome binding site, and amino-~erminus of the CAT gene. -This EcoRI cassette was purified from an EcoRI digest of PChNF109 using agarose gel electrophoresis. Plasmid pChNFl21 (Fig. 2G) was isolated from the ampicillin-resistant transformants of E. coli MC1061. On the basis of the si~e of the DNA fragments from a PvuII digest of the plasmid, the CAT and hANP genes were inferred to be fused in-frame to produce a hybrid protein.
2. Expression of CAT(1-73)-hANP(102-126) Hybrid Protein From Plasmid pChNF121.
Plasmid pChNF121 expresses a CAT-hANP(102-126) hybrid protein under the control of the E. coli trp promoter-operator. The plasmid was used to transform E.
coli W3110 (prototroph, 'rrpR~) to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium (see Section A.2.). The overnight culture was diluted 100-fold into complete M9 medium (uninduced culture) and into M9 medium with 25 ug/ml 3-be~a-indole-acrylic acid xeplacing the 40 ug/ml tryp-tophan (induced culture).
Expression was assessed after shaking ~he cultures for 6 hr at 37C. The uninduced culture had ,~ ~ ' ' , .
-18- ~ 3 2 ~1 g2 reached a high cell density whereas the induced culture reached about one-third this density. Phase contrast microscopy revealed cells of normal morphology in the uninduced culture and elongated cells with several refractile inclusion bodies in the induced culture. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min. and were analyzed by electrophoresis through a 12% SDS-polyacrylamide gel fol-lowed by staining of the protein with Coomassie Blue.
C. Ex~ression Vector pChNF142.
~ xpression ~ector pChNF142 encodes a 99 amino acid CAT-hANP hybrid protein containing a unique Trp residue following amino acid xesidue 73 of the CAT
protein, as a si~e for chemical cleavage. Approximately one-third of the CAT gene ~amino ac,ids 1-73) has been fused to the hANP(102-126) gene and chemical cleavage site t26 amino acids). This amino terminal fragment of CAT has been modified to substitute a Tyr residue for Trp~16] and a Ser residue for Cys[31] to remove the additional chemical cleavage site and reduce the multimerization of the hybrid protein through disulfide bridges. A synthetic hANP gene preceded by sequence encoding a Trp residue has been a~sembled for this vector.
1. Construction of pChNF142.
Plasmid pTrp233 was digested with EcoRI, its termini filled in with E. coli DNA polymerase I, Klenow fragment, and ligated with T4 DNA ligase (to remove the EcoRI restriction endonuclease cleavage site). From ampicillin-resistant transformants of ~. coli MC1061, plasmid pTrp81 was isolated and shown to resist cleavage by EcoRI. Plasmid pTrp81 was digested with NdeI and HindIII, purified by agarose gel electrophoresis, and ligated with a synthstic CAT gene ~ragment using T4 DNA
ligase. The synthetic NdeI-HindIII CAT gene fragment .~
, ~ --19--15~2~162 (Fig. 2H) was assembled from three pairs of oligo-deoxyribonucleotides as previously described. From ampicillin-resistant transformants of E. coli MC1061, plasmid pCAT127 was isolated and shown to contain the synthetic CAT fragment by digestion with EcoRI and AvaI.
The plasmid was digested with BamHI and HindIII, the BamHI-HindIII fragment containing CAT was purified by agarose gel electrophoresis, sequenced by the method of Sanger et al ~1977), supra, and the correct DNA sequence confirmed.
Plasmid pC~T127 was digested with EcoRI and HindIII and ligated using T4 DNA ligase with a pair of annealed synthetic oligodeoxyribonucleotides encoding hANP(102-126) preceded by a Trp residue on an EcoRI-HindIII DNA fragment. Plasmid pChNF142 (Fig. 2I) wasisolated from ampicillin-resistant transformants of E. coli MC1061. Insertion of the hANP gene was confirmed by the size of the DNA fragments in a BamHI and HindIII
digest of the plasmid. The sequence of -the hANP gene was confirmed from an EcoRI-ScaI agarose gel purified fragment from pChNF142.
2. Expression of CAT(1~73), T~r~l6l Ser[311-hANP(102-126) pChNF142.
~he expression of a modified CAT-hANP(102-126) hybrid protein is conducted in substantial accordance with the teaching of the previous examples A.2 and B.2.
II. E~ression of Chloramphenicol Acetyltransferase--Am~loid A4 Protein Insert (A4-751i~ Hybrid Proteins in Escherichla coli.
In the following examples high level expression of the 57 amino acid insert within the amyloid A4-751 protein was achieved by fusing a synthetic A~-751i gene to DN~ sequences encoding amino terminal fra~ments of CAT
under the control of the E. coli tryptophan promoter--20- ~32a~ 62 operator on a pBR322-derived plasmid. The synthetic A4-751i gene encodes amino acids 289-345 from amyloid A4-751 protein (Ponte et al (1988), Nature 331:525-527) preceded by a chemical cleavage site, Asn-Gly. Hydroxylamine cleavage of the hybrid protein between these two residues will yield the insert protein with a Gly residue at its amino terminus.
A. Expression Vector pCAPi132.
Expression vector pCAPil32 encodes a 132 amino acid CAT-A4751i hybrid protein containing a hydroxylamine cleavage si~e (Fig. 4A). Approximately the amino terminal third of the CAT gene (amino acids 1-73~ ha~ been joined in-frame to the A4-751i gene and clea~age ~ite (59 amino acids). The A4-751i protein comprises about 43~ of the hybrid protein. This vector was constructed from plasmids pTrp233 and pChNF121 and the synthetic A4-751i gene and cleavage site.
1. Construction of PcApil32.
Plasmid pTrp233 was digested with EcoRI and HindIII, purified by agarose gel electrophoresis, and ligated with the synthetic gene ~ncoding the A4-751i protein and cleavage site using T4 DNA ligase. The gene had been assembled from six oligodeoxyribonucleotides using previously described techniques and its sequence (Fig. 4A) co~firmed. Plasmid pAPil31 was isolated from ampicillin-resistant transformants of E. coli MCl061.
Insertion of the synthetic gene was confirmed by the size of the DNA fragments from a PvuI and BamHI digest of plasmid mini-prep DNA.
Plasmid pAPil31 was digested with EcoRI to lineari~e the vector and its termini dephosphorylated using bacterial alkaline phosphatase. Plasmid pChNF121 was digested with EcoRI and the approximately 320 bp ~coRI
fragmen~ containing the trp promoter-operator, ribosome ~ 3~0~
binding site, and amino terminus of the CAT gene ~amino acids 1-73) was purified by agarose gel electrophoresis.
This EcoRI cassette was ligated with the pAPil31 plasmid using T4 DNA ligase and ampicillin-resistant transformants s of MC1061 were obtained. On the basis of ~NA fxagment size in a PvuII digest of mini-prep plasmid DNA, plasmid pCAPil32 was isolated with an in-frame fusion of CAT and A4-751i sequences.
2. Expression of CAT( 1-73)-A4-751i Hybrid Protein From Plasmid pCAPil32.
Plasmid pCAPil32 expresses a CAT--A4-751i hybrid protein under the control of the E. coli ~ promoter-operator. The plasmid was used to transform E. coli W3110 to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium. The overnight culture was diluted 100~fold into complete M9 medium which contains 40 ug/ml tryptophan (uninduced culture) and into complete M9 medium containing 25 ug/ml 3-beta-indoleacrylic acid instead of tryptophan (induced culture).
Expression was assessed after shaking the cultures for 6 hr at 37C. The uninduced culture had reached a high cell density whereas the induced culture was at a lower cell density. Phase contrast microscopy revealed cells of normal morphology in the uninduced culture and cells with ~pre-inclusion bodies~ in the induced culture. As used herein, "pre-inclusion bodies"
are defined as less refractile bodies which appear to convert in time to the more refractile "inclusion hodies~
as the hybrid protein accumulates in the cells. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and then analyæed by electrophoresis through a 12% SDS-polyacrylamide gel fol-lowed by staining with Coomassie Blue (Fig. 3~). ThisCAT(1-73)-A4-751i hybrid protein migrates between the : :
- --22~ ~ 32~6~
lysozyme (14,300 MW) and beta-lactoglobulin (18,400 M~7) protein standards on this gel. Vsing a Kontes fiber optic scanner and Hewlett-Packard Integrator to scan the gel, the hybrid protein was estimated to comprise about 7% of the total cell protein. This is a moderate expression le~el of E. coli but A4-751i comprises almost half of the hybrid protein.
To confirm the presence of A4-751i in the hybrid protein, Western blot analysis was carried out on an unstained 12% SDS-polyacrylamide gel o~ these protein samples. Protein was blotted to nitrocellulose and incubated with anti-A4-751i serum (prepared against a 16 amino acid synthetic pep~ide containing amino acids 11-26 of the 57 amino acid insert protein). After incubation with 125I-protein A (Amersham) the blot was placed on X-ray film at -70C for several days. The synthetic peptide anti-serum detected the hybrid protein as well as several other E. coli proteins.
B. Expression vector pCAPil36.
Expression vector pCAPil36 encodes a 274 amino acid CAT-A4-751i hybrid protein containing a hydroxylamine cleavage site. Most of the CAT gene (amino acids 1-210) has been joined in-frame to the A4-751i gene and cleavage site (59 amino acids) through a linker sequence (5 amino acids). The A4-751i polypeptide comprises about 21% of the hybrid protein. This vector was constructed from plasmids pAPil31 and pChNF109.
1. Construction of pCAPil36.
Plasmid pAPil31 was digested with EcoRI to linearize the vector and its termini dephosphorylated using bacterial alkaline phosphatase. From a partial EcoRI digest of pChNF109 an approximately 740 bp EcoRI
fragment containing the trp promoter-operator, the CAT
gene (amino acids 1 210), and linker sequence was purified .
.
' -23~
by agarose gel electrophoresis. This EcoRI cassette and vector pAPil31 were ligated using T4 DNA ligase and ampicillin-resistant transformants of E. coli MC1061 were isolated. From the size of DNA fragments in plasmid mini-preps digested with BamHI, plasmid pCAPil36 was isolatedwith the CAT gene and the synthetic A4-751i gene in-frame.
2. Expression of CAT(1-210)-A4-751i Hybrid Protein From Plasmid pCAPil36.
Plasmid pCAPil36 expresses a CAT-A4-751i hybrid protein under the control of the E. coli trp promoter-operator. The plasmid was used to transform E. coli W3110 to ampicillin resistance and one colony was grown in culture overnight at 37C in complete Mg medium. The overnight culture was diluted 100-fold into the same M9 medium (uninduced culture) and into M9 complete medium containing 25 ug/ml 3-beta-indoleacrylic acid instead of tryptophan (induced culture).
Expression was assessed after shaking the cultures for 5 hr at 37C. Both the uninduced and inducqd cultures reached high cell densities. Phase contrast microscopy revealed cells of noxmal morphology in the uninduced cultures and cells containing inclusion bodies or pre-inclusion bodies (50:50) in the induced cultures.
Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and were analyzed by electrophoresis through a 12% SDS~polyacrylamide gel fol-lowed by staining with Coomassie Blue (Fig. 3A). This CAT-A4-751i hybrid protein migrates between the alpha-ch~motrypsinogen (25,700 MW) and ovalbumin (43,000 MW)protein standards on this gel. Using a Kontes fiber optic scanner and Hewlett-Packar~ In~egrator to scan the gel, the hybrid protein was estimated to comprises about 15% of total cell protein. This is moderately high level expres-sion for E. coli.
(*) Trademark .
,~:''"
.~
- -24- ~ 2 To confirm the presence of A4-751i ln the hybrid protein, Western blot analysis was carried out on an unstained 12% SDS-polyacrylamide gel of these protein samples. Using the method described above ~section II.
A.2.), the synthetic peptide anti-serum detected the hybrid protein as well as se~eral other E. coli proteins.
ssion of Chloramphenicol Acetyltransferase--Glucaqon-L e PePtide I ~-37) Hybrid Protein in Escherichia coli.
In the following example, high level expression of the 31 amino acid GLP-I(7-37) was achie~ed by fusing a synthetic GLP-I gene to DNA sequences encoding an amino terminal fragment of CAT under the control of the E. coli tryptophan promoter-operator on a pBR322-derived plasmid.
The synthetic gene encodes amino acids 7-37 of GLP-l (Mojsov et al (1987), J. Clin Invest 79:616-619) preceded by a Met residue. Treatment with cyanogen bromide releases the insulinotropic peptide.
A. Expression Vector PcGLpl39.
Expression vector pCGLP139 encodes a 105 amino acid CAT-GLP-I hybrid protein containing a cyanogen bromide clea~age site (Fig. 4B). Approximately the amino terminal third of the CAT gene (amino acids 1-73) has been joined in-frame to the GLP-l gene and cleavage site (32 amino acids). The GLP-I peptide comprises about 30~ of the hybrid protein. This vector was constructed from plasmids pTrp233 and pChNF109 and the synthetic GLP-I gene and cleavage site.
1. Construction_of pCGLP139.
Plasmid pTrp233 was digested with EcoRI and HindIII, purified by agarose gel electrophoresis, and ligated with the synthetic gene using T4 DNA ligase. The gene had been assembled from four oligodeoxyribo-25- 1~2~2 nucleotides and its sequence (Fig. 4B) confirmed. From ampicillin-resistant transformants of E. coli MC1061, plasmid pGLP138 was isolated. Insertion of the synthetic gene was confirmed by the failure of plasmid mini-prep DNA
to be cut by PstI.
Plasmid pGLP138 was digested with EcoRI to linearize the vector, its termini dephosphorylated using bacterial alkaline phosphatase, and ligated with the EcoRI
cassette from plasmid pChNF109 using T4 DNA ligase Plasmid pChNF109 had been digested with EcoRI and the ap-proximately 320 bp EcoRI fragment containing the trp promoter-oper~tor, ribosome binding site, and an amino terminal fragment of the CAT gene purified by agarose gel electrophoresis. Plasmid pCGLP139 was isolated from ampicillin-resistant transformants of MC1061. On the basis of DNA ragment size in an AvaI and PvuII digest of plasmid mini-prep DNA, the fusion of CAT and GLP-I
sequences was confirmed to be in-frame.
2. Expression of_CAT(1-73)-GLP-I(7-37) Hybrid Protei.n From Plasmid pCGLP139.
Plasmid pCGLP139 expresses a CAT-GLP-I hybrid protein under the control of the E. coli trp promoter-operator. The plasmid was used to transform E. coli W3110 to ampicillin resistance and one colony was grown in culture overnight at 37C in complete M9 medium. The overnight culture was diluted 100-fold into complete M9 medium which contains 40 ugtml tryptophan (uninduced culture) and into complete M9 medium in which 25 ug/ml 3-beta-indoleacrylic acid has been substituted for the tryptophan tinduced culture).
Expression was assessed after shaking the cultures for 6 hr at 37C. The uninduced culture had reached a high cell density whereas the induced cul~ure was at a lower cell density. Phase contrast microscopy revealed cells of normal morphology in the uninduced -26- 132~162 culture and elongated cells with three or more refractile inclusion bodies in the induced culture. Total cell protein samples were prepared by boiling cell pellets in Laemmli buffer for 5 min and were analyzed by electrophoresis through a 12% SDS-polyacrylamide gel fol-lowed by staining with Coomassie Blue (Fig. 3B). This CAT(1-73~-GLP-I(7-37) hybrid protein migrates between the bovine trypsin inhibitor (6,200 MW) and lysozyme (14,300 MW) protein standards. Using a Kontes fiber optic scanner and Hewlett-Packard Integrator to scan the gel, the hybrid protein was estimated to comprise about 20~ of the total cell protein. (Considering the number of inclusion bodies observed per cell, all of the hybrid protein may not have been solubilized in the Laemmli buffer, and this estimate may be low.) This is high level expression for E. coli.
The molecular weight of the hybrid protein is as predicted for this gene fusion. Amino acid composition analysis of the purified hybrid protein or protein sequencing of the peptide after cyanogen bromide cleavage can be performed to confirm its expression.
IV. CAT Fusion With Human SP-B and SP-C.
The mature forms of both human SP-C and SP-B are expressed as fusions with portions of bacterial CAT. The surfactant peptides are ~oined to the carboxy terminus of the CAT sequences through a hydroxylamine-sensitive asparagine-glycine linkage. The CAT-surfactant fusions are expressed from the tryptophan promoter of the bacte-rial vector pTrp233.
A. Expression Vector pC210SP-B.
SP-B expression vector pC210SP-B encodes a fu-sion protein of 293 residues in which 210 amino acids of CAT are joined to the 76 amino acids of SP-B through a linker of 7 amino acids containing ~he hydroxylamlne-sensitive cleavage site. Cleavage of the fusion with -27- ~ 3~162 hydroxylamine releases a 77 amino acid SP-B product containing the 76 residue mature form of SP-B, plus an amino-terminal glycine residue.
To construct pC210SP-B, the short EcoRI-HindIII
segment containing ANF sequences was removed from pChNF109, and replaced by a portion of human SP-B cDNA #3 extending from the PstI site at nucleotide (nt) 643 (Fig.
6) to the SphI site at nt 804. The EcoRI site wa~ joined at the PstI site through two complementary oligonucleotides encoding the hydroxylamine sensitive cleavage site as well as the amino-terminal residues of mature SP-B (oligo ~2307: 5 ' -AAT TCA ACG GTT TCC CCA TTC
CTC TCC CCT ATT GCT GGC TCT GCA-3' and oligo #2308: 5'-GAC
CCA GCA ATA GGG GAG AGG AAT GGG GAA ACC GTT G-3'). The lS ~I site was joined to the HindIII site of PTrp233 through a second se~ of complementary nucleotides encoding the carboxy~terminal residues of ma~ure SP-B (oligo #3313:
5'-ACC TTA CCG GAG GAC GAG GCG GCA GAC CAG CTG GGG CAG CAT
G-3' and oligo #3314: 5'-CTG CCC CAG CTG GTC TGC CGC CTC
GTC CTC CGG TA-3').
The expression plasmid was used to trans~orm E.
c _ stain W3110 to ampicillin resistance. Rapidly grow-ing cultures of pC210SP-B/W3110 in M9 medium were made 25 ug/ml IAA (3-beta indoleacrylate, Sigma I-1625) to induce the Trp promoter. By 1 hr after induction, refractile cytoplasmic inclusion bodies were seen by phase contrast microscopy inside the still-growing cells. 5 hr after induction, the equivalent of 1 O.D.550 of cells were pelleted by centrifugation, then boiled for 5 min in SDS
sample buffer for electrophoresis in a 12~ SDS-polyacrylamide gel followed by staining with Coomassie Blue (Fig. 7). Lane A = molecular size standards; Lane B
= induced W3110 cells containing pTrp233 vector control;
and Lane C = induced pC210SP-BJW3110. The predicted molecular weight of the CAT:SP-B ~usion protein is 45,000 daltons. The hybrid CAT:SP-B protein was es~imated to - -28- ~3~62 comprise 15-20~ of the total cell protein in the induced cultures.
B. CAT Fusions with SP-C.
A series of vectors wer0 constructed encoding fusion proteins in which mature human SP-C was fused to the carboxy termini of different portions of CAT through a hydroxylamine-sensitive asparagine-glycine linkage.
~Iydroxylamine cleavage of the fusion protein produced by each cons~ruct releases a mature SP-C of 35 amino acids which lacks the amino-terminal phenylalanine residue seen in a portion of natural human SP-C.
1. pC210SP-C.
The amino acid sequence of the 251 residue fu-sion protein encoded plasmid pC210SP-C. The 210 amino acids of CAT are joined to 35 amino acids of mature SP-C
through a linker of 6 amino acids. The mature SP-C
portion of the total fusion protein comprises 14%.
In Fig. 8 is shown the nucleotide sequence of pC210$P-C, in which th~ EcoRI-HindIII fragment of pC210SP-B containing SP-B sequences has been replaced by a segment of human SP-C cDNA #18 extending from the A~LI site at nucleotide 123 to the AvaII site at nucleotide 161. The EcoRI site of the CAT vector was joined to the SP5 ApaLI
site through two complementary oligonucleotides encoding the hydroxylamine sensitive cleavage site as well as the amino-terminal residues of mature SP-C (oligo #2462: 5'-AAT TCA ACG GCA TTC CCT GCT GCC CAG-3' and oligo #2463:
5'-TGC ACT GGG CAG CAG GGA ATG CCG TTG-3'). The A~aII
site of SP-C was joined to the HindIII site of pC210SP-B
through a second set of complementary nucleo~ides encoding the carboxy-terminal residues of mature SP-C and a stop codon (oligo #2871: S'-AGC TTA GTG GAG ACC CAT GA& CAG GGC
3S TCG CAC AAT CAC C~C GAC ~AT GAG-3' and oliqo #2372: 5'-GTC
.
, '' ' ' . -29-~3~al62 CTC ATC GTC GTG GTG ATT GTG GGA GCC CTG CTC ATG GGT CTC
CAC TA-3').
2~ ~C179SP-C.
The amino acid sequence of the 217 residue fu-sion protein encoded by pC179SP-C is a slight modification of the sequence shown in Fig. 8. In pC179SP-C, the 179 amino acids of CAT are joined to 35 amino acids of mature SP-C through a linker of 3 amino acids (Glu, Phe, Asn).
SP-C portion of the total fusion protein comprises 16%.
To construct pC179SP-C, a portion of the CA~
sequence was removed from pC210SP-C. S~arting with pC210SP-C, a DNA fragment extending from the Ncol site at nt 603 (Fig. 8) to the EcoRI site at nt 728 was removed, and the NcoI and EcoRI cohesi~e ends were rejoined with two complementary oligonucleotides (oligo #3083s 5'~CAT
GGG CAA ATA T~A TAC GCA AG-3' and oligo #3084: 5'-AAT TCT
TGC GTA TAA TAT TTG CC-3'). In effect, 31 residues of CAT, and 3 residues of the linker polypeptide are missing in the new fusion protein encoded by vector pC179SP-C.
3. pC14gSP-C.
The amino acid sequence of the 187 residue fu-sion protein encoded by pC149S~-C is a slight modification of the sequence shown in Fig. 8. In plasmid pC149SP-C, the 149 amino acids of CAT are ~oined to 35 amino acids of mature SP~C through a linker of 3 amino acids (Glu, Phe, Asn). The SP-C portion of the total fusion protein comprises 18.7%.
To construct pC1495P-C, a portion of the CAT
segment of pC210SP-C extending from the DdeI site at nt 523 (Fig. 8) to the EcoRI site at nt 728 was removed and replaced by a~set of two complementary oligonucleotides (oligo #3082: 5' TCA GCC:AAT CCC:G-3' oliqo #3081: 5'-AAT
35~ TCG GGA TTG GC-3').
:: :
,. .
, .
_30_ ~3 233 1 ~2 4. pC106SP-C.
The amino acid sequence of the 144 residue fu-sion protein encoded by pC106SP-C is a slight modification of the sequence shown in Fig. 8. In plasmid pC106SP-C, the 106 amino acids of CAT are joined to 35 amino acids of mature SP-C through a linker of 3 amino acids (Glu, Phe, Asn). The SP-C portion of the total fusion protein comprises 24%.
pCl06SP-C was constructed by replacing the EcoRI
fragment of pC210SP-C (nt 302 to nt 728, Fig. 8) with two sets of complementary oligos which were annealed, then ligated together through a region of homology (oligo #3079: 5 '-AAT TCC GTA TGG CAA TGA AAG ACG GTG AGC TGG TGA
TAT GGG ATA GTG TTC ACC CTT GT-3' was annealed with oligo 15 #3085: 5'-ACA CTA TCC CAT ATC ACC P,GC TCA CCG TCT TTC ATT
GCC ATA CGG-3'; oligo #3080: 5 ' -TAC ACC GTT TTC CAT GAG
CAA ACT GAA ACG TTT TCA TCG CTC TGG G-3' was annealed with oligo #3078: 5~-AAT TCC CAG AGC GAT GAA AAC GTT TCA GTT
TGC TCA TGG AAA ACG GTG TAA CAA GG& TGA-3').
5. Expression From SP-C Vectors.
Each SP-C expression vectox was used to transform E. coli strain W3110 to ampicillin resistance.
Rapidly growing cultures of expression strains were induced as described above. By 1 hr after induction, refractile cytoplasmic inclusion bodies were seen by phase contrast microscopy inside the still-growing cells. 5 hr after induction, the equivalent of 1 O.DL550 of cell~ were pelleted by centrifugation, then boiled for 5 min in SDS
sample buffer for electrophoresis in a 12% SDS-polyacrylamide gel followed by staining with Coomassie Blue. The results are provided in Fig. g wherein Lane ~ =
molecular size standards, Lane B = induced W3110 cells containing pTrp233 vector control; Lane C = induced pC106SP-C; Lane D = pC149SP-C; Lane E = pC179SP-C; Lane F
= pC210SPWC. The hybrid CAT:SP-C protein produced by each . , . , ~ .
i ~J~16~
vector is estimated to comprise 15-20~ of the total cell protein in the induced cultures.
v. Improved CAT Vectors for Expression of ~ybrid Proteins in Escherichia Coli.
In the following examples, the basic CAT gene fusion vector has been improved in several ways: (1) unique cloning sites are created for insertion of the gene to be expressed, (2) the CAT gene is modified to optimize cleavage andtor purification of the peptides, and (3) the gene conferring resistance to tetracycline is restored to provide an alternatlve method for plasmid selection and maintenance.
A. Expression Vectors ~CAT73 and PCAT210.
Expression vector pCAT73 contains genes confer-ring resistance to both ampicillin and tetracycline, unique EcoRI and HindIII cloning sites for insertion of genes to be expressed, and the amino terminal fragment (1-73) of the CAT gene. The cleavage site, included with the inserted ~ene, may not be unique. This plasmid is constructed from plasmids pBR322, pTrp233, pCAT21, and oli~odeoxyribonucleotides. Expression vector pCAT210 dif-fers from pCAT73 in that it contains the larger amino terminal fragment (1-210) of the CAT gene from which the EcoRI site at the sequence encoding residues 72 and 73 (Glu-Phe) ha-c been removed. ~An alternative codon choice preserves the Glu and permits the use of unique EcoRI and HindIII cloning sites.) Other DNA fragments encoding the amino terminus of the CAT gene, smaller than 73 amino acids or between 73 and 210 amino acids may also be constructed by insertion of an EcoRI site at the desired fusion point.
132~6~
l. Construction of pCAT73.
Restoration of the gene for tetracycline resist-ance requires restoring the BamHI-HindIII-EcoRI fragment of pBR322 to the CAT expression vector. Since the unique cloning sites desired for this vector are EcoRI and HindIII, this must be done in a manner which removes these sites but retains resistance to tetracycline. Since insertion of DNA at the HindIlI site upstream of the cod-ing region often prevents gene expression, this site i5 removed by creating a point mutation at the HindIII site.
Plasmid pBR322, was digested with EcoRI and HindIII and the vector backbone gel purified. The backbone was ligated with synthetic EcoRI-HindII fragments, which are formed by annealing pairs of oligonucleotides using T4 DNA
lS ligase. The fragments contain the normal EcoRI-HindIII
sequence with the exception of point mutations (G or C) at the first adenine of the recognition sequence 5'-AAGCTT-3'. An intermediate plasmid was isolated from ampicillin-resistant and tetracycline-resistant E. coli MC1061 transformants whose plasmid mini-prep DNA was not digested by HindIII.
A BamHI-EcoRI fragment no longer containing a HindIII site was purified from agarose gel electrophoresis from a BamHI and EcoRI digest of plasmid pTetHl. The fragment was ligated using T4 DNA ligase with plasmid pTrp233 which was also digested with ~amHI and EcoRI and agarose gel purified. Transformed wi~h the ligation, colonies of E. coli MC1061 were selected for ampicillin and/or tetracycline resistance. Plasmid pTrpT233 was resistant to both antibiotics.
In an alternate embodiment, digestion of pTrpT233 with EcoRI, blunting of the termini with DNA
polymerase I, Klenow fragment, and ligation with T4 DNA
ligase will eliminate the EcoRI site (which does not affect resistance to tetracycline). Tetracycline-resistant plasmid pTrpT234 which has lost undesirable ~3201 6~
HlndIII and EcoRI sites is isolated from colonies of E.
coli MC1061 transformed with this ligation.
The CAT gene is obtained as an NdeI-HindIII
fragment purified by agarose gel electrophoresis of an NdeI-HindIII digest of pCAT21. Plasmid pTrpdeltaHind was digested wikh NdeI and HlndIII, purified by agarose gel electrophoresis, and ligated with the CAT gene using T4 DNA ligase. From ampicillin (or tetracycline) resistant transformants of E. coli MC10Sl digested with EcoRI and HindIII to verify incorporation of the CAT gene, plasmid pCAT73 (Fig. 5A) is isolated.
2. Construction of PCAT210.
The BamHI-HindIII fragment containing the trp promoter-operator, ribosome binding site, and CAT gene is purified by agarose gel electrophoresis from a BamHI and HindIII digest of plasmid pCAT21. Site specific mutagenesis is carried out on the fragment using M13 and mutagenic oligodeoxyribonucleotides to convert the GAA
codon for Glu to GAG (also to Glu) within the EcoRI site, 5'-GAATTC-3'. One such plasmid, M13-CATdR, is digested with ScaI to linearize the vector and ligated with an EcoRI linker (for the same reading frame as in pCAT73) using T4 DNA ligase. From the transfectants, M13-CATR1, is isolated and digested with NdeI and HindIII. The new CAT gene is purified by agarose gel electrophoresis and ligated using T4 DNA ligase with NdeI~HindIII-digested plasmid pTrpT234. Plasmid pCAT210 (Fig. SB) is isolated from ampicillin (or tetracycline) resistant transformants f E coli MC1061.
B. Expression Vectors pCAT73-T and ~CAT73-M.
Expression ~ectors pCAT73-T and pCAT73-M are examples in which the amino acid sequence of CAT has been altered using site specific mutagenesis techniques to facilitate purification of the product protein. In these , .
:~ 3 2 ~
cases, the Trp residue at position 16 may be substituted with Tyr and the Met residue at position 67 may be substituted by Ile or Leu to eliminate potential chemical cleavage sites within CAT. In addition, the Cys at posi-tion 31 may also be subs~ituted using a conservative aminoacid alteration, that is, substitution with an amino acid which does not adversely affect biological activity.
Preferred residues include alanine, serine, leucine, isoleucine and valine, most preferred is serine. These la~ter alterations are intended to reduce multimerization through disulfide bridges.
C. Expression of Modified CAT-GLP-1 Plasmid pTrpdeltaHind contains the restored TetR
gene from pTrp233 (although the HindIII site has been eliminated), the Trpl6 to Tyr, Cys3l to Ser, and Met67 to Leu substltutions in the CAT gene sequence, and the GLP 1 gene (taught in Example III) fused in-frame to the modified CAT gene -through a methione residue. The vector was used to transform several E. coli strains including ~3110, MC1061, DHl, MM294 and RR1.
E. coli RRl transformants were more stable and appeared to have better induction/repression control of the Trp promoter than any of the other hosts. An alternative construction for this vector includes reversing the TetR gene (to avoid the back-to-b~ck placement of the Tet~ and Trp promoters in the present construct) to alleviate the stability problems observed using bacterial hosts other than RRI transformants.
VI. Construction of pTrpCAT72:Adipsin/D.
The coding sequence for mature human adipsin/D
was fused to pCAT72 to produce a fusion protein suitable, for example, to generate antisera agains~ human adipsin/D.
,, ,~
:L~2~162 A. Construction of pTrpCAT7? Q3Sl Plasmid pCAT72 Q3Sl was cons~ructed to eliminate Asn residues at which secondary cleavages can occur during hydroxylamine release of peptides fused to CAT. The A~n residues at amino acid positions 26, 51 and 78 of CAT were changed to Gln residues. At the same time, the single Cys at position 31 was changed to Ser to decrease the amount of aggregation seen with many CAT fusion proteins.
The vector pCAT72 Q3Sl was constructed as fol-lows: Oligos CAT72-1 through 6 (below) were annealed and ligated into pUC-9 which had been cleaved with NdeI ~nd EcoRI. In this way, the mutated CAT72 was joined to the polylinker region of the pUC plasmid. CAT72 Q3Sl with the polylinker was then removed from pUC by cleavage with NdeI
and HindIII, and inserted into pTrp233 b~tween NdeI and HindIII to yield pTrpCAT72 Q3Sl.
TATGGAGAAA AAAATCACTG GATATACCAC CGTTGATATA TCCCAATGGC
~60 70 ATCGTAAAGA ACATTTTGAG GCATTTCA
CAAAATGTTC TTTACGATGC CATTGGGATA TATCAACGGT GGTATATCCA
TGATTTTTT TCTCCA
TCAGTTGCT CAATCTACCT ATCAGCAGAC CGTTCAGCTG GATATTACGG
CCTTTTTAAA GACCGTAAAG AAACAGAAGC
CTTTACGGTC TTTAAAAAGG CCGTAATATC CAGCTGAACG GTCTGCTGAT
AGGTAGATTG AGCAACTGAC TGAAATGCCT
1320~
ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGCAGGCT
CATCCGG
AATTCCGGAT GAGCCTGCAT CAGGCGGGCA AGAATGTGAA TAAAGGCCGG
ATAAAACTTG TGCTTCTGTT T
B. Construction of pTrpCAT72 Q6S3 Starting with pCAT72 Q3Sl, pCAT153 Q6S3 was constructed to change the Asn residues at positions 130, 141 and 148 of CAT to Gln residues, and to change the Cys residues at 91 and 126 to Ser residues.
Pla~mid CAT72 Q3Sl in pUC-9 was clea~ed with EcoRI. Oligos CAT153-1 through 6 (below) were annealed and ligated into pCAT72 to ~ive pCAT153 Q6S3. The modified pCAT153 was then removed from pUC by cleavage with NdéI and HindIII, and the resulting fragment inserted into pTrp233 to give pTrpCAT153 Q6S3.
AATTTCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT
CACCCTTCTT ACACCGTTTT CCATGAGCAA
~ 10 20 30 40 50 AAA~CGGTGT AAGAAGGGTG AACACTATCC CATATCACCA GCTCACCGTC
TTTCATTGCC ATACGA
~ 10 20 30 40 50 ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACG~CG ATTTCCGGCA
~ ~0 70 80 GTTTCTACAC ATATATTCGC AAGATGTGGC
~37~ 132~62 AGCGATGAAA ACGTTTCAGT TTGCTCATG~
-10 ~0 30 40 50 GTCTTACGGT GAACAGCTGG CCTATTTCCC TAAAGGGTTT ATTGAGCAGA
TGTTTTTCG~ CTCAGCCCAG CCCG
CAT153~6 - ~ 10 20 30 4~ 5) AATTCGGGCT GGGCTGAGAC ~A~AAACATC TGCTC~AT~ ACCCTTTA~G
7~ 80 GAAATA~GCC AGCTGTTCAC CGTAAGACGC CACATCTT
~, ~
Next, the human adip~in/D.cDNA hg31-40 (Figure 10) wa~ constructed. The BamHI-~yI fragment containing th~ mature coding region was gel puri~iod and in3erted into pUC-9 which had been cleaved with BamHI and HlndIII.
The StyI end of the cDNA ~a~ ~oined to the HindIII end of pUC using two oligos (#3886 S'-CATGGGTGCCGGGGCCTGA-3~ and #3887 5'-AGCTTCAGGCCCCGGCACC-3'). 8y inserting the BamHI-~yI fragment of adipsin/D into pUC in this way, the cod-ing sequence of adipsin/D wa~ placed in fra~e with the EcoRI site of pVC~9. 1~he EcoRI-HindIII fragment of thi~
construct w~s removed from pUC-9 and in~ertsd into pTrpCAT72 between the EcoRI ~ite and the HindIII sites to yield pTrpCRT72:Adipsin/D.
This con~truct gave 10-15~ lavel~ of fusion protein upon induction in ~3110 c~llY.
~odifications of the above de~cribed mode~ for carrying out the invention that are obvious to thoso of skill in the ar~ of molecular biol~gy, protein chemi~try, cell biology, or rela~ed field~ are in~ended to be within the scope of the following claim~.
~1
Each SP-C expression vectox was used to transform E. coli strain W3110 to ampicillin resistance.
Rapidly growing cultures of expression strains were induced as described above. By 1 hr after induction, refractile cytoplasmic inclusion bodies were seen by phase contrast microscopy inside the still-growing cells. 5 hr after induction, the equivalent of 1 O.DL550 of cell~ were pelleted by centrifugation, then boiled for 5 min in SDS
sample buffer for electrophoresis in a 12% SDS-polyacrylamide gel followed by staining with Coomassie Blue. The results are provided in Fig. g wherein Lane ~ =
molecular size standards, Lane B = induced W3110 cells containing pTrp233 vector control; Lane C = induced pC106SP-C; Lane D = pC149SP-C; Lane E = pC179SP-C; Lane F
= pC210SPWC. The hybrid CAT:SP-C protein produced by each . , . , ~ .
i ~J~16~
vector is estimated to comprise 15-20~ of the total cell protein in the induced cultures.
v. Improved CAT Vectors for Expression of ~ybrid Proteins in Escherichia Coli.
In the following examples, the basic CAT gene fusion vector has been improved in several ways: (1) unique cloning sites are created for insertion of the gene to be expressed, (2) the CAT gene is modified to optimize cleavage andtor purification of the peptides, and (3) the gene conferring resistance to tetracycline is restored to provide an alternatlve method for plasmid selection and maintenance.
A. Expression Vectors ~CAT73 and PCAT210.
Expression vector pCAT73 contains genes confer-ring resistance to both ampicillin and tetracycline, unique EcoRI and HindIII cloning sites for insertion of genes to be expressed, and the amino terminal fragment (1-73) of the CAT gene. The cleavage site, included with the inserted ~ene, may not be unique. This plasmid is constructed from plasmids pBR322, pTrp233, pCAT21, and oli~odeoxyribonucleotides. Expression vector pCAT210 dif-fers from pCAT73 in that it contains the larger amino terminal fragment (1-210) of the CAT gene from which the EcoRI site at the sequence encoding residues 72 and 73 (Glu-Phe) ha-c been removed. ~An alternative codon choice preserves the Glu and permits the use of unique EcoRI and HindIII cloning sites.) Other DNA fragments encoding the amino terminus of the CAT gene, smaller than 73 amino acids or between 73 and 210 amino acids may also be constructed by insertion of an EcoRI site at the desired fusion point.
132~6~
l. Construction of pCAT73.
Restoration of the gene for tetracycline resist-ance requires restoring the BamHI-HindIII-EcoRI fragment of pBR322 to the CAT expression vector. Since the unique cloning sites desired for this vector are EcoRI and HindIII, this must be done in a manner which removes these sites but retains resistance to tetracycline. Since insertion of DNA at the HindIlI site upstream of the cod-ing region often prevents gene expression, this site i5 removed by creating a point mutation at the HindIII site.
Plasmid pBR322, was digested with EcoRI and HindIII and the vector backbone gel purified. The backbone was ligated with synthetic EcoRI-HindII fragments, which are formed by annealing pairs of oligonucleotides using T4 DNA
lS ligase. The fragments contain the normal EcoRI-HindIII
sequence with the exception of point mutations (G or C) at the first adenine of the recognition sequence 5'-AAGCTT-3'. An intermediate plasmid was isolated from ampicillin-resistant and tetracycline-resistant E. coli MC1061 transformants whose plasmid mini-prep DNA was not digested by HindIII.
A BamHI-EcoRI fragment no longer containing a HindIII site was purified from agarose gel electrophoresis from a BamHI and EcoRI digest of plasmid pTetHl. The fragment was ligated using T4 DNA ligase with plasmid pTrp233 which was also digested with ~amHI and EcoRI and agarose gel purified. Transformed wi~h the ligation, colonies of E. coli MC1061 were selected for ampicillin and/or tetracycline resistance. Plasmid pTrpT233 was resistant to both antibiotics.
In an alternate embodiment, digestion of pTrpT233 with EcoRI, blunting of the termini with DNA
polymerase I, Klenow fragment, and ligation with T4 DNA
ligase will eliminate the EcoRI site (which does not affect resistance to tetracycline). Tetracycline-resistant plasmid pTrpT234 which has lost undesirable ~3201 6~
HlndIII and EcoRI sites is isolated from colonies of E.
coli MC1061 transformed with this ligation.
The CAT gene is obtained as an NdeI-HindIII
fragment purified by agarose gel electrophoresis of an NdeI-HindIII digest of pCAT21. Plasmid pTrpdeltaHind was digested wikh NdeI and HlndIII, purified by agarose gel electrophoresis, and ligated with the CAT gene using T4 DNA ligase. From ampicillin (or tetracycline) resistant transformants of E. coli MC10Sl digested with EcoRI and HindIII to verify incorporation of the CAT gene, plasmid pCAT73 (Fig. 5A) is isolated.
2. Construction of PCAT210.
The BamHI-HindIII fragment containing the trp promoter-operator, ribosome binding site, and CAT gene is purified by agarose gel electrophoresis from a BamHI and HindIII digest of plasmid pCAT21. Site specific mutagenesis is carried out on the fragment using M13 and mutagenic oligodeoxyribonucleotides to convert the GAA
codon for Glu to GAG (also to Glu) within the EcoRI site, 5'-GAATTC-3'. One such plasmid, M13-CATdR, is digested with ScaI to linearize the vector and ligated with an EcoRI linker (for the same reading frame as in pCAT73) using T4 DNA ligase. From the transfectants, M13-CATR1, is isolated and digested with NdeI and HindIII. The new CAT gene is purified by agarose gel electrophoresis and ligated using T4 DNA ligase with NdeI~HindIII-digested plasmid pTrpT234. Plasmid pCAT210 (Fig. SB) is isolated from ampicillin (or tetracycline) resistant transformants f E coli MC1061.
B. Expression Vectors pCAT73-T and ~CAT73-M.
Expression ~ectors pCAT73-T and pCAT73-M are examples in which the amino acid sequence of CAT has been altered using site specific mutagenesis techniques to facilitate purification of the product protein. In these , .
:~ 3 2 ~
cases, the Trp residue at position 16 may be substituted with Tyr and the Met residue at position 67 may be substituted by Ile or Leu to eliminate potential chemical cleavage sites within CAT. In addition, the Cys at posi-tion 31 may also be subs~ituted using a conservative aminoacid alteration, that is, substitution with an amino acid which does not adversely affect biological activity.
Preferred residues include alanine, serine, leucine, isoleucine and valine, most preferred is serine. These la~ter alterations are intended to reduce multimerization through disulfide bridges.
C. Expression of Modified CAT-GLP-1 Plasmid pTrpdeltaHind contains the restored TetR
gene from pTrp233 (although the HindIII site has been eliminated), the Trpl6 to Tyr, Cys3l to Ser, and Met67 to Leu substltutions in the CAT gene sequence, and the GLP 1 gene (taught in Example III) fused in-frame to the modified CAT gene -through a methione residue. The vector was used to transform several E. coli strains including ~3110, MC1061, DHl, MM294 and RR1.
E. coli RRl transformants were more stable and appeared to have better induction/repression control of the Trp promoter than any of the other hosts. An alternative construction for this vector includes reversing the TetR gene (to avoid the back-to-b~ck placement of the Tet~ and Trp promoters in the present construct) to alleviate the stability problems observed using bacterial hosts other than RRI transformants.
VI. Construction of pTrpCAT72:Adipsin/D.
The coding sequence for mature human adipsin/D
was fused to pCAT72 to produce a fusion protein suitable, for example, to generate antisera agains~ human adipsin/D.
,, ,~
:L~2~162 A. Construction of pTrpCAT7? Q3Sl Plasmid pCAT72 Q3Sl was cons~ructed to eliminate Asn residues at which secondary cleavages can occur during hydroxylamine release of peptides fused to CAT. The A~n residues at amino acid positions 26, 51 and 78 of CAT were changed to Gln residues. At the same time, the single Cys at position 31 was changed to Ser to decrease the amount of aggregation seen with many CAT fusion proteins.
The vector pCAT72 Q3Sl was constructed as fol-lows: Oligos CAT72-1 through 6 (below) were annealed and ligated into pUC-9 which had been cleaved with NdeI ~nd EcoRI. In this way, the mutated CAT72 was joined to the polylinker region of the pUC plasmid. CAT72 Q3Sl with the polylinker was then removed from pUC by cleavage with NdeI
and HindIII, and inserted into pTrp233 b~tween NdeI and HindIII to yield pTrpCAT72 Q3Sl.
TATGGAGAAA AAAATCACTG GATATACCAC CGTTGATATA TCCCAATGGC
~60 70 ATCGTAAAGA ACATTTTGAG GCATTTCA
CAAAATGTTC TTTACGATGC CATTGGGATA TATCAACGGT GGTATATCCA
TGATTTTTT TCTCCA
TCAGTTGCT CAATCTACCT ATCAGCAGAC CGTTCAGCTG GATATTACGG
CCTTTTTAAA GACCGTAAAG AAACAGAAGC
CTTTACGGTC TTTAAAAAGG CCGTAATATC CAGCTGAACG GTCTGCTGAT
AGGTAGATTG AGCAACTGAC TGAAATGCCT
1320~
ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGCAGGCT
CATCCGG
AATTCCGGAT GAGCCTGCAT CAGGCGGGCA AGAATGTGAA TAAAGGCCGG
ATAAAACTTG TGCTTCTGTT T
B. Construction of pTrpCAT72 Q6S3 Starting with pCAT72 Q3Sl, pCAT153 Q6S3 was constructed to change the Asn residues at positions 130, 141 and 148 of CAT to Gln residues, and to change the Cys residues at 91 and 126 to Ser residues.
Pla~mid CAT72 Q3Sl in pUC-9 was clea~ed with EcoRI. Oligos CAT153-1 through 6 (below) were annealed and ligated into pCAT72 to ~ive pCAT153 Q6S3. The modified pCAT153 was then removed from pUC by cleavage with NdéI and HindIII, and the resulting fragment inserted into pTrp233 to give pTrpCAT153 Q6S3.
AATTTCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT
CACCCTTCTT ACACCGTTTT CCATGAGCAA
~ 10 20 30 40 50 AAA~CGGTGT AAGAAGGGTG AACACTATCC CATATCACCA GCTCACCGTC
TTTCATTGCC ATACGA
~ 10 20 30 40 50 ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACG~CG ATTTCCGGCA
~ ~0 70 80 GTTTCTACAC ATATATTCGC AAGATGTGGC
~37~ 132~62 AGCGATGAAA ACGTTTCAGT TTGCTCATG~
-10 ~0 30 40 50 GTCTTACGGT GAACAGCTGG CCTATTTCCC TAAAGGGTTT ATTGAGCAGA
TGTTTTTCG~ CTCAGCCCAG CCCG
CAT153~6 - ~ 10 20 30 4~ 5) AATTCGGGCT GGGCTGAGAC ~A~AAACATC TGCTC~AT~ ACCCTTTA~G
7~ 80 GAAATA~GCC AGCTGTTCAC CGTAAGACGC CACATCTT
~, ~
Next, the human adip~in/D.cDNA hg31-40 (Figure 10) wa~ constructed. The BamHI-~yI fragment containing th~ mature coding region was gel puri~iod and in3erted into pUC-9 which had been cleaved with BamHI and HlndIII.
The StyI end of the cDNA ~a~ ~oined to the HindIII end of pUC using two oligos (#3886 S'-CATGGGTGCCGGGGCCTGA-3~ and #3887 5'-AGCTTCAGGCCCCGGCACC-3'). 8y inserting the BamHI-~yI fragment of adipsin/D into pUC in this way, the cod-ing sequence of adipsin/D wa~ placed in fra~e with the EcoRI site of pVC~9. 1~he EcoRI-HindIII fragment of thi~
construct w~s removed from pUC-9 and in~ertsd into pTrpCAT72 between the EcoRI ~ite and the HindIII sites to yield pTrpCRT72:Adipsin/D.
This con~truct gave 10-15~ lavel~ of fusion protein upon induction in ~3110 c~llY.
~odifications of the above de~cribed mode~ for carrying out the invention that are obvious to thoso of skill in the ar~ of molecular biol~gy, protein chemi~try, cell biology, or rela~ed field~ are in~ended to be within the scope of the following claim~.
~1
Claims (15)
1. A method of stabilizing heterologous protein expression in a prokaryotic host comprising:
(a) constructing a hybrid gene comprising in sequential order, a 3' truncated chloramphenicol acetyltransferase (CAT) gene sequence fused in frame with a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide 1, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP-C, wherein said polypeptide is normally not recoverable in bacterial expression systems, and wherein said hybrid gene, upon translation, produces a fusion protein in a recoverable yield;
(b) providing a vector for expression of said hybrid gene;
(c) culturing the prokaryotic host transformed with the expression vector; and (d) recovering the fusion protein.
(a) constructing a hybrid gene comprising in sequential order, a 3' truncated chloramphenicol acetyltransferase (CAT) gene sequence fused in frame with a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide 1, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP-C, wherein said polypeptide is normally not recoverable in bacterial expression systems, and wherein said hybrid gene, upon translation, produces a fusion protein in a recoverable yield;
(b) providing a vector for expression of said hybrid gene;
(c) culturing the prokaryotic host transformed with the expression vector; and (d) recovering the fusion protein.
2. The method of claim 1 wherein said prokaryotic host is a bacterial cell.
3. The method of claim 2 wherein said bacterial cell is E. coli.
4. The method of claim 1 wherein said 3' truncated CAT gene sequence enhances the level of heterologous protein present in the total cellular protein.
5. The method of claim 1 wherein the length of the truncated CAT gene sequence encodes a CAT peptide of about 73 to about 210 amino acids.
6. The method of claims 1 or 5 wherein said hybrid gene further comprises a DNA sequence encoding a selective cleavage site located between the CAT gene sequence and the heterologous gene sequence.
7. The method of claim 6 wherein said selective cleavage site is composed of tryptophan, methionine, asparagine-glycine, or glutamic acid.
8. A method of stabilizing heterologous protein expression in a prokaryotic host comprising:
(a) constructing a hybrid gene comprising in sequential order, a 3' truncated chloramphenicol acetyltransferase (CAT) gene sequence encoding a CAT
peptide of about 73 to about 180 amino acids, fused in-frame with a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP-C, wherein said heterologous protein is normally not recoverable in bacterial expression systems, and wherein said hybrid gene, upon translation, produces a fusion protein in a recoverable yield;
(b) providing a vector for expression of said hybrid gene;
(c) culturing the prokaryotic host transformed with the expression vector; and (d) recovering the fusion protein.
(a) constructing a hybrid gene comprising in sequential order, a 3' truncated chloramphenicol acetyltransferase (CAT) gene sequence encoding a CAT
peptide of about 73 to about 180 amino acids, fused in-frame with a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP-C, wherein said heterologous protein is normally not recoverable in bacterial expression systems, and wherein said hybrid gene, upon translation, produces a fusion protein in a recoverable yield;
(b) providing a vector for expression of said hybrid gene;
(c) culturing the prokaryotic host transformed with the expression vector; and (d) recovering the fusion protein.
9. The method of claim 8 wherein said hybrid gene further comprises a DNA sequence encoding a selective cleavage site located between the CAT gene sequence and the heterologous gene sequence.
10. A bacterial expression vector capable of enhancing the level of expression of non-stable, bacteri-ally produced heterologous polypeptides comprising:
a hybrid gene having in sequential order, a 3' truncated CAT gene sequence linked to a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP-C, wherein said polypeptide is normally not recoverable in bacterial expression systems, whereby said truncated CAT
gene sequence is capable of rendering the resulting fusion protein resistant to proteolytic degradation.
a hybrid gene having in sequential order, a 3' truncated CAT gene sequence linked to a heterologous gene sequence encoding a mammalian polypeptide selected from the group consisting of amyloid protein A4-751 insert sequence, glucagon-like peptide I, adipsin/D, lung surfactant protein SP-B and lung surfactant protein SP-C, wherein said polypeptide is normally not recoverable in bacterial expression systems, whereby said truncated CAT
gene sequence is capable of rendering the resulting fusion protein resistant to proteolytic degradation.
11. The expression vector of claim 10 wherein the length of the truncated CAT gene sequence encodes a CAT
peptide of about 73 to about 210 amino acids.
peptide of about 73 to about 210 amino acids.
12. The bacterial expression vector of claims 10 or 11 wherein said hybrid gene further comprises a DNA
sequence encoding a selective cleavage site located between the CAT gene sequence and the heterologous gene sequence.
sequence encoding a selective cleavage site located between the CAT gene sequence and the heterologous gene sequence.
13. The vector of claim 12 wherein the hybrid gene having said 3' truncated CAT gene sequence, upon expression, enhances the level of the heterologous protein present in the total cellular protein.
14. In a bacterial expression vector capable of enhancing the level of expression of non-stable, bacteri-ally produced heterologous polypeptides wherein the vector comprises a hybrid gene having in sequential order, a 3' truncated CAT gene sequence linked to a heterologous gene sequence encoding a polypeptide normally not recoverable in bacterial expression systems, said truncated CAT gene sequence being capable of rendering the resulting fusion protein resistant to proteolytic degradation, the improvement comprising altering one or more DNA codons of the truncated CAT gene to eliminate potential chemical cleavage sites within the CAT protein.
15. The improved bacterial expression vector of claim 14 wherein the alterations include substituting the DNA encoding a) methionine at position 67 of CAT with DNA
encoding isoleucine or leucine; (b) cysteine at position 31 of CAT with DNA encoding serine; or (c) tryptophan at position 16 of CAT with DNA encoding tyrosine.
encoding isoleucine or leucine; (b) cysteine at position 31 of CAT with DNA encoding serine; or (c) tryptophan at position 16 of CAT with DNA encoding tyrosine.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US23122488A | 1988-08-11 | 1988-08-11 | |
| US231,224 | 1988-08-11 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA1320162C true CA1320162C (en) | 1993-07-13 |
Family
ID=22868276
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA000607908A Expired - Fee Related CA1320162C (en) | 1988-08-11 | 1989-08-09 | Method for stabilizing heterologous protein expression and vectors for use therein |
Country Status (6)
| Country | Link |
|---|---|
| EP (1) | EP0428615A4 (en) |
| JP (1) | JPH04501204A (en) |
| AU (1) | AU4216689A (en) |
| CA (1) | CA1320162C (en) |
| IL (1) | IL91272A0 (en) |
| WO (1) | WO1990001540A1 (en) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FR2686899B1 (en) | 1992-01-31 | 1995-09-01 | Rhone Poulenc Rorer Sa | NOVEL BIOLOGICALLY ACTIVE POLYPEPTIDES, THEIR PREPARATION AND PHARMACEUTICAL COMPOSITIONS CONTAINING THEM. |
| TW327194B (en) * | 1992-05-01 | 1998-02-21 | American Cyanamid Co | Novel amyloid precursor proteins and methods of using same |
| NL9201173A (en) * | 1992-07-01 | 1994-02-01 | Leuven K U Res & Dev | Vector for positive selection of recombinants |
| DK144093D0 (en) * | 1993-12-23 | 1993-12-23 | Novo Nordisk As | |
| US6031075A (en) * | 1996-02-05 | 2000-02-29 | Children's Hospital Medical Center | Mature alveolar SP-B and a process for producing the same |
| CN1242062C (en) * | 1998-07-10 | 2006-02-15 | 赛欧斯公司 | Method for producing a peptide with a pi above 8 or below 5 |
| EP2206720A1 (en) | 2000-04-12 | 2010-07-14 | Human Genome Sciences, Inc. | Albumin fusion proteins |
| AT410444B (en) * | 2001-03-02 | 2003-04-25 | Oesterr Forsch Seibersdorf | METHOD FOR DETECTING NUCLEIC ACID MOLECULES |
| EP2261250B1 (en) | 2001-12-21 | 2015-07-01 | Human Genome Sciences, Inc. | GCSF-Albumin fusion proteins |
| CA2513213C (en) | 2003-01-22 | 2013-07-30 | Human Genome Sciences, Inc. | Albumin fusion proteins |
| ES2567634T3 (en) | 2004-02-09 | 2016-04-25 | Human Genome Sciences, Inc. | Albumin Fusion Proteins |
| WO2010068754A2 (en) | 2008-12-10 | 2010-06-17 | Paka Pulmonary Pharmaceuticals, Inc. | Methods and compositions for delivery of medicaments to the lungs |
| CN117343200B (en) * | 2023-12-04 | 2024-01-30 | 北京质肽生物医药科技有限公司 | Fusion protein containing amyloid core polypeptide, preparation method and application thereof |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB8314362D0 (en) * | 1983-05-24 | 1983-06-29 | Celltech Ltd | Polypeptide and protein products |
| US4933280A (en) * | 1984-12-11 | 1990-06-12 | California Biotechnology Inc. | Recombinant DNA sequence encoding Alveolar Surfactant Protein |
| DK175380B1 (en) * | 1985-04-22 | 2004-09-20 | Genentech Inc | Mutant human tissue plasminogen activator, DNA sequence encoding it, replicable expression vector to express this DNA sequence, microorganism transformed with this vector, cell cultures transformed with this ... |
| DE3523701A1 (en) * | 1985-07-03 | 1987-01-08 | Bayer Ag | PROCESS FOR PRODUCING PROTEINS AND POLYPEPTIDES |
| GB8518753D0 (en) * | 1985-07-24 | 1985-08-29 | Glaxo Group Ltd | Microbiological products |
| EP0290516A4 (en) * | 1986-10-24 | 1989-11-14 | Jeffrey A Whitsett | Pulmonary hydrophobic surfactant-associated proteins. |
| JPH0779702B2 (en) * | 1986-11-17 | 1995-08-30 | サイオス ノバ インコーポレイテッド | Recombinant Alzheimer amyloid protein |
| EP0383837B1 (en) * | 1987-11-04 | 1998-02-04 | Byk Gulden Lomberg Chemische Fabrik GmbH | Alveolar surfactant proteins |
-
1989
- 1989-08-09 CA CA000607908A patent/CA1320162C/en not_active Expired - Fee Related
- 1989-08-09 AU AU42166/89A patent/AU4216689A/en not_active Abandoned
- 1989-08-09 JP JP1509538A patent/JPH04501204A/en active Pending
- 1989-08-09 WO PCT/US1989/003417 patent/WO1990001540A1/en not_active Ceased
- 1989-08-09 EP EP19890910335 patent/EP0428615A4/en not_active Withdrawn
- 1989-08-10 IL IL91272A patent/IL91272A0/en unknown
Also Published As
| Publication number | Publication date |
|---|---|
| JPH04501204A (en) | 1992-03-05 |
| EP0428615A4 (en) | 1991-11-13 |
| WO1990001540A1 (en) | 1990-02-22 |
| IL91272A0 (en) | 1990-03-19 |
| EP0428615A1 (en) | 1991-05-29 |
| AU4216689A (en) | 1990-03-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6500645B1 (en) | N-terminally extended proteins expressed in yeast | |
| CA1320162C (en) | Method for stabilizing heterologous protein expression and vectors for use therein | |
| KR100227167B1 (en) | Fusion protein containing n-terminal fragments of human serum albumin | |
| JP3369168B2 (en) | Expression of recombinant polypeptides with improved purification | |
| Kaster et al. | Analysis of a bacterial hygromycin B resistance gene by transcriptional and translational fusions and by DNA sequencing | |
| JP3730255B2 (en) | N-terminal extended protein expressed in yeast | |
| JPH05268962A (en) | Cleavage of double stranded dna at arbitrary site | |
| JP4624495B2 (en) | Production of human insulin | |
| JPH0759193B2 (en) | Cloning vector for foreign protein expression | |
| JP3305781B2 (en) | Novel DNA molecules and hosts | |
| JP2970811B2 (en) | Expression systems for preparing polypeptides in prokaryotic cells | |
| US6962796B1 (en) | Production of human parathyroid hormone from microorganisms | |
| US5420242A (en) | Production of human parathyroid hormone from microorganisms | |
| EP0946735B1 (en) | N-terminally extended proteins expressed in yeast | |
| JPH10511845A (en) | Methods for producing extracellular proteins in bacteria | |
| HU216335B (en) | Process for producing peptides | |
| PT87211B (en) | PROCESS FOR THE PREPARATION OF POLIPEPTIDEES BY GENETIC MANIPULATION | |
| EP0225860B1 (en) | A method to export gene products to the growth medium of gram negative bacteria | |
| JP4180112B2 (en) | Vector for expression of N-terminally extended protein in yeast cells | |
| US5378613A (en) | Method for increased expression of low molecular weight recombinant polypeptides | |
| JPH05500615A (en) | Hirudin and new recombinant production method of hirudin | |
| KR900001014B1 (en) | Method for preparing bovine growth hormone derivative expression vector | |
| JP2769541B2 (en) | Equilibrium-type inducible transcription system | |
| KR900001015B1 (en) | Method for properation of expression vector | |
| PT607080E (en) | NEW PRO SEQUENCE THAT ALLOWS SECRECY OF HETEROLOGICAL PROTEINS FROM A YEAST CELL |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MKLA | Lapsed |