EP1421102A2 - Verfahren zur herstellung acylierter polypeptide - Google Patents
Verfahren zur herstellung acylierter polypeptideInfo
- Publication number
- EP1421102A2 EP1421102A2 EP02750841A EP02750841A EP1421102A2 EP 1421102 A2 EP1421102 A2 EP 1421102A2 EP 02750841 A EP02750841 A EP 02750841A EP 02750841 A EP02750841 A EP 02750841A EP 1421102 A2 EP1421102 A2 EP 1421102A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- glu
- arg
- ala
- polypeptide
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 147
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 119
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 108
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000003776 cleavage reaction Methods 0.000 claims abstract description 52
- 230000007017 scission Effects 0.000 claims abstract description 52
- 239000002243 precursor Substances 0.000 claims abstract description 45
- 230000010933 acylation Effects 0.000 claims abstract description 26
- 238000005917 acylation reaction Methods 0.000 claims abstract description 26
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims abstract description 9
- 210000004027 cell Anatomy 0.000 claims description 40
- 125000000539 amino acid group Chemical group 0.000 claims description 33
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 28
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 27
- 101710198884 GATA-type zinc finger protein 1 Proteins 0.000 claims description 16
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical group C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 claims description 16
- 102100040918 Pro-glucagon Human genes 0.000 claims description 15
- 210000004899 c-terminal region Anatomy 0.000 claims description 13
- 108091033319 polynucleotide Proteins 0.000 claims description 12
- 102000040430 polynucleotide Human genes 0.000 claims description 12
- 239000002157 polynucleotide Substances 0.000 claims description 12
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 11
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 claims description 9
- 108010068380 arginylarginine Proteins 0.000 claims description 9
- 210000005253 yeast cell Anatomy 0.000 claims description 9
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 8
- TWSALRJGPBVBQU-PKQQPRCHSA-N glucagon-like peptide 2 Chemical class C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O)[C@@H](C)CC)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)CC)C1=CC=CC=C1 TWSALRJGPBVBQU-PKQQPRCHSA-N 0.000 claims description 7
- 230000017854 proteolysis Effects 0.000 claims description 7
- 238000000746 purification Methods 0.000 claims description 7
- 108010027845 thymosin alpha(1) (24-28) Proteins 0.000 claims description 7
- 239000000126 substance Substances 0.000 claims description 6
- 102400000326 Glucagon-like peptide 2 Human genes 0.000 claims description 5
- 101800000221 Glucagon-like peptide 2 Proteins 0.000 claims description 5
- 206010061592 cardiac fibrillation Diseases 0.000 claims description 5
- 238000000855 fermentation Methods 0.000 claims description 5
- 230000004151 fermentation Effects 0.000 claims description 5
- 230000002600 fibrillogenic effect Effects 0.000 claims description 5
- 238000001556 precipitation Methods 0.000 claims description 5
- 239000000095 Growth Hormone-Releasing Hormone Substances 0.000 claims description 4
- 102100022831 Somatoliberin Human genes 0.000 claims description 4
- 101710142969 Somatoliberin Proteins 0.000 claims description 4
- 230000003241 endoproteolytic effect Effects 0.000 claims description 4
- 238000000926 separation method Methods 0.000 claims description 4
- 230000002255 enzymatic effect Effects 0.000 claims description 3
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 claims description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 claims description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 claims description 2
- FRJIAZKQGSCKPQ-FSPLSTOPSA-N His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 FRJIAZKQGSCKPQ-FSPLSTOPSA-N 0.000 claims description 2
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims description 2
- 108010040030 histidinoalanine Proteins 0.000 claims description 2
- 108010018006 histidylserine Proteins 0.000 claims description 2
- 102400000322 Glucagon-like peptide 1 Human genes 0.000 claims 4
- 229940121355 glucagon like peptide 2 (glp-2) analogues Drugs 0.000 claims 2
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical group OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 claims 1
- LYCVKHSJGDMDLM-LURJTMIESA-N His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 LYCVKHSJGDMDLM-LURJTMIESA-N 0.000 claims 1
- 108091016366 Histone-lysine N-methyltransferase EHMT1 Proteins 0.000 claims 1
- 108010036413 histidylglycine Proteins 0.000 claims 1
- 239000013598 vector Substances 0.000 abstract description 22
- 239000004472 Lysine Substances 0.000 abstract description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 5
- 108060003199 Glucagon Proteins 0.000 abstract description 4
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 abstract description 4
- 229960004666 glucagon Drugs 0.000 abstract description 4
- 101800004266 Glucagon-like peptide 1(7-37) Proteins 0.000 abstract description 2
- 238000000338 in vitro Methods 0.000 abstract description 2
- 102400000321 Glucagon Human genes 0.000 abstract 1
- 102400000324 Glucagon-like peptide 1(7-37) Human genes 0.000 abstract 1
- 101000668058 Infectious salmon anemia virus (isolate Atlantic salmon/Norway/810/9/99) RNA-directed RNA polymerase catalytic subunit Proteins 0.000 abstract 1
- 239000000047 product Substances 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 12
- 108090000623 proteins and genes Proteins 0.000 description 12
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- 108010076504 Protein Sorting Signals Proteins 0.000 description 9
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 8
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 8
- 102000035195 Peptidases Human genes 0.000 description 8
- 108091005804 Peptidases Proteins 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 229940024606 amino acid Drugs 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 6
- 150000001413 amino acids Chemical class 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- 102100033598 Triosephosphate isomerase Human genes 0.000 description 5
- 125000003275 alpha amino acid group Chemical group 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 4
- GCYXWQUSHADNBF-AAEALURTSA-N preproglucagon 78-108 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 GCYXWQUSHADNBF-AAEALURTSA-N 0.000 description 4
- -1 β-endorphine Chemical class 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 102000051325 Glucagon Human genes 0.000 description 3
- 102000056251 Prolyl Oligopeptidases Human genes 0.000 description 3
- 101710178372 Prolyl endopeptidase Proteins 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 229940024999 proteolytic enzymes for treatment of wounds and ulcers Drugs 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- DDYAPMZTJAYBOF-ZMYDTDHYSA-N (3S)-4-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S)-4-amino-1-[[(2S,3S)-1-[[(1S)-1-carboxyethyl]amino]-3-methyl-1-oxopentan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-4-methylsulfanyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-6-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S,3R)-2-[[2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-imidazol-4-yl)propanoyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]acetyl]amino]-3-hydroxybutanoyl]amino]-3-phenylpropanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxypropanoyl]amino]-3-carboxypropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxypropanoyl]amino]hexanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-methylpentanoyl]amino]-3-carboxypropanoyl]amino]-3-hydroxypropanoyl]amino]-5-carbamimidamidopentanoyl]amino]-5-carbamimidamidopentanoyl]amino]propanoyl]amino]-5-oxopentanoyl]amino]-4-oxobutanoic acid Chemical class [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DDYAPMZTJAYBOF-ZMYDTDHYSA-N 0.000 description 2
- ODHCTXKNWHHXJC-VKHMYHEASA-N 5-oxo-L-proline Chemical group OC(=O)[C@@H]1CCC(=O)N1 ODHCTXKNWHHXJC-VKHMYHEASA-N 0.000 description 2
- FFMIYIMKQIMDPK-BQBZGAKWSA-N Asn-His Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 FFMIYIMKQIMDPK-BQBZGAKWSA-N 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 101100166113 Candida albicans (strain SC5314 / ATCC MYA-2876) SAP9 gene Proteins 0.000 description 2
- 108010059378 Endopeptidases Proteins 0.000 description 2
- 102000005593 Endopeptidases Human genes 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 108010088406 Glucagon-Like Peptides Proteins 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- 101710172072 Kexin Proteins 0.000 description 2
- 101100395023 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) his-7 gene Proteins 0.000 description 2
- 108010058003 Proglucagon Proteins 0.000 description 2
- 101710118538 Protease Chemical group 0.000 description 2
- 101100319895 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YAP3 gene Proteins 0.000 description 2
- 101100160515 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YPS1 gene Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical group C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 2
- 101150008621 YPS1 gene Proteins 0.000 description 2
- 108010048241 acetamidase Proteins 0.000 description 2
- 125000002252 acyl group Chemical group 0.000 description 2
- 108090000637 alpha-Amylases Proteins 0.000 description 2
- 102000004139 alpha-Amylases Human genes 0.000 description 2
- 229940024171 alpha-amylase Drugs 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 229940125396 insulin Drugs 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 235000019419 proteases Nutrition 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 101000793340 Achromobacter lyticus Protease 1 Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 241000534414 Anotopterus nikparini Species 0.000 description 1
- 108010037870 Anthranilate Synthase Proteins 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical group NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 1
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 1
- 240000006439 Aspergillus oryzae Species 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 101000775727 Bacillus amyloliquefaciens Alpha-amylase Proteins 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 101000695691 Bacillus licheniformis Beta-lactamase Proteins 0.000 description 1
- 108010029675 Bacillus licheniformis alpha-amylase Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101900040182 Bacillus subtilis Levansucrase Proteins 0.000 description 1
- 102100030981 Beta-alanine-activating enzyme Human genes 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 102000005367 Carboxypeptidases Human genes 0.000 description 1
- 108010006303 Carboxypeptidases Proteins 0.000 description 1
- 102000005572 Cathepsin A Human genes 0.000 description 1
- 108010059081 Cathepsin A Proteins 0.000 description 1
- 241000235646 Cyberlindnera jadinii Species 0.000 description 1
- 150000008574 D-amino acids Chemical group 0.000 description 1
- 101100342470 Dictyostelium discoideum pkbA gene Proteins 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101100385973 Escherichia coli (strain K12) cycA gene Proteins 0.000 description 1
- 102000018389 Exopeptidases Human genes 0.000 description 1
- 108010091443 Exopeptidases Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 101100001650 Geobacillus stearothermophilus amyM gene Proteins 0.000 description 1
- 241000178290 Geotrichum fermentans Species 0.000 description 1
- 101150009006 HIS3 gene Proteins 0.000 description 1
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 1
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- LNCFUHAPNTYMJB-IUCAKERBSA-N His-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNCFUHAPNTYMJB-IUCAKERBSA-N 0.000 description 1
- 101000773364 Homo sapiens Beta-alanine-activating enzyme Proteins 0.000 description 1
- 101000788682 Homo sapiens GATA-type zinc finger protein 1 Proteins 0.000 description 1
- 101000801742 Homo sapiens Triosephosphate isomerase Proteins 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 241000235087 Lachancea kluyveri Species 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical group NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- 101150068888 MET3 gene Proteins 0.000 description 1
- 241000235042 Millerozyma farinosa Species 0.000 description 1
- 101100243377 Mus musculus Pepd gene Proteins 0.000 description 1
- 229910017912 NH2OH Inorganic materials 0.000 description 1
- 101100022915 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-11 gene Proteins 0.000 description 1
- 241000736107 Novosphingobium capsulatum Species 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 241001452677 Ogataea methanolica Species 0.000 description 1
- 102000007981 Ornithine carbamoyltransferase Human genes 0.000 description 1
- 101710113020 Ornithine transcarbamylase, mitochondrial Proteins 0.000 description 1
- 102100037214 Orotidine 5'-phosphate decarboxylase Human genes 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 101150029183 PEP4 gene Proteins 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 241001489192 Pichia kluyveri Species 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- ODHCTXKNWHHXJC-GSVOUGTGSA-N Pyroglutamic acid Natural products OC(=O)[C@H]1CCC(=O)N1 ODHCTXKNWHHXJC-GSVOUGTGSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000235403 Rhizomucor miehei Species 0.000 description 1
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 101000733770 Schizosaccharomyces pombe (strain 972 / ATCC 24843) Aminopeptidase 1 Proteins 0.000 description 1
- 101100022918 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sua1 gene Proteins 0.000 description 1
- 101900104102 Schizosaccharomyces pombe Triosephosphate isomerase Proteins 0.000 description 1
- 101100309436 Streptococcus mutans serotype c (strain ATCC 700610 / UA159) ftf gene Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 101100370749 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) trpC1 gene Proteins 0.000 description 1
- 101150033985 TPI gene Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 101150050575 URA3 gene Proteins 0.000 description 1
- 101710103041 Vacuolar protease A Proteins 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- ODHCTXKNWHHXJC-UHFFFAOYSA-N acide pyroglutamique Natural products OC(=O)C1CCC(=O)N1 ODHCTXKNWHHXJC-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108091005647 acylated proteins Proteins 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010045649 agarase Proteins 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 238000010640 amide synthesis reaction Methods 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 239000001166 ammonium sulphate Substances 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 101150008194 argB gene Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 101150005799 dagA gene Proteins 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 238000003795 desorption Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 229940066758 endopeptidases Drugs 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 108091023030 exopolyphosphatase Proteins 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- UKVFVQPAANCXIL-FJVFSOETSA-N glp-1 (1-37) amide Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 UKVFVQPAANCXIL-FJVFSOETSA-N 0.000 description 1
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 1
- 210000002288 golgi apparatus Anatomy 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 239000008241 heterogeneous mixture Substances 0.000 description 1
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 210000003405 ileum Anatomy 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 101150039489 lysZ gene Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 239000000693 micelle Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 150000007523 nucleic acids Chemical group 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 101150019841 penP gene Proteins 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 101150054232 pyrG gene Proteins 0.000 description 1
- 230000006340 racemization Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 101150025220 sacB gene Proteins 0.000 description 1
- 229940081969 saccharomyces cerevisiae Drugs 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 210000004739 secretory vesicle Anatomy 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 101150016309 trpC gene Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/006—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length of peptides containing derivatised side chain amino acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/107—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides
- C07K1/1072—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides by covalent attachment of residues or functional groups
- C07K1/1077—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides by covalent attachment of residues or functional groups by covalent attachment of residues other than amino acids or peptide residues, e.g. sugars, polyols, fatty acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/605—Glucagons
Definitions
- the present invention is related to a method of producing acylated proteins or poly- peptides by expressing certain precursors of the desired polypeptide that protects the expressed polypeptide against proteolytic degradation within the host cell.
- the invention is also related to DNA-sequences, vectors and transformed host cells for use in the claimed method.
- the present invention is related to certain precursors of the desired polypeptides and an acylation method for acylation method for acylation of one or more lysine resi- dues in the desired polypeptide.
- Recombinant DNA technology has enabled expression of foreign (heterologous) polypeptides in microbial and other host cells.
- yeast expression of heterologous polypeptides after transformation of yeast cells with suitable expression vectors comprising DNA se- quences coding for said polypeptides has been successful for many species of polypeptides, such as insulin and insulin precursors, glucagon, glucagon like peptides and analogues thereof.
- a common problem with expression of proteins or polypeptides of a limited size in a recombinant host is, however, proteolytic degradation of the expressed product by proteolytic enzymes produced by the host organism.
- the isolated product may be a heterogeneous mixture of species of the desired polypeptide having different amino acid chain lengths.
- Yeast contains a number of proteases used for processing yeast proteins e.g. Kex2p and Ypsl p which cleave at the C-terminal side of a dibasic amino acid sequence, and the carboxypeptidase Kexlp which digests remaining basic amino acids after the endoproteolytic digestion by Kex2p, and Ste13p or Dap2p which cleave at X-Ala or X-Pro.
- proteases used for processing yeast proteins e.g. Kex2p and Ypsl p which cleave at the C-terminal side of a dibasic amino acid sequence
- carboxypeptidase Kexlp which digests remaining basic amino acids after the endoproteolytic digestion by Kex2p
- Ste13p or Dap2p which cleave at X-Ala or X-Pro.
- polypeptides e.g. polypeptides having from about 10 to about 100 amino acids chains and none or a few disulphide bonds and/or are rich in basic amino acids, such as ⁇ -endorphine, glucagon and glucagon like peptides may be especially susceptible to intracellular and extracellular proteolytic degradation when expressed in a transformed host cell due to their short-chain open and non-disulfide stabilized structure resulting in an inhomogeneous product which may be proteolytically degraded in the N- and C-terminal ends as well as endoproteolytically degraded.
- N-terminal cleavage of expressed polypeptides by host cell produced enzymes may cause decreased yield of a desired product with correct N-terminal if the N- terminal of the expressed product constitutes a cleavage site for endogenous enzymes.
- yeast for example the enzyme Ste13p cleaves at X-Ala or X-Pro, where X can be any amino acid residue.
- polypeptides with an Ala or Pro residue as the second residue from the N-terminal end may be cleaved at the N-terminal end and the recovered polypeptide may be a mixture of different degradation products complicating the recovery process and reducing the overall yield.
- small polypeptides with little tertiary structure and low content of ⁇ - helixes may have a higher tendency to form ⁇ -sheets that stack on each other and form fibrils during fermentation and down stream separation and purification steps in large scale production. Formation of fibrils may cause unwanted precipitation with loss of the desired product. Fibrillation may be prevented by treatment at high pH. However, such alkaline treatment is pretty harsh to the product and may cause unwanted formation of D-amino acids residues.
- Human GLP-1 is a 37 amino acid residue peptide originating from preproglucagon which is synthesised in the L-cells in the distal ileum, in the pancreas and in the brain. Processing of preproglucagon to give GLP-1 (7-36) amide, GLP-1 (7-37) and GLP-2 occurs mainly in the L- cells. Both GLP-1 and GLP-2 have an Ala as the second amino acid residue from the N- terminal end and are thus prone for N-terminal cleavage when expressed in a host organism such as yeast.
- the present invention is related to a method for making a polypeptide comprising at least one lysine residue being acylated in its ⁇ -amino group, said method comprising the following steps:
- the order of steps (ii) to (iv) may be changed.
- the acylating step (iii) is conducted after the removal step (iv) so that the N-terminal extension is removed before the polypeptide is acylated in the desired position.
- the desired polypeptide may contain more than one lysine residue as a potential target for acylation but will typically only contain one lysine residue. Thus, in one embodiment, the desired polypeptide is monoacylated.
- the N-terminal extension will typically be of up to 15 amino acids in length and may be from 1-15; 2-15; 3-15; 3-12; 3-10; 3-9; 3-8; 3-7; 3-6; or 3-5 amino acids in length.
- the amino acids in the N-terminal extension are selected with a multiple purpose: 1) to protect the expressed precursor molecule against endoproteolytic degradation; 2) to avoid acylation at the N-terminal amino acid residue of the desired polypeptide, i.e. to ensure that acylation takes place preferentially or only at the wanted position in the desired polypeptide; and 3) to prevent precipitation caused by fibrillation during fermentation and down stream processing steps such as separation and purification in large scale production.
- the amino acid residues at both ends of the N-terminal extension should be selected so as to ensure efficient cleavage of the N-terminal extension from the desired polypeptide at the C-terminal end and at the N-terminal end from possible upstream sequences such as pre- or pre-pro peptides which have the purpose of ensuring transport of the expressed precursor molecule out of the host cell and into the culture medium.
- the N-terminal extension may serve as a tag for purification purposes.
- the N-terminal extension is found to be stably attached to the precursor molecule of the invention during fermentation, protecting the N-terminal end of the precursor molecule against the proteolytic enzymes such as Ste13p and/or Dap2p.
- the N-terminal extension is removably attached to the N-terminal end of the desired polypeptide.
- the C-terminal end of the N-terminal extension will constitute a cleavage site or will together with amino acid residues at the N-terminal end of the desired polypeptide constitute a suitable cleavage site.
- This cleavage site is different from lysine to avoid acyla- tion of the precursor molecule at this position.
- Cleavage may be conducted by means of chemicals like cyanogen bromide (E. Gross: Methods in Enzymlogy XI, 1967, 238-255, Editor: CHW Hirs, and JP Whitelegge et al, Protein Science 2000, 9, 1618-1630) or hydroxylamine cleaving at the C-terminal side of Met or Asn.
- Asn cleavage is enhanced by the presence of a Gly N-terminal to the cleavage site (Asn IGly).
- the cleavage can also be effected by specific exoproteases such as a suitable proteolytic enzyme which is specific for the chosen cleavage site, cleaving at an N-terminal pyroglutamic acid or endoproteases like praline endopeptidases (EC 3.4.22.26) cleaving at the C-terminal side of Pro in polypeptides.
- endoproteases as trypsins cleave at the C-terminal side of many single Arg-residues, while other like Factor Xa is more specific and often cleaves after the sequence lle-Glu-Gly-Arg.
- Kex2p or PC1 and similar enzymes cleave at a dibasic cleavage site such as Arg-Arg.
- a mixture of chemical and enzymatic methods can be used if a Cys residue is placed N-terminally to the polypeptide.
- the SH-group can easily be acylated with an amine that is generating a pseudo-lysine amino acid which can then be cleaved by Achromobacter lyticus protease I after acylation of the lysine residue in the desired polypeptide, thereby protecting this site from cleavage.
- the cleavage site is chosen from the group consisting of Met, Asn, Pro, Gin, Cys and Arg-Arg.
- the desired polypeptide will comprise a Pro, Ala or Ser residue as the second amino acid residue from the N-terminal end.
- Such polypeptides will be especially vulnerable to degradation by proteolytic enzymes such as Ste13p.
- the desired polypeptide will have a His-Ala or a His-Pro, His-Ser or a Tyr as the N-terminal sequence.
- the N-terminal extension has the formula wherein X n X-i is a peptide sequence of from 1 - 14 amino acid residues in length and Y is Met, Asn, Pro, Gin, Cys or Arg-Arg, the function of X n Xi - Y being a) to protect the expressed polypeptide from endoproteolytic cleavage, b) to prevent acylation at the N- terminal end of the desired polypeptide and c) to prevent precipitation caused by fibrillation during fermentation and down stream separation and purification steps.
- the amino acid residues in X n X - Y are furthermore selected to obtain optimal in vitro cleavage of the N- terminal extension at its C-terminal end (at Y) and optimal in vivo cleavage at its N-terminal end (at X n ) at a KEX site from upstream signal-leader sequences.
- the amino acid residues in X n Xi may in principle be any amino acid residue except Lys as long as the peptide sequence fulfils at least one of the required purposes.
- number two amino acid residue from the N-terminal end of the extension is preferably not Ala or Pro.
- X n X ⁇ is a peptide sequence of from 1-15; 2-15; 3-15; 3-12;
- N-terminal amino acid residues in X n Xi are preferably chosen from Glu and Asp. Glu and/or Asp positioned at the N-terminal end of the N-terminal extension will also protect the expressed molecule against proteolytic degradation in the yeast cell.
- N-terminal extensions are Glu-Glu-Met; Glu-Glu-Ala-Glu-Met(SEQ ID NO:1); Glu-Glu-Ala-Glu-Asn(SEQ ID NO:2); Glu-Glu-Ala-Glu-Arg-Arg(SEQ ID NO:3); Gin; Glu-Pro-Gln(SEQ ID NO:4); Glu-Ala-Gln; Glu-Ala-Glu-Ala-Gln(SEQ ID NO:5); Glu-Ala-Glu- Ala-Glu-Ala-Gln(SEQ ID NO:6); Glu-Glu-Gly-Cys-Thr-Ser-lle-Cys(SEQ ID NO:7); Glu-His- Gly-Cys-Thr-Ser-lle-Cys(SEQ ID NO:8); Glu-Glu-Ala-Arg-Met(SEQ ID NO:9); Glu-Glu-Arg- Asn(SEQ ID NO:10); Glu-
- the present invention is related to a polypeptide precursor for a desired polypeptide said polypeptide precursor having the formula N-terminal extension-Y-i - * polypeptide *
- Y. is Met, Asn, Pro, Gin, Cys or Arg-Arg; the N-terminal extension has 1-14 amino acid residues as described above and * polypeptide * is the remaining part of thede- sired polypeptide.
- the present invention is related to a polypeptide precursor for a desired polypeptide said polypeptide precursor having the formula
- Y ⁇ is Met, Asn, Pro, Gin, Cys or Arg-Arg; Y 2 is His or Tyr, Y 3 is Ala, Ser or Gly, the N-terminal extension has 1-14 amino acid residues as described above and *polypeptide* is the remaining part of desired polypeptide.
- Y 2 is N-terminal amino acid residue in the desired polypeptide and Y 3 is the second amino acid residue from the N-terminal end of the desired polypeptide
- the present invention is related to polynucleotides encoding the claimed polypeptide precursors and vectors and transformed host cells containing such polynucleotides.
- acylated peptides which have a protracted profile relative to the native peptide or unmodified analogues. This phenomenon is disclosed and demonstrated in WO 98/08871 which discloses acylation of GLP-1 and analogues thereof and in WO 98/08872 which discloses acylation of GLP-2 and analogues thereof.
- the lipophilic group may be introduced by means of mono- or dipeptide spacers as disclosed in WO 98/08871.
- the lipophilic group may be introduced by means of ⁇ -amino- ⁇ , ⁇ -dicarboxylic acid groups as disclosed in WO 00/55119.
- the present polypeptide precursor will contain at least one lysine group with a free ⁇ -amino group to be acylated.
- acylating one or more free amino acid groups in a polypeptide acylation of the free amino group in the N-terminal amino acid residue is more or less avoidable.
- Certain methods have been developed to avoid acylation at the N-terminal amino acid residue, vide US patent No. 5,905,140.
- the present invention offers an alternative solution to the problem, i.e. to express an N-terminally extended precursor of the desired polypeptide.
- the precursor molecule can thus be preferentially acylated in the desired lysine residue which in the case of GLP-1 is the lysine in position 26. After acylation the acylated precursor molecule is cleaved by suitable chemical or enzymatic means as described above and the desired acylated polypeptide can be isolated.
- the acylation step (iii) may be conducted at a pH between 7 and 12.
- the pH will be between 8 and 11.5 or between 9.0 and 10.5 and a pH value of about 9.5 to 10.5 has proven to be efficient.
- the temperature will be between minus 5 and 35°C and will typically be between 0 and 20°C or between 15 and 30°C.
- preferential acylating is meant to include and acylation process where acylation takes place at one or more preferred positions in the molecule in a higher degree than at other positions in the same molecule.
- the acylation at the preferred positions is preferably at least 50, more preferred at least 80 and most preferred 90-100% of the total acylation.
- N-terminal extension is meant a polypeptide sequence removably attached to the N-terminal amino acid residue in the desired polypeptide.
- the N-terminal extension may be 1-15 amino acid residues in length and will not comprise a Lys residue.
- the N-terminal extension will protect the expressed fusion polypeptide against proteolytic degradation within the host cell as described above.
- N-terminal extension includes the cleavage site for cleavage of the N-terminal extension from the desired polypeptide's N- terminal end. It will be understood that whenever a specific N-terminal extension or a sequence being comprised in the N-terminal extension is shown, then the given C-terminal amino acid residue will be the cleavage site directly linked to the N-terminal amino acid residue of the desired polypeptide.
- GLP-1 An example of a desired polypeptide is GLP-1.
- the amino acid sequence of GLP-1 is given i.a. by Schmidt et al. (Diabetologia 28 704-707 (1985). Although the interesting pharmacological properties of GLP-1 (7-37) and analogues thereof have attracted much attention in recent years only little is known about the structure of these molecules.
- the secondary structure of GLP-1 in micelles has been described by Thorton et al. ⁇ Biochemistry 33 3532- 3539 (1994)), but in normal solution,GLP-1 is considered a very flexible molecule.
- Gly 8 GLP-1 (7-37) designates a fragment of GLP-1 derived from GLP-1 (1-37) by deleting the amino acid residues Nos. 1 to 6 and substituting the naturally occurring amino acid residue in position 8 (Ala) by Gly.
- Lys 26 (N ⁇ -tetradecanoyl)-GLP-1 (7-37) designates GLP-1 (7-37) wherein the ⁇ -amino group of the Lys residue in position 26 has been tetradecanoylated.
- GLP-2 and glucagon both belonging to the GRF (growth hormone releasing factor) family of peptides having a His or Tyr in the N-terminal position and Ser, Ala or Gly in the next position, vide Adelhorts K. et al., The Journal of Biological Chemistry (1994) p 6275 -6278).
- POT is the Schizosaccharomyces pombe triose phosphate isomerase gene
- TPI1 is the S. cerevisiae triose phosphate isomerase gene
- Fibrillation is meant a process where so called “fibrils” are formed.
- Fi- brils is a well recognized and described phenomenon and may be composed of antiparallel ⁇ -sheets. Molecules like GLP's with little ⁇ -helical structure and a very flexible and little tertiary structure are very prone to aggregation that leads to precipitation and loss of yield if very crude chemical conditions are not taken in use such as alkaline treatment at pH -12.
- leader an amino acid sequence consisting of a pre-peptide (the signal peptide) and a pro-peptide.
- signal peptide is understood to mean a pre-peptide which is present as an N-terminal sequence on the precursor form of a protein.
- the function of the signal peptide is to allow the heterologous protein to facilitate translocation into the endoplasmic reticulum.
- the signal peptide is normally cleaved off in the course of this process.
- the signal peptide may be heterologous or homologous to the yeast organism producing the protein.
- a number of signal peptides may be used with the DNA construct of the invention including the YPS1 signal peptide (formally called the YAP3 signal peptide) or any functional analogue thereof (Egel-Mitani et al.
- pro-peptide means a polypeptide sequence whose function is to allow the expressed polypeptide to be directed from the endoplasmic reticulum to the Golgi apparatus and further to a secretory vesicle for secretion into the culture medium (i.e. exportation of the polypeptide across the cell wall or at least through the cellular membrane into the periplasmic space of the yeast cell).
- the pro-peptide may be the yeast ⁇ -factor pro-peptide, vide US 4,546,082 and 4,870,008.
- the pro-peptide may be a synthetic pro-peptide, which is to say a pro-peptide not found in nature.
- Suitable synthetic pro-peptides are those disclosed in US 5,395,922; 5,795,746; 5,162,498 and WO 98/32867.
- the pro- peptide will preferably contain an endopeptidase processing site at the C-terminal end, such as a Lys- Arg sequence or any functional analog thereof.
- the polynucleotide sequence of the invention may be prepared synthetically by estab- lished standard methods, e.g. the phosphoamidite method described by Beaucage et al. (1981) Tetrahedron Letters 22:1859-1869, or the method described by Matthes et al. (1984) EMBO Journal 3:801-805.
- oligonucleotides are synthesized, for example, in an automatic DNA synthesizer, purified, duplexed and ligated to form the synthetic DNA construct.
- a currently preferred way of preparing the DNA construct is by polymerase chain reaction (PCR).
- the polynucleotide sequence of the invention may also be of mixed genomic, cDNA, and synthetic origin.
- a genomic or cDNA sequence encoding a leader peptide may be joined to a genomic or cDNA sequence encoding the precursor molecule of the invention, after which the DNA sequence may be modified at a site by inserting synthetic oli- gonucleotides encoding the desired amino acid sequence for homologous recombination in accordance with well-known procedures or preferably generating the desired sequence by PCR using suitable oligonucleotides.
- the invention encompasses a vector which is capable of replicating in the selected microorganism or host cell and which carries a polynucleotide sequence encoding the precur- sor molecule of the invention.
- the recombinant vector may be an autonomously replicating vector, i.e., a vector which exists as an extra-chromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extra-chromosomal element, a mini-chromosome, or an artificial chromosome.
- the vector may contain any means for assuring self-replication.
- the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated.
- a single vector or plasmid or two or more vectors or plasmids which together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used.
- the vector may be linear or closed circular plasmids and will preferably contain an element(s) that permits stable integration of the vector into the host cell's genome or autonomous replication of the vector in the cell independent of the genome.
- the recombinant expression vector is capable of replicating in yeast. Examples of sequences which enable the vector to replicate in yeast are the yeast plasmid 2 ⁇ m replication genes REP 1-3 and origin of replication.
- the vectors of the present invention preferably contain one or more selectable markers which permit easy selection of transformed cells.
- a selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
- Examples of bacterial selectable markers are the dal genes from Bacillus subtilis or Bacillus licheniformis, or markers which confer antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or tetracycline resistance.
- Selectable markers for use in a filamentous fungal host cell include amdS (acetamidase), argB (ornithine carbamoyl- transferase), pyrG (orotidine-5'-phosphate decarboxylase) and trpC (anthranilate synthase.
- Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3.
- a preferred selectable marker for yeast is the Schizosaccharomyces pompe TPI gene (Russell (1985) Gene 40:125-130).
- the polynucleotide sequence is operably connected to a suitable promoter sequence.
- the promoter may be any nucleic acid sequence which shows transcrip- tional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extra-cellular or intra-cellular polypeptides either homologous or heterologous to the host cell.
- suitable promoters for directing the transcription in a bacterial host cell are the promoters obtained from the E. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis levansucrase gene (sacB), Bacillus licheniformis alpha- amylase gene (amyL), Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), and Bacillus licheniformis penicillinase gene (penP).
- dagA Streptomyces coelicolor agarase gene
- sacB Bacillus subtilis levansucrase gene
- amyL Bacillus stearothermophilus maltogenic amylase gene
- amyQ Bacillus amyloliquefaciens alpha-amylase gene
- penP Bacillus lichen
- promoters for directing the transcription in a filamentous fungal host cell are promoters obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Aspergillus niger neutral alpha-amylase, and Aspergillus niger acid stable alpha-amylase.
- useful promoters are the Sac- charomyces cerevisiae MF ⁇ 1 , TPI, ADHm Gal or PGK promoters.
- the polynucleotide construct of the invention will also typically be operably connected to a suitable terminator.
- a suitable terminator is the TPI terminator (Alber et al. (1982) J. Mol. Appl. Genet. 1 :419-434) or the CYC1 terminator.
- TPI terminator Alber et al. (1982) J. Mol. Appl. Genet. 1 :419-4314
- CYC1 terminator the procedures used to ligate the polynucleotide sequence of the invention, the promoter and the terminator, respectively, and to insert them into a suitable vector containing the information necessary for replication in the selected host, are well known to persons skilled in the art.
- the vector may be constructed either by first pre- paring a DNA construct containing the entire DNA sequence encoding the precursor molecule of the invention, and subsequently inserting this fragment into a suitable expression vector, or by sequentially inserting DNA fragments containing genetic information for the individual elements followed by ligation.
- the present invention also relates to recombinant host cells, comprising a polynucleo- tide sequence encoding the precursor molecule of the invention.
- a vector comprising such polynucleotide sequence is introduced into the host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier.
- the term "host cell” encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication.
- the host cell may be a pro- karyote or a eukaryote cell.
- Useful prokaryotes are bacterial cells such as gram positive bacteria including Bacillus and Streptomyces cells, or gram negative bacteria such as E. coli and Pseudomonas sp. Cells.
- Eukaryote cells may be mammalian, insect, plant, or fungal cells.
- the host cell is a yeast cell.
- the yeast organism used in the process of the invention may be any suitable yeast organism which, on cultivation, produces large amounts of the precursor molecule.
- yeast organisms are strains selected from the yeast species Saccharomyces cerevisiae, Saccharomyces kluyveri, Schizosaccharomyces pombe, Sacchoromyces uvarum, Kluyveromyces lactis, Hansenula polymorpha, Pichia pastoris, Pichia methanolica, Pichia kluyveri, Yarrowia lipolytica, Candida sp., Candida utilis, Candida cacaoi, Geot chum sp., and Geotrichum fermentans.
- the transformation of the yeast cells may for instance be effected by protoplast formation followed by transformation in a manner known per se.
- the medium used to cultivate the cells may be any conventional medium suitable for growing yeast organisms.
- the secreted precursor of the invention may then be recovered from the medium by conventional procedures including separating the yeast cells from the medium by centrifugation, filtration or catching the precursor by an ion exchange matrix or by a reverse phase absorption matrix, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. ammonium sulphate, followed by purification by a variety of chromatographic procedures, e.g. ion exchange chromatography, affinity chromatography, or the like.
- a salt e.g. ammonium sulphate
- an analogue is used to designate a peptide wherein one or more amino acid residues of the parent peptide have been substituted by another amino acid residue and/or wherein one or more amino acid residues of the parent peptide have been deleted and/or wherein one or more amino acid residues have been added to the parent peptide.
- Such addition can take place either at the N-terminal end or at the C- terminal end of the parent peptide or both.
- derivative is used in the present text to designate a peptide in which one or more of the amino acid residues of the parent peptide have been chemically modified, e.g. by alkylation, acylation, ester formation or amide formation.
- Fig. 1 shows the plasmid pKV304 which contain DNA encoding Arg 34 GLP-1 (1-37) under regulatory control of the TPI promoter and -terminator and the MFalpha prepro sequence.
- This plasmid is the starting plasmid for making expression plasmids for the precur- sor molecules according to the present invention.
- the host strain ME1719 is a diploid strain and has a phenotype which lacks two as- partyl protease activities, i.e. YPS1 (previously called YAP3) which cleaves C-terminal side of mono- or dibasic amino acid residues (Egel-Mitani, et al., YEAST 6: 127-137, 1990) and PEP4 a vacuolar protease A responsible for activation of other proteases such as protease B, carboxypeptidase Y, aminopeptidase I, RNase, alkaline phosphatase, acid threhalase and exopolyphosphatase.
- YPS1 previously called YAP3
- PEP4 a vacuolar protease A responsible for activation of other proteases such as protease B, carboxypeptidase Y, aminopeptidase I, RNase, alkaline phosphatase, acid threhalase and exo
- triose phosphate isomerase gene has been dis- rupted which phenotype makes it possible to utilize glucose in transformants grown on glucose containing medium.
- the genetic background of ME1719 is MATa/ ⁇ ⁇ yps1 ::ura3/ ⁇ yps1 ::URA3 pep4-3/pep4-3 ⁇ tpi::LEU2/ ⁇ tpi::LEU2 Ieu2/leu2 ⁇ ura3/ ⁇ ura3.
- Expression plasmids containing the N-terminally extended Arg 34 GLP-1 (7 . 37) were made as follows: Plasmid pKV304 containing DNA encoding Arg 34 GLP-1 (7 . 37) without an N- terminal extension was digested with either Eagl+Ncol or Eagl+Asp718. After agarose electrophoresis and GeneCleanTM III purification, fragments of 1.4 kb and 10 kb, respectively were isolated. Oligonucleotide adaptors corresponding to various N-terminal extensions of Arg GLP-1 (7 . 37) containing Ncol and Asp718 cleavage sites were likewise purified as described above.
- the 1.4 kb fragment (Eagl+Ncol), 10 kb fragment (Eagl+Asp718) and the adaptor fragment designed for the N-terminal extension of Arg 34 GLP-1 (7 ⁇ 7) (Ncol+Asp718) were ligated and transformed in E. coli strain MT172 and plasmid DNA was sequenced to verify the correct N-terminally extended Arg 34 GLP-1 (7 . 37) .
- Plasmid DNA was then transformed into yeast strain ME1719 and yeast transformants were isolated twice on MUPD selective plates. Yeast cells were cultured in 5 ml MUPD medium for 3 days at 30°C and culture supernatants were analyzed by HPLC and MALDI-MS (Matrix Assisted Laser Desorption/lnonisation Mass Spectrometry).
- Table 1 shows the different GLP-1 precursors and the yield compared to a control with no N-terminal extension.
- Glu-Glu-Ala-Glu-Asn(SEQ ID NO:2)-GLP-1 was expressed in yeast and purified as previously described. Afterwards the peptide was acylated by use of N ⁇ -palmitoyl-Glu- ⁇ - succinimidyl- ⁇ -tert-butyl ester and deprotected by use of TFA. Reverse-phase HPLC-MS identified the desired product, Glu-Glu-Ala-Glu-Asn(SEQ ID NO:2)-NN2211 acylated in position Lys-26 in a yield of 80 %. 20% of the acylation mixture was acylated both in Lys-26 and N-terminally. Since cleavage of both these peptides at Asn-His should yield the same product Arg 34 GLP-1 (7 . 37) it was decided to do the exploratory studies with the 80/20 mixture.
- FXa Glu-Glu-lle-Glu-Gly-Arg(SEQ ID NO: 16)- Arg 34 GLP-1 (7-37) was expressed and recovered as previously described. 1 mg lyophilized product was dissolved in 10 ml 0,02 M Tris and 2 mM CaCI 2 , pH 7,5. At time zero 40 units of FXa (Amersham Pharmacia Biotech, Product number: 27-0849-01) was added and the incubation was stopped after 2 hours at 30°C in a water bath by dilution 1 :1 with 1 M HAc. Determination by HPLC and MALDI-TOF showed 82 % correctly cleaved Arg 34 GLP-1 (7-37) and 18 % non-cleaved product Glu-Glu-lle-Glu-Gly-
- Arg(SEQ ID NO:16)- Arg 34 GLP-1 (7-3 7) - Arg 34 GLP-1 (7-37) can then be acylated by well established known methods.
- Glu-Glu-Ala-Glu-Arg-Arg(SEQ ID NO:3)- Arg 34 GLP-1 (7-37) was expressed and recovered as previously described.
- 0,37 mg lyophilized product was dissolved in 900 ⁇ l 0,1 M NaAc and 5 mM CaCI 2 , pH 6,0.
- 100 ⁇ l of soluble C-terminal truncated kexin was added and the mixture was incubated in a water bath at 30°C. After 1 hour the reaction was stopped by addition of 2 M HAc 1 :1 and cleavage was determined by HPLC and MALDI-MS.
- Arg 34 GLP-1 (7-37) 66.9% was cleaved to Arg 34 GLP-1 (7-37) and the rest was uncleaved Glu-Glu-Ala-Glu-Arg-Arg(SEQ ID NO:3)- Arg 34 GLP-1 (7-37) .
- Arg 34 GLP-1 (7-37) can then be acylated by well established known methods.
- Glu-Glu-Ala-Glu-Pro(SEQ ID NO:13 )-Arg 3 GLP-1 (7-37) was expressed and recovered as pre- viously described. 11 ,4 mg lyophilized product was dissolved in 50 ml 0,02 M Tris, HCI, pH 7,5. At time zero 500 ⁇ l was added of prolylendopeptidase (Sphingomonas capsulata prolylendopeptidase expressed in E.coli according to Kanatani et al, Archives of Biochemistry and Biophysics, 358, 141-148 (1998). Cleavage and formation of Arg 34 GLP-1 (7-37) are detected after 30 min incubation in a water bath at 30°C. Arg 34 GLP-1 (7-37) can now be acylated by well established, known methods.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Analytical Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Endocrinology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DKPA200101140 | 2001-07-24 | ||
| DK200101140 | 2001-07-24 | ||
| PCT/DK2002/000501 WO2003010185A2 (en) | 2001-07-24 | 2002-07-18 | Method for making acylated polypeptides |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP1421102A2 true EP1421102A2 (de) | 2004-05-26 |
Family
ID=8160636
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP02750841A Ceased EP1421102A2 (de) | 2001-07-24 | 2002-07-18 | Verfahren zur herstellung acylierter polypeptide |
Country Status (4)
| Country | Link |
|---|---|
| EP (1) | EP1421102A2 (de) |
| JP (1) | JP2005503141A (de) |
| AU (1) | AU2002355160A1 (de) |
| WO (1) | WO2003010185A2 (de) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8158586B2 (en) | 2005-04-11 | 2012-04-17 | Pharmagap Inc. | Inhibitors of protein kinases and uses thereof |
| WO2007016763A1 (en) * | 2005-08-05 | 2007-02-15 | Pharmagap Inc. | Peptides targeted to protein kinase c isoforms and uses thereof |
| EP1872790A1 (de) | 2006-06-26 | 2008-01-02 | DeveloGen Aktiengesellschaft | Neue Zusammensetzung zur Erhöhung der Bioverfügbarkeit von Neurturin |
| US20100317057A1 (en) * | 2007-12-28 | 2010-12-16 | Novo Nordisk A/S | Semi-recombinant preparation of glp-1 analogues |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6011007A (en) * | 1993-09-17 | 2000-01-04 | Novo Nordisk A/S | Acylated insulin |
| ZA954983B (en) * | 1994-06-17 | 1996-02-14 | Novo Nordisk As | N-terminally extended proteins expressed in yeast |
| US5646242A (en) * | 1994-11-17 | 1997-07-08 | Eli Lilly And Company | Selective acylation of epsilon-amino groups |
| US5905140A (en) * | 1996-07-11 | 1999-05-18 | Novo Nordisk A/S, Novo Alle | Selective acylation method |
| EP1956000B1 (de) * | 1999-03-17 | 2016-10-05 | Novo Nordisk A/S | Acylierungsmittel zur acylierung von peptiden |
-
2002
- 2002-07-18 AU AU2002355160A patent/AU2002355160A1/en not_active Abandoned
- 2002-07-18 EP EP02750841A patent/EP1421102A2/de not_active Ceased
- 2002-07-18 WO PCT/DK2002/000501 patent/WO2003010185A2/en not_active Ceased
- 2002-07-18 JP JP2003515544A patent/JP2005503141A/ja not_active Withdrawn
Non-Patent Citations (1)
| Title |
|---|
| See references of WO03010185A2 * |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2003010185A3 (en) | 2004-03-25 |
| AU2002355160A1 (en) | 2003-02-17 |
| JP2005503141A (ja) | 2005-02-03 |
| WO2003010185A2 (en) | 2003-02-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP2069502B1 (de) | Verfahren zur herstellung gereifter insulinpolypeptide | |
| US7402565B2 (en) | Processes for making acylated insulin | |
| US8835132B2 (en) | Method for making acylated polypeptides | |
| EP1377608B1 (de) | Insulinvorstufen und verfahren zu deren herstellung | |
| US7105314B2 (en) | Method for making human insulin precursors | |
| EP2084179B1 (de) | Verfahren zur herstellung aktivierter carboxypeptidasen | |
| US6861237B2 (en) | Production of heterologous polypeptides in yeast | |
| EP1917363B1 (de) | Verfahren zur herstellung reifer insulinpolypeptide | |
| US7572884B2 (en) | Method for making acylated polypeptides | |
| US20030040601A1 (en) | Method for making insulin precursors and insulin analog precursors | |
| EP1421102A2 (de) | Verfahren zur herstellung acylierter polypeptide | |
| JP4386722B2 (ja) | アシル化されたポリペプチドの製造方法 | |
| WO2002079251A2 (en) | Method for making human insulin precursors | |
| WO2004085472A1 (en) | Method for making human insulin precursors and human insulin | |
| WO2002100887A2 (en) | Method for making insulin precursors and insulin precursor analogs | |
| EP1339856A1 (de) | Produktion heterologer polypeptide in hefe |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LI LU MC NL PT SE SK TR |
|
| AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
| 17P | Request for examination filed |
Effective date: 20040927 |
|
| 17Q | First examination report despatched |
Effective date: 20051104 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
| 18R | Application refused |
Effective date: 20081204 |