EP0854882A1 - Polypeptid-akzeptor für n-acetylgalactosaminyltransferase - Google Patents

Polypeptid-akzeptor für n-acetylgalactosaminyltransferase

Info

Publication number: EP0854882A1
Authority: EP; European Patent Office
Prior art keywords: enzyme; seq; ser; pro; sequence
Prior art date: 1995-10-09
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Pending

Application number

EP96930677A

Other languages

English (en)

French (fr)

Inventor

Ake P. Elhammer

Akira Kurosaka

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Pharmacia and Upjohn Co

Original Assignee

Pharmacia and Upjohn Co

Upjohn Co

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1995-10-09

Filing date

1996-09-09

Publication date

1998-07-29

1996-09-09 Application filed by Pharmacia and Upjohn Co, Upjohn Co filed Critical Pharmacia and Upjohn Co

1998-07-29 Publication of EP0854882A1 publication Critical patent/EP0854882A1/de

Status Pending legal-status Critical Current

Links

108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 115
108010046220 N-Acetylgalactosaminyltransferases Proteins 0.000 title claims abstract description 25
102000007524 N-Acetylgalactosaminyltransferases Human genes 0.000 title claims abstract description 25
102000004196 processed proteins & peptides Human genes 0.000 title claims description 47
229920001184 polypeptide Polymers 0.000 title claims description 10
108090000623 proteins and genes Proteins 0.000 claims abstract description 103
102000004169 proteins and genes Human genes 0.000 claims abstract description 82
230000013595 glycosylation Effects 0.000 claims abstract description 35
238000006206 glycosylation reaction Methods 0.000 claims abstract description 35
235000018102 proteins Nutrition 0.000 claims description 77
150000001413 amino acids Chemical group 0.000 claims description 70
238000000034 method Methods 0.000 claims description 38
AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 13
239000004473 Threonine Substances 0.000 claims description 13
108091028043 Nucleic acid sequence Proteins 0.000 claims description 7
230000008569 process Effects 0.000 claims description 6
CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 5
ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 5
QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims 4
ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims 4
ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims 4
235000004279 alanine Nutrition 0.000 claims 4
235000003704 aspartic acid Nutrition 0.000 claims 4
OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims 4
102000004190 Enzymes Human genes 0.000 description 125
108090000790 Enzymes Proteins 0.000 description 125
239000000370 acceptor Substances 0.000 description 96
210000004027 cell Anatomy 0.000 description 89
108010066816 Polypeptide N-acetylgalactosaminyltransferase Proteins 0.000 description 73
235000001014 amino acid Nutrition 0.000 description 69
241000283690 Bos taurus Species 0.000 description 62
210000003022 colostrum Anatomy 0.000 description 49
235000021277 colostrum Nutrition 0.000 description 49
230000000694 effects Effects 0.000 description 42
MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 37
239000000872 buffer Substances 0.000 description 35
RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 33
FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 30
125000003275 alpha amino acid group Chemical group 0.000 description 30
238000003556 assay Methods 0.000 description 27
238000012546 transfer Methods 0.000 description 27
239000002299 complementary DNA Substances 0.000 description 24
238000002474 experimental method Methods 0.000 description 24
235000004400 serine Nutrition 0.000 description 24
239000002773 nucleotide Substances 0.000 description 23
125000003729 nucleotide group Chemical group 0.000 description 23
108020004414 DNA Proteins 0.000 description 21
AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 21
MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 21
229920001542 oligosaccharide Polymers 0.000 description 20
150000002482 oligosaccharides Chemical class 0.000 description 20
238000003752 polymerase chain reaction Methods 0.000 description 20
241000701447 unidentified baculovirus Species 0.000 description 20
239000013598 vector Substances 0.000 description 20
239000000463 material Substances 0.000 description 19
MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 description 17
LFTYTUAZOPRMMI-NESSUJCYSA-N UDP-N-acetyl-alpha-D-galactosamine Chemical compound O1[C@H](CO)[C@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1O[P@](O)(=O)O[P@](O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-NESSUJCYSA-N 0.000 description 17
LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 description 17
239000012634 fragment Substances 0.000 description 17
239000000523 sample Substances 0.000 description 17
238000006243 chemical reaction Methods 0.000 description 16
239000000499 gel Substances 0.000 description 16
102000004357 Transferases Human genes 0.000 description 15
108090000992 Transferases Proteins 0.000 description 15
150000001720 carbohydrates Chemical class 0.000 description 15
238000000338 in vitro Methods 0.000 description 15
239000011780 sodium chloride Substances 0.000 description 15
102100039847 Globoside alpha-1,3-N-acetylgalactosaminyltransferase 1 Human genes 0.000 description 14
101000887519 Homo sapiens Globoside alpha-1,3-N-acetylgalactosaminyltransferase 1 Proteins 0.000 description 14
108091034117 Oligonucleotide Proteins 0.000 description 14
229920002684 Sepharose Polymers 0.000 description 14
239000012528 membrane Substances 0.000 description 14
238000012163 sequencing technique Methods 0.000 description 14
238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 14
OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical group CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 13
235000014633 carbohydrates Nutrition 0.000 description 13
108020004999 messenger RNA Proteins 0.000 description 13
238000002360 preparation method Methods 0.000 description 13
239000000047 product Substances 0.000 description 13
235000008521 threonine Nutrition 0.000 description 13
XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 13
229910001868 water Inorganic materials 0.000 description 13
108010076504 Protein Sorting Signals Proteins 0.000 description 12
238000004458 analytical method Methods 0.000 description 12
239000001963 growth medium Substances 0.000 description 12
150000007523 nucleic acids Chemical group 0.000 description 12
238000000746 purification Methods 0.000 description 12
OVRNDRQMDRJTHS-KEWYIRBNSA-N N-acetyl-D-galactosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-KEWYIRBNSA-N 0.000 description 11
108010002471 apomucin Proteins 0.000 description 11
108020004707 nucleic acids Proteins 0.000 description 11
102000039446 nucleic acids Human genes 0.000 description 11
125000003607 serino group Chemical class [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 11
102000051366 Glycosyltransferases Human genes 0.000 description 10
108700023372 Glycosyltransferases Proteins 0.000 description 10
241000238631 Hexapoda Species 0.000 description 10
KISWVXRQTGLFGD-UHFFFAOYSA-N 2-[[2-[[6-amino-2-[[2-[[2-[[5-amino-2-[[2-[[1-[2-[[6-amino-2-[(2,5-diamino-5-oxopentanoyl)amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]-5-(diaminomethylideneamino)p Chemical compound C1CCN(C(=O)C(CCCN=C(N)N)NC(=O)C(CCCCN)NC(=O)C(N)CCC(N)=O)C1C(=O)NC(CO)C(=O)NC(CCC(N)=O)C(=O)NC(CCCN=C(N)N)C(=O)NC(CO)C(=O)NC(CCCCN)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 KISWVXRQTGLFGD-UHFFFAOYSA-N 0.000 description 9
WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 9
PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
229910021380 Manganese Chloride Inorganic materials 0.000 description 9
GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 9
102000047918 Myelin Basic Human genes 0.000 description 9
101710107068 Myelin basic protein Proteins 0.000 description 9
229920004890 Triton X-100 Polymers 0.000 description 9
239000013504 Triton X-100 Substances 0.000 description 9
230000015572 biosynthetic process Effects 0.000 description 9
238000004587 chromatography analysis Methods 0.000 description 9
238000001727 in vivo Methods 0.000 description 9
239000011565 manganese chloride Substances 0.000 description 9
239000002609 medium Substances 0.000 description 9
239000013615 primer Substances 0.000 description 9
210000001519 tissue Anatomy 0.000 description 9
230000000875 corresponding effect Effects 0.000 description 8
230000029087 digestion Effects 0.000 description 8
230000002829 reductive effect Effects 0.000 description 8
230000004989 O-glycosylation Effects 0.000 description 7
241000700605 Viruses Species 0.000 description 7
JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 7
230000001086 cytosolic effect Effects 0.000 description 7
238000009826 distribution Methods 0.000 description 7
230000002255 enzymatic effect Effects 0.000 description 7
238000002955 isolation Methods 0.000 description 7
238000011068 loading method Methods 0.000 description 7
239000006166 lysate Substances 0.000 description 7
238000000926 separation method Methods 0.000 description 7
239000000758 substrate Substances 0.000 description 7
238000003786 synthesis reaction Methods 0.000 description 7
125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 7
241000287828 Gallus gallus Species 0.000 description 6
108700026244 Open Reading Frames Proteins 0.000 description 6
239000013592 cell lysate Substances 0.000 description 6
239000007795 chemical reaction product Substances 0.000 description 6
239000013604 expression vector Substances 0.000 description 6
238000001114 immunoprecipitation Methods 0.000 description 6
VDXZNPDIRNWWCW-JFTDCZMZSA-N melittin Chemical group NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(N)=O)CC1=CNC2=CC=CC=C12 VDXZNPDIRNWWCW-JFTDCZMZSA-N 0.000 description 6
239000007787 solid Substances 0.000 description 6
238000001890 transfection Methods 0.000 description 6
208000002109 Argyria Diseases 0.000 description 5
102000003886 Glycoproteins Human genes 0.000 description 5
108090000288 Glycoproteins Proteins 0.000 description 5
241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 5
108010063954 Mucins Proteins 0.000 description 5
238000000636 Northern blotting Methods 0.000 description 5
239000002253 acid Substances 0.000 description 5
238000001042 affinity chromatography Methods 0.000 description 5
239000000427 antigen Substances 0.000 description 5
108091007433 antigens Proteins 0.000 description 5
102000036639 antigens Human genes 0.000 description 5
230000003197 catalytic effect Effects 0.000 description 5
230000006870 function Effects 0.000 description 5
108700014210 glycosyltransferase activity proteins Proteins 0.000 description 5
238000011534 incubation Methods 0.000 description 5
150000002632 lipids Chemical class 0.000 description 5
230000004048 modification Effects 0.000 description 5
238000012986 modification Methods 0.000 description 5
239000013612 plasmid Substances 0.000 description 5
230000010076 replication Effects 0.000 description 5
241000894007 species Species 0.000 description 5
239000006228 supernatant Substances 0.000 description 5
AWDRATDZQPNJFN-VAYUFCLWSA-N taurodeoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCS(O)(=O)=O)C)[C@@]2(C)[C@@H](O)C1 AWDRATDZQPNJFN-VAYUFCLWSA-N 0.000 description 5
238000013519 translation Methods 0.000 description 5
101000921522 Bos taurus Cytochrome c Proteins 0.000 description 4
101000753793 Bos taurus Thiosulfate sulfurtransferase Proteins 0.000 description 4
108020004705 Codon Proteins 0.000 description 4
241000237981 Patella vulgata Species 0.000 description 4
238000012300 Sequence Analysis Methods 0.000 description 4
238000002105 Southern blotting Methods 0.000 description 4
238000004422 calculation algorithm Methods 0.000 description 4
239000013599 cloning vector Substances 0.000 description 4
238000010276 construction Methods 0.000 description 4
SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 4
SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 4
239000003599 detergent Substances 0.000 description 4
238000001962 electrophoresis Methods 0.000 description 4
238000010828 elution Methods 0.000 description 4
210000004907 gland Anatomy 0.000 description 4
238000004128 high performance liquid chromatography Methods 0.000 description 4
238000009396 hybridization Methods 0.000 description 4
-1 linker amino acids Chemical class 0.000 description 4
230000037361 pathway Effects 0.000 description 4
229920002401 polyacrylamide Polymers 0.000 description 4
235000000346 sugar Nutrition 0.000 description 4
238000012360 testing method Methods 0.000 description 4
150000003588 threonines Chemical class 0.000 description 4
108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
241000894006 Bacteria Species 0.000 description 3
BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 3
239000003155 DNA primer Substances 0.000 description 3
QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 3
UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 3
DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 3
HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 3
LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 3
MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 3
108010090665 Mannosyl-Glycoprotein Endo-beta-N-Acetylglucosaminidase Proteins 0.000 description 3
OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
241000283973 Oryctolagus cuniculus Species 0.000 description 3
229920005654 Sephadex Polymers 0.000 description 3
239000012507 Sephadex™ Substances 0.000 description 3
108090000787 Subtilisin Proteins 0.000 description 3
NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 3
JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 3
125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 3
108010015684 alpha-N-Acetylgalactosaminidase Proteins 0.000 description 3
102000002014 alpha-N-Acetylgalactosaminidase Human genes 0.000 description 3
238000004873 anchoring Methods 0.000 description 3
238000013459 approach Methods 0.000 description 3
108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 3
230000004071 biological effect Effects 0.000 description 3
238000005119 centrifugation Methods 0.000 description 3
238000012512 characterization method Methods 0.000 description 3
238000010367 cloning Methods 0.000 description 3
235000013601 eggs Nutrition 0.000 description 3
102000034238 globular proteins Human genes 0.000 description 3
108091005896 globular proteins Proteins 0.000 description 3
210000002288 golgi apparatus Anatomy 0.000 description 3
208000015181 infectious disease Diseases 0.000 description 3
238000007689 inspection Methods 0.000 description 3
108010034529 leucyl-lysine Proteins 0.000 description 3
108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 3
108010017391 lysylvaline Proteins 0.000 description 3
230000005012 migration Effects 0.000 description 3
238000013508 migration Methods 0.000 description 3
239000000203 mixture Substances 0.000 description 3
230000002797 proteolythic effect Effects 0.000 description 3
230000006337 proteolytic cleavage Effects 0.000 description 3
230000002285 radioactive effect Effects 0.000 description 3
230000009257 reactivity Effects 0.000 description 3
230000009467 reduction Effects 0.000 description 3
238000011160 research Methods 0.000 description 3
238000012216 screening Methods 0.000 description 3
210000000813 small intestine Anatomy 0.000 description 3
229910000033 sodium borohydride Inorganic materials 0.000 description 3
239000012279 sodium borohydride Substances 0.000 description 3
238000012289 standard assay Methods 0.000 description 3
238000007619 statistical method Methods 0.000 description 3
238000013518 transcription Methods 0.000 description 3
230000035897 transcription Effects 0.000 description 3
230000002103 transcriptional effect Effects 0.000 description 3
230000003612 virological effect Effects 0.000 description 3
239000011534 wash buffer Substances 0.000 description 3
MRXDGVXSWIXTQL-HYHFHBMOSA-N (2s)-2-[[(1s)-1-(2-amino-1,4,5,6-tetrahydropyrimidin-6-yl)-2-[[(2s)-4-methyl-1-oxo-1-[[(2s)-1-oxo-3-phenylpropan-2-yl]amino]pentan-2-yl]amino]-2-oxoethyl]carbamoylamino]-3-phenylpropanoic acid Chemical compound C([C@H](NC(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C=O)C1NC(N)=NCC1)C(O)=O)C1=CC=CC=C1 MRXDGVXSWIXTQL-HYHFHBMOSA-N 0.000 description 2
VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
108010087765 Antipain Proteins 0.000 description 2
108010039627 Aprotinin Proteins 0.000 description 2
HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 2
ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 2
GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 2
ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 2
QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
108010077805 Bacterial Proteins Proteins 0.000 description 2
108700043183 Bos taurus BSM1 Proteins 0.000 description 2
OLVPQBGMUGIKIW-UHFFFAOYSA-N Chymostatin Natural products C=1C=CC=CC=1CC(C=O)NC(=O)C(C(C)CC)NC(=O)C(C1NC(N)=NCC1)NC(=O)NC(C(O)=O)CC1=CC=CC=C1 OLVPQBGMUGIKIW-UHFFFAOYSA-N 0.000 description 2
108091026890 Coding region Proteins 0.000 description 2
238000011537 Coomassie blue staining Methods 0.000 description 2
QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 2
OWAFTBLVZNSIFO-SRVKXCTJSA-N Cys-His-His Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OWAFTBLVZNSIFO-SRVKXCTJSA-N 0.000 description 2
102000018832 Cytochromes Human genes 0.000 description 2
108010052832 Cytochromes Proteins 0.000 description 2
WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
102000002322 Egg Proteins Human genes 0.000 description 2
108010000912 Egg Proteins Proteins 0.000 description 2
ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
108060003306 Galactosyltransferase Proteins 0.000 description 2
102000030902 Galactosyltransferase Human genes 0.000 description 2
OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 2
NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 2
ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 2
NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
108010031186 Glycoside Hydrolases Proteins 0.000 description 2
102000005744 Glycoside Hydrolases Human genes 0.000 description 2
UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 2
MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 2
GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 2
101100005713 Homo sapiens CD4 gene Proteins 0.000 description 2
101000746373 Homo sapiens Granulocyte-macrophage colony-stimulating factor Proteins 0.000 description 2
SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 2
KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
GDBQQVLCIARPGH-UHFFFAOYSA-N Leupeptin Natural products CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N GDBQQVLCIARPGH-UHFFFAOYSA-N 0.000 description 2
KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 2
NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
DWAICOVNOFPYLS-OSMVPFSASA-N N-acetyl-D-galactosaminitol Chemical compound CC(=O)N[C@@H](CO)[C@@H](O)[C@@H](O)[C@H](O)CO DWAICOVNOFPYLS-OSMVPFSASA-N 0.000 description 2
230000004988 N-glycosylation Effects 0.000 description 2
125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 2
108091005804 Peptidases Proteins 0.000 description 2
108010055817 Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase Proteins 0.000 description 2
102000000447 Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase Human genes 0.000 description 2
MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 2
ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 2
GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
239000004365 Protease Substances 0.000 description 2
108010067787 Proteoglycans Proteins 0.000 description 2
102000016611 Proteoglycans Human genes 0.000 description 2
101000762949 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) Exotoxin A Proteins 0.000 description 2
102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 2
XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
108010056079 Subtilisins Proteins 0.000 description 2
102000005158 Subtilisins Human genes 0.000 description 2
101100388071 Thermococcus sp. (strain GE8) pol gene Proteins 0.000 description 2
JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 2
CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 2
CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 2
KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 2
108010065282 UDP xylose-protein xylosyltransferase Proteins 0.000 description 2
DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
RFZFBOQPPFCOKG-BZSNNMDCSA-N Val-Trp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N RFZFBOQPPFCOKG-BZSNNMDCSA-N 0.000 description 2
241000251539 Vertebrata <Metazoa> Species 0.000 description 2
102000010199 Xylosyltransferases Human genes 0.000 description 2
PQLVXDKIJBQVDF-UHFFFAOYSA-N acetic acid;hydrate Chemical compound O.CC(O)=O PQLVXDKIJBQVDF-UHFFFAOYSA-N 0.000 description 2
230000002378 acidificating effect Effects 0.000 description 2
239000002671 adjuvant Substances 0.000 description 2
108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 2
108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
RDOXTESZEPMUJZ-UHFFFAOYSA-N anisole Chemical compound COC1=CC=CC=C1 RDOXTESZEPMUJZ-UHFFFAOYSA-N 0.000 description 2
SDNYTAYICBFYFH-TUFLPTIASA-N antipain Chemical compound NC(N)=NCCC[C@@H](C=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SDNYTAYICBFYFH-TUFLPTIASA-N 0.000 description 2
229960004405 aprotinin Drugs 0.000 description 2
108010092854 aspartyllysine Proteins 0.000 description 2
238000000376 autoradiography Methods 0.000 description 2
108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
230000008901 benefit Effects 0.000 description 2
230000008827 biological function Effects 0.000 description 2
238000004364 calculation method Methods 0.000 description 2
230000015556 catabolic process Effects 0.000 description 2
238000004113 cell culture Methods 0.000 description 2
239000006143 cell culture medium Substances 0.000 description 2
230000008859 change Effects 0.000 description 2
108010086192 chymostatin Proteins 0.000 description 2
238000003776 cleavage reaction Methods 0.000 description 2
150000001875 compounds Chemical class 0.000 description 2
230000001143 conditioned effect Effects 0.000 description 2
230000001186 cumulative effect Effects 0.000 description 2
238000006731 degradation reaction Methods 0.000 description 2
238000004925 denaturation Methods 0.000 description 2
230000036425 denaturation Effects 0.000 description 2
235000013345 egg yolk Nutrition 0.000 description 2
210000002969 egg yolk Anatomy 0.000 description 2
210000002472 endoplasmic reticulum Anatomy 0.000 description 2
230000004927 fusion Effects 0.000 description 2
108020001507 fusion proteins Proteins 0.000 description 2
102000037865 fusion proteins Human genes 0.000 description 2
102000035122 glycosylated proteins Human genes 0.000 description 2
108091005608 glycosylated proteins Proteins 0.000 description 2
230000001279 glycosylating effect Effects 0.000 description 2
108010089804 glycyl-threonine Proteins 0.000 description 2
108010037850 glycylvaline Proteins 0.000 description 2
108010018006 histidylserine Proteins 0.000 description 2
230000006801 homologous recombination Effects 0.000 description 2
238000002744 homologous recombination Methods 0.000 description 2
230000036571 hydration Effects 0.000 description 2
238000006703 hydration reaction Methods 0.000 description 2
230000002209 hydrophobic effect Effects 0.000 description 2
125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
239000003547 immunosorbent Substances 0.000 description 2
238000011065 in-situ storage Methods 0.000 description 2
ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 2
230000003993 interaction Effects 0.000 description 2
210000003734 kidney Anatomy 0.000 description 2
108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
108010000761 leucylarginine Proteins 0.000 description 2
108010091871 leucylmethionine Proteins 0.000 description 2
108010057821 leucylproline Proteins 0.000 description 2
108010052968 leupeptin Proteins 0.000 description 2
GDBQQVLCIARPGH-ULQDDVLXSA-N leupeptin Chemical compound CC(C)C[C@H](NC(C)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C=O)CCCN=C(N)N GDBQQVLCIARPGH-ULQDDVLXSA-N 0.000 description 2
239000003446 ligand Substances 0.000 description 2
238000004519 manufacturing process Methods 0.000 description 2
230000001404 mediated effect Effects 0.000 description 2
229930182817 methionine Natural products 0.000 description 2
108010005942 methionylglycine Proteins 0.000 description 2
108010068488 methionylphenylalanine Proteins 0.000 description 2
238000004816 paper chromatography Methods 0.000 description 2
108010091212 pepstatin Proteins 0.000 description 2
229950000964 pepstatin Drugs 0.000 description 2
FAXGPCHRFPCXOO-LXTPJMTPSA-N pepstatin A Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 2
239000012071 phase Substances 0.000 description 2
108010051242 phenylalanylserine Proteins 0.000 description 2
YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
230000008488 polyadenylation Effects 0.000 description 2
230000001323 posttranslational effect Effects 0.000 description 2
239000002244 precipitate Substances 0.000 description 2
125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
239000002987 primer (paints) Substances 0.000 description 2
108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
108010077112 prolyl-proline Proteins 0.000 description 2
108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
108010053725 prolylvaline Proteins 0.000 description 2
235000019419 proteases Nutrition 0.000 description 2
230000017854 proteolysis Effects 0.000 description 2
239000011541 reaction mixture Substances 0.000 description 2
238000011084 recovery Methods 0.000 description 2
239000011347 resin Substances 0.000 description 2
229920005989 resin Polymers 0.000 description 2
230000002441 reversible effect Effects 0.000 description 2
238000009738 saturating Methods 0.000 description 2
230000007017 scission Effects 0.000 description 2
230000003248 secreting effect Effects 0.000 description 2
230000028327 secretion Effects 0.000 description 2
238000004062 sedimentation Methods 0.000 description 2
238000011451 sequencing strategy Methods 0.000 description 2
239000012679 serum free medium Substances 0.000 description 2
108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
108010026333 seryl-proline Proteins 0.000 description 2
210000002027 skeletal muscle Anatomy 0.000 description 2
239000000243 solution Substances 0.000 description 2
238000001179 sorption measurement Methods 0.000 description 2
108010061238 threonyl-glycine Proteins 0.000 description 2
238000005820 transferase reaction Methods 0.000 description 2
108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
108010044292 tryptophyltyrosine Proteins 0.000 description 2
238000005406 washing Methods 0.000 description 2
FJQZXCPWAGYPSD-UHFFFAOYSA-N 1,3,4,6-tetrachloro-3a,6a-diphenylimidazo[4,5-d]imidazole-2,5-dione Chemical compound ClN1C(=O)N(Cl)C2(C=3C=CC=CC=3)N(Cl)C(=O)N(Cl)C12C1=CC=CC=C1 FJQZXCPWAGYPSD-UHFFFAOYSA-N 0.000 description 1
QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
108010079054 Amyloid beta-Protein Precursor Proteins 0.000 description 1
102000014303 Amyloid beta-Protein Precursor Human genes 0.000 description 1
241000256844 Apis mellifera Species 0.000 description 1
LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
239000004475 Arginine Substances 0.000 description 1
GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
KGCUOPPQTPZILL-CIUDSAMLSA-N Asn-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N KGCUOPPQTPZILL-CIUDSAMLSA-N 0.000 description 1
ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
ATHZHGQSAIJHQU-XIRDDKMYSA-N Asn-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ATHZHGQSAIJHQU-XIRDDKMYSA-N 0.000 description 1
HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
241000972773 Aulopiformes Species 0.000 description 1
101001120790 Caenorhabditis elegans UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase Proteins 0.000 description 1
OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
201000009030 Carcinoma Diseases 0.000 description 1
102000016289 Cell Adhesion Molecules Human genes 0.000 description 1
108010067225 Cell Adhesion Molecules Proteins 0.000 description 1
102000011022 Chorionic Gonadotropin Human genes 0.000 description 1
108010062540 Chorionic Gonadotropin Proteins 0.000 description 1
241000699802 Cricetulus griseus Species 0.000 description 1
BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
XCDDSPYIMNXECQ-NAKRPEOUSA-N Cys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS XCDDSPYIMNXECQ-NAKRPEOUSA-N 0.000 description 1
NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
MTCFGRXMJLQNBG-UWTATZPHSA-N D-Serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 description 1
150000008574 D-amino acids Chemical class 0.000 description 1
102000053602 DNA Human genes 0.000 description 1
238000001712 DNA sequencing Methods 0.000 description 1
102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
108010024212 E-Selectin Proteins 0.000 description 1
102100023471 E-selectin Human genes 0.000 description 1
102000016359 Fibronectins Human genes 0.000 description 1
108010067306 Fibronectins Proteins 0.000 description 1
102000005915 GABA Receptors Human genes 0.000 description 1
108010005551 GABA Receptors Proteins 0.000 description 1
NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 1
ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
239000004471 Glycine Substances 0.000 description 1
229920002683 Glycosaminoglycan Polymers 0.000 description 1
102000001554 Hemoglobins Human genes 0.000 description 1
108010054147 Hemoglobins Proteins 0.000 description 1
229920000209 Hexadimethrine bromide Polymers 0.000 description 1
KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 1
101000987586 Homo sapiens Eosinophil peroxidase Proteins 0.000 description 1
101000920686 Homo sapiens Erythropoietin Proteins 0.000 description 1
101001051093 Homo sapiens Low-density lipoprotein receptor Proteins 0.000 description 1
241000701109 Human adenovirus 2 Species 0.000 description 1
241000701024 Human betaherpesvirus 5 Species 0.000 description 1
TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
HZYHBDVRCBDJJV-HAFWLYHUSA-N Ile-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O HZYHBDVRCBDJJV-HAFWLYHUSA-N 0.000 description 1
UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 1
SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
108010001831 LDL receptors Proteins 0.000 description 1
108090001090 Lectins Proteins 0.000 description 1
102000004856 Lectins Human genes 0.000 description 1
LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 description 1
WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
102100024640 Low-density lipoprotein receptor Human genes 0.000 description 1
ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
108010036176 Melitten Proteins 0.000 description 1
102000018697 Membrane Proteins Human genes 0.000 description 1
108010052285 Membrane Proteins Proteins 0.000 description 1
XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
NLDXSXDCNZIQCN-ULQDDVLXSA-N Met-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 NLDXSXDCNZIQCN-ULQDDVLXSA-N 0.000 description 1
NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
102100023315 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Human genes 0.000 description 1
108010056664 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyltransferase Proteins 0.000 description 1
AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
108010002311 N-glycylglutamic acid Proteins 0.000 description 1
BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
206010028980 Neoplasm Diseases 0.000 description 1
244000061176 Nicotiana tabacum Species 0.000 description 1
108091006033 O-glycosylated proteins Proteins 0.000 description 1
108010035766 P-Selectin Proteins 0.000 description 1
102100023472 P-selectin Human genes 0.000 description 1
YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
108010001014 Plasminogen Activators Proteins 0.000 description 1
102000001938 Plasminogen Activators Human genes 0.000 description 1
IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
102000009609 Pyrophosphatases Human genes 0.000 description 1
108010009413 Pyrophosphatases Proteins 0.000 description 1
108020005067 RNA Splice Sites Proteins 0.000 description 1
238000002123 RNA extraction Methods 0.000 description 1
108020004511 Recombinant DNA Proteins 0.000 description 1
108091058545 Secretory proteins Proteins 0.000 description 1
102000040739 Secretory proteins Human genes 0.000 description 1
WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
241000256251 Spodoptera frugiperda Species 0.000 description 1
108091081024 Start codon Proteins 0.000 description 1
108010006785 Taq Polymerase Proteins 0.000 description 1
YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
108091036066 Three prime untranslated region Proteins 0.000 description 1
JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
XCCTYIAWTASOJW-UHFFFAOYSA-N UDP-Glc Natural products OC1C(O)C(COP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-UHFFFAOYSA-N 0.000 description 1
XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 1
DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
SRBFZHDQGSBBOR-IOVATXLUSA-N Xylose Natural products O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 1
230000006978 adaptation Effects 0.000 description 1
239000011543 agarose gel Substances 0.000 description 1
230000004520 agglutination Effects 0.000 description 1
108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
125000000539 amino acid group Chemical group 0.000 description 1
230000003321 amplification Effects 0.000 description 1
239000003242 anti bacterial agent Substances 0.000 description 1
230000001407 anti-thrombic effect Effects 0.000 description 1
229940088710 antibiotic agent Drugs 0.000 description 1
230000010100 anticoagulation Effects 0.000 description 1
239000007864 aqueous solution Substances 0.000 description 1
PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
108010068380 arginylarginine Proteins 0.000 description 1
108010062796 arginyllysine Proteins 0.000 description 1
235000009582 asparagine Nutrition 0.000 description 1
125000000613 asparagine group Chemical class N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
108010093581 aspartyl-proline Proteins 0.000 description 1
108010038633 aspartylglutamate Proteins 0.000 description 1
108010068265 aspartyltyrosine Proteins 0.000 description 1
239000011324 bead Substances 0.000 description 1
SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
210000004556 brain Anatomy 0.000 description 1
239000006227 byproduct Substances 0.000 description 1
210000004899 c-terminal region Anatomy 0.000 description 1
239000011575 calcium Substances 0.000 description 1
229910052791 calcium Inorganic materials 0.000 description 1
239000001506 calcium phosphate Substances 0.000 description 1
229910000389 calcium phosphate Inorganic materials 0.000 description 1
235000011010 calcium phosphates Nutrition 0.000 description 1
238000005251 capillar electrophoresis Methods 0.000 description 1
238000007036 catalytic synthesis reaction Methods 0.000 description 1
150000001768 cations Chemical class 0.000 description 1
230000024245 cell differentiation Effects 0.000 description 1
210000000170 cell membrane Anatomy 0.000 description 1
230000033383 cell-cell recognition Effects 0.000 description 1
239000003153 chemical reaction reagent Substances 0.000 description 1
239000003795 chemical substances by application Substances 0.000 description 1
229940121538 choriogonadotropin beta Drugs 0.000 description 1
230000002759 chromosomal effect Effects 0.000 description 1
210000000349 chromosome Anatomy 0.000 description 1
239000007979 citrate buffer Substances 0.000 description 1
230000000052 comparative effect Effects 0.000 description 1
239000000470 constituent Substances 0.000 description 1
239000000356 contaminant Substances 0.000 description 1
230000002079 cooperative effect Effects 0.000 description 1
230000008878 coupling Effects 0.000 description 1
238000010168 coupling process Methods 0.000 description 1
238000005859 coupling reaction Methods 0.000 description 1
ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
230000007423 decrease Effects 0.000 description 1
238000010511 deprotection reaction Methods 0.000 description 1
239000000645 desinfectant Substances 0.000 description 1
238000003745 diagnosis Methods 0.000 description 1
FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
239000012149 elution buffer Substances 0.000 description 1
238000011067 equilibration Methods 0.000 description 1
239000006167 equilibration buffer Substances 0.000 description 1
210000003743 erythrocyte Anatomy 0.000 description 1
ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
229960005542 ethidium bromide Drugs 0.000 description 1
210000003527 eukaryotic cell Anatomy 0.000 description 1
102000013361 fetuin Human genes 0.000 description 1
108060002885 fetuin Proteins 0.000 description 1
239000012847 fine chemical Substances 0.000 description 1
239000012530 fluid Substances 0.000 description 1
229930182830 galactose Natural products 0.000 description 1
238000001641 gel filtration chromatography Methods 0.000 description 1
238000001879 gelation Methods 0.000 description 1
238000002523 gelfiltration Methods 0.000 description 1
238000010353 genetic engineering Methods 0.000 description 1
108010049041 glutamylalanine Proteins 0.000 description 1
150000004676 glycans Chemical class 0.000 description 1
125000000267 glycino group Chemical group [H]N([*])C([H])([H])C(=O)O[H] 0.000 description 1
239000000937 glycosyl acceptor Substances 0.000 description 1
108010050848 glycylleucine Proteins 0.000 description 1
108010084389 glycyltryptophan Proteins 0.000 description 1
239000006451 grace's insect medium Substances 0.000 description 1
230000005484 gravity Effects 0.000 description 1
238000003306 harvesting Methods 0.000 description 1
210000002216 heart Anatomy 0.000 description 1
238000000703 high-speed centrifugation Methods 0.000 description 1
108010036413 histidylglycine Proteins 0.000 description 1
108010092114 histidylphenylalanine Proteins 0.000 description 1
102000044890 human EPO Human genes 0.000 description 1
125000002349 hydroxyamino group Chemical group [H]ON([H])[*] 0.000 description 1
238000002513 implantation Methods 0.000 description 1
239000012194 insect media Substances 0.000 description 1
230000008611 intercellular interaction Effects 0.000 description 1
239000000543 intermediate Substances 0.000 description 1
210000000936 intestine Anatomy 0.000 description 1
230000003834 intracellular effect Effects 0.000 description 1
210000004020 intracellular membrane Anatomy 0.000 description 1
238000004255 ion exchange chromatography Methods 0.000 description 1
210000003292 kidney cell Anatomy 0.000 description 1
239000002523 lectin Substances 0.000 description 1
231100000518 lethal Toxicity 0.000 description 1
230000001665 lethal effect Effects 0.000 description 1
230000000670 limiting effect Effects 0.000 description 1
210000004185 liver Anatomy 0.000 description 1
239000012160 loading buffer Substances 0.000 description 1
210000004072 lung Anatomy 0.000 description 1
210000004698 lymphocyte Anatomy 0.000 description 1
108010064235 lysylglycine Proteins 0.000 description 1
229910001629 magnesium chloride Inorganic materials 0.000 description 1
210000004962 mammalian cell Anatomy 0.000 description 1
239000011159 matrix material Substances 0.000 description 1
230000007246 mechanism Effects 0.000 description 1
108010056582 methionylglutamic acid Proteins 0.000 description 1
UZKWTJUDCOPSNM-UHFFFAOYSA-N methoxybenzene Substances CCCCOC=C UZKWTJUDCOPSNM-UHFFFAOYSA-N 0.000 description 1
230000000813 microbial effect Effects 0.000 description 1
235000013336 milk Nutrition 0.000 description 1
239000008267 milk Substances 0.000 description 1
210000004080 milk Anatomy 0.000 description 1
230000003278 mimic effect Effects 0.000 description 1
239000002480 mineral oil Substances 0.000 description 1
235000010446 mineral oil Nutrition 0.000 description 1
238000010369 molecular cloning Methods 0.000 description 1
150000002772 monosaccharides Chemical class 0.000 description 1
NKAAEMMYHLFEFN-UHFFFAOYSA-M monosodium tartrate Chemical compound [Na+].OC(=O)C(O)C(O)C([O-])=O NKAAEMMYHLFEFN-UHFFFAOYSA-M 0.000 description 1
230000035772 mutation Effects 0.000 description 1
238000003199 nucleic acid amplification method Methods 0.000 description 1
210000001672 ovary Anatomy 0.000 description 1
238000012856 packing Methods 0.000 description 1
210000000496 pancreas Anatomy 0.000 description 1
230000036961 partial effect Effects 0.000 description 1
239000002245 particle Substances 0.000 description 1
230000007110 pathogen host interaction Effects 0.000 description 1
239000008188 pellet Substances 0.000 description 1
108010012581 phenylalanylglutamate Proteins 0.000 description 1
108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
210000002826 placenta Anatomy 0.000 description 1
229940127126 plasminogen activator Drugs 0.000 description 1
229920001282 polysaccharide Polymers 0.000 description 1
239000005017 polysaccharide Substances 0.000 description 1
239000013641 positive control Substances 0.000 description 1
230000003389 potentiating effect Effects 0.000 description 1
238000001556 precipitation Methods 0.000 description 1
239000002243 precursor Substances 0.000 description 1
238000012545 processing Methods 0.000 description 1
125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
108010079317 prolyl-tyrosine Proteins 0.000 description 1
108010090894 prolylleucine Proteins 0.000 description 1
238000000734 protein sequencing Methods 0.000 description 1
239000002516 radical scavenger Substances 0.000 description 1
108020003175 receptors Proteins 0.000 description 1
102000005962 receptors Human genes 0.000 description 1
230000008929 regeneration Effects 0.000 description 1
239000012557 regeneration buffer Substances 0.000 description 1
238000011069 regeneration method Methods 0.000 description 1
108091008146 restriction endonucleases Proteins 0.000 description 1
230000000717 retained effect Effects 0.000 description 1
238000004366 reverse phase liquid chromatography Methods 0.000 description 1
210000003705 ribosome Anatomy 0.000 description 1
210000003935 rough endoplasmic reticulum Anatomy 0.000 description 1
235000019515 salmon Nutrition 0.000 description 1
150000003839 salts Chemical class 0.000 description 1
239000012723 sample buffer Substances 0.000 description 1
238000003345 scintillation counting Methods 0.000 description 1
239000013049 sediment Substances 0.000 description 1
238000012764 semi-quantitative analysis Methods 0.000 description 1
210000002966 serum Anatomy 0.000 description 1
239000011734 sodium Substances 0.000 description 1
229910000029 sodium carbonate Inorganic materials 0.000 description 1
239000007790 solid phase Substances 0.000 description 1
239000002904 solvent Substances 0.000 description 1
108010005652 splenotritin Proteins 0.000 description 1
238000010186 staining Methods 0.000 description 1
238000010561 standard procedure Methods 0.000 description 1
238000006467 substitution reaction Methods 0.000 description 1
150000008163 sugars Chemical class 0.000 description 1
230000031068 symbiosis, encompassing mutualism through parasitism Effects 0.000 description 1
125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
210000001550 testis Anatomy 0.000 description 1
238000011426 transformation method Methods 0.000 description 1
230000001131 transforming effect Effects 0.000 description 1
QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
108010087967 type I signal peptidase Proteins 0.000 description 1
108010012567 tyrosyl-glycyl-glycyl-phenylalanyl Proteins 0.000 description 1
108010020532 tyrosyl-proline Proteins 0.000 description 1
238000000108 ultra-filtration Methods 0.000 description 1
241001430294 unidentified retrovirus Species 0.000 description 1
238000011144 upstream manufacturing Methods 0.000 description 1
108010073969 valyllysine Proteins 0.000 description 1
210000003462 vein Anatomy 0.000 description 1
125000000969 xylosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)CO1)* 0.000 description 1

Classifications

- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
- C07K7/06—Linear peptides containing only normal peptide links having 5 to 11 amino acids
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides

Definitions

the present invention relates to glycosyltransferase enzymes and the genes corresponding to such enzymes.
the present invention relates to the enzyme N-acetylgalactosaminyltransferase.
the invention relates to the isolation and sequencing of the enzyme N-acetylgalactosaminyltransferase.
the invention also relates to the construction of proteins capable of expressing the acceptor peptide for the enzyme N-acetylgalactosaminyltransferase.
Carbohydrates are an important class of biological compounds.
carbohydrates function as structural components where they regulate viscosity, store energy, or are key components of cell surfaces. Nearly all site specific intercellular interactions involve cell surface carbohydrates. For example, union of sperm and egg as well as the implantation of fertilized egg are both mediated by cell surface carbohydrates.
a number of proteins that function as cell adhesion molecules including GMP-140, ELAM-1, and lymphocyte adhesion molecules like Mel- 14, exhibit structural features that mimic lectins, and are thought to bind specific cell surface carbohydrate structures (Stoolman, Cell (1989) 56:907-910). Glycosylated proteins as tumor-associated antigens are now being used to identify the presence of numerous carcinomas. Even isolated oligosaccharides have been found to exhibit biological activity on their own.
oligosaccharides have an influence on the protein or lipid to which they are conjugated (Rademacher et al., Ann. Rev..).
oligosaccharides have been shown to influence proteins, stability, rate of proteolysis, rate of in vivo clearance from the bloodstream, thermal stability and solubility. Changes in the oligosaccharide portion of cell surface carbohydrates have been noted in cells which have become cancerous. Other oligosaccharide changes have been detected during cell differentiation (Toone et al., Tetrahedron Report (1989) 45(17):5365-5422). As such, the significance of oligosaccharides to biological function cannot be understated.
O-glycosidically linked (mucin type) oligosaccharides have been reported on a number of different types of glycoproteins (Sadler, (1984) Biology of Carbohydrates. (Ginsburg and Robbins, eds.) pp. 199-213, Vol. 2, John Wiley and Sons, New York). These structures have been assigned a diverse array of functions, ranging from quite specific such as being involved in cell-cell recognition and host-pathogen interaction, to more general such as providing protection from proteolytic degradation or supplying the appropriate charge and water binding properties to mucous secretions (Sadler (1984) Biology of Carbohydrates (supra); Paulson (1989) Trends Biochem. Sc , 14:272-275; and Jentoft (1990) Trends Biochem. Sci.. 15:291-294).
the initial reaction in O-linked oligosaccharide biosynthesis is the transfer of an N-acetylgalactosamine residue from the nucleotide sugar UDP-N- acetylgalactosamine to a serine or threonine residue on the protein acceptor.
This reaction which can occur post-translationally, is catalyzed by UDP- GalNAc olypeptide, N-acetylgalactosaminyltransferase (hereinafter referred to as GalNAc-transferase or GalNAcT) an intracellular membrane bound enzyme believed to be localized in the secretory pathway.
GalNAc-transferase The exact location(s) of GalNAc-transferase is still controversial. It has been reported that the initial addition of N-acetylgalactosamine to the acceptor protein can take place early (even co-translationally) in the rough endoplasmic reticulum (ER). Other authors have suggested that this reaction is a post-translational event occurring in later ER compartments and/or in the cis region of the Golgi complex (e.g. Hanover et al. (1982) J. Biol. Chem. 257:10172-10177; Roth (1984) J. Cell Biol. 98:399-406; Elhammer and Kornfeld (1984) J. Cell Biol. 98:327-331; Tooze et al. (1988) J. Cell Biol. 106:1475-1487; Deschuyteneer et al. (1988) J. Biol. Chem.
Enzyme-mediated catalytic synthesis would offer dramatic advantages over the classical synthetic organic pathways, producing very high yields of carbohydrates (e.g., oligosaccharides and/or polysaccharides) economically, under mild conditions in aqueous solutions, and without generating notable amounts of undesired side products.
carbohydrates e.g., oligosaccharides and/or polysaccharides
Such enzymes which include glycosyltransferase, are however difficult to isolate, especially from eukaryotic, e.g., mammalian sources, because these proteins are only found in low concentrations, and tend to be membrane- bound.
the acceptor (peptide) specificity of GalNAc-transferase is poorly understood.
acceptor site for this enzyme consists of acidic amino acids closely followed by the tetrapeptide Ser-Gly-Xaa-Gly, where Xaa may be any amino acid (Bourdon et al. 1987).
the present invention is based upon the discoveries of the gene coding for the enzyme N-acetylgalactosaminyltransferase, the amino acid sequence of the enzyme N-acetylgalactosaminyltransferase, and the polypeptide sequence ofthe acceptor peptide for the enzyme N-acetylgalactosaminyltransferase. These discoveries allow for the control of glycosylation of a protein.
the present invention involves controlling the glycosylation of a protein, either within a cell or in vitro, by introducing into the DNA sequence encoding the protein at least one gene which is capable of expressing the acceptor peptide for the enzyme N-acetylgalactosaminyltransferase, expressing a protein having an acceptor cite for that enzyme, and exposing the expressed protein to that enzyme.
the present invention involves introducing into the DNA sequence encoding the protein a DNA sequence encoding an N- acetylgalactosaminyltransferase enzyme acceptor peptide having an amino acid sequence as follows: PPDAATAAPL [SEQ ID NO:20] wherein Proline is P, Aspartic Acid is D, Alanine is A, Threonine is T, and Leucine is L.
the present invention also involves expressing a protein having a PPDAATAAPL [SEQ ID NO:20] acceptor cite for that enzyme, and exposing the expressed protein to that enzyme.
the present invention also provides a process for altering the glycosylation of a protein produced by a cell where the process involves introducing into the cell at least one gene which is capable of expressing the enzyme N- acetylgalactosaminyltransferase followed by expressing a sufficient amount of the enzyme in the cell to thereby alter the glycosylation of the protein in the cell.
N-terminal Requence of bovine colostrum GalNAc-transferase sequence of oligonucleotide nrimers. restriction map for cDNA clones (pCRl000-91B and PCR1000-52A) containing the GalNAc-transferase and the sequencing strategy.
A N-terminal amino acid sequence (34 amino acids) [SEQ ID NO:l] obtained from purified bovine colostrum GalNAc-transferase.
oligonucleotides A, B and C are 512, 64 and 64, respectively.
B Nucleotide sequence of the region surrounding the EcoRI cloning site of the ⁇ gt 10 vector. Oligonucleotides F and G [SEQ ID NOS: 7 and 8, respectively] were synthesized and used in PCR reactions with the bovine small intestine cDNA library cloned in ⁇ gtlO [SEQ. ID NO: 16] (see text).
C Restriction map of cDNA clones pCR1000-91B and pCR1000-52A.
the protein coding region of the GalNAc- transferase protein is represented by the open box, the noncoding regions by the straight solid line and vector sequences by a solid box.
the arrows beneath the 9 IB clone and above the 52A clone indicate the direction and extent of sequencing of the clones.
Figure 3 Amino acid sequence [SEQ ID NO:91 of the cloned GalNAc-transferase inferred from the nucleotide sequence of cDNA clones 91B ISEQ ID NO:101 and 52A.
the proposed transmembrane sequence is indicated by the solid boxed residues.
N-linked glycosylation Potential sites for N-linked glycosylation are indicated by the dashed boxed residues and predicted sites for O-linked glycosylation are marked with a dot under the appropriate amino acid.
the N-terminus of the soluble bovine GalNAc-transferase (determined by N-terminal sequencing) is indicated by the arrow.
the consensus poly A+ sequence (AATAAA) is indicated with a solid box and the sequence of the 93 bp insert of pCR1000-93I and the 621 bp insert of pCRl000-600 are indicated by the dashed underline (931) or solid underline (600).
the numbering of the nucleotide (upper) [SEQ ID NO: 10] or amino acid sequence (lower) [SEQ ID NO:9] is indicated to the right of the sequence.
the first ATG codon obtained from the 9 IB clone [SEQ ID NO: 10] represents the beginning of the 1680 base pair nucleotide sequence for GalNAc-transferase [SEQ ID NO:ll].
Figure 4 Predicted transmembrane domain and O-linked glvcosvlation sites for the cloned GalNAc-transferase.
FIG. 6 Immunoprecipitation of in vivo 35 S-methionine labeled GalNAc-transferase expressed in baculovirus infected Sf9 cells.
the cloned GalNAc-transferase DNA was expressed in Sf9 cells using a baculovirus vector.
the infected cells were switched to culture medium containing 35 S-methionine 24 hours post-infection and harvested after another 24 hours.
the cells were lysed in a detergent containing buffer and the labeled transferase was immunoprecipitated from the cell lysates and the corresponding culture media.
the washed irnmunoprecipitates were separated by SDS-PAGE on a 10% polyacrylamide gel.
Lanes 1, 3 and 5 contain radioactivity precipitated from cell lysates of cells infected with virus containing the constructs GalNAcT 2-l.A, GalNAcT 2-l.B and CMV Pol-1, respectively.
Lanes 2, 4 and 6 contains radioactivity immunoprecipitated from the corresponding culture media. The two molecular mass forms of the immunoprecipitated protein is indicated by the arrow heads. The migration of molecular weight markers is indicated to the right.
A Human granulocyte-macrophage colony-stimulating factor.
B Human choriogonadotropin ⁇ -chain.
C Subtilisin BPN'.
D Bovine cytochrome C
A Bovine rhodanese.
B Chimeric protein constructed from the first two domains of human CD4 and the last three domains oi Pseudomonas exotoxin.
C Human LDL receptor protein.
D Human Alzheimer amyloid protein precursor.
Figure 9. Lineweaver-Burk plots of GalNAc-transferase reaction velocities.
FIG. 12 The domain structure of bovine UDP-GalNAc:polvpeptide. N- acetvlgalactosaminvltransferase: construction of the secreted, soluble enzvme.
GalNAcT denotes the full-length transferase; the domain structure of the molecule is high-lighted by the symbols described in the key.
GalNAcTs denotes the soluble fusion molecule; the melittin signal sequence and 5 amino acids forming the linkage between the signal sequence and the GalNAc-transferase sequence, are represented by the solid bar. The arrow indicates the signal peptidase cleavage site.
Figure 13 The domain structure of bovine UDP-GalNAc:polvpeptide. N- acetvlgalactosaminvltransferase: construction of the secreted, soluble enzvme.
GalNAcT denotes the full-length transferase; the domain structure of the molecule
sequences coding for the cytoplasmic and membrane spanning domains of the full-length cDNA were replaced with sequences that code for the honeybee melittin signal peptide and five linker amino acids (78 nucleotides) [SEQ ID NO: 18].
the honeybee melittin signal sequence was chosen since the intended expression system for the construct was baculovirus/Sf9 cells.
Figure 14 Separation of soluble GalNAc-transferase on SDS-polvacrvlamide electrophoresis. Silver staining detected only one protein band on the 10% polyacrylamide gel. A molecular mass of approximately 61 kDa could be detected by Coomassie Blue staining.
Figure 15 The nucleotide seouence of UDP-GalNAc:polvpeptide. N- acetylgalactosaminyl-transferase.
the depicted nucleotide sequence [SEQ ID NO: 11] codes for the enzyme N-acetylgalactosaminyltransferase.
Figure 16 The aminp acid sequence pf UDP-Ga_NAc;pQl ⁇ peptide.
N- acetylgalactosaminyl-transferase The amino acid sequence of the enzyme N- acetylgalactosaminyltransferase [SEQ ID NO:9] is depicted.
Figure 17. An amino acid sequence of a soluble form UDP-GalNAc:polvpeptide. N- acetylgalactosaminyl-transferase. The amino acid sequence of a secreted form of the enzyme N-acetylgalactosaminyltransferase [SEQ ID NO: 19] is depicted.
Figure 18 GalNAc-transferase reaction velocity plot. The transfer of 3 H-acetylgalactosamine to the acceptor peptides by soluble GalNAc-transferase was assayed as outlined in Materials and Methods.
the synthetic acceptor peptide of the instant experiment was Pro-Pro-Asp-Ala-Ala- Thr-Ala-Ala-Pro-Leu (PPDAATAAPL) [SEQ ID NO:20].
N-acetylgalactosaminyl transferase GalNAcT
GalNAcT N-acetylgalactosaminyl transferase
Fig. 15 [SEQ ID NO: 11] and the amino acid sequence depicted in Fig. 16 [SEQ ID NO:9]. This definition is intended to encompass natural allelic variations in the
GalNAct sequence and all references to GalNAcT, and nucleotide and amino acid sequences thereof are intended to encompass such allelic variations, both naturally- occurring and man-made.
Cloned genes of the present invention may code for the GalNAcT enzyme of any species of origin, but preferably code for enzymes of mammalian, most preferably bovine, origin.
Probes may be labeled with a detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent group in accordance with known procedures and used in conventional hybridization assays.
a detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent group in accordance with known procedures and used in conventional hybridization assays.
GalNAcT gene sequences may be obtained by use of the polymerase chain reaction (PCR) procedure, with the PCR oligonucleotide primers being produced from the GalNAcT gene sequence provided herein. See U.S. Patent Nos. 4,683,195 to Mullis et al. and 4,683,202 to Mullis.
the GalNAcT enzyme may be synthesized in host cells transformed with vectors containing DNA encoding the GalNAcT enzyme.
a vector is a replicable DNA construct. Vectors are used herein either to amplify DNA encoding the GalNAcT enzyme and/or to express DNA which encodes the GalNAcT enzyme.
An expression vector is a replicable DNA construct in which a DNA sequence encoding the GalNAcT enzyme is operably linked to suitable control sequences capable of effecting the expression of the GalNAcT enzyme in a suitable host. The need for such control sequences will vary depending upon the host selected and the transformation method chosen.
control sequences include a transcriptional promoter, an optional operator sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding sites, and sequences which control the termination of transcription and translation.
Amplification vectors do not require expression control domains. All that is needed is the ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants.
Vectors useful for practicing the present invention include plasmids, viruses
a useful vector is a baculovirus expression vector.
the vector replicates and functions independently of the host genome, or may, in some instances, integrate into the genome itself. Suitable vectors will contain replicon and control sequences which are derived from species compatible with the intended expression host.
Transformed host cells are cells which have been transformed or transfected with the GalNAcT enzyme constructed using recombinant DNA techniques. Transformed host cells ordinarily express the GalNAcT enzyme, but host cells transformed for purposes of cloning or amplifying the GalNAcT enzyme DNA need not express the GalNAcT enzyme. When expressed, the GalNAcT enzyme will typically be located in the host cell membrane.
DNA regions are operably linked when they are functionally related to each other.
a promoter is operably linked to a coding sequence if it controls the transcription of the sequence.
a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation.
operably linked means contiguous and, in the case of leader sequences, contiguous and in the same translational reading frame.
Expression vectors for such cells ordinarily include (if necessary) an origin of replication, a promoter located upstream from the gene to be expressed, along with a ribosome binding site, RNA splice site (if intron-containing genomic DNA is used), a polyadenylation site, and a transcriptional termination sequence.
the transcriptional and translation control sequences in expression vectors to be used in transforming vertebrate cells are often provided by viral sources.
promoters are derived from polyoma, Adenovirus 2, and Simian Virus 40 (SV40).
SV40 Simian Virus 40
the early and late promoters of SV40 are useful because both are obtained easily from the virus as a fragment which also contains the SV40 viral origin of replication.
the GalNAcT enzyme promoter, control and/or signal sequences may also be used, provided such control sequences are compatible with the host cell chosen.
An crigin of replication may be provided either by construction of the vector to include an exogenous origin, such as may be derived from SV40 or other viral source, or may be provided by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell chromosome, the latter may be sufficient.
GalNAcT enzyme made from cloned genes in accordance with the present invention may be used for designing new compounds containing oligosaccharides for a variety of healthcare and industrial applications.
host cells may be transformed with a vector of the present invention, GalNAcT enzyme expressed in that host, the cells lysed, and the enzyme isolated from the lyzed cells.
the enzyme can then be used in vitro to begin the initial reaction in the O-linked oligosaccharide biosynthesis of the transfer of an N-acetylgalactosamine residue from the nucleotide sugar UDP-N-acetylgalactosamine to a serine or threonine residue on the protein acceptor.
Cloned genes and vectors of the present invention are useful in molecular biology to transform cells which do not ordinarily express the GalNAcT enzyme to thereafter express this enzyme. Such cells are useful as intermediates for producing the enzyme. Such cells are also useful for the in vivo biosynthesis of an O-linked oligosaccharide to a protein acceptor.
Milk (and colostrum) contains a number of glycosyltransferase activities (e.g. Prieels et al, 1975; Paulson et al, 1977; Bushway et al, 1979; Parodi et al, 1984).
bovine colostrum contains what appears to be a soluble form of N-acetylgalactosaminyl transferase (GalNAcT) (Elhammer and Kornfeld, 1986) but did not provide a procedure for the purification of sufficient amounts of GalNAcT for N-terminal sequencing. The following procedure describes the purification of GalNAcT from bovine colostrum.
the amino acid sequence of the enzyme is determined by N-terminal sequencing. This information is then used to isolate a cDNA clone encoding a full- length (membrane bound) transferase which upon expression in the insect cell line Sf9 resulted in the synthesis of a fully active enzyme.
the acceptor specificity of the enzyme is then determined using a semiquantitative analysis of the amino acids surrounding known glycosylation sites in 16 different proteins followed by in vitro glycosylation studies of synthetic peptides.
[ ⁇ - 32 P]dATP 300 Ci/mmol
UDP-[lJH]N-acetylgalactosamine 8.3 Ci/mmol
Na[ 125 I] (15.2 mCi/ ⁇ g)
[ ⁇ - 33 P]dATP is from NEN/Dupont and 35 S- methionine is from ICN (Trans S-35 label, 1 mCi ml),.
Bovine colostrum is obtained from a local farmer.
UDP-N-acetylgalactosamine UDP, PMSF, chymostatin, leupeptin, antipain, pepstatin, aprotinin, bovine submaxillary mucin, Nonidet P-40 (NP-40), Triton X-100, taurodeoxycholate, Sephadex G-100 Superfine, rabbit anti- chicken IgG antibodies, ATP, myelin basic protein, subtilisin, rhodanese and cytochrome C (reduced and carboxymethylated as described by Heinrikson, R.L., 1973) are from Sigma. DEAE-Sephacel, Sepharose 6B and Protein A-Sepharose are from Pharmacia. IODOGEN is from Pierce.
N-glycosidase F is from Oxford Glycosystems. Geneamp Kit (for PCR) is obtained from Perkin Elmer/Cetus. A bovine small intestine cDNA library cloned in a ⁇ gt 10 vector is purchased from Clontech (catalog # BLIOlOa). The TA cloning vector pCRlOOO is from Invitrogen. Sequenase version 2.0 is from U.S. Biochemical Corp. The baculoGold transfection kit is from PharMingen. 1 cc Bond Elut C lg columns were from Varian. Serum-free Grace's insect medium, Insect Express, was from BioWhitaker.
the vector pVt-Bac was a gift from Dr. Thierry Vernet at the Biotechnology Research Institute, National Research Council of Canada. Patella vulgata ⁇ -N-acetylgalactosaminidase is from V-Labs, Inc. Restriction enzymes and all other reagents are from standard sources. In addition, the following buffers are used. Buffer A: 25 mM Imidazole, pH 8.
buffer B 25 mM imidazole, pH 7.2, 1 M NaCl, 1% Triton X-100, 20 mM EDTA; buffer C: 25 mM Imidazole, pH 7.2, 30 mM MnCl 2 , 20 - mM NaCl; buffer D: 25 mM Imidazole pH 7.2, 0.5 M NaCl, 20 mM EDTA; buffer E: 25 mM Ir- ' ..
Example 1 Isolation of N-acetvlgalactosaminvltransferase from Bovine Colostrum
the first four steps in the purification of the transferase are identical to the procedure described by Elhammer and Kornfeld (1986) (which is herein incorporated by reference) except that the samples loaded on the affinity columns are adjusted to 1 mM ATP (in addition to the reported buffer, salt and UDP concentrations) to compensate for an apparently higher pyrophosphatase activitydes) in the colostrum used. Equilibration, loading, washing and elution buffer volumes are adjusted (scaled up) for the larger columns used. All steps in the purification procedure are performed at +4°C and enzyme activity is assayed with the following standard assay throughout the purification.
the standard assay for UDP-GalNAc:polypeptide, N-acetylgalactosaminyl- transferase activity during purification contained the following components in a final volume of 80 ⁇ l: 50 mM Imidazole pH 7.2, 10 mM MnCl 2 , 0.5% Triton X-100, 15 ⁇ M UDP-GalNAc, UDP[1- 3 H-] GalNAc (27,000 cpm/assay), 0.15 mg/ml apomucin and varying amounts of enzyme (see individual experiments).
the reaction mixture is incubated at 37°C for 5-10 minutes (see individual experiments) and the reaction product is TCA precipitated and radioactivity measured as described.
the supernatant from the 100,000 g centrifugation is loaded directly on a DEAE- Sephacel column equilibrated in buffer A.
the bed volume of this column should be approximately equal to the amount of 100,000 g supernatant loaded
Apomucin deglycosylated mucin
bovine submaxillary mucin by the method of Hagopian and Eylar with minor modifications.
the carbohydrate content ofthe apomucin preparation is determined by the method of Reinhold.
CNBr-activated Sepharose is prepared from Sepharose 6B essentially as described by Cautrecasas.
the apomucin is coupled to the activated Sepharose in 0.1 M sodium carbonate buffer pH 9.2 at 4°C overnight. The protein concentration during the reaction is 2.5 mg/ml.
the columns are run by gravity at a pressure of -30 cm H 2 0 during loading and -60 cm H 2 O during washing, elution and regeneration.
the column Before loading, the column is washed with 400 ml buffer B (regeneration buffer) followed by 500 ml buffer C and 150 ml buffer C containing 0.25 mM UDP.
the sample Prior to loading the column the sample ( -200U enzyme activity per 50 ml column in the first affinity step) is supplemented with MnCl 2 and UDP to final concentrations of 30 mM and 1.25 mM, respectively.
the column is washed with 4 column volumes of buffer C containing 0.25 mM UDP and six 40 ml fractions are collected. The column is then eluted with buffer D.
fractions 1 and 2 25 ml each, normally contains no, or very little activity
fractions 3 and 4 50 ml each contains the bulk of the activity
fractions 3 and 4 50 ml each, contains the bulk of the activity
fractions 5 through 7 25 ml each, contains in some cases smaller amounts of activity.
the individual fractions are dialyzed against 4 liters of buffer E (2 changes) immediately after elution, and assayed for enzyme activity. Typically only fractions 3 and 4 are used in the subsequent purification.
Step 4 Apomucin affinity chromatography II
the same type column is used as in the previous one.
the column is first washed with 400 ml buffer B followed by 500 ml buffer F and 150 ml buffer F, containing 0.25 mM UDP.
dialyzed fractions 3 and 4 from step 3 are supplemented with 1 M MnCl 2 , 4 M NaCl and UDP to achieve final concentrations of 30 mM, 100 mM and 1.25 mM respectively.
Step 5 Gel filtration chromatography on Sephadex G-100 superfine
the dialyzed fractions from three step 4 runs are pooled, 1/50 volume 5% taurodeoxycholate is added, and the material is concentrated to 2.5 ml on an Amicon YM-10 filter under 40 psi pressure.
Half of this material, 1.25 ml is loaded on a Sephadex G-100 Superfine column (20-50 ⁇ m bead size; 1.5 x 100 cm) equilibrated in buffer G having 300 mM NaCl.
the column is run at a pressure of 30 cm H 2 O, which resulted in a flow of approx. 2.3 ml/hour and fractions (100 total) are collected at 40 min.
the fractions comprising the activity peak are pooled and concentrated as described above but without any further addition of detergent.
Analytical gel filtration to determine the molecular weight ofthe transferase is carried out using the same procedure but with a smaller column (0.9 x 100 cm) and collecting 1.06 ml fractions. The recoveries from this step using the conditions described above typically ranged from 80-90%.
the purified GalNAcT preparation contains only one polypeptide, with a molecular mass of approximately 70 kDa, detectable with silver staining (Figure LA). A portion ofthe purified preparation is labeled in vitro with 125 I and separated on SDS- PAGE before and after digestion with peptide N-glycosidase F. Figure IB shows that this treatment results in an approximately 6 kDa shift in the apparent molecular mass of the protein.
N-terminal sequencing of the purified bovine colostrum GalNAcT is done by automated Edman degradation in an Applied Biosystems Sequencer (Model 470) fitted with an on-line HPLC analyzer (Model 120-A) for phenylthiohydantoins. Quantitation of the latter is afforded by the Nelson Analytical Turbochrom chromatography data system connected in parallel with the recorder to the output from the HPLC system. The 34 amino acid sequence is shown in Figure 2A [SEQ ID NO:l].
Example 3 Isolation and Characterization of cDNA Clones Encoding Bovine GalNAc- Transferase
Oligonucleotide primers are synthesized based on the partial N-terminal amino acid sequence of the purified bovine colostrum enzyme with an Applied Biosystems
oligonucleotide (oligos A-E) [SEQ ID NOS: 2-6, respectively,] sequence of the primers and probes used in the Polymerase Chain Reaction (hereinafter referred to as PCR) and later in a Southern Blot analysis are shown in Fig. 2A below the GalNAcT amino acid sequence.
PCR Polymerase Chain Reaction
Fig. 2A The degeneracy of oligonucleotides A, B and C are 512, 64 and 64, respectively.
the PCR is carried out in 0.1 ml of solution containing 50 mM KCl, 10 mM Tris-HCL pH 8.3, 1.5 mM MgCl 2 , 0.2 mM each of the four dNTP's, 1 ⁇ M of each oligonucleotide, either 5 ⁇ l of the bovine intestine cDNA library or 10 ng of plasmid or ⁇ DNA and 2.5 units of Taq polymerase.
the reaction is covered with 0.1 ml of mineral oil and subjected to a temperature step cycle. When degenerate oligonucleotides are used the steps are 94°C (1 min), 37°C (2 min), 72°C (3 min) for a total of 35 cycles.
oligonucleotides For nondegenerate oligonucleotides the steps are 94°C (1 min), 55°C (2 min), 72°C (3 min) for a total of 25 cycles. Standard DNA manipulations are performed as described in Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989) Molecular Cloning.
the cDNA encoding the GalNAcT gene is cloned using the following approach. Oligonucleotides A [SEQ ID NO:2] and C [SEQ ID NO:4] are used as opposing primers in a PCR reaction. A bovine small intestine cDNA library cloned into a ⁇ gtlO vector is used as the template for the reaction. On the basis of the amino acid sequence, the predicted size ofthe amplified PCR product is 93 bp. The products ofthe PCR reaction are analyzed by Southern blot analysis using ohgonucleotide B [SEQ ID NO:3] as a probe ( Figure 2A).
oligonucleotide primers D-G [SEQ ID NOS: 5-8, respectively] are synthesized. Oligonucleotides D [SEQ ID NO:5] and E [SEQ ID NO:6] are derived from the sequence ofthe pCRlOOO-931 insert and F [SEQ ID NO:7] and G [SEQ ID NO:8] are primers that directly flank either side of the EcoRI cloning site of ⁇ gtlO ( Figure 2B).
PCR reactions are run using the bovine cDNA library as template with oUgonucleotides D+F or D+G as primers.
the resulting PCR products are analyzed by Southern blot analysis using oligonucleotide E [SEQ ID NO:6] as a probe.
the 621 bp insert contains a 207 amino acid open reading frame with the first 23 amino acids of that open reading frame being a perfect match to amino acids 12-34 of the purified protein ( Figure 2A) [SEQ ID NO:l].
the 621 bp fragment contains a portion ofthe GalNAcT gene
this fragment is labeled with [ ⁇ - 32 P]dATP by nick translation (Goldin et al, 1981) and is used as a probe to screen the bovine cDNA library.
the cDNA library (containing 2.5 x IO 6 independent clones) is screened by plaque hybridization using the above labeled DNA fragment as a probe. Seven positive plaques are obtained from the primary screen and each isolate is plaque purified three times. Five of the seven isolates are found to contain inserts of 600 bp or smaller while the two remaining isolates contain inserts of approximately 1600 and 2300 bp.
the two larger inserts are PCR amplified and cloned (using oUgonucleotides F and G as primers) into the TA cloning vector to yield pCR1000-52A (1600 bp insert) and pCR1000-91B (2300 bp insert).
the size ofthe ⁇ inserts are analyzed on 1% agarose gels following restriction digest with EcoRI or by PCR using oUgonucleotides F and G as primers.
Example 4 DNA Sequence Analysis of PCR Inserts and Predicted Amino Acid Sequence
the inserts in pCR1000-93I, pCR1000-600, pCR1000-91B (2294 bp) and pCRl000-52A (1582 bp) are sequenced by the dideoxy chain termination method (Sanger et al. , 1977) using Sequenase version 2.0 with [ ⁇ - 33 P]dATP. Double stranded DNA sequencing (Ausubel et al. 1987) is done with 20-mer ohgonucleotide primers, synthesized according to the sequence of the cDNA insert. The sequencing strategy is shown in Figure 2C. Sequence analysis is performed using the Sequence Analysis software package of the University of Wisconsin Genetics Computer Group (Devereux et al, 1984).
the first ATG codon ofthe sequence obtained from the 9 IB clone is present at nucleotide 53.
the sequence of the 52A clone demonstrated that it is a truncated version of the 9 IB clone in that the sequence of this clone starts at nucleotide 162 and ends at nucleotide
the 52A insert covers nearly all of the open reading frame sequences (missing codons for the first 37 amino acids) found in the 9 IB clone.
nucleotide sequence of the 52A clone is identical to the 9 IB clone with the exception that nucleotide 358 is a G in the 52A clone instead of an A. This base change is in the wobble position (AGA. to AGGJ of codon 102 so it does not alter the arginine at that position.
the 3'-untranslated region of the 9 IB clone is 562 bp in length, contains a consensus polyadenylation signal (nucleotides 2176-2182) and a track of 25 A residues at the end ofthe clone (Fig. 3), indicating that the 91B clone contains aU the 3' terminal sequences of the GalNAcT mRNA [SEQ ID NO: 10].
MDBK cells bovine mammary tissue and various human tissues is analyzed by Northern blot analysis using the 600 bp insert of pCR 1000-600 as a hybridization probe.
Total RNA and poly A* RNA is prepared from bovine mammary tissue and from MDBK using the Invitrogen Fastrack kit, following the manufacturers procedure. Two ⁇ g of poly A * RNA are denatured by glyoxylation and Northern blot analysis is performed as previously described (Homa et al. , 1986).
a human multiple tissue Northern blot (Clontech (Cat # 7760-1)) is prehybridized in 50% formamide, 5 x SSC, 1 x Denhart's, 1% SDS, 100 ⁇ g per ml denatured salmon testes DNA, at 42°C for 2 h and then hybridized overnight at 42°C with the 32 P-labeled 600 bp insert isolated from the pCRlOOO-600. Filters are washed three times for 15 min in 0.1 X SSC, 0.1% SDS at 55°C As shown in Figure 5, at least two different sized
GalNAcT mRNA's are detected from all the samples.
the size of the bovine messages are approximately 4.1 and 3.2 kb, while all the human tissues express messages of 4.8 and 3.9 kb.
a third mRNA of approximately 1.5 kb is detected in the skeletal muscle sample.
the putative GalNAcT coding region, pCR1000-9lB is digested with Sstll and Hindlll (both enzymes cut only in pCRlOOO sequences that flank the insert; Figure 2C) and these sites are blunted using T4 DNA polymerase so that it can be cloned into a baculovirus expression vector.
BamHI linkers are then ligated onto the blunted ends and the resulting sample is ligated into the BamHI site of the baculovirus expression vector pAC373 (Summers and Smith, 1986).
the resulting isolates are screened for proper orientation of the GalNAcT open reading frame with respect to the baculovirus polyhedron promoter, to yield pAC373-GalNAcT.
Cotransfection of Sf9 cells with pAC373-GalNAcT and linearized baculovirus DNA from PharMingen's baculoGold transfection kit is performed using calcium phosphate precipitation (Summers & Smith, 1986).
the baculovirus DNA provided in the PharMingen transfection kit contains a lethal mutation that can be corrected by homologous recombination with sequences contained in the pAC373 vector. Therefore, following transfection, only recombinant viruses wiU grow on Sf9 cells.
GalNAcT 2-1A and GalNAcT 2-1B Transfections are done in duplicate and the resulting virus samples are referred to as GalNAcT 2-1A and GalNAcT 2-1B.
Cells are harvested 48 hours post infection and lysed in a detergent containing buffer. Following sedimentation of undissolved material, the cleared lysates are assayed for GalNAcT activity. Lysates from uninfected cells or from cells infected with either a baculovirus containing an unrelated gene, CMV-POL (human cytomegalovirus DNA polymerase gene), or two separate baculovirus isolates of the GalNAcT gene, GalNAcT 2-1A and GalNAcT 2- IB, are assayed.
CMV-POL human cytomegalovirus DNA polymerase gene
the baculovirus expressed protein is further examined by immunoprecipitation and SDS-PAGE analysis.
Baculovirus infected cells are labeled from 24 to 48 hours postinfection with [ 35 S]methionine.
GalNAcT is immunoprecipitated from lysates and culture media ofthe labeled cells using a chicken polyclonal antibody raised against the purified bovine colostrum enzyme.
a chicken is injected with 100 ⁇ g purified enzyme axillary, intramuscularly (with Freund's complete adjuvant).
One month later the chicken is boosted with another 50 ⁇ g antigen subcutaneously (with Freund's incomplete adjuvant); a second booster, 50 ⁇ g enzyme axillary, intra-muscularly, is administered after an additional 21 days.
Test bleeds are done two weeks after each booster. After the second test bleed (which upon analysis is found to contain anti- GalNAcT antibodies) eggs are coUected each day and used as a source for antibodies.
IgG is isolated from egg yolk as described by Jensenius et al., 1981.
Immunoprecipitation ofthe in vivo 35 S-methionine labeled enzyme is done from crude cell lysates. Infected cells are labeled between 24-48 hours postinfection with 50 ⁇ Ci/ml 35 S-methionine in medium that contains one tenth the normal methionine concentration. Approximately 1.5 X 10° labeled, infected cells are dissolved in 670 ⁇ l PBS containing 0.5% Triton X-100, 0.5% taurodeoxycholate, 0.05% SDS, 0.1 TlU/ml of Aprotinin and 10 ⁇ g/ml each of leupeptin, antipain, chymostatin and pepstatin.
any undissolved debris is sedimented at 10,000 x g for 20 minutes and the supernatant is collected.
Immunoprecipitation is carried out by the addition of 4 ⁇ l (approximately 20 ⁇ g chicken IgG) of chicken anti GalNacT antibodies; purified IgG isolated from egg yolk is used for all immunoprecipitation experiments.
the antigen- antibody complexes are isolated by over night adsorption to 22 ⁇ l (volume of sedimented gel) of protein A-Sepharose coated with rabbit anti-chicken IgG antibodies.
the coated protein A-Sepharose is prepared by incubating 330 ⁇ l sedimented protein A-Sepharose with 2.3 mg rabbit anti-chicken IgG antibodies (an affinity purified IgG fraction) in 1 ml of PBS over night; the coated protein A-Sepharose is washed three times with 1 ml PBS containing 0.5% Triton X-100, 0.5% taurodeoxycholate, 0.05% SDS. Following adsorption of the antigen, the immunosorbent is sedimented by centrifugation and washed extensively essentially as described by Dunphy et al. (1985).
the washed antigen-antibody-immunosorbent complexes are suspended in 50 ⁇ l SDS-PAGE sample buffer (Laemmli, 1970) and heated for five minutes on a boiUng water bath to release the bound antigen. Following sedimentation of the protein A-Sepharose the antigen containing supernatants are aspirated and loaded on SDS-PAGE. SDS-PAGE, and fluorography of the dried gels is done as described previously (Davis et al. , 1986) (Fig. 6).
the soluble bovine colostrum enzyme is the result of proteolytic cleavage of a membrane bound molecule or if it represents a bona fide secretory protein.
Soluble, enzymaticaUy active forms of a ⁇ l-4 galactosyltransferase and a ⁇ 2-6 sialyltransferase have been reported, both of which appear to be the result of proteolytic cleavage of membrane bound proteins (Paulson and CoUey, 1989 and references therein).
the translation products from the different mRNA species related to both these molecules appears in most tissues to be membrane bound molecules (Joziasse, 1992).
the larger sizes ofthe two GalNAcT messages are presumably related to untranslated sequences larger than those recovered in the isolated clones, in the 5' and/or 3' ends of the native molecules.
Messenger RNA molecules from previously characterized cloned glycosyltransferases frequently contain extensive 5' and 3' untranslated sequences (e.g. Weinstein et al, 1987; Larsen et al, 1989; Russo et al, 1990; Scocca et al, 1990; Sarkar et al, 1991; Nagata et al, 1992).
these two proteins are not known at present; they may represent different glycoforms of the enzyme or, perhaps more likely, the lower molecular mass form may be a proteolytic fragment, similar to the enzyme purified from bovine colostrum.
the latter possibiUty is supported by two observations: 1), the mass difference between the two molecules is roughly equal to that ofthe sequence (40 amino acids) missing in bovine colostrum enzyme and 2), while the irnmunoprecipitates from cell lysates contains predominantly the higher molecular mass form ofthe enzyme, the culture medium appears to be enriched in the lower mass form. High-speed centrifugation of the culture medium failed to sediment more than approximately 30% ofthe enzymatic activity (Data not shown).
the smaller molecular mass of the insect cell produced molecule as compared to the predicted mass of a membrane bound bovine enzyme may be the result of differences in glycosylation of the two molecules.
Insect cells typically synthesize truncated, non-sialylated N- and O-linked oUgosaccharides (e.g. Hsieh and Robbins, 1984; Domingo and Throwbridge, 1988; Kuroda et al, 1990; Thomsen et al, 1990; Wathen et al, 1991; Chen and Bahl, 1991); this results in a reduced molecular mass of insect cell produced glycoproteins on SDS- PAGE. The identity of higher molecular mass bands, approximately 120-180 kDa, on the gel is not clear.
Example 8 Construction and Expression in Sf9 cells of a Soluble GalNAc-transferase (GalNAcTs)
GalNAcTs Soluble GalNAc-transferase
the sequences coding for the cytoplasmic and membrane spanning domains of the full- length cDNA were replaced with sequences that code for the honeybee melittin signal peptide (Fig. 12).
the honeybee melittin signal sequence was chosen since the intended expression system for the construct was baculovirus/Sf9 cells.
the plasmid pAC373-GalNAcT (Homa et al., 1993) which contains the full length GalNAc-transferase gene under the control ofthe baculovirus polyhedron promoter was digested with Xbal and Bglll, which generated a 150 bp fragment, and with Bglll and Xhol, which generated a 9700 bp vector fragment. Both fragments were gel purified.
the Xbal site used is located 7 amino acids from the N-terminus of the soluble colostrum enzyme, in a portion of the molecule corresponding to what is referred to as the "stem region" in other glycosyltransferases (reviewed by Shaper & Shaper, 1992).
2100 bp fragment generated by this digest was gel purified.
the three gel purified fragments were added to the same tube and ligated.
the resulting plasmid contains a GalNAc-transferase gene under the control of the baculovirus polyhedron promoter in which the first 47 amino acids (141 nucleotides) have been replaced with 21 amino acids (63 nucleotides) ofthe honeybee meUttin signal peptide plus five (5) amino acids (15 nucleotides) that link the two domains together (Fig. 13) [SEQ ID NO: 18].
GalNAcTs-Mel in Sf9 cells resulted in 130-fold increase in GalNAc-transferase activity in the culture medium, as compared to uninfected cells (Table 2) or cells infected with an unrelated molecule ( ⁇ 6-3). This is more than 35 times the amount recovered in the medium of cells expressing the full length molecule (Homa & Elhammer, unpublished observations). A significant portion (36%) ofthe total enzymatic activity resulting from expression of the soluble molecule was, however, retained inside the cells; the reason for this is not clear at present.
Example 9 Isolation and Characterization of GalNAcTs
the bound enzyme was eluted from the column with 720 ml Buffer D and collected in seven 80 ml fractions. Run-through, wash, and eluted fractions were all dialyzed against Buffer E containing 300 mM NaCl (three changes) prior to assay for GalNAc- transferase activity. The recovery of enzyme activity on the column was invariably over 90%. The following concentration of the eluted enzyme, however, led to significant losses in activity. In fact this step accounted for the largest losses in the preparation procedure. Dialyzing the enzyme into a buffer containing 300 mM NaCl prior to concentration was an absolute necessity to avoid even higher losses in this step. The enzyme isolated from bovine colostrum shows a similar behavior in this regard (Elhammer & Kornfeld, 1986). The purified preparation was concentrated by ultrafiltration on a YM-10 membrane at 45 psi pressure.
the crude medium sample was precipitated as described by Wessel and Flugge ( 1984) prior to electrophoresis; precipitate corresponding to approximately 250 ⁇ l medium was loaded.
NH 2 -terminal sequencing ofthe purified molecule was done as described in Example 2. Interestingly, this purification procedure yielded a homogenous preparation only if expression of the molecule was carried out in serum- free medium.
the efficient production ofthe cloned molecule using the baculovirus expression system facilitated preparation of GalNAc-transferase in amounts sufficient for detailed biochemical and enzymatic studies to determine the acceptor substrate specificity of GalNAc-transferase from a database of in vivo substrates and from the in vitro glycosylation of proteins and peptides. These studies have been facilitated by the avaUabiUty of information regarding the presence of glycosylated serine and threonine residues in proteins obtained during protein sequencing. This information is registered in the NBRF protein sequence repository.
a search of the NBRF protein database yields several hundred definite or probable Thr and Ser O-glycosylation sites. From these, only those with reasonably unambiguous assignments are chosen and all proteoglycans are excluded since they contain primarily glycosaminoglycan chains where the anchoring sugar is xylose and not GalNAc. Also included into the reference set are the O-glycosylation sites identified
glycosylation sites themselves show no homology.
the complete reference set consists of the 196 glycosylated peptide segments (shown in Table 1 in Elhammer et al., 1993).
the glycosylated peptides are listed as enneapeptide (ennea Greek, nine) segments, with the reactive Ser or Thr in the central position, designated as PO. Accordingly, the amino acid side chains toward the N-terminus are designated as the subsites Pl to P4 and those toward the C-terminus as subsites Pl' to P4'.
a length of nine residues is chosen as a starting point, with the option that, depending on the results on the selectivity of the subsites, the portion of the peptides subject to analysis may be extended or truncated.
the sequences show that besides the obvious need for Ser or Thr in PO, no other subsite has an absolute requirement for any given amino acid. This then suggests that specificity of the enzyme may be the result of the cooperation of several subsites, none of them essential, but all of them contributing to catalytic efficiency.
PS is the abundance of glycosylatable peptides in all proteins and RP is the cumulative probability calculated as the product of all relevant s ⁇ values:
a Ser or Thr-containing peptide may be predicted to be a substrate for the enzyme if the probabUity, h, is higher than a certain cutoff value, h c .
a given peptide is predicted to be a glycosyl acceptor, if
the probability pattern of these two proteins is shown in Figure 7. It is interesting to note that the calculated probabilities for these two proteins are not distributed uniformly between the two extremes. Rather, a small number of residues are associated with very high probabilities whereas the rest of the sequence indicates uniformly low probabilities. Furthermore, the residues with high probabilities are clustered into one or two distinct segments where the clustering of Ser and Thr residues may perhaps be a necessary but certainly not a sufficient criterion for creating a highly glycosylated protein segment.
subtilisin BPN' (subsn.aa, Figure 7C) which is produced by a microbial system incapable of O-linked oUgosaccharide biosynthesis, contains a number of randomly distributed potential glycosylation sites, while very few nonglycosylated mammalian proteins contain any potential glycosylation sites. It is perhaps more typical to find no potential glycosylation sites at all, as in the case of horse hemoglobin (hbho.aa) or that of bovine cytochrome C (ccpg.aa, Figure 7D).
this region of the protein consists of a fully exposed segment linking the two homologous domains ofthe enzyme, exposure of native or mildly denatured rhodanese to GalNAc-transferase should result in glycosylation ofthe molecule.
the chimeric protein, CD4PE40 constructed from two domains of the human CD4 protein and three domains of the Pseudomonas exotoxin, shows two prominent potential glycosylation sites, both at regions linking individual domains, see Figure 8B. In the same vein, one would predict that subtiUsin could also be extensively glycosylated, if not in the native form then at least after mild denaturation.
the potential of the predictive method is perhaps best illustrated by its application to the LDL receptor (ldlrec.aa, Figure 8C) and the Alzheimer precursor protein (alz.aa, Figure 8D), which have both been shown to be extensively O-glycosylated, each in a known, narrow segment of the polypeptide.
the present method not only correctly identifies these regions of glycosylation but also specifically predicts which Ser and Thr residues may be modified.
the above analysis allows one to hypothesize about the saUent features of the enzyme active site responsible for the specificity of glycosylation. Table 4 indicates that high selectivity is expressed at aU subsites, but only toward Ser, Thr, and Pro.
the selectivity of a given subsite depends on how many times more frequent are at that site the surabundant residues than all the other amino acids. Also, selectivity is higher when the surabundant residue is one which occurs with low frequency in globular proteins.
S j a specificity parameter for the subsite i, S j , as the number of surabundant residues found at that site, divided by the number of these same residues expected at that site from random distribution. This ratio is then multiplied by the fraction of surabundant residues at that site.
S j The values of S it reported in Table 3, suggest that the binding site extends at least from P3 to P4' and perhaps even P4 is included in the substrate-enzyme interactions.
the preferred conformations appear to be a random coil, a sharp bend, or a ⁇ -strand from P4 to PO followed by a turn.
the enzyme does not require a preformed secondary structure but imposes one upon binding of the substrate.
the hydration index of the amino acids in the potential glycosylation sites also shown in Table 6, indicates that most peptides are reasonably exposed to the aqueous environment.
reaction products are characterized using alkaline sodium borohydride treatment essentially as described by Carlson (1968).
Digestion with Patella vulgata ⁇ -N-acetylgalactosaminidase (approximately 1 unit ml) is done in 25 mM citrate buffer pH 4.0 in a final volume of 30 ⁇ l for 24 hours.
Released radioactive sugars are separated on descending paper chromatography in pyridine-ethyl acetate-glacial acetic acid-water (5:5:1:3; v:v:v:v).
Table 7 shows that, as predicted, both bovine rhodanese and, to a lesser extent, the bacterial protein subtihsin do indeed function as acceptors for the enzyme, although neither of them reacts unless reduced and carboxymethylated prior to exposure to the enzyme.
bovine cytochrome C which contains one Ser and eight Thr residues but no predicted potential sites, is not an acceptor for the enzyme, whether in the native, or in the reduced and carboxymethylated state.
Myelin basic protein a molecule which previously has been shown to be an efficient acceptor for GalNAc-transferase (Hagopian et al., 1971) is included as a positive control in this experiment.
Rhodanese contains two additional predicted acceptor sites, Ser 142 and Ser 6 , (Fig. 8). However, due to the low rates of transfer to serine residues under our standard assay conditions, transfer to these sites should not contribute significantly to the total transfer in the assay.
the lower rate of transfer to reduced and carboxymethylated rhodanese compared to that of myelin basic protein, may be related to incomplete exposure of the acceptor sites even by the reduction and carboxymethylation procedure, and/or differences in rate constants between the acceptor sequences on the two molecules.
Myelin basic protein contains one site predicted with high probability and three additional low probability sites. The molecule can reportedly be glycosylated with 1.2 to 1.5 N-acetylgalactosamines per molecule (Cruz and MoscareUo, 1983).
the bacterial protein subtilisin contains four predicted serine sites with probabilities higher than 0.6 (Fig. 8). Three of the serines have a high exposure index in the native protein (Kabsch and Sander, 1983), but the three-dimensional structure of the protein indicates that the hydroxyls are located in a restrained environment. Again, this could account for the need for reduction and carboxymethylation for acceptor activity.
the 35 times slower transfer rate to denatured subtUisin, as compared to myelin basic protein indicates again a slower transfer to serines than to threonines, under the conditions used. Factors such as those discussed for rhodanese may also contribute to the low levels of transfer.
cytochrome C which does not contain any predicted acceptor site, is completely inactive as an acceptor, whether in the native or in the reduced and carboxymethylated form.
the ability of GalNAc-transferase to glycosylate a series of synthetic acceptor peptides is shown in Figure 9 and Table 8.
PPAdSTdSAPG Pro-Pro-Ala-D-Ser-Thr-D-Ser-Ala-Pro-Gly
the t-Boc-amino acids and the PAM resin solid supports are supplied by AB
the completed peptides are removed from the supporting resin, concurrently with the side chain-protecting groups, by a standard HF cleavage procedure using anisole as a cation scavenger (10% v/v).
the crude peptides are purified by preparative reverse phase chromatography on a C18 Vydac column (2.5 x 30 cm) using a water/acetonitrile gradient, each phase containing 0.1% TFA.
Each purified peptide is characterized by FAB MS and shows a single symmetrical peak on analytical HPLC.
glycosylated amino acids in the peptides PPASTSAPG [SEQ ID NO: 14] and PPASSSAPG [SEQ ID NO : 15] are identified by sequencing ofthe reaction products from the corresponding assay.
Fig. 11 shows that for both glycosylated peptides the majority of the sugar-Unked radioactivity is associated with residue #5, the central amino acid, be it threonine, as in PPASTSAPG [SEQ ID NO: 14], or serine, as in PPASSSAPG [SEQ ID NO: 15].
the measurable amounts of radioactivity associated with the residues following residue 5 are presumably due to the large load of peptide in the sequencer necessitated by the low specific radioactivity of the sample. Nevertheless, since the radioactivities associated with residues 7 and 8 extrapolate smoothly to that of residue 6, it is most likely that, within our experimental error, residue 6 is not labelled.
N-acetylgalactosamine to peptide acceptors is assayed by two different assays.
concentration of UDP-GalNAc is saturating in all assays; a !__, of 8 ⁇ M is reported for bovine colostrum GalNAc-transferase (Elhammer and Kornfeld,
reaction mixture contains 50
reaction product glycosylated peptide
UDP-GalNAc a product that is separated from unreacted UDP-GalNAc by chromatography on Dowex-2 columns (0.5 ml bed volume) equilibrated in water; the run-through fraction (2.5 ml) containing the glycosylated peptide is collected, supplemented with scintillation fluid and counted for radioactivity.
the assay conditions are as follows: 50 mM Imidazole, pH 7.2, 10 mM MnCl 2 , 0.5% Triton X-100, 150 ⁇ M UDP-GalNAc, approximately 260,000 cpm UDP-[ 3 H]-GalNAc and 3.2 mM acceptor peptide (the concentration of RSPPP is 3.7 mM).
the assays are incubated for 20 minutes (PPASTSAPG) [SEQ ID NO: 14] or 8 hours (PPASSSAPG [SEQ ID NO: 15], PPAdSSdSAPG and RSPPP [SEQ ID NO: 13]).
the enzyme is inactivated by placing the samples on a boiUng water bath for 1.5 minutes. The samples are then allowed to cool and the reaction products are separated from unreacted UDP-GalNAc and free GalNAc by chromatography on a Biogel P-2 column (1 X 50 cm) equiUbrated in 7% isopropanol; thirty 1.3 ml fractions are coUected.
the peptide PPASTSAPG [SEQ ID NO: 14] is designed to contain a single Thr, at PO.
the proline residues at P4, P3 and P3 1 provide maximum probabiUties at those positions; serine residues at Pl and Pl' result in good probabilities without much steric constraint.
FinaUy, the alanine residues at P2 and P2' and the glycine at P4 are indifferent as to the probabUity of glycosylation but allow for flexibUity of the peptide backbone.
Tables 8 and 9 show that this peptide is the most efficient of the acceptors tested and comparative assays show that its reactivity is very close to that of bovine apomucin (data not shown). Furthermore, the kinetic parameters for the two peptides, determined under our conditions, are quite comparable to those of the purified porcine submaxiUary GalNAc-transferase-catalyzed glycosylation of peptides whose structure is derived from sites identified in porcine submaxiUary mucin (Wang et al., 1992).
the peptide RTPPP [SEQ ID NO: 12] derived from the major acceptor sequence in myelin basic protein (Hagopian et al.,1971), has a ___, lower than that of PPASTSAPG [SEQ ID NO: 14] but also a much lower V majl and, hence, its catalytic efficiency is only half of that of PPASTSAPG [SEQ ID NO: 14].
the activities of the two corresponding peptides containing serine instead of threonines are measurable but too low for
bovine colostrum GalNAc-transferase is capable of transferring GalNAc to the serine of these peptides, albeit — under our in vitro conditions — approximately 35 times slower than to threonine (Table 9).
bovine colostrum GalNAc-transferase In contrast to the enzyme recently purified from porcine submaxiUary gland (Wang et al., 1992), however, the bovine colostrum GalNAc-transferase is definitely capable of glycosylating both threonine and serine residues.
bovine colostrum GalNAc-transferase in experiments reported by O'Connel et al. (1992), bovine colostrum GalNAc-transferase faUed to glycosylate serine in a peptide derived from human erythropoietin. This phenomenon may be related to the specific acceptor peptide used, even though a serine in this position is glycosylated in vivo.
Example 13 Transfer of N-acetvlgalactosamine to Synthetic Acceptor Peptides bv Soluble GalNAc-Transferase
the abiUty of GalNAcTs to glycosylate a series of synthetic acceptor peptides was also studied.
Assays for the determination of kinetic parameters for peptide acceptors were carried out as described by Elhammer et al. (1993) but using a modification ofthe method described by O'Connel and Tabak (1993) for isolation ofthe acceptor peptides: One ml Bond Elut columns containing 100 mg packing material were used. Before loading the assay samples, the columns were washed with 2 ml methanol followed by 2 ml 0.1% TFA (in water).
the assay samples (40 ⁇ l) were diluted to 1 ml with 0.1% TFA and loaded on the columns. Unbound radioactivity was than washed out with 4 ml 0.1% TFA, after which the glycosylated acceptor peptides were eluted with 1.5 ml 35% acetonitrile, 0.1% TFA (in water), directly into scintillation vials. Calculation of kinetic parameters was done from double reciprocal plots (Uv versus 1 S) using standard procedures.
the Km for UDP-GalNAc is approximately 1.7 ⁇ M and the Km:s for the threonine containing acceptor peptide PPASTSAPG [SEQ ID NO: 14] and the serine containing acceptor peptide PPDAASAAPLR [SEQ ID NO: 17] are approximately 6.5 and 3.6 mM, respectively.
Transfer by GalNAcTs to another serine containing acceptor peptide PPASSSAPG [SEQ ID NO: 15] is approximately 70 times slower than to PPASTSAPG [SEQ ID NO: 14] (Data not shown).
the specific activity ofthe purified enzyme preparation, using bovine apomucin as acceptor, is approximately 2,160 U/mg protein (Table 4).
the enzymatic properties of the purified GalNacTs appear to be similar to the those determined for the enzymes purified from bovine colostrum and porcine submaxiUary gland (Elhammer and Kornfeld, 1986; Elhammer et al., 1993; Wang et al., 1992; Wang et al., 1993).
the Km for the acceptor peptide PPASTSAPG [SEQ ID NO: 14] is almost identical for the colostrum enzyme and the baculo expressed molecule, 6.0 vs. 6.5 mM.
the serine containing acceptor peptide PPDAASAAPLR [SEQ ID NO: 17] (O'Connel et al.
GalNAcTs and the bovine colostrum enzyme glycosylate the peptide PPASSSAGP [SEQ ID NO: 15] are at least 35 times slower than PPASTSAPG [SEQ ID NO: 14] (Elhammer et al., 1993).
the Km for UDP-GalNAc in assays using GalNAcTs is lower than those determined for the bovine colostrum and the porcine submaxiUary gland enzymes, 1.7 uM vs. 8 ⁇ M and 6 ⁇ M, respectively (Table V; Elhammer & Kornfeld, 1986; Wang et al., 1992). The reason for this is not clear at present.
the amino acid sequence of the bovine colostrum and the cloned molecules should be identical except for five amino acids in the NH 2 -terminal end of the molecule; the colostrum enzyme sequence also contains an additional two amino acids at the NH 2 -terminus.
differences in post-translational processing, in particular glycosylation, of the Sf9 produced vs. the bovine molecule may, to some extent, influence the kinetic characteristics of the two molecules.
the oUgosaccharide structures on the insect produced molecule are most likely of high mannose and/or truncated high mannose type, while results from endoglycosidase digestion experiments suggest that the colostrum enzyme contains complex type oligosaccharides (Elhammer & Kornfeld, 1986). It is likely that the N-linked oUgosaccharide structures on the porcine enzyme also would be of the types normally synthesized by mammalian cells; peptide: N-glycosidase F digestion experiments suggest that this molecule contains 9 kDa of N-linked oUgosaccharides (Wang et al., 1992). Further experiments will however be needed to clarify this question.
the ability of soluble GalNAc-transferase to glycosylate another synthetic acceptor peptide is shown in Figure 18.
the synthetic acceptor peptide Pro-Pro-Asp- Ala-Ala-Thr-Ala-Ala-Pro-Leu (PPDAATAAPL) [SEQ ID NO:20] was synthesized by solid phase methodology as described in Example 12. Sequence analysis for the identification of the glycosylated amino acid(s) in the acceptor peptide PPDAATAAPL [SEQ ID NO:20] was also performed as described in Example 12. Experiments (data not shown) demonstrated that the incorporated radioactivity in the acceptor peptide PPDAATAAPL [SEQ ID NO:20] is in the form of N-acetylgalactosamine. Further, the glycosylated amino acids in the peptide PPDAATAAPL [SEQ ID NO:20] were identified as decribed in Example 12.
the peptide PPDAATAAPL [SEQ ID NO:20] has a lower K_, but a similar V mll _.
the synthetic acceptor peptide PPDAATAAPL [SEQ ID NO:20] has a much higher catalytic efficiency than that of PPASTSAPG [SEQ ID NO: 14].
CeUs (1X10 6 ) were infected with recombinant virus containing GalNAcTs-Mel (5 pfu/ceU). The ceUs were harvested 65 hours post infection, lysed in a detergent containing buffer and the GalNac-transferase activity was determined in the cell lysates and the corresponding culture media; lysate and culture medium from uninfected ceUs were assayed as control. The numbers have been adjusted for differences in protein content in the cell lysates; the volume ofthe culture media was 5 ml.
*1 unit equals one mole N-acetylgalactosamine transferred to apomucin per minute, under assay conditions.
Surabundant amino acids surrounding the reactive Ser or Thr Surabundance at a given subsite for a given amino acid is expressed as the number of that amino acid found at the site in excess to that expected from random distribution, divided by the S.D. of the expected distribution.
the excess of surabundant residues is equal to or higher than twice the S.D. of the expected residue
N-acetylgalactosamine to protein acceptors was assayed under standard conditions (see Material and Methods).
the acceptor concentration was 65 ⁇ M
the enzyme concentration approximately 65 mU/ml
assay time was 60 minutes.
the transfer to both native and reduced-carboxymethylated acceptors was assayed.
Bovine cytochrome C ⁇ 0.1 ⁇ 0.1
PPASSSAPG [SEQ n.a. n.a. *8.5 ID NO:15]
RSPPP [SEQ ID n.a. n.a. »0.4 b NO: 13]
Assays were done as described in Materials and Methods. The products were separated by Biogel P-2 chromatography. Assay times were 20 minutes for PPASTSAPG [SEQ ID NO: 14] and 8 hours for PPASSSAPG [SEQ ID NO: 15], PPAdSTdSAPG and RSPPP [SEQ ID NO: 13].
Glu lie Gly Thr Tyr Asp Ala Gly Met Asp Ile Trp Gly Gly Glu Asn 305 310 315 320
GCACAAACTC CAATGCAGAC CATTCTCTTG GTACCTAGAG AATATTTATC CTGATTCTCA 1320
GATTCCTCGT CACTATTTCT CTTTGGGAGA GATACGAAAT GTGGAAACAA ATCAGTGTCT 1380
AAAAAAAAAA AAAA 2294 INFORMATION FOR SEQ ID NO: 11:
GATTACTTTC AGGAAATTGG AACATATGAT GCTGGAATGG ATATTTGGGG AGGAGAAAAC 960
AAGCTCAACT TTCGCTGGTA TCCTGTTCCC CAAAGAGAAA TGGACAGAAG GAAAGGTGAT 780
Cys Pro Ile lie Asp Val Ile Ser Asp Asp Thr Phe Glu Tyr Met Ala 195 200 205

Landscapes

Chemical & Material Sciences (AREA)
Life Sciences & Earth Sciences (AREA)
Organic Chemistry (AREA)
Health & Medical Sciences (AREA)
Genetics & Genomics (AREA)
General Health & Medical Sciences (AREA)
Biochemistry (AREA)
Medicinal Chemistry (AREA)
Molecular Biology (AREA)
Engineering & Computer Science (AREA)
Bioinformatics & Cheminformatics (AREA)
Wood Science & Technology (AREA)
Zoology (AREA)
General Engineering & Computer Science (AREA)
Microbiology (AREA)
Biotechnology (AREA)
Biomedical Technology (AREA)
Biophysics (AREA)
Proteomics, Peptides & Aminoacids (AREA)
Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Peptides Or Proteins (AREA)
Preparation Of Compounds By Using Micro-Organisms (AREA)

EP96930677A 1995-10-09 1996-09-09 Polypeptid-akzeptor für n-acetylgalactosaminyltransferase Pending EP0854882A1 (de)

Applications Claiming Priority (3)

Application Number	Priority Date	Filing Date	Title
US500695P	1995-10-09	1995-10-09
US5006P		1995-10-10
PCT/US1996/014136 WO1997013783A1 (en)	1995-10-09	1996-09-09	An acceptor polypeptide for an n-acetylgalactosaminyltransferase

Publications (1)

Publication Number	Publication Date
EP0854882A1 true EP0854882A1 (de)	1998-07-29

Family

ID=21713649

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
EP96930677A Pending EP0854882A1 (de)	1995-10-09	1996-09-09	Polypeptid-akzeptor für n-acetylgalactosaminyltransferase

Country Status (4)

Country	Link
EP (1)	EP0854882A1 (de)
JP (1)	JPH11514232A (de)
AU (1)	AU6964196A (de)
WO (1)	WO1997013783A1 (de)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JP2003199573A (ja) *	2001-12-28	2003-07-15	Jgs:Kk	新規ｕｄｐ−ｎ−アセチル−ｄ−ガラクトサミン：ポリペプチドｎ−アセチルガラクトサミン転移酵素及びこれをコードする核酸
US9045514B2 (en) *	2010-01-22	2015-06-02	Dupont Nutrition Biosciences Aps	Methods for producing amino-substituted glycolipid compounds

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
CN1057534C (zh) *	1993-08-17	2000-10-18	柯瑞英-艾格公司	促红细胞生成素类似物

1996
- 1996-09-09 WO PCT/US1996/014136 patent/WO1997013783A1/en not_active Ceased
- 1996-09-09 AU AU69641/96A patent/AU6964196A/en not_active Abandoned
- 1996-09-09 JP JP9515030A patent/JPH11514232A/ja active Pending
- 1996-09-09 EP EP96930677A patent/EP0854882A1/de active Pending

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9713783A1 *

Also Published As

Publication number	Publication date
WO1997013783A1 (en)	1997-04-17
JPH11514232A (ja)	1999-12-07
AU6964196A (en)	1997-04-30

Legal Events

Date	Code	Title	Description
1998-06-12	PUAI	Public reference made under article 153(3) epc to a published international application that has entered the european phase	Free format text: ORIGINAL CODE: 0009012
1998-06-12	STAA	Information on the status of an ep patent application or granted ep patent	Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE
1998-07-29	17P	Request for examination filed	Effective date: 19980501
1998-07-29	AK	Designated contracting states	Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE
1998-07-29	AX	Request for extension of the european patent	Free format text: AL PAYMENT 980501;LT PAYMENT 980501;LV PAYMENT 980501;SI PAYMENT 980501

Publication	Publication Date	Title
US6096512A (en)	2000-08-01	Cloned DNA encoding a UDP-GalNAc: Polypeptide, N-acetylgalactosaminyltransferase
Homa et al.	1993	Isolation and expression of a cDNA clone encoding a bovine UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase
Webster et al.	1993	The adenovirus protease is activated by a virus-coded disulphide-linked peptide
Aeed et al.	1994	Glycosylation of recombinant prorenin in insect cells: the insect cell line Sf9 does not express the mannose 6-phosphate recognition signal
US5962243A (en)	1999-10-05	Methods for the identification of farnesyltransferase inhibitors
EP0632831B1 (de)	2002-11-27	Nukleinsäure, expressionsvektor und zusammensetzungen zur identifizierung und herstellung von rekombinanten sialyltransferasen
WO1995004816A1 (en)	1995-02-16	Compositions and methods for producing sialyltransferases
CA2142990A1 (en)	1994-03-03	Methods and compositions for the identification, characterization, and inhibition of farnesyltransferase
US5910570A (en)	1999-06-08	Cloned DNA encoding a UDP-GalNAc: polypeptide N-acetylgalactosaminy-ltransferase
CA2114631C (en)	2005-03-01	N-acetylglucosaminyltransferase v coding sequences
WO1997013783A1 (en)	1997-04-17	An acceptor polypeptide for an n-acetylgalactosaminyltransferase
US5976851A (en)	1999-11-02	Methods and compositions for the identification, characterization, and inhibition of farnesyl protein transferase
AU705919B2 (en)	1999-06-03	Protein modifying enzyme
US6764844B1 (en)	2004-07-20	DNA sequence encoding a novel glucuronyl C5-epimerase
Homa et al.	1995	Conversion of a Bovine UDP-GalNAc: polypeptide, N-acetylgalactosaminyltransferase, to a Soluble, Secreted Enzyme, and Expression in Sf9 Cells
Fred et al.	0	UDP-Ga1NAc: Polypeptide N-Acetylgalactosaminyltransferase
CN101736008B (zh)	2012-02-08	一种制备基因工程N-乙酰化胸腺素α1的方法
GB2288401A (en)	1995-10-18	N-acetylgalactosaminyltransferase
Kravchenko et al.	2005	Alternative transcripts of POLRMT gene coding for nuclear RNA polymerase IV
NZ500268A (en)	2001-07-27	DNA sequence coding for a mammalian glucuronyl C5-epimerase and a process for its production