CN106987578B - A kind of terpenoid synthase for producing koraiol and its application - Google Patents

A kind of terpenoid synthase for producing koraiol and its application Download PDF

Info

Publication number
CN106987578B
CN106987578B CN201710229084.XA CN201710229084A CN106987578B CN 106987578 B CN106987578 B CN 106987578B CN 201710229084 A CN201710229084 A CN 201710229084A CN 106987578 B CN106987578 B CN 106987578B
Authority
CN
China
Prior art keywords
koraiol
plasmid
producing
gene
fgj09920
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201710229084.XA
Other languages
Chinese (zh)
Other versions
CN106987578A (en
Inventor
胡晓瑜
宋慧芳
张可頔
付景峰
黄毓婷
孙文琦
刘永波
鲁江峰
方呈祥
刘天罡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN201710229084.XA priority Critical patent/CN106987578B/en
Publication of CN106987578A publication Critical patent/CN106987578A/en
Application granted granted Critical
Publication of CN106987578B publication Critical patent/CN106987578B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0006Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1205Phosphotransferases with an alcohol group as acceptor (2.7.1), e.g. protein kinases
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1229Phosphotransferases with a phosphate group as acceptor (2.7.4)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P15/00Preparation of compounds containing at least three condensed carbocyclic rings
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/01088Hydroxymethylglutaryl-CoA reductase (1.1.1.88)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y203/00Acyltransferases (2.3)
    • C12Y203/03Acyl groups converted into alkyl on transfer (2.3.3)
    • C12Y203/0301Hydroxymethylglutaryl-CoA synthase (2.3.3.10)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/01Phosphotransferases with an alcohol group as acceptor (2.7.1)
    • C12Y207/01036Mevalonate kinase (2.7.1.36)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/04Phosphotransferases with a phosphate group as acceptor (2.7.4)
    • C12Y207/04002Phosphomevalonate kinase (2.7.4.2)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y401/00Carbon-carbon lyases (4.1)
    • C12Y401/01Carboxy-lyases (4.1.1)
    • C12Y401/01033Diphosphomevalonate decarboxylase (4.1.1.33), i.e. mevalonate-pyrophosphate decarboxylase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y402/00Carbon-oxygen lyases (4.2)
    • C12Y402/03Carbon-oxygen lyases (4.2) acting on phosphates (4.2.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y503/00Intramolecular oxidoreductases (5.3)
    • C12Y503/03Intramolecular oxidoreductases (5.3) transposing C=C bonds (5.3.3)
    • C12Y503/03002Isopentenyl-diphosphate DELTA-isomerase (5.3.3.2)

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本发明公开了一种生产koraiol的萜类合酶及其应用,属于合成生物学领域。通过提供一种合成koraiol的萜类合酶FGJ09920,其核苷酸序列如SEQ ID NO:2所示,与含有甲羟戊酸途径相关基因一起构建用于生产koraiol的菌株。对甲羟戊酸途径的来源于大肠杆菌XL1‑blue的atoB基因或idi基因过表达,合成大量催化底物法尼基焦磷酸FPP,可以进一步促进生产koraiol。本发明的萜类合酶具有专一性和高效性,可以提高koraiol的产量,极大地克服了原料投入量大而koraiol产率低的弊端,降低了研究成本,并且保证绿色环保。

Figure 201710229084

The invention discloses a terpenoid synthase for producing koraiol and its application, and belongs to the field of synthetic biology. By providing a terpenoid synthase FGJ09920 for synthesizing koraiol, the nucleotide sequence of which is shown in SEQ ID NO: 2, together with the mevalonate pathway-related genes, a strain for producing koraiol is constructed. Overexpression of the atoB gene or idi gene derived from Escherichia coli XL1‑blue in the p-mevalonate pathway can synthesize a large amount of the catalytic substrate farnesyl pyrophosphate FPP, which can further promote the production of koraiol. The terpenoid synthase of the invention has specificity and high efficiency, can increase the yield of koraiol, greatly overcomes the drawbacks of large input of raw materials and low yield of koraiol, reduces research cost, and ensures green environmental protection.

Figure 201710229084

Description

一种生产koraiol的萜类合酶及其应用A kind of terpenoid synthase for producing koraiol and its application

技术领域technical field

本发明属于合成生物学领域,涉及一种生产倍半萜化合物—koraiol的萜类合酶及其应用。The invention belongs to the field of synthetic biology, and relates to a terpenoid synthase for producing a sesquiterpene compound-koraiol and its application.

背景技术Background technique

萜类化合物是含有异戊二烯单元的化合物的总称。它在自然界广泛存在,迄今为止,人们已从动物、植物以及微生物体内发现大约76 000种萜类化合物。该化合物具有许多生理活性,所以被广泛应用于香水生产行业、保健品行业、农业生产领域以及医疗行业。Terpenoids are a general term for compounds containing isoprene units. It exists widely in nature. So far, about 76,000 terpenoids have been found in animals, plants and microorganisms. The compound has many physiological activities, so it is widely used in the perfume production industry, the health care product industry, the agricultural production field and the medical industry.

萜类化合物的生产方法有天然提取、化学合成法和发酵法。但由于天然提取法生产成本居高不下,化学合成法可能的毒害作用和国际上对天然产物的日益推崇,人们开始大量研究利用微生物发酵法生产萜类化合物。The production methods of terpenoids include natural extraction, chemical synthesis and fermentation. However, due to the high production cost of natural extraction method, the possible toxic effects of chemical synthesis method and the increasing respect of natural products in the world, people have begun to study a lot of terpenoids by microbial fermentation.

生物体利用2-甲基-赤藓糖醇磷酸(MEP途径)或者甲羟戊酸途径(MVA途径)的异戊烯焦磷酸异构酶Idi合成IPP以及DMAPP,随后IPP以及DMAPP在异戊烯转移酶催化下合成GPP、FPP,GGPP以及GFPP等不同链长的异戊二烯单元。随后萜类合酶(terpene synthase,TS)可以这些不同链长的异戊二烯单元为底物,合成单萜(C10)、倍半萜(C15)、二萜(C120)、二倍半萜(C25)、三萜(C30)、四萜(C40)以及多萜。Organisms use the isopentenyl pyrophosphate isomerase Idi of the 2-methyl-erythritol phosphate (MEP pathway) or mevalonate pathway (MVA pathway) to synthesize IPP and DMAPP, and then IPP and DMAPP are synthesized in isopentenyl. Transferase catalyzes the synthesis of isoprene units with different chain lengths such as GPP, FPP, GGPP and GFPP. Then terpene synthase (TS) can use these isoprene units of different chain lengths as substrates to synthesize monoterpenes (C10), sesquiterpenes (C15), diterpenes (C120), and disesquiterpenes (C25), triterpenes (C30), tetraterpenes (C40) and polyterpenes.

这其中,单萜和倍半萜为香水及香料化合物的主要来源,对有关这些化合物生物合成基因的挖掘及在萜类化合物微生物合成高产平台中进行代谢工程改造,将使得我们能够高效经济地实现这些高附加值产物的合成。Among them, monoterpenes and sesquiterpenes are the main sources of perfume and fragrance compounds. The mining of biosynthetic genes related to these compounds and the metabolic engineering of terpenoid microbial synthesis high-yield platforms will enable us to efficiently and economically realize Synthesis of these high value-added products.

发明内容SUMMARY OF THE INVENTION

为了克服现有技术的缺点与不足,本发明提供一种新型的萜类合酶FgJ09920及含有该萜类合酶基因的生产koraiol的菌株,以实现倍半萜化合物koraiol微生物合成,提高产量、降低成本,为调制薰衣草、柑桔等香型香精的主要来源,减少毒害作用。In order to overcome the shortcomings and deficiencies of the prior art, the present invention provides a novel terpenoid synthase FgJ09920 and a koraiol-producing strain containing the terpenoid synthase gene, so as to realize the sesquiterpene compound koraiol microbial synthesis, improve yield, reduce The cost is the main source for the preparation of lavender, citrus and other fragrance flavors, and the toxic effect is reduced.

本发明的第一方面,提供一种萜类合酶FgJ09920,其具有SEQ ID NO:1所示的氨基酸序列。The first aspect of the present invention provides a terpenoid synthase FgJ09920, which has the amino acid sequence shown in SEQ ID NO: 1.

进一步地,本发明提供一种核酸分子,其编码上述萜类合酶FgJ09920,核苷酸序列如SEQ ID NO:2所示。Further, the present invention provides a nucleic acid molecule encoding the above-mentioned terpenoid synthase FgJ09920, and the nucleotide sequence is shown in SEQ ID NO: 2.

本发明的第二方面,提供萜类合酶FgJ09920的用途,用于生产koraiol。The second aspect of the present invention provides the use of the terpenoid synthase FgJ09920 for the production of koraiol.

本发明的第三方面,提供一种生产koraiol的菌株,该菌株含有甲羟戊酸途径(MVA途径)和koraiol合成的相关基因;所述的甲羟戊酸途径的相关基因包括(1)将乙酰辅酶A缩合为乙酰乙酰辅酶A的基因atoB、(2)将乙酰辅酶A和乙酰乙酰辅酶A缩合为HMG-CoA的基因erg13、(3)将HMG-CoA还原为甲羟戊酸的基因thmg1、(4)将甲羟戊酸磷酸化为甲羟戊酸-5-磷酸的基因erg12、(5)将甲羟戊酸-5-磷酸磷酸化为甲羟戊酸-5-焦磷酸的基因erg8、(6)将甲羟戊酸-5-焦磷酸脱羧生成异戊烯焦磷酸的基因mvd1和(7)将异戊烯焦磷酸异构为二甲基烯丙基焦磷酸的基因idi;所述的koraiol合成的相关基因包括ispA和萜类合酶FgJ09920基因,所述的FgJ09920基因,其序列如SEQ ID NO:2所示。合成的目标萜类化合物koraiol具有下列式(I)的结构:The third aspect of the present invention provides a strain for producing koraiol, the strain contains genes related to mevalonate pathway (MVA pathway) and koraiol synthesis; the genes related to the mevalonate pathway include (1) The gene atoB that condenses acetyl-CoA to acetoacetyl-CoA, (2) the gene erg13 that condenses acetyl-CoA and acetoacetyl-CoA to HMG-CoA, (3) the gene thmg1 that reduces HMG-CoA to mevalonate , (4) the gene erg12 that phosphorylates mevalonate to mevalonate-5-phosphate, (5) the gene that phosphorylates mevalonate-5-phosphate to mevalonate-5-pyrophosphate erg8, (6) the gene mvd1 that decarboxylates mevalonate-5-pyrophosphate to generate isopentenyl pyrophosphate and (7) the gene idi that isomerizes isopentenyl pyrophosphate to dimethylallyl pyrophosphate; The genes related to the synthesis of koraiol include ispA and terpenoid synthase FgJ09920 gene, and the sequence of the FgJ09920 gene is shown in SEQ ID NO: 2. The synthesized target terpenoid koraiol has the structure of the following formula (I):

所述的甲羟戊酸途径的相关基因优选为来源于大肠杆菌XL1-blue的atoB基因(AM946981.2)、idi(CP010152.1)和来源于酿酒酵母INVSC1的erg13基因(CP005477.2)、tHMG1(CP005464.2)、erg12(CP008027.1)、erg8(CP005426.1)、mvd1(CP005554.2)。The genes related to the mevalonate pathway are preferably atoB gene (AM946981.2), idi (CP010152.1) derived from Escherichia coli XL1-blue, and erg13 gene (CP005477.2) derived from Saccharomyces cerevisiae INVSC1, tHMG1 (CP005464.2), erg12 (CP008027.1), erg8 (CP005426.1), mvd1 (CP005554.2).

所述的萜类合酶FgJ09920基因来源于真菌禾谷镰刀菌J1-012(Fusariumgraminearum J1-012)的FgJ09920,其核苷酸序列如SEQ ID NO:2所示,其氨基酸序列为SEQID NO:1。The terpenoid synthase FgJ09920 gene is derived from FgJ09920 of the fungus Fusarium graminearum J1-012 (Fusarium graminearum J1-012), its nucleotide sequence is shown in SEQ ID NO: 2, and its amino acid sequence is SEQ ID NO: 1 .

优选的,所述的生产koraiol的菌株为含有质粒pMH1、质粒pFZ81和质粒pGB235的大肠杆菌;所述的质粒pMH1以pBBR1MCS为骨架载体、启动子为lac启动子,复制子替换为p15A复制子,包含atoB、erg13和thmg1基因,pMH1的序列(不含骨架载体序列)如SEQ IDNO.3所示;所述的质粒pFZ81以pBBR1MCS-2为骨架载体、启动子为lac启动子、复制子为质粒自带的pBBR1MCS复制子,包含erg12、erg8、mvd1和idi基因,pFZ81的序列(不含骨架载体序列)如SEQ ID NO.4所示;所述的质粒pGB235以pET21为骨架载体、启动子为T7启动子、复制子为质粒自带的高拷贝的pBR322复制子,包含FgJ09920、ispA、idi基因,pGB235的序列(不含骨架载体序列)如SEQ ID NO.5所示。Preferably, the strain for producing koraiol is Escherichia coli containing plasmid pMH1, plasmid pFZ81 and plasmid pGB235; the plasmid pMH1 uses pBBR1MCS as the backbone vector, the promoter is the lac promoter, and the replicon is replaced by the p15A replicon, Comprising atoB, erg13 and thmg1 genes, the sequence of pMH1 (without skeleton vector sequence) is as shown in SEQ ID NO.3; described plasmid pFZ81 uses pBBR1MCS-2 as skeleton vector, promoter as lac promoter, and replicon as plasmid The self-contained pBBR1MCS replicon includes erg12, erg8, mvd1 and idi genes, and the sequence of pFZ81 (without the backbone vector sequence) is shown in SEQ ID NO.4; the plasmid pGB235 takes pET21 as the backbone vector, and the promoter is The T7 promoter and replicon are the high-copy pBR322 replicons that come with the plasmid, including FgJ09920, ispA, and idi genes. The sequence of pGB235 (without the backbone vector sequence) is shown in SEQ ID NO.5.

优选的,所述的生产koraiol的菌株原核表达时过表达来源于大肠杆菌XL1-blue的atoB基因或idi基因,合成大量催化底物法尼基焦磷酸FPP。Preferably, the prokaryotic expression of the koraiol-producing strain overexpresses the atoB gene or the idi gene derived from Escherichia coli XL1-blue to synthesize a large amount of the catalytic substrate farnesyl pyrophosphate FPP.

本发明的第四方面是提供上述生产koraiol的菌株在生产koraiol中的应用。The fourth aspect of the present invention is to provide the use of the above-mentioned koraiol-producing strain in the production of koraiol.

本发明相对于现有技术具有如下优点和效果:本发明首次利用萜类合酶FgJ09920基因结合外源的甲羟戊酸途径获得了稳定生产koraiol的大肠杆菌。本发明的萜类合酶具有专一性和高效性,可以提高koraiol的产量,极大地克服了原料投入量大而koraiol产率低的弊端,降低了研究成本,并且保证绿色环保。Compared with the prior art, the present invention has the following advantages and effects: for the first time, the present invention utilizes the terpenoid synthase FgJ09920 gene to combine with the exogenous mevalonate pathway to obtain Escherichia coli that stably produces koraiol. The terpenoid synthase of the invention has specificity and high efficiency, can increase the yield of koraiol, greatly overcomes the disadvantages of large input of raw materials and low yield of koraiol, reduces research cost, and ensures green environmental protection.

附图说明Description of drawings

图1为GC-MS检测FgJ09920-FPP-体外反应和FgJ09920发酵(pGB235),E.coli K1体外反应色谱图;Figure 1 is the GC-MS detection of FgJ09920-FPP-in vitro reaction and FgJ09920 fermentation (pGB235), E.coli K1 in vitro reaction chromatogram;

图2为质粒pMH1结构示意图;Fig. 2 is a schematic diagram of the structure of plasmid pMH1;

图3为质粒pFZ81结构示意图;Fig. 3 is a schematic diagram of the structure of plasmid pFZ81;

图4为质粒pET28a-FgJ09920结构示意图;Fig. 4 is a schematic diagram of the structure of plasmid pET28a-FgJ09920;

图5为质粒pGB234结构示意图;Fig. 5 is a schematic diagram of the structure of plasmid pGB234;

图6为质粒pGB235结构示意图;Fig. 6 is a schematic diagram of the structure of plasmid pGB235;

图7为菌株发酵产物的MS图;Fig. 7 is the MS figure of bacterial strain fermentation product;

图8为化合物koraiol的谱图;Fig. 8 is the spectrogram of compound koraiol;

其中a为koraiol的结构示意图;b为碳谱图(13C NMR,CDCl3,101MHz);c为氢谱图(1H NMR,CDCl3,400MHz)。a is the structural schematic diagram of koraiol; b is the carbon spectrum ( 13 C NMR, CDCl 3 , 101 MHz); c is the hydrogen spectrum ( 1 H NMR, CDCl 3 , 400 MHz).

具体实施方式Detailed ways

通过以下详细说明结合附图可以进一步理解本发明的特点和优点。所提供的实施例仅是对本发明方法的说明,而不以任何方式限制本发明揭示的其余内容。The features and advantages of the present invention can be further understood from the following detailed description in conjunction with the accompanying drawings. The examples provided are merely illustrative of the methods of the present invention, and are not intended to limit the remainder of the present disclosure in any way.

实施例中未注明具体技术或条件的,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品。If no specific technique or condition is indicated in the examples, the technique or condition described in the literature in the field or the product specification is used. The reagents or instruments used without the manufacturer's indication are conventional products that can be obtained from the market.

【实施例1】体外验证萜类合酶的功能[Example 1] Verification of the function of terpenoid synthase in vitro

1、质粒pGB28a-FgJ09920的构建1. Construction of plasmid pGB28a-FgJ09920

以反转录的Fusarium graminearum J1-012的cDNA为模板,用引物P27/P29(见表1)扩增获得的编码区FgJ09920,并连接到质粒pET28a上获得质粒pGB28a-FgJ09920(图4),用于纯化蛋白。Using the reverse transcribed cDNA of Fusarium graminearum J1-012 as a template, the coding region FgJ09920 obtained by amplification with primers P27/P29 (see Table 1) was ligated to the plasmid pET28a to obtain the plasmid pGB28a-FgJ09920 (Figure 4). for purified protein.

2、蛋白的纯化2. Protein purification

将含有目的基因(SEQ ID NO:2所示的核苷酸序列)的表达载体pET28a-FgJ9920转化表达宿主E.coli BL21(DE3),转化后挑取单克隆至含相应抗生素的LB培养基中,37℃,220rpm过夜培养。按1%接种量转接至1L含相应抗生素的新鲜的LB培养基中,37℃,220rpm培养至OD600约为0.6-0.8,降温至16℃,加入终浓度为0.1mM的IPTG,16℃,220rpm培养16-18h。8 000rpm离心5min收集细胞,之后用30-40mL蛋白纯化缓冲液A(Buffer A:50mM Tris-HCl,300mM NaCl,4mMβ-巯基乙醇,pH 7.6)彻底重悬细胞,超声破碎(脉冲5s,停顿8s,超声破碎5min)。4℃,12,000g离心30min以上,收集上清,4℃,20,000rpm离心1h,收集上清,用0.45μm滤膜进行过滤,加入6%buffer B(Buffer:500mM咪唑加入到Buffer A中)使咪唑约为30mM,混匀备用。The expression vector pET28a-FgJ9920 containing the gene of interest (nucleotide sequence shown in SEQ ID NO: 2) was transformed into the expression host E. coli BL21 (DE3), and after the transformation, a single clone was picked into the LB medium containing the corresponding antibiotics , 37 ℃, 220rpm overnight culture. Transfer to 1L of fresh LB medium containing the corresponding antibiotics at 1% inoculum, cultivate at 37°C, 220rpm to an OD 600 of about 0.6-0.8, cool down to 16°C, add IPTG with a final concentration of 0.1mM, 16°C , 220rpm cultured for 16-18h. Cells were collected by centrifugation at 8 000 rpm for 5 min, and then thoroughly resuspended with 30-40 mL of protein purification buffer A (Buffer A: 50 mM Tris-HCl, 300 mM NaCl, 4 mM β-mercaptoethanol, pH 7.6), and sonicated (pulse 5 s, pause 8 s). , sonicated for 5 min). 4°C, centrifuge at 12,000g for more than 30min, collect the supernatant, centrifuge at 4°C, 20,000rpm for 1 h, collect the supernatant, filter with a 0.45μm filter, add 6% buffer B (Buffer: 500mM imidazole is added to Buffer A) to make Imidazole is about 30 mM, mix well for use.

使用Bio-Rad的Biologic DuoFlow Chromatography System纯化组氨酸标签蛋白。蛋白分离柱加载至FPLC上加以控制,FPLC的流速始终为1.5mL/min,而样品自动上样的流速为2mL/min。所得到的上清样品用5mL Hitrap HP Ni-NTA柱子经Biorad做第一步纯化,该镍离子螯合柱先经过30mL(6个柱体积)缓冲液A(Buffer A:50mM Tris-HCl,300mM NaCl,4mMβ-巯基乙醇,pH7.6)的平衡,然后通过自动进样器将准备好的30mL上清液加载到柱子上,再用20mL的缓冲液A(4个柱体积)对柱子进行清洗,这时启动缓冲液B(50mM Tris-HCl,150mM NaCl,250mM Imidazole pH 7.6)的线性梯度,在100mL(20个柱体积)的流量内,缓冲液B由0%增长为100%,再用20mL(4个柱体积)100%的缓冲液B清洗柱子。根据紫外吸收收集并通过SDS-PAGE检测带有组氨酸标签的目的蛋白。挑选比较纯的组分收集起来,通过Millipore公司的离心浓缩管Amicon Centricon-10(分子量在10,000以下的会被滤出)来离心浓缩至2.5mL,然后通过Pharmacia公司的PD-10柱子脱盐并交换到缓冲液C(含有10%甘油的50mM磷酸缓冲液,pH 7.6)中,分装后液氮速冻并保存于-80℃冰箱。Histidine-tagged proteins were purified using Bio-Rad's Biologic DuoFlow Chromatography System. The protein separation column was loaded onto the FPLC for control, and the flow rate of the FPLC was always 1.5 mL/min, while the flow rate of the automatic sample loading was 2 mL/min. The obtained supernatant sample was purified by Biorad with 5mL Hitrap HP Ni-NTA column. The nickel ion chelating column was first passed through 30mL (6 column volumes) buffer A (Buffer A: 50mM Tris-HCl, 300mM NaCl, 4 mM β-mercaptoethanol, pH 7.6), then 30 mL of the prepared supernatant was loaded onto the column by an autosampler, and the column was washed with 20 mL of buffer A (4 column volumes) , then start the linear gradient of buffer B (50mM Tris-HCl, 150mM NaCl, 250mM Imidazole pH 7.6), in the flow of 100mL (20 column volumes), buffer B increases from 0% to 100%, and then use The column was washed with 20 mL (4 column volumes) of 100% buffer B. Histidine-tagged target proteins were collected according to UV absorption and detected by SDS-PAGE. The relatively pure fractions were collected and concentrated to 2.5mL by centrifugation through Millipore's centrifugal concentrator Amicon Centricon-10 (the molecular weight below 10,000 will be filtered out), and then desalted and exchanged through Pharmacia's PD-10 column. into buffer C (50 mM phosphate buffer containing 10% glycerol, pH 7.6), quick-frozen in liquid nitrogen after aliquoting and stored in a -80°C refrigerator.

3、体外催化反应3. In vitro catalytic reaction

我们设立了以下体外酶促反应体系:向200μL终浓度为50mM含有10%甘油的PBbuffer(pH 7.6)缓冲液中添加10μM纯化的蛋白,100μM的底物GPP、FPP或GGPP,以及2mM的Mg2+,30℃过夜反应。随后用等体积的正己烷萃取2次,合并有机相并用GC-MS检测生成的产物。We set up the following in vitro enzymatic reaction system: 10 μM purified protein, 100 μM substrate GPP, FPP or GGPP, and 2 mM Mg were added to 200 μL final concentration of 50 mM PBbuffer (pH 7.6) containing 10% glycerol + , 30℃ overnight reaction. It was then extracted twice with an equal volume of n-hexane, the organic phases were combined and the resulting product was detected by GC-MS.

萜类化合物检测所用的GC-MS为Thermo TRACE GC ULTRA气相色谱配备TSQQUANTUM XLS MS,气相色谱柱为TRACE TR-5MS(30m×0.25mm×0.25um)。每次分析进样1μL,以高纯的氦气为载气,设置流速为1mL/min。GC条件为80℃维持1min,随后以10℃/min的速率升温到220℃,再在220℃维持15min。进样器和传输线温度分别设定为230℃和240℃。The GC-MS used for the detection of terpenoids was a Thermo TRACE GC ULTRA gas chromatography equipped with a TSQQUANTUM XLS MS, and the gas chromatography column was a TRACE TR-5MS (30m×0.25mm×0.25um). 1 μL was injected for each analysis, high-purity helium was used as the carrier gas, and the flow rate was set to 1 mL/min. The GC conditions were maintained at 80°C for 1 min, then ramped to 220°C at a rate of 10°C/min, and maintained at 220°C for 15 min. The injector and transfer line temperatures were set at 230°C and 240°C, respectively.

结果显示,FgJ09920表达的萜类合酶能够以FPP为底物,合成倍半萜产物(图1)。The results showed that the terpenoid synthase expressed by FgJ09920 could use FPP as a substrate to synthesize sesquiterpene products (Figure 1).

【实施例2】构建表达载体[Example 2] Construction of expression vector

用Qiagen公司的Blood and Cell Culture DNA Mini Kit纯化获得大肠杆菌XL1-blue基因组DNA和酿酒酵母INVSC1基因组DNA。Escherichia coli XL1-blue genomic DNA and Saccharomyces cerevisiae INVSC1 genomic DNA were purified with the Blood and Cell Culture DNA Mini Kit of Qiagen Company.

质粒pMH1含有甲羟戊酸途径前三个基因:来源于大肠杆菌XL1-blue的atoB基因(乙酰乙酰辅酶A硫酯酶,AM946981.2),来源于酿酒酵母INVSC1的erg13(HMG-CoAsynthase,CP005477.2)和tHMG1(HMG-CoA还原酶,删除了HMG1的跨膜区域,CP005464.2)。Plasmid pMH1 contains the first three genes of the mevalonate pathway: atoB gene (acetoacetyl-CoA thioesterase, AM946981.2) from Escherichia coli XL1-blue, erg13 from Saccharomyces cerevisiae INVSC1 (HMG-CoAsynthase, CP005477 .2) and tHMG1 (HMG-CoA reductase, deleted transmembrane region of HMG1, CP005464.2).

质粒pFZ81含有甲羟戊酸途径后四个基因:来源于酿酒酵母INVSC1的erg12(甲羟戊酸激酶,CP008027.1),erg8(甲羟戊酸-5-磷酸激酶,CP005426.1)和mvd1(甲羟戊酸-5-焦磷酸激酶,CP005554.2),来源于大肠杆菌XL1-blue的idi(异戊烯焦磷酸异构酶,CP010152.1)基因。Plasmid pFZ81 contains the last four genes of the mevalonate pathway: erg12 (mevalonate kinase, CP008027.1), erg8 (mevalonate-5-phosphate kinase, CP005426.1) and mvd1 from Saccharomyces cerevisiae INVSC1 (Mevalonate-5-pyrophosphate kinase, CP005554.2), derived from the idi (isoamyl pyrophosphate isomerase, CP010152.1) gene of Escherichia coli XL1-blue.

质粒pGB235含有合成倍半萜化合物的三个基因,分别是来源于真菌禾谷镰刀菌J1-012(Fusarium graminearum J1-012)的FgJ09920(SEQ ID NO:2),其氨基酸序列为SEQID NO:1;来源于大肠杆菌XL1-blue的Idi;来源于大肠杆菌XL1-blue的ispA,能够以甲羟戊酸产物异戊二烯焦磷酸(IPP)和二甲烯丙基焦磷酸(DMAPP)为底物合成法尼基焦磷酸,用于倍半萜的合成。Plasmid pGB235 contains three genes for synthesizing sesquiterpene compounds, which are FgJ09920 (SEQ ID NO: 2) derived from the fungus Fusarium graminearum J1-012 (Fusarium graminearum J1-012), and its amino acid sequence is SEQ ID NO: 1 ; Idi derived from Escherichia coli XL1-blue; ispA derived from Escherichia coli XL1-blue, able to base on the mevalonate products isoprene pyrophosphate (IPP) and dimethylallyl pyrophosphate (DMAPP) Synthesis of farnesyl pyrophosphate for the synthesis of sesquiterpenes.

所有基因均通过PCR扩增获得,所用引物见表1。All genes were amplified by PCR, and the primers used are shown in Table 1.

表1引物序列表Table 1 Primer sequence list

Figure BDA0001266106850000061
Figure BDA0001266106850000061

Figure BDA0001266106850000071
Figure BDA0001266106850000071

具体构建方法如下:The specific construction method is as follows:

①质粒pMH1的构建①Construction of plasmid pMH1

首先将pBBR1MCS质粒的复制子替换为来源于pMSD15质粒的p15A复制子。以质粒pBBR1MCS为模板用引物P1/P2进行扩增,同时p15A复制子用引物P3/P4扩增(引物序列见表1),经PCR产物纯化后用Nanodrop测定DNA浓度,然后将20ng pCR扩增的p15A片段和等摩尔的pBBR1MCS片段混合,经过一轮PCR扩增,扩增条件为:98℃,2min预变性,然后30个PCR循环98℃,20s;60℃,20s;72℃,6min,最后72℃充分延伸10min。随后转化大肠杆菌XL1-blue获得质粒pBBR1MCS/p15A。The replicon of the pBBR1MCS plasmid was first replaced with the p15A replicon derived from the pMSD15 plasmid. The plasmid pBBR1MCS was used as a template for amplification with primers P1/P2, and the p15A replicon was amplified with primers P3/P4 (see Table 1 for primer sequences). After the PCR product was purified, the DNA concentration was measured with Nanodrop, and then 20ng of pCR was amplified The p15A fragment and equimolar pBBR1MCS fragment were mixed, and after one round of PCR amplification, the amplification conditions were: 98°C, 2min pre-denaturation, and then 30 PCR cycles of 98°C, 20s; 60°C, 20s; 72°C, 6min, The final extension was fully extended at 72°C for 10 min. Subsequently, E. coli XL1-blue was transformed to obtain plasmid pBBR1MCS/p15A.

用引物P5/P6以pBBR1MCS/p15A为模板扩增pMH1质粒骨架,同时用P7/P8、P9/P10、P11/P12为引物扩增相应基因。经PCR产物纯化之后,取50ngpBBR1MCS/p15A扩增产物和等摩尔的各基因扩增产物混合,并用去离子水调整体积到5μL,随后加入至15μL的Gibson缓冲液中混匀,50℃反应1h后转化大肠杆菌XL1-blue,挑取克隆,并将阳性克隆测序获得质粒pMH1(图2)。The pMH1 plasmid backbone was amplified with primers P5/P6 and pBBR1MCS/p15A as the template, while the corresponding genes were amplified with primers P7/P8, P9/P10 and P11/P12. After the PCR product was purified, 50ng of pBBR1MCS/p15A amplification product was mixed with equimolar amplification products of each gene, and the volume was adjusted to 5 μL with deionized water, then added to 15 μL of Gibson buffer and mixed, and reacted at 50°C for 1 h. E. coli XL1-blue was transformed, clones were picked, and positive clones were sequenced to obtain plasmid pMH1 (Figure 2).

②质粒pFZ81的构建②Construction of plasmid pFZ81

用引物P13/P14以pBBR1MCS-2为模板扩增pFZ81质粒骨架,同时用P15/P16、P17/P18、P19/P20、P21/P22为引物扩增相应基因。经PCR产物纯化之后,取50ng pBBR1MCS-2扩增产物和等摩尔的各基因扩增产物混合,并用去离子水调整体积到5μL,随后加入至15μL的Gibson缓冲液中混匀,50℃反应1h后转化大肠杆菌XL1-blue,挑取克隆,并将阳性克隆测序获得质粒pFZ81(图3)。The pFZ81 plasmid backbone was amplified with primers P13/P14 and pBBR1MCS-2 as the template, while the corresponding genes were amplified with primers P15/P16, P17/P18, P19/P20 and P21/P22. After the PCR product was purified, 50ng of pBBR1MCS-2 amplification product was mixed with equimolar amplification products of each gene, and the volume was adjusted to 5 μL with deionized water, then added to 15 μL of Gibson buffer and mixed, and reacted at 50°C for 1 h. After transformation of E. coli XL1-blue, clones were picked, and positive clones were sequenced to obtain plasmid pFZ81 (Figure 3).

③质粒pGB234的构建③ Construction of plasmid pGB234

以反转录的Fusarium graminearum J1-012的cDNA为模板,用引物P27/P28扩增获得FgJ09920的编码区并连接到质粒pET21a上获得质粒pGB234(图5)。Using the reverse transcribed cDNA of Fusarium graminearum J1-012 as a template, the coding region of FgJ09920 was obtained by amplification with primers P27/P28 and ligated to plasmid pET21a to obtain plasmid pGB234 (Figure 5).

④质粒pGB235的构建④Construction of plasmid pGB235

用引物P23/P24以及P25/P26分别从E.coli BL21(DE3)基因组上扩增ispA以及idi基因,随后将ispA克隆至pET21a获得质粒pGB305;将idi克隆至pET21a(+)获得质粒pGB306。分别用XbaI/XhoI,SpeI/XhoI酶切质粒pGB305和pGB306,随后借助同尾酶将pGB306上酶切下来的idi片段连接至质粒pGB305从而获得质粒pGB308。随后用XbaI/XhoI从pGB308上酶切下来ispA-idi片段并借助同尾酶分别将其连接至质粒pGB234,获得质粒pGB235(图6)。The ispA and idi genes were amplified from E. coli BL21 (DE3) genome with primers P23/P24 and P25/P26, respectively, and then ispA was cloned into pET21a to obtain plasmid pGB305; idi was cloned into pET21a(+) to obtain plasmid pGB306. Plasmids pGB305 and pGB306 were digested with XbaI/XhoI and SpeI/XhoI, respectively, and then the idi fragment from pGB306 was ligated to plasmid pGB305 by isocaudal enzyme to obtain plasmid pGB308. The ispA-idi fragments were subsequently cleaved from pGB308 with XbaI/XhoI and ligated into plasmid pGB234 by isocaudal enzymes to obtain plasmid pGB235 (Fig. 6).

【实施例3】大肠杆菌体内合成FgJ09920来源的倍半萜化合物[Example 3] In vivo synthesis of sesquiterpenoids derived from FgJ09920 in Escherichia coli

为了生产倍半萜化合物,将甲羟戊酸途径的两个质粒pMH1和pFZ81同时转入大肠杆菌BL21(DE3)中获得BL21(DE3)/pMH1/pFZ81,命名为PS,随后将pGB235转化进入菌株PS中,获得菌株K1,随后分别挑取单克隆至10mL的LB培养基中(同时含有100μg/mL氨苄青霉素,50μg/mL卡那霉素和34μg/mL氯霉素),37℃,220rpm过夜培养,随后按1%接种量接种到新鲜的同一培养基中37℃,220rpm继续培养至OD600约为0.6~0.8时,降温至16℃并加入终浓度为0.1mM的IPTG进行诱导表达,诱导表达18h后升温至28℃发酵72h,随后对其进行发酵及产物萃取,收集菌体及发酵液用等体积的正己烷萃取2次,减压蒸馏后甲醇复溶(加入甲醇前先加入少量DMSO助溶),用于产物纯化。In order to produce sesquiterpenoids, two plasmids of the mevalonate pathway, pMH1 and pFZ81, were simultaneously transformed into E. coli BL21(DE3) to obtain BL21(DE3)/pMH1/pFZ81, named PS, and then pGB235 was transformed into the strain In PS, strain K1 was obtained, and then single clones were picked into 10 mL of LB medium (containing 100 μg/mL ampicillin, 50 μg/mL kanamycin and 34 μg/mL chloramphenicol at the same time), 37 ° C, 220 rpm overnight Cultivated, then inoculated into the same fresh medium at 1% inoculum at 37°C, continued to culture at 220rpm until the OD 600 was about 0.6 to 0.8, cooled to 16°C and added IPTG with a final concentration of 0.1mM to induce expression. After expression for 18h, the temperature was raised to 28°C for fermentation for 72h, followed by fermentation and product extraction. The bacteria and fermentation broth were collected and extracted twice with an equal volume of n-hexane. After distillation under reduced pressure, methanol was reconstituted (add a small amount of DMSO before adding methanol). co-solubilization) for product purification.

结果显示,含有FgJ09920的突变株E.coli K1能够以FPP为底物合成保留时间为12.06min的化合物koraiol(图1、7)。The results showed that the mutant strain E. coli K1 containing FgJ09920 could synthesize the compound koraiol with a retention time of 12.06 min using FPP as a substrate (Figures 1 and 7).

【实施例4】化合物鉴定[Example 4] Compound identification

1H NMR和13C NMR结果显示(图8)GC-MS保留时间为12.06min的化合物为一种倍半萜醇koraiol。1H NMR(400MHz,CDCl3)δ2.58(q,J=9.5Hz,1H),2.07–1.99(m,1H),1.98(dd,J=11.5,7.8Hz,1H),1.93(ddd,J=12.9,6.2,1.8Hz,1H),1.83–1.65(m,3H),1.56–1.46(m,3H),1.46–1.37(m,2H),1.35–1.27(m,1H),1.24(d,J=0.9Hz,3H),1.20(d,J=0.8Hz,3H),1.12(s,3H),0.85(s,3H)。13C NMR(101MHz,CDCl3)δ73.99,53.22,50.37,46.27,42.77,42.33,37.62,34.01,32.30,31.04,24.18,23.40,22.28,22.16,18.66。倍半萜醇koraiol的理论分子量为223.2056,高分辨质谱检测结果显示其实际分子量为223.2048。The results of 1 H NMR and 13 C NMR showed (Fig. 8) that the compound with a retention time of 12.06 min by GC-MS was a sesquiterpene alcohol, koraiol. 1 H NMR (400MHz, CDCl 3 ) δ 2.58 (q, J=9.5Hz, 1H), 2.07-1.99 (m, 1H), 1.98 (dd, J=11.5, 7.8Hz, 1H), 1.93 (ddd, J=12.9, 6.2, 1.8Hz, 1H), 1.83–1.65 (m, 3H), 1.56–1.46 (m, 3H), 1.46–1.37 (m, 2H), 1.35–1.27 (m, 1H), 1.24 ( d, J=0.9Hz, 3H), 1.20 (d, J=0.8Hz, 3H), 1.12 (s, 3H), 0.85 (s, 3H). 13 C NMR (101 MHz, CDCl3) δ 73.99, 53.22, 50.37, 46.27, 42.77, 42.33, 37.62, 34.01, 32.30, 31.04, 24.18, 23.40, 22.28, 22.16, 18.66. The theoretical molecular weight of the sesquiterpene alcohol koraiol is 223.2056, and the high-resolution mass spectrometry results show that its actual molecular weight is 223.2048.

SEQUENCE LISTINGSEQUENCE LISTING

<110> 武汉大学<110> Wuhan University

<120> 一种生产koraiol的萜类合酶及其应用<120> A kind of terpenoid synthase for producing koraiol and its application

<160> 34<160> 34

<170> PatentIn version 3.3<170> PatentIn version 3.3

<210> 1<210> 1

<211> 366<211> 366

<212> PRT<212> PRT

<213> 禾谷镰刀菌J1-012<213> Fusarium graminearum J1-012

<400> 1<400> 1

Met Val Pro Ser Ile Ile Thr Pro Pro Pro Ser Cys Ala Gly Gly ProMet Val Pro Ser Ile Ile Thr Pro Pro Pro Ser Cys Ala Gly Gly Pro

1 5 10 151 5 10 15

Ile Ser Pro Ser Ser Ile Cys Ser Asp His Val Asp Ile Pro Glu LeuIle Ser Pro Ser Ser Ile Cys Ser Asp His Val Asp Ile Pro Glu Leu

20 25 30 20 25 30

Ala Asp Gly Arg Trp Ile Arg Leu Pro Glu Ala Leu Phe Ser Ser IleAla Asp Gly Arg Trp Ile Arg Leu Pro Glu Ala Leu Phe Ser Ser Ile

35 40 45 35 40 45

Met Ala Val Glu Pro Asp Val Asn Pro Met Tyr Lys Thr Ser Lys AlaMet Ala Val Glu Pro Asp Val Asn Pro Met Tyr Lys Thr Ser Lys Ala

50 55 60 50 55 60

Leu Ser Asp Ala Trp Leu Lys Asp Ala Leu Arg Met Asn Asp Lys ThrLeu Ser Asp Ala Trp Leu Lys Asp Ala Leu Arg Met Asn Asp Lys Thr

65 70 75 8065 70 75 80

Ala Ser Ile Trp Ser Arg Leu Asp Ile Ala Tyr Met Ser Ala Ile CysAla Ser Ile Trp Ser Arg Leu Asp Ile Ala Tyr Met Ser Ala Ile Cys

85 90 95 85 90 95

Ala Pro Asn Ala Asp Leu Glu Thr Leu Lys Leu Met Asn Asp Trp AsnAla Pro Asn Ala Asp Leu Glu Thr Leu Lys Leu Met Asn Asp Trp Asn

100 105 110 100 105 110

Gly Trp Val Phe Ala Phe Asp Asp Pro Phe Asp Glu Gly Ser Phe AlaGly Trp Val Phe Ala Phe Asp Asp Pro Phe Asp Glu Gly Ser Phe Ala

115 120 125 115 120 125

Asn Asn Pro Ile Lys Ala Ala Glu Glu Val Ile Tyr Thr Leu Ala ThrAsn Asn Pro Ile Lys Ala Ala Glu Glu Val Ile Tyr Thr Leu Ala Thr

130 135 140 130 135 140

Leu Asp Asn Ile His Pro Val Val Ser Pro Asp Gln Asn Pro Leu ArgLeu Asp Asn Ile His Pro Val Val Ser Pro Asp Gln Asn Pro Leu Arg

145 150 155 160145 150 155 160

His Thr Leu Gln Ser Cys Trp Asn Arg Phe Arg Gln Arg Ala Ser ProHis Thr Leu Gln Ser Cys Trp Asn Arg Phe Arg Gln Arg Ala Ser Pro

165 170 175 165 170 175

Ala Leu Gln Tyr Arg Trp Lys Lys His Leu Thr Met Tyr Cys Ile GlyAla Leu Gln Tyr Arg Trp Lys Lys His Leu Thr Met Tyr Cys Ile Gly

180 185 190 180 185 190

Val Leu Gln Gln Val Gly Val Gln Asn Thr Ala Ser Arg Leu Ser ValVal Leu Gln Gln Val Gly Val Gln Asn Thr Ala Ser Arg Leu Ser Val

195 200 205 195 200 205

Glu Glu Tyr Met Asp Met Arg Ala Gly Cys Val Gly Ala Tyr Pro CysGlu Glu Tyr Met Asp Met Arg Ala Gly Cys Val Gly Ala Tyr Pro Cys

210 215 220 210 215 220

Ile Gly Leu Met Glu Phe Ala Glu Gly Ile Asp Leu Pro Gln Asp ValIle Gly Leu Met Glu Phe Ala Glu Gly Ile Asp Leu Pro Gln Asp Val

225 230 235 240225 230 235 240

Met Asp His Pro Ser Leu Glu Ala Ile Ser Arg Ile Thr Cys Asp LeuMet Asp His Pro Ser Leu Glu Ala Ile Ser Arg Ile Thr Cys Asp Leu

245 250 255 245 250 255

Val Thr Leu Gln Asn Asp Leu Cys Ser Tyr Arg Lys Asp Leu Ile GlnVal Thr Leu Gln Asn Asp Leu Cys Ser Tyr Arg Lys Asp Leu Ile Gln

260 265 270 260 265 270

Gly Glu Asp Ser Asn Ile Met Phe Ile Leu Arg Asp Gln Gly Met ThrGly Glu Asp Ser Asn Ile Met Phe Ile Leu Arg Asp Gln Gly Met Thr

275 280 285 275 280 285

Asp Gln Glu Ala Ala Asp Glu Ile Gly Glu Met Leu Tyr Asp Cys TyrAsp Gln Glu Ala Ala Asp Glu Ile Gly Glu Met Leu Tyr Asp Cys Tyr

290 295 300 290 295 300

Arg Arg Trp His Thr Ala Met Ala Asn Leu Pro Phe Trp Gly Glu GlyArg Arg Trp His Thr Ala Met Ala Asn Leu Pro Phe Trp Gly Glu Gly

305 310 315 320305 310 315 320

Val Asp Arg Asp Val Ile Lys Phe Val Asn Gly Cys Arg Asn Ile AlaVal Asp Arg Asp Val Ile Lys Phe Val Asn Gly Cys Arg Asn Ile Ala

325 330 335 325 330 335

Leu Gly Asn Leu His Trp Ser Leu Tyr Thr Phe Arg Tyr Leu Gly AspLeu Gly Asn Leu His Trp Ser Leu Tyr Thr Phe Arg Tyr Leu Gly Asp

340 345 350 340 345 350

Glu Gly Pro Gln Val Lys Lys Thr Arg Met Met Arg Leu ProGlu Gly Pro Gln Val Lys Lys Thr Arg Met Met Arg Leu Pro

355 360 365 355 360 365

<210> 2<210> 2

<211> 1098<211> 1098

<212> DNA<212> DNA

<213> 禾谷镰刀菌J1-012<213> Fusarium graminearum J1-012

<400> 2<400> 2

atggttccga gcatcattac cccgccgccg agctgcgcgg gtggcccgat cagcccgagc 60atggttccga gcatcattac cccgccgccg agctgcgcgg gtggcccgat cagcccgagc 60

agcatttgca gcgaccacgt ggatattccg gagctggcgg atggtcgttg gattcgtctg 120agcatttgca gcgaccacgt ggatattccg gagctggcgg atggtcgttg gattcgtctg 120

ccggaggcgc tgttcagcag catcatggcg gtggaaccgg acgttaaccc gatgtacaag 180ccggaggcgc tgttcagcag catcatggcg gtggaaccgg acgttaaccc gatgtacaag 180

accagcaaag cgctgagcga cgcgtggctg aaggatgcgc tgcgtatgaa cgataaaacc 240accagcaaag cgctgagcga cgcgtggctg aaggatgcgc tgcgtatgaa cgataaaacc 240

gcgagcatct ggagccgtct ggacattgcg tatatgagcg cgatctgcgc gccgaacgcg 300gcgagcatct ggagccgtct ggacattgcg tatatgagcg cgatctgcgc gccgaacgcg 300

gatctggaga ccctgaagct gatgaacgac tggaacggtt gggttttcgc gtttgacgat 360gatctggaga ccctgaagct gatgaacgac tggaacggtt gggttttcgc gtttgacgat 360

ccgttcgacg aaggcagctt tgcgaacaac ccgattaaag cggcggagga agtgatctac 420ccgttcgacg aaggcagctt tgcgaacaac ccgattaaag cggcggagga agtgatctac 420

accctggcga ccctggataa cattcacccg gtggttagcc cggaccagaa cccgctgcgt 480accctggcga ccctggataa cattcacccg gtggttagcc cggaccagaa cccgctgcgt 480

cacaccctgc aaagctgctg gaaccgtttt cgtcagcgtg cgagcccggc gctgcaatac 540cacaccctgc aaagctgctg gaaccgtttt cgtcagcgtg cgagcccggc gctgcaatac 540

cgttggaaga aacacctgac catgtattgc atcggtgttc tgcagcaagt gggcgttcag 600cgttggaaga aacacctgac catgtattgc atcggtgttc tgcagcaagt gggcgttcag 600

aacaccgcga gccgtctgag cgttgaggaa tacatggata tgcgtgcggg ttgcgtgggt 660aacaccgcga gccgtctgag cgttgaggaa tacatggata tgcgtgcggg ttgcgtgggt 660

gcgtatccgt gcattggtct gatggagttc gcggaaggca tcgacctgcc gcaagacgtt 720gcgtatccgt gcattggtct gatggagttc gcggaaggca tcgacctgcc gcaagacgtt 720

atggatcacc cgagcctgga agcgatcagc cgtattacct gcgatctggt gacgctgcag 780atggatcacc cgagcctgga agcgatcagc cgtattacct gcgatctggt gacgctgcag 780

aacgacctgt gcagctatcg taaggatctg attcaaggcg aggacagcaa catcatgttt 840aacgacctgt gcagctatcg taaggatctg attcaaggcg aggacagcaa catcatgttt 840

attctgcgtg accagggtat gaccgatcaa gaagcggcgg acgagatcgg cgaaatgctg 900attctgcgtg accagggtat gaccgatcaa gaagcggcgg acgagatcgg cgaaatgctg 900

tacgattgct atcgtcgttg gcacaccgcg atggcgaacc tgccgttctg gggcgagggc 960tacgattgct atcgtcgttg gcacaccgcg atggcgaacc tgccgttctg gggcgagggc 960

gtggaccgtg atgttattaa atttgtgaac ggttgccgta acatcgcgct gggcaacctg 1020gtggaccgtg atgttattaa atttgtgaac ggttgccgta acatcgcgct gggcaacctg 1020

cactggagcc tgtacacctt ccgttatctg ggtgacgaag gcccgcaggt gaagaaaacc 1080cactggagcc tgtacacctt ccgttatctg ggtgacgaag gcccgcaggt gaagaaaacc 1080

cgtatgatgc gtctgccg 1098cgtatgatgc gtctgccg 1098

<210> 3<210> 3

<211> 4515<211> 4515

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 3<400> 3

gcgcgtaata cgactcacta tagggcgaat tggagctcca ccgcggtggc ggccgctcta 60gcgcgtaata cgactcacta tagggcgaat tggagctcca ccgcggtggc ggccgctcta 60

gaactagtgg atccttagga tttaatgcag gtgacggacc catctttcaa acgatttata 120gaactagtgg atccttagga tttaatgcag gtgacggacc catctttcaa acgatttata 120

tcagtggcgt ccaaattgtt aggttttgtt ggttcagcag gtttcctgtt gtgggtcata 180tcagtggcgt ccaaattgtt aggttttgtt ggttcagcag gtttcctgtt gtgggtcata 180

tgactttgaa ccaaatggcc ggctgctagg gcagcacata aggataattc acctgccaag 240tgactttgaa ccaaatggcc ggctgctagg gcagcacata aggataattc acctgccaag 240

acggcacagg caactattct tgctaattga cgtgcgttgg taccaggagc ggtagcatgc 300acggcacagg caactattct tgctaattga cgtgcgttgg taccaggagc ggtagcatgc 300

gggcctctta cacctaataa gtccaacatg gcaccttgtg gttctagaac agtaccacca 360gggcctctta cacctaataa gtccaacatg gcaccttgtg gttctagaac agtaccacca 360

ccgatggtac ctacttcgat ggatggcatg gatacggaaa ttctcaaatc accgtccact 420ccgatggtac ctacttcgat ggatggcatg gatacggaaa ttctcaaatc accgtccact 420

tctttcatca atgttataca gttggaactt tcaacatttt gtgcaggatc ttgtcctaat 480tctttcatca atgttataca gttggaactt tcaacatttt gtgcaggatc ttgtcctaat 480

gccaagaaaa cagctgtcac taaattagct gcatgtgcgt taaatccacc aacagaccca 540gccaagaaaa cagctgtcac taaattagct gcatgtgcgt taaatccacc aacagaccca 540

gccattgcag atccaaccaa attcttagca atgttcaact caaccaatgc ggaaacatca 600gccattgcag atccaaccaa attcttagca atgttcaact caaccaatgc ggaaacatca 600

ctttttaaca cttttctgac aacatcacca ggaatagtag cttctgcgac gacactctta 660ctttttaaca cttttctgac aacatcacca ggaatagtag cttctgcgac gacactctta 660

ccacgacctt cgatccagtt gatggcagct ggtttttttg tcggtacagt agttaccaga 720ccacgacctt cgatccagtt gatggcagct ggttttttttg tcggtacagt agttaccaga 720

aacggagaca acctccatat cttcccagcc atactcttct accatttgct ttaatgagta 780aacggagaca acctccatat cttcccagcc atactcttct accatttgct ttaatgagta 780

ttcgacacct ttagaaatca tattcatacc cattgcgtca ccagtagttg ttctaaatct 840ttcgacacct ttagaaatca tattcatacc cattgcgtca ccagtagttg ttctaaatct 840

catgaagagt aaatctcctg ctagacaagt ttgaatatgt tgcagacgtg caaatcttga 900catgaagagt aaatctcctg ctagacaagt ttgaatatgt tgcagacgtg caaatcttga 900

tgtagagtta aaagcttttt taattgcgtt ttgtccctct tctgagtcta accatatctt 960tgtagagtta aaagcttttt taattgcgtt ttgtccctct tctgagtcta accatatctt 960

acaggcacca gatcttttca aagttgggaa acggactact gggcctcttg tcataccatc 1020acaggcacca gatcttttca aagttgggaa acggactact gggcctcttg tcataccatc 1020

cttagttaaa acagttgttg caccaccgcc agcattgatt gccttacagc cacgcatggc 1080cttagttaaa acagttgttg caccaccgcc agcattgatt gccttacagc cacgcatggc 1080

agaagctacc aaacaaccct ctgtagttgc cattggtata tgataagatg taccatcgat 1140agaagctacc aaacaaccct ctgtagttgc cattggtata tgataagatg taccatcgat 1140

aaccaagggg cctataacac caacgggcaa aggcatgtaa cctataacat tttcacaaca 1200aaccaagggg cctataacac caacgggcaa aggcatgtaa cctataacat tttcacaaca 1200

agcgccaaat acgcggtcgt agtcataatt tttatatggt aaacgatcag atgctaatac 1260agcgccaaat acgcggtcgt agtcataatt tttatatggt aaacgatcag atgctaatac 1260

aggagcttct gccaaaattg aaagagcctt cctacgtacc gcaaccgctc tcgtagtatc 1320aggagcttct gccaaaattg aaagagcctt cctacgtacc gcaaccgctc tcgtagtatc 1320

acctaatttt ttctccaaag cgtacaaagg taacttaccg tgaataacca aggcagcgac 1380acctaatttt ttctccaaag cgtacaaagg taacttaccg tgaataacca aggcagcgac 1380

ctctttgttc ttcaattgtt ttgtatttcc actacttaat aatgcttcta attcttctaa 1440ctctttgttc ttcaattgtt ttgtatttcc actacttaat aatgcttcta attcttctaa 1440

aggacgtatt ttcttatcca agctttcaat atcgcgggaa tcatcttcct cactagatga 1500aggacgtatt ttcttatcca agctttcaat atcgcgggaa tcatcttcct cactagatga 1500

tgaaggtcct gatgagctcg attgcgcaga tgataaactt ttgactttcg atccagaaat 1560tgaaggtcct gatgagctcg attgcgcaga tgataaactt ttgactttcg atccagaaat 1560

gactgtttta ttggttaaaa cgaattcgga tccgcgaccc atttgctgtc caccagtcat 1620gactgtttta ttggttaaaa cgaattcgga tccgcgaccc atttgctgtc caccagtcat 1620

gctagccata tggctgccgc gcggcaccag gccgctgctg tgatgatgat gatgatggct 1680gctagccata tggctgccgc gcggcaccag gccgctgctg tgatgatgat gatgatggct 1680

gctgcccata gtgtaatcct ccttattttt taacatcgta agatcttcta aatttgtcat 1740gctgcccata gtgtaatcct ccttattttt taacatcgta agatcttcta aatttgtcat 1740

cgatgttggt caagtagtaa acaccacttt gcaaatgctc aatggaacct tgaggtttga 1800cgatgttggt caagtagtaa acaccacttt gcaaatgctc aatggaacct tgaggtttga 1800

agttcttctt caaatgggca ttttctctca attcgatggc agcttcgtaa tcctttggag 1860agttcttctt caaatgggca ttttctctca attcgatggc agcttcgtaa tcctttggag 1860

tttcggtgat tctcttggct aatttgttag taatatctaa ttccttgata atatgttgga 1920tttcggtgat tctcttggct aatttgttag taatatctaa ttccttgata atatgttgga 1920

cgtcaccaac aattttgcaa gaatatagag atgcagctaa accggaaccg taagaaaata 1980cgtcaccaac aattttgcaa gaatatagag atgcagctaa accggaaccg taagaaaata 1980

aaccaacacg cttgccttgt aagtcgtcag atccaacata gtttaataga gatgcaaagg 2040aaccaacacg cttgccttgt aagtcgtcag atccaacata gtttaataga gatgcaaagg 2040

cggcataaac agatgcggtg tacatgttac ctgtgtttgt tggaacaatc aaagattggg 2100cggcataaac agatgcggtg tacatgttac ctgtgtttgt tggaacaatc aaagattggg 2100

caactctctc tttgtggaat ggcttagcaa cattaacaaa agttttttca atgttcttat 2160caactctctc tttgtggaat ggcttagcaa cattaacaaa agttttttca atgttcttat 2160

cggttaaaga ttcgtcataa tcgcgagtag ctaattcggc gtcaacttct gggaacaatt 2220cggttaaaga ttcgtcataa tcgcgagtag ctaattcggc gtcaacttct gggaacaatt 2220

gaggattggc tctgaaatcg ttatatagta atctaccgta tgattttgtg accaatttac 2280gaggattggc tctgaaatcg ttatatagta atctaccgta tgattttgtg accaatttac 2280

aggttggaac atggaaaacg ttgtagtcga aatatttcaa aacgttcaaa gcatccgaac 2340aggttggaac atggaaaacg ttgtagtcga aatatttcaa aacgttcaaa gcatccgaac 2340

cagcgggatc gctaaccaac cctttagaaa tagccttctt ggaataactc ttgtaaactt 2400cagcgggatc gctaaccaac cctttagaaa tagccttctt ggaataactc ttgtaaactt 2400

gatcaagagc cttgacgtaa caagttaatg aaaaatgacc atcgacgtaa ggatattcgc 2460gatcaagagc cttgacgtaa caagttaatg aaaaatgacc atcgacgtaa ggatattcgc 2460

tggtgaaatc tggcttgtaa aaatcgtagg cgtgttccat gtaagaagct cttacagagt 2520tggtgaaatc tggcttgtaa aaatcgtagg cgtgttccat gtaagaagct cttacagagt 2520

caaatacaat tggagcatca ggaccgatcc acatagcaac agtaccggca ccaccggttg 2580caaatacaat tggagcatca ggaccgatcc acatagcaac agtaccggca ccaccggttg 2580

gtcttgcggc acccttatcg tagatggcaa tatcaccgca aactacaatg gcgtctctac 2640gtcttgcggc acccttatcg tagatggcaa tatcaccgca aactacaatg gcgtctctac 2640

catcccatgc gttagattca atccagttca aagagttgaa caacgcgttg gtaccaccgt 2700catcccatgc gttagattca atccagttca aagagttgaa caacgcgttg gtaccaccgt 2700

aacaggcatt aagcgtgtca ataccttcga cgtcagtgtt ttcaccaaac aattgcatca 2760aacaggcatt aagcgtgtca ataccttcga cgtcagtgtt ttcaccaaac aattgcatca 2760

agacagactt gacagacttg gacttgtcaa tcagagtttc agtaccgact tctaatctac 2820agacagactt gacagacttg gacttgtcaa tcagagtttc agtaccgact tctaatctac 2820

caattttgtt ggtgtcgatg ttgtaactct tgatcaactt agacaaaaca gttagggaca 2880caattttgtt ggtgtcgatg ttgtaactct tgatcaactt agacaaaaca gttagggaca 2880

tcgagtagat atcttctctg tcattgacaa aagacatgtt ggtttggccc agaccaattg 2940tcgagtagat atcttctctg tcattgacaa aagacatgtt ggtttggccc agaccaattg 2940

tgtatttacc ttgagaaacg ccatcaaatt tctctagctc agattggttg acacattgag 3000tgtatttacc ttgagaaacg ccatcaaatt tctctagctc agattggttg acacattgag 3000

ttgggatgta aatttggata cctttaatac cgacattttg aggtctggtt ttttgttcag 3060ttgggatgta aatttggata cctttaatac cgacattttg aggtctggtt ttttgttcag 3060

cggtcttttg tttttttagt tcagtcattt gcaagtttgt attgtgtaat tgttgttgct 3120cggtcttttg tttttttagt tcagtcattt gcaagtttgt attgtgtaat tgttgttgct 3120

tttgcggcct aagtcttcct ttaataccac accaacaaag tttagttgag agtttcattt 3180tttgcggcct aagtcttcct ttaataccac accaacaaag tttagttgag agtttcattt 3180

agctgtcctc cttaattcaa ccgttcaatc accatcgcaa ttccctgacc gccgccaatg 3240agctgtcctc cttaattcaa ccgttcaatc accatcgcaa ttccctgacc gccgccaatg 3240

cacagtgttg ccagccccag cgttttatcg cgtgcctgca tggcatgtaa tagtgtgacc 3300cacagtgttg ccagccccag cgttttatcg cgtgcctgca tggcatgtaa tagtgtgacc 3300

agaatacgag caccactggc accgatagga tgcccgagcg cgatggcccc gccgttgaca 3360agaatacgag caccactggc accgatagga tgcccgagcg cgatggcccc gccgttgaca 3360

ttcactttct cagaatcaaa gcccaggttt ttcccaacgg caaggaactg tgcagcaaat 3420ttcactttct cagaatcaaa gcccaggttt ttcccaacgg caaggaactg tgcagcaaat 3420

gcttcattag cctcaatgag atcaatatcc gccagttgca gccccgccag ttgtaacgct 3480gcttcattag cctcaatgag atcaatatcc gccagttgca gccccgccag ttgtaacgct 3480

ttttgcgtgg caggtactgg ccccataccc atcaatgcgg ggggcacgcc accgctggca 3540ttttgcgtgg caggtactgg ccccataccc atcaatgcgg ggggcacgcc accgctggca 3540

taacttttaa tgcgagccag gggggtaagg cctgctgcca gcgccgcaga ttcttccata 3600taacttttaa tgcgagccag gggggtaagg cctgctgcca gcgccgcaga ttcttccata 3600

atcaccagag cggcagcacc gtcgttaata ccagacgcgt tcccagcggt gactgttcct 3660atcaccagag cggcagcacc gtcgttaata ccagacgcgt tcccagcggt gactgttcct 3660

gctttatcga aggccgggcg caatgcacct aacgcttcag ccgttgaatt cgctttcggg 3720gctttatcga aggccgggcg caatgcacct aacgcttcag ccgttgaatt cgctttcggg 3720

aattcgtctt gactgaagac gaaggttttc tttcgagtga caacatttac cgggacgatt 3780aattcgtctt gactgaagac gaaggttttc tttcgagtga caacatttac cgggacgatt 3780

tcggctgtaa aagcaccgga ctcaattgcg gctgccgctt tacgctgtga atgtagcgcc 3840tcggctgtaa aagcaccgga ctcaattgcg gctgccgctt tacgctgtga atgtagcgcc 3840

agttcatcct gcatttcacg ggtaattccg tactctttag ccacgttttc ggcggtaatc 3900agttcatcct gcatttcacg ggtaattccg tactctttag ccacgttttc ggcggtaatc 3900

cccatatgat aaccatgggt ggcgcacatc aggccatcgc gcaggattac gtcataaacc 3960cccatatgat aaccatgggt ggcgcacatc aggccatcgc gcaggattac gtcataaacc 3960

tgtccgtctc caagacgata accagagcgt gcttttgcat cgagtaagta gggggctaaa 4020tgtccgtctc caagacgata accagagcgt gcttttgcat cgagtaagta gggggctaaa 4020

ctcatatttt ccataccccc cgccacaatg ctctgcgcct gacctgcctg aatggcctgg 4080ctcatatttt ccataccccc cgccacaatg ctctgcgcct gacctgcctg aatggcctgg 4080

gcggcaagcg ccacactttt aagacccgaa ccacatactt tattgaccgt gaatccgcac 4140gcggcaagcg ccacactttt aagacccgaa ccacatactt tattgaccgt gaatccgcac 4140

accgtttctg ccagcccgct ttttaacagt gcctgacgcg ccggattttg ccccagcccg 4200accgtttctg ccagcccgct ttttaacagt gcctgacgcg ccggattttg ccccagcccg 4200

gcttgtaaca cgttacccat aatcacttca tcaacgtgtt gtgaatcgat ttttgcacgt 4260gcttgtaaca cgttacccat aatcacttca tcaacgtgtt gtgaatcgat ttttgcacgt 4260

tcaatggcgg ctttaattac tgtcgccccc aggtcgatgg cgctggtgga agcgagtgaa 4320tcaatggcgg ctttaattac tgtcgccccc aggtcgatgg cgctggtgga agcgagtgaa 4320

ccgttaaaac taccgatagc agtacgtacc gcactgacga tgacacaatt tttcatttta 4380ccgttaaaac taccgatagc agtacgtacc gcactgacga tgacacaatt tttcatttta 4380

tattcctcct agtcgactct agaggatccc cgggctgcag gaattcgata tcaagcttat 4440tattcctcct agtcgactct agaggatccc cgggctgcag gaattcgata tcaagcttat 4440

cgataccgtc gacctcgagg gggggcccgg tacccagctt ttgttccctt tagtgagggt 4500cgataccgtc gacctcgagg gggggcccgg tacccagctt ttgttccctt tagtgagggt 4500

taattgcgcg ctggg 4515taattgcgcg ctggg 4515

<210> 4<210> 4

<211> 4667<211> 4667

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 4<400> 4

cgcgcgtaat acgactcact atagggcgaa ttggagctct tatttaagct gggtaaatgc 60cgcgcgtaat acgactcact atagggcgaa ttggagctct tatttaagct gggtaaatgc 60

agataatcgt tttctggctt cgcgatttgt cgcctgcatc accatccacg gactgaacgc 120agataatcgt tttctggctt cgcgatttgt cgcctgcatc accatccacg gactgaacgc 120

ccacggcgtg gcatcaatac cgtgtaatac atctgctaaa tcacaccatt gataatccat 180ccacggcgtg gcatcaatac cgtgtaatac atctgctaaa tcacaccatt gataatccat 180

cacttcatca tcattgatct gtaacgcact agtggtgcgt gcggcaaata ccggacacac 240cacttcatca tcattgatct gtaacgcact agtggtgcgt gcggcaaata ccggacacac 240

ttcattttcc acaatgccac tcggatcggt ggcgcggtag cgaaagtcag gatagataga 300ttcattttcc acaatgccac tcggatcggt ggcgcggtag cgaaagtcag gatagataga 300

ttcaggaggc gtaatttcca cgccaagctc ataacggcaa cggcggatca ctgcgtcttc 360ttcaggaggc gtaatttcca cgccaagctc ataacggcaa cggcggatca ctgcgtcttc 360

gttgctttct cccagttgtg ggtgcccaca aaccgagtta gtccacacgc caggccatgc 420gttgctttct cccagttgtg ggtgcccaca aaccgagtta gtccacacgc caggccatgc 420

ttttttgctc agtgcgcggc gggtaactaa taattgtcct ttggcattaa acagccaact 480ttttttgctc agtgcgcggc gggtaactaa taattgtcct ttggcattaa acagccaact 480

ggagaacgcg agatgtaagc gggtgtctgc cgtgtgtgcg gcatactttt ccagcgtacc 540ggagaacgcg agatgtaagc gggtgtctgc cgtgtgtgcg gcatactttt ccagcgtacc 540

cgtgggaact ccctgtgcat tcaataaaat gacgtgttcc gtttgcatat ggctgccgcg 600cgtgggaact ccctgtgcat tcaataaaat gacgtgttcc gtttgcatat ggctgccgcg 600

cggcaccagg ccgctgctgt gatgatgatg atgatggctg ctgcccatat agtaatcctc 660cggcaccagg ccgctgctgt gatgatgatg atgatggctg ctgcccatat agtaatcctc 660

ctcccgggct gcagttattc ctttggtaga ccagtctttg cgtcaatcaa agattcgttt 720ctcccgggct gcagttattc ctttggtaga ccagtctttg cgtcaatcaa agattcgttt 720

gtttcttgtg ggcctgaacc gacttgagtt aaaatcactc tggcaacatc cttttgcaac 780gtttcttgtg ggcctgaacc gacttgagtt aaaatcactc tggcaacatc cttttgcaac 780

tcaagatcca attcacgtgc agtaaagtta gatgattcaa attgatggtt gaaagcctca 840tcaagatcca attcacgtgc agtaaagtta gatgattcaa attgatggtt gaaagcctca 840

agctgctcag tagtaaattt cttgtcccat ccaggaacag agccaaacaa tttatagata 900agctgctcag tagtaaattt cttgtcccat ccaggaacag agccaaacaa tttatagata 900

aatgcaaaga gtttcgactc attttcagct aagtagtaca acacagcatt tggacctgca 960aatgcaaaga gtttcgactc attttcagct aagtagtaca acacagcatt tggacctgca 960

tcaaacgtgt atgcaacgat tgtttctccg taaaactgat taatggtgtg gcaccaactg 1020tcaaacgtgt atgcaacgat tgtttctccg taaaactgat taatggtgtg gcaccaactg 1020

atgatacgct tggaagtgtc attcatgtag aatattggag ggaaagagtc caaacatgtg 1080atgatacgct tggaagtgtc attcatgtag aatattggag ggaaagagtc caaacatgtg 1080

gcatggaaag agttggaatc catcattgtt tcctttgcaa aggtggcgaa atctttttca 1140gcatggaaag agttggaatc catcattgtt tcctttgcaa aggtggcgaa atctttttca 1140

acaatggctt tacgcatgac ttcaaatctc tttggtacga catgttcaat tctttcttta 1200acaatggctt tacgcatgac ttcaaatctc tttggtacga catgttcaat tctttcttta 1200

aatagttcgg aggttgccac ggtcaattgc ataccctgag tggaactcac atccttttta 1260aatagttcgg aggttgccac ggtcaattgc ataccctgag tggaactcac atccttttta 1260

atatcgctga caactaggac acaagctttc atctgaggcc agtcagagct gtctgcgatt 1320atatcgctga caactaggac acaagctttc atctgaggcc agtcagagct gtctgcgatt 1320

tgtactgcca tggaatcatg accatcttca gcttttccca tttcccaggc cacgtatccg 1380tgtactgcca tggaatcatg accatcttca gcttttccca tttcccaggc cacgtatccg 1380

ccaaacaacg atctacaagc tgaaccagac ccctttcttg ctattctaga tatttctgaa 1440ccaaacaacg atctacaagc tgaaccagac ccctttcttg ctattctaga tatttctgaa 1440

gttgactgtg gtaattggta taacttagca attgcagaga ccaatgcagc aaagccagca 1500gttgactgtg gtaattggta taacttagca attgcagaga ccaatgcagc aaagccagca 1500

gcggaggaag ctaaaccagc tgctgtagga aagttatttt cggagacaat gtggagtttc 1560gcggaggaag ctaaaccagc tgctgtagga aagttatttt cggagacaat gtggagtttc 1560

cattgagata atgtgggcaa tgaggcgtcc ttcgattcca tttcctttct taattggcgt 1620cattgagata atgtgggcaa tgaggcgtcc ttcgattcca tttcctttct taattggcgt 1620

aggtcgcgca gacaattttg agttctttca ttgtcgatgc tgtgtggttc tccatttaac 1680aggtcgcgca gacaattttg agttctttca ttgtcgatgc tgtgtggttc tccatttaac 1680

cacaaagtgt cgcgttcaaa ctcaggtgca gtagccgcag aggtcaacgt tctgaggtca 1740cacaaagtgt cgcgttcaaa ctcaggtgca gtagccgcag aggtcaacgt tctgaggtca 1740

tcttgcgata aagtcactga tatggacgaa ttggtgggca gattcaactt cgtgtccctt 1800tcttgcgata aagtcactga tatggacgaa ttggtgggca gattcaactt cgtgtccctt 1800

ttcccccaat acttaagggt tgcgatgttg acgggtgcgg taacggatgc tgtgtaaacg 1860ttcccccaat acttaagggt tgcgatgttg acgggtgcgg taacggatgc tgtgtaaacg 1860

gtcatgagta ttacctccta tttatcaaga taagtttccg gatctttttc tttcctaaca 1920gtcatgagta ttacctccta tttatcaaga taagtttccg gatctttttc tttcctaaca 1920

ccccagtcag cctgagttac atccagccat tgaaccttag aaaatctttt gtcattagcg 1980ccccagtcag cctgagttac atccagccat tgaaccttag aaaatctttt gtcattagcg 1980

gtttgagccc taagatcaac atcttgctta gtaatcactg caatggcgtc ataaccacca 2040gtttgagccc taagatcaac atcttgctta gtaatcactg caatggcgtc ataaccacca 2040

gcaccaggta ttaagcaagt aagaactcct tttaaggtct ggcaatcatc caataagcta 2100gcaccaggta ttaagcaagt aagaactcct tttaaggtct ggcaatcatc caataagcta 2100

gtttgtacgg gaggttcgat atcggcacca gattctttag ttatttttct aaaggaacgt 2160gtttgtacgg gaggttcgat atcggcacca gattctttag ttatttttct aaaggaacgt 2160

ctaattgtgg caactgcatc tctaacttct gtgatttcag gatacttttg acaggtacag 2220ctaattgtgg caactgcatc tctaacttct gtgatttcag gatacttttg acaggtacag 2220

tcattcctct caagagactc aaatatctga tcgctgtaat cgtcatgagt ctcgtgtaag 2280tcattcctct caagagactc aaatatctga tcgctgtaat cgtcatgagt ctcgtgtaag 2280

cgatctagtt tagatagtcc atccataaat ctagaatttg catgatcgag ttctgtatat 2340cgatctagtt tagatagtcc atccataaat ctagaatttg catgatcgag ttctgtatat 2340

attttcaagc tttctggcat atgcgaatca taccaatttt ttaccttctg gaccagtttt 2400attttcaagc tttctggcat atgcgaatca taccaatttt ttaccttctg gaccagtttt 2400

actgtttctg aaccattctt aatatcgccc atccataaag ttaatcccga aggtaaatgg 2460actgtttctg aaccattctt aatatcgccc atccataaag ttaatcccga aggtaaatgg 2460

ttacttttaa tcgtaatatt ccagtcttct tcatcaacca aatgcgccag tttactgccg 2520ttacttttaa tcgtaatatt ccagtcttct tcatcaacca aatgcgccag tttactgccg 2520

taagtagcac ttccaatatc tggcaaatta gagattaatg cgggtgggaa tcttctatat 2580taagtagcac ttccaatatc tggcaaatta gagattaatg cgggtgggaa tcttctatat 2580

ctgatagatc catatgctgc cgccgctaca tcaaacccgc ttccaatttt accctgagct 2640ctgatagatc catatgctgc cgccgctaca tcaaacccgc ttccaatttt accctgagct 2640

tgacaatgag caacttgtgc taaattatga ataacttctc tatatttgtc tacattattt 2700tgacaatgag caacttgtgc taaattatga ataacttctc tatatttgtc tacattattt 2700

tccaggtccg atacaaaaaa ggaggccaaa gctgtagtta aaactgtgac taaacctgcc 2760tccaggtccg atacaaaaaa ggaggccaaa gctgtagtta aaactgtgac taaacctgcc 2760

gaggagccca gccctgtttt gggaacttct tcaattctgt gcgaatgaaa actcaatctt 2820gaggagccca gccctgtttt gggaacttct tcaattctgt gcgaatgaaa actcaatctt 2820

ctgttgccac gatgttcggt aacgctatcc tcctgagaat ggtaggcatc atcagagaaa 2880ctgttgccac gatgttcggt aacgctatcc tcctgagaat ggtaggcatc atcagagaaa 2880

atatcaataa cgaacaagtt tctattgcag tagtcgtcca tgttaggttt aaagtagcta 2940atatcaataa cgaacaagtt tctattgcag tagtcgtcca tgttaggttt aaagtagcta 2940

aatacgttag cgataacttt ttcaatgaaa gggttcttag atccgcctat cgaaacagga 3000aatacgttag cgataacttt ttcaatgaaa gggttcttag atccgcctat cgaaacagga 3000

atgaagccac ttttaggact tatatggtac agccactccc catctttaaa ttgtttactt 3060atgaagccac ttttaggact tatatggtac agccactccc catctttaaa ttgtttactt 3060

ttcacacgca cttcaaactt atcagaccct tgcaatgaac cgtaaggatg ggctacagca 3120ttcacacgca cttcaaactt atcagaccct tgcaatgaac cgtaaggatg ggctacagca 3120

tgcattcttg ccgataatcc gactacaaat gcttcatatt ttgtatctaa aactaaatat 3180tgcattcttg ccgataatcc gactacaaat gcttcatatt ttgtatctaa aactaaatat 3180

ccaccagcta gtaacgcttt ccctggggca ctgaaggctc tcaactctga catttgatct 3240ccaccagcta gtaacgcttt ccctggggca ctgaaggctc tcaactctga catttgatct 3240

gcctcctatg aagtccatgg taaattcgtg tttcctggca ataatagatc gtcaatttgt 3300gcctcctatg aagtccatgg taaattcgtg tttcctggca ataatagatc gtcaatttgt 3300

tgctttgtgg tagttttatt ttcaaataat tggaatacta gggatttgat tttaggatct 3360tgctttgtgg tagttttatt ttcaaataat tggaatacta gggatttgat tttaggatct 3360

ttattcaaat tttttgcgct taacaaacag cagccagtcc cacccaagtc tgtttcaaat 3420ttattcaaat tttttgcgct taacaaacag cagccagtcc cacccaagtc tgtttcaaat 3420

gtctcgtaac taaaatcatc ttgcaatttc tttttgaaac tgtcaatttg ctcttgagta 3480gtctcgtaac taaaatcatc ttgcaatttc tttttgaaac tgtcaatttg ctcttgagta 3480

atgtctcttc gtaacaaagt caaagagcaa ccgccgccac cagcaccggt aagttttgtg 3540atgtctcttc gtaacaaagt caaagagcaa ccgccgccac cagcaccggt aagttttgtg 3540

gagccaattc tcaaatcatc gctcagattt ttaataagtt ctaatccagg atgagaaaca 3600gagccaattc tcaaatcatc gctcagattt ttaataagtt ctaatccagg atgagaaaca 3600

ccgattgaga caagcagtcc atgatttatt cttatcaatt ccaatagttg ttcatacagt 3660ccgattgaga caagcagtcc atgatttatt cttatcaatt ccaatagttg ttcatacagt 3660

tcattattag tttctacggc ctcgtcatcg gtgcctttac atttacttaa ctttgtcatg 3720tcattattag tttctacggc ctcgtcatcg gtgcctttac atttacttaa ctttgtcatg 3720

atctctaagc cttgtagggc acattcaccc atggcatcta gaattggctt cataacttca 3780atctctaagc cttgtagggc acattcaccc atggcatcta gaattggctt cataacttca 3780

ggaaatttct cggtgaccaa cacacgaacg cgagcaacaa gatcttttgt agaccttgga 3840ggaaatttct cggtgaccaa cacacgaacg cgagcaacaa gatcttttgt agaccttgga 3840

attctagtat aggttaggat cattggaatg gctgggaaat catctaagaa cttaaaattg 3900attctagtat aggttaggat cattggaatg gctgggaaat catctaagaa cttaaaattg 3900

tttgtattta ttgttccatt atgtgagtct ttttcaaata gcagggcatt accataagtg 3960tttgtattta ttgttccatt atgtgagtct ttttcaaata gcagggcatt accataagtg 3960

gccacagcgt tatctattcc tgaaggggta ccgtgaatac acttttcacc tatgaaggcc 4020gccacagcgt tatctattcc tgaaggggta ccgtgaatac acttttcacc tatgaaggcc 4020

cattgattca ctatatgctt atcgttttct gacagctttt ccaagtcatt agatcctatt 4080cattgattca ctatatgctt atcgttttct gacagctttt ccaagtcatt agatcctatt 4080

aaccccccca agtaggccat agctaaggcc agtgatacag aaatagaggc gcttgagccc 4140aaccccccca agtaggccat agctaaggcc agtgatacag aaatagaggc gcttgagccc 4140

aacccagcac cgatgggtaa agtagacttt aaagaaaact taatattctt ggcatggggg 4200aacccagcac cgatgggtaa agtagacttt aaagaaaact taatattctt ggcatggggg 4200

cataggcaaa caaacatata caggaaacaa aacgctgcat ggtagtggaa ggattcggat 4260cataggcaaa caaacatata caggaaacaa aacgctgcat ggtagtggaa ggattcggat 4260

agttgagcta acaacggatc caaaagacta acgagttcct gagacaagcc atcggtggct 4320agttgagcta acaacggatc caaaagacta acgagttcct gagacaagcc atcggtggct 4320

tgttgagcct tggccaattt ttgggagttt acttgatcct cggtgatggc attgaaatca 4380tgttgagcct tggccaattt ttgggagttt acttgatcct cggtgatggc attgaaatca 4380

ttgatggacc acttatgatt aaagctaatg tccgggaagt ccaattcaat agtatctggt 4440ttgatggacc acttatgatt aaagctaatg tccgggaagt ccaattcaat agtatctggt 4440

gcagatgact cgcttattag caggtaggtt ctcaacgcag acacactagc agcgacggca 4500gcagatgact cgcttattag caggtaggtt ctcaacgcag acacactagc agcgacggca 4500

ggcttgttgt acacagcaga gtgttcacca aaaataataa cctttcccgg tgcagaagtt 4560ggcttgttgt acacagcaga gtgttcacca aaaataataa cctttcccgg tgcagaagtt 4560

aagaacggta atgacatggt taattcctcc tactgcagga attcgatatc aagcttatcg 4620aagaacggta atgacatggt taattcctcc tactgcagga attcgatatc aagcttatcg 4620

ataccgtcga cctcgagggg gggcccggta cccagctttt gttcccc 4667ataccgtcga cctcgagggg gggcccggta cccagctttt gttcccc 4667

<210> 5<210> 5

<211> 3052<211> 3052

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 5<400> 5

caaaaaaccc ctcaagaccc gtttagaggc cccaaggggt tatgctagtt attgctcagc 60caaaaaaccc ctcaagaccc gtttagaggc cccaaggggt tatgctagtt attgctcagc 60

ggtggcagca gccaactcag cttcctttcg ggctttgtta gcagccggat ctcagtggtg 120ggtggcagca gccaactcag cttcctttcg ggctttgtta gcagccggat ctcagtggtg 120

gtggtggtgg tgctcgagtg cggccgcaag cttgtcgacg gagctcgaat tcttatttaa 180gtggtggtgg tgctcgagtg cggccgcaag cttgtcgacg gagctcgaat tcttatttaa 180

gctgggtaaa tgcagataat cgttttctgg cttcgcgatt tgtcgcctgc atcaccatcc 240gctgggtaaa tgcagataat cgttttctgg cttcgcgatt tgtcgcctgc atcaccatcc 240

acggactgaa cgcccacggc gtggcatcaa taccgtgtaa tacatctgct aaatcacacc 300acggactgaa cgcccacggc gtggcatcaa taccgtgtaa tacatctgct aaatcacacc 300

attgataatc catcacttca tcatcattga tctgtaacgc actagtggtg cgtgcggcaa 360attgataatc catcacttca tcatcattga tctgtaacgc actagtggtg cgtgcggcaa 360

ataccggaca cacttcattt tccacaatgc cactcggatc ggtggcgcgg tagcgaaagt 420ataccggaca cacttcattt tccacaatgc cactcggatc ggtggcgcgg tagcgaaagt 420

caggatagat agattcagga ggcgtaattt ccacgccaag ctcataacgg caacggcgga 480caggatagat agattcagga ggcgtaattt ccacgccaag ctcataacgg caacggcgga 480

tcactgcgtc ttcgttgctt tctcccagtt gtgggtgccc acaaaccgag ttagtccaca 540tcactgcgtc ttcgttgctt tctcccagtt gtgggtgccc acaaaccgag ttagtccaca 540

cgccaggcca tgcttttttg ctcagtgcgc ggcgggtaac taataattgt cctttggcat 600cgccaggcca tgctttttttg ctcagtgcgc ggcgggtaac taataattgt cctttggcat 600

taaacagcca actggagaac gcgagatgta agcgggtgtc tgccgtgtgt gcggcatact 660taaacagcca actggagaac gcgagatgta agcgggtgtc tgccgtgtgt gcggcatact 660

tttccagcgt acccgtggga actccctgtg cattcaataa aatgacgtgt tccgtttgca 720tttccagcgt acccgtggga actccctgtg cattcaataa aatgacgtgt tccgtttgca 720

tatggctgcc gcgcggcacc aggccgctgc tgtgatgatg atgatgatgg ctgctgccca 780tatggctgcc gcgcggcacc aggccgctgc tgtgatgatg atgatgatgg ctgctgccca 780

tggtatatct ccttcttaaa gttaaacaaa attatttcta gttatttatt acgctggatg 840tggtatatct ccttcttaaa gttaaacaaa attatttcta gttatttatt acgctggatg 840

atgtagtccg ctagcgcttc cagtgccgag gtatcgagtg actgttcagc cagttgtttc 900atgtagtccg ctagcgcttc cagtgccgag gtatcgagtg actgttcagc cagttgtttc 900

agcgactgac gggcatcgtc gatcagatcc cgggctttct tccgggcttg ctcaagaccc 960agcgactgac gggcatcgtc gatcagatcc cgggctttct tccgggcttg ctcaagaccc 960

agaagtgcag ggtaggtact tttaccaagt tgctggtcgg caccctggcg ttttcccaac 1020agaagtgcag ggtaggtact ttaccaagt tgctggtcgg caccctggcg ttttcccaac 1020

gttgcagtat ctcccaccac atccaggatg tcatcctgaa cctggaaggc aaggccgatg 1080gttgcagtat ctcccaccac atccaggatg tcatcctgaa cctggaaggc aaggccgatg 1080

ctctctgcat acttgtcgag taccggcaga gcacgacgtc ctttatctcc ggcgcttaat 1140ctctctgcat acttgtcgag taccggcaga gcacgacgtc ctttatctcc ggcgcttaat 1140

gcaccaaggc gaacggcggc gcgaatcaat gcgccggttt tatgacgatg aatacgctca 1200gcaccaaggc gaacggcggc gcgaatcaat gcgccggttt tatgacgatg aatacgctca 1200

agcgcgtcca gaggtacgtg tttgccttcc gcgtctaaat ctaatgcctg accaccgcac 1260agcgcgtcca gaggtacgtg tttgccttcc gcgtctaaat ctaatgcctg accaccgcac 1260

attccggcaa taccactggc gctcgccagt tcagaaatca tcgaaattct gtcgcggtcc 1320attccggcaa taccactggc gctcgccagt tcagaaatca tcgaaattct gtcgcggtcc 1320

gacacttccg gcatatcggc atcgcttaaa atcgagaacg ccagcgtttg taaagcgtcg 1380gacacttccg gcatatcggc atcgcttaaa atcgagaacg ccagcgtttg taaagcgtcg 1380

ccagcgagaa tcgcgtttgc ttcgccaaac ttcacatggc aggttggcaa accgcgacgc 1440ccagcgagaa tcgcgtttgc ttcgccaaac ttcacatggc aggttggcaa accgcgacgc 1440

agatcgtcat catccattgc cggtaaatca tcatgaatta atgagtaagc gtggatacac 1500agatcgtcat catccattgc cggtaaatca tcatgaatta atgagtaagc gtggatacac 1500

tcaacggcgg cagcgggtgc gtccagcgtg tttgtgctaa cgccgaacat atgaccggtg 1560tcaacggcgg cagcgggtgc gtccagcgtg tttgtgctaa cgccgaacat atgaccggtg 1560

gcataaacca ggaaaggtcg caggcgctta ccacctaata atgcgccata ctgcatggtt 1620gcataaacca ggaaaggtcg caggcgctta ccacctaata atgcgccata ctgcatggtt 1620

tcgaccacgg gagtgttctg aaagggcagt ggggcgataa aacggctcag cgcctggttg 1680tcgaccacgg gagtgttctg aaagggcagt ggggcgataa aacggctcag cgcctggttg 1680

gcctgcttaa cgcaggcttc gagttgctgc ggaaagtcca tggatccgcg acccatttgc 1740gcctgcttaa cgcaggcttc gagttgctgc ggaaagtcca tggatccgcg acccatttgc 1740

tgtccaccag tcatgctagc catatggctg ccgcgcggca ccaggccgct gctgtgatga 1800tgtccaccag tcatgctagc catatggctg ccgcgcggca ccaggccgct gctgtgatga 1800

tgatgatgat ggctgctgcc catggtatat ctccttctta aagttaaaca aaattatttc 1860tgatgatgat ggctgctgcc catggtatat ctccttctta aagttaaaca aaattatttc 1860

tagttacggc agacgcatca tacgggtttt cttcacctgc gggccttcgt cacccagata 1920tagttacggc agacgcatca tacgggtttt cttcacctgc gggccttcgt cacccagata 1920

acggaaggtg tacaggctcc agtgcaggtt gcccagcgcg atgttacggc aaccgttcac 1980acggaaggtg tacaggctcc agtgcaggtt gcccagcgcg atgttacggc aaccgttcac 1980

aaatttaata acatcacggt ccacgccctc gccccagaac ggcaggttcg ccatcgcggt 2040aaatttaata acatcacggt ccacgccctc gccccagaac ggcaggttcg ccatcgcggt 2040

gtgccaacga cgatagcaat cgtacagcat ttcgccgatc tcgtccgccg cttcttgatc 2100gtgccaacga cgatagcaat cgtacagcat ttcgccgatc tcgtccgccg cttcttgatc 2100

ggtcataccc tggtcacgca gaataaacat gatgttgctg tcctcgcctt gaatcagatc 2160ggtcataccc tggtcacgca gaataaacat gatgttgctg tcctcgcctt gaatcagatc 2160

cttacgatag ctgcacaggt cgttctgcag cgtcaccaga tcgcaggtaa tacggctgat 2220cttacgatag ctgcacaggt cgttctgcag cgtcaccaga tcgcaggtaa tacggctgat 2220

cgcttccagg ctcgggtgat ccataacgtc ttgcggcagg tcgatgcctt ccgcgaactc 2280cgcttccagg ctcgggtgat ccataacgtc ttgcggcagg tcgatgcctt ccgcgaactc 2280

catcagacca atgcacggat acgcacccac gcaacccgca cgcatatcca tgtattcctc 2340catcagacca atgcacggat acgcacccac gcaacccgca cgcatatcca tgtattcctc 2340

aacgctcaga cggctcgcgg tgttctgaac gcccacttgc tgcagaacac cgatgcaata 2400aacgctcaga cggctcgcgg tgttctgaac gcccacttgc tgcagaacac cgatgcaata 2400

catggtcagg tgtttcttcc aacggtattg cagcgccggg ctcgcacgct gacgaaaacg 2460catggtcagg tgtttcttcc aacggtattg cagcgccggg ctcgcacgct gacgaaaacg 2460

gttccagcag ctttgcaggg tgtgacgcag cgggttctgg tccgggctaa ccaccgggtg 2520gttccagcag ctttgcaggg tgtgacgcag cgggttctgg tccgggctaa ccaccgggtg 2520

aatgttatcc agggtcgcca gggtgtagat cacttcctcc gccgctttaa tcgggttgtt 2580aatgttatcc agggtcgcca gggtgtagat cacttcctcc gccgctttaa tcgggttgtt 2580

cgcaaagctg ccttcgtcga acggatcgtc aaacgcgaaa acccaaccgt tccagtcgtt 2640cgcaaagctg ccttcgtcga acggatcgtc aaacgcgaaa acccaaccgt tccagtcgtt 2640

catcagcttc agggtctcca gatccgcgtt cggcgcgcag atcgcgctca tatacgcaat 2700catcagcttc agggtctcca gatccgcgtt cggcgcgcag atcgcgctca tatacgcaat 2700

gtccagacgg ctccagatgc tcgcggtttt atcgttcata cgcagcgcat ccttcagcca 2760gtccagacgg ctccagatgc tcgcggtttt atcgttcata cgcagcgcat ccttcagcca 2760

cgcgtcgctc agcgctttgc tggtcttgta catcgggtta acgtccggtt ccaccgccat 2820cgcgtcgctc agcgctttgc tggtcttgta catcgggtta acgtccggtt ccaccgccat 2820

gatgctgctg aacagcgcct ccggcagacg aatccaacga ccatccgcca gctccggaat 2880gatgctgctg aacagcgcct ccggcagacg aatccaacga ccatccgcca gctccggaat 2880

atccacgtgg tcgctgcaaa tgctgctcgg gctgatcggg ccacccgcgc agctcggcgg 2940atccacgtgg tcgctgcaaa tgctgctcgg gctgatcggg ccacccgcgc agctcggcgg 2940

cggggtaatg atgctcggaa ccatatgtat atctccttct taaagttaaa caaaattatt 3000cggggtaatg atgctcggaa ccatatgtat atctccttct taaagttaaa caaaattatt 3000

tctagagggg aattgttatc cgctcacaat tcccctatag tgagtcgtat ta 3052tctagagggg aattgttatc cgctcacaat tcccctatag tgagtcgtat ta 3052

<210> 6<210> 6

<211> 52<211> 52

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 6<400> 6

catcttatta atcagataaa atatttctcg agctccggca aaaagtggcc cc 52catcttatta atcagataaa atatttctcg agctccggca aaaagtggcc cc 52

<210> 7<210> 7

<211> 50<211> 50

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 7<400> 7

catcttccag gaaatctccg ccccgctcga gaaacccacg gcggcaatgc 50catcttccag gaaatctccg ccccgctcga gaaacccacg gcggcaatgc 50

<210> 8<210> 8

<211> 50<211> 50

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 8<400> 8

gcattgccgc cgtgggtttc tcgagcgggg cggagatttc ctggaagatg 50gcattgccgc cgtgggtttc tcgagcgggg cggagatttc ctggaagatg 50

<210> 9<210> 9

<211> 52<211> 52

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 9<400> 9

ggggccactt tttgccggag ctcgagaaat attttatctg attaataaga tg 52ggggccactt tttgccggag ctcgagaaat attttatctg attaataaga tg 52

<210> 10<210> 10

<211> 55<211> 55

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 10<400> 10

tttgaaagat gggtccgtca cctgcattaa atcctaagga tccactagtt ctaga 55tttgaaagat gggtccgtca cctgcattaa atcctaagga tccactagtt ctaga 55

<210> 11<210> 11

<211> 59<211> 59

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 11<400> 11

ttttatattc ctcctagtcg actctagagg atccccgggc tgcaggaatt cgatatcaa 59ttttatattc ctcctagtcg actctagagg atccccgggc tgcaggaatt cgatatcaa 59

<210> 12<210> 12

<211> 59<211> 59

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 12<400> 12

cccggggatc ctctagagtc gactaggagg aatataaaat gaaaaattgt gtcatcgtc 59cccggggatc ctctagagtc gactaggagg aatataaaat gaaaaattgt gtcatcgtc 59

<210> 13<210> 13

<211> 49<211> 49

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 13<400> 13

gttgagagtt tcatttagct gtcctcctta attcaaccgt tcaatcacc 49gttgagagtt tcatttagct gtcctcctta attcaaccgt tcaatcacc 49

<210> 14<210> 14

<211> 46<211> 46

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 14<400> 14

acggttgaat taaggaggac agctaaatga aactctcaac taaact 46acggttgaat taaggaggac agctaaatga aactctcaac taaact 46

<210> 15<210> 15

<211> 49<211> 49

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 15<400> 15

tggctgctgc ccatagtgta atcctcctta ttttttaaca tcgtaagat 49tggctgctgc ccatagtgta atcctcctta ttttttaaca tcgtaagat 49

<210> 16<210> 16

<211> 47<211> 47

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 16<400> 16

atgttaaaaa ataaggagga ttacactatg ggcagcagcc atcatca 47atgttaaaaa ataaggagga ttacactatg ggcagcagcc atcatca 47

<210> 17<210> 17

<211> 23<211> 23

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 17<400> 17

ttaggattta atgcaggtga cgg 23ttaggattta atgcaggtga cgg 23

<210> 18<210> 18

<211> 59<211> 59

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 18<400> 18

cagaaaacga ttatctgcat ttacccagct taaataagag ctccaattcg ccctatagt 59cagaaaacga ttatctgcat ttacccagct taaataagag ctccaattcg ccctatagt 59

<210> 19<210> 19

<211> 55<211> 55

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 19<400> 19

ttaagaacgg taatgacatg gttaattcct cctactgcag gaattcgata tcaag 55ttaagaacgg taatgacatg gttaattcct cctactgcag gaattcgata tcaag 55

<210> 20<210> 20

<211> 42<211> 42

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 20<400> 20

ctgcagtagg aggaattaac catgtcatta ccgttcttaa ct 42ctgcagtagg aggaattaac catgtcatta ccgttcttaa ct 42

<210> 21<210> 21

<211> 44<211> 44

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 21<400> 21

ctcaactctg acatttgatc tgcctcctat gaagtccatg gtaa 44ctcaactctg acatttgatc tgcctcctat gaagtccatg gtaa 44

<210> 22<210> 22

<211> 51<211> 51

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 22<400> 22

ttaccatgga cttcatagga ggcagatcaa atgtcagagt tgagagcctt c 51ttaccatgga cttcatagga ggcagatcaa atgtcagagt tgagagcctt c 51

<210> 23<210> 23

<211> 53<211> 53

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 23<400> 23

gatgctgtgt aaacggtcat gagtattacc tcctatttat caagataagt ttc 53gatgctgtgt aaacggtcat gagtattacc tcctatttat caagataagt ttc 53

<210> 24<210> 24

<211> 50<211> 50

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 24<400> 24

atcttgataa ataggaggta atactcatga ccgtttacac agcatccgtt 50atcttgataa ataggaggta atactcatga ccgtttacac agcatccgtt 50

<210> 25<210> 25

<211> 55<211> 55

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 25<400> 25

tgcccatata gtaatcctcc tcccgggctg cagttattcc tttggtagac cagtc 55tgcccatata gtaatcctcc tcccgggctg cagttattcc tttggtagac cagtc 55

<210> 26<210> 26

<211> 51<211> 51

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 26<400> 26

ggaataactg cagcccggga ggaggattac tatatgggca gcagccatca t 51ggaataactg cagcccggga ggaggattac tatatgggca gcagccatca t 51

<210> 27<210> 27

<211> 24<211> 24

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 27<400> 27

ttatttaagc tgggtaaatg caga 24ttatttaagc tgggtaaatg caga 24

<210> 28<210> 28

<211> 31<211> 31

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 28<400> 28

atatggatcc atggactttc cgcagcaact c 31atatggatcc atggactttc cgcagcaact c 31

<210> 29<210> 29

<211> 38<211> 38

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 29<400> 29

atatgaattc actagtttat ttattacgct ggatgatg 38atatgaattc actagtttat ttattacgct ggatgatg 38

<210> 30<210> 30

<211> 34<211> 34

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 30<400> 30

atcgcatatg caaacggaac acgtcatttt attg 34atcgcatatg caaacggaac acgtcatttt attg 34

<210> 31<210> 31

<211> 36<211> 36

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 31<400> 31

atatctcgag actagttatt taagctgggt aaatgc 36atatctcgag actagttatt taagctgggt aaatgc 36

<210> 32<210> 32

<211> 27<211> 27

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 32<400> 32

atatcatatg gttccgagca tcattac 27atatcatatg gttccgagca tcattac 27

<210> 33<210> 33

<211> 30<211> 30

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 33<400> 33

atatgaattc ttacggcaga cgcatcatac 30atatgaattc ttacggcaga cgcatcatac 30

<210> 34<210> 34

<211> 35<211> 35

<212> DNA<212> DNA

<213> 人工序列<213> Artificial sequences

<400> 34<400> 34

atatctcgag actagttacg gcagacgcat catac 35atatctcgag actagttacg gcagacgcat catac 35

Claims (5)

1.一种萜类合酶FgJ09920的用途,所述萜类合酶具有SEQ ID NO:1所示的氨基酸序列,其特征在于,用于生产koraiol。1. Use of a terpenoid synthase FgJ09920, the terpenoid synthase has the amino acid sequence shown in SEQ ID NO: 1, and is characterized in that, it is used for the production of koraiol. 2.一种生产koraiol的菌株,其特征在于,该菌株含有甲羟戊酸途径和koraiol合成的相关基因;所述的甲羟戊酸途径的相关基因包括atoBerg13、thmg1、erg12、erg8、mvd1 idi基因;所述的koraiol合成的相关基因包括ispA和萜类合酶FgJ09920基因,所述的FgJ09920基因,其序列如SEQ ID NO:2所示。2. a bacterial strain for producing koraiol, is characterized in that, this bacterial strain contains the related gene of mevalonate approach and koraiol synthesis; The related gene of described mevalonate approach comprises atoB , erg13, thmg1, erg12, erg8, mvd1 and idi genes; the genes related to koraiol synthesis include ispA and terpenoid synthase FgJ09920 gene, the sequence of which is shown in SEQ ID NO:2. 3.根据权利要求2所述的生产koraiol的菌株,其特征在于,生产koraiol的菌株为含有pMH1、质粒pFZ81 和质粒pGB235的大肠杆菌;3. the bacterial strain of producing koraiol according to claim 2, is characterized in that, the bacterial strain of producing koraiol is the Escherichia coli containing pMH1, plasmid pFZ81 and plasmid pGB235; 所述的质粒pMH1 以pBBR1MCS 为骨架载体、启动子为lac 启动子,复制子为p15A,包含atoB、erg13 和thmg1 基因;The plasmid pMH1 uses pBBR1MCS as the backbone vector, the promoter is the lac promoter, the replicon is p15A, and contains atoB, erg13 and thmg1 genes; 所述的质粒pFZ81 以pBBR1MCS-2 为骨架载体、启动子为lac 启动子、复制子为pBBR1Rep复制子,包含erg12、erg8、mvd1idi 基因;The plasmid pFZ81 uses pBBR1MCS-2 as the backbone vector, the promoter is the lac promoter, and the replicon is the pBBR1Rep replicon, including erg12, erg8, mvd1 and idi genes; 所述的质粒pGB235以pET21为骨架载体、启动子为T7 启动子、复制子为pBR322 高拷贝复制子,包含FgJ09920、ispAidi基因。The plasmid pGB235 uses pET21 as the backbone vector, the promoter is the T7 promoter, and the replicon is the pBR322 high-copy replicon, and contains FgJ09920, ispA and idi genes. 4.根据权利要求2或3所述的生产koraiol的菌株,其特征在于,该菌株过表达来源于大肠杆菌XL1-blue的atoB基因或idi基因,合成大量催化底物法尼基焦磷酸FPP。4. The strain for producing koraiol according to claim 2 or 3, wherein the strain overexpresses the atoB gene or the idi gene derived from Escherichia coli XL1-blue to synthesize a large amount of the catalytic substrate farnesyl pyrophosphate FPP. 5.权利要求4所述的生产koraiol的菌株在生产koraiol中的应用。5. The application of the strain for producing koraiol according to claim 4 in the production of koraiol.
CN201710229084.XA 2017-04-10 2017-04-10 A kind of terpenoid synthase for producing koraiol and its application Expired - Fee Related CN106987578B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710229084.XA CN106987578B (en) 2017-04-10 2017-04-10 A kind of terpenoid synthase for producing koraiol and its application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710229084.XA CN106987578B (en) 2017-04-10 2017-04-10 A kind of terpenoid synthase for producing koraiol and its application

Publications (2)

Publication Number Publication Date
CN106987578A CN106987578A (en) 2017-07-28
CN106987578B true CN106987578B (en) 2020-01-07

Family

ID=59414946

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710229084.XA Expired - Fee Related CN106987578B (en) 2017-04-10 2017-04-10 A kind of terpenoid synthase for producing koraiol and its application

Country Status (1)

Country Link
CN (1) CN106987578B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108239630B (en) * 2016-12-27 2021-07-16 武汉臻智生物科技有限公司 Method for modifying terpene synthase
WO2018120337A1 (en) * 2016-12-27 2018-07-05 武汉臻智生物科技有限公司 Terpene synthase and use thereof
CN110093329B (en) * 2018-01-30 2023-07-04 深圳艾格鑫科技有限公司 Terpene synthase and application thereof

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103282484A (en) * 2010-11-01 2013-09-04 诺维信公司 Filamentous fungi and methods for producing isoprenoids

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103282484A (en) * 2010-11-01 2013-09-04 诺维信公司 Filamentous fungi and methods for producing isoprenoids

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Daniel Buchvaldt Amby et al..Role of the Colletotrichum acutatum sesquiterpene synthase CaTPS in the biosynthesis of sesquiterpenoids.《Microbiology》.2016,(第162期),第1773–1783页. *
无.NCBI Reference Sequence:XP_009257076.1.《NCBI》.2016,第1-2页. *

Also Published As

Publication number Publication date
CN106987578A (en) 2017-07-28

Similar Documents

Publication Publication Date Title
CN106906201B (en) Terpenoid synthase for producing nerolidol and application thereof
CN103243065B (en) Bacterial strain for producing farnesene and application of bacterial strain
US8703454B2 (en) Method for producing (+)-zizaene
CN101978062B (en) Method for producing alpha-santalene
CN108239631B (en) Terpene synthase and application thereof
CN103243066B (en) Bacterial strain for producing lycopene and application of bacterial strain
CN103571835A (en) System and microorganism for producing taxadiene, and application of system or microorganism
CN104120141A (en) A method of synthesizing beta-caryophyllene by microbial catalysis and a reconstituted cell capable of synthesizing the beta-caryophyllene
CN106987578B (en) A kind of terpenoid synthase for producing koraiol and its application
CN111004763B (en) A kind of engineering bacteria for producing β-caryophyllene and its construction method and application
CN107083378A (en) A kind of Terpene synthase for producing Longiborneol and its application
CN110317765B (en) A kind of Escherichia coli expression strain with high production of geraniol glucoside and its application
ES2933968T3 (en) manool production
JP6748108B2 (en) Production of aromatic compounds
CN102732499B (en) Artemisia annua amorpha-4,11-diene synthase mutant with improved enzyme activity and application thereof
JP6735461B2 (en) Recombinant E. coli and its use
CN114921428A (en) Baeyer-Villiger monooxygenase and application thereof
CN113621633A (en) Mangifera indica terpene synthase gene TPS1 and application thereof
KR102346076B1 (en) Transformed methanotrophs for producing α-bisabolene production from methane and uses thereof
JP6440109B2 (en) Recombinant E. coli and its use
CN107531763B (en) An enzyme for biosynthesis of isoprene and isoprene and its mutant
CN117721125B (en) Matricaria chamomilla beta-elemene synthetase gene McTPS and application thereof
CN117363602A (en) Tobacco sesquiterpene synthase NtTPS126 and application thereof
CN117402865A (en) Tobacco sesquiterpene synthase NtTPS122 and application thereof
WO2024251059A1 (en) Cephalotaxane diterpene synthase, preparation method therefor, and use thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200107

CF01 Termination of patent right due to non-payment of annual fee