OA21191A - Method for creating new gene in organism and use thereof. - Google Patents
Method for creating new gene in organism and use thereof. Download PDFInfo
- Publication number
- OA21191A OA21191A OA1202200178 OA21191A OA 21191 A OA21191 A OA 21191A OA 1202200178 OA1202200178 OA 1202200178 OA 21191 A OA21191 A OA 21191A
- Authority
- OA
- OAPI
- Prior art keywords
- gene
- different
- promoter
- région
- endogenous
- Prior art date
Links
Abstract
The present invention relates to the technical fields of genetic engineering and bioinformatics, in particular, to a method for creating a new gene in an organism in the absence of an artificial DNA template, and a use thereof. The method comprises simultaneously generating DNA breaks at two or more different specific sites in the organism's genome, wherein the specific sites are genomic sites capable of separating different genetic elements or different protein domains, and the DNA breaks are ligated to each other through non-homologous end joining (NHEJ) or homologous repair to generate a new combination of the different gene elements or different protein domains that is different from the original genome sequence, thereby creating a new gene. The new gene of the invention can change the growth, development, resistance, yield and other traits of the organism, and has great value in application. <img file="OA21191A_A0001.tif"/>Fig. 16
Description
Method For Creating New Gene In Organism And Use Thereof
Technical Field
The présent invention relates to the technical fields of genetic engineering and bioinformatics, and in particular, a method for creating a new gene in an organism in the absence of an artificial DNA template, and use thereof.
Background Art
Generally speaking, a complété gene expression cassette in an organism comprises a promoter, 5' untranslated région (5' UTR), coding région (CDS) or non-coding RNA région (Non-coding RNA), 3' untranslated région (3' UTR), a terminator and many other éléments. Non-coding RNA can perform its biological functions at the RNA level, including rRNA, tRNA, snRNA, snoRNA and microRNA. The CDS région contains exons and introns. After the transcribed RNA is translated into a protein, the amino acids of different segments usually form different domains. The spécifie domains détermine the intracellular localization and function of the protein (such as nuclear localization signal, chloroplast leading peptide, mitochondrial leading peptide, DNA binding domain, transcription activation domain, enzyme catalytic center, etc.). For non-coding RNA, different segments also hâve different functions. When one or several éléments of a gene change, a new gene will be formed, which may hâve new functions. For example, an inversion event of a 1.7Mb chromosome fragment occurred upstream of the PpOFPl gene of fiat peach may resuit in a new promoter, which will significantly increase the expression of PpOFPl in peach fruit with fiat shape in the S2 stage of fruit development as compared to that in peach fruit with round shape, thereby inhibit the vertical development of peach fruit and resuit in the fiat shape phenotype in fiat peach (Zhou et al. 2018. A 1.7-Mb chromosomal inversion downstream of a PpOFPl gene is responsible for fiat fruit shape in peach. Plant Biotechnol. J. DOI: 10.1111/pbi. 13455).
The naturel génération of new genes in biological genomes requires a long evolutionary process. According to the research work, the molecular mechanisms for the génération of new genes include exon rearrangement, gene duplication, retrotransposition, and intégration of movable éléments (transposons, retrotransposons), horizontal gene transfer, gene fusion splitting, de novo origination, and many other mechanisms, and new genes may be retained in species under the action of naturel sélection through the dérivation and functional évolution. The relatively young new genes that hâve been identified in fruit Aies, Arabidopsis thaliana, and primates hâve a history of hundreds of thousands to millions of years according to a calculation (Long et al. 2012. The origin and évolution of new genes. Methods Mol Biol. DOI: 10.1007/978-l-61779-585-5_7). Therefore, in the field of genetic engineering and biological breeding, taking plants as an example, if it is desired to introduce a new gene into a plant (even if ail the genetic éléments of the new gene are derived from different genes of the species itself), it can only be achieved through the transgenic technology. That is, the éléments from different genes are assembled together in vitro to form a new gene, which is then transferred into the plant through transgenic technology. It is characterized in that the assembly of new gene needs to be carried out in vitro, resulting in transgenic crops.
The gene editing tools represented by CRISPR/Cas9 and the like can effïciently and accurately generate double-strand breaks (DSB) at spécifie sites in the genome of an organism, and then the double-strand breaks (DSB) are repaired through the cell’s own non-homologous end repair or homologous recombination mechanisms, thereby generating site-specific mutations. The current applications of the gene editing technique mainly focus on the editing of the internai éléments of a single gene, mostly the editing of a CDS exon regio. Editing an exon usually results in frameshift mutations in the gene, leading to the function loss of the gene. For this reason, the gene editing tools such as CRISPR/Cas9 are also known as gene knockout (i.e., gene destruction) tools. In addition to the CDS région, the promoter, 5'UTR and other régions can also be knocked out to affect the expression level of a gene. These methods ail mutate existing genes without generating new genes, so it is difficult to meet some needs in production. For example, for most genes, the existing gene editing technology is difficult to achieve the up-regulation of gene expression, and it is also difficult to change the subcellular localization of a protein or change the functional domain of protein. There are also reports in the literature of inserting a promoter or enhancer sequence upstream of an existing gene to change the expression pattern of the gene so as to produce new traits (Lu et al. 2020. Targeted, efficient sequence insertion and replacement in rice. Nat Biotechnol. DOI: 10.1038/s41587-020-0581-5), but this method requires the provision of foreign DNA templates, so strict regulatory procedures similar to genetically modified crops apply, and the application is restricted.
Summarv of the Invention
In order to solve the above-mentioned problems in the prior art, the présent invention provides a method for creating a new gene in an organism in the absence of an artificial DNA template by simultaneously generating two or more DNA double-strand breaks at a combination of spécifie sites in the organism's genome, and use thereof.
In one aspect, the présent invention provides a method for creating a new gene in an organism, comprising the following steps:
simultaneously generating DNA breaks at two or more different spécifie sites in the organism’s genome, wherein the spécifie sites are genomic sites capable of separating different genetic éléments or different protein domains, ligating the DNA breaks to each other by a non-homologous end joining (NHEJ) or homologous repair, generating a new combination of the different gene éléments or different protein domains that is different from the original genome sequence, thereby creating the new gene.
In one spécifie embodiment, the two or more different spécifie sites may be located on the same chromosome or on different chromosomes. When they locate on the same chromosome, the chromosome fragment resulting from the DNA breaks simultaneously occurring at two spécifie sites may be deleted, inversed or replicating doubled after repair; when they locate on different chromosomes, the DNA breaks generated at two spécifie sites may be ligated to each other after repair to produce a crossover event of the chromosome arms. These events can be identified and screened by PCR sequencing with specifically designed primers.
In a spécifie embodiment, the two or more different spécifie sites may be spécifie sites on at least two different genes, or may be at least two different spécifie sites on the same gene.
In a spécifie embodiment, the transcription directions of the at least two different genes may be the same or different (opposite or toward each other).
In a spécifie embodiment, the DNA breaks are produced by delivering a nuclease with targeting property into a cell of the organism to contact with the spécifie sites of the genomic DNA. There is no essential différence between this type of DNA breaks and the DNA breaks produced by traditional techniques (such as radiation or Chemical mutagenesis).
In a spécifie embodiment, the nuclease with targeting property is selected from Meganuclease, Zinc finger nuclease (ZFN), TALEN, and the CRISPR/Cas System.
Among them, the CRISPR/Cas System can generate two or more DNA double-strand breaks at different sites in the genome through two or more leading RNAs targeting different sequences; by separately designing the ZFN protein or TALEN protein in two or more spécifie site sequences, the Zinc finger nuclease and TALEN Systems can simultaneously generate DNA double-strand breaks at two or more sites. When two breaks are located on the same chromosome, repair results such as délétion, inversion and doubling may occur; and when two breaks are located on two different chromosomes, crossover of chromosomal arms may occur. The délétion, inversion, doubling and exchange of chromosome segments at two DNA breaks can recombine different gene éléments or protein domains, thereby creating a new functional gene.
In a spécifie embodiment, the nuclease with targeting property exists in the form of DNA.
In another spécifie embodiment, the nuclease with targeting property exists in the form of mRNA or protein, rather than the form of DNA.
In a spécifie embodiment, the method for delivering the nucleases with targeting property into the cell is selected from a group consisting of: 1) PEG-mediated cell transfection; 2) liposome-mediated cell transfection; 3) electric shock transformation; 4) microinjection; 5) gene gun bombardment; or 6) Jgro&acterznm-mediated transformation.
The gene éléments comprise a promoter, a 5' untranslated région (5'UTR), a coding région (CDS) or non-coding RNA région (Non-coding RNA), a 3' untranslated région (3'UTR) and a terminator of the gene.
In a spécifie embodiment, the combination of different gene éléments refers to a combination of the promoter of one of the two genes with different expression patterns and the CDS or non-coding RNA région of the other gene.
In a spécifie embodiment, one of the combinations of different gene éléments refers to a strong endogenous promoter in the organism, and the other is a coding région of the HPPD, EPSPS, PPG or GH1 gene.
In another spécifie embodiment, the combination of different gene éléments refers to a combination of a région from the promoter to the 5'UTR of one of two genes with different expression patterns and the CDS or non-coding RNA région of the other gene.
In a spécifie embodiment, the different expression patterns refer to different levels of gene expression.
In another spécifie embodiment, the different expression patterns refer to different tissue-specificities of gene expression.
In another spécifie embodiment, the different expression patterns refer to different developmental stage-specificities of gene expression.
In another spécifie embodiment, the combination of different gene éléments is a combination of adjacent gene éléments within the same gene.
The protein domains refer to a DNA fragment corresponding to a spécifie functional domain of a protein; it includes but is not limited to nuclear localization signal, chloroplast leading peptide, mitochondrial leading peptide, phosphorylation site, méthylation site, transmembrane domain, DNA binding domain, transcription activation domain, receptor activation domain, enzyme catalytic center, etc.
In a spécifie embodiment, the combination of different protein domains refers to a combination of a localization signal région of one of two protein coding genes with different subcellular localizations and a mature protein coding région of the other gene.
In a spécifie embodiment, the different subcellular locations include, but are not limited to, a nuclear location, a cytoplasmic location, a cell membrane location, a chloroplast location, a mitochondrial location, or an endoplasmic réticulum membrane location.
In another spécifie embodiment, the combination of different protein domains refers to a combination of two protein domains with different biological fiinctions.
In a spécifie embodiment, the different biological fiinctions include, but are not limited to, récognition of spécifie DNA or RNA conserved sequence, activation of gene expression, binding to protein ligand, binding to small molécule signal, ion binding, or spécifie enzymatic reaction.
In another spécifie embodiment, the combination of different protein domains refers to a combination of adjacent protein domains in the same gene.
In another spécifie embodiment, the combination of gene éléments and protein domains refers to a combination of protein domains and adjacent promoters, 5'UTR, 3'UTR or terminators in the same gene.
Specifically, the exchange of promoters of different genes can be achieved by inversion of chromosome fragments: when two genes located on the same chromosome hâve different directions, DNA breaks can be generated at spécifie sites between the promoter and CDS of each of the two genes, the région between the breaks can be inverted, thereby the promoters of these two genes would be exchanged, and two new genes would be generated at both ends of the inverted chromosome segment. The different directions of the two genes may be that their 5' ends are internai, namely both genes are in opposite directions, or their 5' ends are extemal, namely both genes are towards each other. Where the genes are in opposite directions, the promoters of the genes would be inverted, as shown in Scheme 1 of Figure 2; where the genes are towards each other, the CDS régions of the genes would be inverted, as shown in Scheme 1 of Figure 4. The inverted région can be as short as less than lOkb in length, with no other genes therebetween; or the inverted région can be very long, reaching up to 300kb-3Mb, andcontaining hundreds of genes.
It is also possible to create a new gene by doubling a chromosome fragment: where two genes located on the same chromosome are in the same direction, DNA breaks can be generated in spécifie sites between the promoter and CDS of each of the two genes, the région between the breaks can be doubled by duplication, and a new gene would be created at the junction of the doubled segment by fusing the promoter of the downstream gene to the the CDS région of the upstream gene, as shown in Figure 1 Scheme 1 and Figure 3. The length of the doubled région can be in the range of 500bp to 5Mb, which can be very short with no other genes therebetween, or can be very long to contain hundreds of genes. Although this method will induce point mutations in the régions between the promoters and the CDS région of the original two genes, such small-scale point mutations generally hâve little effect on the properties of the gene expression, while the new genes created by promoter replacement will hâve new properties of expression. Or altematively, DNA breaks can be generated at spécifie positions on both sides of a protein domain of a same gene, and the région between the breaks can be doubled by duplication, thereby creating a new gene with doubled spécifie functional domains.
In another aspect, the présent invention provides a method for creating a new gene in an organism, comprising the following steps:
generating DNA breaks at spécifie sites on at least two different genes at the level of genome or chromosome of the organism, inducing transfer, doubling, inversion or délétion of DNA, so that a spécifie gene element of one endogenous gene and a gene element on another endogenous gene would be ligated together through non-homologous end joining (NHEJ) or homologous repair, thereby creating a new gene.
The présent invention also provides a new gene obtainable by the présent method.
Compared with the original genes, the new gene may hâve different promoter and therefore hâve expression characteristics in terms of tissues or intensities or developmental stages, or hâve new amino acid sequences.
The new amino acid sequence can either be a fusion of the whole or partial coding régions of two or more gene, or a doubling of a partial protein coding région of the same gene.
In a spécifie embodiment, the new gene is a highly expressing endogenous HPPD, EPSPS, PPO or GH1 gene in an organism.
The présent invention also provides a DNA containing the gene.
The présent invention also provides a protein encoded by the gene, or biologically active fragment thereof.
The présent invention also provides a recombinant expression vector, which comprises the gene and a promoter operably linked thereto.
The présent invention also provides an expression cassette containing the gene.
The présent invention also provides a host cell, which comprises the expression cassette. Preferably, the host cell is a plant cell, an animal cell or a fungal cell.
The présent invention further provides an organism regenerated from the host cell.
The présent invention further provides use of the gene in conferring or improving a resistance/tolerance trait or growth advantage trait in an organism.
The présent invention further provides a composition, which comprises:
(a) the promoter of one of two genes with different expression patterns and a coding région or non-coding RNA région of the other gene;
(b) a région between the promoter and the 5' untranslated région of one of two genes with different expression patterns and a coding région or non-coding RNA région of the other gene;
(c) adjacent gene éléments within the same gene;
(d) a localization signal région of one of the two protein coding genes with different subcellular localizations and a mature protein coding région of the other gene;
(e) two protein domains with different biological functions;
(f) adjacent protein domains in the same gene; or, · (g) a protein domain and an adjacent promoter, 5' untranslated région, 3' non-coding région or terminator in the same gene.
In a spécifie embodiment, the different expression patterns refers to different levels of gene expression.
In another spécifie embodiment, the different expression patterns refers to different tissue-specificities of gene expression.
In another spécifie embodiment, the different expression patterns refers to different developmental stage-specifîcities of gene expression.
In a spécifie embodiment, the different subcellular locations include, but are not limited to, nuclear location, cytoplasmic location, cell membrane location, chloroplast location, mitochondrial location, or endoplasmic réticulum membrane location.
In a spécifie embodiment, the different biological functions include, but are not limited to, récognition of spécifie DNA or RNA conserved sequence, activation of gene expression, binding to protein ligand, binding to small molécule signal, ion binding, or spécifie enzymatic reaction.
In a spécifie embodiment, the composition is fused in vivo.
In particular, the présent invention also provides an editing method of increasing the expression level of a target endogenous gene in an organism independent of an exogenous DNA donor fragment, which comprises the following steps: simultaneously generating DNA breaks at spécifie sites between the promoter and the CDS of each of the target endogenous gene and an optional endogenous highly-expressing gene; ligating the DNA breaks to each other via non-homologous end joining (NHEJ) or homologous repair to form an in vivo fusion of the coding région of the target endogenous gene and the optional strong endogenous promoter, thereby creating a new highly-expressing endogenous gene. This method is named as an editing method for knocking-up an endogenous gene.
In a spécifie embodiment, the target endogenous gene and the optional highly-expressing endogenous gene are located on the same chromosome.
In another spécifie embodiment, the target endogenous gene and the optional highly-expressing endogenous gene are located on different chromosomes.
In another aspect, the présent invention provides an editing method for knocking up the expression of the endogenous HPPD gene in a plant, comprising fusing the coding région of the HPPD gene with a strong plant endogenous promoter in vivo to form a new highly-expressing plant endogenous HPPD gene. That is, simultaneously generating DNA breaks at spécifie sites between the promoter and the CDS of each of the HPPD gene and an optional endogenous highly-expressing gene, ligating the DNA breaks to each other through an intracellular repair pathway to form an in vivo fusion of the coding région of the HPPD gene and the optional endogenous strong promoter, thereby creating a new highly-expressing HPPD gene. In rice, the strong promoter is preferably a promoter of the ubiquitin2 gene.
The présent invention also provides a highly-expressing plant endogenous HPPD gene obtainable by the above editing method.
The présent invention also provides a highly-expressing rice endogenous HPPD gene which has a sequence selected from the group consisting of:
(l)a nucleic acid sequence as shown in SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 18 or SEQ ID NO: 19 or a partial sequence thereof, or a complementary sequence thereof;
(2) a sequence having an identity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% to any one of the sequences as defined in (1); or (3) a nucleic acid sequence capable of hybridizing to the sequence as shown in (1) or (2) under a stringent condition.
In another aspect, the présent invention provides an editing method for knocking up the expression of an endogenous EPSPS gene in a plant, which comprises fusing the coding région of an EPSPS gene with a strong plant endogenous promoter in vivo to form a new highly-expressing plant endogenous EPSPS gene. That is, simultaneously generating DNA breaks at spécifie sites between the promoter and the CDS of each of the EPSPS gene and an optional highly-expressing endogenous gene, ligating the DNA breaks to each other through an intracellular repair pathway to form an in vivo fusion of the coding région of the EPSPS gene and the optional strong endogenous promoter, thereby creating a new highly-expressing EPSPS gene. In rice, the strong promoter is preferably a promoter of the TKT gene.
The présent invention also provides a highly-expressing plant endogenous EPSPS gene obtainable by the above editing method.
The présent invention also provides a highly-expressing rice endogenous EPSPS gene which has a sequence selected from the group consisting of:
(1) the nucleic acid sequence as shown in SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13 or SEQ ID NO: 14 or a partial sequence thereof, or a complementary sequence thereof;
(2) a sequence having an identity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% to any one of the sequences as defined in (1); or (3) a nucleic acid sequence capable of hybridizing to the sequence as shown in (1) or (2) under a stringent condition.
In another aspect, the présent invention provides an editing method for knocking up the expression of an endogenous PPO (PPOX) gene in a plant, which comprises fusing the coding région of the PPO gene with a strong plant endogenous promoter in vivo to form a new highly-expressing plant endogenous PPO gene. That is, simultaneously generating DNA breaks at spécifie sites between the promoter and the CDS of each of the PPO gene and an optional highly-expressing endogenous gene , ligating the DNA breaks to each other through an intracellular repair pathway to form an in vivo fusion of the coding région of the PPO gene and the optional strong endogenous promoter, thereby creating a new highly-expressing PPO gene. In rice, the strong promoter is preferably a promoter of the CP 12 gene. In Arabidopsis thaliana, the strong promoter is preferably a promoter of the ubiquitinl 0 gene.
The présent invention also provides a highly-expressing plant endogenous PPO gene obtainable by the above editing method.
The présent invention also provides a highly-expressing rice endogenous PPO gene having a sequence selected from the group consisting of:
(1) the nucleic acid sequence as shown in SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, or SEQ ID NO: 26 or a partial sequence thereof, or a complementary sequence thereof;
(2) a sequence having an identity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% to any one of the sequences as defined in (1); or (3) a nucleic acid sequence capable of hybridizing to the sequence as shown in (1) or (2) under a stringent condition.
The présent invention also provides a DNA containing the HPPD, EPSPS or PPO gene.
The présent invention also provides a protein encoded by the HPPD, EPSPS or PPO gene or a biologically active fragment thereof.
The présent invention also provides a recombinant expression vector, which comprises the HPPD, EPSPS or PPO gene, and a promoter operably linked thereto.
The présent invention further provides an expression cassette comprising the HPPD, EPSPS or PPO gene.
The présent invention further provides a plant host cell, which comprises the expression cassette.
The présent invention further provides a plant regenerated from the plant host cell.
The présent invention also provides a method for producing a plant with an increased résistance or tolérance to an herbicide, which comprises regenerating the plant host cell into a plant and/or progeny thereof.
In a spécifie embodiment, the plant with increased herbicide résistance or tolérance is a non-transgenic line obtainable by Crossing a plant regenerated from the plant host cell of the invention with a wild-type plant to remove the exogenous transgenic component through genetic ségrégation.
The présent invention also provides a herbicide-resistant rice, which comprises any of the above-mentioned highly-expressing rice endogenous HPPD gene, highly-expressing rice endogenous EPSPS gene, highly-expressing rice endogenous PPO gene or any combination thereof.
In a spécifie embodiment, the herbicide-resistant rice is non-transgenic.
The présent invention fùrther provides use of the highly-expressing plant endogenous HPPD, EPSPS or PPO gene in improving the résistance or tolérance to an inhibitory herbicide in a plant cell, a plant tissue, a plant part or a plant.
In another aspect, the présent invention provides a method for controlling a weed in a plant cultivation site, wherein the plant comprises the above-mentioned plant or a plant prepared by the above-mentioned method, which comprises applying to the cultivation site one or more HPPD, EPSPS or PPO inhibitory herbicides in an amount for effectively controlling the weed.
In the research work of the inventors, it was found that in cells simultaneously undergoing dual-target or multi-target gene editing, a certain proportion of the ends of DNA double-strand breaks at different targets were spontaneously ligated to each other, resulting in events of délétion, inversion or duplication-doubling of the fragments between the targets on the same chromosome, and/or the exchange of chromosome fragments between targets on different chromosomes. It has been reported in the literature that this phenomenon commonly exists in plants and animais (Puchta et al. 2020. Changing local recombination patterns in Arabidopsis by CRISPR/Cas mediated chromosome engineering. Nat Commun. DOI: 10.1038/s41467-02018277-z; Li et al. 2015. Efficient inversions and duplications of mammalian regulatory DNA éléments and gene clusters by CRISPR/Cas9. J Mol Cell Biol. DOI: 10.1093/jmcb/mjv016).
The présent inventors surprisingly discovered that, by inducing DNA double-strand breaks in a combination of gene editing targets near spécifie éléments of a gene of interest, causing spontaneous repair ligation, directed combination of different gene éléments can be achieved at the genome level without the need to provide a foreign DNA template, it is possible to produce therefrom a new functional gene. This strategy greatly accelerates the création of new genes and has great potential in animal and plant breeding and gene function research.
n
Detailed description of invention
In the présent invention, unless otherwise specified, the scientific and technical terms used herein hâve the meanings commonly understood by those skilled in the art. In addition, protein and nucleic acid chemistry, molecular biology, cell and tissue culture, microbiology, immunology related terms and laboratory procedures used herein are ail terms and routine procedures widely used in the corresponding fields. For example, the standard recombinant DNA and molecular cloning techniques used in the présent invention are well known to those skilled in the art and are fully described in the following documents: Sambrook, J., Fritsch, EF and Maniatis, T., Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989. For a better understanding of the présent invention, définitions and explanations of related terms are provided below.
The term genome as used herein refers to ail compléments of genetic material (genes and non-coding sequences) présent in each cell or virus or organelle of an organism, and/or complété genome inherited from a parent as a unit (haploid).
The term gene editing refers to strategies and techniques for targeted spécifie modification of any genetic information or genome of living organisms. Therefore, the term includes editing of gene coding régions, but also includes editing of régions other than gene coding régions of the genome. It also includes editing or modifying other genetic information of nuclei (if présent) and cells.
The term CRISPR/Cas nuclease may be a CRISPR-based nuclease or a nucleic acid sequence encoding the same, including but not limited to: 1) Cas9, including SpCas9, ScCas9, SaCas9, xCas9, VRER-Cas9, EQR-Cas9, SpG-Cas9, SpRY-Cas9, SpCas9-NG, NG-Cas9, NGA-Cas9 (VQR), etc.; 2) Casl2, including LbCpfl, FnCpfl, AsCpfl, MAD7, etc., or any variant or dérivative of the aforementioned CRISPR-based nuclease; preferably, wherein the at least one CRISPR-based nuclease comprises a mutation compared to the corresponding wild-type sequence, so that the obtained CRISPR-based nuclease recognizes a different PAM sequence. As used herein, CRISPR-based nuclease is any nuclease that has been identified in a naturally occurring CRISPR System, which is subsequently isolated from its naturel background, and has preferably been modified or combined into a recombinant construct of interest, suitable as a tool for targeted genome engineering. As long as the original wild-type CRISPR-based nuclease provides DNA récognition, i.e., binding properties, any CRISPR-based nuclease can be used and optionally reprogrammed or otherwise mutated so as to be suitable for varions embodiments of the invention.
The term CRISPR refers to a sequence-specific genetic manipulation technique that relies on clustered regularly interspaced short palindromie repeats, which is different from RNA interférence that régulâtes gene expression at the transcriptional level.
Cas9 nuclease and Cas9 are used interchangeably herein, and refer to RNA-guided nuclease comprising Cas9 protein or fragment thereof (for example, a protein containing the active DNA cleavage domain of Cas9 and/or the gRNA binding domain of Cas9). Cas9 is a component of the CRISPR/Cas (clustered regularly interspaced short palindrome repeats and associated Systems) genome editing System. It can target and eut DNA target sequences under the guidance of guide RNA to form DNA double-strand breaks (DSB).
Cas protein or Cas polypeptide refers to a polypeptide encoded by Cas (CRISPR-associated) gene. Cas protein includes Cas endonuclease. Cas protein can be a bacterial or archaeal protein. For example, the types I to III CRISPR Cas proteins herein generally originate from prokaryotes; the type I and type III Cas proteins can be derived from bacteria or archaea species, and the type II Cas protein (i.e., Cas9) can be derived from bacterial species. “Cas proteins” include Cas9 protein, Cpfl protein, C2cl protein, C2c2 protein, C2c3 protein, Cas3, Cas3-HD, Cas5, Cas7, Cas8, CaslO, Casl2a, Casl2b, or a combination or complex thereof.
Cas9 variant or Cas9 endonuclease variant refers to a variant of the parent Cas9 endonuclease, wherein when associated with crRNA and tracRNA or with sgRNA, the Cas9 endonuclease variant retains the abilities of recognizing, binding to ail or part of a DNA target sequence and optionally unwinding ail or part of a DNA target sequence, nicking ail or part of a DNA target sequence, or cutting ail or part of a DNA target sequence. The Cas9 endonuclease variants include the Cas9 endonuclease variants described herein, wherein the Cas9 endonuclease variants are different from the parent Cas9 endonuclease in the following manner: the Cas9 endonuclease variants (when complexed with gRNA to form a polynucleotide-directed endonuclease complex capable of modifying a target site) hâve at least one improved property, such as, but not limited to, increased transformation efficiency, increased DNA editing effîciency, decreased off-target cutting, or any combination thereof, as compared to the parent Cas9 endonuclease (complexed with the same gRNA to form a polynucleotide-guided endonuclease complex capable of modifying the same target site).
The Cas9 endonuclease variants described herein include variants that can bind to and nick double-stranded DNA target sites when associated with crRNA and tracrRNA or with sgRNA, while the parent Cas endonuclease can bind to the target site and resuit in double strand break (cleavage) when associated with crRNA and tracrRNA or with sgRNA.
Guide RNA and gRNA are used interchangeably herein, and refer to a guide RNA sequence used to target a spécifie gene for correction using CRISPR technology, which usually consists of crRNA and tracrRNA molécules that are partially complementary to form a complex, wherein crRNA contains a sequence that has sufficient complementarity with the target sequence so to hybridize with the target sequence and direct the CRISPR complex (Cas9+crRNA+tracrRNA) to specifically bind to the target sequence. However, it is known in the art that a single guide RNA (sgRNA) can be designed, which contains both the properties of crRNA and tracrRNA.
The terms single guide RNA and sgRNA are used interchangeably herein, and refer to the synthetic fusion of two RNA molécules, which comprises a fusion of a crRNA (CRISPR RNA) of a variable targeting domain (linked to a tracr pairing sequence hybridized to tracrRNA) and a tracrRNA (trans-activating CRISPR RNA). The sgRNA may comprise crRNA or crRNA fragments and tracrRNA or tracrRNA fragments of the type II CRISPR/Cas System that can form a complex with the type II Cas endonuclease, wherein the guide RNA/Cas endonuclease complex can guide the Cas endonuclease to a DNA target site so that the Cas endonuclease can recognize, optionally bind to the DNA target site, and optionally nick the DNA target site or eut (introduce a single-strand or double-strand break) the DNA target site.
In certain embodiments, the guide RNA(s) and Cas9 can be delivered to a cell as a ribonucleoprotein (RNP) complex. RNP is composed of purified Cas9 protein complexed with gRNA, and it is well known in the art that RNP can be effectively delivered to many types of cells, including but not limited to stem cells and immune cells (Addgene, Cambridge, MA, Mirus Bio LLC, Madison, WI).
The protospacer adjacent motif (PAM) herein refers to a short nucléotide sequence adjacent to a (targeted) target sequence (prespacer) recognized by the gRNA/Cas endonuclease System. If the target DNA sequence is not adjacent to an appropriate PAM sequence, the Cas endonuclease may not be able to successfully recognize the target DNA sequence. The sequence and length of PAM herein can be different depending on the Cas protein or Cas protein complex in use. The PAM sequence can be of any length, but is typically in length of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 nucléotides.
As used herein, the term organism includes animais, plants, fungi, bacteria, and the like.
As used herein, the term host cell includes plant cells, animal cells, fungal cells, bacterial cells, and the like.
In the présent invention, the “plant” should be understood to mean any differentiated multicellular organism capable of performing photosynthesis, in particular monocotyledonous or dicotyledonous plants, for example, (1) food crops: Oryza spp., like Oryza sativa, Oryza latifolia, Oryza sativa, Oryza glaberrima; Triticum spp., like Triticum aestivum, T. Turgidumssp. durum; Hordeum spp., like Hordeum vulgare, Hordeum arizonicum; Secale cereale; Avena spp., like Avena sativa, Avena fatua, Avena byzantine, Avena fatua var.sativa, Avena hybrida; Echinochloa spp., like Pennisetum glaucum, Sorghum, Sorghum bicolor, Sorghum vulgare, Triticale, Zea mays or Maize, Millet, Rice, Foxtail millet, Proso millet, Sorghum bicolor, Panicum, Fagopyrum spp., Panicum miliaceum, Setaria italica, Zizania palustris, Eragrostis tef, Panicum miliaceum, Eleusine coracana; (2) legume crops: Glycine spp. like Glycine max, Soja hispida, Soja max, Vicia spp., Vigna spp., Pisum spp., field bean, Lupinus spp., Vicia, Tamarindus indica, Lens culinaris, Lathyrus spp., Lablab, broad bean, mung bean, red bean, chickpea; (3) oil crops: Arachis hypogaea, Arachis spp, Sesamum spp., Helianthus spp. like Helianthus annuus, Elaeis like Eiaeis guineensis, Elaeis oleifera, soybean, Brassicanapus, Brassica oleracea, Sesamum orientale, Brassica juncea, Oilseed râpe, Camellia oleifera, oil palm, olive, castor-oil plant, Brassica napus L., canola; (4) fiber crops: Agave sisalana, Gossypium spp. like Gossypium, Gossypium barbadense, Gossypium hirsutum, Hibiscus cannabinus, Agave sisalana, Musa textilis Nee, Linum usitatissimum, Corchorus capsularis L, Boehmeria nivea (L.), Cannabis sativa, Cannabis sativa; (5) fruit crops: Ziziphus spp., Cucumis spp., Passiflore edulis, Vitis spp., Vaccinium spp., Pyrus communis, Prunus spp., Psidium spp., Punica granatum, Malus spp., Citrullus lanatus, Citrus spp., Ficus carica, Fortunella spp., Fragaria spp., Crataegus spp., Diospyros spp., Eugenia unifora, Eriobotrya japonica, Dimocarpus longan, Carica papaya, Cocos spp., Averrhoa carambola, Actinidia spp., Prunus amygdalus, Musa spp. (musa acuminate), Persea spp. (Persea Americana), Psidium guajava, Mammea Americana, Mangifera indica, Canarium album (Oleaeuropaea), Caricapapaya, Cocos nucifera, Malpighia emarginata, Manilkara zapota, Ananas comosus, Annona spp., Citrus reticulate (Citrus spp.), Artocarpus spp., Litchi chinensis, Ribes spp., Rubus spp., pear, peach, apricot, plum, red bayberry, lemon, kumquat, durian, orange, strawberry, blueberry, hami melon, muskmelon, date palm, walnut tree, cherry tree; (6) rhizome crops: Manihot spp., Ipomoea batatas, Colocasia esculenta, tuber mustard, Allium cepa (onion), eleocharis tuberose (water chestnut), Cyperus rotundus, Rhizoma dioscoreae; (7) vegetable crops: Spinacia spp., Phaseolus spp., Lactuca sativa, Momordica spp, Petroselinum crispum, Capsicum spp., Solanum spp. (such as Solanum tuberosum, Solanum integrifolium, Solanum lycopersicum), Lycopersicon spp. (such as Lycopersicon esculentum, Lycopersicon lycopersicum, Lycopersicon pyriforme), Macrotyloma spp., Kale, Luffa acutangula, lentil, okra, onion, potato, artichoke, asparagus, broccoli, Brussels sprouts, cabbage, carrot, cauliflower, celery, collard greens, squash, Benincasa hispida, Asparagus officinalis, Apium graveolens, Amaranthus spp., Allium spp., Abelmoschus spp., Cichorium endivia, Cucurbita spp., Coriandrum sativum, B.carinata, Rapbanus sativus, Brassica spp. (such as Brassica napus, Brassica rapa ssp., canola, oilseed râpe, tumip râpe, tumip râpe, leaf mustard, cabbage, black mustard, canola (rapeseed), Brussels sprout, Solanaceae (eggplant), Capsicum annuum (sweet pepper), cucumber, luffa, Chinese cabbage, râpe, cabbage, calabash, Chinese chives, lotus, lotus root, lettuce; (8) flower crops: Tropaeolum minus, Tropaeolum majus, Canna indica, Opuntia spp., Tagetes spp., Cymbidium (orchid), Crinum asiaticum L., Clivia, Hippeastrum rutilum, Rosa rugosa, Rosa Chinensis, Jasminum sambac, Tulipa gesneriana L., Cerasus sp., Pharbitis nil (L.) Choisy, Calendula officinalis L., Nelumbo sp., Bellis perennis L., Dianthus caryophyllus, Pétunia hybrida, Tulipa gesneriana L., Lilium brownie, Prunus mume, Narcissus tazetta L., Jasminum nudiflorum Lindl., Primula malacoides, Daphné odora, Camellia japonica, Michelia alba, Magnolia liliiflora, Vibumum macrocephalum, Clivia miniata, Malus spectabilis, Paeonia suffruticosa, Paeonia lactiflora, Syzygium aromaticum, Rhododendron simsii, Rhododendron hybridum, Michelia figo (Lour.) Spreng., Cercis chinensis, Kerria japonica, Weigela florida, Fructus forsythiae, Jasminum mesnyi, Parochetus communis, Cyclamen persicum Mill., Phalaenophsis hybrid, Dendrobium nobile, Hyacinthus orientalis, Iris tectorum Maxim, Zantedeschia aethiopica, Calendula officinalis, Hippeastrum rutilum, Bégonia semperflorenshybr, Fuchsia hybrida, Bégonia maculataRaddi, Géranium, Epipremnum aureum; (9) médicinal crops: Carthamus tinctorius, Mentha spp., Rheum rhabarbarum, Crocus sativus, Lycium chinense, Polygonatum odoratum, Polygonatum Kingianum, Anemarrhena asphodeloides Bunge, Radix ophiopogonis, Fritillaria cirrhosa, Curcuma aromatica, Amomum villosum Lour., Polygonum multiflorum, Rheum officinale, Glycyrrhiza uralensis Fisch, Astragalus membranaceus, Panax ginseng, Panax notoginseng, Acanthopanax gracilistylus, Angelica sinensis, Ligusticum wallichii, Bupleurum sinenses DC., Datura stramonium Linn., Datura metel L., Mentha haplocalyx, Leonurus sibiricus L., Agastache rugosus, Scutellaria baicalensis, Prunella vulgaris L., Pyrethrum cameum, Ginkgo biloba L., Cinchona ledgeriana, Hevea brasiliensis (wild), Medicago sativa Linn, Piper Nigrum L., Radix Isatidis, Atractylodes macrocephala Koidz; (10) raw material crops: Hevea brasiliensis, Ricinus communis, Vemicia fordii, Morus alba L., Hops Humulus lupulus, Betula, Alnus cremastogyne Burk., Rhus vemiciflua stokes; (11) pasture crops: Agropyron spp., Trifolium spp., Miscanthus sinensis, Pennisetum sp., Phalaris arundinacea, Panicum virgatum, prairiegrasses, Indiangrass, Big bluestem grass, Phleum pratense, turf, cyperaceae (Kobresia pygmaea, Carex pediformis, Carex humilis), Medicago sativa Linn, Phleum pratense L., Medicago sativa, Melilotus suavcolen,
Astragalus sinicus, Crotalaria juncea, Sesbania cannabina, Azolla imbircata, Eichhomia crassipes, Amorpha fruticosa, Lupinus micranthus, Trifolium, Astragalus adsurgens pall, Pistia stratiotes linn, Altemanthera philoxeroides, Lolium; (12) sugar crops: Saccharum spp., Beta vulgaris; (13) beverage crops: Camellia sinensis, Camellia Sinensis, tea, Coffee (Coffea spp.), Theobroma cacao, Humulus lupulus Linn.; (14) lawn plants: Ammophila arenaria, Poa spp.(Poa pratensis (bluegrass)), Agrostis spp. (Agrostis matsumurae, Agrostis palustris), Lolium spp. (Lolium), Festuca spp. (Festuca ovina L.), Zoysia spp. (Zoysiajaponica), Cynodon spp. (Cynodon dactylon/bermudagrass), Stenotaphrum secunda tum (Stenotaphrum secundatum), Paspalum spp., Eremochloa ophiuroides (centipedegrass), Axonopus spp. (carpetweed), Bouteloua dactyloides (buffalograss), Bouteloua var. spp. (Bouteloua gracilis), Digitaria sanguinalis, Cyperusrotundus, Kyllingabrevifolia, Cyperusamuricus, Erigeron canadensis, Hydrocotylesibthorpioides, Kummerowiastriata, Euphorbia humifusa, Viola arvensis, Carex rigescens, Carex heterostachya, turf; (15) tree crops: Pinus spp., Salix spp., Acer spp., Hibiscus spp., Eucalyptus spp., Ginkgo biloba, Bambusa sp., Populus spp., Prosopis spp., Quercus spp., Phoenix spp., Fagus spp., Ceiba pentandra, Cinnamomum spp., Corchorus spp., Phragmites australis, Physalis spp., Desmodium spp., Populus, Hedera hélix, Populus tomentosa Carr, Vibumum odoratissinum, Ginkgo biloba L., Quercus, Ailanthus altissima, Schima superba, Ilex pur-purea, Platanus acerifolia, ligustrum lucidum, Buxus megistophylla Levl., Dahurian larch, Acacia meamsii, Pinus massoniana, Pinus khasys, Pinus yunnanensis, Pinus fïnlaysoniana, Pinus tabuliformis, Pinus koraiensis, Juglans nigra, Citrus limon, Platanus acerifolia, Syzygium jambos, Davidia involucrate, Bombax malabarica L., Ceiba pentandra (L.), Bauhinia blakeana, Albizia saman, Albizzia julibrissin, Erythrina corallodendron, Erythrina indica, Magnolia gradiflora, Cycas revolute, Lagerstroemia indica, coniferous, macrophanerophytes, Frutex; (16) nut crops: Bertholletia excelsea, Castanea spp., Corylus spp., Carya spp., Juglans spp., Pistacia vera, Anacardium occidentale, Macadamia (Macadamia integrifolia), Carya illinoensis Koch, Macadamia, Pistachio, Badam, other plants that produce nuts; (17) others: arabidopsis thaliana, Bra chiaria eruciformis, Cenchrus echinatus, Setaria faberi, eleusine indica, Cadaba farinose, algae, Carex elata, omamental plants, Carissa macrocarpa, Cynara spp., Daucus carota, Dioscorea spp., Erianthus sp., Festuca arundinacea, Hemerocallis fulva, Lotus spp., Luzula sylvatica, Medicago sativa, Melilotus spp., Morus nigra, Nicotiana spp., Olea spp., Omithopus spp., Pastinaca sativa, Sambucus spp., Sinapis sp., Syzygium spp., Tripsacum dactyloides, Triticosecale rimpaui, Viola odorata, and the like.
In a spécifie embodiment, the plant is selected from rice, corn, wheat, soybean, sunflower, sorghum, râpe, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava, potato, sweet potato, Chinese cabbage, cabbage, cucumber, Chinese rose, Scindapsus aureiis, watermelon, melon, strawberry, blueberry, grape, apple, citrus, peach, pear, banana, etc.
As used herein, the term plant includes a whole plant and any progeny, cell, tissue or part of plant. The term plant part includes any part of a plant, including, for example, but not limited to: seed (including mature seed, immature embryo without seed coat, and immature seed); plant cutting; plant cell; plant cell culture; plant organ (e.g., pollen, embryo, flower, fruit, bud, leaf, root, stem, and related expiant). Plant tissue or plant organ can be seed, callus tissue, or any other plant cell population organized into a structural or fiinctional unit. Some plant cells or tissue cultures can regenerate a plant that has the physiological and morphological characteristics of the plant from which the cell or tissue is derived, and can regenerate a plant that has substantially the same génotype as the plant. In contrast, some plant cells cannot regenerate plants. The regenerable cells in plant cells or tissue cultures can be embryos, protoplasts, meristematic cells, callus, pollen, leaves, anthers, roots, root tips, silks, flowers, kernels, ears, cobs, husks, or stems.
The plant parts comprise harvestable parts and parts that can be used to propagate offspring plants. The plant parts that can be used for propagation include, for example, but not limited to: seeds; fruits; cuttings; seedlings; tubers; and rootstocks. The harvestable parts of plants can be any of useful parts of plants, including, for example, but not limited to: flowers; pollen; seedlings; tubers; leaves; stems; fruits; seeds; and roots.
The plant cells are the structural and physiological units of plants. As used herein, the plant cells include protoplasts and protoplasts with partial cell walls. The plant cells may be in a form of isolated single cells or cell aggregates (e.g., loose callus and cultured cells), and may be part of higher order tissue units (e.g., plant tissues, plant organs, and intact plants). Therefore, the plant cells can be protoplasts, gamete-producing cells, or cells or collection of cells capable of regenerating a whole plant. Therefore, in the embodiments herein, a seed containing a plurality of plant cells and capable of regenerating into a whole plant is considered as a plant part.
As used herein, the term protoplast refers to a plant cell whose cell wall is completely or partially removed and whose lipid bilayer membrane is exposed. Typically, the protoplast is an isolated plant cell without cell wall, which has the potential to regenerate a cell culture or a whole plant.
The plant offspring includes any subséquent générations of the plant.
The terms inhibitory herbicide tolérance and inhibitory herbicide résistance can be used interchangeably, and both refer to tolérance and résistance to an inhibitory herbicide .
Improvement in tolérance to inhibitory herbicide and improvement in résistance to inhibitory herbicide mean that the tolérance or résistance to the inhibitory herbicide is improved as compared to a plant containing the wild-type gene.
The term wild-type refers to a nucleic acid molécule or protein that can be found in nature.
In the présent invention, the term cultivation site comprises a site where the plant of the présent invention is cultivated, such as soil, and also comprises, for example, plant seeds, plant seedlings and grown plants. The term weed-controlling effective amount refers to an amount of herbicide that is sufficient to affect the growth or development of the target weed, for example, to prevent or inhibit the growth or development of the target weed, or to kill the weed. Advantageously, the weed-controlling effective amount does not significantly affect the growth and/or development of the plant seeds, plant seedlings or plants of the présent invention. Those skilled in the art can détermine such weed-controlling effective amount through routine experiments.
The term gene comprises a nucleic acid fragment expressing a functional molécule (such as, but not limited to, spécifie protein), including regulatory sequences before (5' non-coding sequences) and after (3' non-coding sequences) a coding sequence.
The DNA sequence that encodes a spécifie RNA is a DNA nucleic acid sequence that can be transcribed into RNA. The DNA polynucleotides can encode a RNA (mRNA) that can be translated into a protein, or the DNA polynucleotides can encode a RNA that cannot be translated into a protein (for example, tRNA, rRNA, or DNA-targeting RNA; which are also known as non-coding RNA or ncRNA).
The terms polypeptide, peptide and protein are used interchangeably in the présent invention, and refer to a polymer of amino acid residues. The terms are applied to amino acid polymers in which one or more amino acid residues are artificially Chemical analogs of corresponding and naturally occurring amino acids, as well as to naturally occurring amino acid polymers. The terms polypeptide, peptide, amino acid sequence and protein may also include their modification forms, including but not limited to glycosylation, lipid linkage, sulfation, γ-carboxylation of glutamic acid residue, hydroxylation and ADP-ribosylation.
The term biologically active fragment refers to a fragment that has one or more amino acid residues deleted from the N and/or C-terminus of a protein while still retaining its functional activity.
The terms polynucleotide and nucleic acid are used interchangeably and comprise DNA, RNA or hybrids thereof, which may be double-stranded or single-stranded.
The terms nucléotide sequence and nucleic acid sequence both refer to the sequence of bases in DNA or RNA.
Those of ordinary skill in the art can easily use known methods, such as directed évolution and point mutation methods, to mutate the DNA fragments as shown in SEQ ID No. 9 to SEQ ID No. 17 of the présent invention. Those artificially modified nucléotide sequences that hâve at least 75% identity to any one of the foregoing sequences of the présent invention and exhibit the same function are considered as dérivatives of the nucléotide sequence of the présent invention and équivalent to the sequences of the présent invention.
The term identity refers to the sequence similarity to a naturel nucleic acid sequence. Sequence identity can be evaluated by observation or computer software. Using a computer sequence alignment software, the identity between two or more sequences can be expressed as a percentage (%), which can be used to evaluate the identity between related sequences. Partial sequence means at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% of a given sequence.
The stringent condition may be as follows: hybridizing at 50 LI in a mixed solution of 7% sodium dodecyl sulfate (SDS), 0.5M NaPC>4, and 1 mM EDTA, and washing at 50Ξ1 in 2x SSC and 0.1% SDS; or altematively: hybridizing at 50ü in a mixed solution of 7% SDS, 0.5M NaPÛ4 and ImM EDTA, and washing at 503 in lx SSC and 0.1% SDS; or altematively: hybridizing at 50D in a mixed solution of 7% SDS, 0.5M NaPC>4 and ImM EDTA, and washing at 50□ in 0.5x SSC and 0.1% SDS; or altematively: hybridizing at 50ü in a mixed solution of 7% SDS, 0.5M NaPC>4 and ImM EDTA, and washing at 50Π in 0.1 x SSC and 0.1% SDS; or altematively: hybridizing at 50D in a mixed solution of 7% SDS, 0.5M NaPO4 and ImM EDTA, and washing at 65□ in O.lx SSC and 0.1% SDS; or altematively: hybridizing at 65□ in a solution of 6x SSC, 0.5% SDS, and then membrane washing with 2x SSC, 0.1% SDS and lx SSC, 0.1% SDS each once; or altematively: hybridizing and membrane washing twice in a solution of 2x SSC, 0.1% SDS at 68 □, 5 min each time, and then hybridizing and membrane washing twice in a solution of 0.5x SSC, 0.1% SDS at 68 □, 15min each time; or altematively: hybridizing and membrane washing in a solution of O.lx SSPE (or O.lx SSC), 0.1% SDS at 65□.
As used in the présent invention, expression cassette, expression vector and expression construct refer to a vector such as a recombinant vector suitable for expression of a nucléotide sequence of interest in a plant. The term expression refers to the production of a functional product. For example, the expression of a nucléotide sequence may refer to the transcription of the nucléotide sequence (such as transcription to generate mRNA or functional RNA) and/or the translation of RNA into a precursor or mature protein.
The expression construct of the présent invention can be a linear nucleic acid fragment, a circular plasmid, a viral vector, or, in some embodiments, can be an RNA (such as mRNA) that can be translated.
The expression construct of the présent invention may comprise regulatory sequences and nucléotide sequences of interest from different sources, or regulatory sequences and nucléotide sequences of interest from the same source but arranged in a way different from those normally occurring in nature.
The highly-expressing gene in the présent invention refers to a gene whose expression level is higher than that of a common gene in a spécifie tissue.
The terms recombinant expression vector or DNA construct are used interchangeably herein and refer to a DNA molécule comprising a vector and at least one insert. Recombinant expression vectors are usually produced for the purpose of expression and/or propagation of the insert or for the construction of other recombinant nucléotide sequences. The insert may be operably or may be inoperably linked to a promoter sequence and may be operably or may be inoperably linked to a DNA regulatory sequence.
The terms regulatory sequence and regulatory element can be used interchangeably and refer to a nucléotide sequence that is located at the upstream (5' non-coding sequence), middle or downstream (3' non-coding sequence) of a coding sequence, and affects the transcription, RNA Processing, stability or translation of a related coding sequence. Plant expression regulatory éléments refer to nucléotide sequences that can control the transcription, RNA processing or stability or translation of a nucléotide sequence of interest in plants.
The regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyA récognition sequences.
The term promoter refers to a nucleic acid fragment capable of controlling the transcription of another nucleic acid fragment. In some embodiments of the présent invention, the promoter is a promoter capable of controlling gene transcription in plant cells, regardless of whether it is derived from plant cells. The promoter can be a constitutive promoter or a tissue-specific promoter or a developmentally regulated promoter or an inducible promoter.
The term strong promoter is a well-known and widely used term in the art. Many strong promoters are known in the art or can be identified by routine experiments. The activity of the strong promoter is higher than the activity of the promoter operatively linked to the nucleic acid molécule to be overexpressed in a wild-type organism, for example, a promoter with an activity higher than the promoter of an endogenous gene. Preferably, the activity of the strong promoter is higher by about 2%, 5%, 8%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 200%, 250%, 300%, 350%, 400%, 450%, 500%, 600%, 700%, 800% , 900%, 1000% or more than 1000% than the activity of the promoter operably linked to the nucleic acid molécule to be overexpressed in the wild-type organism. Those skilled in the art know how to measure the activity of a promoter and compare the activities of different promoters.
The term constitutive promoter refers to a promoter that will generally cause gene expression in most cell types in most cases. Tissue-specific promoter and tissue-preferred promoter are used interchangeably, and refer to a promoter that is mainly but not necessarily exclusively expressed in a tissue or organ, and also expressed in a spécifie cell or cell type. Developmentally regulated promoter refers to a promoter whose activity is determined by a developmental event. Inducible promoter responds to an endogenous or exogenous stimulus (environment, hormone, Chemical signal, etc.) to selectively express an operably linked DNA sequence.
As used herein, the term operably linked refers to a connection of a regulatory element (for example, but not limited to, promoter sequence, transcription termination sequence, etc.) to a nucleic acid sequence (for example, a coding sequence or open reading frame) such that the transcription of the nucléotide sequence is controlled and regulated by the transcription regulatory element. The techniques for operably linking regulatory element région to nucleic acid molécule are known in the art.
The introducing a nucleic acid molécule (such as a plasmid, linear nucleic acid fragment, RNA, etc.) or protein into a plant refers to transforming a cell of the plant with the nucleic acid or protein so that the nucleic acid or protein can fimetion in the plant cell. The term transformation used in the présent invention comprises stable transformation and transient transformation.
The term stable transformation refers to that the introduction of an exogenous nucléotide sequence into a plant genome results in a stable inheritance of the exogenous gene. Once stably transformed, the exogenous nucleic acid sequence is stably integrated into the genome of the plant and any successive générations thereof.
The term transient transformation refers to that the introduction of a nucleic acid molécule or protein into a plant cell to perform fiinction does not resuit in a stable inheritance of the foreign gene. In transient transformation, the exogenous nucleic acid sequence is not integrated into the genome of the plant.
Changing the expression of endogenous genes in organisms includes two aspects: intensity and spatial-temporal characteristics. The change of intensity includes the increase (knock-up), decrease (knock-down) and/or shut off the expression of the gene (knock-out); the spatial-temporal specificity includes temporal (growth and development stage) specificity and spatial (tissue) specificity, as well as inducibility. In addition, it includes changing the targeting of a protein, for example, changing the feature of cytoplasmic localization of a protein into a feature of chloroplast localization or nuclear localization.
Unless otherwise defined, ail technical and scientific terms used herein hâve the same meanings as commonly understood by those of ordinary skill in the art to which the présent invention pertains. Although any methods and materials similar or équivalent to those described herein can also be used in the practice or testing of the présent invention, the preferred methods and materials are now described.
Ail publications and patents cited in this description are incorporated herein by reference as if each individual publication or patent is exactly and individually indicated to be incorporated by reference, and is incorporated herein by reference to disclose and describe methods and/or materials related to the publications cited. The citation of any publication which it was published before the filing date should not be interpreted as an admission that the présent invention is not eligible to précédé the publication of the existing invention. In addition, the publication date provided may be different from the actual publication date, which may require independent vérification.
Unless specifically stated or implied, as used herein, the terms a, a/an and the mean at least one. Ail patents, patent applications, and publications mentioned or cited herein are incorporated herein by reference in their entirety, with the same degree of citation as if they were individually cited.
The présent invention has the following advantageous technical effects:
The présent invention comprehensively uses the information of the following two different professional fields to develop a method for directly creating new genes in organisms, completely changing the conventional use of the original gene editing tools (i.e., knocking out genes), realizing a new use thereof for creating new genes, in particular, realizing an editing method for knocking up endogenous genes by using gene editing technology to increase the expression of target genes. The first is the information in the field of gene editing, that is, when two or more different target sites and Cas9 simultaneously target the genome or organism, different situations such as délétion, inversion, doubling or inversion-doubling may occur. The second is the information in the field of genomics, that is, the information about location and distance of different genes in the genome, and spécifie locations, directions and functions of different éléments (promoter, 5'UTR, coding région (CDS), different domain régions, terminator, etc.) in genes, and expression specificity of different genes, etc. By combining the information in these two different fields, DNA breaks are induced at spécifie sites of two or more different genes or at two or more spécifie sites within a single gene (spécifie sites can be determined in the field of genomics), a new combination of different gene éléments or functional domains can be formed through délétion, inversion, doubling, and inversion-doubling or chromosome arm exchange, etc. (the spécifie situations would be provided in the field of gene editing), thereby specifically creating a new gene in the organism.
The new genes created by the présent invention are formed by the fusion or recombination of different éléments of two or more genes under the action of the spontaneous DNA repair mechanism in the organism to change the expression intensity, spatial-temporal specificity, spécial functional domains and the like of the original gene without an exogenous transgene or synthetic gene éléments. Because the new gene has the fusion of two or more different gene éléments, this greatly expands the scope of gene mutation, and will produce more abundant and diverse functions, thus it has a wide range of application prospects. At the same time, these new genes are not linked to the gene editing vectors, so the vector éléments can be removed through genetic ségrégation, and thereby resulting in non-transgenic biological materials containing the new genes for animal and plant breeding. Altematively, non-integrated transient editing can be performed by delivery of mRNA or ribonucleic acid protein complex (RNP) to create non-genetically modified biological materials containing the new genes. This process is non-transgenic and the résultant edited materials would contain no transgene as well. In theory and in fact, these new genes can also be obtained through traditional breeding techniques (such as radiation or Chemical mutagenesis). The différence is that the screening with traditional techniques requires the création of libraries containing a huge number of random mutants and thus it is time-consuming and costly to screen new functional genes. While in the présent invention, new functional genes can be created through bioinformatics analysis combined with gene editing technology, the breeding duration can be greatly shortened. The method of the présent invention is not obliged to the current régulations on gene editing organisms in many countries.
In addition, the new gene création technology of the présent invention can be used to change many traits in organisms, including the growth, development, résistance, yield, etc., and has great application value. The new genes created may hâve new regulatory éléments (such as promoters), which will change the expression intensity and and/or spatial-temporal characteristics of the original genes, or will hâve new amino acid sequences and thus hâve new functions. Taking crops as an example, changing the expression of spécifie genes can increase the résistance of crops to noxious organisms such as pests and weeds and abiotic stresses such as drought, waterlogging, and salinity, and can also increase yield and improve quality. Taking fish as an example, changing the expression characteristics of growth hormone in fish can significantly change its growth and development speed.
Brief Description of Drawings
Figure 1 shows a schematic diagram of creating a new HPPD gene in rice.
Figure 2 shows a schematic diagram of creating a new EPSPS gene in rice.
Figure 3 shows a schematic diagram of creating a new PPOX gene in Arabidopsis thaliana.
Figure 4 shows a schematic diagram of creating a new PPOX gene in rice.
Figure 5 shows the sequencing results for the HPPD-duplication Scheme tested with rice protoplast.
Figure 6 shows the map of the Agrobacterium transformation vector pQY2091 for rice.
Figure 7 shows the electrophoresis results of the PCR products for the détection of new gene fragments in pQY2091 transformed hygromycin résistant rice callus. The arrow indicates the PCR band of the new gene created by the fusion of the promoter of the UBI2 gene with the coding région of the HPPD. The numbers are the numbers of the different callus samples. M represents DNA Marker, and the band sizes are lOObp, 250bp, 500bp, 750bp, lOOObp, 2000bp, 2500bp, 5000bp, 7500bp in order.
Figure 8 shows the electrophoresis results of the PCR products for the détection of new gene fragments in pQY2091 transformed rice TO seedlings. The arrow indicates the PCR band of the new gene created by the fusion of the promoter of the UBI2 gene with the coding région of the HPPD. The numbers are the serial numbers of the different TO seedlings. M represents DNA Marker, and the band sizes are sequentially lOObp, 250bp, 500bp, 750bp, lOOObp, 2000bp, 2500bp,5000bp, 7500bp.
Figure 9 shows the test results for the résistance to Shuangzuocaotong of the QY2091 TO génération of the HPPD gene doubling strain. In the same flowerpot, the wild-type Jinjing 818 is on the left, and the HPPD doubling strain is on the right.
Figure 10 shows the relative expression levels of the HPPD and UBI2 genes in the QY2091 TO génération of the HPPD gene doubling strain. 818CK1 and 818CK3 represent two control plants of the wild-type Jinjing 818; 13M and 20M represent the primary tiller leaf samples of the QY2091-13 and the QY2091-20 TO plants; 13L and 20L represent the secondary tiller leaf samples of the QY2091-13 and the QY2091-20 TO plants used in the herbicide résistance test.
Figure 11 shows a schematic diagram of the possible génotypes of QY2091 Tl génération and the binding sites of the molecular détection primers.
Figure 12 shows the comparison of the sequencing results detecting the HPPD doubling and the predicted doubled sequences for QY2091-13 and QY2091-20.
Figure 13 shows the results of the herbicide résistance test for the Tl génération of the QY2091 HPPD doubling strain at the seedling stage.
Figure 14 shows a schematic diagram of the types of the possible editing event of rice PPO1 gene chromosome fragment inversion and the binding sites of molecular détection primers.
Figure 15 shows the sequencing results of the EPSPS-inversion détection.
Figure 16 shows the map of the rice Agrobacterium transformation vector pQY2234.
Figure 17 shows the electrophoresis results of the PCR products for the détection of new gene fragments of hygromycin résistant rice callus transformed with pQY2234. The arrow indicates the PCR band of the new gene created by the fusion of the promoter of the CP 12 gene with the coding région of the PPO1. The numbers are the serial numbers of different callus samples. M represents DNA Marker, and the band sizes are sequentially lOObp, 250bp, 500bp, 750bp, lOOObp, 2000bp, 2500bp, 5000bp, 7500bp.
Figure 18 shows the résistance test results of the PPO1 gene inversion strain 2081 of the QY2234 T0 génération. Under the same treatment dose, the left flowerpot is the wild-type Huaidao No.5 control, and the right is the PPO1 inversion strain.
Figure 19 shows the relative expression levels of PPO1 and CP12 genes in the QY2234 T0 génération PPO1 inversion strain. H5CK1 and H5CK2 represent two wild-type Huaidao No.5 control plants; 252M, 304M and 329M represent the primary tiller leaf samples of QY2234-252, QY2234-304 and QY2234-329 T0 plants; 252L, 304L and 329L represent secondary tiller leaf samples.
Figure 20 shows the comparison of the sequencing resuit of the PPO1 inversion with the predicted inversion sequence in the Huaidao 5 background.
Figure 21 shows the comparison of the sequencing resuit of the PPO1 inversion with the predicted inversion sequence in the Jinjing 818 background.
Figure 22 shows the herbicide résistance test results for the Tl génération of the QY2234 PPO1 inversion strain at seedling stage.
Spécifie Models for Carrying Out the Invention
The présent invention is further described in conjunction with the examples as follows. The following description is just illustrative, and the protection scope of the présent invention should not be limited to this.
Example 1: An editing method for knocking up the expression of the endogenous HPPD gene by inducing doubling of chromosome fragment in plant - rice protoplast test
HPPD was a key enzyme in the pathway of chlorophyll synthesis in plants, and the inhibition of the activity of the HPPD would eventually lead to albino chlorosis and death of plants. Many herbicides, such as mesotrione and topramezone, were inhibitors with the HPPD as the target protein, and thus increasing the expression level of the endogenous HPPD gene in plants could improve the tolérance of the plants to these herbicides. The rice HPPD gene (as shown in SEQ ID NO: 6, in which l-1067bp is the promoter, and the rest is the expression région) locates on rice chromosome 2. Through bioinformatic analysis, it was found that rice Ubiquitin2 (hereinafter referred to as UBI2) gene (as shown in SEQ ID NO: 5, in which l-2107bp was the promoter, and the rest was the expression région) locates about 338 kb downstream of HPPD gene, and the UBI2 gene and the HPPD gene were in the same direction on the chromosome. According to the rice gene expression profile data provided by the International Rice Genome Sequencing Project (http://rice.plantbiology.msu.edu/index.shtml), the expression intensity of the UBI2 gene in rice leaves was 3 to 10 times higher than that of the HPPD gene, and the UBI2 gene promoter was a strong constitutively expressed promoter.
As shown in Figure 1, Scheme 1 shows that double-strand breaks were simultaneously generated at the sites between the promoters and the CDS région of the HPPD and UBI2 genes respectively, the event of doubling the région between the two breaks were obtained after screening and identification, and a new gene could be formed by fiising the promoter of UBI2 and the coding région of HPPD together. In addition, according to Scheme 2 as shown in Figure 1, a new gene in which the promoter of UBI2 and the coding région of HPPD were fused could also be formed by two consecutive inversions. First, the schemes as shown in Figure 1 were tested in the rice protoplast System as follows:
1. Firstly, the genomic DNA sequences of the rice HPPD and UBI2 genes were input into the CRISPOR online tool (http://crispor.tefor.net/) to search for available editing target sites. After online scoring, the following target sites between the promoters and the CDS régions of HPPD and UBI2 genes were selected for testing:
| OsHPPD-guide RNA1 | GTGCTGGTTGCCTTGGCTGC |
| OsHPPD-guide RNA2 | CACAAATTCACCAGCAGCCA |
| OsHPPD-guide RNA3 | TAAGAACTAGCACAAGATTA |
| OsHPPD-guide RNA4 | GAAATAATCACCAAACAGAT |
The guide RNA1 and guide RNA2 located between the promoter and the CDS région of the HPPD gene, close to the start codon of the HPPD protein, and the guide RNA3 and guide RNA4 located between the promoter and CDS région of the UBI2 gene, close to the UBI2 protein initiation codon.
pHUE411 vector (https://www.addgene.org/62203/) is used as the backbone, and the following primers were designed for the above-mentioned target sites to perform vector construction as described in “Xing HL, Dong L, Wang ZP, Zhang HY, Han CY, Liu B, Wang XC, Chen QJ. A CRISPR/Cas9 Toolkit for multiplex genome editing in plants. BMC Plant Biol. 2014Nov29; 14(1): 327,.
| Primer No. | DNA sequence (5' to 3') |
| OsHPPD-sgRN | ATATGGTCTCGGGCGGTGCTGGTTGCCTTGGCTGCGTTTTAGAG |
| Al-F | CTAGAAATAGCAAG |
| OsHPPD-sgRN | ATATGGTCTCGGGCGCACAAATTCACCAGCAGCCAGTTTTAGAG |
| A2-F | CTAGAAATAGCAAG |
| OsHPPD-sgRN | TATTGGTCTCTAAACTAATCTTGTGCTAGTTCTTAGCTTCTTGGT |
| A3-R | GCCGCGC |
| OsHPPD-sgRN | TATTGGTCTCTAAACATCTGTTTGGTGATTATTTCGCTTCTTGGT |
| A4-R | GCCGCGC |
gene editing vectors for the following dual-target combination were constructed following the method provided in the above-mentioned literature. Specifically, with the pCBC-MTlT2 plasmid (https://www.addgene.org/50593/) as the template, sgRNAl+3, sgRNAl+4, sgRNA2+3 and sgRNA2+4 double target fragments were amplified respectively for constructing the sgRNA expression cassettes. The vector backbone of pHUE411 was digested with Bsal, and recovered from the gel, and the target fragment was digested and directly used for the ligation reaction. T4 DNA ligase was used to ligate the vector backbone and the target fragment, and the ligation product was transformed into Trans5a competent cells. Different monoclones were picked and sequenced The Sparkjade High Purity Plasmid Mini Extraction Kit was used to extract plasmids from the clones with correct sequences, thereby obtaining recombinant plasmids, respectively named as pQY002065, pQY002066, pQY002067, and pQY002068, as follows:
pQY002065 pHUE41 l-HPPD-sgRNAl+3 combination of OsHPPD-guide RNA1, guide RNA3
PQY002066 pHUE41 l-HPPD-sgRNAl+4 combination of OsHPPD-guide RNA1, guide RNA4 pQY002067 pHUE41 l-HPPD-sgRNA2+3 combination of OsHPPD-guide RNA2, guide RNA3 pQY002068 pHUE41 l-HPPD-sgRNA2+4 combination of OsHPPD-guide RNA2, guide RNA4
2. Plasmids of high-purity and high-concentration were prepared for the above-mentioned pQY002065-002068 vectors as follows:
Plasmids were extracted with the Promega Medium Plasmid Extraction Kit (Midipreps DNA Purification System, Promega, A7640) according to the instructions. The spécifie steps were:
(1) Adding 5 ml of Escherichia coli to 300 ml of liquid LB medium containing kanamycin, and shaking at 200 rpm, 37°C for 12 to 16 hours;
(2) Placing the above bacteria solution in a 500 ml centrifuge tube, and centrifuging at 5,000 g for 10 minutes, discarding the supematant;
(3) Adding 3 ml of Cell Resuspension Solution (CRS) to resuspend the cell pellet and vortexing for thorough mixing;
(4) Adding 3 ml of Cell Lysis Solution (CLS) and mixing up and down slowly for no more than 5 minutes;
(5) Adding 3 ml of Neutralization Solution and mixed well by overtuming until the color become clear and transparent;
(6) Centrifuging at 14,000g for 15 minutes, and further centrifuging for 15 minutes if precipitate was not formed compact;
(7) Transferring the supematant to a new 50 ml centrifuge tube, avoiding to suck in white precipitate into the centrifuge tube;
(8) Adding 10 ml of DNA purification resin (Purification Resin, shaken vigorously before use) and mixing well;
(9) Pouring the Resin/DNA mixture was poured into a filter column, and treating by the vacuum pump négative pressure method (0.05 MPa);
(10) Adding 15 ml of Column Wash Solution (CWS) to the fdter column, and vacuuming.
(11) Adding 15 ml of CWS, and repeating vacuuming once; vacuuming was extended for 30 s after the whole solution passed through the filter column;
(12) Cutting off the filter column, transferring to a 1.5 ml centrifuge tube, centrifuging at 12,000 g for 2 minutes, removing residual liquid, and transferring the filter column to a new 1.5 ml centrifuge tube;
(13) Adding 200 pL of sterilized water preheated to 70°C, and keeping rest for 2 minutes;
(14) Centrifuging at 12,000 g for 2 minutes to elute the plasmid DNA; and the concentration was generally about 1 pg/pL.
3. Preparing rice protoplasts and performing PEG-mediated transformation:
First, rice seedlings for protoplasts were prepared, which is of the variety Nipponbare. The seeds were provided by the Weeds Department of the School of Plant Protection, China Agricultural University, and expanded in house. The rice seeds were hulled first, and the hulled seeds were rinsed with 75% éthanol for 1 minute, treated with 5% (v/v) sodium hypochlorite for 20 minutes, then washed with stérile water for more than 5 times. After blow-drying in an ultra-clean table, they were placed in a tissue culture bottle containing 1/2 MS medium, 20 seeds for each bottle. Protoplasts were prepared by incubating at 26°C for about 10 days with 12 hours light.
The methods for rice protoplast préparation and PEG-mediated transformation were conducted according to Lin et al., 2018 Application of protoplast technology to CRISPR/Cas9 mutagenesis: from single-cell mutation détection to mutant plant régénération. Plant Biotechnology Journal https://doi.org /10.1111/pbi.12870. The steps were as follows:
(1) the leaf sheath of the seedlings was selected, eut into pièces of about 1 mm with a sharp Geely razor blade, and placed in 0.6 M mannitol and MES culture medium (formulation: 0.6 M mannitol, 0.4 M MES, pH 5.7) for later use. Ail materials were eut and transferred to 20 ml of enzymatic hydrolysis solution (formulation: 1.5% Cellulase R10/RS (YaKult Honsha), 0.5% Mecerozyme RIO (YaKult Honsha), 0.5M mannitol, 20mM KC1, 20mM MES, pH 5.7, lOmM CaCh, 0.1% BSA, 5 mM β-mercaptoethanol), wrapped in tin foil and placed in a 28°C shaker, enzymatically hydrolyzed at 50 rpm in the dark for about 4 hours, and the speed was increased to 100 rpm in the last 2 minutes;
(2) after the enzymatic lysis, an equal volume of W5 solution (formulation: 154mM NaCl, 125mM CaCh, 5mM KC1, 15mM MES) was added, shaken horizontally for 10 seconds to release the protoplasts. The cells after enzymatic lysis were filtered through a 300-mesh sieve and centrifuged at 150 g for 5 minutes to collect protoplasts;
(3) the cells were rinsed twice with the W5 solution, and the protoplasts were collected by centrifugation at 150 g for 5 minutes;
(4) the protoplasts were resuspended with an appropriate amount of MMG solution (formulation: 3.05g/L MgClj, Ig/L MES, 91.2g/L mannitol), and the concentration of the protoplasts was about 2χ 106 cells/mL.
The transformation of protoplasts was carried out as follows:
(1) to 200 pL of the aforementioned MMG resuspended protoplasts, endotoxin-free plasmid DNA of high quality (10-20 pg) was added and tapped to mix well;
(2) an equal volume of 40% (w/v) PEG solution (formulation: 40% (w/v) PEG, 0.5M mannitol, lOOmM CaCh) was added, tapped to mix well, and kept rest at 28°C in the dark for 15 minutes;
(3) after the induction of transformation, 1.5 ml of W5 solution was added slowly, tapped to mix the cells well. The cells were collected by centrifugation at 150 g for 3 minutes. This step was repeated once;
(4) 1.5 ml of W5 solution was added to resuspend the cells, and placed in a 28°C incubator and cultured in the dark for 12-16 hours. For extracting protoplast genomic DNA, the cultivation should be carried out for 48-60 hours.
4. Genome targeting and detecting new gene:
(1) First, protoplast DNAs were extracted by the CT AB method with some modifications. The spécifie method was as follows: the protoplasts were centrifuged, then the supematant was discarded. 500 pL of DNA extracting solution (formulation: CT AB 20g/L, NaCl 81.82g/L, lOOmM Tris-HCl (pH 8.0), 20 mM EDTA, 0.2% β-mercaptoethanol) was added, shaken to mix well, and incubated in a 65°C water bath for 1 hour; when the incubated sample was cooled, 500 pL of chloroform was added and mixed upside down and centrifuged at 10,000 rpm for 10 minutes; 400 pL of the supematant was transferred to a new 1.5 ml centrifuge tube, 1 ml of 70% (v/v) éthanol was added and the mixture was kept at -20°C for precipitating for 20 minutes; the 5 mixture was centrifuged at 12,000 rpm for 15 minutes to precipitate the DNA; after the precipitate was air dried, 50 pL of ultrapure water was added and stored at -20°C for later use.
(2) The détection primers in the following table were used to amplify the fragments containing the target sites on both sides or the predicted fragments resulting from the fusion of the UBI2 promoter and the HPPD coding région. The lengths of the PCR products were between 10 300-1000 bp, in which the primer8-F + primer6-R combination was used to detect the fusion fragment at the middle joint after the doubling of the chromosome fragment, and the product length was expected to be 630bp.
Primer Sequence (5' to 3')
CACTACCATCCATCCATTTGC
GAGTTCCCCGTGGAGAGGT
TCCATTACTACTCTCCCCGATT
GTGTGGGGGAGTGGATGAC
TGTAGCTTGTGCGTTTCGAT
GGGATGCCCTCTTTGTCC
TCTGTGTGAAGATTATTGCCACT
GGGATGCCCTCCTTATCTTG
The PCR reaction System was as follows:
Components Volume x 15 buffer solution5 pL
Forward primer (10 pM) 2pL
Reverse primer (10 pM) 2pL
Template DNA 2pL
Ultrapure water Added to 50 pL (3) A PCR reaction was conducted under the following general reaction conditions:
Step TempératureTime
Dénaturation 98 °C30 s
OsHPPDduphcated-pnmerl -F OsHPPDduplicated-primer6-R OsHPPDduplicated-primer3 -F OsHPPDduplicated-primer7-R OsHPPDduplicated-primer5-F OsHPPDduplicated-primer2-R OsHPPDduplicated-primer8-F OsHPPDduplicated-primer4-R
| 98 °C | 15 s | |
| Amplification for 30-35 cycles | 58 °C | 15 s |
| 72 °C | 30 s | |
| Final extension | 72 °C | 3 min |
| Finished | 16 °C | 5min |
(4) The PCR reaction products were detected by 1% agarose gel electrophoresis. The results showed that the 630 bp positive band for the predicted fusion fragment of the UBI2 promoter and the HPPD coding région could be detected in the pQY002066 and pQY002068 transformed samples.
5. The positive samples of the fusion fragment of the UBI2 promoter and the HPPD coding région were sequenced for vérification, and the OsHPPDduplicated-primer8-F and OsHPPDduplicated-primer6-R primers were used to sequence from both ends. As shown in Figure 5, the promoter of the UBI2 gene and the expression région of the HPPD gene could be directly ligated, and the editing event of the fusion of the promoter of rice UBI2 gene and the expression région of the HPPD gene could be detected in the protoplast genomic DNA of the rice transformed with pQY002066 and pQY002068 plasmids, indicating that the scheme of doubling the chromosome fragments to form a new HPPD gene was feasible, a new HPPD gene which expression was driven by a strong promoter could be created, and this was defined as an HPPD doubling event. The sequencing resuit of the pQY002066 vector transformed protoplast for testing HPPD doubling event was shown in SEQ ID NO: 9; and the sequencing resuit of the pQY002068 vector transformed protoplast for testing HPPD doubling event was shown in SEQ ID NO: 10.
Example 2: Création of herbicide-resistant rice with knock-up expression of endogenous HPPD gene by chromosome fragment doubling through Agrobacterïum-mediated transformation
1. Construction of knock-up editing vector: Based on the results of the protoplast test in Example 1, the dual-target combination OsHPPD-guide RNA1: 5'GTGCTGGTTGCCTTGGCTGC3' and OsHPPD-guide RNA4:
5'GAAATAATCACCAAACAGAT3' with a high editing efficiency was selected. The Agrobacterium transformation vector pQY2091 was constructed according to Example 1. pHUE411 was used as the vector backbone and subjected to rice codon optimization. The map of the vector was shown in FIG. 6.
2. Agrobacterium transformation of rice callus:
1) Agrobacterium transformation: Ipg of the rice knock-up editing vector pQY2091 plasmid was added to 10μ1 of Agrobacterium EHA105 heat-shock competent cells (Angyu Biotech, Catalog No. G6040), placed on ice for 5 minutes, immersed in liquid nitrogen for quick freezing for 5 minutes, then removed and heated at 37°C for 5 minutes, and fïnally placed on ice for 5 minutes. 500μ1 of YEB liquid medium (formulation: yeast extract Ig/L, peptone 5g/L, beef extract 5g/L, sucrose 5g/L, magnésium sulfate 0.5g/L) was added. The mixture was placed in a shaker and incubated at 28°C, 200 rpm for 2~3 hours; the bacteria were collected by centrifugation at 3500 rpm for 30 seconds, the collected bacteria were spread on YEB (kanamycin 50 mg/L + rifampicin 25 mg/L) plate, and incubated for 2 days in an incubator at 28°C; the single colonies were picked and placed into liquid culture medium, and the bacteria were stored at -80 □.
2) Cultivation of Agrobacterium: The single colonies of the transformed Agrobacterium on the YEB plate was picked, added into 20 ml of YEB liquid medium (kanamycin 50 mg/L + rifampicin 25 mg/L), and cultured while stirring at 28°C until the QD600 was 0.5, then the bacteria cells were collected by centrifugation at 5000 rpm for 10 minutes, 20-40 ml of AAM (Solarbio, lot number LA8580) liquid medium was added to resuspend the bacterial cells to reach OD600 of 0.2-0.3, and then acetosyringone (Solarbio, article number A8110) was added to reach the final concentration of 200μΜ for infecting the callus.
3) Induction of rice callus: The varieties of the transformation récipient rice were Huaidao 5 and Jinjing 818, purchased from the seed market in Huai’an, Jiangsu, and expanded in house. 800-2000 clean rice seeds were hulled, then washed with stérile water until the water was clear after washing. Then the seeds were disinfected with 70% alcohol for 30 seconds, then 30 ml of 5% sodium hypochlorite was added and the mixture was placed on a horizontal shaker and shaken at 50 rpm for 20 minutes, then washed with stérile water for 5 times. The seeds were placed on stérile absorbent paper, air-dried to remove the water on the surface of the seeds, inoculated on an induction medium and cultivated at 28°C to obtain callus.
The formulation of the induction medium: 4.1g/L N6 powder + 0.3 g/L hydrolyzed casein + 2.878 g/L proline + 2 mg/L 2,4-D + 3% sucrose + O.lg/L inositol + 0.5 g glutamine + 0.45% phytagel, pH 5.8.
4) Infection of rice callus with Agrobacterium: The callus of Huaidao No. 5 or Jinjing 818 subcultured for 10 days with a diameter of 3 mm was selected and collected into a 50 ml centrifuge tube; the resuspension solution of the Agrobacterium AAM with the OD600 adjusted to 0.2-0.3 was poured into the centrifuge tube containing the callus, placed in a shaker at 28°C at a speed of 200 rpm to perform infection for 20 minutes; when the infection was completed, the bacteria solution was discarded, the callus was placed on stérile filter paper and air-dried for about 20 minutes, then placed on a plate containing co-cultivation medium to perform co-cultivation, on which the plate was covered with a stérile filter paper soaked with AAM liquid medium containing 100 μΜ acetosyringone; after 3 days of co-cultivation, the Agrobacterium was removed by washing (firstly washing with stérile water for 5 times, then washing with 500mg/L cephalosporin antibiotic for 20 minutes), and sélective cultured on 50mg/L hygromycin sélection medium.
The formulation of the co-cultivation medium: 4.1g/L N6 powder + 0.3 g/L hydrolyzed casein + 0.5 g/L proline + 2 mg/L 2,4-D + 200 μΜ AS + 10 g/L glucose + 3% Sucrose + 0.45% phytagel, pH 5.5.
3. Molecular identification and différentiation into seedlings of hygromycin résistant callus:
Different from the sélection process of conventional rice transformation, with spécifie primers of the fusion fragments generated after the chromosome fragment doubling, hygromycin résistant callus could be molecularly identified during the callus sélection and culture stage in the présent invention, positive doubling events could be determined, and callus containing new genes resulting from fusion of different genetic éléments was selected for différentiation cultivation and induced to emerge seedlings. The spécifie steps were as follows:
1) The co-cultured callus was transferred to the sélection medium for the first round of sélection (2 weeks). The formulation of the sélection medium is: 4.1 g/L N6 powder + 0.3 g/L hydrolyzed casein + 2.878 g/L proline + 2 mg/L 2,4-D + 3% sucrose + 0.5g glutamine + 30 mg/L hygromycin (HYG) + 500 mg/L cephalosporin (cef) + 0.1 g/L inositol + 0.45% phytagel, pH 5.8.
2) After the first round of sélection was completed, the newly grown callus was transferred into a new sélection medium for the second round of sélection (2 weeks). At this stage, the newly grown callus with a diameter greater than 3 mm was clamped by tweezers to take a small amount of sample, the DNA thereof was extracted with the CTAB method described in Example 1 for the first round of molecular identification. In this example, the primer pair of OsHPPDduplicated-primer8-F (8F) and OsHPPDduplicated-primer6-R (6R) was selected to perform PCR identification for the callus transformed with the pQY2091 vector, in which the reaction System and reaction conditions were similar to those of Example 1. Among the total of 350 calli tested, no positive sample was detected in the calli of Huaidao 5, while 28 positive samples were detected in the calli of Jinjing 818. The PCR détection results of some calli were shown in Figure 7.
3) The calli identified as positive by PCR were transferred to a new sélection medium for the third round of sélection and expanding cultivation; after the diameter of the calli was greater than 5 mm, the callus in the expanding cultivation was subjected to the second round of molecular identification using 8F+6R primer pair, the yellow-white callus at good growth status that was identified as positive in the second round was transferred to a différentiation medium to perform différentiation, and the seedlings of about 1 cm could be obtained after 3 to 4 weeks; the differentiated seedlings were transferred to a rooting medium for rooting cultivation; after the seedlings of the rooting cultivation were subjected to hardening off, they were transferred to a flowerpot with soil and placed in a greenhouse for cultivation. The formulation of the différentiation medium is: 4.42g/L MS powder + 0.5 g/L hydrolyzed casein + 0.2 mg/L NAA + 2 mg/L KT + 3% sucrose+ 3% sorbitol+30 mg/L hygromycin + 0.1 g/L inositol + 0.45% phytagel, pH 5.8. The formulation of the rooting medium is: 2.3g/L MS powder + 3% sucrose + 0.45% phytagel.
4. Molecular détection of HPPD doubling seedlings (T0 génération):
After the second round of molecular identification, 29 doubling event-positive calli were co-differentiated to obtain 403 seedlings of T0 génération, and the 8F+6R primer pair was used for the third round of molecular identification of the 403 seedlings, among which 56 had positive bands. The positive seedlings were moved into a greenhouse for cultivation. The PCR détection results of some T0 seedlings were shown in Figure 8.
5. HPPD inhibitory herbicide résistance test for HPPD doubled seedlings (T0 génération):
The transformation seedlings of T0 génération identified as doubling event positive were transplanted into large plastic buckets in the greenhouse for expanding propagation to obtain seeds of Tl génération. After the seedlings began to tiller, the tillers were taken from vigorously growing strains, and planted in the same pots with the tillers of the wild-type control varieties at the same growth period. After the plant height reached about 20 cm, the herbicide résistance test was conducted. The herbicide used was Shuangzuocaotong (CAS No. 1622908-18-2) produced by our company, and its field dosage was usually 4 grams of active ingrédients per mu (4g a.i./mu). In this experiment, Shuangzuocaotong was applied at a dosage gradient of 2g a.i./mu, 4g a.i./mu, 8g a.i./mu and 32g a.i./muwith a walk-in spray tower.
The résistance test results were shown in Figure 9. After 5-7 days of the application, the wild-type control rice seedlings began to show albino, while the strains of the HPPD doubling events ali remained normally green. After 4 weeks of the application, the wild-type rice seedlings were close to death, while the strains of the doubling events ail continued to remain green and grew normally. The test results showed that the HPPD gene-doubled strains had a significantly improved tolérance to Shuangzuocaotong.
6. Quantitative détection of the relative expression of the HPPD gene in the HPPD doubled seedlings (TO génération):
It was speculated that the improved résistance of the HPPD gene doubled strain to Shuangzuocaotong was due to the fusion of the strong promoter of UBI2 and the HPPD gene CDS that increased the expression of HPPD, so the TO génération strains QY2091-13 and QY2091-20 were used to take samples from the primary tillers and the secondary tillers used for herbicide résistance test to detect the expression levels of the HPPD and UBI2 genes, respectively, with the wild-type Jinjing 818 as the control. The spécifie steps were as follows:
1) Extraction of total RNA (Trizol method):
0.1-0.3g of fresh leaves were taken and ground into powder in liquid nitrogen. 1ml of Trizol reagent was added for every 50-1 OOmg of tissue for lysis; the Trizol lysate of the above tissue was transferred into a 1.5ml centrifuge tube, stood at room température (15- 30°C) for 5 minutes; chloroform was added in an amount of 0.2ml per 1ml of Trizol; the centrifuge tube was capped, shaken vigorously in hand for 15 seconds, stood at room température (15-30°C) for 2-3 minutes, then centrifuged at 12000g (4Q) for 15 minutes; the upper aqueous phase was removed and placed in a new centrifuge tube, isopropanol was added in an amount of 0.5 ml per 1 ml of Trizol, the mixture was kept at room température (15-30°C) for 10 minutes, then centrifuged at 12000g (2-8°C) for 10 minutes; the supematant was discarded, and 75% éthanol was added to the pellet in an amount of 1ml per 1ml of Trizol for washing. The mixture was vortexed, and centrifuged at 7500g (2-8°C) for 5 minutes. The supematant was discarded; the precipitated RNA was dried naturally at room température for 30 minutes; the RNA precipitate was dissolved by 50 μΐ of RNase-free water, and stored in the refrigerator at -80°C after electrophoresis analysis and concentration détermination.
2) RNA electrophoresis analysis:
An agarose gel at a concentration of 1% was prepared, then 1 μΐ of the RNA was taken and mixed with 1 μΐ of 2X Loading Buffer. The mixture was loaded on the gel. The voltage was set to 180V and the time for electrophoresis was 12 minutes. After the electrophoresis was completed, the agarose gel was taken out, and the locations and brightness of fragments were observed with a UV gel imaging System.
3) RNA purity détection:
The RNA concentration was measured with a microprotein nucleic acid analyzer. RNA with a good purity had an OD260/OD280 value between 1.8-2.1. The value lower than 1.8 indicated 5 serions protein contamination, and higher than 2.1 indicated serions RNA dégradation.
4) Real-time fluorescence quantitative PCR
The extracted total RNA was reverse transcribed into cDNA with a spécial reverse transcription kit. The main procedure comprised: fïrst determining the concentration of the extracted total RNA, and a portion of 1-4 pg of RNA was used for synthesizing cDNA by 10 reverse transcriptase synthesis. The resulting cDNA was stored at -20°C.
(î) A solution of the RNA template was prepared on ice as set forth in the following table and subjected to dénaturation and annealing reaction in a PCR instrument. This process was conducive to the dénaturation of the RNA template and the spécifie annealing of primers and templates, thereby improving the efficiency of reverse transcription.
Table 1 : Reverse transcription, dénaturation and annealing reaction System
| Component | Amounts (μΐ) |
| Oligo dT primer (50μΜ) | Ιμΐ |
| dNTP mixture (lOmM each) | Ιμΐ |
| RNA Template | l-4pg |
| RNase free water | Added to 10 pL |
Reaction conditions for dénaturation and annealing:
□ 5 min
4ü 5 min @ The reverse transcription reaction System was prepared as set forth in Table 2for synthesizing cDNA :
Table 2: Reverse transcription reaction System
| Component | Amount (μΐ) |
| Reaction solution after the above dénaturation and | 10 pl |
| annealing |
5X RTase Plus Reaction Buffer
RNase Inhibitor
Evo M-MLV Plus RTase (200 U/μΙ )
RNase free water μΐ
0.5 μΐ l μΐ
Added to 20 μΕ
Reaction conditions for cDNA synthesis:
min min (3) The UBQ5 gene of rice was selected as the internai reference gene, and the synthesized cDNA was used as the template to perform fluorescence quantitative PCR. The primers listed in Table 3 were used to préparé the reaction solution according to Table 4.
Table 3: Sequence 5’-3 ’ of the primer for Fluorescence quantitative PCR
| UBQ5-F | ACCACTTCGACCGCCACTACT |
| UBQ5-R | ACGCCTAAGCCTGCTGGTT |
| RT-OsHPPD-F | CAGATCTTCACCAAGCCAGTAG |
| RT-OsHPPD-R | GAGAAGTTGCCCTTCCCAAA |
| RT-OsUbi2-F | CCTCCGTGGTGGTCAGTAAT |
| RT-OsUbi2-R | GAACAGAGGCTCGGGACG |
Table 4: Reaction solution for real-time quantitative PCR (Real Time PCR)
| Component of mixture | Amount (μΐ) |
| SYBR Premix ExTaq II Forward primer ( 1 ΟμΜ) Reverse primer (10μΜ) cDNA | 5 μΐ 0.2 μΐ 0.2 μΐ Ιμΐ |
| Rox II | 0.2μ1 |
| Ultrapure water In total | 3.4 μΐ 10 μΐ |
(î) The reaction was performed following the real-time quantitative PCR reaction steps in Table 5. The reaction was conducted for 40 cycles.
Table 5: Real-time quantitative PCR reaction steps
| Température (J) | Time |
| 50J | 2 min |
| 95 ZJ | 10 min |
| 95J | 15 s |
| 60J | 20 s |
| 95 □ | 15 s |
| 60Σ1 | 20 s |
| 95J | 15 s |
5) Data processing and experimental results
As shown in Table 6, UBQ5 was used as an internai reference, ACt was calculated by subtracting the Ct value of UBQ5 from the Ct value of the target gene , and then 2'ACt was calculated, which represented the relative expression level of the target gene. The 818CK1 and 5 818CK3 were two wild-type Jinjing 818 control plants; 13M and 20M represented the primary tiller leaf samples of QY2091-13 and QY2091-20 TO plants; 13L and 20L represented the secondary tiller leaf samples of QY2091-13 and QY2091-20 TO plants used for herbicide résistance testing.
Table 6: Ct values and relative expression folds of different genes
| UBQ5 | Mean | UBI2 | ACt | 2-acî | Mean | HPPD | ACt | 2-ûcî | Mean | |
| 23.27 | 17.56 | -5.88 | 58.95 | 20.81 | -2.63 | 6.20 | ||||
| 23.55 | 17.71 | -5.73 | 53.09 | 21.01 | -2.43 | 5.40 | ||||
| 818CK1 | 23.51 | 23.44 | 17.66 | -5.78 | 55.06 | 55.70 | 20.98 | -2.47 | 5.52 | 5.71 |
| 23.45 | 17.88 | -5.50 | 45.20 | 20.93 | -2.44 | 5.43 | ||||
| 23.19 | 17.94 | -5.44 | 43.41 | 21.13 | -2.24 | 4.74 | ||||
| 818CK3 | 23.49 | 23.37 | 17.72 | -5.65 | 50.26 | 46.29 | 21.14 | -2.24 | 4.72 | 4.96 |
| 24.61 | 19.56 | -4.92 | 30.32 | 20.23 | -4.25 | 19.07 | ||||
| 24.27 | 19.52 | -4.96 | 31.05 | 20.29 | -4.19 | 18.28 | ||||
| 13M | 24.56 | 24.48 | 19.16 | -5.32 | 39.97 | 33.78 | 20.48 | -4.00 | 15.99 | 17.78 |
| 23.98 | 18.76 | -5.20 | 36.70 | 19.02 | -4.94 | 30.64 | ||||
| 23.89 | 18.52 | -5.43 | 43.19 | 19.07 | -4.89 | 29.56 | ||||
| 13L | 24.00 | 23.96 | 18.81 | -5.14 | 35.34 | 38.41 | 19.07 | -4.88 | 29.45 | 29.88 |
| 24.34 | 19.01 | -5.40 | 42.30 | 19.37 | -5.04 | 32.98 | ||||
| 24.41 | 19.07 | -5.34 | 40.64 | 19.33 | -5.09 | 34.05 |
| 20M | 24.49 | 24.41 | 19.29 | -5.13 | 35.00 | 39.32 | 19.26 | -5.16 | 35.65 | 34.22 |
| 24.63 | 19.46 | -5.11 | 34.52 | 19.88 | -4.69 | 25.83 | ||||
| 24.67 | 19.38 | -5.19 | 36.48 | 19.91 | -4.66 | 25.31 | ||||
| 20L | 24.41 | 24.57 | 19.42 | -5.15 | 35.61 | 35.54 | 19.86 | -4.71 | 26.16 | 25.77 |
The results were shown in Figure 10. The rice UBQ5 was used as an internai reference gene to calculate the relative expression levels of the OsHPPD and UBI2 genes. The results showed that the HPPD expression level of the HPPD doubled strain was significantly higher than that of the wild type, indicating that the fused UBI2 strong promoter did increase the expression level of HPPD, thereby creating a highly-expressing HPPD gene, with the HPPD gene knocked up. The slight decrease in the expression level of UBI2 could be due to the small-scale mutations resulting from the édition of the promoter région, and we had indeed detected base insertions, délétions or small fragment délétions at the UBI2 target site. Compared with the wild type, the expression levels of UBI2 and HPPD significantly tended to be consistent and met the theoretical expectations; among them, the HPPD expression level of the 20M sample was about 6 times higher than that of the wild type CK3 group.
The above results proved that, following the effective chromosome fragment doubling program as tested in protoplasts, calli and transformed seedlings with doubling events could be selected by multiple rounds of molecular identification during the Agrobacterium transformation and tissue culturing, and the UBI2 strong promoter in the new HPPD gene fusion generated in the transformed seedlings did increase the expression level of HPPD gene, rendering the plants to get résistance to HPPD inhibitory herbicide Shuangzuocaotong, up to 8 times the field dose, and thus a herbicide-resistant rice with knock-up endogenous HPPD gene was created. Taking this as an example, using the chromosome fragment doubling technical solution of Example 1 and Example 2, a desired promoter could also be introduced into an endogenous gene which gene expression pattern should be changed to create a new gene, and a new variety of plants with desired gene expression pattern could be created through Agrobacterium-meàwteà. transformation.
Example 3: Molecular détection and herbicide résistance test of Tl génération of herbicide-resistant rice strain with knock-up expression of the endogenous HPPD gene caused by chromosome fragment doubling
The physical distance between the HPPD gene and the UBI2 gene in the wild-type rice genome was 338 kb, as shown in Scheme 1 in Figure 1. The length of the chromosome was increased by 338 kb after the chromosome fragment between them was doubled by duplication, and a highly-expressing new HPPD gene was generated with a UBI2 promoter at the joint of the duplicated fragment to drive the expression of the HPPD CDS région. In order to détermine whether the new gene could be inherited stably and the effect of the doubling chromosome fragment on the genetic stability, molecular détection and herbicide résistance test was conducted for the Tl génération of the HPPD doubled strains.
First of ail, it was observed that the doubling event had no significant effect on the fertility of TO génération plants, as ail positive TO strains were able to produce normal seeds. Planting test of Tl génération seedlings were fùrther conducted for the QY2091-13 and QY2091-20 strains.
1. Sample préparation:
For QY2091-13, a total of 36 Tl seedlings were planted, among which 27 grew normally and 9 were albino. 32 were selected for DNA extraction and détection, where No. 1-24 were normal seedlings, and No.25-32 were albino seedlings.
For QY2091-20, a total of 44 Tl seedlings were planted, among which 33 grew normally and 11 were albino. 40 were selected for DNA extraction and détection, where No. 1-32 were normal seedlings, and No.33-40 were albino seedlings.
Albino seedlings were observed in the Tl génération plants. It was speculated that, since HPPD was a key enzyme in the chlorophyll synthesis pathway of plants, and the TO génération plants resulting from the dual-target édition possibly could be chimeras of many génotypes such as doubling, délétion, inversion of chromosome fragments, or small fragment mutation at the edited target site. The albino phenotype could be generated in the plants where the HPPD gene was destroyed, for example, the HPPD CDS région was deleted. Different primer pairs were designed for PCR to détermine possible génotypes.
2. PCR molecular identification:
1) Sequences of détection primers: sequence 5’-3’
Primer 8F: TCTGTGTGAAGATTATTGCCACTAGTTC
Primer 6R: GAGTTCCCCGTGGAGAGGT
Test 141-F: CCCCTTCCCTCTAAAAATCAGAACAG
Primer 4R: GGGATGCCCTCCTTATCTTGGATC
Primer 3F: CCTCCATTACTACTCTCCCCGATTC
Primer 7R: GTGTGGGGGAGTGGATGACAG pg-Hyg-Rl: TCGTCCATCACAGTTTGCCA pg-35S-F: TGACGTAAGGGATGACGCAC
2) The binding sites of the above primers were shown in Figure 11. Among them, the Primer 8F + Primer 6R were used to detect the fusion fragment of the UBI2 promoter and the HPPD CDS after the chromosome fragment doubling, and the length of the product was 630 bp; the Test 141-F + Primer 4R were used to detect chromosome fragment délétion event, and the length of the product was 222bp; and the pg-Hyg-RH- pg-35S-F were used to detect the T-DNA fragment of the editing vector, and the length of the product was 660bp.
3) PCR reaction System, reaction procedure and gel electrophoresis détection were performed according to Example 1.
3. Molecular détection results:
The détection results of doubling and délétion events were shown in Table 7. It could be noted that the chromosome fragment doubling events and délétion events were observed in the Tl génération plants, with different rations among different lines. The doubling events in the QY2091-13 (29/32) were higher than that in the QY2091-20 (21/40), possibly due to the different chimeric ratios in the TO génération plants. The test results indicated that the fusion gene generated by the doubling was heritable.
Table 7: Détection results of doubling and délétion events
| QY2091-20 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
| Doubling | + | - | - | - | + | + | + | - | - | - | + | - | - | - | - | - | + | + | + | - |
| Délétion | - | - | - | - | - | + | - | - | + | - | - | + | - | - | - | - | - | - | - | - |
| 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | |
| Doubling | - | + | - | - | + | - | + | + | + | - | + | + | + | - | + | + | + | + | + | - |
| Délétion | - | - | + | + | - | + | - | - | - | + | + | - | + | - | + | - | - | - | - | + |
| QY2091-13 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
| Doubling | + | - | + | + | + | + | + | - | + | + | + | + | + | + | + | + | + | + | + | + |
| Délétion | - | + | - | + | + | + | - | + | - | - | + | - | - | - | - | - | - | - | - | + |
| 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | |||||||||
| Doubling | - | + | + | + | + | + | + | + | + | + | + | + | ||||||||
| Délétion | - | - | + | - | - | + | - | - | + | + | - | + |
The pg-Hyg-Rl+ pg-35S-F primers were used to detect the T -DNA fragment of the editing vector for the above Tl seedlings. The electrophoresis results of the PCR products of
QY2091-20-17 and QY2091-13-7 were négative for the T-DNA fragment, indicating that it was a homozygous doubling. It could be seen that doubling-homozygous non-transgenic strains could be segregated from the Tl génération of the doubling events.
4. Détection of editing events by sequencing:
The doubling fusion fragments were sequenced for the doubling-homozygous positive Tl génération samples 1, 5, 7, 11, 18 and 19 for QY2091-20 and for the doubling-homozygous positive Tl samples 1, 3, 7, 9, 10 and 12 for QY2091-13. The left target site of the HPPD gene and the right target site of the UBI2 were amplified at the same time for sequencing to detect the editing events at the target sites. Among them, the Primer 3F + Primer 7R were used to detect the editing event of the left HPPD target site, where the wild-type control product was 481 bp in length; the Primer 8F+Primer 4R were used to detect the editing event of the right UBI2 target site, where the wild-type control product was 329bp in length.
1) Génotype of the doubling events:
The sequencing resuit of the HPPD doubling in QY2091-13 was shown in SEQ ID NO: 18, and the sequencing resuit of the HPPD doubling in QY2091-20 was shown in SEQ ID NO: 19, see Figure 12. Compared with the predicted linker sequences of the doubling, one T base was inserted at the linker in QY2091-13, 19 bases were deleted from the linker in QY2091-20, and both of the insertion and délétion occurred in the promoter région of UBI2 and had no effect on the coding région of the HPPD protein. From the détection results on the expression levels of the HPPD gene in Example 2, it can be seen that the expression levels of these new HPPD genes where the UBI2 promoters were fused to the HPPD CDS région was significantly increased.
2) Editing events at the original HPPD and UBI2 target sites on both sides:
There were more types of editing events at the target sites on both sides. In two lines, three editing types occurred in the HPPD promoter région, namely insertion of single base, délétion of 17 bases, and délétion of 16 bases; and two editing types occured in the UBI2 promoter région, namely insertion of 7 bases and délétion of 3 bases. The Tl plants used for testing and sampling were ail green seedlings and grew normally, indicating that small-scale mutations in these promoter régions had no significant effect on gene function, and herbicide-resistant rice varieties could be selected from their offspring.
5. Herbicide résistance test on seedlings of Tl génération:
The herbicide résistance of the Tl génération of the QY2091 HPPD doubled strain was tested at the seedling stage. After the Tl génération seeds were subjected to surface disinfection, they germinated on 1/2 MS medium containing 1.2μΜ Shuangzuocaotong, and cultivated at 28 J, 16 hours light/8 hours dark, in which wild-type Jinjing 818 was used as a control.
The test results of résistance were shown in Figure 13. After 10 days of cultivation in light, the wild-type control rice seedlings showed phenotypes of albinism and were almost ail albino, while the lines of the HPPD doubling events QY2091-7, 13, 20, 22 showed phenotype ségrégation of chlorosis and green seedlings. According to the aforementioned molecular détection results, there was génotype ségrégation in the Tl génération. Albino seedlings appeared in the absence of herbicide treatment, while green seedlings continued to remain green and grew normally after the addition of 1.2 μΜ Shuangzuocaotong. The test results indicated that the high résistance to Shuangzuocaotong of the HPPD gene-doubled lines could be stably inherited to the Tl génération.
Example 4: An editing method for knocking up the expression of the endogenous PPO gene by inducing chromosome fragment inversion — rice protoplast test
The rice PPO1 (also known as PPOX1) gene (as shown in SEQ ID NO: 7, in which l-1065bp was the promoter, the rest was the coding région) was located on chromosome 1, and the calvin cycle protein CP12 gene (as shown in SEQ ID NO: ID NO: 8, in which l-2088bp was the promoter, and the rest was the coding région) was located 911kb downstream of the PPO1 gene with opposite directions. According to the rice gene expression profile data provided by the International Rice Genome Sequencing Project (http://rice.plantbiology.msu.edu/index.shtml), the expression intensity of the CP12 gene in rice leaves was 50 times that of the PPO1 gene, and the CP 12 gene promoter was a strong promoter highly expressing in leaves.
As shown in Scheme 1 of Figure 4, by simultaneously inducing double-strand breaks between the respective promoters and the CDS région of the two genes and screening, the région between the two breaks could be reversed, with the promoter of PPO1 gene replaced with the promoter of CP 12 gene, increasing the expression level of the PPO1 gene and achieving the résistance to PPO inhibitory herbicides, thereby herbicide-resistant lines could be selected. In addition, as shown in Scheme 2 of Figure 4, a new gene of PPO1 driven by the promoter of CP 12 gene could also be created by first inversion and then doubling.
1. First, the rice PPO1 and CP 12 genomic DNA sequences were input into the CRISPOR online tool (http://crispor.tefor.net/) to search for available editing target sites. After online scoring, the following target sites were selected between the promoters and the CDS régions of the PPO1 and CP 12 genes for testing:
| Name of target sgRNA | Sequence(5' to 3) |
| OsPPO-guide RNA1 | CCATGTCCGTCGCTGACGAG |
| OsPPO-guide RNA2 | CCGCTCGTCAGCGACGGACA |
| OsPPO-guide RNA3 | GCCATGGCTGGCTGTTGATG |
| OsPPO-guide RNA4 | CGGATTTCTGCGTGTGATGT |
The guide RNA1 and guide RNA2 located between the promoter and the CDS région of the PPO1 gene, close to the PPOl start codon, and the guide RNA3 and guide RNA4 located between the promoter and the CDS région of the CPI2 gene, close to the CPI2 start codon.
As described in Example l, primers were designed for the above target sites to construct dual-target vectors, with pHUE4l l as the backbone:
| Primer No. | DNA sequence (5* to 3') |
| OsPPOl-sgRN | ATATGGTCTCGGGCGCCATGTCCGTCGCTGACGAGGTTTTAGAG |
| Al-F | CTAGAAATAGCAAG |
| OsPPOl-sgRN | ATATGGTCTCGGGCGCCGCTCGTCAGCGACGGACAGTTTTAGAG |
| A2-F | CTAGAAATAGCAAG |
| OsPPOl-sgRN | TATTGGTCTCTAAACCATCAACAGCCAGCCATGGCGCTTCTTGGT |
| A3-R | GCCGCGCCTC |
| OsPPOl-sgRN | TATTGGTCTCTAAACACATCACACGCAGAAATCCGGCTTCTTGG |
A4-R TGCCGCGCCTC
Specifically, the pCBC-MTlT2 plasmid (https://www.addgene.org/50593/) was used as the template to amplify the sgRNAl+3, sgRNAl+4, sgRNA2+3, sgRNA2+4 dual-target fragments and construct sgRNA expression cassettes, respectively. The pHUE411 vector backbone was digested with Bsal and recovered from gel, and the target fragment was directly used for the 10 ligation reaction after digestion. T4 DNA ligase was used to ligate the vector backbone and the target fragment, the ligation product was transformed into Trans5a competent cells, different monoclones were selected and sequenced. The Sparkjade High Purity Plasmid Mini Extraction Kit was used to extract plasmids with correct sequencing results, thereby obtaining recombinant plasmids, respectively named as pQY002095, pQY002096, pQY002097, pQY002098, as shown 15 below:
pQY002095 pHUE411-PPO-sgRNAl+3 containing OsPPO-guide RNA1, guide RNA3 combination
PQY002096 pHUE411-PPO-sgRNA2+3 containing OsPPO-guide RNA2, guide RNA3 combination pQY002097 pHUE41 l-PPO-sgRNAl+4 containing OsPPO-guide RNAl, guide RNA4 combination pQY002098 pHUE41 l-PPO-sgRNA2+4 containing OsPPO-guide RNA2, guide RNA4 combination
2. Plasmids of high-purity and high-concentration were prepared for the above-mentioned pQY002095-002098 vectors as described in the step 2 of Example 1.
3. Rice protoplasts were prepared and subjected to PEG-mediated transformation with the above-mentioned vectors as described in step 3 of Example 1.
4. Genomic targeting and détection of new gene with the détection primers shown in the table below for the PCR détection as described in the step 4 of Example 1.
Primer
OsPPOinversion-checkFl (PPO-F1)
OsPPOinversion-checkF2 (PPO-F2)
OsPPOinversion-checkRl (PPO-R1)
OsPPOinversion-checkR2 (PPO-R2)
OsCPinversion-checkFl (CP-F1)
OsCPinversion-checkF2 (CP-F2)
OsCPinversion-checkRl (CP-R1)
OsCPinversion-checkR2 ( CP-R2 )
Sequence (5' to 3')
GCTATGCCGTCGCTCTTTCTC
CGGACTTATTCCCACCAGAA
GAGAAGGGGAGCAAGAAGACGT
AAGGCTGGAAGCTGTTGGG
CATTCCACCAAACTCCCCTCTG
AGGTCTCCTTGAGCTTGTCG
GTCATCTGCTCATGTTTTCACGGTC
CTGAGGAGGCGATAAGAAACGA
Among them, the combination of PPO-R2 and CP-R2 was used to amplify the CP 12 promoter-driven PPO1 CDS new gene fragment that was generated on the right side after chromosome fragment inversion, and the combination of PPO-F2 and CP-F2 was used to 15 amplify the PPO1 promoter-driven CP 12 CDS new gene fragment that was generated on the left side after inversion. The possible génotypes resulting from the dual-target editing and the binding sites of the molecular détection primers were shown in Figure 14.
5. The PCR and sequencing results showed that the expected new gene in which the CP 12 promoter drove the expression of PPO1 was created from the transformation of rice protoplasts. The editing event where the rice CP 12 gene promoter was fused to the PPO1 gene expression région could be detected in the genomic DNA of the transformed rice protoplasts. This indicated that the scheme to form a new PPO gene through chromosome fragment inversion was feasible, and a new PPO gene driven by a strong promoter could be created, which was defined as a PPO1 inversion event. The sequencing results for the chromosome fragment inversion in protoplasts transformed with the pQY002095 vector were shown in SEQ ID NO: 15; the sequencing results for the chromosome fragment délétion in protoplasts transformed with the pQY002095 vector were shown in SEQ ID NO: 16; and the sequencing results for the chromosome fragment inversion in protoplasts transformed with the pQY002098 vector were shown in SEQ ID NO: 17.
Example 5: Création of herbicide-resistant rice with knock-up expression of the endogenous PPO gene caused by chromosome fragment inversion through Agrobacterium-mediated transformation
1. Construction of knock-up editing vector: Based on the results of the protoplast testing, the dual-target combination of OsPPO-guide RNA1: 5'CCATGTCCGTCGCTGACGAG3' and OsPPO-guide RNA4: 5'CGGATTTCTGCGT-GTGATGT3' with high editing efficiency was selected to construct the Agrobacterium transformation vector pQY2234. pHUE411 was used as the vector backbone and the rice codon optimization was performed. The vector map was shown in Figure 16.
2. Agrobacterium transformed rice callus and two rounds of molecular identification:
The pQY2234 plasmid was used to transform rice callus according to the method described in step 2 of Example 2. The récipient varieties were Huaidao No.5 and Jinjing 818. In the callus sélection stage, two rounds of molecular identification were performed on hygromycin-resistant callus, and the calli positive in inversion event were differentiated. During the molecular détection of callus, the amplification of the CP12 promoter-driven PPO1 CDS new gene fragment generated on the right side after chromosome fragment inversion by the combination of PPO-R2 and CP-R2 was deemed as the positive standard for the inversion event, while the CP 12 new gene generated on the left side after inversion was considered after différentiation and seedling emergence of the callus. A total of 734 calli from Huaidao No.5 were tested, in which 24 calli were positive for the inversion event, and 259 calli from Jinjing 818 were tested, in which 29 calli were positive for the inversion event. Figure 17 showed the PCR détection results of Jinjing 818 calli No. 192-259.
3. A total of 53 inversion event-positive calli were subjected to two rounds of molecular identification and then co-differentiated, and 9 doubling event-positive calli were identified, which were subjected to two rounds of molecular identification and then co-differentiated to produce 1,875 TO seedlings, in which 768 strains were from Huaidao No.5 background, and 1107 strains were from Jinjing 818 background. These 1875 seedlings were further subjected to the third round of molecular identification with the PPO-R2 and CP-R2 primer pair, in which 184 fines from Huaidao No.5 background showed inversion-positive bands, 350 strains from Jinjing 818 background showed inversion-positive bands. The positive seedlings were moved to the greenhouse for cultivation.
4. PPO inhibitory herbicide résistance test of PPO1 inversion seedlings (TO génération):
Transformation seedlings of QY2234 TO génération identified as inversion event-positive were transplanted into large plastic buckets in the greenhouse to grow seeds of Tl génération. There were a large number of positive seedlings, so some TO seedlings and wild-type control fines with similar growth period and status were selected. When the plant height reached about cm, the herbicide résistance test was directly carried out. The herbicide used was a high-efficiency PPO inhibitory herbicide
produced by the company (code 2081, see Chinse Patent Application for Invention No.202010281666.4). In this experiment, the herbicide was applied at the gradients of three levels, namely 0.18, 0.4, and 0.6 g ai/mu, by a walk-in type spray tower.
The résistance test results were shown in Figure 18. 3-5 days after the application, the wild-type control rice seedlings began to wither from tip of leaf, necrotic spots appeared on the leaves, and the plants gradually withered, while most of the fines of the PPO1 inversion event maintained normal growth, the leaves had no obvious phytotoxicity. In addition, some fines showed phytotoxicity, probably due to the polygenotypic mosaicism of editing events and the low expression level of PPO1 in the T0 génération fines. Two weeks after the application, the wild-type rice seedlings died, and most of the inversion event strains continued to remain green and grew normally. The test results showed that the PPO1 inversion fines could significantly improve the tolérance of plants to 2081.
5. Quantitative détection of relative expression level of PPO1 gene in PPO1 inversion seedlings (T0 génération):
It was speculated that the increased résistance of the PPOl gene inversion lines to 2081 was due to the fusion of the strong CPI2 promoter and the CDS of the PPOl gene which would increase the expression level of PPOl. Therefore, the lines of TO génération QY2234-252, QY2234-304 and QY2234-329 from Huaidao No.5 background were selected, their primary 5 tillers and secondary tillers were sampled and subjected to the détection of expression levels of PPOl and CP 12 genes. The wild-type Huaidao No.5 was used as the control. The spécifie protocols followed step 6 of Example 2, with the rice UBQ5 gene as the internai reference gene. the fluorescence quantitative primers were as follows: 5’-3’
| UBQ5-F | ACCACTTCGACCGCCACTACT |
| UBQ5-R | ACGCCTAAGCCTGCTGGTT |
| RT-OsPPOl-F | GCAGCAGATGCTCTGTCAATA |
| RT-OsPPOl-R | CTGGAGCTCTCCGTCAATTAAG |
| RT-OsCP12-Fl | CCGGACATCTCGGACAA |
| RT-OsCP12-Rl | CTCAGCTCCTCCACCTC |
The UBQ5 was used as an internai reference. ACt was calculated by subtracting the Ct 10 value of UBQ5 from the Ct value of the target gene. Then 2‘ACt was calculated, which represented the relative expression level of the target gene. The H5CK1 and H5CK2 were two wild-type control plants of Huaidao No.5, the 252M, 304M and 329M represented the primary tiller leaf samples of QY2234-252, QY2234-304 and QY2234-329 TO plants, and the 252L, 304L, and 329L represented their secondary tiller leaf samples. The results were shown in Table 15 8 below:
Table 8: Ct values and relative expression folds of different genes
| UBQ5 | Mean | PPOl | ACt | 2-ACt | Mean | CP12 | ACt | 2-ACt | Mean | |
| 28.18 | 25.83 | -2.43 | 5.39 | 22.28 | -3.98 | 15.77 | ||||
| 28.37 | 25.98 | -2.28 | 4.85 | 22.06 | -4.20 | 18.44 | ||||
| H5CK1 | 28.23 | 28.26 | 25.93 | -2.33 | 5.03 | 5.09 | 22.11 | -4.15 | 17.76 | 17.32 |
| 28.23 | 25.73 | -2.36 | 5.15 | 21.63 | -6.47 | 88.58 | ||||
| 27.98 | 26.02 | -2.07 | 4.20 | 21.53 | -6.57 | 94.87 | ||||
| H5CK2 | 28.07 | 28.09 | 25.92 | -2.18 | 4.52 | 4.62 | 21.54 | -6.55 | 93.83 | 92.43 |
| 25.51 | 25.17 | -0.54 | 1.45 | 22.26 | -3.45 | 10.95 | ||||
| 25.82 | 25.22 | -0.49 | 1.41 | 22.36 | -3.36 | 10.23 | ||||
| 252M | 25.80 | 25.71 | 25.22 | -0.49 | 1.41 | 1.42 | 22.43 | -3.29 | 9.76 | 10.31 |
| 26.41 | 23.36 | -3.14 | 8.84 | 22.30 | -4.21 | 18.49 |
| 26.64 | 23.41 | -3.10 | 8.56 | 21.95 | -4.56 | 23.55 | ||||
| 252L | 26.47 | 26.51 | 23.46 | -3.05 | 8.28 | 8.56 | 21.78 | -4.73 | 26.47 | 22.84 |
| 25.74 | 24.55 | -1.29 | 2.44 | 22.51 | -3.32 | 10.02 | ||||
| 25.99 | 24.53 | -1.31 | 2.48 | 22.45 | -3.39 | 10.47 | ||||
| 304M | 25.78 | 25.84 | 24.48 | -1.36 | 2.57 | 2.50 | 22.56 | -3.28 | 9.71 | 10.07 |
| 25.97 | 23.63 | -2.36 | 5.14 | 21.60 | -4.39 | 20.97 | ||||
| 26.00 | 23.75 | -2.25 | 4.74 | 21.43 | -4.56 | 23.55 | ||||
| 304L | 26.00 | 25.99 | 23.56 | -2.43 | 5.39 | 5.09 | 22.32 | -3.68 | 12.78 | 19.10 |
| 26.94 | 23.11 | -3.89 | 14.84 | 22.23 | -4.76 | 27.16 | ||||
| 26.99 | 23.25 | -3.75 | 13.42 | 21.85 | -5.15 | 35.39 | ||||
| 329M | 27.07 | 27.00 | 23.22 | -3.78 | 13.71 | 13.99 | 21.82 | -5.18 | 36.29 | 32.95 |
| 26.50 | 23.64 | -2.63 | 6.19 | 22.00 | -4.27 | 19.30 | ||||
| 26.52 | 23.74 | -2.53 | 5.79 | 21.97 | -4.30 | 19.71 | ||||
| 329L | 25.79 | 26.27 | 23.77 | -2.50 | 5.65 | 5.87 | 22.15 | -4.12 | 17.42 | 18.81 |
The relative expression levels of PPO1 and CP12 in different strains were shown in Figure 19. As the results showed, unlike the doubling event in Example 2, the gene expression levels of these inversion event strains were significantly different. The expression levels of CP 12 are very different between the two Huaidao No.5 CK groups, possibly because of the different growth rates of the seedlings. Compared with the H5CK2 control group, the expression levels of CP 12 in the experimental groups ail showed a tendency of decrease, while the expression levels of PPO1 for 252L and 329M increased significantly, and the expression levels of PPO1 for 304L and 329L modestly increased, and the expression levels of PPO1 for 252M and 304M decreased. Different from the doubling of chromosome fragments which mainly increased the gene expression level, the inversion of chromosome fragments generated new genes on both sides, so varions editing events might occur at the targets on both sides, and the changes in the transcription direction might also affect gene expression level at the same time. That is to say, the TO génération plants were complex chimeras. There might also be significant différences in gene expression levels between primary and secondary tillers of the same plant. It could be seen from the results of quantitative PCR that the PPO1 inversion events showed a higher likelihood of increasing the PPO1 gene expression level, and thus herbicide-resistant strains with high expression level of PPO1 could be selected out by herbicide résistance sélection for the inversion events.
The above results proved that, following the scheme of detecting effective chromosome fragment inversion in protoplasts, calli and transformed seedlings with inversion events could be selected through the multiple rounds of molecular identification during the Agrobacterium transformation and tissue culturing, and the CP 12 strong promoter fused with the new PPO1 gene generated in the transformant seedlings could indeed increase the expression level of the PPO1 gene, which could confer the plants with résistance to the PPO inhibitory herbicide 2081, thereby herbicide-resistant rice with knock-up endogenous PPO gene was created. Taking this as an example, the chromosome fragment inversion protocol of Example 4 and Example 5 also applied to other endogenous genes which gene expression pattern needed to be changed by introducing and fusing with a required promoter, thereby a new gene can be created, and new varieties with a desired gene expression pattem could be created through Agrobacteriïim-mediated transformation in plants.
Example 6: Molecular détection and herbicide résistance test of the Tl génération plants of the herbicide-resistant rice lines with knock-up expression of the endogenous PPO1 gene through chromosome fragment inversion
The physical distance between the wild-type rice genome PPO1 gene and CP 12 gene was 911 kb. As shown in Figure 14, a highly-expressing PPO1 gene with a CP 12 promoter-driven PPO1 CDS région was generated on the right side after the inversion of the chromosome fragment between the two genes. A délétion of chromosome fragment could also occur. In order to test whether the new gene could be inherited stably and the influence of the chromosome fragment inversion on genetic stability, molecular détection and herbicide résistance test was carried out on the Tl génération of the PPO1 inversion strain.
First of ail, it was observed that the inversion event had no significant effect on the fertility of the TO génération plants, as ail positive TO strains were able to produce seeds normally. The Tl générations of QY2234/H5-851 strains with the Huaidao No.5 background were selected for détection.
1. Sample préparation:
For QY2234/H5-851, a total of 48 Tl seedlings were planted. Ail the plants grew normally.
2. PCR molecular identification:
1) Détection primer sequence: 5’-3’
PPO-R2: AAGGCTGGAAGCTGTTGGG
CP-R2: CTGAGGAGGCGATAAGAAACGA
PPO-F2: CGGACTTÀTTTCCCACCAGAA
CP-F2: AGGTCTCCTTGAGCTTGTCG pg-Hyg-Rl : TCGTCCATCACAGTTTGCCA pg-35S-F: TGACGTAAGGGATGACGCAC
2) The binding sites of the above primers were shown in Figure 14, wherein the PPO-R2 + 5 CP-R2 was used to detect the fusion fragment of the right CPI2 promoter and the PPO1 coding région after the inversion of the chromosome fragment, and the length of the product was 507bp; the PPO-F2 + CP-F2 was used to detect the fusion fragment of the left PPOl promoter and the CP 12 coding région after the inversion of the chromosome fragment, and the length of the product was 560bp; the PPO-F2 + PPO-R2 was used to detect the left PPO target site before the 10 inversion, and the length of the product in the wild-type control was 586bp; the CP-F2 + CP-R2 was used to detect the right CP 12 target site before the inversion, and the length of the product in the wild-type control was 481 bp. The pg-Hyg-Rl + pg-35S-F was used to detect the T-DNA fragment of the editing vector, and the length of the product was 660bp.
3) PCR reaction System and reaction conditions:
Reaction System (10pL System):
| 2*KOD buffer | 5pL | |
| 2mM dNTPs | 2pL | |
| KOD enzyme | 0.2pL | |
| Primer F | 0.2pL | |
| Primer R | 0.2pL | |
| Water | 2.1pL | |
| Sample | 0.3pL | |
| Reaction conditions: | ||
| 94°C | 2 minutes | |
| 98°C | 20 seconds - | |
| 60°C | 20 seconds | 40 cycles |
| 68°C | 20 seconds | |
| 68°C | 2 minutes | |
| 12°C | 5 minutes | |
| The PCR products were subjected to electrophoresis on a 1% agarose gel with a voltage of | ||
| 180V for 10 minutes. |
3. Molecular détection results:
The détection results were shown in Table 9. A total of 48 plants were detected, of which 12 plants (2/7/11/16/26/36/37/40/41/44/46/47) were homozygous in inversion, 21 plants (1/3/4/5/6/8/9/15/17/20/22/23/24/27/30/31/33/34/39/42/43) were heterozygous in inversion, 5 and 15 plants (10/12/13/14/18/19/21/25/28/29/32/35/38/45/48) were homozygous in non-inversion. The ratio of homozygous inversion: heterozygous inversion: homozygous non-inversion was 1:1.75:1.25, approximately 1:2:1. So the détection results met the Mendel’s law of inheritance, indicating that the new PPO1 gene generated by inversion was heritable.
Table 9: Results of molecular détection
| QY2234-851 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
| Right side of inversion | + | + | + | + | + | + | + | + | + | - | + | - | - | - | + | + | + | - | - | + |
| Left side of inversion | + | + | + | + | + | + | + | + | + | - | + | - | - | - | + | + | + | - | - | + |
| PPO WT | + | - | + | + | + | + | - | + | + | + | - | + | + | + | + | - | + | + | + | + |
| CP 12 WT | + | - | + | + | + | + | - | + | + | + | - | + | + | + | + | - | + | + | + | + |
| QY2234-851 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 |
| Right side of inversion | - | + | + | + | - | + | + | - | - | + | + | - | + | + | - | + | + | - | + | + |
| Left side of inversion | - | + | + | + | - | + | + | - | - | + | + | - | + | + | - | + | + | - | + | + |
| PPO WT | + | + | + | + | + | - | + | + | + | + | + | + | + | + | + | - | - | + | + | - |
| CP 12 WT | + | + | + | + | + | - | + | + | + | + | + | + | + | + | + | - | - | + | + | - |
| QY2234-851 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | ||||||||||||
| Right side of inversion | + | + | + | + | - | + | + | - | ||||||||||||
| Left side of inversion | + | + | + | + | - | + | + | - | ||||||||||||
| PPO WT | - | + | + | - | + | - | - | + | ||||||||||||
| CP12 WT | - | + | + | - | + | - | - | + |
For the above Tl seedlings, the Pg-Hyg-Rl + pg-35S-F primers were used to detect the T-DNA fragment of the editing vector. The electrophoresis results of 16 and 41 were négative for T-DNA fragment, indicating homozygous inversion. It could be seen that non-transgenic strains of homozygous inversion could be segregated from the Tl génération of the inversion event.
4. Sequencing détection of the editing events:
The génotype détection of the inversion events focused on the editing events of the new 5 PPO gene on the right side. The mutation events with the complété protein coding trame of the PPO1 gene were retained. The CP 12 site editing events on the left side that did not affect the normal growth of plants through the phenotype observation in the greenhouse and field were retained. The génotypes of the editing events detected in the inversion event-positive lines were listed below, in which seamless indicated identical to the predicted fusion fragment sequence 10 after inversion. The génotypes of the successful QY2234 inversion events in Huaidao No.5 background were as follows:
| No. | Génotype | No. | Génotype |
| 2234/H5-295 | Right side -Ibp; left side -32bp | 2234/H5-650 | Right side seamless; left side +lbp (G) |
| 2234/H5-381 | Right side +18bp | 2234/H5-263 | Right side seamless; left side seamless |
| 2234/H5-410 | Right side -Ibp; left side + lbp | 2234/H5-555 | Right side -23bp |
| 2234/H5-159 | Right side -16bp | 2234/H5-645 | Right side -5bp, +20bp, |
| 2234/H5-232 | Right side -4bp |
Some of the sequencing peak maps and sequence comparison results were shown in Figure 20.
The génotypes of the successful QY2234 inversion in the Jinjing818 background were as follows:
| No. | Right side PPO génotype | No. | Right side PPO génotype |
| 2234/818-5 | Right side seamless | 2234/818-144 | Right side +lbp |
| 2234/818-42 | Right side -16bp | 2234/818-151 | Sight side +2bp, -26bp, pure peak |
| 2234/818-108 | Right side -15bp | 2234/818-257 | Sight side +lbp |
| 2234/818-134 | Right side +5bp, -15bp |
Some of the sequencing peak maps and sequence comparison results were shown in Figure
21.
The sequencing results of the above different new PPO1 genes with the CP 12 promoter fused to the PPO1 coding région were shown in SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 25, and SEQ ID NO: 26.
5. Herbicide résistance test of Tl génération seedlings:
The herbicide résistance test was performed on the Tl génération of the QY2234/H5-851 PPO1 inversion lines at seedling stage. The wild-type Huaidao No.5 was used as a control, and planted simultaneously with the Tl génération seeds of the inversion lines. When the seedlings reached a plant height of 15 cm, 2081 was applied by spraying at four levels of 0.3, 0.6, 0.9 and 1.2 g a.i./mu. The culture conditions were 28°C, with 16 hours of light and 8 hours of darkness.
The résistance test results were shown in Figure 22. After 5 days of the application, the wild-type control rice seedlings showed obvious phytotoxicity at a dose of 0.3 g a.i./mu. They began to wither from the tip of leaf, and necrotic spots appeared on the leaves; at a dose of 0.6 g a.i./mu, the plants died quickly. However, QY2234/H5-851 Tl seedlings could maintain normal growth at a dose of 0.3 g a.i./mu, and no obvious phytotoxicity could be observed on the leaves; at doses of 0.6 and 0.9 g ai/mu, some Tl seedlings showed dry leaf tips, but most Tl seedlings could keep green and continue to grow, while the control substantially died off. At a dose of 1.2 g a.i./mu, the control plants were ail dead, while some of the Tl seedlings could keep green and continue to grow. The test results indicated that the résistance of the PPO1 gene inversionlines to 2081 couldbestablyinheritedtotheirTl génération.
Example 7: An editing method for knocking up the expression of theendogenous EPSPS gene in plant
EPSPS was a key enzyme in the pathway of aromatic amino acid synthesis in plants and the target site of the biocidal herbicide glyphosate. The high expression level of EPSPS gene could endow plants with résistance to glyphosate. The EPSPS gene (as shown in SEQ ID NO: 4, in which l-1897bp was the promoter, and the rest was the expression région) was located on chromosome 6 in rice. The gene upstream was transketolase (TKT, as shown in SEQ ID NO: 3, in which 1-2091 bp was the promoter, and the rest was the expression région) with an opposite direction. The expression intensity of TKT gene in leaves was 20-50 times that of the EPSPS gene. As shown in Figure 2, by simultaneously inducing double-strand breaks between the promoter and the CDS région of the two genes respectively, the inversion (Scheme 1) or inversion doubling (Scheme 2) of the région between the two breaks could be obtained after screening. In both cases, the promoter of the EPSPS gene would be replaced with the promoter of the TKT gene, thereby increasing the expression level of the EPSPS gene and obtaining the résistance to glyphosate. In addition, the Schemes 3, 4 and 5 as shown in Figure 2 could also create new EPSPS genes driven by the TKT gene promoter. The gene structure of EPSPS 5 adjacent to and opposite in direction relative to TKT was conserved in monocotyledonous plants (Table 10). While in dicotyledonous plants, both genes were also adjacent yet in the same direction; therefore, this method was universal in plants.
Table 10: Distance between the EPSPS gene and the adjacent TKT gene in different plants
| Species | Location (chromosome) | Distance from CD région start site (kb) | Direction |
| Rice | 6 | 4 | Reverse <TKT-EPSPS> |
| Wheat | 7A | 35 | Reverse <TKT-EPSPS> |
| 7D | 15 | Reverse <TKT-EPSPS> | |
| 4A? | 50 | Reverse <TKT-EPSPS> | |
| Maize | 9 | 22 | Reverse <TKT-EPSPS> |
| Brachypodium distachyon | 1 | 5 | Reverse <TKT-EPSPS> |
| Sorghum | 10 | 15 | Reverse <TKT-EPSPS> |
| Millet | 4 | 5 | Reverse <TKT-EPSPS> |
| Soybean | 3 | 6 | Forward TKT>EPSPS> |
| Tomato | 5 | 6 | Forward TKT>EPSPS> |
| Peanut | 2 | 6 | Forward TKT>EPSPS> |
| 12 | 5 | Forward TKT>EPSPS> | |
| Cotton | 9 | 22 | Forward TKT>EPSPS> |
| Alfalfa | 4 | 8 | Forward TKT>EPSPS> |
| Arabidopsis | 2 | 5 | Forward TKT>EPSPS> |
| Grape | 15 | 17 | Forward TKT>EPSPS> |
To this end, pHUE411 was used as the backbone, and the following as targets:
Name of target sgRNA OsEPSPS-guide RNA1 OsEPSPS-guide RNA2 OsEPSPS-guide RNA3
Sequence(5' to 3)
CCACACCACTCCTCTCGCCA
CCATGGCGAGAGGAGTGGTG
ATGGTCGCCGCCATTGCCGG
OsEPSPS-guide RNA4
OsEPSPS-guide RNA5
OsEPSPS-guide RNA6
GACCTCCACGCCGCCGGCAA
TAGTCATGTGACCATCCCTG
TTGACTCTTTGGTTCATGCT
Several different dual-target vectors had been constructed:
PQY002061 pHUE41 l-EPSPS-sgRNAl+3
PQY002062 pHUE4l l-EPSPS-sgRNA2+3
PQY002063 pHUE4l l-EPSPS-sgRNAl+4
PQY002064 pHUE4l l-EPSPS-sgRNA2+4
PQY002093 pHUE41l-EPSPS-sgRNA2+5
PQY002094 pHUE41 l-EPSPS-sgRNA2+6 (2) With the relevant détection primers shown in the following table, the fragments containing the target sites on both sides or the predicated fragments generated by the fusion of 10 the UBI2 promoter and the HPPD coding région were amplified, and the length of the products is between 300-1000 bp.
| Primer | Sequence (5' to 3') |
| EPSPSinversion checkFl | ATCCAAGTTACCCCCTCTGC |
| EPSPSinversion checkRl | CACAAACACAGCCACCTCAC |
| EPSPSinversion check-nestF2 | ATGTCCACGTCCACACCATA |
| EPSPSinversion check-nestR2 | AATGGAATTCACGCAAGAGG |
| EPSPSinversion checkF3 | GTAGGGGTTCTTGGGGTTGT |
| EPSPSinversion checkR3 | CGCATGCTAACTTGAGACGA |
| EPSPSinversion check-nestF4 | GGATCGTGTTCACCGACTTC |
| EPSPSinversion check-nestR4 | CCGGTACAACGCACGAGTAT |
| EPSPSinversion checkF5 | GGCGTCATTCCATGGTTGATTGT |
| EPSPSinversion checknestFô | GATAGACCCAGATGGGCATAGAATC |
| EPSPSinversion checkR5 | TGCATGCATTGATGGTTGGTGC |
| EPSPSinversion checknestRô | CCGGCCCTTAGAATAAAGGTAGTAG |
After protoplast transformation, the détection results showed that the expected inversion events were obtained. As shown in Figure 15, the sequencing resuit of the inversion détection of pQY002062 vector transformed protoplast was shown in SEQ ID NO: 11; the sequencing resuit of the délétion détection of pQY002062 vector transformed protoplast was shown in SEQ ID No: 12; the sequencing resuit of the inversion détection of the pQY002093 vector transformed protoplast was shown in SEQ ID NO: 13; and the sequencing resuit of the délétion détection of pQY002093 vector transformed protoplast was shown in SEQ ID NO: 14.
These vectors were transferred into Agrobacterium for transforming calli of rice. Plants containing the new EPSPS gene were obtained. The herbicide bioassay results showed that the plants had obvious résistance to glyphosate herbicide.
Example 8: An editing method for knocking up the expression of the endogenous PPO gene in Arabidopsîs
Protoporphyrinogen oxidase (PPO) was one of the main targets of herbicides. By highly expressing plant endogenous PPO, the résistance to PPO inhibitory herbicides could be significantly increased. The Arabidopsîs PPO gene (as shown in SEQ ID NO: 1, in which l-2058bp was the promoter, and the rest was the expression région) located on chromosome 4, and the ubiquitinlO gene (as shown in SEQ ID NO: 2, in which l-2078bp was the promoter, and the rest was the expression région) located 1.9M downstream with the same direction as the PPO gene.
As shown in the Scheme as shown in Figure 3, simultaneously generating double-strand breaks at the sites between the promoter and the CDS région of the PPO and the ubiquitinlO genes respectively. Doubling events of the région between the two breaks could be obtained by screening, namely a new gene generated by fusing the ubiquitinlO promoter and the PPO coding région. In addition, following Scheme 2 as shown in Figure 1, a new gene in which the ubiquitinlO promoter and the PPO coding région were fused together could also be created.
To this end, pHEE401E was used as the backbone (https://www.addgene.Org/71287/). and the following locations were used as target sites:
| Name of target sgRNA | Sequence(5' to 3) |
| AtPPO-guide RNA1 | CAAACCAAAGAAAAAGTATA |
| AtPPO-guide RNA2 | GGTAATCTTCTTCAGAAGAA |
| AtPPO-guide RNA3 | ATCATCTTAATTCTCGATTA |
| AtPPO-guide RNA4 | TTGTGATTTCTATCTAGATC |
The dual-target vectors were constructed following the method described by Wang ZP, Xing HL, Dong L, Zhang HY, Han CY, Wang XC, Chen QJ. Egg cell-specific promoter-controlled CRISPR/Cas9 efficiently générâtes homozygous mutants for multiple target genes in Arabidopsis in a single génération. Genome Biol .2015 Jul 21 ; 16:144.:
PQY002076
PQY002077
PQY002078
PQY002079 pHEE401E- AtPPO-sgRN A1 +3 pHEE401 E-AtPPO-sgRN A1 +4 pHEE401E-AtPPO-sgRN A2+3 pHEE401 E-AtPPO-sgRNA2+4
Arabidopsis was transformed according to the method as follows:
(Y) Agrobacterium transformation
Agrobacterium GV3101 competent cells were transformed with the recombinant plasmids to obtain recombinant Agrobacterium. .
(2) Préparation of Agrobacterium infection solution
1) Activated Agrobacterium was inoculated in 30ml of YEP liquid medium (containing 25mg/L Rif and 50mg/L Kan), cultured at 28°C under shaking at 200 rpm ovemight until the OD600 value was about 1.0-1.5.
2) The bacteria were collected by centrifugation at 6000 rpm for 10 minutes, and the supematant was discarded.
3) The bacteria was resuspended in the infection solution (no need to adjust the pH) to reach QD600=0.8 for later use.
(3) Transformation of Arabidopsis
1) Before the plant transformation, the plants should grow well with luxuriant inflorescence and no stress response. The first transformation could be carried out as long as the plant height reached 20cm. When the soil was dry, watering was carried out as appropriate. On the day before the transformation, the grown siliques were eut with scissors.
2) The inflorescence of the plant to be transformed was immersed in the above solution for 30 seconds to 1 minute with gentle stirring. The infiltrated plant should hâve a layer of liquid film thereon.
3) After transformation, the plant was cultured in the dark for 24 hours, and then removed to a normal light environment for growth.
4) After one week, the second transformation was carried out in the same way.
(4) Seed harvest
Seeds were harvested when they were mature. The harvested seeds were dried in an oven at 37°C for about one week.
(5) Sélection of transgenic plants
The seeds were treated with disinfectant for 5 minutes, washed with ddHiO for 5 times, and then evenly spread on MS sélection medium (containing 30pg/ml Hyg, 100pg/ml Cef). Then the medium was placed in a light incubator (at a température of 22□, 16 hours of light and 8 hours of darkness, light intensity 100-150 pmol/m2/s, and a humidity of 75%) for cultivation. The positive seedlings were selected and transplanted to the soil after one week.
(6) Détection of Tl mutant plants (6.1) Genomic DNA extraction
1) About 200 mg of Arabidopsis leaves was eut and placed into a 2 ml centrifuge tube. Steel balls were added, and the leaves were ground with a high-throughput tissue disruptor.
2) After thorough grinding, 400pL of SDS extraction buffer was addedand mixed upside down. The mixture was incubated in a 65 □ water bath for 15 minutes, and mixed upside down every 5 minutes during the period.
3) The mixture was centriftiged at 13000rpm for 5 minutes.
4) 300pL of supematant was removed and transferred to a new 1.5ml centrifuge tube, an equal volume of isopropanol pre-cooled at -20°C was added into the centrifuge tube, and then the centrifuge tube was kept at -20°C for 1 hour or ovemight.
5) The mixture was centriftiged at 13000rpm for 10 minutes, and the supematant was discarded.
6) 500pL of 70% éthanol was added to the centrifuge tube to wash the precipitate, the washing solution was discarded after centrifugation (careftilly not discarding the precipitate). After the precipitate was dried at room température, 30pL of ddH2Ü was added to dissolve the DNA, and then stored at -20 □.
(6.2) PCR amplification
With the extracted genome of the Tl plant as template, the target fragment was amplified with the détection primers. 5 pL of the amplification product was taken and detected by 1% agarose gel electrophoresis, and then imaged by a gel imager. The remaining product was directly sequenced by a sequencing company.
The sequencing results showed that the AtPPOl gene doubling was successfully achieved in Arabidopsis, and the herbicide résistance test showed that the doubling plant had résistance to PPO herbicides.
Example 9: Création of GH1 gene with new expression characteristics in zebrafish
The growth hormone (GH) genes in fishes controlled their growth and development speed. At présent, highly expressing the GH gene in Atlantic salmons through the transgenic technology could significantly increase their growth rates. The technique was of great economical value, but only approved for marketing after décades. The GH1 gene was the growth hormone gene in zebrafish. In the présent invention, suitable promoters in zebrafish (suitable in terms of continuous expression, strength, and tissue specificity) were fused together in vivo through délétion, inversion, doubling, inversion doubling, chromosome transfer, etc., to create a fast-growing fish variety.
Ail publications and patent applications mentioned in the description are incorporated herein by reference, as if each publication or patent application is individually and specifically incorporated herein by reference.
Although the foregoing invention has been described in more detail by way of examples and embodiments for clear understanding, it is obvious that certain changes and modifications can be implemented within the scope of the appended daims, such changes and modifications are ail within the scope of the présent invention.
Claims (44)
- (1) the nucleic acid sequence as shown in SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO:(1) the nucleic acid sequence as shown in SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13 or SEQ ID NO: 14 or a portion thereof or a complementary sequence thereof;(1) the nucleic acid sequence as shown in SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 18 or SEQ ID NO: 19 or a portion thereof or a complementary sequence thereof;1. A method for creating a new gene in an organism, characterized by comprising the following steps:simultaneously generating DNA breaks at two or more different spécifie sites in the organism’s genome, wherein the spécifie sites are genomic sites capable of separating different genetic éléments or different protein domains, and the DNA breaks are ligated to each other by a non-homologous end joining (NHEJ) or homologous repair, generating a new combination of the different genetic éléments or different protein domains different from the original genomic sequence, thereby creating the new gene.
- (2) a sequence having an identity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% to any one of the sequences as defined in (1); or (3) a nucleic acid sequence capable of hybridizing to the sequence as shown in (1) or (2) under a stringent condition.(2) a sequence having an identity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% to any one of the sequences as defmed in (1); or (3) a nucleic acid sequence capable of hybridizing to the sequence as shown in (1) or (2) under a stringent condition.(2) a sequence having an identity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% to any one of the sequences as defmed in (1); or (3) a nucleic acid sequence capable of hybridizing to the sequence as shown in (1) or (2) under a stringent condition.2. The method according to claim 1, characterized in that said two or more different spécifie sites locate on the same chromosome or on different chromosomes, or may be spécifie sites on at least two different genes, or may be at least two different spécifie sites on the same gene, wherein said at least two different genes may hâve the same or different transcription directions.
- 3. The method according to claim 1 or 2, characterized in that said DNA breaks are achieved by delivering a nuclease with targeting property into a cell of the organism to contact with the spécifie sites of the genomic DNA.
- 4. The method according to claim 3, characterized in that said nuclease with targeting property is selected from the group consisting of Meganuclease, Zinc finger nuclease, TALEN, and CRISPR/Cas System.
- 5. The method according to claim 3 or 4, characterized in that the nuclease with targeting property is in the form of DNA, or exists in the form of mRNA or protein, but not DNA.
- 6. The method according to any one of claims 3-5, characterized in that the nucleases with targeting property are delivered into the cell by: 1) a PEG-mediated cell transfection method; 2) a liposome-mediated cell transfection method; 3) an electric shock transformation method; 4) a microinjection; 5) a gene gun bombardment; or 6) an Jgro&acterzw/M-mediated transformation method.
- 7. The method according to any one of claims 1-6, characterized in that said gene éléments are selected from the group consisting of a promoter, a 5' untranslated région, a coding région or non-coding RNA région, a 3' untranslated région, a terminator of the gene, or any combination thereof.
- 8. The method according to any one of claims 1-7, characterized in that the combination of different gene éléments is a combination of the promoter of one of the two genes with different expression patterns and the coding région or the non-coding RNA région of the other gene, a combination of the région from the promoter to 5'UTR of one of the two genes with different expression patterns and the CDS or non-coding RNA région of the other gene, or a combination of adjacent gene éléments of the same gene.
- 9. The method of claim 8, characterized in that, in the combination of different gene éléments, one element is a strong endogenous promoter of the organism, and the other is the coding région of HPPD, EPSPS, PPO or GH1 gene.
- 10. The method according to claim 8 or 9, characterized in that the different expression patterns are different levels of gene expression, different tissue-specificities of gene expression, or different developmental stage-specificities of gene expression.
- 11. The method according to any one of claims 1-10, characterized in that the protein domain is a DNA fragment corresponding to a spécifie functional domain of a protein.
- 12. The method according to claim 11, characterized in that the protein domain is selected from the group being consisted of: a nuclear localization signal, a chloroplast leading peptide, a mitochondrial leading peptide, a phosphorylation site, a méthylation site, a transmembrane domain, a DNA binding domain, a transcription activation domain, a receptor activation domain, or an enzyme catalytic center.
- 13. The method according to any one of claims 1-12, characterized in that the combination of different protein domains is a combination of the localization signal région of one of two proteins with different subcellular localizations and the mature protein coding région of the other gene, a combination of two protein domains with different biological functions, or a combination of adjacent protein domains of the same gene.
- 14. The method of claim 13, characterized in that the different subcellular locations are selected from the group consisting of nuclear location, cytoplasmic location, cell membrane location, chloroplast location, mitochondrial location, and endoplasmic réticulum membrane location, or the different biological functions are selected from the group consisting of récognition of spécifie DNA or RNA conserved sequence, activation of gene expression, binding to a protein ligand, binding to small molecular signal, binding to an ion, spécifie enzymatic reaction, and any combination thereof.
- 15. The method according to any one of daims 1-7 and 12, characterized in that the combination of gene éléments and protein domains are a combination of protein domains and adjacent promoters, 5'UTR, 3'UTR or terminators of the same gene.
- 16. A new gene obtainable by the method according to any one of daims 1-15.
- 17, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, or SEQ ID NO: 26 or a portion thereof or a complementary sequence thereof;17. The new gene according to daim 16, characterized in that compared with the original gene, the new gene either has a different promoter and therefore is expressed with a different spatial-temporal characteristics or a different intensity characteristics or a different developmental stage characteristics, or has a new amino acid sequence.
- 18. The new gene according to daim 7, characterized in that the new amino acid sequence may either be an intégral fusion of two or more gene coding régions, or a partial fusion of coding régions or a doubling of a part of protein coding région of the same gene.
- 19. The new gene according to daim 16 or 17, characterized in that the new gene is a highly-expressed endogenous HPPD, EPSPS, PPO or GH1 gene in the organism.
- 20. A DNA containing the gene according to any one of daims 16-19.
- 21. A protein encoded by the gene according to any one of daims 16-19, or a biologically active fragment thereof.
- 22. A recombinant expression vector, which comprises the gene according to any one of daims 16-19 and a promoter operably linked thereto.
- 23. An expression cassette comprising the gene according to any one of daims 16-19.
- 24. A host cell, which comprises the expression cassette according to daim 23.
- 25. An organism regenerated from the host cell according to daim 24.
- 26. Use of the gene according to any one of daims 16-19 in conferring or improving a resistance/tolerance trait or a growth advantage trait in an organism.
- 27. A composition, which comprises:(a) a promoter of one of two genes with different expression patterns and a coding région or non-coding RNA région of the other gene;(b) a promoter to a 5' untranslated région of one of two genes with different expression patterns and a coding région or non-coding RNA région of the other gene;(c) adjacent gene éléments within the same gene;(d) a localization signal région of one of the two protein coding genes with different subcellular localizations and a mature protein coding région of the other gene;(e) two protein domains with different biological fûnctions;(f) adjacent protein domains in the same gene; or, (g) a protein domain and an adjacent promoter, 5' untranslated région, 3' non-coding région or terminator in the same gene.
- 28. An editing method for increasing the gene expression level of a target endogenous gene in an organism, which is independent of an exogenous DNA donor fragment, which comprises the following steps:simultaneously generating DNA breaks separately at selected sites between the promoter and the coding région of each of the target endogenous gene and an optional endogenous highly-expressing gene; ligating the DNA breaks to each other by means of non-homologous end joining (NHEJ) or homologous repair, thereby generating an in vivo fusion of the coding région of the target endogenous gene and the optional strong endogenous promoter to form a new highly-expressing endogenous gene, wherein the target endogenous gene and the optional endogenous highly-expressing gene are located on the same chromosome or different chromosomes.
- 29. An editing method for knocking up the expression of an endogenous HPPD, EPSPS or PPO gene in a plant, characterized in that it comprises fusing the coding région of the HPPD, EPSPS or PPO gene with a strong endogenous promoter of a plant in vivo to form a new highly-expressing plant endogenous HPPD, EPSPS or PPO gene, respectively.
- 30. The editing method according to claim 29, characterized in that it comprises the following steps: simultaneously generating DNA breaks respectively in selected spécifie sites between the promoter and the coding région of each of the HPPD, EPSPS or PPO gene and an optional endogenous highly-expressing gene , ligating the DNA breaks to each other through an intracellular repair pathway, generating in vivo a fusion of the coding région of the HPPD, EPSPS or PPO gene and the optional strong endogenous promoter to form a new highly-expressing HPPD, EPSPS or PPO gene, respectively.
- 31. The editing method according to claim 29 or 30, characterized in that it comprises fusing the coding région of the HPPD gene with a strong endogenous promoter of a rice, wherein the strong promoter is a promoter of ubiquitin2 gene, it comprises fusing the coding région of the EPSPS gene with a strong endogenous promoter of a rice, wherein the strong promoter is a promoter of TKT gene, or it comprises fusing the coding région of the PPO gene with a strong endogenous promoter of a rice or an Arabidopsis, wherein in rice, the strong promoter is a promoter of CP 12 gene, and in Arabidopsis, the strong promoter is a promoter of ubiquitinlO gene.
- 32. A highly-expressing plant endogenous HPPD, EPSPS or PPO gene obtainable by the editing method according to any one of daims 29-31.
- 33. A highly-expressing rice endogenous HPPD gene, which has a sequence selected from the group consisting of:
- 34. A highly-expressing rice endogenous EPSPS gene, which has a sequence selected from the group consisting of:
- 35. A highly-expressing rice endogenous PPO gene, which has a sequence selected from the following:
- 36. A DNA comprising the gene according to any one of daims 32-35.
- 37. A protein or a biologically active fragment thereof encoded by the gene according to any one of claims 32-35.
- 38. A recombinant expression vector, which comprises the gene according to any one of claims 32-35, and a promoter operably linked thereto.
- 39. An expression cassette comprising the gene according to any one of claims 32-35.
- 40. A plant host cell, which comprises the expression cassette according to daim 39.
- 41. A plant regenerated from the plant host cell according to daim 40.
- 42. A method for producing a plant with an increased résistance or tolérance to an herbicide, which comprises regenerating the plant host cell according to daim 40 into a plant and a progeny derived therefrom.
- 43. A rice résistant to a herbicide, which comprises one or a combination of two or more of the highly-expressing rice endogenous HPPD gene according to daim 33, the highly-expressing rice endogenous EPSPS gene according to daim 34, and the highly-expressing rice endogenous PPO gene according to daim 35.
- 44. A method for controlling a weed in a cultivation site of a plant, wherein the plant is selected from the group consisting of the plant according to daim 41, a plant prepared by the method according to daim 42, or the rice according to daim 43, wherein the method comprises applying to the cultivation site one or more of HPPD, EPSPS or PPO inhibitory herbicides in an amount for effectively controlling the weed.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201911073406.1 | 2019-11-06 | ||
| CN202011190279.6 | 2020-10-30 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| OA21191A true OA21191A (en) | 2024-02-29 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7559047B2 (en) | Methods for producing novel genes in organisms and uses thereof | |
| JP7822931B2 (en) | Methods for generating novel mutations in organisms and their applications | |
| WO2022233271A1 (en) | Method for generating new gene in organism and use thereof | |
| US12428627B2 (en) | Mutant p-hydroxyphenylpyruvate dioxygenase, and coding nucleic acid and use thereof | |
| JP2018503392A (en) | Method for performing site-specific modification in complete plants by gene transient expression | |
| CN121443140A (en) | Allele combinations for crop yield enhancement | |
| OA21191A (en) | Method for creating new gene in organism and use thereof. | |
| HK40072923A (en) | Method for creating new gene in organism and use thereof | |
| RU2832668C1 (en) | Method of creating new gene in organism and application thereof | |
| CN118510388A (en) | Improved screening methods for genome editing events | |
| RU2833881C1 (en) | Method of creating new mutations in organisms and application thereof | |
| HK40050322A (en) | Method for generating new mutations in organisms, and application thereof | |
| RU2781830C2 (en) | Mutant p-hydroxyphenylpyruvatedioxygenase, nucleic acid encoding it, and their use | |
| WO2024133272A1 (en) | Increased editing efficiency by co-delivery of rnp with nucleic acid |