WO2024179544A1 - 编码rna的工程化dna分子 - Google Patents
编码rna的工程化dna分子 Download PDFInfo
- Publication number
- WO2024179544A1 WO2024179544A1 PCT/CN2024/079346 CN2024079346W WO2024179544A1 WO 2024179544 A1 WO2024179544 A1 WO 2024179544A1 CN 2024079346 W CN2024079346 W CN 2024079346W WO 2024179544 A1 WO2024179544 A1 WO 2024179544A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- poly
- tail
- dna molecule
- coding sequence
- rna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
- C07K14/01—DNA viruses
- C07K14/025—Papovaviridae, e.g. papillomavirus, polyomavirus, SV40, BK virus, JC virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
- C12N15/68—Stabilisation of the vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/34—Polynucleotides, e.g. nucleic acids, oligoribonucleotides
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B40/00—Libraries per se, e.g. arrays, mixtures
- C40B40/04—Libraries containing only organic compounds
- C40B40/06—Libraries containing nucleotides or polynucleotides, or derivatives thereof
- C40B40/08—Libraries containing RNA or DNA which encodes proteins, e.g. gene libraries
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20051—Methods of production or purification of viral material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/50—Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
- C12R2001/19—Escherichia coli
Definitions
- the present application relates to the field of biotechnology, and in particular to an RNA comprising a poly(A) tail.
- the poly(A) tail enables the replication of the DNA encoding the RNA in a prokaryotic system to have a higher stability, and can be used to regulate the expression level of the RNA in a eukaryotic cell.
- the primary structure of a translatable mRNA drug molecule consists of a 5' cap structure, a 5' untranslated region (5'UTR), a coding region, a 3' untranslated region, and a polyadenylic acid tail (poly(A) tail).
- the known functions of the poly(A) tail include maintaining the in vivo stability of mRNA molecules and participating in the initiation of protein translation, which is achieved through the interaction between poly(A) tail binding protein (PABP) and the translation initiation complex.
- PABP poly(A) tail binding protein
- the poly(A) tail is synthesized by a typical poly(A) polymerase as a post-transcriptional modification.
- the first step in the in vitro preparation of mRNA drugs is to synthesize them through in vitro transcription (IVT) using a linearized plasmid containing a designed product sequence as a template, and the Poly(A) tail is usually added downstream of the 3’UTR in a co-transcriptional manner.
- IVT in vitro transcription
- the poly(dA:dT) repeat sequence in the plasmid is unstable during replication in Escherichia coli, and deletion mutations often occur, resulting in shortening of poly(dA:dT). This phenomenon is not conducive to the preparation process of in vitro transcription template plasmids produced on a large scale through fermentation, and Poly(A) truncation has a significant effect on the in vivo stability and biological activity of mRNA.
- the element a is composed of a plurality of consecutive adenine (A) nucleotides, and the length of the element a ranges from ⁇ 20 nt;
- the element b is composed of a plurality of consecutive A nucleotides, and the length of the element b is in the range of 3nt ⁇ b ⁇ 20nt;
- the element c consists of a non-A nucleotide, and the nucleotide is selected from T, C, and G nucleotides;
- Poly(A) tail coding sequence does not contain elements b adjacent to each other, does not contain elements c adjacent to each other, and does not contain elements d adjacent to each other.
- the Poly (A) tail coding sequence further comprises a unique element e, wherein the element e consists of one or two consecutive A's; and is located at the 3' end of the Poly (A) tail coding sequence and is adjacent to element d or element c.
- the number of the elements b is 2-10, for example 3, 4, 5, 6, 7, 8, or 9.
- the number of the elements c is 0 to 10, for example, 1, 2, 3, 4, 5, 6, 7, 8, or 9.
- the number of the elements d is 0 to 5, such as 1, 2, 3, or 4.
- the ratio of element c and element d is The total number is 2-15, for example 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14.
- the number of the element a is 1, the number of the element b is 3, the number of the element c is 2, and the number of the element d is 1.
- the number of the element a is 1, the number of the element b is 4, the number of the element c is 4, and the number of the element d is 1.
- the number of the element a is 1, the number of the element b is 5, the number of the element c is 4, and the number of the element d is 1.
- the number of the element a is 1, the number of the element b is 3, the number of the element c is 3, and the number of the element d is 1.
- the element a ⁇ 80 nt. In some embodiments, the element a is 21nt, 22nt, 23nt, 24nt, 25nt, 26nt, 27nt, 28nt, 29nt, 30nt, 31nt, 32nt, 33nt, 34nt, 35nt, 36nt, 37nt, 38nt, 39nt, 40nt, 41nt, 42nt, 43nt, 44nt, 45nt, 46nt, 47nt, 48nt, 49nt, 50nt, 51nt, 52nt, 53nt, 54nt, 55nt, 56nt, 57nt, 58nt, 59nt, 60nt, 61nt, 62nt, 63nt, 64nt, 65nt, 66nt, 67nt, 68nt, 69nt, 70nt, 71nt, 72nt, 73nt, 74nt, 75
- the element b is 3nt, 4nt, 5nt, 6nt, 7nt, 8nt, 9nt, 10nt, 11nt, 12nt, 13nt, 14nt, 15nt, 16nt, 17nt, 18nt or 19nt.
- the element d is 2nt ⁇ d ⁇ 20nt, 3 to 18nt, 5 to 16nt, 4 to 10nt, or 6 to 12nt, for example 2nt, 3nt, 4nt, 5nt, 6nt, 7nt, 8nt, 9nt, 10nt, 11nt, 12nt, 13nt, 14nt, 15nt, 16nt, 17nt, 18nt, 19nt or 20nt, preferably 6nt.
- the length of the Poly (A) tail encoding sequence is greater than 40nt, for example, 41nt, 42nt, 43nt, 44nt, 45nt, 46nt, 47nt, 48nt, 49nt, 50nt, 51nt, 52nt, 53nt, 54nt, 55nt, 56nt, 57nt, 58nt, 59nt, 60nt, 61nt, 62nt, 63nt, 64nt, 65nt, 66nt, 67nt, 68nt, 69nt, 70nt, 71nt, 72nt, 73nt, 74nt, 75nt, 76nt, 77nt, 78nt, 79nt, 80nt ,81nt,82nt,83nt,84nt,85nt,86nt,87nt,88nt,89nt,90nt,91nt,92nt,93
- 50% or more of the polynucleotides of element a are located in the 5' portion or 3' portion of the poly(A) tail coding sequence. In some embodiments, more than 50% of the polynucleotides of element a are located in the 5' portion of the poly(A) tail coding sequence. More than 50% of the polynucleotides of element a are located at the 3' portion of the poly(A) tail coding sequence. In some embodiments, the number of nucleotides of element a located at the 3' portion of the poly(A) tail coding sequence is equal to the number of nucleotides located at the 5' portion of the poly(A) tail coding sequence.
- the element c is G, C or T.
- the element d comprises a palindromic sequence.
- Element d is a palindromic sequence.
- the element d comprises a sequence selected from the following: GATATC (SEQ ID NO: 15), GTATAC (SEQ ID NO: 16), GAATCT (SEQ ID NO: 17), GCATATGACT (SEQ ID NO: 18) and GATATCGTATAC (SEQ ID NO: 19).
- the element d is a sequence selected from the following: GATATC (SEQ ID NO: 15), GTATAC (SEQ ID NO: 16), GAATCT (SEQ ID NO: 17), GCATATGACT (SEQ ID NO: 18) and GATATCGTATAC (SEQ ID NO: 19).
- the element d comprises a polynucleotide sequence as shown in SEQ ID NO: 15.
- the polynucleotide sequence of the element d is as shown in SEQ ID NO: 15.
- the nucleotide at the 3' end of the poly (A) tail encoding sequence is A. In some embodiments, the nucleotide at the 3' end of the poly (A) tail encoding sequence is G. In some embodiments, the nucleotide at the 3' end of the poly (A) tail encoding sequence is C. In some embodiments, the nucleotide at the 3' end of the poly (A) tail encoding sequence is T.
- the 3' portion of the Poly (A) tail coding sequence comprises one or more non-A nucleotides. In some embodiments, the Poly (A) tail coding sequence comprises one or more non-A nucleotides at 1/2 near the 3' end. In some embodiments, the Poly (A) tail coding sequence comprises one or more non-A nucleotides at 1/3 near the 3' end. In some embodiments, the Poly (A) tail coding sequence comprises one or more non-A nucleotides at 1/4 near the 3' end.
- the structure of the Poly(A) tail encoding sequence is:
- the structure of the Poly(A) tail encoding sequence is:
- the structure of the Poly(A) tail encoding sequence is:
- element d consists of 6 or 12 nucleotides.
- the Poly(A) tail encoding sequence is as shown in any one of SEQ ID NO: 1 to 10.
- the Poly (A) tail coding sequence is shown as SEQ ID NO: 3 or SEQ ID NO: 4.
- the engineered DNA molecule is further connected to a target gene fragment on the 5' end side of its Poly (A) tail coding sequence, and the target gene fragment and the Poly (A) tail coding sequence jointly encode RNA.
- the engineered DNA molecule is further connected to a target gene fragment on the 5' end side of its Poly (A) tail coding sequence, and the target gene fragment and the Poly (A) tail coding sequence jointly encode mRNA.
- the target gene fragment comprises a protein coding sequence or a non-protein coding sequence such as a functional RNA coding sequence.
- the target gene fragment further comprises a 5'UTR coding sequence on the 5' end side of the protein coding sequence or the functional RNA coding sequence.
- the target gene fragment further comprises a 3'UTR coding sequence on the 3' end side of the protein coding sequence or the functional RNA coding sequence. In some embodiments, the target gene fragment further comprises a 3'UTR coding sequence on the 3' end side of the protein coding sequence or the functional RNA coding sequence, and further comprises a 5'UTR coding sequence on the 5' end side of the protein coding sequence or the functional RNA coding sequence. In some embodiments, the engineered DNA molecule further comprises a replicon, such as a replication initiation site, such as an ORI.
- the engineered DNA molecule further comprises a marker gene to facilitate screening of cells comprising the engineered DNA molecule, wherein the marker gene is selected from, for example, a resistance gene for antibiotics, a fluorescent protein, etc.
- the DNA molecule further comprises Containing a promoter, the promoter starts the transcription of the target gene fragment and the poly (A) tail coding sequence co-encoding RNA.
- the promoter is a prokaryotic promoter.
- the promoter is a eukaryotic promoter.
- the DNA molecule further comprises a replicon such as a replication initiation site, a promoter, a 5'UTR coding sequence, a protein coding sequence, and a 3'UTR coding sequence. In some embodiments, the DNA molecule further comprises a replicon such as a replication initiation site, a resistance gene, a 5'UTR coding sequence, a protein coding sequence, and a 3'UTR coding sequence. In some embodiments, the DNA molecule further comprises a replicon such as a replication initiation site, a resistance gene, a promoter, a 5'UTR coding sequence, a protein coding sequence, and a 3'UTR coding sequence.
- the protein coding sequence encodes an HPV virus antigen protein. In some embodiments, the HPV protein is derived from type 16 and/or type 18 HPV. In some embodiments, the protein coding sequence encodes the E2, E6 or E7 protein of HPV. In some embodiments, the protein coding sequence encodes a fusion protein of HPV E6 and E7 proteins. In some embodiments, the protein coding sequence encodes a fusion protein of HPV E2, E6 and E7 proteins. In some embodiments, the polypeptide fragment of the fusion protein is derived from HPV type 16 and/or type 18. In some embodiments, the polypeptide fragment of the fusion protein is derived from HPV type 16 and/or type 18 E2, E6 and E7 proteins. In some embodiments, the protein coding sequence encodes a polypeptide as shown in SEQ NO: 26 or a conservatively substituted variant thereof.
- the engineered DNA molecule comprises a polynucleotide sequence as shown in any one of SEQ ID NOs: 22-25, or a synonymous mutant of the polynucleotide sequence as shown in any one of SEQ ID NOs: 22-25, or a polynucleotide sequence having more than 85% sequence identity with the polynucleotide sequence shown in any one of SEQ ID NOs: 22-25 or its synonymous mutant.
- the DNA molecule is a DNA plasmid. In some embodiments, the DNA molecule is a linear plasmid or a circular plasmid. In some embodiments, the DNA molecule is single-stranded or double-stranded. In some embodiments, the plasmid is a plasmid based on pUC, pTZ, pMB1 or pCoIE1. In some embodiments, the plasmid is a plasmid based on a pUC57 vector.
- the cell is a prokaryotic cell. In some embodiments, the cell is a recA-bacterium. In some embodiments, the cell is Escherichia coli. In some embodiments, the Escherichia coli is selected from: K-12 and its derivatives, and B strain and its derivatives.
- the Escherichia coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2.
- the second aspect of the present application discloses a cell comprising the DNA molecule of the aforementioned first aspect.
- the DNA molecule of the aforementioned first aspect can be replicated and/or transcribed in the cell.
- the cell is a prokaryotic cell, and the DNA molecule of the aforementioned first aspect can be replicated in the prokaryotic cell.
- the cell is a recA-bacterium.
- the cell is Escherichia coli.
- the prokaryotic cell is a competent cell.
- the prokaryotic cell is an engineered cell.
- the prokaryotic cell is an engineered prokaryotic cell.
- the cell is Escherichia coli
- the Escherichia coli is selected from: K-12 and its derivative strains, and B strain and its derivative strains.
- the Escherichia coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2.
- the third aspect of the present application provides a Poly(A) tail.
- the Poly(A) tail :
- the further modification comprises replacing one or more ribonucleotides in the Poly (A) tail obtained by (1) or (2) with one or more deoxyribonucleotides.
- the one or more ribonucleotides are replaced with deoxyribonucleotides corresponding to the one or more ribonucleotides, for example, one or more ribonucleotides A in the Poly (A) tail are replaced with deoxyribonucleotides A, one or more ribonucleotides U are replaced with deoxyribonucleotides T, one or more ribonucleotides C are replaced with deoxyribonucleotides C, one or more ribonucleotides G are replaced with deoxyribonucleotides G, or one or more ribonucleotides G are replaced with ribonucleotides I (inosine) or deoxyribonucleotides I.
- the modification is a chemical modification. In some embodiments, the modification is a base editing. In some embodiments, the modification is a deamination treatment of one or more ribonucleotides in the Poly (A) tail obtained by (1) or (2).
- the Poly(A) tail comprises a polynucleotide sequence selected from any one of SEQ ID NOs: 1-10. In some embodiments, the polynucleotide sequence of the Poly(A) tail is as shown in any one of SEQ ID NOs: 1-10.
- the Poly (A) tail is used to make the RNA molecule more stable, wherein the Poly (A) tail is located at the 3' end of the RNA, and the more stable is more stable outside the cell, inside the cell, or in the animal body relative to the RNA molecule containing another Poly (A) tail.
- the Poly (A) tail is used to reduce the stability of the RNA molecule, wherein the Poly (A) tail is located at the 3' end of the RNA, and the reduction in stability is relative to the RNA molecule containing another Poly (A) tail, and the stability is reduced outside the cell, inside the cell, or in the animal body.
- the Poly (A) tail is used to increase the expression of the RNA molecule in the same time, and the increased expression is relative to the RNA molecule containing another Poly (A) tail, and the expression is increased outside the cell, inside the cell, or in the animal body.
- the Poly (A) tail is used to reduce the expression of an RNA molecule in the same period of time, and the increased expression is relative to the RNA molecule containing another Poly (A) tail, and the expression is reduced outside the cell, inside the cell, or in the animal body.
- the Poly (A) tail is used to extend the expression time of an RNA molecule, and the extended expression time is relative to the RNA molecule containing another Poly (A) tail, and the expression time is extended outside the cell, inside the cell, or in the animal body.
- the Poly (A) tail is used to shorten the expression time of an RNA molecule, and the extended expression time is relative to the RNA molecule containing another Poly (A) tail, and the expression time is shortened outside the cell, inside the cell, or in the animal body.
- the Poly (A) tail is used to extend the half-life of an RNA molecule, and the half-life extension is relative to the half-life extension of an RNA molecule containing another Poly (A) tail, outside the cell, inside the cell, or in the animal body.
- the Poly(A) tail is used to shorten the half-life of an RNA molecule, wherein the shortened half-life is shortened outside a cell, inside a cell, or in an animal body relative to an RNA molecule comprising another Poly(A) tail.
- the RNA molecule is an mRNA molecule.
- the Poly(A) tail and the other Poly(A) tail are two different Poly(A) tails belonging to the Poly(A) tail of the third aspect of the present application.
- the Poly(A) tail is a Poly(A) tail belonging to the third aspect of the present application, and the other Poly(A) tail is a Poly(A) tail other than the Poly(A) tail of the third aspect of the present application.
- the intracellular is a host cell, and the host cell is a eukaryotic cell.
- the host cell is Mammalian cells.
- the host cell is a human cell.
- the present application also provides a DNA fragment or a hybrid molecule fragment of DNA and RNA encoding the poly (A) tail of the third aspect of the present application, and the use of the DNA fragment or the hybrid molecule fragment of DNA and RNA to make the replication of the DNA molecule encoding RNA or the hybrid molecule of DNA and RNA in the host cell more conservative, in which the DNA fragment or the hybrid molecule fragment of DNA and RNA encoding the poly (A) tail of the third aspect of the present application is located on the 3' end of the RNA coding sequence in the DNA molecule or the hybrid molecule of DNA and RNA.
- the host cell is a prokaryotic cell. In some embodiments, the host cell is a recA-bacterium.
- the host cell is Escherichia coli.
- the Escherichia coli is selected from: K-12 and its derivative strains, and B strain and its derivative strains.
- the Escherichia coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2.
- the fourth aspect of the present application also provides an RNA molecule, wherein the RNA molecule comprises the Poly (A) tail of the third aspect.
- the RNA molecule is an mRNA molecule.
- the RNA molecule :
- the further modification comprises replacing one or more ribonucleotides in the RNA molecule obtained by (1) or (2) with one or more deoxyribonucleotides.
- the one or more ribonucleotides are replaced with deoxyribonucleotides corresponding to the one or more ribonucleotides, for example, one or more ribonucleotide A of the RNA molecule is replaced with deoxyribonucleotide A, one or more ribonucleotide U is replaced with deoxyribonucleotide T, one or more ribonucleotide C is replaced with deoxyribonucleotide C, one or more ribonucleotide G is replaced with deoxyribonucleotide G, or one or more ribonucleotide G is replaced with ribonucleotide I (inosine) or deoxyribonucleotide I.
- the modification is a chemical modification. In some embodiments, the modification is a base editing. In some embodiments, the modification is to one or more of the RNA molecules obtained by (1) or (2). In some embodiments, the further modification is a post-transcriptional modification. In some embodiments, the further modification comprises a capping process. In some embodiments, the further modification comprises splicing. In some embodiments, the further modification comprises splicing and capping.
- the RNA molecule includes coding RNA or non-coding RNA (ncRNA). In some embodiments, the RNA molecule is Pre-mRNA. In some embodiments, the RNA is mature mRNA. In some embodiments, the RNA molecule is long noncoding RNA (lncRNA). In some embodiments, the RNA molecule further comprises a 5'-cap structure. In some embodiments, the polynucleotide sequence of the RNA molecule is as shown in any one of SEQ ID NO: 22-25. In some embodiments, the RNA molecule comprises a polynucleotide sequence as shown in any one of SEQ ID NO: 22-25.
- the present application also provides a hybrid molecule of DNA and RNA, which carries the same genetic information as the engineered DNA molecule of the first aspect, the same genetic information as the Poly (A) tail of the third aspect, or the same genetic information as the RNA molecule of the fourth aspect.
- the present application also provides a nucleic acid molecule library.
- the nucleic acid molecule library comprises the engineered DNA molecule of the aforementioned first aspect, the Poly (A) tail of the aforementioned third aspect, a DNA fragment encoding the Poly (A) tail of the third aspect of the present application or a hybrid molecule fragment of DNA and RNA, or the RNA molecule of the aforementioned fourth aspect.
- the present application also provides a method for regulating protein expression, comprising: introducing a plurality of nucleic acid molecules in the aforementioned nucleic acid molecule library into a target cell at different times and/or in different quantity ratios.
- the nucleic acid molecule is the engineered DNA molecule of the aforementioned first aspect.
- the nucleic acid molecule is the RNA molecule of the aforementioned fourth aspect.
- FIG. 1 shows the complexation of 10 poly(A)s in this application in E. coli DH5 ⁇ in Example 2. Control stability;
- FIG2 shows the base deletion statistics of 10 poly(A)s in the present application in Example 2 in Escherichia coli DH5 ⁇ ;
- Figure 3 shows the replication stability of poly(A) P1, P2, P3, P4 and poly(A) controls C1 and C2 in Example 3 in Escherichia coli DH5 ⁇ under a plasmid system including HPV antigen sequences;
- FIG4 shows the base deletion statistics of poly(A) P3, P4 and poly(A) controls C1 and C2 in Example 3 in E. coli DH5 ⁇ under the HPV antigen sequence plasmid system;
- FIG5 shows the comparison of the replication stability of poly(A) P1, P2, P3 and poly(A) control C1 in Example 3 under two temperature conditions of 30° C. and 37° C. in E. coli DH5 ⁇ in a plasmid system including an HPV antigen sequence;
- FIG. 6 shows an example of a map of a universal vector plasmid DNA in the Examples.
- FIG7 shows the animal imaging results of the expression levels of luciferase in mice with poly(A) P3, P4, P5, P8, P9 and control C2 in Example 4;
- Figure 8 shows the quantitative results of fluorescence intensity after animal imaging of the expression levels of luciferase carrying poly(A) P3, P4, P5, P8, P9 and control C2 in mice in Example 4 (ns, no significant difference; ⁇ , significant difference, p ⁇ 0.01).
- the present application first provides a method for stably amplifying a poly (A) tail transcription template DNA in vitro to reduce the mutation frequency of the poly (A) tail transcription template sequence when the DNA is replicated in large quantities in cells.
- a large number of RNAs containing poly (A) tails with determined sequences are obtained based on the DNA.
- RNAs, such as mRNAs, with poly (A) tails having certain specific functions that have been engineered and designed can be mass-produced by in vitro fermentation.
- the present application also provides uses of the aforementioned Poly(A) tail, RNA, DNA, and library.
- Element a is composed of a plurality of consecutive adenine (A) nucleotides, and the length of element a is ⁇ 20nt; element b is composed of a plurality of consecutive A nucleotides, and the length of element b is 3nt ⁇ b ⁇ 20nt; element c is composed of a non-A nucleotide, and the nucleotide is selected from T, C, and G nucleotides; element d is composed of any two or more consecutive nucleotides, and the nucleotides are selected from A, T, C, and G nucleotides, wherein the nucleotides at the 5' and 3' ends of element d are not A nucleotides, and element d does not contain more than 3 consecutive A nucleotides, and the
- the two or more elements when describing the positional relationship of two or more elements as "non-adjacent", it means that the two or more elements are not adjacent to each other. In other words, the two or more elements contain at least one or more other nucleotides or bases other than the nucleotides of the two elements.
- nucleotides in all polynucleotide sequences are numbered from the 5' end to the 3' end, that is, the nucleotide at the 5' end is the first nucleotide, and the nucleotide at the 3' end is the last nucleotide.
- “5'end” and “5'end” can be used interchangeably;
- “3'end” and “3'end” can be used interchangeably.
- “5'end” "5'end” and “3'end” focus on describing the relative position relationship between nucleotides, between nucleotide sequence segments, or between nucleotides and nucleotide sequence segments in the same nucleic acid sequence; "5'end” and “3'end” are used to describe the positions of the first and last nucleotides of a nucleic acid sequence or a segment of a nucleic acid sequence, respectively. "5' end side” is used to describe the relative position relationship between two sequences that do not overlap with each other in the same polynucleotide sequence.
- RNA molecule comprises the poly (A) tail, then relative to the coding DNA of an RNA molecule that does not comprise the poly (A) tail (e.g., an RNA molecule comprising some other poly (A) tail), the probability that the daughter DNA molecule has 100% sequence identity with the parent DNA molecule is higher; or the average sequence identity between the multiple daughter DNA molecules obtained by replication of the parent DNA molecule and the parent DNA molecule is higher.
- the “regulation” means: increasing or decreasing the total amount of protein or functional RNA expressed by the RNA molecule within the same length of time; or allowing the RNA to express protein or functional RNA within a longer or shorter time frame, and the increase, decrease, or longer or shorter time frame is compared to another RNA molecule expressing the same protein or functional RNA.
- “regulating" protein expression it refers to regulating the expression of RNA molecules containing the protein coding sequence.
- the regulatory effect described here can be achieved by connecting the Poly (A) tail of the present application to the 3' end of an RNA molecule that does not contain a Poly (A) tail, or by The Poly(A) tail is replaced with the Poly(A) tail of the present application.
- the percentage of "identity”, such as 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.5% identity refers to the degree of similarity between amino acid sequences or between nucleotide sequences determined by sequence alignment, which is 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.5%.
- the proportion of the number of positions with the same base or amino acid residue to the total number of positions determined after two sequences have the same residue at as many positions as possible by introducing spaces, etc.
- the percentage of "identity" can be determined using software programs known in the art.
- substantially complementary refers to a degree of complementarity of at least about 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% in a region of about 40, 50, 60, 70, 80, 100, 150, 200, 250 or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.
- a single base or a single nucleotide according to the Watson-Crick base pairing principle, when A is paired with T or U, C is paired with G or I, it is called complementary or matching, and vice versa; and other base pairings are called non-complementary.
- the "complementary polynucleotide sequence" of a polynucleotide sequence refers to a polynucleotide sequence that is completely complementary to the polynucleotide sequence.
- a “conservative substitution variant" of a protein, polypeptide or amino acid sequence is one in which one or more amino acid residues are substituted with amino acids without changing the overall conformation and function of the protein or enzyme, including but not limited to replacing the amino acids in the amino acid sequence of the parent protein in the manner described by the aforementioned "conservative substitution". Therefore, the similarity of two proteins or amino acid sequences with similar functions may be different. For example, a similarity (identity) of 70% to 99% based on the MEGALIGN algorithm.
- Constant substitution variants also include polypeptides or enzymes with more than 60% amino acid identity as determined by the BLAST or FASTA algorithms, preferably more than 75%, preferably more than 85%, or even more than 90%.
- the invention relates to a protein or enzyme which is optimal for the production of a novel polypeptide and which has the same or substantially similar properties or functions as the native or parent protein or enzyme.
- DNA and RNA refer to single-stranded or double-stranded DNA or RNA molecules.
- DNA and DNA molecules refer to double-stranded DNA molecules consisting of A, C, G and/or T nucleotides
- RNA and RNA molecules refer to single-stranded RNA molecules consisting of A, C, G and/or U nucleotides.
- A, C, G, T and U nucleotides refer to nucleotides containing adenine, guanine, cytosine, thymine and uracil as their respective nitrogenous bases.
- DNA and RNA hybrid molecule is a molecule comprising a polynucleotide sequence consisting of deoxyribonucleotides and ribonucleotides.
- the DNA and RNA hybrid molecule can be obtained by:
- hybrid molecule of DNA and RNA is not limited to the above method, and the hybrid molecule of DNA and RNA obtained by any method belongs to the category of "hybrid molecule of DNA and RNA" defined in this application.
- nucleic acid molecules are described as having "the same genetic information"
- the two nucleic acid molecules are complementary, or contain exactly the same base sequence, or one or more thymines in the base sequence of one of the nucleic acid molecules are converted to uracil to obtain a nucleic acid molecule with exactly the same base sequence as another nucleic acid molecule. Therefore, any two of DNA, RNA, and hybrid molecules of DNA and RNA can have the same genetic information.
- base sequence refers to the order of arrangement of bases in a polynucleotide molecule.
- RNA such as mRNA
- U uracil
- Poly A tail or "Poly (A) sequence” refers to an uninterrupted or uninterrupted sequence of adenylic acid residues that is typically located at the 3'-end of an RNA molecule.
- the Poly-A sequence is attached to the 3' end of the 3'-UTR.
- An uninterrupted poly-A tail is characterized by continuous adenylic acid residues.
- the Poly-A tail can be of any length.
- the Poly-A tail comprises, or consists of, at least 20, at least 30, at least 40, at least 80 or at least 100 and at most 500, at most 400, at most 300, at most 200 or at most 150 adenylic acid (A), particularly about 120 A.
- the vast majority of nucleotides in the PolyA tail are adenosine, wherein the vast majority refers to at least 75%, at least 80%, at least 85%, at least 90% of the nucleotides, etc., but the remaining nucleotides are allowed to be nucleotides other than A (non-A nucleotides), such as U (uridylic acid), G (guanylic acid) or C (cytidylic acid).
- the in vitro preparation process of the RNA is a prokaryotic fermentation process, that is, the coding nucleic acid of the RNA molecule containing the Poly (A) tail is introduced into a prokaryotic cell, and the prokaryotic cell is amplified to achieve the purpose of amplifying the coding nucleic acid, and then the amplified coding nucleic acid is transcribed into the RNA.
- the in vitro preparation process of the RNA is to connect the RNA fragment containing the protein coding sequence with the Poly (A) tail by homologous recombination, enzyme ligation, or other non-homologous recombination, and the Poly (A) tail is prepared by a prokaryotic fermentation process, and the coding nucleic acid containing the Poly (A) tail is introduced into a prokaryotic cell during the prokaryotic fermentation process, and the prokaryotic cell is amplified to achieve the purpose of amplifying the coding nucleic acid, and then the amplified coding nucleic acid is transcribed into RNA containing the Poly (A) tail.
- the aforementioned coding nucleic acid is linear.
- the aforementioned coding nucleic acid is circular. In some embodiments, the aforementioned coding nucleic acid is a plasmid. In some embodiments, the aforementioned coding nucleic acid is single-stranded or double-stranded. In some embodiments, the aforementioned coding nucleic acid is chemically modified before being introduced into a prokaryotic cell. In some embodiments, the aforementioned encoding nucleic acid before being introduced into the prokaryotic cell is chemically synthesized. In some embodiments, the encoding nucleic acid is inserted into the nucleoid/nucleoid genomic DNA of the prokaryotic cell. In some embodiments, the encoding nucleic acid is free in the cytoplasm of the prokaryotic cell or outside the nucleoid/nucleoid. In some embodiments, the prokaryotic cell is Escherichia coli.
- the present application provides a series of Poly(A) tails, which are highly conserved during the in vitro preparation of RNA.
- the Poly(A) tails contain one or more non-A nucleotides at one or more positions.
- the Poly(A) tail encoding sequence comprises:
- Poly(A) tail coding sequence does not contain elements b adjacent to each other, does not contain elements c adjacent to each other, and does not contain elements d adjacent to each other.
- the Poly (A) tail coding sequence further comprises a unique element e, wherein the element e consists of one or two consecutive A's and is located at the 3' end of the Poly (A) tail coding sequence and is adjacent to element d or element c.
- the Poly(A) tail here can be a segment of RNA or a hybrid molecule of DNA and RNA.
- the present application also provides a Poly(A) tail that can regulate protein expression levels.
- the present application also provides a Poly(A) tail that can regulate protein expression levels while being highly conservative during in vitro preparation.
- the Poly(A) tail that can regulate protein expression levels; or the structure of the Poly(A) tail that can regulate protein expression levels while being highly conservative during in vitro preparation is selected from:
- the poly (A) tail of the regulatable protein expression level; or The structure of the Poly(A) tail, which is highly conserved during the in vitro preparation process and can regulate protein expression levels, is: 60A-element d-19A-G-19A-G-17A or 19A-G-19A-G-19A-element d-60A.
- yA represents the number of consecutive A's in element a or element b, where y is a natural number, for example, 19A means that it contains 19 consecutive A's; 60A means that it contains 60 consecutive A's.
- the Poly(A) tail that can regulate protein expression levels; or the polynucleotide coding sequence of the Poly(A) tail that can regulate protein expression levels that is highly conserved during the in vitro preparation process is selected from a polynucleotide sequence as shown in any one of SEQ ID NOs: 1-10.
- the present application also provides the use of the above-mentioned Poly (A) tail for regulating protein expression, in which the Poly (A) tail is located at the 3' end of the mRNA, such as the 3' end of the 3' UTR.
- the method for regulating protein expression described below is used.
- the present application also provides an engineered DNA molecule that can replicate in a cell, which comprises the coding sequence of the aforementioned Poly A tail or its complementary sequence.
- the engineered DNA molecule should also contain structural elements necessary for the DNA molecule to replicate in a cell, or to replicate efficiently.
- the structural elements necessary for the engineered DNA molecule to replicate or replicate efficiently in a cell are known in the art, including, for example, an origin of replication (ORI).
- the engineered DNA molecule further comprises a marker gene or a fragment thereof and/or a reporter gene or a fragment thereof, and a unique restriction endonuclease site that allows the insertion of DNA elements, preferably a restriction endonuclease site in the form of a multiple cloning site (MCS).
- the marker gene is conducive to identifying cells containing a plasmid containing the marker gene, and can be selected from, for example, an antibiotic resistance gene.
- Each restriction endonuclease site in the MCS can be specifically recognized by different restriction endonucleases.
- the DNA molecule is a DNA plasmid.
- DNA plasmid refers to a plasmid composed of a double-stranded DNA molecule.
- the "plasmid” is a circular DNA molecule.
- the "plasmid” can also encompass linear DNA molecules.
- the term “plasmid” also encompasses molecules obtained by linearizing the circular plasmid, such as by cutting the circular plasmid with a restriction endonuclease, thereby converting the circular plasmid molecule into a linear molecule, and linear molecules that can replicate in prokaryotes.
- Plasmids can replicate, i.e., in cells independently of prokaryotes.
- the invention relates to amplifying the genomic genetic information stored in the nucleoid or nucleoid of a cell and can be used for cloning, i.e., for amplifying genetic information in bacterial cells.
- the DNA plasmid according to the present application is a medium copy or high copy plasmid, more preferably a high copy plasmid.
- high copy plasmids are such vectors: they are based on pUC, pTZ plasmids or any other plasmids (e.g., pMB1, pCoIE1) containing an ORI that supports high copy of the plasmid, etc.
- the DNA molecule is a DNA molecule or a fragment thereof constituting a nucleoid or a nucleoid of a prokaryotic organism, that is, the coding sequence or its complementary sequence comprising the aforementioned Poly (A) tail can be replicated along with the prokaryotic genome.
- the DNA molecule is further connected to a target gene fragment on the 5' end side of the Poly (A) tail coding sequence, and the target gene fragment and the Poly (A) tail coding sequence jointly encode RNA.
- the target gene fragment and the Poly (A) tail coding sequence jointly encode mRNA.
- the target gene fragment comprises a coding sequence of a protein, a polypeptide or a fragment thereof.
- the target gene fragment also comprises a coding sequence of an element that can be used to start or regulate the expression of the protein, polypeptide or fragment thereof after transcription, and the element includes but is not limited to 5'UTR, 3'UTR, etc.
- the target gene fragment comprises a coding sequence of at least one untranslated region (UTR). In some embodiments, the target gene fragment comprises at least a coding sequence of a 5'UTR and a coding sequence of a protein, polypeptide or a fragment thereof. In some embodiments, the target gene fragment comprises at least a coding sequence of a 5'UTR, a coding sequence of a protein, a polypeptide or a fragment thereof, and a coding sequence of a 3'UTR from 5' to 3'.
- UTR untranslated region
- the coding sequence of the protein, polypeptide or its fragment can be finally translated into one or more proteins, or one or more polypeptides, such as short peptides, oligopeptides, polypeptides, fusion proteins, proteins and fragments thereof, such as parts of known proteins, such as functional parts.
- the functional part can be, for example, a biologically active part of a protein or an antigenic part that can effectively produce antibodies, such as an antigenic epitope.
- the two ends of the coding sequence of the protein, polypeptide or its fragment respectively include a start codon (5' end) and a stop codon (3' end), which are the first three nucleotides and the last three nucleotides that can be translated of the mRNA molecule, respectively.
- the 5'UTR usually contains at least one ribosome binding site (RBS), such as the Shine-Dalgarno sequence in prokaryotes, or at least one translation start site, such as the Kozak sequence in eukaryotes.
- RBS ribosome binding site
- translation start site such as the Kozak sequence in eukaryotes.
- RBS promotes the efficient and accurate translation of mRNA molecules by recruiting ribosomes at the start of translation. Its activity can be optimized by changing the length and sequence of a given RBS or translation revelation site and the distance from the start codon.
- the 5'UTR includes an internal ribosome entry site or IRES.
- the 3'UTR may contain one or more regulatory sequences, such as binding sites for amino acid sequences that enhance the stability of the mRNA molecule, regulatory Binding sites for RNA molecules (such as miRNA molecules), and/or signal sequences involved in the intracellular trafficking of mRNA molecules.
- regulatory sequences such as binding sites for amino acid sequences that enhance the stability of the mRNA molecule, regulatory Binding sites for RNA molecules (such as miRNA molecules), and/or signal sequences involved in the intracellular trafficking of mRNA molecules.
- the target gene fragment further comprises one or more additional regulatory sequences, such as binding sites for amino acid sequences that enhance the stability of mRNA molecules, binding sites for amino acid sequences that enhance the translation of mRNA molecules, regulatory elements (such as riboswitches), binding sites for regulatory RNA molecules (such as miRNA molecules), and/or nucleotide sequences that have a positive impact on translation initiation.
- additional regulatory sequences such as binding sites for amino acid sequences that enhance the stability of mRNA molecules, binding sites for amino acid sequences that enhance the translation of mRNA molecules, regulatory elements (such as riboswitches), binding sites for regulatory RNA molecules (such as miRNA molecules), and/or nucleotide sequences that have a positive impact on translation initiation.
- regulatory elements such as riboswitches
- binding sites for regulatory RNA molecules such as miRNA molecules
- nucleotide sequences that have a positive impact on translation initiation within the 5'UTR, preferably there is no functional upstream open reading frame, an out-of
- the coding sequence of the protein, polypeptide or fragment thereof comprises codons that can be translated into an amino acid sequence. All the codons contained in the coding sequence may be naturally occurring codons encoding amino acids, or may be partially or entirely composed of artificially synthesized codons. In some embodiments, some or all of the codons are codon optimized. In some embodiments, some or all of the codons encode non-natural amino acids.
- the DNA molecule further comprises a structural element necessary for initiating or regulating transcription of the RNA on the 5' end side of the target gene fragment, and the structural element is known in the art.
- the structural element at least comprises a promoter. Promoters and their sequences are known in the art, including weak promoters, medium strength promoters, strong promoters, mini promoters or core promoters, etc. In some specific embodiments, the promoter is a strong promoter.
- the promoter can initiate transcription of the target gene fragment and/or Poly (A) tail in prokaryotes. In some embodiments, the promoter can initiate transcription of the target gene fragment and/or Poly (A) tail in eukaryotic cells.
- the "promoter” comprises at least one transcription recognition site and a subsequent transcription factor binding site.
- the recognition and binding sites can interact with an amino acid sequence that mediates or regulates transcription. Compared with the recognition site, the binding site is closer to the aforementioned target gene fragment.
- the binding site can be, for example, a Pribnow box in prokaryotes or a TATA box in eukaryotes.
- the transcription recognition site can be located about 35 bp upstream of the transcription start site
- the transcription factor binding site can be located about 10 bp upstream of the transcription start site.
- the promoter comprises at least one additional regulatory element, such as an AT-rich upstream element located about 40 and/or 60 nucleotides before the transcription start site, and/or an additional regulatory element that enhances promoter activity located between the recognition site and the binding site.
- the promoter is a strong promoter, i.e., the promoter comprises a sequence that promotes the transcription of the aforementioned RNA coding sequence. Strong promoters are known to those skilled in the art, such as OXB18, OXB19 and OXB20 promoters derived from the RecA promoter of Escherichia coli, or can be identified or synthesized by routine laboratory procedures.
- the promoter is a T7 promoter.
- the promoter also comprises other regulatory elements, such as an enhancer that is included in a DNA plasmid that promotes the transcription of the aforementioned RNA coding sequence.
- the present application also provides a library comprising the aforementioned engineered DNA molecules.
- the library comprises at least two DNA molecules having different Poly (A) tail encoding sequences.
- the present application also provides the use of the above-mentioned engineered DNA molecules in the stable amplification of the poly (A) tail coding sequence or the coding sequence of the RNA with the poly (A) tail.
- the method for amplifying the poly (A) tail coding sequence or the coding sequence of the RNA with the poly (A) tail is as shown in the method for stably amplifying the poly (A) tail transcription template DNA in vitro later.
- RNA comprising the aforementioned Poly(A) tail and a target gene fragment on the 5' end of the Poly(A) tail coding sequence.
- the RNA further comprises a 5'-cap structure.
- the RNA is mRNA.
- mRNA messenger RNA
- messenger RNA is any RNA, naturally occurring, non-naturally occurring or modified, that encodes at least one protein, polypeptide, or fragment thereof, and the mRNA has the ability to be translated to produce the encoded protein, polypeptide, or fragment thereof in vitro, in vivo, in situ, or ex vivo. Therefore, the mRNA can be a mature mRNA or a pre-mature mRNA, and the elements or structures that it must contain or selectively contain are known in the art. In some embodiments, the mRNA contains coding sequences for multiple necessary functional components to express, regulate, or enhance the expression level of the protein, polypeptide, or fragment thereof.
- the functional components include, but are not limited to, a 5' cap, a 5'UTR, a 3'UTR, and the like. Both 5'UTR and 3'UTR are typically transcribed from genomic DNA and are elements that pre-mature mRNA possesses. As a mature mRNA,
- 5'cap is located at the 5' end of the mRNA and contains methylated guanylate, which is linked to the 5' end of the mRNA via pyrophosphate to form a 5',5'-triphosphate linkage with its adjacent nucleotide.
- There are usually three types of 5' cap structures m7G5'ppp5'Np, m7G5'ppp5'NmpNp, m7G5'ppp5'NmpNmpNp), which are called type O, type I and type II, respectively.
- Type O means that the ribose of the terminal nucleotide is not methylated
- type I means that the ribose of one terminal nucleotide is methylated
- type II means that the ribose of both terminal nucleotides is methylated.
- the 5' cap can be used according to the manufacturer's protocol.
- RNA cap analogs simultaneously complete 5' capping of the polynucleotide during the in vitro transcription reaction to produce a 5'-guanosine cap structure: 3'-O-Me-m7G(5')ppp(5')G [ARCA cap], G(5')ppp(5')A, G(5')ppp(5')G, m7G(5')ppp(5')A, m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA), or m7G(5')ppp(5')(2'-OMeA)pG (CleanCapAG).
- 5' capping of the modified RNA can be completed post-transcriptionally using a vaccinia virus capping enzyme to produce an O-type cap structure: m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA).
- Type I cap structures can be generated using both vaccinia virus capping enzyme and 2'-O methyl-transferase to generate m7G(5')ppp(5')(2'-OMeA)pG.
- Type II cap structures can be generated from type I cap structures by 2'-O-methylation of the 5'-third to last nucleotide using 2'-O methyl-transferase.
- Type III cap structures can be generated from type II cap structures by 2'-O-methylation of the 5'-fourth to last nucleotide using 2'-O methyl-transferase.
- some or all of the uridine in the mRNA is chemically modified uridine.
- some or all of the uridine in the mRNA is pseudouridine or 1-methyl-pseudouridine.
- part or all of the uracil nucleotides of the mRNA are substituted with pseudouridine ( ⁇ ) nucleotides or N1-methylpseudouridine (m1 ⁇ ) nucleotides.
- the mRNA further comprises a stabilizing element.
- Stabilizing elements may include, for example, a histone stem-loop.
- the mRNA comprises a coding region, at least one histone stem-loop and optionally a poly(A) sequence or a polyadenylation signal.
- the poly(A) sequence or polyadenylation signal should generally enhance the expression level of the encoded protein.
- the mRNA comprises a combination of a poly(A) sequence or a polyadenylation signal and at least one histone stem-loop, although both have alternative mechanisms in nature, their synergistic effect can increase protein expression to a level that exceeds that observed for any single element.
- the synergistic effect of the combination of poly(A) and at least one histone stem-loop is independent of the order of the elements or the length of the poly(A) sequence.
- the histone stem-loop is generally derived from a histone gene and comprises two adjacent partial or complete reverse complementary sequences separated by a spacer (composed of a short sequence) that form a loop by intramolecular base pairing. Unpaired loop regions are generally unable to base pair with any of the stem-loop elements.
- the stability of the stem-loop structure generally depends on the length, the number of mismatches or bulges, and the base composition of the paired region.
- wobble base pairing non-Watson-Crick base pairing
- the at least one histone stem-loop sequence comprises 15 to 45 nucleotides in length.
- one or more AU-rich sequences of the mRNA may be removed. These sequences are sometimes referred to as AURES, which are destabilizing sequences found in the 3'UTR. AURES can be removed from the mRNA. Alternatively, AURES can be retained in the mRNA.
- the lipid nanoparticles generally include ionizable lipids, non-cationic lipids, sterols and PEG lipid components and target nucleic acids, such as the above-mentioned mRNA.
- Lipid nanoparticles of the present disclosure can be produced using components, compositions and methods as generally known in the art, see, for example, PCT/US2016/052352, PCT/US2016/068300, PCT/US2017/037551, PCT/US2015/027400, PCT/US2016/047406, PCT/US2016000129, PCT/US2016/014280, PCT/US2017/037551, PCT/US2017/037552, PCT/US2017/037551 ...
- the present application also provides a library comprising the aforementioned mRNA molecules.
- the library comprises at least two mRNA molecules with different Poly (A) tails.
- the present application also provides the use of the mRNA and the mRNA library.
- At least two or more mRNA molecules having a Poly (A) tail with different influence gradients on the expression of mRNA can be used to adjust the expression of the coding sequence of the aforementioned protein, polypeptide or fragment thereof.
- the application also provides cells comprising the aforementioned engineered DNA molecules, which can be stored and/or amplified in the cells.
- the cells are prokaryotic cells, and the DNA molecules can be replicated in the prokaryotic cells.
- the cells are prokaryotic cells, and the DNA molecules can be replicated and/or transcribed in the prokaryotic cells.
- the DNA molecules are eukaryotic cells, and the DNA molecules can be replicated in the cells.
- the DNA molecules can be transcribed and/or replicated in the DNA cells.
- the cell is a prokaryotic cell.
- the cell is a bacterium, an actinomycete, a cyanobacterium, a mycoplasma, a rickettsia, and a chlamydia.
- the cell is selected from: Bacillus subtilis, lactobacillus, acetic acid bacteria, corynebacterium, brevis, arthrobacter, pseudomonas, and micrococcus.
- the cell is a recA - bacterium.
- the cell is an Escherichia coli.
- the cell is an Escherichia coli, and the Escherichia coli is selected from: K-12 and its derivative strains, and B strains and their derivative strains.
- the E. coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2, etc.
- the cell is selected from Streptomyces, Micromonospora and Nocardia.
- the cell is a fungus.
- the cell is selected from yeast or mold.
- the present application provides a method for stably amplifying a poly(A) tail transcription template DNA in vitro to reduce the mutation frequency of the poly(A) tail transcription template sequence when the DNA is replicated in large quantities in cells.
- the method comprises: amplifying the cells containing the engineered DNA molecule.
- the engineered DNA molecules are introduced into the cells.
- the introduction may include chemical conversion or electro-conversion.
- the introduction is a natural endocytosis process of the engineered DNA molecules by the cells.
- the transduction comprises administering to the human body, and the administration is selected from: by intravenous, intraperitoneal, subcutaneous, intracranial, intrathecal, intraarterial (e.g., via the carotid artery), intramuscular and intratumoral injection or perfusion.
- the present application also provides a method for regulating protein expression, comprising:
- Two or more of the aforementioned RNA molecules are added at different times and/or in different quantitative ratios. into target cells.
- the two or more aforementioned engineered DNA molecules and the two or more aforementioned RNA molecules have different Poly(A) tails, and the Poly(A) tails have different influence gradients on the expression amount of RNA.
- the present application also provides a method for regulating protein expression, comprising introducing the aforementioned engineered DNA molecule or the aforementioned RNA molecule into a target cell.
- the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises a structure selected from the following:
- the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises a structure selected from the following:
- the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises the structure: 60A-element d-19A-G-19A-G-17A or 19A-G-19A-G-19A-element d-60A.
- yA represents the number of consecutive A's in element a or element b, where y is a natural number, for example, 19A means that it contains 19 consecutive A's; 60A means that it contains 60 consecutive A's.
- the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises a polynucleotide sequence selected from the following, or consists of any one polynucleotide sequence selected from the following: SEQ ID NO: 1-10.
- the Poly(A) tail and the DNA sequence encoding the Poly(A) tail shown in Table 1 below were constructed using conventional genetic engineering methods.
- Example 2 Testing Poly(A) function using luciferase coding sequence as an example
- luciferase as the protein coding region, the stability of different poly(A) variants in E. coli and their effects on luciferase expression in cells were investigated.
- This universal vector uses the E. coli cloning vector pUC57 as the vector backbone, and the T7 promoter sequence (5’-TAATACGACTCACTATAAGG-3’), 5’UTR, luciferase protein, 3’UTR and polyadenylic acid string poly(dA:dT) are arranged in sequence between the Xba I restriction site and the EcoR I restriction site of its multiple cloning site.
- C1C2 A60-10nt spacer-A60 (control C1), A30-10nt spacer-A70 (control C2), A60-1nt spacer-A60 (control C3) or A60-6nt spacer-A60 (control C4)
- C1C2 is derived from the patent in the literature (U.S. Patent No. US 10717982B2)).
- the vector plasmid constructed in step 2) is confirmed to be correct by sequencing, it is transformed into E. coli DH5 ⁇ , and the transformed plate is grown at 30°C to complete the plasmid extraction and sequencing.
- the stability and base deletion of different poly (A) variants are analyzed and calculated according to the sequencing results; the replication stability is expressed by the percentage of clones without any base changes, and the higher the percentage, the higher the replication stability of the plasmid in E. coli.
- the poly(A) variants designed in this application are superior to or equivalent to the prior art in terms of replication stability in E. coli cells.
- the replication stability of poly(A) variants P3, P4, and P8 is the highest, and the replication stability of P3, P4, and P8 is equivalent to that of C2, with no statistically significant difference (p>0.05, ⁇ 2 test), and the replication stability of P3, P4, and P8 is better than that of the control C1, C3, and C4, and the difference is statistically significant (p ⁇ 0.05, ⁇ 2 test).
- the HPV vector plasmid containing P1, P2, P3 and P4 constructed in step 1) was confirmed to be correct by sequencing and then transformed into E. coli DH5 ⁇ .
- the transformed plates were grown at 30°C to complete plasmid extraction and sequencing. After sequencing, the stability and base deletion of different poly(A) variants were analyzed and calculated based on the sequencing results; the stability was expressed by the percentage of clones without any base changes, and the higher the percentage, the more stable it was.
- the different poly(A) variants of the present application still maintain high replication stability; among them, P3 is comparable to C2 in stability, with no statistically significant difference (p>0.05, ⁇ 2 test), and has the best cloning stability compared with other groups of new variants of the present application.
- the above results show that the poly(A) variants designed in the present application are universal in application in the embodiments for different protein coding regions.
- Example 1 and Example 2 the plates transformed with E. coli were cultured overnight in a 30°C biochemical incubator, and the clones obtained were sequenced to evaluate replication stability.
- the culture temperature of E. coli affects the DNA replication rate, it also affects the replication stability.
- the present application also compared the sequencing detection of the plates transformed with E. coli after being cultured in a 37°C biochemical incubator. The results show ( Figure 5) that the 37°C culture greatly increased the base deletion ratio of the control C1, from 53% when cultured at 30°C to 98%.
- the production of mRNA drugs must rely on large-scale fermentation to prepare a sufficient amount of template plasmids, and the stability of the plasmids during fermentation (in this application, the stability of the plasmids specifically refers to the stability of poly dA: dT) is crucial to the production of mRNA drugs with uniform quality.
- the stability of the plasmids specifically refers to the stability of poly dA: dT
- it is necessary to establish a strain library containing the target plasmid including strain libraries of different generations such as primary libraries and secondary libraries, so it is necessary to evaluate the plasmid stability in Escherichia coli of different generations.
- the present application detected the stability of P1 and P3 between different generations of the fermentation process in Example 3.
- each poly (A) variant selected 4 correct Escherichia coli clones and cultured them through fermentation.
- the results show that in the 3rd, 5th, 7th, and 9th generations of seed passage, the plasmids of the 4 clones of P1 and P3 remain stable without any base changes.
- poly(A) tail In eukaryotic cells, a certain length of poly(A) tail is essential for protecting the 3’ end of mRNA, maintaining mRNA stability and promoting protein expression. Under the influence of physiological or environmental factors in the body, poly(A) gradually becomes shorter, thereby triggering mRNA degradation.
- This application investigates the effects of different poly(A) variants on protein expression levels.
- the specific experimental process is as follows:
- linearized DNA is obtained by digestion with type II restriction endonuclease BspQ I.
- the 3’ end of the linearized DNA is different poly(A): control C2, P3, P4, P5, P8 or P9.
- the linearized DNA is used as a template for in vitro transcription.
- the 100 ⁇ l reaction system contains 1X reaction buffer, 5mM (final concentration) of ATP, CTP, N1M-UTP, GTP, CleanCap AG 4mM (final concentration), and 5 ⁇ l of in vitro transcriptase. After the reaction mixture is fully mixed, react at 37°C for 3h. The in vitro transcribed mRNA is collected by LiCl precipitation and finally dissolved in enzyme-free water.
- LNP is lipid nanoparticles.
- concentration and particle size of the mRNA encapsulated by LNP were measured using the Ribogreen RNA quantification kit (Invitrogen, R11490) and the Darwin ZetaSizer particle size analyzer.
- the LNP is quality controlled by measuring the particle size, encapsulation efficiency, PDI and other indicators.
- the quality control results show that the prepared LNP meets the standards of particle size range 50nm-150nm, PDI ⁇ 0.3, and encapsulation efficiency >90%, which can be used for subsequent experiments.
- the mRNA content in the LNP was determined by the ribogreen method, and then the LNP was diluted to an mRNA content of 100ng/ ⁇ L.
- mice were randomly divided into groups according to their body weight and administered after 2-3 days of adaptive feeding. Five mice were injected into each of C2, P3, P4, P5, P8, and P9. Each mouse was administered intramuscularly. 100 ⁇ l (10 ⁇ g mRNA), and the control group was injected with PBS into 5 mice. Animal imaging was performed 6h after injection, and the fluorescence value was calculated. The results showed ( Figure 7-8) that compared with the control C2, the expression activity of P3 was significantly increased by 1.8 times, with a statistically significant difference (p ⁇ 0.01, student's t-test), and the expression levels of P4, P5, P8, and P9 were comparable to those of C2, with no significant difference (p>0.05, student's t-test).
- sequences used in the above examples of the present application are shown in the following sequence table. It should be understood that the following sequences are only exemplary sequences of the present application's embodiments, rather than any limitation to the present application's embodiments.
- the nucleic acid sequences in the following sequence table may represent DNA sequences or RNA sequences, and when they represent RNA sequences, "T" therein represents uridine.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Tropical Medicine & Parasitology (AREA)
- Cell Biology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
Claims (51)
- 一种可在细胞中复制的工程化DNA分子,其包含多聚腺苷尾(Poly A尾)编码序列,所述Poly(A)尾编码序列包含唯一的元件a和至少一个元件b,以及至少一个元件c和/或至少一个元件d:所述元件a由多个连续的腺嘌呤(A)核苷酸组成,所述元件a的长度范围为≥20nt;所述元件b由多个连续的A核苷酸组成,所述元件b的长度范围为3nt≤b<20nt;所述元件c由一个非A的核苷酸组成,所述核苷酸选自T、C、G核苷酸;所述元件d由任意两个或更多个连续的核苷酸组成,所述核苷酸选自A、T、C、G核苷酸,其中元件d的5’及3’末端的核苷酸不为A核苷酸,并且其不包含3个以上连续的A核苷酸;所述元件d的长度范围为2nt≤d≤20nt;其中所述元件a和元件b不相邻、元件c和元件d不相邻,且所述Poly(A)尾编码序列不包含彼此相邻的元件b、不包含彼此相邻的元件c、且不包含彼此相邻的元件d。
- 根据权利要求1所述的DNA分子,所述Poly(A)尾编码序列的长度为101至200nt或101-150nt或120至150nt或130-140nt或120-135nt或123-125nt。
- 根据权利要求1或2所述的DNA分子,所述Poly(A)尾编码序列3’末端的为A核苷酸或非A核苷酸。
- 根据权利要求1-3中任一项所述的DNA分子,其中所述元件a≤80nt。
- 根据权利要求1-3中任一项所述的DNA分子,其中所述元件a为30至70nt、35至65nt、40至60nt、或45至55nt,优选60nt。
- 根据权利要求1-5中任一项所述的DNA分子,其中所述元件a的50%或以上的多核苷酸位于所述Poly(A)尾编码序列的5’部分或3’部分。
- 根据权利要求1-6中任一项所述的DNA分子,其中所述元件b为3nt至10nt、10至19nt、12至15nt、14nt至17nt或16至19nt,优选19nt。
- 根据权利要求1-7中任一项所述的DNA分子,其中所述元件b的个数为2-10个,优选2-5个,进一步优选3个。
- 根据权利要求1-8中任一项所述的DNA分子,其中所述元件c为G。
- 根据权利要求1-9中任一项所述的DNA分子,其中所述元件c的个数 为2至10个、3个至8个、或4至6个或2-5个,优选2个。
- 根据权利要求1-10中任一项所述的DNA分子,其中所述元件d包含回文序列。
- 根据权利要求1-11中任一项所述的DNA分子,其中所述元件d为3至18nt、5至16nt、4至10nt、或6至12nt,优选6nt。
- 根据权利要求1-12中任一项所述的DNA分子,其中所述元件d选自以下序列中的任一种或多种:GATATC(SEQ ID NO:15)、GTATAC(SEQ ID NO:16)、GAATCT(SEQ ID NO:17)、GCATATGACT(SEQ ID NO:18)及GATATCGTATAC(SEQ ID NO:19)。
- 根据权利要求1-13中任一项所述的DNA分子,其中所述元件d选自如下核苷酸序列中的任一种或多种:SEQ ID NO:15、SEQ ID NO:16、及SEQ ID NO:17。
- 根据权利要求1-14中任一项所述的DNA分子,其中所述元件d的核苷酸序列如SEQ ID NO:15所示。
- 根据权利要求1-15中任一项所述的DNA分子,其中所述元件d的个数为0-5,优选1-3个,进一步优选1个。
- 根据权利要求1-16中任一项所述的DNA分子,其中当元件c和元件d同时存在时,所述元件c和元件d的个数总和为2-15个,优选3-5个,进一步优选3个。
- 根据权利要求1-17中任一项所述的DNA分子,其中在Poly(A)尾编码序列的3’部分,优选所述Poly(A)尾编码序列靠近3’末端的1/2部分,包含一个或多个非A核苷酸。
- 根据权利要求1-18中任一项所述的DNA分子,所述Poly(A)尾编码序列的结构为:元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b;元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c;元件a-元件d-元件b-元件c-元件b-元件c-元件b;或元件b-元件c-元件b-元件c-元件b-元件d-元件a。
- 根据权利要求1-19中任一项所述的DNA分子,所述Poly(A)尾编码序列的结构为:元件a-元件d-元件b-元件c-元件b-元件c-元件b;所述元件a为60nt长度,元件b为16至19nt长度,元件d为6nt长度。
- 根据权利要求1-20中任一项所述的DNA分子,所述Poly(A)尾编码序列的结构为:元件b-元件c-元件b-元件c-元件b-元件d-元件a;所述元件a为60nt长度,元件b为16至19nt长度,元件d为6nt长度。
- 根据权利要求1-21中任一项所述的DNA分子,所述Poly(A)尾编码序列如SEQ ID NO:1至10中任一项所示。
- 根据权利要求1-22中任一项所述的DNA分子,所述Poly(A)尾编码序列如SEQ ID NO:3或SEQ ID NO:4所示。
- 根据权利要求1-23中任一项所述的DNA分子,其在Poly(A)尾编码序列的5’端一侧进一步连接目的基因片段,所述目的基因片段与所述Poly(A)尾编码序列共同编码RNA。
- 根据权利要求1-24中任一项所述的DNA分子,其进一步包含复制子。
- 根据权利要求1-25中任一项所述的DNA分子,其进一步包含抗性基因。
- 根据权利要求1-26中任一项所述的DNA分子,其进一步包含启动所述RNA转录的启动子。
- 根据权利要求1-27中任一项所述的DNA分子,所述目的基因片段包括5’UTR编码序列。
- 根据权利要求1-28中任一项所述的DNA分子,所述目的基因片段包括蛋白编码序列或非蛋白编码序列。
- 根据权利要求1-29中任一项所述的DNA分子,所述目的基因片段包括3’UTR编码序列。
- 根据权利要求1-30中任一项所述的DNA分子包含复制子、抗生素抗性基因、启动子、5’UTR编码序列、蛋白编码序列、及3’UTR编码序列。
- 根据权利要求1-31的任一项所述的DNA分子,其中所述蛋白编码序列编码HPV(人乳头瘤病毒)蛋白,优选地所述HPV蛋白源自16型和/或18型HPV。
- 根据权利要求1-32中任一项所述的DNA分子,其中所述蛋白编码序列编码HPV的E2、E6或E7蛋白,E6和E7蛋白多肽片段的融合蛋白或E2、E6 和E7蛋白多肽片段的融合蛋白,优选地所述HPV蛋白源自16型和/或18型HPV。
- 根据权利要求1-33中任一项所述的DNA分子,其中所述蛋白编码序列编码如SEQ NO:26所示的多肽。
- 根据权利要求1-34中任一项所述的DNA分子,其包含如SEQ ID NO:22-25中任一项所示的多核苷酸序列,或如SEQ ID NO:22-25中任一项所示的多核苷酸序列的同义突变体,或与SEQ ID NO:22-25中任一项所示的多核苷酸序列或其同义突变体具有85%以上序列同一性的多核苷酸序列。
- 根据权利要求1-35中任一项所述的DNA分子,所述DNA分子为DNA质粒。
- 包含根据权利要求1-36中任一项所述的DNA分子的细胞。
- 根据权利要求37的细胞为原核细胞。
- 根据权利要求37的细胞为大肠杆菌。
- 根据权利要求1-36中任一项所述的DNA分子编码的RNA分子。
- 根据权利要求40所述的RNA分子,所述RNA还包含5'‐帽结构和/或所述RNA的部分或全部尿苷为化学修饰的尿苷;优选的所述RNA的部分或全部尿苷为假尿苷或1‐甲基‐假尿苷。
- 根据权利要求1-36中任一项所述的Poly(A)尾的DNA编码序列。
- 根据权利要求1-36中任一项所述的Poly(A)尾的DNA编码序列所编码的Poly(A)尾序列。
- 将根据权利要求42的Poly(A)尾的DNA编码序列用于使编码RNA的DNA分子在宿主细胞中的复制更加保守的用途,其中所述Poly(A)尾位于所述RNA的3’末端。
- 根据权利要求44所述的用途,所述宿主细胞为原核细胞,优选大肠杆菌。
- 根据权利要求43所述的Poly(A)尾序列用于调控RNA分子在宿主细胞中表达的用途,其中所述Poly(A)尾位于所述RNA的3’末端。
- 根据权利要求46所述的用途,所述宿主细胞为真核细胞,优选哺乳动物细胞,进一步优选人细胞。
- 包含根据权利要求1-36中任一项所述的DNA分子的文库。
- 包含根据权利要求1-36中任一项所述的DNA分子所编码的RNA分子 的文库。
- 调控蛋白表达的方法,包括:将根据权利要求48的DNA文库中的多种DNA在不同时间,和/或以不同的数量比例导入目的细胞;或将权利要求49的RNA文库中的多种RNA在不同时间,和/或以不同的数量比例导入目的细胞。
- DNA与RNA的杂合分子,其携带与根据权利要求1-36中任一项所述的DNA分子相同的遗传信息,与根据权利要求42所述的Poly(A)尾相同的遗传信息,或与根据权利要求40所述的RNA分子相同的遗传信息。
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU2024229819A AU2024229819A1 (en) | 2023-03-01 | 2024-02-29 | Engineered dna molecule for coding rna |
| KR1020257029385A KR20250161542A (ko) | 2023-03-01 | 2024-02-29 | Rna를 코딩하기 위한 조작된 dna 분자 |
| JP2025549603A JP2026507667A (ja) | 2023-03-01 | 2024-02-29 | Rnaをコードするための工学的に操作されたdna分子 |
| EP24763224.3A EP4674967A1 (en) | 2023-03-01 | 2024-02-29 | Engineered dna molecule for coding rna |
| US19/083,930 US20250215441A1 (en) | 2023-03-01 | 2025-03-19 | Engineered dna molecule for coding rna |
| MX2025010281A MX2025010281A (es) | 2023-03-01 | 2025-08-29 | Molécula de adn diseñada para codificar arn |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNPCT/CN2023/079037 | 2023-03-01 | ||
| CN2023079037 | 2023-03-01 | ||
| CN202410205083 | 2024-02-23 | ||
| CN202410205083.1 | 2024-02-23 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US19/083,930 Continuation US20250215441A1 (en) | 2023-03-01 | 2025-03-19 | Engineered dna molecule for coding rna |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024179544A1 true WO2024179544A1 (zh) | 2024-09-06 |
Family
ID=92533833
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/079346 Ceased WO2024179544A1 (zh) | 2023-03-01 | 2024-02-29 | 编码rna的工程化dna分子 |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US20250215441A1 (zh) |
| EP (1) | EP4674967A1 (zh) |
| JP (1) | JP2026507667A (zh) |
| KR (1) | KR20250161542A (zh) |
| CN (1) | CN118581123A (zh) |
| AU (1) | AU2024229819A1 (zh) |
| CL (1) | CL2025002618A1 (zh) |
| MX (1) | MX2025010281A (zh) |
| TW (1) | TW202440933A (zh) |
| WO (1) | WO2024179544A1 (zh) |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170166905A1 (en) * | 2014-07-11 | 2017-06-15 | Biontech Rna Pharmaceuticals Gmbh | Stabilization of poly(a) sequence encoding dna sequences |
| CN107073070A (zh) * | 2014-08-15 | 2017-08-18 | 格纳西尼有限公司 | 治疗宫颈癌的方法 |
| CN111212908A (zh) * | 2017-08-15 | 2020-05-29 | 因特利亚治疗公司 | 编码信使核糖核酸(mRNA)的稳定核酸 |
| CN112805386A (zh) * | 2018-10-11 | 2021-05-14 | 埃泽瑞斯公司 | 含有编码具有节段多聚(A)尾的mRNA的序列的质粒 |
| CN113088519A (zh) * | 2019-12-23 | 2021-07-09 | 北京原基品德生物科技有限公司 | 一种分离的核酸分子及其应用 |
| CN115698303A (zh) * | 2020-03-17 | 2023-02-03 | Abion株式会社 | 体外转录物mrna和包含其的药物组合物 |
| WO2023022490A1 (ko) * | 2021-08-17 | 2023-02-23 | 에스케이바이오사이언스 주식회사 | 폴리 a 테일을 안정적으로 유지하는 방법 |
| WO2024041641A1 (zh) * | 2022-08-26 | 2024-02-29 | 深圳瑞吉生物科技有限公司 | 在宿主细胞中稳定存在的多段化PolyA多核苷酸及其应用 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20240098596A (ko) * | 2022-12-21 | 2024-06-28 | 엘지디스플레이 주식회사 | 표시 장치 및 표시 패널 |
| AU2023414262B2 (en) * | 2022-12-29 | 2025-08-14 | Rinuagene Biotechnology Co., Ltd. | Polynucleotide molecule for preventing or treating hpv infection-related diseases |
-
2024
- 2024-02-29 WO PCT/CN2024/079346 patent/WO2024179544A1/zh not_active Ceased
- 2024-02-29 TW TW113107326A patent/TW202440933A/zh unknown
- 2024-02-29 KR KR1020257029385A patent/KR20250161542A/ko active Pending
- 2024-02-29 AU AU2024229819A patent/AU2024229819A1/en active Pending
- 2024-02-29 EP EP24763224.3A patent/EP4674967A1/en active Pending
- 2024-02-29 JP JP2025549603A patent/JP2026507667A/ja active Pending
- 2024-02-29 CN CN202410231642.6A patent/CN118581123A/zh active Pending
-
2025
- 2025-03-19 US US19/083,930 patent/US20250215441A1/en active Pending
- 2025-08-28 CL CL2025002618A patent/CL2025002618A1/es unknown
- 2025-08-29 MX MX2025010281A patent/MX2025010281A/es unknown
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170166905A1 (en) * | 2014-07-11 | 2017-06-15 | Biontech Rna Pharmaceuticals Gmbh | Stabilization of poly(a) sequence encoding dna sequences |
| US10717982B2 (en) | 2014-07-11 | 2020-07-21 | Biontech Rna Pharmaceuticals Gmbh | Stabilization of poly(A) sequence encoding DNA sequences |
| CN107073070A (zh) * | 2014-08-15 | 2017-08-18 | 格纳西尼有限公司 | 治疗宫颈癌的方法 |
| CN111212908A (zh) * | 2017-08-15 | 2020-05-29 | 因特利亚治疗公司 | 编码信使核糖核酸(mRNA)的稳定核酸 |
| CN112805386A (zh) * | 2018-10-11 | 2021-05-14 | 埃泽瑞斯公司 | 含有编码具有节段多聚(A)尾的mRNA的序列的质粒 |
| CN113088519A (zh) * | 2019-12-23 | 2021-07-09 | 北京原基品德生物科技有限公司 | 一种分离的核酸分子及其应用 |
| CN115698303A (zh) * | 2020-03-17 | 2023-02-03 | Abion株式会社 | 体外转录物mrna和包含其的药物组合物 |
| WO2023022490A1 (ko) * | 2021-08-17 | 2023-02-23 | 에스케이바이오사이언스 주식회사 | 폴리 a 테일을 안정적으로 유지하는 방법 |
| WO2024041641A1 (zh) * | 2022-08-26 | 2024-02-29 | 深圳瑞吉生物科技有限公司 | 在宿主细胞中稳定存在的多段化PolyA多核苷酸及其应用 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4674967A1 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20250215441A1 (en) | 2025-07-03 |
| CL2025002618A1 (es) | 2025-12-19 |
| JP2026507667A (ja) | 2026-03-04 |
| AU2024229819A1 (en) | 2025-09-04 |
| TW202440933A (zh) | 2024-10-16 |
| KR20250161542A (ko) | 2025-11-17 |
| CN118581123A (zh) | 2024-09-03 |
| EP4674967A1 (en) | 2026-01-07 |
| MX2025010281A (es) | 2025-10-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Trepotec et al. | Segmented poly (A) tails significantly reduce recombination of plasmid DNA without affecting mRNA translation efficiency or half-life | |
| JP6625521B2 (ja) | 環状rnaの細胞内翻訳 | |
| EP4359521A2 (en) | Compositions and methods for improved protein translation from recombinant circular rnas | |
| CN117568337B (zh) | 一种基于人源tRNA的自环化RNA | |
| JP2025062016A (ja) | 核酸コンカテマー由来産物の発現 | |
| CN116004696A (zh) | 可与IRES组合的3ˊUTR加茎环结构基因及其应用、mRNA表达系统 | |
| JP2023505188A (ja) | 核酸組成物 | |
| EP4642914A1 (en) | Compositions and methods for generating circular rna | |
| CN117844806B (zh) | 一种5’-utr元件及其在提高蛋白表达量中的应用 | |
| US20260092293A1 (en) | Expression-adjustable engineered rna molecule | |
| WO2024179544A1 (zh) | 编码rna的工程化dna分子 | |
| WO2025241623A1 (zh) | 一种翻译调控元件、表达载体、药物组合物、疫苗组合物、构建方法、表达目的蛋白的方法及应用 | |
| CA3284145A1 (en) | Engineered dna molecule for coding rna | |
| HK40112587A (zh) | 编码rna的工程化dna分子 | |
| CN121294433A (zh) | 一种polyA尾部元件、构建体及其应用 | |
| HK40115559A (zh) | 表达可调控的工程化rna分子 | |
| CN121801903A (zh) | 一种自组装自复制rna系统及其应用 | |
| EP4208551A1 (en) | Rna-aptamer-sensors | |
| CN121801893A (zh) | 提高mRNA的PolyA尾遗传稳定性的方法 | |
| CN119823983A (zh) | 一种提高无帽无尾线性mRNA翻译效率的元件 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24763224 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: AU2024229819 Country of ref document: AU |
|
| ENP | Entry into the national phase |
Ref document number: 2025549603 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2025549603 Country of ref document: JP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: P2025-02715 Country of ref document: AE |
|
| WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2025/010281 Country of ref document: MX |
|
| ENP | Entry into the national phase |
Ref document number: 1020257029385 Country of ref document: KR Free format text: ST27 STATUS EVENT CODE: A-0-1-A10-A15-NAP-PA0105 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| ENP | Entry into the national phase |
Ref document number: 2024229819 Country of ref document: AU Date of ref document: 20240229 Kind code of ref document: A |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112025017585 Country of ref document: BR |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202592395 Country of ref document: EA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 11202505708U Country of ref document: SG |
|
| WWP | Wipo information: published in national office |
Ref document number: 11202505708U Country of ref document: SG |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2024763224 Country of ref document: EP |
|
| WWP | Wipo information: published in national office |
Ref document number: MX/A/2025/010281 Country of ref document: MX |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2024763224 Country of ref document: EP Effective date: 20251001 |
|
| WWP | Wipo information: published in national office |
Ref document number: 1020257029385 Country of ref document: KR |
|
| ENP | Entry into the national phase |
Ref document number: 2024763224 Country of ref document: EP Effective date: 20251001 |
|
| ENP | Entry into the national phase |
Ref document number: 2024763224 Country of ref document: EP Effective date: 20251001 |
|
| WWP | Wipo information: published in national office |
Ref document number: 2024763224 Country of ref document: EP |