WO2024125637A1 - 提高rna分子翻译效率和/或稳定性的utr及其应用 - Google Patents
提高rna分子翻译效率和/或稳定性的utr及其应用 Download PDFInfo
- Publication number
- WO2024125637A1 WO2024125637A1 PCT/CN2023/139184 CN2023139184W WO2024125637A1 WO 2024125637 A1 WO2024125637 A1 WO 2024125637A1 CN 2023139184 W CN2023139184 W CN 2023139184W WO 2024125637 A1 WO2024125637 A1 WO 2024125637A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- polynucleotide
- sequence
- variant
- nos
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/88—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation using microencapsulation, e.g. using amphiphile liposome vesicle
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/70—Carbohydrates; Sugars; Derivatives thereof
- A61K31/7088—Compounds having three or more nucleosides or nucleotides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/50—Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/80—Vector systems having a special element relevant for transcription from vertebrates
- C12N2830/85—Vector systems having a special element relevant for transcription from vertebrates mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2840/00—Vectors comprising a special translation-regulating system
- C12N2840/10—Vectors comprising a special translation-regulating system regulates levels of translation
- C12N2840/105—Vectors comprising a special translation-regulating system regulates levels of translation enhancing translation
Definitions
- the present invention belongs to the field of biotechnology and relates to a 5' and/or 3' untranslated region (UTR) for improving the translation efficiency and/or stability of RNA molecules, a nucleic acid molecule comprising the UTR, a vector comprising the UTR or the nucleic acid molecule, a pharmaceutical composition comprising the UTR, the nucleic acid molecule or the vector, and the application of the UTR, the nucleic acid molecule, the vector and the pharmaceutical composition.
- UTR 5' and/or 3' untranslated region
- mRNA messenger RNA
- mRNA messenger RNA
- mRNA vaccines have been clinically used in the field of infectious disease prevention. Compared with recombinant protein subunit vaccines, inactivated vaccines or DNA vaccines, mRNA vaccines have several obvious advantages. First, since mRNA does not infect the body or integrate into genomic DNA, the safety of mRNA is greatly improved. Secondly, the use of modified bases can reduce the inherent immunogenicity of mRNA molecules and reduce their degradation by the body, further improving the safety and stability of mRNA vaccines. In addition, in vitro, the yield of mRNA synthesis is very high, so mRNA vaccines have the potential to be efficient, rapidly developed, low-cost manufactured, and easy to manage safely.
- the untranslated region (UTR) of mRNA controls the translation, degradation and localization of genes, including the stem-loop structure, upstream start codon, upstream open reading frame, internal ribosome entry site and various cis-acting elements that bind to RNA-binding proteins.
- UTRs play a crucial role in post-transcriptional regulation of gene expression, including regulation of mRNA nuclear export and translation efficiency, subcellular localization, and stability. UTRs may also play other roles, such as the specific incorporation of the modified amino acid selenocysteine at the UGA codon of mRNA encoding selenoproteins, a process mediated by a conserved stem-loop structure in the 3'-UTR.
- mRNA molecules directly affects the dosage and dosing interval of mRNA drugs (especially mRNA vaccines), which ultimately affects the bioavailability of mRNA drugs and determines the clinical application value of mRNA drugs.
- mRNA drugs especially mRNA vaccines
- UTRs untranslated regions
- the inventors have identified a variety of 5'-UTRs and/or 3'-UTRs that can improve the translation efficiency and/or stability of mRNA molecules.
- the above UTRs are universal core elements that give mRNA molecules containing the UTRs enhanced translation efficiency, significantly improve the expression of target genes and/or mRNA stability, and have broad application value in the industrialization of mRNA drugs.
- the present invention provides a recombinant RNA molecule comprising: (1) a first nucleotide sequence encoding a polypeptide and/or protein of interest; and (2) a second nucleotide sequence containing a 5'-untranslated region (5'-UTR); the 5'-UTR comprising at least one of the following polynucleotides: (a): derived from genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, The 5'-UTR of at least one of the genes PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): a fragment of the 5'-UTR described in (a); (c): a variant of the 5'-UTR described in (a); and (d): a variant of the fragment described in (
- the gene is a human gene.
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB; (f): a fragment of the 5'-UTR described in (e); (g): a variant of the 5'-UTR described in (e); and (h): a variant of the fragment described in (f).
- the second nucleotide sequence comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21; preferably, the sequence as shown in SEQ ID NO: 1 Variants of RNA encoded by the polynucleotides as shown in at least one of SEQ ID NOs: 1 to 21, fragments of RNA encoded by the polynucleotides as shown in at least one of
- the second nucleotide sequence comprises at least one of: 5'-UTRs of at least two of the genes described in (a), fragments of 5'-UTRs of at least two of the genes described in (a), variants of 5'-UTRs of at least two of the genes described in (a), and variants of fragments of 5'-UTRs of at least two of the genes described in (a).
- the second nucleotide sequence comprises at least one of: at least two copies of the 5'-UTR in (a), at least two copies of a fragment of the 5'-UTR in (a), at least two copies of a variant of the 5'-UTR in (a), and at least two copies of a variant of a fragment of the 5'-UTR in (a).
- the recombinant RNA molecule further comprises at least one of a promoter, a 5'-cap structure, a 3'-UTR and a poly (A) tail.
- the recombinant RNA molecule further comprises at least one of a 5'-cap structure, a 3'-UTR and a poly (A) tail.
- the 5'-cap structure includes at least one of m7GpppG , m27,3' - OGpppG, m7Gppp (5')N1, and m7Gppp ( m2'-O )N1.
- the 3'-UTR comprises: i) a 3'-UTR derived from at least one of an albumin gene, an ⁇ -globin gene, a ⁇ -globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, and a collagen ⁇ gene; ii) a variant of the 3'-UTR in i); iii) at least one of a 3'-UTR derived from at least one of a gene MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, NFKB2, and GH1, a fragment thereof, a variant thereof, and a variant of a fragment thereof
- the nucleotides constituting the poly(A) tail include at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides; preferably, the nucleotides constituting the poly(A) tail include at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in succession. In some embodiments, the nucleotides constituting the poly(A) tail include one or more nucleotides other than A nucleotides; alternatively, the nucleotides constituting the poly(A) include two or more nucleotides other than A nucleotides in succession.
- the present invention also provides another recombinant RNA molecule, which comprises: (1) a first nucleotide sequence encoding a polypeptide and/or protein of interest; and (2) a second nucleotide sequence containing a 3'-untranslated region (3'-UTR); the 3'-UTR comprises at least one of the following polynucleotides: (a): derived from genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FB The 3'-UTR of at least one gene among XL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2; (b): a fragment of the 3'-UTR described in (a); (c): a variant of
- the gene is a human gene.
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): 3’-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, and PGLYRP1; (f): a fragment of the 3’-UTR described in (e); (g): a variant of the 3’-UTR described in (e); and (h): a variant of the fragment described in (f).
- the second nucleotide sequence comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48; preferably, the sequence is as shown in SEQ ID NO: 2 2 to 48, fragments of RNA encoded by a polynucleotide whose sequence is as shown in at least one of SEQ ID NOs: 22 to 48, and variants of fragments of RNA encoded by a polynu
- the second nucleotide sequence comprises at least one of: 3’-UTRs of at least two of the genes described in (a), fragments of 3’-UTRs of at least two of the genes described in (a), variants of 3’-UTRs of at least two of the genes described in (a), and variants of fragments of 3’-UTRs of at least two of the genes described in (a).
- the second nucleotide sequence comprises at least one of: at least two copies of the 3’-UTR in (a), at least two copies of a fragment of the 3’-UTR in (a), at least two copies of a variant of the 3’-UTR in (a), and at least two copies of a variant of a fragment of the 3’-UTR in (a).
- the recombinant RNA molecule further comprises at least one of a promoter, a 5'-cap structure, a 5'-UTR and a poly (A) tail.
- the recombinant RNA molecule further comprises at least one of a 5'-cap structure, a 3'-UTR and a poly (A) tail.
- the 5'-cap structure includes at least one of m7GpppG , m27,3' - OGpppG, m7Gppp (5')N1, or m7Gppp ( m2'-O )N1.
- the 5'-UTR comprises: i) a 5'-UTR derived from genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, At least one of the 5'-UTR of at least one gene among UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7, fragments thereof, variants and variants of fragments thereof; preferably, RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21,
- the nucleotides constituting the poly(A) tail include at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides; preferably, the nucleotides constituting the poly(A) tail include at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in a row. In some embodiments, the nucleotides constituting the poly(A) tail include one or more nucleotides other than A nucleotides.
- the present invention provides a DNA molecule encoding the recombinant RNA molecule of the present invention.
- the present invention provides a vector comprising the recombinant RNA molecule or DNA molecule of the present invention.
- the present invention provides a host cell comprising the recombinant RNA molecule, DNA molecule or vector of the present invention.
- the present invention provides a lipid nanoparticle comprising the recombinant RNA molecule of the present invention.
- the present invention provides a pharmaceutical composition
- a pharmaceutical composition comprising the recombinant RNA molecule of the present invention, the DNA molecule of the present invention, the vector of the present invention, the host cell of the present invention or the lipid nanoparticle of the present invention, and a pharmaceutically acceptable carrier.
- the present invention provides a vector comprising a first nucleotide sequence encoding a 5'-UTR and/or a second nucleotide sequence encoding a 3'-UTR, wherein:
- the first nucleotide sequence comprises at least one of the following polynucleotides: (a): a polynucleotide encoding a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): a polynucleotide encoding a fragment of the 5'-UTR described in (a); (c): a polynucleotide encoding a variant of the 5'-UTR described in (a); and (d): a polynucleotide encoding a variant of the fragment described in (b);
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a polynucleotide encoding a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1 and NFKB2; (f): a polynucleotide encoding a fragment of the 3'-UTR described in (e); (g): a polynucleotide encoding a variant of the 3'-UTR described in (e); and (h): a polynucleotide encoding a variant of the fragment described in (f).
- the gene is a human gene.
- the first nucleotide sequence comprises a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB, or a variant thereof.
- the second nucleotide sequence comprises a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, and PGLYRP1, or a variant thereof.
- the vector comprises the first nucleotide sequence and the second nucleotide sequence.
- the first nucleotide sequence comprises: i) a polynucleotide sequence as shown in at least one of SEQ ID NOs: 1 to 21, a fragment of a polynucleotide sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of a polynucleotide sequence as shown in at least one of SEQ ID NOs: 1 to 21, and a variant of a fragment of a polynucleotide sequence as shown in at least one of SEQ ID NOs: 1 to 21; preferably, the variant of the polynucleotide sequence as shown in at least one of SEQ ID NOs: 1 to 21, the variant of the polynucleotide sequence as shown in at least one of SEQ ID NOs: 1 to 21, the variant of the polynucleotide sequence as shown in Fragments of a polynucleotide as shown in at least one of SEQ ID NOs: 1 to 21 and variants of a fragment of a polyn
- the second nucleotide sequence comprises: (1): a polynucleotide encoding a 3'-UTR derived from at least one of the albumin gene, ⁇ -globin gene, ⁇ -globin gene, tyrosine hydroxylase gene, lipoxygenase gene, and collagen ⁇ gene; (2): a polynucleotide encoding a variant of the 3'-UTR in (1); (3): at least one of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 22 to 48, a fragment of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 22 to 48, a variant of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 22 to 48, and a variant of a fragment of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 22 to 48
- the vector further comprises a polynucleotide encoding a poly(A) tail.
- the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides; preferably, the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in a row.
- the nucleotides constituting the poly(A) tail comprise one or more nucleotides other than A nucleotides.
- the present invention provides a use of a recombinant RNA molecule, a DNA molecule, a vector, a host cell, a lipid nanoparticle or a pharmaceutical composition of the present invention in the preparation of a drug; preferably, the drug is used for gene therapy, gene vaccination or protein replacement therapy.
- the drug is a nucleic acid drug, wherein the nucleic acid includes at least one of the following: RNA, messenger RNA (mRNA), DNA, plasmid, ribosomal RNA (rRNA), single-stranded guide RNA (sgRNA) and Cas9 mRNA.
- RNA messenger RNA
- rRNA ribosomal RNA
- sgRNA single-stranded guide RNA
- Cas9 mRNA RNA, messenger RNA (mRNA), DNA, plasmid, ribosomal RNA (rRNA), single-stranded guide RNA (sgRNA) and Cas9 mRNA.
- the drug is used for the treatment and/or prevention of a disease; preferably, the disease is selected from the group consisting of: rare diseases, infectious diseases, cancer, genetic diseases, autoimmune diseases, diabetes, neurodegenerative diseases, cardiovascular diseases, renal vascular diseases, and metabolic diseases; preferably, the cancer includes one or more of lung cancer, gastric cancer, liver cancer, esophageal cancer, colon cancer, pancreatic cancer, brain cancer, lymphoma, blood cancer or prostate cancer; the genetic disease includes one or more of hemophilia, thalassemia, and Gaucher's disease.
- Figure 1A shows a flow chart for the construction of plasmid D
- Figure 1B shows a schematic diagram of the construction and transformation of plasmid D
- Fig. 2 shows the map of plasmid luciferase-pcDNA3
- Figure 3 shows a map of plasmid B
- Figure 4 shows a map of plasmid C
- FIG. 5 shows the spectrum of plasmid D
- FIG6 shows the expression of luciferase after HEK293 cells were transfected with mRNA containing different 5'-UTRs in Example 5;
- FIG7 shows the expression of luciferase after HEK293 cells were transfected with mRNA containing different 3'-UTRs in Example 6;
- FIG8 shows the expression of mRNAs containing the same 5′-UTR but different 3′-UTRs in mice
- FIG. 9 shows the expression of mRNAs containing the same 3′-UTR but different 5′-UTRs in mice.
- the expressions “comprises,” “comprising,” “containing,” and “having” are open ended, meaning the inclusion of the listed elements, steps, or components but not the exclusion of other unlisted elements, steps, or components.
- the expression “consisting of” excludes any element, step, or component not specified.
- the expression “consisting essentially of” means that the scope is limited to the specified elements, steps, or components, plus optional elements, steps, or components that do not significantly affect the basic and novel properties of the claimed subject matter. It should be understood that the expressions “consisting essentially of” and “consisting of” are encompassed within the meaning of the expression “comprising.”
- the numerical ranges described herein should be understood to include any and all subranges contained therein.
- the range “1 to 10” should be understood to include not only the explicitly stated values of 1 and 10, but also any single value (e.g., 2, 3, 4, 5, 6, 7, 8, and 9) and subranges (e.g., 1 to 2, 1.5 to 2.5, 1 to 3, 1.5 to 3.5, 2.5 to 4, 3 to 4.5, etc.) within the range of 1 to 10.
- This principle also applies to ranges with only one value as the minimum or maximum value.
- fragment or “fragment of a nucleic acid” refers to a portion of a nucleic acid. For example, a nucleic acid shortened at the 5' and/or 3' ends. A fragment of a nucleic acid comprises at least 50%, 60%, 70% or 80% from the nucleic acid. Preferably, a fragment of a nucleic acid comprises at least 70% or 80% from the nucleic acid. Preferably, at least 90%, 95%, 96%, 97%, 98% or 99% of the nucleotide residues. Generally, it can be a shorter portion of the full length of a nucleic acid.
- variant nucleic acids refers to a nucleic acid variant, wherein at least one nucleotide of the nucleic acid variant is different from a reference nucleic acid (or "parent").
- a variant nucleic acid includes single or multiple nucleotide deletions, additions, mutations and/or insertions, wherein: a deletion includes removing one or more nucleotides from a reference nucleic acid; an addition includes replacing one or more nucleotides (e.g., 1, 2, 3, 5, 10, 20, 30, 50 or more) with a reference nucleic acid.
- nucleic acid variant used herein includes naturally occurring variants and engineered variants. Therefore, the "nucleic acid variant” defined herein can be derived from, separated from, related to, based on or homologous to a reference nucleic acid sequence.
- Nucleic acid variants optionally have at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with the corresponding naturally occurring (wild type) nucleic acid or its homologues, fragments or derivatives; preferably at least 70%, more preferably at least 80%, even more preferably at least 85%, even more preferably at least 90%, most preferably at least 95% or even 97%.
- nucleic acid molecules include degenerate nucleic acid sequences, wherein the degenerate nucleic acid sequences according to the present invention are nucleic acids that differ from the reference nucleic acid in the codon sequence due to the degeneracy of the genetic code.
- the nucleic acid variant is a variant of 5'-UTR, a variant of 3'-UTR or a variant of ployA.
- the mutation introduced into the nucleic acid variant can prevent the nucleic acid variant from being recognized by nucleases and being cleaved, avoid binding to microRNA, or avoid generating complex secondary structures such as hairpin structures or G-quadruplexes.
- the nucleic acid variant is a variant of the 3'-UTR derived from the ALB gene (GenBank accession number is NM_000477.7), and the variant mutates one "A” into a "C” based on the 3'-UTR of the ALB gene to avoid being recognized by nucleases and being cleaved.
- % identity refers to the percentage of identical nucleotides or amino acids in the optimal alignment between the sequences to be compared, and the differences between the two sequences can be distributed over a local region (segment) or over the entire length of the sequences to be compared.
- identity between the two sequences is determined after the optimal alignment of the segment or "comparison window".
- the optimal alignment can be performed manually or with the aid of algorithms known in the art. Algorithms known in the art include, but are not limited to, the local homology algorithm described by Smith and Waterman, 1981, Ads App. Math. 2, 482 and Neddleman and Wunsch, 1970, J. Mol. Biol.
- % identity or % homology can be obtained by determining the number of identical positions corresponding to the sequences to be compared, dividing this number by the number of positions compared (e.g., the number of positions in the reference sequence), and multiplying this result by 100 to obtain % homology.
- the degree of homology is given for a region of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or about 100%. In some embodiments, the degree of homology is given for the entire length of the reference sequence.
- Alignment to determine sequence identity can be performed using tools known in the art, preferably using optimal sequence alignment, for example, using Align, using standard settings, preferably EMBOSS::needle, Matrix:Blosum62, Gap Open 10.0, Gap Extend 0.5.
- a fragment or variant of a specific nucleic acid or a nucleic acid having a specific degree of identity with a specific nucleic acid preferably has at least one functional property of the specific nucleic acid and is preferably functionally equivalent to the specific nucleic acid, e.g. a nucleic acid exhibiting properties that are identical or similar to those of the specific nucleic acid.
- nucleotide includes deoxyribonucleotide, deoxyribonucleotide, deoxyribonucleotide derivative, and ribonucleotide derivative.
- ribonucleotide is a constituent substance of ribonucleic acid (RNA), composed of one molecule of base, one molecule of pentose and one molecule of phosphoric acid, which refers to a nucleotide with a hydroxyl group at the 2' position of the ⁇ -D-ribofuranosyl group
- deoxyribonucleotide is a constituent substance of deoxyribonucleic acid (DNA), also composed of one molecule of base, one molecule of pentose and one molecule of phosphoric acid, which refers to a nucleotide in which the hydroxyl group at the 2' position of the ⁇ -D-ribofuranosyl group is replaced by hydrogen, and is the main chemical component of
- Nucleotide is usually referred to by a single letter representing the base: “A” or “A nucleotide” refers to adenine deoxyribonucleotide or adenine ribonucleotide containing adenine, “C” or “C nucleotide” refers to cytosine deoxyribonucleotide or cytosine ribonucleotide containing cytosine, “G” or “G nucleotide” refers to guanine deoxyribonucleotide or guanine ribonucleotide containing guanine, “U” or “U nucleotide” refers to uracil ribonucleotide containing uracil, and “T” or “T nucleotide” refers to thymine deoxyribonucleotide containing thymine.
- nucleic acid generally refers to a polymer containing deoxyribonucleotides (deoxyribonucleic acid, referred to as DNA) or a polymer containing ribonucleotides (ribonucleic acid, referred to as RNA) or any compound of a combination thereof.
- nucleic acids herein also include derivatives of nucleic acids.
- derivatives of nucleic acids includes chemical derivatization of nucleic acids on the bases, sugars or phosphates of nucleotides, as well as nucleic acids containing non-natural nucleotides and nucleotide analogs.
- nucleic acids can be in the form of single-stranded or double-stranded linear or covalently closed circular molecules.
- DNA-encoded RNA refers to the RNA corresponding to the DNA, that is, a polynucleotide after all the T nucleotides in the DNA are replaced by U nucleotides.
- the polynucleotide may comprise one segment or multiple segments (nucleic acid fragments) (e.g., 1, 2, 3, 4, 5, 6, 7, 8 segments).
- the polynucleotide may comprise a segment encoding a polypeptide of interest (e.g., a polypeptide and polypeptide antigen described herein).
- the polynucleotide may comprise a segment encoding a polypeptide of interest and a regulatory segment (including but not limited to a segment for transcriptional regulation and translational regulation).
- the regulatory segment comprises a polynucleotide corresponding to one or more of the following regulatory elements: a promoter, a 5' untranslated region (5'-UTR), a 3' untranslated region (3'-UTR), and a poly (A) tail.
- promoter refers to a polynucleotide located upstream of the 5' end of the coding region of a gene, which contains a conserved sequence required for specific binding of RNA polymerase and transcription initiation, can activate RNA polymerase, enable RNA polymerase to accurately bind to template DNA and have the specificity of transcription initiation. Promoters can be derived from viruses, bacteria, fungi, plants, insects and animals.
- promoters include bacteriophage T7 promoter, bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, SV40 early promoter or SV 40 late promoter and CMV IE promoter.
- the term "5' untranslated region" or "5'-UTR” can be an RNA sequence in mRNA that is located upstream of the coding sequence and is not translated into protein. The 5'-UTR in a gene usually starts from the transcription start site and ends at the nucleotide upstream of the translation start codon of the coding sequence.
- the 5'-UTR may contain elements that control gene expression, such as ribosome binding sites, 5'-terminal oligopyrimidine tracts, and translation initiation signals such as Kozak sequences.
- mRNA can be post-transcriptionally modified by adding a 5' cap. Therefore, the 5'-UTR in mature mRNA can also refer to the RNA sequence between the 5' cap and the start codon.
- the term "3' untranslated region" or "3'-UTR" may be located downstream of the coding sequence in mRNA and is not translated into protein.
- the 3'-UTR in mRNA is located between the stop codon and the poly (A) sequence of the coding sequence, for example, starting from the nucleotides downstream of the stop codon and ending with the nucleotides upstream of the poly (A) sequence.
- 5’ or 3’-UTR derived from gene A refers to 5’ or 3’-UTR of mRNA from gene A.
- the 5’ or 3’-UTR derived from gene A may be the entire 5’ or 3’-UTR of mRNA of gene A, or may be a partial 5’ or 3’-UTR of mRNA of gene A.
- poly (A) acid As used herein, the terms "poly (A) acid”, “poly (A) sequence” and “poly (A) tail” are used interchangeably, and the naturally occurring poly (A) sequence is usually composed of adenine ribonucleotides.
- modified poly (A) sequence refers to a poly (A) sequence comprising nucleotides or nucleotide segments other than adenine ribonucleotides.
- the poly (A) sequence is usually located at the 3' end of the mRNA, such as the 3' end (downstream) of the 3'-UTR.
- the term "5'-cap structure” the 5'-cap structure is usually located at the 5' end of the mature mRNA. In some embodiments, In the embodiment, the 5'-cap structure is connected to the 5'-end of the mRNA through a 5'-5'-triphosphate bond.
- the 5'-cap structure is usually formed by a modified (e.g., methylated) ribonucleotide (especially a guanine nucleotide derivative).
- m7GpppN (cap 0 or "cap0”, is a cap structure formed by the 5' phosphate group of hnRNA reacting with the 5'-phosphate group of m7GTP under the action of guanylyl transferase to form a 5', 5'-phosphodiester bond), wherein N is the terminal 5' nucleotide of the nucleic acid carrying the 5'-cap structure.
- the 5'-cap structure includes but is not limited to cap 0, cap 1 (a cap structure formed by further methylating the first nucleotide sugar group 2'-OH of hnRNA on the basis of cap 0, or "cap1”), cap 2 (a cap structure formed by further methylating the second nucleotide sugar group 2'-OH of hnRNA on the basis of cap 1, or "cap2”), cap 4, cap 0 analogs, cap 1 analogs, cap 2 analogs, or cap 4 analogs.
- cap 1 a cap structure formed by further methylating the first nucleotide sugar group 2'-OH of hnRNA on the basis of cap 0, or "cap1”
- cap 2 a cap structure formed by further methylating the second nucleotide sugar group 2'-OH of hnRNA on the basis of cap 1, or "cap2”
- cap 4 cap 0 analogs, cap 1 analogs, cap 2 analogs, or cap 4 analogs.
- the term "expression” includes transcription and/or translation of a nucleotide sequence. Thus, expression may involve the production of transcripts and/or polypeptides.
- transcription refers to the process of transcribing the genetic code in a DNA sequence into RNA (transcript).
- in vitro transcription refers to the in vitro synthesis of RNA, particularly mRNA, in a cell-free system (e.g., in an appropriate cell extract) (see, e.g., Pardi N., Muramatsu H., Weissman D., Karikó K. (2013). In: Rabinovich P. (eds) Synthetic Messenger RNA and Cell Metabolism Modulation.
- a vector that can be used to produce a transcript is also referred to as a "transcription vector,” which contains regulatory sequences required for transcription.
- transcription encompasses "in vitro transcription.”
- polypeptide refers to a polymer comprising two or more amino acids covalently linked by peptide bonds.
- a “protein” may comprise one or more polypeptides, wherein the polypeptides interact with each other by covalent or non-covalent means.
- the term "host cell” refers to a cell for receiving, maintaining, replicating, expressing a polynucleotide or a vector.
- the term "host cell” includes prokaryotic cells (e.g., Escherichia coli) or eukaryotic cells (e.g., yeast cells and insect cells). For example, cells from humans, mice, hamsters, pigs, goats, primates.
- the cell can be derived from a variety of tissue types and includes primary cells and cell lines. Some specific examples include keratinocytes, peripheral blood leukocytes, bone marrow stem cells, and embryonic stem cells.
- the host cell is an antigen presenting cell, particularly a dendritic cell, a monocyte, or a macrophage.
- the nucleic acid can be present in a host cell in a single copy or in several copies.
- the host cell can be a cell expressing a polypeptide of the present invention therein.
- recombinant or “recombinant” means "produced by genetic engineering”.
- "recombinant material” such as recombinant RNA molecules, is non-naturally occurring.
- naturally occurring or “naturally occurring” as used herein refers to the fact that a material can be found in nature. For example, a peptide or nucleic acid that is present in an organism (including a virus) and can be isolated from a source in nature and has not been intentionally modified by man in an experiment is naturally occurring.
- the term "plasmid” generally refers to a circular DNA molecule, but the term can also encompass linearized DNA molecules. Specifically, the term “plasmid” also encompasses molecules obtained by, for example, digesting a circular plasmid with a restriction enzyme, thereby converting the circular plasmid molecule into a linear molecule and linearizing the circular plasmid. Plasmids can replicate, i.e., amplify the genetic information stored as chromosomal DNA in a cell independently, and can be used for cloning, i.e., for amplifying genetic information in bacterial cells.
- the DNA plasmid is a medium copy or high copy plasmid, more preferably a high copy plasmid.
- high copy plasmids include, for example, pUC and pTZ plasmids or any other plasmids (e.g., pMB1, pCoIE1) comprising a replication origin that supports high copies of the plasmid.
- treatment and the like are used herein to generally mean obtaining a desired pharmacological and/or physiological effect.
- treatment according to the invention may relate to the treatment of a disease state, but may also relate to prophylactic treatment in terms of complete or partial prevention of a disease or its symptoms.
- treatment is to be understood as being therapeutic in terms of partial or complete cure of a disease and/or adverse effects and/or symptoms attributable to the disease.
- Treatment may also be prophylactic or preventive treatment, i.e. measures taken to prevent a disease, For example, to prevent the onset of infection and/or disease.
- the 5'-UTR of the recombinant nucleic acid molecule includes a polynucleotide with a sequence as shown in SEQ ID NO:1
- the 3'-UTR of the recombinant nucleic acid molecule includes a polynucleotide with a sequence as shown in SEQ ID NO:22
- the following scheme is also an embodiment of the present invention: the 5'-UTR of the recombinant nucleic acid molecule includes a polynucleotide with a sequence as shown in SEQ ID NO:1
- the 3'-UTR of the recombinant nucleic acid molecule includes a polynucleotide with a sequence as shown in SEQ ID NO:22.
- the inventors unexpectedly discovered that incorporating the 5'-UTR of gene PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 or CDK7 into mRNA can improve the translation efficiency of the coding sequence.
- PPIA peptidylprolyl isomerase A
- HPX hemopexin
- CDK5RAP3 CDK5 regulatory subunit associated protein 3;
- HSPA8 heat shock protein family A (Hsp70)member 8;
- HBA1 hemoglobin subunit alpha 1
- HBB hemoglobin subunit beta
- MYSM1 Myb like, SWIRM and MPN domains 1;
- LENG1 leukocyte receptor cluster member 1
- TMSB4X thymosin beta 4 X-linked
- CASP4 caspase 4
- IFNA1 interferon alpha 1
- PGLYRP1 peptidoglycan recognition protein 1;
- UCHL1 ubiquitin C-terminal hydrolase L1;
- TTR transthyretin
- APOA2 apolipoprotein A2
- DTYMK deoxythymidylate kinase
- APOC2 apolipoprotein C2
- CDK7 cyclin-dependent kinase 7.
- the present invention provides a 5'-UTR comprising at least one selected from the following polynucleotides: (a): a 5'-UTR derived from at least one gene among PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): a fragment of the 5'-UTR described in (a); (c): a variant of the 5'-UTR described in (a); and (d): a variant of the fragment described in (b).
- the gene is a eukaryotic gene.
- the gene is a chordate gene.
- the gene is a vertebrate gene.
- the gene is a mammalian gene.
- the gene is a primate gene.
- each of the genes is independently a gene of humans (Homo sapiens), a gene of bonobos (Pan paniscus), a gene of chimpanzees (Pan troglodytes), or a gene of western lowland gorillas (Gorilla gorilla gorilla).
- the gene is a human gene.
- the gene is human PPIA gene, human HPX gene, bonobo HPX gene, human FTCD gene, human CDK5RAP3 gene, western lowland gorilla CDK5RAP3 gene, human HSPA8 gene, human HBA1 gene, human HBB gene, human MYSM1 gene, bonobo MYSM1 gene, human LENG1 gene, human TMSB4X gene, human CASP4 gene, human IFNA1 gene, human PGLYRP1 gene, western lowland gorilla PGLYRP1 gene, human UCHL1 gene, human CPAMD8 gene, chimpanzee CPAMD8 gene, human TTR gene, western lowland gorilla TTR gene, human APOA2 gene, human GH1 gene, human DTYMK gene, bonobo DTYMK gene, human APOC2 gene, chimpanzee APOC2 gene, and human CDK7 gene.
- the GenBank accession number of the PPIA gene is BC137057.1; the GenBank accession number of the HPX gene is AH002827.2; the GenBank accession number of the FTCD gene is NM_006657.3; the GenBank accession number of the CDK5RAP3 gene is AK223387.1; the GenBank accession number of the HSPA8 gene is NM_006597.6; the GenBank accession number of the HBA1 gene is NM_000558.5; the GenBank accession number of the HBB gene is NM_000518.5; the GenBank accession number of the MYSM1 gene is NM_001085487.2; the GenBank accession number of the LENG1 gene is NM_024316.3; the GenBank accession number of the TMSB4X gene is NM_021109.4; and the CAS
- the GenBank accession number of the P4 gene is NM_001225.3; the GenBank accession number of the IFNA1 gene is NM_02
- the polynucleotides encoding the 5'-UTR of human genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7 are as shown in Table 1.
- the 5'-UTR comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- a fragment, variant, or variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to an RNA encoded by a sequence as shown in one of SEQ ID NOs: 1 to 21.
- a fragment, variant, or variant of a fragment of A is an abbreviation for a fragment of A, a variant of A, or a variant of a fragment of A.
- a fragment, variant, or variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21 refers to a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, or a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21.
- the gene is selected from at least one of PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB.
- the 5'-UTR comprises at least one of the following polynucleotides: 1): RNA encoded by a polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6; 2): a fragment of the RNA in 1); 3): a variant of the RNA in 1); and a variant of the fragment in 2).
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the 5'-UTR of the present invention may comprise two or more tandem 5'-UTRs from the above-mentioned genes, fragments of the 5'-UTRs of the above-mentioned genes, variants of the 5'-UTRs of the above-mentioned genes, or variants of the fragments of the 5'-UTRs of the above-mentioned genes.
- the 5'-UTR comprises at least one of the following polynucleotides: (a): 5'-UTRs derived from at least two genes of PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): fragments of the 5'-UTRs of at least two of the genes in (a); (c): variants of the 5'-UTRs of at least two of the genes in (a); and (d): variants of the fragments in (b).
- the 5'-UTR comprises at least one of the following polynucleotides: (e): a 5'-UTR derived from at least two genes among genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB; (f): a fragment of the 5'-UTR described in (e); (g): a variant of the 5'-UTR described in (e); and (h): a variant of the fragment described in (f).
- the 5'-UTR comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21.
- the sequence is one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in the sequence has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide shown in one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide shown in one of SEQ ID NOs: 1 to 21.
- the 5'-UTR includes at least one of the following polynucleotides: RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, a fragment of RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, a variant of RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, and a variant of a fragment of RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the 5'-UTR comprises at least one of the following polynucleotides: a): at least two copies of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7; b): at least two copies of a fragment of a 5'-UTR of at least one of the genes in a); c): at least two copies of a variant of a 5'-UTR of at least one of the genes in a); and d): at least two copies of a variant of a fragment of a 5'-UTR of at least one of the genes in a).
- the 5'-UTR comprises at least two copies of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the 5'-UTR comprises at least one of the following polynucleotides: at least two copies of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, at least two copies of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, at least two copies of a variant of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, and at least two copies of a variant of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21.
- the 5'-UTR comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, at least two copies of an RNA fragment encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, at least two copies of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6.
- the fragment, variant, or variant of the fragment of RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the fragment of RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6 has an insertion, addition, deletion, or substitution of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides compared to the RNA encoded by the polynucleotide having a sequence as shown in SEQ ID NOs: 9, 7, 18, 12, 8, 1, or 6.
- the inventors unexpectedly discovered that incorporation of a 3'-UTR from gene MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1 or NFKB2 into mRNA can increase the translation efficiency of the coding sequence.
- MPND MPN domain containing
- PGLYRP1 peptidoglycan recognition protein 1;
- HPX hemopexin
- CDK7 cyclin dependent kinase 7
- APOC2 apolipoprotein C2
- PFN1 profilin 1
- RBP4 retinol binding protein 4
- ALB albumin
- GSDMD gasdermin D
- ORM1 orosomucoid 1;
- CASP4 caspase 4
- CHMP2A charged multivesicular body protein 2A
- LENG1 leukocyte receptor cluster member 1
- MYCBPAP MYCBP associated protein
- APOC1 apolipoprotein C1
- GAPDH glyceraldehyde-3-phosphate dehydrogenase
- HSPA8 heat shock protein family A (Hsp70)member 8;
- APOA2 apolipoprotein A2
- UCHL1 ubiquitin C-terminal hydrolase L1;
- TSG101 tumor susceptibility 101
- NFKB2 nuclear factor kappa B subunit 2.
- the present invention provides a 3'-UTR comprising at least one selected from the following polynucleotides: (a): derived from a gene The 3'-UTR of at least one gene among MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1 and NFKB2; (b): a fragment of the 3'-UTR described in (a); (c): a variant of the 3'-UTR described in (a); and (d): a variant of the fragment described in (b).
- the gene is a eukaryotic gene.
- the gene is a chordate gene.
- the gene is a vertebrate gene.
- the gene is a mammalian gene.
- the gene is a primate gene.
- each of the genes is independently a gene of humans (Homo sapiens), a gene of bonobos (Pan paniscus), a gene of chimpanzees (Pan troglodytes), or a gene of western lowland gorillas (Gorilla gorilla gorilla).
- the gene is a human gene.
- the gene is at least one of a human MPND gene, a bonobo MPND gene, a human FBXW10 gene, a human FBXW12 gene, a bonobo FBXW12 gene, a human PGLYRP1 gene, a human HPX gene, a human CDK7 gene, a human APOC2 gene, a human PFN1 gene, a human RBP4 gene, a human FTCD gene, a human NAAA gene, a human ALB gene, a human GSDMD gene, a human FBXL8 gene, a bonobo FBXL8 gene, a western lowland gorilla FBXL8 gene, a human ORM1 gene, a human CASP4 gene, a human CHMP2A gene, a bonobo CHMP2A gene, a human LENG1 gene, a bonobo LENG1 gene, a human MYCBPAP gene, a human APOC1 gene,
- the GenBank accession number of the MPND gene is NM_001300862.1; the GenBank accession number of the FBXW10 gene is NM_001267586.2; the GenBank accession number of the FBXW12 gene is NM_001159929.1; the GenBank accession number of the PGLYRP1 gene is NM_005091.3; the GenBank accession number of the HPX gene is AH002827.2; and the GenBank accession number of the CDK7 gene is AY130859.
- GenBank accession number of APOC2 gene is NM_000483.5; the GenBank accession number of PFN1 gene is NM_005022.4; the GenBank accession number of RBP4 gene is NM_006744.4; the GenBank accession number of FTCD gene is NM_006657.3; the GenBank accession number of NAAA gene is NM_001363719.2; the GenBank accession number of ALB gene is NM_000477.7; the GenBank accession number of GSDMD gene is NM_024736.7; FB The GenBank accession number of XL8 gene is NM_018378.2; the GenBank accession number of ORM1 gene is NM_000607.4; the GenBank accession number of CASP4 gene is NM_001225.3; the GenBank accession number of CHMP2A gene is NM_198426.3; the GenBank accession number of LENG1 gene is NM_024316.3; the GenBank accession number of MY
- the polynucleotides encoding the 3'-UTRs of human genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2 are as shown in Table 2.
- the 3'-UTR comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48.
- the fragment, variant, or variant of the fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 22 to 48.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide whose sequence is shown in at least one of SEQ ID NOs: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 48.
- the gene is selected from at least one of MPND, FBXW10, FBXW12 and PGLYRP1.
- the 3'-UTR comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a fragment of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a variant of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, and a variant of a fragment of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 24, 22, 23 or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 24, 22, 23 or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the encoded RNA has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotide insertions, additions, deletions or substitutions compared to the encoded RNA.
- the 3'-UTR sequence of the present invention may comprise two or more tandemly linked 3'-UTRs from the above-mentioned genes, fragments of the 3'-UTRs of the above-mentioned genes, variants of the 3'-UTRs of the above-mentioned genes, or variants of the 3'-UTR fragments of the above-mentioned genes.
- the 3'-UTR comprises at least one of the following polynucleotides: (a): 3'-UTRs derived from at least two genes of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2; (b): fragments of the 3'-UTRs of at least two of the genes in (a); (c): variants of the 3'-UTRs of at least two of the genes in (a); and (d): variants of the fragments in (b).
- the 3’-UTR comprises at least one of the following polynucleotides: (e): a 3’-UTR derived from at least two genes of the genes MPND, FBXW10, FBXW12, and PGLYRP1; (f): a fragment of the 3’-UTR described in (e); (g): a variant of the 3’-UTR described in (e); and (h): a variant of the fragment described in (f).
- the 3'-UTR comprises at least one of the following polynucleotides: RNA encoded by at least two of the polynucleotides in SEQ ID NOs: 22 to 48, fragments of RNA encoded by at least two of the polynucleotides in SEQ ID NOs: 22 to 48, variants of RNA encoded by at least two of the polynucleotides in SEQ ID NOs: 22 to 48, and variants of fragments of RNA encoded by at least two of the polynucleotides in SEQ ID NOs: 22 to 48.
- the fragment, variant, or variant of the fragment of RNA encoded by one of the polynucleotides in SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the RNA encoded by the polynucleotide in one of the sequences as SEQ ID NOs: 22 to 48.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide shown in one of sequences SEQ ID NO: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide shown in one of sequences SEQ ID NO: 22 to 48.
- the 3’-UTR comprises at least one of the following polynucleotides: RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, fragments of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, variants of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, and variants of fragments of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of sequence SEQ ID NO: 24, 22, 23, or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the RNA encoded by the polynucleotide of sequence SEQ ID NO: 24, 22, 23, or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of sequence SEQ ID NO: 24, 22, 23, or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by the polynucleotide of sequence SEQ ID NO: 24, 22, 23, or 25.
- the 3'-UTR comprises at least one of the following polynucleotides: a): at least two copies of a 3'-UTR derived from one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2; b): at least two copies of a fragment of the 3'-UTR of at least one of the genes in a); c): at least two copies of a variant of the 3'-UTR of at least one of the genes in a); and d): at least two copies of a variant of a fragment of the 3'-UTR of at least one of the genes in a).
- the 3'-UTR comprises at least one of the following polynucleotides: e): at least two copies of a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, and PGLYRP1; f): at least two copies of a fragment of a 3'-UTR derived from at least one of the genes in e); g): at least two copies of e) a variant of the 3'-UTR of at least one of the genes described in; and h) at least two copies of a variant of the fragment described in f).
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the 3'-UTR sequence comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, and two copies of a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 48.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 48.
- the 3'-UTR comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, at least two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, at least two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, and at least two copies of a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown as one of SEQ ID NO: 24, 22, 23 and 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown as SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown as one of SEQ ID NO: 24, 22, 23 and 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown as SEQ ID NO: 24, 22, 23 or 25.
- RNA molecules comprising the 5' and/or 3'-UTR of the present invention
- the present invention provides a recombinant RNA molecule comprising a 5' and/or 3'-UTR identified in the present invention that improves the translation of a coding sequence.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest and a second nucleotide sequence containing a 5'-UTR, wherein the second nucleotide sequence comprises at least one selected from the following polynucleotides: (a): a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): a fragment of the 5'-UTR described in (a); (c): a variant of the 5'-UTR described in (a); and (d): a variant of the fragment described in (b), wherein the first nucleotide sequence and the second nucleot
- the gene is a human gene.
- the first nucleotide sequence encodes at least one polypeptide of interest. For example, one, two, three, four, five, six, seven, eight, nine, or ten polypeptides of interest.
- the first nucleotide sequence encodes at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine, or ten proteins of interest.
- the first nucleotide sequence encodes at least one polypeptide of interest and at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest and one, two, three, four, five, six, seven, eight, nine or ten proteins of interest.
- the second nucleotide sequence comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21.
- the fragment, variant, or variant of the fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 1 to 21.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide with a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the gene is selected from at least one of PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB.
- the second nucleotide sequence comprises at least one of the following polynucleotides: 1): RNA encoded by a polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6; 2): a fragment of the RNA in 1); 3): a variant of the RNA in 1); and 4): a variant of the fragment in 2).
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the polynucleotide shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the second nucleotide sequence may comprise two or more tandem 5'-UTRs from the above-mentioned gene, fragments of the 5'-UTR of the above-mentioned gene, variants of the 5'-UTR of the above-mentioned gene, or variants of the fragments of the 5'-UTR of the above-mentioned gene.
- the second nucleotide sequence comprises at least one of the following polynucleotides: (a): 5'-UTRs derived from at least two of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): fragments of the 5'-UTRs of at least two of the genes in (a); (c): variants of the 5'-UTRs of at least two of the genes in (a); and (d): variants of the fragments in (b).
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a 5'-UTR derived from at least two genes among genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB; (f): a fragment of the 5'-UTR described in (e); (g): a variant of the 5'-UTR described in (e); and (h): a variant of the fragment described in (f).
- the second nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21.
- the second nucleotide sequence comprises RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in SEQ ID NOs: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in SEQ ID NOs: 1 to 21.
- RNA encoded by a polynucleotide as shown in at least two of SEQ ID NOs: 1 to 21 a fragment of an RNA encoded by a polynucleotide as shown in at least two of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide as shown in at least two of SEQ ID NOs: 1 to 21, or a variant of a fragment of an RNA encoded by a polynucleotide as shown in at least two of SEQ ID NOs: 1 to 21.
- a fragment, a variant, or a variant of a fragment of an RNA encoded by a polynucleotide as shown in one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with an RNA encoded by a polynucleotide as shown in one of SEQ ID NOs: 1 to 21.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the second nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, a fragment of RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, a variant of RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, and a variant of a fragment of RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6.
- the fragment, variant or variant of the fragment of the RNA encoded by the polynucleotide whose sequence is shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide with a sequence as shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide with a sequence as shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6.
- the second nucleotide sequence comprises at least one of the following polynucleotides: a): at least two copies of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7; b): at least two copies of a fragment of the 5'-UTR of at least one of the genes in a); c): at least two copies of a variant of the 5'-UTR of at least one of the genes in a); and d): at least two copies of a variant of a fragment of the 5'-UTR of at least one of the genes in a).
- the second nucleotide sequence comprises at least two copies of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7.
- the 5'-UTR sequence comprises at least one of the following polynucleotides: a): at least two copies of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB; b): at least two copies of a fragment of a 5'-UTR of at least one of the genes in a); c): at least two copies of a variant of a 5'-UTR of at least one of the genes in a); and d): at least two copies of a variant of a fragment of a 5'-UTR of at least one of the genes in a).
- the 5'-UTR sequence comprises at least two copies of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, at least two copies of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, at least two copies of a variant of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, and at least two copies of a variant of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies, or more.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide of one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide of one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of one of SEQ ID NOs: 1 to 21.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6, two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6, two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6, and two copies of a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO:9, 7, 18, 12, 8, 1 and 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO:9, 7, 18, 12, 8, 1 and 6 has an insertion, addition, deletion or substitution of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides compared to the RNA encoded by the polynucleotide having a sequence as shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6.
- the recombinant RNA molecule is an mRNA molecule. In some embodiments, the recombinant RNA molecule further comprises at least one of a promoter, a 5'-cap structure, a 3'-UTR, and a poly(A) tail. In some embodiments, the recombinant RNA molecule further comprises at least one of a 5'-cap structure, a 3'-UTR, and a poly(A) tail.
- the 5'-cap structure includes but is not limited to at least one of m7GpppG , m27,3' - OGpppG, m7Gppp (5')N1 and m7Gppp ( m2'-O )N1.
- m7G represents 7-methylguanosine cap nucleoside
- ppp represents a triphosphate bond between the 5' carbon of the cap nucleoside and the first nucleotide of the primary RNA transcript
- N1 is the 5'most nucleotide
- G represents guanosine nucleoside
- m7 represents a methyl group at the 7-position of guanine
- m2' -O represents a methyl group at the 2'-O position of the nucleotide.
- the 3'-UTR comprises: i) a nucleotide sequence derived from the 3'-UTR of at least one of the albumin gene, the ⁇ -globin gene, the ⁇ -globin gene, the tyrosine hydroxylase gene, the lipoxygenase gene, and the collagen ⁇ gene, or a variant thereof; ii) a variant of the 3'-UTR in i); iii) at least one of the 3'-UTR, fragments, variants, and variants of fragments thereof, derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, NFKB2,
- the fragment, variant, or variant of the fragment of RNA encoded by the polynucleotides as shown in SEQ ID NO:22-49 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotides as shown in SEQ ID NO:22-49; iv) at least two copies of one of the polynucleotide sequences i), ii) or iii); or v) at least two polynucleotides in the group consisting of the polynucleotides in i) to iii).
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies, or nine copies.
- the nucleotides constituting the poly(A) tail include at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides; the nucleotides constituting the poly(A) tail include at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in succession. In some embodiments, the nucleotides constituting the poly(A) tail include one or more nucleotides other than A nucleotides.
- the poly(A) tail includes two or more consecutive nucleotides other than A nucleotides, wherein the first and last nucleotides in the sequence having two or more consecutive nucleotides are nucleotides other than A nucleotides.
- the poly(A) tail is truncated, i.e., m consecutive A nucleotides and n consecutive A nucleotides are reconnected by a linker sequence consisting of p non-A nucleotides, wherein m, n and p are positive integers.
- m is 30, n is 70, and p is 10.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest and a second nucleotide sequence comprising a 3'-UTR, wherein the second nucleotide sequence comprises at least one of the following polynucleotides: (a): a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1 and NFKB2; (b): a fragment of the 3'-UTR described in (a); (c): a variant of the 3'-UTR described in (a); and (d):
- the gene is a human gene.
- the first nucleotide sequence encodes at least one polypeptide of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest. In some embodiments, the first nucleotide sequence encodes at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten proteins of interest. In some embodiments, the first nucleotide sequence encodes at least one polypeptide of interest and at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest and one, two, three, four, five, six, seven, eight, nine or ten proteins of interest.
- the second nucleotide sequence comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48.
- the fragment, variant, or variant of the fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 22 to 48.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted with the RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the gene is selected from at least one of MPND, FBXW10, FBXW12 and PGLYRP1.
- the second nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a fragment of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a variant of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, and a variant of a fragment of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 24, 22, 23 or 25 is at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in SEQ ID NO: 24, 22, 23, or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to an RNA encoded by a polynucleotide having a sequence as shown in SEQ ID NO: 24, 22, 23, or 25.
- the 3'-UTR in the second nucleotide sequence in the recombinant RNA molecule of the present invention may comprise two or more tandem 3'-UTRs from the above-mentioned gene, fragments of the 3'-UTR of the above-mentioned gene, variants of the 3'-UTR of the above-mentioned gene, or variants of fragments of the 3'-UTR of the above-mentioned gene.
- the second nucleotide sequence comprises at least one of the following polynucleotides: (a): 3'-UTRs derived from at least two of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2; (b): fragments of the 3'-UTRs of at least two of the genes in (a); (c): variants of the 3'-UTRs of at least two of the genes in (a); and (d): variants of the fragments in (b).
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a 3’-UTR derived from at least two genes among the genes MPND, FBXW10, FBXW12 and PGLYRP1; (f): a fragment of the 3’-UTR described in (e); (g): a variant of the 3’-UTR described in (e); and (h): a variant of the fragment described in (f).
- the second nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 22 to 48, a fragment of an RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 22 to 48, a variant of an RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 22 to 48, and a variant of a fragment of an RNA encoded by a polynucleotide sequence as shown in at least two of SEQ ID NOs: 22 to 48.
- the fragment, variant, or variant of the fragment of an RNA encoded by a polynucleotide sequence as shown in one of SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with an RNA encoded by a polynucleotide sequence as shown in one of SEQ ID NOs: 22 to 48.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide whose sequence is shown as one of SEQ ID NOs: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by a polynucleotide whose sequence is shown as one of SEQ ID NOs: 22 to 48.
- the second nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, fragments of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, variants of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, and variants of fragments of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25.
- the second nucleotide sequence comprises at least one of the following polynucleotides: a): at least two copies of a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2; b): at least two copies of a fragment of the 3'-UTR of at least one of the genes in a); c): at least two copies of a variant of the 3'-UTR of at least one of the genes in a); and d): at least two copies of a fragment of the 3'-UTR of at least one of the genes in a).
- the second nucleotide sequence comprises at least two copies of a nucleotide sequence derived from the 3'-UTR of at least one of the genes MPND, FBXW10, FBXW12 and PGLYRP1.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, at least two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, at least two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, and at least two copies of a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 48.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 48.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, at least two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, at least two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, and at least two copies of a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown as one of SEQ ID NO: 24, 22, 23 and 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown as SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown as one of SEQ ID NO: 24, 22, 23 and 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown as SEQ ID NO: 24, 22, 23 or 25.
- the recombinant RNA molecule is an mRNA molecule. In some embodiments, the recombinant RNA molecule further comprises at least one of a promoter, a 5'-cap structure, a 5'-UTR and a poly (A) tail.
- the recombinant RNA molecule further comprises at least one of a 5'-cap structure, a 5'-UTR and a poly(A) tail.
- the 5'-cap structure includes but is not limited to at least one of m7GpppG , m27,3' - OGpppG, m7Gppp (5')N1 or m7Gppp ( m2'-O )N1.
- m7G represents 7-methylguanosine cap nucleoside
- ppp represents a triphosphate bond between the 5' carbon of the cap nucleoside and the first nucleotide of the primary RNA transcript
- N1 is the 5'most nucleotide
- G represents guanosine nucleoside
- m7 represents a methyl group at the 7-position of guanine
- m2' -O represents a methyl group at the 2'-O position of the nucleotide.
- the 5'-UTR comprises: i) a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7, a fragment, a variant or a variant of a fragment thereof; preferably, at least one of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, and a
- RNA encoded by the polynucleotides of at least one of SEQ ID NOs: 1-21 variants of RNA encoded by the polynucleotides of at least one of SEQ ID NOs: 1-21, fragments of RNA encoded by the polynucleotides of at least one of SEQ ID NOs: 1-21, and variants of fragments of RNA encoded by the polynucleotides of at least one of SEQ ID NOs: 1-21, which have at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homology with the RNA encoded by the polynucleotides of at least one of SEQ ID NOs: 1-21; ii) at least two copies of at least one polynucleotide of i); or iii) at least two polynucleotides of i).
- the at least two copies are two copies, three copies,
- the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides; preferably, the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in succession.
- the poly(A) tail comprises one or more nucleotides other than A nucleotides.
- the poly(A) tail comprises two or more consecutive nucleotides other than A nucleotides, wherein the first and last nucleotides in the sequence having two or more consecutive nucleotides are nucleotides other than A nucleotides.
- the poly(A) tail is a truncated polyA, i.e., m consecutive A nucleotides and n consecutive A nucleotides are reconnected by a linker sequence consisting of p non-A nucleotides, wherein m, n and p are positive integers.
- m is 30, n is 70, and p is 10.
- the DNA sequence corresponding to the poly(A) tail is shown as SEQ ID NO:53.
- the recombinant RNA molecule is no more than 50000nt.
- the recombinant RNA molecule is no more than 40000nt, 30000nt, 20000nt, 10000nt, 9000nt, 8000nt or 6000nt.
- the recombinant RNA molecule is 500nt to 50000nt.
- the recombinant RNA molecule is 1000nt to 40000nt, 1000nt to 30000nt, 1500nt to 10000nt or 1500nt to 8000nt.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a second nucleotide sequence comprising a 5'-UTR, and a third nucleotide sequence comprising a 3'-UTR, wherein the second nucleotide sequence comprises a 5'-UTR derived from at least one of genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7, a fragment thereof, a variant thereof, and at least one of a variant of a fragment thereof, wherein the third nucleotide sequence comprises a 5'-UTR derived from at least one of genes PPIA, HPX, FTCD, CDK5RAP3,
- the nucleotide sequence comprises at least one of a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, NFKB2 and GH1, a fragment, a variant or a variant of a fragment thereof, and wherein the first nucleotide sequence does not naturally occur in the same RNA molecule with at least one of the second nucleotide sequence and the third nucleotide sequence.
- a combination of a 5'-UTR, fragment, variant, or fragment variant derived from at least one of the above genes and a 3'-UTR, fragment, variant, or fragment variant derived from at least one of the above genes can further improve the translation efficiency and/or stability of the mRNA molecule, and improve the expression level of the polypeptide and/or protein of interest, for example, by 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 times, etc.
- A, a fragment, a variant or a variant of a fragment is an abbreviation of A, a fragment of A, a variant of A or a variant of a fragment of A.
- 5'-UTR derived from at least one of genes PPIA, HPX and FTCD, a fragment, a variant or a variant of a fragment refers to 5'-UTR derived from at least one of genes PPIA, HPX and FTCD, a fragment derived from 5'-UTR of at least one of genes PPIA, HPX and FTCD, a variant derived from 5'-UTR of at least one of genes PPIA, HPX and FTCD, or a variant of a fragment derived from 5'-UTR of at least one of genes PPIA, HPX and FTCD.
- At least one of A, a fragment, a variant and a variant of a fragment is A, a fragment of A, a variant of A and a variant of a fragment of A.
- at least one of the 5'-UTR, fragments, variants and variants of fragments derived from at least one of the genes PPIA, HPX and FTCD means at least one of the 5'-UTR, fragments, variants and variants of fragments derived from at least one of the genes PPIA, HPX and FTCD, variants of the 5'-UTR, and variants of fragments derived from at least one of the genes PPIA, HPX and FTCD.
- the gene is a human gene.
- the first nucleotide sequence encodes at least one polypeptide of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest. In some embodiments, the first nucleotide sequence encodes at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten proteins of interest. In some embodiments, the first nucleotide sequence encodes at least one polypeptide of interest and at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest and one, two, three, four, five, six, seven, eight, nine or ten proteins of interest.
- the second nucleotide sequence comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21.
- the second nucleotide sequence comprises one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 1 to 21.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide of at least one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the RNA encoded by the polynucleotide of one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide of at least one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by the polynucleotide of one of SEQ ID NOs: 1 to 21.
- the gene is selected from at least one of PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB.
- the second nucleotide sequence comprises at least one of the following nucleotides: 1): RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6; 2): a fragment of the RNA in 1); 3): a variant of the RNA in 1); and 4): a variant of the fragment in 2).
- the second nucleotide sequence comprises one of the following nucleotides: 1): RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6; 2): a fragment of the RNA in 1); 3): a variant of the RNA in 1); and 4): a variant of the fragment in 2).
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of the sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the 5'-UTR in the second nucleotide sequence in the recombinant RNA molecule of the present invention may contain two or more sequences.
- the 5'-UTR of the gene, the fragment of the 5'-UTR of the gene, the variant of the 5'-UTR of the gene, or the variant of the fragment of the 5'-UTR of the gene is linked to the 5'-UTR of the gene.
- the second nucleotide sequence comprises at least one of the following polynucleotides: (a): 5'-UTRs derived from at least two of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): fragments of the 5'-UTRs of at least two of the genes in (a); (c): variants of the 5'-UTRs of at least two of the genes in (a); and (d): variants of the fragments in (b).
- the second nucleotide sequence comprises one of the following polynucleotides: (a): 5'-UTRs derived from at least two of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): fragments of the 5'-UTRs of at least two of the genes in (a); (c): variants of the 5'-UTRs of at least two of the genes in (a); and (d): variants of the fragments in (b).
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a 5'-UTR derived from at least two of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB; (f): a fragment of the 5'-UTR described in (e); (g): a variant of the 5'-UTR described in (e); and (h): a variant of the fragment described in (f).
- the second nucleotide sequence comprises one of the following polynucleotides: (e): a 5'-UTR derived from at least two of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB; (f): a fragment of the 5'-UTR described in (e); (g): a variant of the 5'-UTR described in (e); and (h): a variant of the fragment described in (f).
- the second nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21.
- the second nucleotide sequence comprises RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, or a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence as shown in one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the RNA encoded by the polynucleotide of the sequence as shown in one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence as shown in at least two of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by the polynucleotide of the sequence as shown in one of SEQ ID NOs: 1 to 21.
- the second nucleotide sequence comprises at least one of the following polynucleotides: an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1 and 6.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 is at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide shown in 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide shown in SEQ ID NO:9, 7, 18, 12, 8, 1 or 6.
- the second nucleotide sequence comprises at least one of the following polynucleotides: a): at least two copies of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7; b): at least two copies of a fragment of the 5'-UTR of at least one of the genes in a); c): at least two copies of a variant of the 5'-UTR of at least one of the genes in a); and d): at least two copies of a variant of a fragment of the 5'-UTR of at least one of the genes in a).
- the second nucleotide sequence comprises at least two copies of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7.
- the 5'-UTR sequence comprises at least two copies of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB, at least two copies of a fragment of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB, at least two copies of a variant of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB, or at least two copies of a variant of a fragment of a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB.
- the 5'-UTR sequence comprises at least two copies of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB, at least two copies of a fragment of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB, at least two copies of a variant of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB, or at least two copies of a variant of a fragment of a 5'-UTR derived from one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, at least two copies of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, at least two copies of a variant of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21, and at least two copies of a variant of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NO: 1 to 21.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, and two copies of a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the fragment of the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of the sequence as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6 has an insertion, addition, deletion, or substitution of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides compared to the RNA encoded by the polynucleotide having a sequence as shown in SEQ ID NOs: 9, 7, 18, 12, 8, 1, or 6.
- the third nucleotide sequence comprises: at least one of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 49, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 49, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 49, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 49.
- the third nucleotide sequence comprises: at least one of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least one of SEQ ID NO: 22 to 48.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49.
- a fragment, variant, or variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49.
- the gene is selected from at least one of MPND, FBXW10, FBXW12 and PGLYRP1. In some embodiments, the gene is selected from one of MPND, FBXW10, FBXW12 and PGLYRP1.
- the third nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a fragment of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a variant of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, and a variant of a fragment of RNA encoded by a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 24, 22, 23 or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 24, 22, 23 or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the 3'-UTR in the third nucleotide sequence of the recombinant molecule of the present invention may comprise two or more tandem 3'-UTRs from the above-mentioned gene, fragments of the 3'-UTR of the above-mentioned gene, variants of the 3'-UTR of the above-mentioned gene, or variants of the 3'-UTR fragments of the above-mentioned gene.
- the third nucleotide sequence comprises at least one of the following polynucleotides: (a): derived from the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, NFKB2, and GH1.
- polynucleotides comprises at least one of the following polynucleotides: (a): derived from the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC
- the third nucleotide sequence comprises at least one of the following polynucleotides: (e): 3'-UTRs derived from at least two genes of the genes MPND, FBXW10, FBXW12, and PGLYRP1; (f): a fragment of the 3'-UTR of at least two genes of the genes described in (e); (g): a variant of the 3'-UTR of at least two genes of the genes described in (e); and (h): a variant of the fragment of the 3'-UTR of at least two genes of the genes described in (f).
- the third nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 49, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 49, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 49, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 49.
- the third nucleotide sequence comprises at least one of the following polynucleotides: RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 48, a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 48, a variant of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 48, and a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in at least two of SEQ ID NO: 22 to 48.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 49 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 49.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 49 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown in one of SEQ ID NOs: 22 to 49.
- the third nucleotide sequence contains at least one of the following polynucleotides: RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, fragments of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, variants of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25, and variants of fragments of RNA encoded by at least two of the polynucleotides shown in sequences SEQ ID NO: 24, 22, 23 and 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide shown in the sequence SEQ ID NO: 24, 22, 23 or 25.
- the third nucleotide sequence comprises at least one of the following polynucleotides: a): at least two copies of a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, NFKB2, and GH1; b): at least two copies of a fragment of the 3'-UTR of at least one of the genes in a); c): at least two copies of a variant of the 3'-UTR of at least one of the genes in a); and d): at least two copies of a variant of a fragment of the 3'-UTR of at least one of the genes
- the second nucleotide sequence comprises at least two copies of the 3'-UTR of at least one of the genes MPND, FBXW10, FBXW12 and PGLYRP1.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the third nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of the RNA encoded by the polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49, At least two copies of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49, at least two copies of a variant of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49, and at least two copies of a variant of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49.
- the third nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49, at least two copies of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49, at least two copies of a variant of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49, and at least two copies of a variant of a fragment of RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 49.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 49 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 49.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 49 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide of one of SEQ ID NOs: 22 to 49.
- the third nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 24, 22, 23 and 25, at least two copies of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 24, 22, 23 and 25, at least two copies of a variant of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 24, 22, 23 and 25, and at least two copies of a variant of a fragment of an RNA encoded by a polynucleotide having a sequence as shown in one of SEQ ID NO: 24, 22, 23 and 25.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown as one of SEQ ID NO: 24, 22, 23 and 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the RNA encoded by the polynucleotide whose sequence is shown as SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the RNA encoded by the polynucleotide whose sequence is shown as one of SEQ ID NO: 24, 22, 23 and 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the RNA encoded by the polynucleotide whose sequence is shown as SEQ ID NO: 24, 22, 23 or 25.
- nucleotide sequence of the 5’-UTR is as shown in SEQ ID NO:55.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 3’-UTR of any of the above embodiments, and a 5’-UTR with a nucleotide sequence as shown in SEQ ID NO:55.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 3’-UTR with a sequence as shown in one of SEQ ID NOs: 22 to 48, and a 5’-UTR with a nucleotide sequence as shown in SEQ ID NO: 55.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 3’-UTR derived from at least one of genes FBXW10, FBXW12, MPND and PGLYRP1, and a 5’-UTR having a nucleotide sequence as shown in SEQ ID NO:55.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 3’-UTR with a sequence as shown in one of SEQ ID NOs: 22 to 25, and a 5’-UTR with a nucleotide sequence as shown in SEQ ID NO: 55.
- the 3’-UTR is a 3’-UTR derived from the gene COP1 (CARD only protein).
- nucleotide sequence of the 3’-UTR is as shown in SEQ ID NO:56.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 5'-UTR according to any of the above embodiments, and a 3'-UTR having a nucleotide sequence as shown in SEQ ID NO:56.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 5’-UTR as shown in one of SEQ ID NOs: 1 to 21, and a 3’-UTR of the gene COP1 (CARD only protein).
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 5’-UTR with a sequence as shown in one of SEQ ID NOs: 1 to 21, and a 3’-UTR with a nucleotide sequence as shown in SEQ ID NO: 56.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, and HBB, and a 3'-UTR derived from the gene COP1.
- the recombinant RNA molecule comprises a first nucleotide sequence encoding a polypeptide and/or protein of interest, a 5’-UTR with a sequence as shown in one of SEQ ID NOs: 1, 6 to 9, 12 and 18, and a 3’-UTR with a nucleotide sequence as shown in SEQ ID NO: 56.
- the recombinant RNA molecule is an mRNA molecule. In some embodiments, the recombinant RNA molecule further comprises at least one of a 5'-cap structure and a poly (A) tail.
- the 5'-cap structure includes but is not limited to at least one of m7GpppG , m27,3' - OGpppG, m7Gppp (5')N1 or m7Gppp ( m2'-O )N1.
- m7G represents 7-methylguanosine cap nucleoside
- ppp represents a triphosphate bond between the 5' carbon of the cap nucleoside and the first nucleotide of the primary RNA transcript
- N1 is the 5'most nucleotide
- G represents guanosine nucleoside
- m7 represents a methyl group at the 7-position of guanine
- m2' -O represents a methyl group at the 2'-O position of the nucleotide.
- the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides.
- the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in succession.
- the poly(A) tail comprises one or more nucleotides other than A nucleotides.
- the poly(A) tail comprises two or more consecutive nucleotides other than A nucleotides, wherein the first and last nucleotides in the sequence having two or more consecutive nucleotides are nucleotides other than A nucleotides.
- the poly(A) tail is truncated, i.e., m consecutive A nucleotides and n consecutive A nucleotides are reconnected by a linker sequence consisting of p non-A nucleotides, wherein m, n and p are positive integers.
- m is 30, n is 70, and p is 10.
- the DNA sequence corresponding to the poly (A) tail is shown as SEQ ID NO:53.
- the recombinant RNA molecule is no more than 50000nt.
- the recombinant RNA molecule is no more than 40000nt, 30000nt, 20000nt, 10000nt, 9000nt, 8000nt or 6000nt.
- the recombinant RNA molecule is 500nt to 50000nt.
- the recombinant RNA molecule is 1000nt to 40000nt, 1000nt to 30000nt, 1500nt to 10000nt or 1500nt to 8000nt.
- the recombinant RNA molecule of any of the above embodiments comprises modified nucleosides.
- the recombinant DNA molecule comprises at least one of modified uridine, modified cytidine, modified adenosine, and modified guanosine.
- the modified nucleoside is a modified uridine.
- 0.1% to 100% of the uridine in the recombinant RNA molecule is modified.
- 80% to 100% of the uridine is modified.
- 100% of the uridine is modified.
- Exemplary modified uridines include pseudouridine ( ⁇ ), N1-methyl pseudouridine, pyridin-4-one ribonucleoside, 5-aza-uridine, 6-aza-uridine, 2-thio-5-aza-uridine, 2-thio-uridine (s2U), 4-thio-uridine (s4U), 4-thio-pseudouridine, 2-thio-pseudouridine, 5-hydroxy-uridine (ho5U), 5-aminoallyl-uridine, 5-halo-uridine (e.g., 5-iodo-uridine or 5-bromo-uridine), 3-methyl-uridine (m3U), 5-methoxy-uridine (mo5U), uridine-5-oxyacetic acid (cmo5U), uridine-5-oxyacetic acid methyl ester (mcmo5U), 5-carboxymethyl-uridine (cm5U), 1-carboxymethyl-pseudouridine, 5-carboxyhydroxymethyl-uridine (ch
- the modified nucleoside is a modified cytidine.
- 0.1% to 100% of the cytidines in the recombinant RNA molecule are modified.
- Preferably, 80% to 100% of the cytidines are modified.
- 100% of the cytidines are modified.
- Exemplary modified cytidines include 5-aza-cytidine, 6-aza-cytidine, pseudoisocytidine, 3-methyl-cytidine (m3C), N4-acetyl-cytidine (ac4C), 5-formyl-cytidine (f5C), N4-methyl-cytidine (m4C), 5-methyl-cytidine (m5C), 5-halo-cytidine (e.g., 5-iodo-cytidine), 5-hydroxymethyl-cytidine (hm5C), 1-methyl-pseudoisocytidine, pyrrolo-cytidine, pyrrolo-pseudoisocytidine, 2-thio-cytidine (s2C), 2-thio-5-methyl-cytidine, 4-thio-pseudoisocytidine, 4-thio-1-methyl-pseudoisocytidine, 4-thio-1-methyl-1-d
- 5-Methoxy-zebularine 5-methyl-zebularine, 5-aza-2-thio-zebularine, 2-thio-zebularine, 2-methoxy-cytidine, 2-methoxy-5-methyl-cytidine, 4-methoxy-pseudoisocytidine, 4-methoxy-1-methyl-pseudoisocytidine, lysidine (k2C), ⁇ -thio-cytidine, 2'-O-methyl-cytidine (Cm), 5,2'-O-dimethyl Cm), N4,2'-O-trimethyl-cytidine (m42Cm), 1-thio-cytidine, 2'-F-ara-cytidine, 2'-F-cytidine and 2'-OH-ara-cytidine.
- the modified nucleoside is a modified adenosine.
- 0.1% to 100% of the adenosine in the recombinant RNA molecule is modified.
- 80% to 100% of the adenosine is modified.
- 100% of the adenosine is modified.
- Exemplary modified adenosines include 2-amino-purine, 2,6-diaminopurine, 2-amino-6-halo-purine (e.g., 2-amino-6-chloro-purine), 6-halo-purine (e.g., 6-chloro-purine), 2-amino-6-methyl-purine, 8-azido-adenosine, 7-deaza-adenine, 7-deaza-8-aza-adenine, 7-deaza-2-amino-purine, 7-deaza-8-aza-2-amino-purine, 7-deaza-2,6-diaminopurine, 7-deaza-8-aza-2,6-diaminopurine, 1-methyl-adenosine (m1A), 2-methyl-adenine (m2A), N6 -methyl-adenosine (m6A), 2-methylthio-N6-methyl-adenosine (ms2m6A), N6-
- the modified nucleoside is a modified guanosine.
- 0.1% to 100% of the guanosine in the recombinant RNA molecule is modified.
- 80% to 100% of the guanosine is modified.
- 100% of the guanosine is modified.
- Exemplary modified guanosines include inosine (I), 1-methyl-inosine (m1I), wyosine (imG), methyl wyosine (mimG), 4-demethyl-wyosine (imG-14), iso-wyosine (imG2), yW, peroxyyW (o2yW), hydroxyyW (OHyW), undermodified hydroxyyW (OHyW*), 7-deaza-guanosine, queuosine (Q), epoxy-queuosine (o Q), galactosyl-braided glycoside (galQ), mannosyl-braided glycoside (manQ), 7-cyano-7-deaza-guanosine (preQ0), 7-aminomethyl-7-deaza-guanosine (preQ1), archaeosine (G+), 7-deaza-8-aza-guanosine, 6-thio-guanosine, 6-thio-7-deaza-guanosine
- the polypeptide and/or protein of interest refers to a therapeutically or pharmaceutically active polypeptide or protein having a therapeutic or preventive effect, whose function in or near a cell is necessary or beneficial, for example, such a protein, whose deficiency or defective form causes a disease, providing such a protein can regulate or prevent the disease, or such a protein, which is beneficial to the body in or near a cell.
- the polypeptide or protein may comprise a complete protein or a functional variant thereof.
- the nucleotide sequence encoding the polypeptide and/or protein of interest or the expressed peptide and/or protein comprises or is one or more of the following: (a) an antigen; (b) a therapeutic protein or polypeptide, a fragment, fragment or variant thereof; and (c) other polypeptides or proteins.
- the peptide and/or protein expressed by the nucleotide sequence encoding the polypeptide and/or protein of interest comprises or is an antigen.
- the antigen expressed by the nucleotide sequence encoding the polypeptide and/or protein of interest is derived from one or more of the following: (1) pathogenic antigens, fragments, variants or variants of fragments thereof, (2) tumor antigens, fragments, variants or variants of fragments thereof, (3) allergic antigens, fragments, variants or variants of fragments thereof, (4) autoimmune self-antigens, fragments, variants or variants of fragments thereof.
- pathogenic antigens are derived from pathogenic organisms that can cause an immune response in a subject (e.g., a mammalian subject, further e.g., a human).
- pathogenic organisms include or are one or more of the following: bacteria, viruses, fungi, and protozoa (e.g., unicellular organisms, multicellular organisms).
- the pathogenic antigen comprises or is a surface antigen, a fragment, a variant, or a variant of a fragment thereof, such as a protein located on the surface of a virus, a bacterium, or a protozoan, a fragment thereof (e.g., an external portion of a surface antigen), a variant thereof or variations of the fragment.
- the pathogenic antigen comprises or is derived from a polypeptide or protein from a pathogen associated with an infectious disease.
- the pathogenic antigen is selected from but not limited to the group consisting of the antigens derived from pathogens recorded on pages 21 to 35 of WO2018/078053A1, the antigens derived from pathogens recorded on page 57, paragraph 3 to page 63, paragraph 2 of WO2019/077001A1, the antigens derived from pathogens recorded on page 32, line 26 to page 34, line 27 of WO2013/120628A1, and the antigens recorded on page 34, line 29 to page 59, line 5 of WO2013/120628A1.
- the pathogen of the pathogenic antigen is selected from but not limited to one or more of the following: scabies, Babesia, Leishmania, Gnatostoma, Ancylostoma braziliensis, Ancylostoma duodenale, Strongyloides stercoralis, Trichuris trichuris, Toxocara canis, Toxocara cati, Toxoplasma gondii, Trypanosoma brucei, Trypanosoma cruzi, Brugia malayi, Onchocerca volvulus, Bancrofti, Tapeworm, Taenia solium, Echinococcus, Ascaris lumbricoides, Dinucleate Amoeba fragilis, Naegleria fowleri (Naegleria fowleri), Necator americanus, Paragonimus (e.g., Paragonimus westermani), Clonorchis sinensis, Plasmodium
- the pathogenic antigen includes or is one or more of the following:
- SARS coronavirus 2 SARS-CoV-2
- SARS-CoV-2019 coronavirus or SARS coronavirus SARS coronavirus
- spike protein S
- envelope protein E
- membrane protein M
- nucleocapsid protein N
- MERS coronavirus spike protein (S), spike S1 fragment (S1), envelope protein (E), membrane protein (M) or nucleocapsid protein (N)
- H human papillomavirus
- HPV16 replication protein E1, regulatory protein E2, protein E3, protein E4, protein E5, protein E6, protein E7, protein E8, major capsid protein L1 and minor capsid protein L2; (4) one or more of the following proteins of human parainfluenza virus (HPIV/PIV) (e.g.
- hPIV-1, hPIV-2, hPIV-3 or hPIV-4 serotypes fusion protein (F), hemagglutinin neuraminidase , glycoprotein (G), matrix protein (M), phosphoprotein (P), nucleocapsid protein, fusion glycoprotein F0, F1 or F2, recombinant PIV3/PIV1 fusion glycoprotein, C protein, D protein, viral replicase (L) and non-structural V protein; (5) one or more of the following proteins of human metapneumovirus (hMPV): fusion (F) glycoprotein, glycoprotein (G), phosphoprotein (P), and nucleocapsid protein; (6) one or more of the following proteins of influenza virus: hemagglutinin (HA), neuraminidase (NA), nucleoprotein (NP), M1 protein, M2 protein, NS1 protein, NS2 protein (NEP protein: nuclear export protein), PA protein, PB1 protein (polymerase
- the tumor antigen is selected from but not limited to the group consisting of the tumor antigens described in WO2018/078053A1, pages 47-51.
- the antigens expressed by the nucleotide sequences encoding the polypeptides and/or proteins of interest include or are allergic antigens and autoimmune self-antigens.
- allergic antigens and autoimmune self-antigens are derived from or selected from, but not limited to, the antigen groups described on pages 59 to 73 of WO2018/078053A1.
- the antigens expressed by the nucleotide sequences encoding the polypeptides and/or proteins of interest are listed on pages 48 to 51 of WO2018/078053A1.
- the polypeptide and/or protein expressed by the nucleotide sequence encoding the polypeptide and/or protein of interest comprises or is a therapeutic protein or polypeptide.
- the therapeutic protein or polypeptide includes or is one or more of the following:
- Enzyme replacement therapy for the treatment of metabolic, endocrine or amino acid disorders or therapeutic proteins or polypeptides for replacing missing, defective or mutated proteins (2) Therapeutic proteins or polypeptides for the treatment of blood diseases, circulatory system diseases, respiratory system diseases, infectious diseases or immune deficiencies; (3) Therapeutic proteins or polypeptides for the treatment of cancer or tumor diseases; (4) Therapeutic proteins or polypeptides for hormone replacement therapy; (5) Therapeutic proteins or polypeptides for reprogramming somatic cells into pluripotent stem cells or totipotent stem cells; (6) Therapeutic proteins or polypeptides used as adjuvants or immunostimulants; (7) Therapeutic proteins or polypeptides as therapeutic antibodies; (8) Therapeutic proteins or polypeptides as gene editing agents; (9) Therapeutic proteins or polypeptides for the treatment or prevention of liver diseases selected from the group consisting of liver fibrosis, cirrhosis and liver cancer; and (10) Therapeutic proteins or polypeptides for the treatment or prevention of rare diseases.
- enzyme replacement therapy for the treatment of metabolic, endocrine or amino acid disorders or therapeutic proteins or polypeptides for replacing missing, deleted or mutated proteins include or are one or more of the following: acidic sphingomyelin Lipase, fatty acid, aglycosidase beta, leucosidase, ⁇ -galactosidase A, ⁇ -glucosidase, ⁇ -L-iduronidase, ⁇ -N-acetylglucosaminidase, amphiregulin, angiopoietin (Ang1, Ang2, Ang3, Ang4, ANGPTL2, ANGPTL3, ANGPTL4, ANGPTL5, ANGPTL6, ANGPTL7), ATPase, Cu(2+)-transporting ⁇ polypeptide (ATP7B), argininosuccinate synthetase (ASS1 ), beta-cell factor, beta-glucuronidase, bone morphogen
- the therapeutic protein or polypeptide for treating metabolic or endocrine diseases is selected from the proteins or polypeptides described in Table A (in combination with Table C) of WO2017/191274.
- the therapeutic protein or polypeptide for treating a blood disorder, a circulatory system disease, a respiratory system disease, a cancer or tumor disease, an infectious disease, or an immune deficiency comprises or is one or more of the following: alteplase (tissue plasminogen activator; tPA), anistreplase, antithrombin III (AT-III), bivalirudin, darbepoetin- ⁇ , drotrecogin- ⁇ (activated protein C), erythropoietin, epoetin alfa- ⁇ , erythropoietin, erthropoyetin, factor IX, factor VIIa, factor VIII, recombinant hirudin, protein C concentrate, reteplase (tP A deletion mutant protein), streptokinase, tenecteplase, urokinase, angiostatin, anti-CD22 immunotoxin, denileukin, immunocyanine,
- the therapeutic protein or polypeptide for treating cancer or tumor disease includes or is one or more of the following: cytokines, chemokines, suicide gene products, immunogenic proteins or peptides, apoptosis inducers, angiogenesis inhibitors, heat shock proteins, tumor antigens, ⁇ -catenin inhibitors, STING pathway activators, checkpoint regulators, innate immune activators, antibodies, dominant negative receptors and decoy receptors, myeloid-derived suppressor cells (MDSCs) inhibitors, IDO pathway inhibitors, and proteins or peptides that bind to apoptosis inhibitors;
- cytokines cytokines
- chemokines suicide gene products
- immunogenic proteins or peptides include apoptosis inducers, angiogenesis inhibitors, heat shock proteins, tumor antigens, ⁇ -catenin inhibitors, STING pathway activators, checkpoint regulators, innate immune activators, antibodies, dominant negative receptors and decoy receptors, myeloid-derived suppressor cells
- the hormones in the therapeutic protein or polypeptide for hormone replacement therapy include one or more of the following: estrogen, progesterone, progesterone, and testosterone.
- therapeutic proteins for reprogramming somatic cells into pluripotent or totipotent stem cells include one or more of the following: Oct-3/4, Sox gene family (e.g., Sox1, Sox2, Sox3, and Sox15), Klf family (e.g., Klf1, Klf2, Klf4, and Klf5), Myc family (e.g., c-Myc, L-Myc, and N-Myc), Nanog, and LIN28.
- Sox gene family e.g., Sox1, Sox2, Sox3, and Sox15
- Klf family e.g., Klf1, Klf2, Klf4, and Klf5
- Myc family e.g., c-Myc, L-Myc, and N-Myc
- Nanog LIN28.
- the therapeutic protein or polypeptide used as an adjuvant or immunostimulatory protein includes or is one or more of the following: human adjuvant proteins, in particular pattern recognition receptors TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, TLR11; NOD1, NOD2, NOD3, NOD4, NOD5, NALP1, NALP2, NALP3, NALP4, NALP5, NALP6, NALP6, NALP7, NALP7, NALP8, NALP9, NALP10, NALP11, NALP12, NALP13, NALP14J IPAF, NAIP, CIITA, RIG-I, MDA5 and LGP2, TLR signaling signal transducers (including adaptor proteins (such as Trif and Cardif), components of small GTPases signals (such as RhoA, Ras, Rac1, Cdc42, Rab, etc.), components of PIP signals (such as PI3K, Src kinase, etc.), components of Myers (such as
- NF-kB NF-kB
- c-Fos c-Jun
- c-Myc CREB
- AP-1 Elk-1
- ATF2 IRF-3
- IRF-7 heat shock proteins
- HSP10 HSP60, HSP65, HSP70, HSP75 and HSP90
- gp96 fibrinogen, type III repeat extra domain of fibronectin, etc.
- components of the complement system e.g.
- the human auxiliary protein includes one or more of the following: trif, flt-3 ligand, Gp96 or fibronectin, cytokines that induce or enhance innate immune responses (e.g., IL-1 ⁇ , IL-1R1, IL1 ⁇ , IL-2, IL-6, IL-7, IL-8, IL-9, IL-12, IL-13, IL-15, IL-16, IL-17, IL-18, IL-21, IL-23, TNF ⁇ , IFN ⁇ , IFN ⁇ , IFN ⁇ , GM-CSF, G-CSF, M-CSF), chemokines (e.g., IL-8, IP-10, MCP-1, MIP-1 ⁇ , RANTES, Eotaxin, CCL21), cytokines released by macrophages (e.g., IL-1, IL-6, IL-8, IL-12, TNF- ⁇ , etc.).
- therapeutic proteins or polypeptides used as adjuvants or immunostimulators include one or more of the following: bacterial (adjuvant) proteins, protozoan (adjuvant) proteins, viral (adjuvant) proteins, fungal (adjuvant) proteins, and animal-derived proteins.
- the bacterial (adjuvant) protein includes one or more of the following: bacterial heat shock proteins or chaperones (including Hsp60, Hsp70, Hsp90, Hsp100); Gram-negative bacterial OmpA (outer membrane protein); OspA; bacterial porins (e.g., OmpF); bacterial toxins (e.g., pertussis toxin (PT) of Bordetella pertussis, pertussis toxin (PT) of Bordetella pertussis); Cough adenylate cyclase toxin CyaA and CyaC, pertussis toxin PT-9K/129G mutant, Bordetella pertussis adenylate cyclase toxin CyaA and CyaC, tetanus toxin, cholera toxin (CT), cholera toxin B subunit, cholera toxin CTK63 mutant, CTE112K mutant of CT
- LTK63, LTR72 phenol-soluble regulatory protein
- HP-NAP Helicobacter pylori neutrophil activating protein
- surfactant protein D Borrelia burgdorferi outer surface protein A lipoprotein, Ag38 (38kDa antigen) of Mycobacterium tuberculosis
- bacterial pilin proteins e.g. pilin of gram-negative bacterial pilin
- surfactant protein A and bacterial flagellar proteins e.g. pilin of gram-negative bacterial pilin
- the protozoan (adjuvant) proteins include one or more of the following: Tc52 from Trypanosoma cruzi, PFTG from Trypanosoma gondii, protozoan heat shock proteins, LeIF from Leishmania, and Spectrum-like proteins from Toxoplasma gondii.
- the viral (adjuvant) proteins include one or more of the following: respiratory syncytial virus fusion glycoprotein (F protein), MMT virus envelope protein, mouse leukemia virus protein, and wild-type measles virus hemagglutinin protein.
- F protein respiratory syncytial virus fusion glycoprotein
- MMT virus envelope protein MMT virus envelope protein
- mouse leukemia virus protein MMT virus envelope protein
- wild-type measles virus hemagglutinin protein wild-type measles virus hemagglutinin protein.
- the fungal (adjuvant) protein comprises a fungal immunomodulatory protein (FIP, eg, LZ-8).
- FIP fungal immunomodulatory protein
- the animal-derived protein includes keyhole limpet hemocyanin (KLH).
- KLH keyhole limpet hemocyanin
- the polypeptide and/or protein expressed by the nucleotide sequence encoding the polypeptide and/or protein of interest comprises or is a therapeutic protein or polypeptide as a therapeutic antibody, such as one or more of the cytokines, chemokines, suicide enzymes and gene products, apoptosis inducers, endogenous angiogenesis inhibitors, heat shock proteins, tumor antigens, innate immune activators, and antibodies against proteins associated with tumor or cancer development described in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11 and Table 12 of WO2016/170176A1.
- V DNA molecules or vectors encoding the recombinant RNA molecules of the 5' and/or 3'-UTR of the present invention
- the present invention provides a DNA molecule encoding the 5'-UTR, 3'-UTR and/or recombinant RNA molecule of the present invention.
- the present invention also provides a vector comprising the recombinant RNA molecule or DNA molecule of the present invention and a host cell comprising the recombinant RNA molecule, DNA molecule and/or vector of the present invention.
- the present invention also provides a vector comprising a first nucleotide sequence encoding a 5'-UTR and/or a second nucleotide sequence encoding a 3'-UTR, wherein: the first nucleotide sequence comprises at least one of the following polynucleotides: (a): a polynucleotide encoding a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): a polynucleotide encoding a fragment of the 5'-UTR described in (a); (c): a polynucleotide encoding a variant of the 5'-UTR described in (a
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a polynucleotide encoding a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, NFKB2; (f): a polynucleotide encoding a fragment of the 3'-UTR in (e); (g): a polynucleotide encoding a variant of the 3'-UTR in (e); and (h): a polynucleotide encoding a variant of the fragment in (f).
- the gene is a human gene.
- the first nucleotide sequence comprises at least one of the following polynucleotides: a polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a fragment of an RNA encoded by a polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 1 to 21, a variant of a polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 1 to 21, and a variant of a fragment of a polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the fragment of the polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, or 60% similarity with the polynucleotide with a sequence as shown in one of SEQ ID NOs: 1 to 21.
- a fragment, variant, or variant of a fragment of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the gene is selected from at least one of PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB.
- the first nucleotide sequence comprises at least one of the following polynucleotides: 1): a polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6; 2): a fragment of the polynucleotide shown in 1); 3): a variant of the polynucleotide shown in 1); and 4): a variant of the fragment shown in 2).
- the fragment, variant, or variant of the polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- a fragment, variant, or variant of a fragment of a polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to a polynucleotide shown in sequence SEQ ID NO: 9, 7, 18, 12, 8, 1 or 6.
- the first nucleotide sequence may comprise two or more tandemly linked polynucleotides encoding the 5'-UTR from the above-mentioned gene, polynucleotides encoding fragments of the 5'-UTR from the above-mentioned gene, polynucleotides encoding variants of the 5'-UTR from the above-mentioned gene, and polynucleotides encoding variants of fragments of the 5'-UTR from the above-mentioned gene.
- the first nucleotide sequence comprises at least one of the following polynucleotides: (a): a polynucleotide encoding a 5'-UTR derived from at least two genes of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2 and CDK7; (b): a polynucleotide encoding a fragment of the 5'-UTR of at least two of the genes described in (a); (c): a polynucleotide encoding a variant of the 5'-UTR of at least two of the genes described in (a); and (d): a polynucleotide encoding a variant of the fragment of the 5'-UTR of at least two of
- the first nucleotide sequence comprises at least one of the following polynucleotides: (e): a polynucleotide encoding a 5'-UTR derived from at least two genes of genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB; (f): a polynucleotide encoding a fragment of the 5'-UTR described in (e); (g): a polynucleotide encoding a variant of the 5'-UTR described in (e); and (h): a polynucleotide encoding a variant of the fragment described in (f).
- polynucleotides comprises at least one of the following polynucleotides: (e): a polynucleotide encoding a 5'-UTR derived from at least two genes of genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB
- the first nucleotide sequence comprises at least one of the following polynucleotides: a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a fragment of a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, a variant of a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21, and a variant of a fragment of a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 1 to 21.
- a fragment, a variant, or a variant of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with a polynucleotide having a sequence as shown in one of SEQ ID NOs: 1 to 21.
- a fragment, variant, or variant of a fragment of a polynucleotide with a sequence as shown in one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to a polynucleotide with a sequence as shown in one of SEQ ID NOs: 1 to 21.
- the first nucleotide sequence comprises at least one of the following polynucleotides: a polynucleotide with a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, a fragment of a polynucleotide with a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, a variant of a polynucleotide with a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6, and a variant of a fragment of a polynucleotide with a sequence as shown in at least two of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6.
- a fragment, variant, or variant of a fragment of a polynucleotide with a sequence as shown in SEQ ID NOs: 9, 7, 18, 12, 8, 1, or 6 has at least 40%, 50%, 60%, 70%, 80%, 90%, 100%, 110%, 120%, 130%, 140%, 150%, 160%, 170%, 180%, 190%, 200%, 210%, 220%, 230%, 240%, 250%, 260%, 270%, 280%, 290%, 300%, 310%, 320%, 330%, 340%, 350%, 360%, 370%, 380%, 390%, 400%, 410%, 420%, 430%, 440%, 450%, 460%, 470%, 480%, 490%, 500%, 510%, 520%, 530%, 540%, In some embodiments, the fragment, variant, or variant of a fragment of a polynucleotide as shown in SEQ ID NO: 9, 7, 18, 12, 8, 1, or 6 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more
- the first nucleotide sequence comprises at least one of the following polynucleotides: a): at least two copies of a polynucleotide encoding a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1, HBB, MYSM1, LENG1, TMSB4X, CASP4, IFNA1, PGLYRP1, UCHL1, CPAMD8, TTR, APOA2, GH1, DTYMK, APOC2, and CDK7; b): at least two copies of a fragment of a polynucleotide encoding a 5'-UTR of at least one of the genes in a); c): at least two copies of a polynucleotide encoding a variant of a 5'-UTR of at least one of the genes in a); and d): at least two copies of a polynucleotide en
- the first nucleotide sequence comprises at least one of the following polynucleotides: e): at least two copies of a polynucleotide encoding a 5'-UTR derived from at least one of the genes PPIA, HPX, FTCD, CDK5RAP3, HSPA8, HBA1 and HBB; f) at least two copies of a polynucleotide encoding a fragment of a 5'-UTR derived from at least one of the genes in e); g) at least two copies of a polynucleotide encoding a variant of a 5'-UTR derived from at least one of the genes in e); h) at least two copies of a polynucleotide encoding a variant of a fragment of a 5'-UTR derived from at least one of the genes in e).
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies
- the first nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, at least two copies of a fragment of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, at least two copies of a variant of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21, and at least two copies of a variant of a fragment of a polynucleotide having a sequence as shown in at least one of SEQ ID NOs: 1 to 21.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the fragment of the polynucleotide whose sequence is shown in one of SEQ ID NOs: 1 to 21 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the polynucleotide whose sequence is shown in one of SEQ ID NOs: 1 to 21.
- the fragment, variant, or variant of the fragment of the polynucleotide whose sequence is shown in one of SEQ ID NOs: 1 to 21 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the polynucleotide whose sequence is shown in one of SEQ ID NOs: 1 to 21.
- the first nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6, at least two copies of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6, at least two copies of a variant of a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6, and at least two copies of a variant of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NO: 9, 7, 18, 12, 8, 1 and 6.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the fragment of a polynucleotide with a sequence as shown in one of SEQ ID NOs: 9, 7, 18, 12, 8, 1, and 6 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the polynucleotide with a sequence as shown in SEQ ID NOs: 9, 7, 18, 12, 8, 1, or 6.
- the homolog or variant has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more nucleotides inserted, added, deleted, or substituted compared to SEQ ID NOs: 9, 7, 18, 12, 8, 1, or 6.
- the second nucleotide sequence comprises at least one of the following polynucleotides: A polynucleotide as set forth in at least one of SEQ ID NOs: 22 to 48, a fragment of a polynucleotide as set forth in at least one of SEQ ID NOs: 22 to 48, a variant of a polynucleotide as set forth in at least one of SEQ ID NOs: 22 to 48, and a variant of a fragment of a polynucleotide as set forth in at least one of SEQ ID NOs: 22 to 48.
- the fragment, variant, or variant of a fragment of a polynucleotide as set forth in at least one of SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to a polynucleotide as set forth in one of SEQ ID NOs: 22 to 48.
- a fragment, variant, or variant of a fragment of a polynucleotide with a sequence as shown in at least one of SEQ ID NOs: 22 to 48 has an insertion, addition, deletion, or substitution of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides compared to a polynucleotide with a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the gene is selected from at least one of MPND, FBXW10, FBXW12 and PGLYRP1.
- the second nucleotide sequence comprises at least one of the following polynucleotides: a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a fragment of a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, a variant of a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25, and a variant of a fragment of a polynucleotide with a sequence as shown in SEQ ID NO: 24, 22, 23 or 25.
- the fragment, variant, or variant of the fragment of the polynucleotide shown in SEQ ID NO: 24, 22, 23, or 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the polynucleotide shown in SEQ ID NO: 24, 22, 23, or 25.
- the fragment, variant, or variant of the fragment of the polynucleotide shown in SEQ ID NO: 24, 22, 23, or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to the polynucleotide shown in SEQ ID NO: 24, 22, 23, or 25.
- the second nucleotide sequence may comprise two or more tandemly linked polynucleotides encoding the 3'-UTR from the above gene, polynucleotides encoding variants of the 3'-UTR from the above gene, or polynucleotides encoding variants of fragments of the 3'-UTR from the above gene.
- the second nucleotide sequence comprises at least one of the following polynucleotides: (a): a polynucleotide encoding a 3'-UTR derived from at least two of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1 and NFKB2; (b): a polynucleotide encoding a fragment of the 3'-UTR of at least two of the genes described in (a); (c): a polynucleotide encoding a variant of the 3'-UTR of at least two of the genes described in (a); and (d): a polynucleotides
- the second nucleotide sequence comprises at least one of the following polynucleotides: (e): a polynucleotide encoding a 3’-UTR derived from at least two genes of the genes MPND, FBXW10, FBXW12, and PGLYRP1; (f): a polynucleotide encoding a fragment of the 3’-UTR described in (e); (g): a polynucleotide encoding a variant of the 3’-UTR described in (e); and (h): a polynucleotide encoding a variant of the fragment described in (f).
- the second nucleotide sequence comprises at least one of the following polynucleotides: a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 22 to 48, a fragment of a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 22 to 48, a variant of a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 22 to 48, and a variant of a fragment of a polynucleotide having a sequence as shown in at least two of SEQ ID NOs: 22 to 48.
- a fragment, a variant, or a variant of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the fragment, variant or variant of the fragment of the polynucleotide with a sequence as shown in one of SEQ ID NO: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the polynucleotide with a sequence as shown in one of SEQ ID NO: 22 to 48.
- the second nucleotide sequence comprises at least one of the following polynucleotides: a polynucleotide with a sequence as shown in at least two of SEQ ID NOs: 24, 22, 23, and 25, a polynucleotide with a sequence as shown in SEQ ID NOs: 24, In some embodiments, the fragments, variants, or variants of the fragments of the polynucleotides as shown in at least two of SEQ ID NOs: 24, 22, 23, and 25, the variants of the polynucleotides as shown in at least two of SEQ ID NOs: 24, 22, 23, and 25, and the variants of the fragments of the polynucleotides as shown in at least two of SEQ ID NOs: 24, 22, 23, and 25.
- the fragments, variants, or variants of the fragments of the polynucleotides as shown in SEQ ID NOs: 24, 22, 23, or 25 have at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with the polynucleotides as shown in SEQ ID NOs: 24, 22, 23, or 25.
- the fragment, variant, or variant of the fragment of the polynucleotide shown in SEQ ID NO: 24, 22, 23 or 25 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the polynucleotide shown in SEQ ID NO: 24, 22, 23 or 25.
- the second nucleotide sequence comprises at least one of the following polynucleotides: a): at least two copies of a polynucleotide encoding a 3'-UTR derived from at least one of the genes MPND, FBXW10, FBXW12, PGLYRP1, HPX, CDK7, APOC2, PFN1, RBP4, FTCD, NAAA, ALB, GSDMD, FBXL8, ORM1, CASP4, CHMP2A, LENG1, MYCBPAP, APOC1, GAPDH, HSPA8, APOA2, UCHL1, TSG101, NAE1, and NFKB2; b): at least two copies of a polynucleotide encoding a fragment of the 3'-UTR of at least one of the genes in a); c): at least two copies of a polynucleotide encoding a variant of the 3'-UTR of at least one of the genes in
- the second nucleotide sequence comprises at least one of the following polynucleotides: e): at least 2 copies of a polynucleotide encoding a 3'-UTR derived from one of the genes MPND, FBXW10, FBXW12 and PGLYRP1; f): at least two copies of a polynucleotide encoding a fragment of the 3'-UTR of at least one of the genes in e); g): at least two copies of a polynucleotide encoding a variant of the 3'-UTR of at least one of the genes in e); and h): at least two copies of a polynucleotide encoding a variant of the fragment of the 3'-UTR of at least one of the genes in e).
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, at least two copies of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, at least two copies of a variant of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48, and at least two copies of a variant of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the fragment, variant, or variant of the fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity with a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the fragment, variant, or variant of the fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48 has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted, or substituted compared to a polynucleotide having a sequence as shown in one of SEQ ID NOs: 22 to 48.
- the second nucleotide sequence comprises at least one of the following polynucleotides: at least two copies of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, at least two copies of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, at least two copies of a variant of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25, and at least two copies of a variant of a fragment of a polynucleotide having a sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25.
- the at least two copies are two copies, three copies, four copies, five copies, six copies, seven copies, eight copies or nine copies.
- the polynucleotide sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25 has at least 40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the polynucleotide sequence as shown in SEQ ID NOs: 24, 22, 23 or 25.
- the sequence as shown in one of SEQ ID NOs: 24, 22, 23 and 25 The polynucleotide has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides inserted, added, deleted or substituted compared to the polynucleotide whose sequence is shown in SEQ ID NO: 24, 22, 23 or 25.
- the vector is used to produce recombinant RNA molecules such as mRNA molecules.
- the vector comprises the first nucleotide sequence and the second nucleotide sequence.
- the vector also comprises the third nucleotide sequence encoding the polypeptide and/or protein of interest between the first nucleotide sequence and the second nucleotide sequence.
- the third nucleotide sequence encodes at least one polypeptide of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest.
- the third nucleotide sequence encodes at least one protein of interest.
- the third nucleotide sequence encodes at least one polypeptide of interest and at least one protein of interest. For example, one, two, three, four, five, six, seven, eight, nine or ten polypeptides of interest and one, two, three, four, five, six, seven, eight, nine or ten proteins of interest.
- the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides.
- the nucleotides constituting the poly(A) tail comprise at least 20, at least 40, at least 80, at least 100 or at least 120 A nucleotides in succession.
- the poly(A) tail comprises one or more nucleotides other than A nucleotides.
- the poly(A) tail comprises two or more consecutive nucleotides other than A nucleotides, wherein the first and last nucleotides in the sequence having two or more consecutive nucleotides are nucleotides other than A nucleotides.
- the poly(A) tail is truncated, i.e., m consecutive A nucleotides and n consecutive A nucleotides are reconnected by a linker sequence consisting of p non-A nucleotides, wherein m, n and p are positive integers.
- m is 30, n is 70, and p is 10.
- the DNA sequence corresponding to the poly (A) tail is shown as SEQ ID NO:53.
- the vector may also include expression control elements for correct expression of the vector in the host.
- control elements are known to those skilled in the art and may include promoters, splicing cassettes, translation initiation codons, translation and insertion sites for introducing inserts into the vector.
- the vector of the present invention may be, for example, a plasmid, a cosmid, a virus, a phage or another vector conventionally used in genetic engineering, and may contain additional genes, such as a marker gene that allows selection of the vector in a suitable host cell and under suitable conditions.
- RNA molecules and vectors of the present invention can be introduced into cells directly or via liposomes, viral vectors (eg, adenovirus, retrovirus), electroporation, ballistics (eg, gene gun) or other delivery systems.
- viral vectors eg, adenovirus, retrovirus
- electroporation e.g, electroporation
- ballistics e.g, gene gun
- the present invention also provides a method for preparing mRNA, comprising contacting the vector of the present invention with RNA polymerase.
- the method of the present invention further comprises the step of linearizing the plasmid. In some embodiments, the supercoil rate of the plasmid is at least about 90% prior to linearization. In some embodiments, the method of the present invention further comprises the step of purifying the linearized plasmid. In some embodiments, the method of the present invention further comprises the step of purifying mRNA.
- the method of the present invention further comprises the steps of capping and optionally purifying the capped product.
- the cap is a Cap1 cap.
- the capping reaction is as follows: pppN1(p)Nx-OH(3') ⁇ ppN1(pN)x-OH(3')+Pi ppN1(pN)x-OH(3')+GTP ⁇ G(5')ppp(5')N1(pN)x-OH(3')+PPi G(5')ppp(5')N1(pN)x-OH(3')+AdoMet ⁇ m7G(5')ppp(5')N1(pN)x-OH(3')+AdoHyc m7GpppN1(pN)x-OH(3')+AdoMet ⁇ m7Gppp[m2'-O]N1(pN)x-OH(3')+AdoHyc.
- the present invention also provides a host cell comprising the recombinant RNA molecule, DNA molecule and/or vector of the present invention, such as a bacterial cell.
- the vector of the present invention such as a plasmid, is stored and/or amplified in the host cell.
- Host cells of the present invention can be prepared by transforming competent host cells with the vector of the present invention.
- Competent host cells are cells with the ability to absorb free extracellular genetic material (such as DNA plasmids) independently of the sequence.
- DNA plasmids free extracellular genetic material
- Various bacterial cells known to those skilled in the art are naturally able to absorb exogenous DNA from the environment, and therefore can serve as bacterial host cells according to the present invention.
- competent bacterial host cells can be obtained from natural non-competent bacterial cells using, for example, electroporation or chemicals (such as calcium ion treatment and accompanied by high temperature exposure). After uptake, the DNA plasmid is preferably neither degraded nor integrated in the genomic information of the bacterial host cell.
- Bacterial host cells include Escherichia coli (E.coli) cells well known to those skilled in the art.
- Lipid nanoparticles containing the recombinant RNA molecules of the 5' and/or 3'-UTR of the present invention pharmaceutical compositions containing the recombinant RNA molecules of the 5' and/or 3'-UTR of the present invention, and treatment/prevention of diseases
- the present invention provides a lipid nanoparticle comprising the recombinant RNA molecule, DNA molecule or vector of the present invention.
- the lipid nanoparticles contain protonable cationic lipids.
- the lipid nanoparticles further contain one or more of helper lipids, structural lipids and PEG-lipids (polyethylene glycol-lipids). In some further embodiments, the lipid nanoparticles further contain the helper lipids, the structural lipids and the PEG-lipids.
- the auxiliary lipid is a phospholipid.
- the phospholipid is usually semi-synthetic, or it may be of natural origin or chemically modified.
- the phospholipid includes, but is not limited to, DSPC (distearylphosphatidylcholine), DOPE (dioleoylphosphatidylethanolamine), DOPC (dioleoylphosphatidylcholine), DOPS (dioleoylphosphatidylserine), DSPG (1,2-dioctadecanoyl-sn-glycerol-3-phospho-(1'-rac-glycerol)), DPPG (dipalmitoylphosphatidylglycerol), DPPC (dipalmitoylphosphatidylcholine), DGTS (1,2-dipalmitoyl-sn-glycerol-3-O-4'-(N,N,N-trimethyl)homoserine), lysophospho
- the structural lipid is a sterol substance, including but not limited to cholesterol, cholesterol ester, steroid hormones, steroid vitamins, bile acid, cholesterol, ergosterol, ⁇ -sitosterol and oxidized cholesterol derivatives.
- the structural lipid is at least one selected from cholesterol, cholesterol ester, steroid hormones, steroid vitamins and bile acid.
- the structural lipid is cholesterol, preferably high-purity cholesterol, especially injection-grade high-purity cholesterol, such as CHO-HP (produced by AVT).
- the term PEG-lipid is a conjugate of polyethylene glycol and a lipid structure.
- the PEG-lipid is selected from PEG-DMG and PEG-distearoylphosphatidylethanolamine (PEG-DSPE), preferably PEG-DMG.
- PEG-DSPE PEG-distearoylphosphatidylethanolamine
- the PEG-DMG is a polyethylene glycol (PEG) derivative of 1,2-dimyristyl glyceride.
- the average molecular weight of the PEG is about 2000 to 5000, preferably about 2000.
- the lipid nanoparticle further contains the helper lipid, the structural lipid and the PEG-lipid.
- the lipid nanoparticle comprises the following amount (molar percentage) of the protonatable cationic lipid, based on the total amount of the protonatable cationic lipid, the auxiliary lipid, the structural lipid and the PEG-lipid: about 25.0%-75.0%, such as about 25.0%-28.0%, 28.0%-32.0%, 32.0%-35.0%, 35.0%-40.0%, 40.0%-42.0%, 42.0%-45.0%, 45.0%-46.3%, 46.3%-48.0%, 48.0%-49.5%, 49.5%-50.0%, 50.0%-55.0%, 55.0%-60.0%, 60.0%-65.0%, or 65.0%-75.0%.
- the present invention provides a cationic liposome comprising the recombinant RNA molecule, DNA molecule or vector of the present invention.
- the present invention provides a cationic protein comprising the recombinant RNA molecule, DNA molecule or vector of the present invention.
- the present invention provides a cationic polymer comprising the recombinant RNA molecule, DNA molecule or vector of the present invention. thing.
- the present invention provides a pharmaceutical composition, which comprises the recombinant RNA molecule, DNA molecule, vector, host cell, cationic liposome, cationic protein, cationic polymer or lipid nanoparticle of the present invention, and a pharmaceutically acceptable carrier, diluent or excipient.
- the present invention also provides a use of the recombinant RNA molecule, DNA molecule, vector, lipid nanoparticle or pharmaceutical composition of the present invention in preparing a drug.
- the medicament is for gene therapy, genetic vaccination, protein replacement therapy, antisense therapy, or treatment by interfering RNA.
- the drug is a nucleic acid drug, wherein the nucleic acid comprises at least one of the following: RNA, messenger RNA (mRNA), antisense oligonucleotide, DNA, plasmid, ribosomal RNA (rRNA), microRNA (miRNA), transfer RNA (tRNA), small inhibitory RNA (siRNA), small nuclear RNA (snRNA), small hairpin RNA (shRNA), tRNA, single-stranded guide RNA (sgRNA) and Cas9mRNA.
- RNA messenger RNA
- rRNA ribosomal RNA
- miRNA microRNA
- tRNA transfer RNA
- small inhibitory RNA small nuclear RNA
- shRNA small hairpin RNA
- tRNA single-stranded guide RNA
- Cas9mRNA Cas9mRNA.
- the drug is used for the treatment and/or prevention of a disease.
- the disease is selected from the group consisting of rare diseases, infectious diseases, cancer, genetic diseases, autoimmune diseases, diabetes, neurodegenerative diseases, cardiovascular diseases, renal vascular diseases, and metabolic diseases;
- the cancer includes one or more of lung cancer, gastric cancer, liver cancer, esophageal cancer, colon cancer, pancreatic cancer, brain cancer, lymphoma, blood cancer, or prostate cancer;
- the genetic disease includes one or more of hemophilia, thalassemia, and Gaucher's disease.
- the medicament is a vaccine.
- the recombinant molecule is used to produce an antigen of a pathogen or a portion thereof.
- the drug is a gene therapy agent.
- the recombinant molecule is used to produce a protein associated with a genetic disease.
- the recombinant molecules are used to generate antibodies, such as scFVs or nanobodies.
- the present invention also provides a method for preventing or treating a disease, comprising administering the recombinant RNA molecule or pharmaceutical composition of the present invention to a subject in need thereof.
- the disease or condition is selected from the group consisting of rare diseases, infectious diseases, cancer, genetic diseases, autoimmune diseases, diabetes, neurodegenerative diseases, cardiovascular diseases, renal vascular diseases, and metabolic diseases.
- the cancer includes one or more of lung cancer, gastric cancer, liver cancer, esophageal cancer, colon cancer, pancreatic cancer, brain cancer, lymphoma, blood cancer or prostate cancer;
- the genetic disease includes one or more of hemophilia, thalassemia, and Gaucher's disease.
- the recombinant RNA molecule or pharmaceutical composition is used as a vaccine to prevent a disease. In some embodiments, the recombinant RNA molecule is used to produce an antigen or part thereof of a pathogen.
- the recombinant RNA molecule is used to produce the protein associated with the genetic disease.
- the recombinant RNA molecules are used to produce antibodies, such as scFVs or nanobodies.
- the present invention obtains optimized UTRs through a large number of screenings. These UTRs can improve the translation efficiency and/or stability of mRNA, increase the expression level of polypeptides and/or proteins, and have very important application value for the research and development of mRNA vaccines.
- the construction method is as follows:
- Luciferase-pcDNA3 plasmid purchased from Addgene, its plasmid number #18964
- new restriction sites HindIII and BamHI were added in front of the Kozack sequence
- new restriction sites KpnI and ApaI were added after the stop codon of luciferase to obtain plasmid B.
- the nucleotide sequence of Luciferase-pcDNA3 plasmid is shown in SEQ ID NO.50, and the plasmid map of luciferase-pcDNA3 is shown in Figure 2.
- the nucleotide sequence of plasmid B is shown in SEQ ID NO:51, and the plasmid map of plasmid B is shown in Figure 3.
- the Amp (ampicillin) resistance gene in plasmid B was replaced with the Kana (kanamycin) resistance gene, and the neo/KanR sequence from 3746 to 4540 was removed to obtain plasmid C.
- the plasmid spectrum of plasmid C is shown in Figure 4, and the nucleotide sequence of plasmid C is shown in SEQ ID NO:52.
- Plasmid C was inserted with a poly(A) tail as shown in SEQ ID NO:53. Specifically, plasmid C was digested with ApaI, the product was purified, the purified single digestion product and the poly(A) tail as shown in SEQ ID NO:53 were recombined by homologous recombination, the recombinant product was transformed into DH5 ⁇ , clones were screened on an LB plate containing 50 ⁇ g/mL kanamycin, and clones with correct sequencing were selected to extract plasmids to obtain plasmid D.
- the plasmid spectrum of plasmid D is shown in Figure 5, and the nucleotide sequence of plasmid D is shown in SEQ ID NO:54.
- FIG. 1A The construction flowchart of plasmid D is shown in Figure 1A , and the schematic diagram of the construction and transformation is shown in Figure 1B .
- the 5’-UTR sequence was synthesized by Shanghai Sangon Biotechnology Co., Ltd. According to the company’s quality analysis report, the 5’-UTR sequence was consistent with the theoretical design sequence.
- Plasmid D was double-digested with HindIII and BamHI, and the fragment A with a molecular weight of about 6 kb was obtained by gel excision recovery (Axygen); different 5'UTR sequences were introduced into the homology arm sequence by PCR, and homologous recombination reaction was carried out with fragment A, and Takara's homologous recombination enzyme system was used to react at 50°C for 15 minutes; the above reaction system was transformed into DH5 ⁇ (Takara) competent cells, plated (LB plate with 50 ⁇ g/mL kanamycin), and after culturing for 16 hours, 3-4 single clones were picked and cultured in a medium containing 50 ⁇ g/mL kanamycin for 8 hours, and the plasmids were extracted and sequenced by Sangon Biotech (Shanghai) Co., Ltd. to obtain plasmids containing different 5'UTRs.
- the 3'-UTR sequence was synthesized by commissioning Sangon Biotechnology (Shanghai) Co., Ltd. According to the company's quality analysis report, the 3'-UTR sequence was consistent with the theoretical design sequence.
- Plasmid D was double-digested with KpnI and ApaI, and the fragment A with a molecular weight of about 6 kb was obtained by gel excision recovery (Axygen); different 3'-UTR sequences were introduced into the homology arm sequence by PCR, and homologous recombination reaction was carried out with fragment A, and Takara's homologous recombination enzyme system was used to react at 50°C for 15 minutes; the above reaction system was transformed into DH5 ⁇ (Takara) competent cells, plated (LB plate with 50 ⁇ g/mL kanamycin), and after culturing for 16 hours, 3-4 single clones were picked and cultured in a medium containing 50 ⁇ g/mL kanamycin for 8 hours, and the plasmids were extracted and sequenced by Sangon Biotech (Shanghai) Co., Ltd. to obtain plasmids containing different 3'-UTRs.
- Enzyme linearization Take 20 ⁇ g of plasmid and use the corresponding enzyme (BsaI) for linearization.
- the reaction conditions are 37°C for 2h. Note: The reaction system can also be scaled up according to the required production volume.
- Cap1 cap structure and reaction principle are as follows: pppN1(p)Nx-OH(3') ⁇ ppN1(pN)x-OH(3')+Pi ppN1(pN)x-OH(3')+GTP ⁇ G(5')ppp(5')N1(pN)x-OH(3')+PPi G(5')ppp(5')N1(pN)x-OH(3')+AdoMet ⁇ m7G(5')ppp(5')N1(pN)x-OH(3')+AdoHyc m7GpppN1(pN)x-OH(3')+AdoMet ⁇ m7Gppp[m2'-O]N1(pN)x-OH(3')+AdoHyc
- the amount of capped mRNA per time should not exceed 60ug.
- the pre-heated mRNA was mixed with the above system and incubated at 37°C for 1 h.
- Characterization data Determine concentration using onedrop or Qubit.
- HEK293 cells were transfected with mRNA containing different 5'-UTRs. Specifically, HEK293 cells were added to a 96-well plate at 40,000 cells/well one day in advance, and cell transfection was performed when the cell confluence reached 70% to 90% the next day.
- mRNAs containing different 5'-UTRs were transfected into HEK293 cells, with 100 ng of mRNA per well. After incubation in a 37°C CO2 incubator for 16 h, luciferase reporter assay was performed (following the instructions of the Promega kit).
- HEK293 cells were transfected with mRNA containing different 3'-UTRs. Specifically, HEK293 cells were plated on a 96-well plate at 40,000 cells/well one day in advance, and cell transfection was performed when the cell confluence reached 70% to 90% the next day.
- mRNAs containing different 3'-UTRs were transfected into HEK293 cells, with 100 ng of mRNA per well. After incubation in a 37°C CO2 incubator for 16 h, luciferase reporter assay was performed (following the instructions of the Promega kit).
- step 2 Using the series of plasmids in step 1, a series of mRNAs containing 5UTR-NO54 and one of the different 3'UTRs were prepared with reference to Example 4, and then respective LNP-mRNA preparations were prepared. The steps of preparing the LNP-mRNA preparations included:
- LNP encapsulation The volume of the preparation solution (the total volume of the aqueous phase and its alcohol phase of each system) is 1.5 mL, wherein the mass ratio of mRNA: lipid is 1:10, and the concentration of HAc-NaOAc buffer (0.2 M, pH 5.0) in the final aqueous phase is 0.025 M.
- SM-102 lipids dissolved in ethanol
- chemical structure of SM-102 is as follows:
- SM-102 is commercially available or can be prepared according to techniques known in the art.
- 1 ⁇ PBS+8% (m/V) sucrose solution Take 2 packets of 1 ⁇ PBS pre-prepared powder into a beaker, dissolve and mix with 2L DEPC water, then add 160g sucrose and mix to obtain 1 ⁇ PBS+8% (m/V) sucrose solution.
- mice used in the experiment were: female Balb/c mice, 6 weeks old; 3 mice in each group; each mouse was injected with 12 ⁇ g of LNP-mRNA preparation via the tail vein. After 12 hours, the mice were anesthetized by isoflurane inhalation and injected with the luciferase development substrate D-Luciferin (150 mg/kg). The animals were then placed in a supine position, and the signal distribution and intensity of luciferin in the mice were observed using the IVIS live imaging system.
- step 2 Using the series of plasmids in step 1, prepare a series of mRNAs containing 3UTR-NO3 and one of the different 5'-UTRs as described above, and further prepare the corresponding LNP-mRNA preparations.
- mice used in the experiment were: female Balb/c mice, 6 weeks old; 3 mice in each group; each mouse was injected with 12 ⁇ g of LNP-mRNA preparation through the tail vein. After 12 hours, the mice were anesthetized by isoflurane inhalation and injected with luciferase development substrate D-Luciferin (150 mg/kg). The animals were then placed in a supine position, and the signal distribution and intensity of luciferin in the mice were observed using the IVIS in vivo imaging system.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Physics & Mathematics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Epidemiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
Description
cap G1G2=m7G-5'-ppp-5'-Gm2'-3'-p-[m7=7-CH3;m2'=2'-O-CH3;-ppp-=-
PO2H-O-PO2H-O-PO2H)-;-p-=-PO2H-]。
pppN1(p)Nx-OH(3')→ppN1(pN)x-OH(3')+Pi
ppN1(pN)x-OH(3')+GTP→G(5')ppp(5')N1(pN)x-OH(3')+PPi
G(5')ppp(5')N1(pN)x-OH(3')+AdoMet→m7G(5')ppp(5')N1(pN)x-OH(3')+AdoHyc
m7GpppN1(pN)x-OH(3')+AdoMet→m7Gppp[m2’-O]N1(pN)x-OH(3')+AdoHyc。
pppN1(p)Nx-OH(3')→ppN1(pN)x-OH(3')+Pi
ppN1(pN)x-OH(3')+GTP→G(5')ppp(5')N1(pN)x-OH(3')+PPi
G(5')ppp(5')N1(pN)x-OH(3')+AdoMet→m7G(5')ppp(5')N1(pN)x-OH(3')+AdoHyc
m7GpppN1(pN)x-OH(3')+AdoMet→m7Gppp[m2’-O]N1(pN)x-OH(3')+AdoHyc
Claims (40)
- 一种重组RNA分子,其包含:(1)编码感兴趣的多肽和/或蛋白的第一核苷酸序列;和(2)含有5’-非翻译区(5’-UTR)的第二核苷酸序列;所述5’-UTR包含选自以下多核苷酸中的至少一种:(a):源自基因PPIA、HPX、FTCD、CDK5RAP3、HSPA8、HBA1、HBB、MYSM1、LENG1、TMSB4X、CASP4、IFNA1、PGLYRP1、UCHL1、CPAMD8、TTR、APOA2、GH1、DTYMK、APOC2和CDK7中的至少一个基因的5’-UTR;(b):(a)中所述5’-UTR的片段;(c):(a)中所述5’-UTR的变体;及(d):(b)中所述片段的变体;所述第一核苷酸序列与所述第二核苷酸序列不天然出现于同一RNA分子。
- 权利要求1的重组RNA分子,其中所述基因是人基因。
- 权利要求1或2的重组RNA分子,其中所述第二核苷酸序列包含以下多核苷酸中的至少一种:(a):源自基因PPIA、HPX、FTCD、CDK5RAP3、HSPA8、HBA1和HBB中的至少一个基因的5’-UTR;(b):(a)中所述5’-UTR的片段;(c):(a)中所述5’-UTR的变体;及(d):(b)中所述片段的变体。
- 权利要求1~3任一项的重组RNA分子,其中所述第二核苷酸序列包含下述多核苷酸中的至少一种:序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA、序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段、序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的变体和序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段的变体;优选地,所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的变体、所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段和所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段的变体,与所述序列如SEQ ID NO:1~21中的至少一个所示的多核苷酸编码的RNA具有至少70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的相同性。
- 权利要求1~4任一项的重组RNA分子,其中所述第二核苷酸序列包含:(a)中所述基因中的至少两个基因的5’-UTR、(a)中所述基因中的至少两个基因的5’-UTR的片段、(a)中所述基因中的至少两个基因的5’-UTR的变体和(a)中所述基因中的至少两个基因的5’-UTR的片段的变体中的至少一种。
- 权利要求1~4任一项的重组RNA分子,其中所述第二核苷酸序列包含:至少两个拷贝的(a)中的5’-UTR、至少两个拷贝的(a)中的5’-UTR的片段、至少两个拷贝的(a)中的5’-UTR的变体和至少两个拷贝的(a)中的5’-UTR的片段的变体中的至少一种。
- 权利要求1~6任一项的重组RNA分子,其进一步还包含启动子、5’-帽子结构、3’-UTR和poly(A)尾中的至少一种。
- 权利要求7的重组RNA分子,其中所述5’-帽子结构包括m7GpppG、m2 7,3′-OGpppG、m7Gppp(5')N1和m7Gppp(m2′-O)N1中的至少一种。
- 权利要求7或8的重组RNA分子,其中所述3’-UTR包含:i)源自白蛋白基因、α-珠蛋白基因、β-珠蛋白基因、酪氨酸羟化酶基因、脂加氧酶基因和胶原蛋白α基因中的至少一种基因的3’-UTR;ii)所述i)中的所述3’-UTR的变体;iii)源自基因MPND、FBXW10、FBXW12、PGLYRP1、HPX、CDK7、APOC2、PFN1、RBP4、FTCD、NAAA、ALB、GSDMD、FBXL8、ORM1、CASP4、CHMP2A、LENG1、MYCBPAP、APOC1、GAPDH、HSPA8、APOA2、UCHL1、TSG101、NAE1、NFKB2和GH1中至少一种基因的3’-UTR、其片段、变体和片段的变体中的至少一种;优选地,序列如SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA、序列SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA的片段、序列SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA的变体和序列如SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA的片段的变体中的至少一种;优选地,所述序列如SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA的变体、所述序列如SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA的片段、和所述序列如SEQ ID NO:22~49中至少一个所示的多核苷酸编码的RNA的片段的变体与所述序列如SEQ ID NO:22~49中的至少一个所示的多核苷酸编码的RNA有至少70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的相同性;iv)至少两个拷贝的i)、ii)或iii)中的一种多核苷酸;或v)由i)~iii)中的多核苷酸所构成的组中的至少两种多核苷酸。
- 权利要求7~9任一项的重组RNA分子,其中构成所述poly(A)尾的核苷酸包含至少20个、至少40个、至少80个、至少100个或至少120个A核苷酸;优选地,构成所述poly(A)尾的核苷酸包含连续地至少20个、至少40个、至少80个、至少100个或至少120个A核苷酸。
- 权利要求7~10任一项的重组RNA分子,其中构成所述poly(A)尾的核苷酸包括一个或多个除A核苷酸之外的其他核苷酸;可选地,构成所述poly(A)的核苷酸包含连续两个或两个以上的除A核苷酸外的其他核苷酸。
- 一种重组RNA分子,其包含:(1)编码感兴趣的多肽和/或蛋白的第一核苷酸序列;和(2)含有3’-非翻译区(3’-UTR)的第二核苷酸序列;所述3’-UTR包含选自以下多核苷酸中的至少一种:(a):源自基因MPND、FBXW10、FBXW12、PGLYRP1、HPX、CDK7、APOC2、PFN1、RBP4、FTCD、NAAA、ALB、GSDMD、FBXL8、ORM1、CASP4、CHMP2A、LENG1、MYCBPAP、APOC1、GAPDH、HSPA8、APOA2、UCHL1、TSG101、NAE1、和NFKB2中的至少一个基因的3’-UTR;(b):(a)中所述3’-UTR的片段;(c):(a)中所述3’-UTR的变体;及(d):(b)中所述片段的变体;所述第一核苷酸序列和所述第二核苷酸序列不天然出现于同一RNA分子。
- 权利要求12的重组RNA分子,其中所述基因是人基因。
- 权利要求12或13的重组RNA分子,其中所述第二核苷酸序列包含以下多核苷酸中的至少一种:(a):源自基因MPND、FBXW10、FBXW12和PGLYRP1中的至少一个基因的3’-UTR;(b):(a)中所述3’-UTR的片段;(c):(a)中所述3’-UTR的变体;及(d):(b)中所述片段的变体。
- 权利要求12~14任一项的重组RNA分子,其中所述第二核苷酸序列包含下述多核苷酸中的至少一种:序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA、序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA的片段、序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA的变体和序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA的片段的变体;优选地,所述序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA的变体、所述序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA的片段和所述序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA的片段的变体,与所述序列如SEQ ID NO:22~48中的至少一个所示的多核苷酸编码的RNA具有至少70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的相同性。
- 权利要求12~15任一项的重组RNA分子,其中所述第二核苷酸序列包含:(a)中所述基因中的至少两个基因的3’-UTR、(a)中所述基因中的至少两个基因的3’-UTR的片段、(a)中所述基因中的至少两个基因的3’-UTR的变体和(a)中所述基因中的至少两个基因的3’-UTR的片段的变体中的至少一种。
- 权利要求12~15任一项的重组RNA分子,其中所述第二核苷酸序列包含:至少两个拷贝的(a)中的3’-UTR、至少两个拷贝的(a)中的3’-UTR的片段、至少两个拷贝的(a)中的3’-UTR的变体和至少两个拷贝的(a)中的3’-UTR的片段的变体中的至少一种。
- 权利要求12~17任一项的重组RNA分子,其进一步还包含启动子、5’-帽子结构、5’-UTR和poly(A)尾中的至少一种。
- 权利要求17的重组RNA分子,其中所述5’-帽子结构包括m7GpppG、m2 7,3′-OGpppG、m7Gppp(5')N1或m7Gppp(m2′-O)N1中的至少一种。
- 权利要求18或19的重组RNA分子,其中所述5’-UTR包含:i)源自基因PPIA、HPX、FTCD、CDK5RAP3、HSPA8、HBA1、HBB、MYSM1、LENG1、TMSB4X、CASP4、IFNA1、PGLYRP1、UCHL1、CPAMD8、TTR、APOA2、GH1、DTYMK、APOC2和CDK7中至少一个基因的5’-UTR、其片段、变体和片段的变体中的至少一种;优选地,序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA、序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段、序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的变体和序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段的变体;优选地,所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的变体、所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段和所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸编码的RNA的片段的变体,与所述序列如SEQ ID NO:1~21中的至少一个所示的多核苷酸编码的RNA具有至少70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的相同性;ii)至少两个拷贝的i)中的其中一种多核苷酸;或iii)至少两种i)中的多核苷酸。
- 权利要求18~20任一项的重组RNA分子,其中构成所述poly(A)尾的核苷酸包含至少20个、至少40个、至少80个、至少100个或至少120个A核苷酸;优选地,构成所述poly(A)尾的核苷酸包含连续地至少20个、至少40个、至少80个、至少100个或至少120个A核苷酸。
- 权利要求18~21任一项的重组RNA分子,其中构成所述poly(A)尾的核苷酸包 含一个或多个除A核苷酸之外的其他核苷酸。
- 一种DNA分子,其编码权利要求1~22任一项的重组RNA分子。
- 一种载体,其包含权利要求23的DNA分子。
- 一种宿主细胞,其包含权利要求1~22任一项的重组RNA分子、权利要求23的DNA分子或权利要求24所述的载体。
- 一种脂质纳米颗粒,其包含权利要求1~22任一项的重组RNA分子。
- 一种药物组合物,其包含权利要求1~22任一项的重组RNA分子、权利要求23的DNA、权利要求24的载体、权利要求25的宿主细胞或权利要求26的脂质纳米颗粒,以及药学上可接受的载剂。
- 一种载体,其包含编码5’-UTR的第一核苷酸序列和/或编码3’-UTR的第二核苷酸序列,其中:所述第一核苷酸序列包含如下多核苷酸中的至少一种:(a):编码源自基因PPIA、HPX、FTCD、CDK5RAP3、HSPA8、HBA1、HBB、MYSM1、LENG1、TMSB4X、CASP4、IFNA1、PGLYRP1、UCHL1、CPAMD8、TTR、APOA2、GH1、DTYMK、APOC2和CDK7中的至少一个基因的5’-UTR的多核苷酸;(b):编码(a)中所述5’-UTR的片段的多核苷酸;(c):编码(a)中所述5’-UTR的变体的多核苷酸;及(d):编码(b)中所述片段的变体的多核苷酸;所述第二核苷酸序列包含如下多核苷酸中的至少一种:(e):编码源自基因MPND、FBXW10、FBXW12、PGLYRP1、HPX、CDK7、APOC2、PFN1、RBP4、FTCD、NAAA、ALB、GSDMD、FBXL8、ORM1、CASP4、CHMP2A、LENG1、MYCBPAP、APOC1、GAPDH、HSPA8、APOA2、UCHL1、TSG101、NAE1和NFKB2中的至少一个基因的3’-UTR的多核苷酸;(f):编码(e)中所述3’-UTR的片段的多核苷酸;(g):编码(e)中所述3’-UTR的变体的多核苷酸;及(h):编码(f)中所述片段的变体的多核苷酸。
- 权利要求28的载体,其中所述基因是人基因。
- 权利要求28或29的载体,其中所述第一核苷酸序列包含源自基因PPIA、HPX、FTCD、CDK5RAP3、HSPA8、HBA1和HBB中的至少一个的5’-UTR或其变体。
- 权利要求28~30任一项的载体,其中所述第二核苷酸序列包含源自基因MPND、FBXW10、FBXW12、和PGLYRP1中的至少一个的3’-UTR或其变体。
- 权利要求28~31任一项的载体,其包含所述第一核苷酸序列和第二核苷酸序列。
- 权利要求28~32任一项的载体,其中所述第一核苷酸序列包含:i)序列如SEQ ID NO:1~21中至少一个所示的多核苷酸、序列如SEQ ID NO:1~21中至少一个所示的多核苷酸的片段、序列如SEQ ID NO:1~21中至少一个所示的多核苷酸的变体和序列如SEQ ID NO:1~21中至少一个所示的多核苷酸的片段的变体;优选地,所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸的变体、所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸的片段和所述序列如SEQ ID NO:1~21中至少一个所示的多核苷酸的片段的变体,与所述序列如SEQ ID NO:1~21中的至少一个所示的多核苷酸具有至少70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的同源性;ii)至少两个拷贝的i)中的一种多核苷酸;或iii)至少两种i)中的多核苷酸。
- 权利要求28~33任一项的载体,其中所述第二核苷酸序列包含:(1):编码源自白蛋白基因、α-珠蛋白基因、β-珠蛋白基因、酪氨酸羟化酶基因、脂加氧酶基因、和胶原蛋白α基因中至少一个基因的3’-UTR的多核苷酸;(2):编码(1)中的所述3’-UTR的变体的多核苷酸;(3):序列如SEQ ID NO:22~48中至少一个所示的多核苷酸、序列如SEQ ID NO: 22~48中至少一个所示的多核苷酸的片段、序列如SEQ ID NO:22~48中至少一个所示的多核苷酸的变体和序列如SEQ ID NO:22~48中至少一个所示的多核苷酸的片段的变体中的至少一种;优选地,所述序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的RNA、所述序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的片段、和所述序列如SEQ ID NO:22~48中至少一个所示的多核苷酸编码的片段的变体,与所述序列如SEQ ID NO:22~48中的至少一个所示的多核苷酸有70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的相同性;(4):至少两个拷贝的(1)、(2)或(3)中的一种多核苷酸;或(5):由(1)~(3)中的多核苷酸所构成的组中的至少两种多核苷酸。
- 权利要求28~34任一项的载体,其还包含编码poly(A)尾的多核苷酸。
- 权利要求35的载体,其中组成所述poly(A)尾的核苷酸包含至少20个、至少40个、至少80个、至少100个或至少120个A核苷酸;优选地,组成所述poly(A)尾的核苷酸包含连续地至少20个、至少40个、至少80个、至少100个或至少120个A核苷酸。
- 权利要求35的载体,其中组成所述poly(A)尾的核苷酸包含一个或多个除A核苷酸之外的其他核苷酸。
- 权利要求1~22任一项所述的重组RNA分子、权利要求23所述的DNA分子、权利要求24或28所述的载体、权利要求25所述的宿主细胞、权利要求26述的脂质纳米颗粒或权利要求27所述的药物组合物在制备药物中的用途;优选地,所述药物用于基因治疗、基因疫苗接种或蛋白质替代疗法。
- 权利要求38所述的用途,所述药物为核酸药物,其中所述核酸包括下述的至少一种:RNA、信使RNA(mRNA)、DNA、质粒、核糖体RNA(rRNA)、单链向导RNA(sgRNA)和Cas9 mRNA。
- 权利要求38~39任一项所述的用途,所述药物用于疾病的治疗和/或预防;优选地,所述疾病选自由以下组成的组:罕见病、感染性疾病、癌症、遗传性疾病、自体免疫性疾病、糖尿病、神经退化性疾病、心血管疾病、肾血管疾病,以及代谢性疾病;优选地,所述癌症包括肺癌、胃癌、肝癌、食管癌、结肠癌、胰腺癌、脑癌、淋巴癌、血癌或前列腺癌中的一种或多种;所述遗传疾病包括血友病,地中海贫血、高雪氏病中的一种或多种。
Priority Applications (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2025534775A JP2026501176A (ja) | 2022-12-16 | 2023-12-15 | Rna分子の翻訳効率及び/又は安定性を向上させるためのutr及びその使用 |
| KR1020257023528A KR20250113527A (ko) | 2022-12-16 | 2023-12-15 | Rna 분자의 번역 효율 및/또는 안정성을 개선하기 위한 utr 및 이의 용도 |
| EP23902830.1A EP4636090A1 (en) | 2022-12-16 | 2023-12-15 | Utr for improving translation efficiency and/or stability of rna molecule and use thereof |
| AU2023396545A AU2023396545A1 (en) | 2022-12-16 | 2023-12-15 | Utr for improving translation efficiency and/or stability of rna molecule and use thereof |
| CN202380086599.2A CN120435560A (zh) | 2022-12-16 | 2023-12-15 | 提高rna分子翻译效率和/或稳定性的utr及其应用 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNPCT/CN2022/139593 | 2022-12-16 | ||
| CN2022139593 | 2022-12-16 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024125637A1 true WO2024125637A1 (zh) | 2024-06-20 |
Family
ID=91484402
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/139184 Ceased WO2024125637A1 (zh) | 2022-12-16 | 2023-12-15 | 提高rna分子翻译效率和/或稳定性的utr及其应用 |
Country Status (6)
| Country | Link |
|---|---|
| EP (1) | EP4636090A1 (zh) |
| JP (1) | JP2026501176A (zh) |
| KR (1) | KR20250113527A (zh) |
| CN (1) | CN120435560A (zh) |
| AU (1) | AU2023396545A1 (zh) |
| WO (1) | WO2024125637A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025228326A1 (zh) * | 2024-04-29 | 2025-11-06 | 深圳深信生物科技有限公司 | 重组rna分子及其应用 |
Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2013120628A1 (en) | 2012-02-15 | 2013-08-22 | Curevac Gmbh | Nucleic acid comprising or coding for a histone stem-loop and a poly(a) sequence or a polyadenylation signal for increasing the expression of an encoded pathogenic antigen |
| CN104321432A (zh) * | 2012-03-27 | 2015-01-28 | 库瑞瓦格有限责任公司 | 包含5′top utr的人工核酸分子 |
| WO2016170176A1 (en) | 2015-04-22 | 2016-10-27 | Curevac Ag | Rna containing composition for treatment of tumor diseases |
| WO2017191274A2 (en) | 2016-05-04 | 2017-11-09 | Curevac Ag | Rna encoding a therapeutic protein |
| WO2018078053A1 (en) | 2016-10-26 | 2018-05-03 | Curevac Ag | Lipid nanoparticle mrna vaccines |
| US20180195077A1 (en) * | 2015-07-16 | 2018-07-12 | Cornell University | Methods of enhancing translation ability of rna molecules, treatments, and kits |
| WO2019077001A1 (en) | 2017-10-19 | 2019-04-25 | Curevac Ag | NEW ARTIFICIAL NUCLEIC ACID MOLECULES |
| CN111405912A (zh) * | 2017-09-29 | 2020-07-10 | 因特利亚治疗公司 | 用于基因组编辑的多核苷酸、组合物及方法 |
| CN112656954A (zh) * | 2013-10-22 | 2021-04-16 | 夏尔人类遗传性治疗公司 | 用于递送信使rna的脂质制剂 |
| CN113521269A (zh) * | 2020-04-22 | 2021-10-22 | 生物技术Rna制药有限公司 | 冠状病毒疫苗 |
| CN114717230A (zh) * | 2021-01-05 | 2022-07-08 | 麦塞拿治疗(香港)有限公司 | 成纤维细胞生长因子mRNA的无细胞和无载体体外RNA转录方法和核酸分子 |
-
2023
- 2023-12-15 CN CN202380086599.2A patent/CN120435560A/zh active Pending
- 2023-12-15 EP EP23902830.1A patent/EP4636090A1/en active Pending
- 2023-12-15 JP JP2025534775A patent/JP2026501176A/ja active Pending
- 2023-12-15 KR KR1020257023528A patent/KR20250113527A/ko active Pending
- 2023-12-15 WO PCT/CN2023/139184 patent/WO2024125637A1/zh not_active Ceased
- 2023-12-15 AU AU2023396545A patent/AU2023396545A1/en active Pending
Patent Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2013120628A1 (en) | 2012-02-15 | 2013-08-22 | Curevac Gmbh | Nucleic acid comprising or coding for a histone stem-loop and a poly(a) sequence or a polyadenylation signal for increasing the expression of an encoded pathogenic antigen |
| CN104321432A (zh) * | 2012-03-27 | 2015-01-28 | 库瑞瓦格有限责任公司 | 包含5′top utr的人工核酸分子 |
| CN112656954A (zh) * | 2013-10-22 | 2021-04-16 | 夏尔人类遗传性治疗公司 | 用于递送信使rna的脂质制剂 |
| WO2016170176A1 (en) | 2015-04-22 | 2016-10-27 | Curevac Ag | Rna containing composition for treatment of tumor diseases |
| US20180195077A1 (en) * | 2015-07-16 | 2018-07-12 | Cornell University | Methods of enhancing translation ability of rna molecules, treatments, and kits |
| WO2017191274A2 (en) | 2016-05-04 | 2017-11-09 | Curevac Ag | Rna encoding a therapeutic protein |
| WO2018078053A1 (en) | 2016-10-26 | 2018-05-03 | Curevac Ag | Lipid nanoparticle mrna vaccines |
| CN111405912A (zh) * | 2017-09-29 | 2020-07-10 | 因特利亚治疗公司 | 用于基因组编辑的多核苷酸、组合物及方法 |
| WO2019077001A1 (en) | 2017-10-19 | 2019-04-25 | Curevac Ag | NEW ARTIFICIAL NUCLEIC ACID MOLECULES |
| CN113521269A (zh) * | 2020-04-22 | 2021-10-22 | 生物技术Rna制药有限公司 | 冠状病毒疫苗 |
| CN114717230A (zh) * | 2021-01-05 | 2022-07-08 | 麦塞拿治疗(香港)有限公司 | 成纤维细胞生长因子mRNA的无细胞和无载体体外RNA转录方法和核酸分子 |
Non-Patent Citations (6)
| Title |
|---|
| "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
| NEDDLEMANWUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
| PARDI N.MURAMATSU HWEISSMAN DKARIKÓ K: "Synthetic Messenger RNA and Cell Metabolism Modulation. Methods in Molecular Biology (Methods and Protocols", vol. 969, 2013, HUMANA PRESS |
| PEARSONLIPMAN, PROC. NATL ACAD. SCI. USA, vol. 88, 1988, pages 2444 |
| See also references of EP4636090A1 |
| SMITHWATERMAN, ADS APP. MATH., vol. 2, 1981, pages 482 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025228326A1 (zh) * | 2024-04-29 | 2025-11-06 | 深圳深信生物科技有限公司 | 重组rna分子及其应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4636090A1 (en) | 2025-10-22 |
| KR20250113527A (ko) | 2025-07-25 |
| CN120435560A (zh) | 2025-08-05 |
| JP2026501176A (ja) | 2026-01-14 |
| AU2023396545A1 (en) | 2025-07-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN108026537B (zh) | 人工核酸分子 | |
| CA3216490A1 (en) | Epstein-barr virus mrna vaccines | |
| WO2022221336A1 (en) | Respiratory syncytial virus mrna vaccines | |
| TW202305140A (zh) | 多價rna組合物中rna種類之鑑定及比率測定方法 | |
| EP4096683A1 (en) | Respiratory virus immunizing compositions | |
| CN114901360A (zh) | 用于递送核酸的新型脂质纳米颗粒 | |
| EP4219723B1 (en) | Circular rna platforms, uses thereof, and their manufacturing processes from engineered dna | |
| JP2024528447A (ja) | 非天然型5’非翻訳領域及び3’非翻訳領域、及びその用途 | |
| US20240200083A1 (en) | Plasmid system without selectable markers and production method thereof | |
| EP4636090A1 (en) | Utr for improving translation efficiency and/or stability of rna molecule and use thereof | |
| WO2023227124A1 (zh) | 一种构建mRNA体外转录模板的骨架 | |
| CN118638801A (zh) | 一种编码促红细胞生成素的mRNA分子及其应用 | |
| AU2022349697A1 (en) | Synthetic production of circular dna vectors | |
| EP4560021A1 (en) | Mrna for sars-cov-2 s protein and use thereof | |
| CN119768532A (zh) | 用于生产核酸的方法 | |
| WO2025228326A1 (zh) | 重组rna分子及其应用 | |
| CA3262348A1 (en) | Mrna for sars-cov-2 s protein and use thereof | |
| EP4123029A1 (en) | In-vitro transcript mrna and pharmaceutical composition comprising same | |
| CN121586776A (zh) | 自扩增核酸分子及其应用 | |
| WO2024199441A1 (zh) | 一种包含poly(A)的多核苷酸分子及其用途 | |
| WO2025149032A1 (en) | Respiratory syncytial virus vaccine compositions and their use | |
| WO2025177220A1 (en) | Alternative self-amplifying rna | |
| WO2025202929A1 (en) | Methods for producing nucleic acids | |
| WO2026027700A2 (en) | Modified ribonucleic acids | |
| Widada | Isolation of GAG-CA Subunit of Jembrana Virus from the Viral Genome by RT-PCR and Its Cloning in pCR21-Topoplasmid usingTopoisomerase-Based System |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23902830 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2025534775 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2025534775 Country of ref document: JP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202380086599.2 Country of ref document: CN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: AU2023396545 Country of ref document: AU |
|
| ENP | Entry into the national phase |
Ref document number: 1020257023528 Country of ref document: KR Free format text: ST27 STATUS EVENT CODE: A-0-1-A10-A15-NAP-PA0105 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 1020257023528 Country of ref document: KR |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2023902830 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2023396545 Country of ref document: AU Date of ref document: 20231215 Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWP | Wipo information: published in national office |
Ref document number: 1020257023528 Country of ref document: KR |
|
| ENP | Entry into the national phase |
Ref document number: 2023902830 Country of ref document: EP Effective date: 20250716 |
|
| ENP | Entry into the national phase |
Ref document number: 2023902830 Country of ref document: EP Effective date: 20250716 |
|
| WWP | Wipo information: published in national office |
Ref document number: 202380086599.2 Country of ref document: CN |
|
| ENP | Entry into the national phase |
Ref document number: 2023902830 Country of ref document: EP Effective date: 20250716 |
|
| WWP | Wipo information: published in national office |
Ref document number: 2023902830 Country of ref document: EP |