WO2024179544A1 - 编码rna的工程化dna分子 - Google Patents

编码rna的工程化dna分子 Download PDF

Info

Publication number
WO2024179544A1
WO2024179544A1 PCT/CN2024/079346 CN2024079346W WO2024179544A1 WO 2024179544 A1 WO2024179544 A1 WO 2024179544A1 CN 2024079346 W CN2024079346 W CN 2024079346W WO 2024179544 A1 WO2024179544 A1 WO 2024179544A1
Authority
WO
WIPO (PCT)
Prior art keywords
poly
tail
dna molecule
coding sequence
rna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2024/079346
Other languages
English (en)
French (fr)
Inventor
张卫国
董翊洁
陈华
秦尉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
RinuaGene International HK Ltd
RinuaGene Biotechnology Co Ltd
Original Assignee
RinuaGene International HK Ltd
RinuaGene Biotechnology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by RinuaGene International HK Ltd, RinuaGene Biotechnology Co Ltd filed Critical RinuaGene International HK Ltd
Priority to AU2024229819A priority Critical patent/AU2024229819A1/en
Priority to KR1020257029385A priority patent/KR20250161542A/ko
Priority to JP2025549603A priority patent/JP2026507667A/ja
Priority to EP24763224.3A priority patent/EP4674967A1/en
Publication of WO2024179544A1 publication Critical patent/WO2024179544A1/zh
Priority to US19/083,930 priority patent/US20250215441A1/en
Priority to MX2025010281A priority patent/MX2025010281A/es
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • C07K14/01DNA viruses
    • C07K14/025Papovaviridae, e.g. papillomavirus, polyomavirus, SV40, BK virus, JC virus
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • C12N15/68Stabilisation of the vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/10Cells modified by introduction of foreign genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/34Polynucleotides, e.g. nucleic acids, oligoribonucleotides
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/04Libraries containing only organic compounds
    • C40B40/06Libraries containing nucleotides or polynucleotides, or derivatives thereof
    • C40B40/08Libraries containing RNA or DNA which encodes proteins, e.g. gene libraries
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2710/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
    • C12N2710/00011Details
    • C12N2710/20011Papillomaviridae
    • C12N2710/20022New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2710/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
    • C12N2710/00011Details
    • C12N2710/20011Papillomaviridae
    • C12N2710/20051Methods of production or purification of viral material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/50Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/185Escherichia
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/185Escherichia
    • C12R2001/19Escherichia coli

Definitions

  • the present application relates to the field of biotechnology, and in particular to an RNA comprising a poly(A) tail.
  • the poly(A) tail enables the replication of the DNA encoding the RNA in a prokaryotic system to have a higher stability, and can be used to regulate the expression level of the RNA in a eukaryotic cell.
  • the primary structure of a translatable mRNA drug molecule consists of a 5' cap structure, a 5' untranslated region (5'UTR), a coding region, a 3' untranslated region, and a polyadenylic acid tail (poly(A) tail).
  • the known functions of the poly(A) tail include maintaining the in vivo stability of mRNA molecules and participating in the initiation of protein translation, which is achieved through the interaction between poly(A) tail binding protein (PABP) and the translation initiation complex.
  • PABP poly(A) tail binding protein
  • the poly(A) tail is synthesized by a typical poly(A) polymerase as a post-transcriptional modification.
  • the first step in the in vitro preparation of mRNA drugs is to synthesize them through in vitro transcription (IVT) using a linearized plasmid containing a designed product sequence as a template, and the Poly(A) tail is usually added downstream of the 3’UTR in a co-transcriptional manner.
  • IVT in vitro transcription
  • the poly(dA:dT) repeat sequence in the plasmid is unstable during replication in Escherichia coli, and deletion mutations often occur, resulting in shortening of poly(dA:dT). This phenomenon is not conducive to the preparation process of in vitro transcription template plasmids produced on a large scale through fermentation, and Poly(A) truncation has a significant effect on the in vivo stability and biological activity of mRNA.
  • the element a is composed of a plurality of consecutive adenine (A) nucleotides, and the length of the element a ranges from ⁇ 20 nt;
  • the element b is composed of a plurality of consecutive A nucleotides, and the length of the element b is in the range of 3nt ⁇ b ⁇ 20nt;
  • the element c consists of a non-A nucleotide, and the nucleotide is selected from T, C, and G nucleotides;
  • Poly(A) tail coding sequence does not contain elements b adjacent to each other, does not contain elements c adjacent to each other, and does not contain elements d adjacent to each other.
  • the Poly (A) tail coding sequence further comprises a unique element e, wherein the element e consists of one or two consecutive A's; and is located at the 3' end of the Poly (A) tail coding sequence and is adjacent to element d or element c.
  • the number of the elements b is 2-10, for example 3, 4, 5, 6, 7, 8, or 9.
  • the number of the elements c is 0 to 10, for example, 1, 2, 3, 4, 5, 6, 7, 8, or 9.
  • the number of the elements d is 0 to 5, such as 1, 2, 3, or 4.
  • the ratio of element c and element d is The total number is 2-15, for example 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14.
  • the number of the element a is 1, the number of the element b is 3, the number of the element c is 2, and the number of the element d is 1.
  • the number of the element a is 1, the number of the element b is 4, the number of the element c is 4, and the number of the element d is 1.
  • the number of the element a is 1, the number of the element b is 5, the number of the element c is 4, and the number of the element d is 1.
  • the number of the element a is 1, the number of the element b is 3, the number of the element c is 3, and the number of the element d is 1.
  • the element a ⁇ 80 nt. In some embodiments, the element a is 21nt, 22nt, 23nt, 24nt, 25nt, 26nt, 27nt, 28nt, 29nt, 30nt, 31nt, 32nt, 33nt, 34nt, 35nt, 36nt, 37nt, 38nt, 39nt, 40nt, 41nt, 42nt, 43nt, 44nt, 45nt, 46nt, 47nt, 48nt, 49nt, 50nt, 51nt, 52nt, 53nt, 54nt, 55nt, 56nt, 57nt, 58nt, 59nt, 60nt, 61nt, 62nt, 63nt, 64nt, 65nt, 66nt, 67nt, 68nt, 69nt, 70nt, 71nt, 72nt, 73nt, 74nt, 75
  • the element b is 3nt, 4nt, 5nt, 6nt, 7nt, 8nt, 9nt, 10nt, 11nt, 12nt, 13nt, 14nt, 15nt, 16nt, 17nt, 18nt or 19nt.
  • the element d is 2nt ⁇ d ⁇ 20nt, 3 to 18nt, 5 to 16nt, 4 to 10nt, or 6 to 12nt, for example 2nt, 3nt, 4nt, 5nt, 6nt, 7nt, 8nt, 9nt, 10nt, 11nt, 12nt, 13nt, 14nt, 15nt, 16nt, 17nt, 18nt, 19nt or 20nt, preferably 6nt.
  • the length of the Poly (A) tail encoding sequence is greater than 40nt, for example, 41nt, 42nt, 43nt, 44nt, 45nt, 46nt, 47nt, 48nt, 49nt, 50nt, 51nt, 52nt, 53nt, 54nt, 55nt, 56nt, 57nt, 58nt, 59nt, 60nt, 61nt, 62nt, 63nt, 64nt, 65nt, 66nt, 67nt, 68nt, 69nt, 70nt, 71nt, 72nt, 73nt, 74nt, 75nt, 76nt, 77nt, 78nt, 79nt, 80nt ,81nt,82nt,83nt,84nt,85nt,86nt,87nt,88nt,89nt,90nt,91nt,92nt,93
  • 50% or more of the polynucleotides of element a are located in the 5' portion or 3' portion of the poly(A) tail coding sequence. In some embodiments, more than 50% of the polynucleotides of element a are located in the 5' portion of the poly(A) tail coding sequence. More than 50% of the polynucleotides of element a are located at the 3' portion of the poly(A) tail coding sequence. In some embodiments, the number of nucleotides of element a located at the 3' portion of the poly(A) tail coding sequence is equal to the number of nucleotides located at the 5' portion of the poly(A) tail coding sequence.
  • the element c is G, C or T.
  • the element d comprises a palindromic sequence.
  • Element d is a palindromic sequence.
  • the element d comprises a sequence selected from the following: GATATC (SEQ ID NO: 15), GTATAC (SEQ ID NO: 16), GAATCT (SEQ ID NO: 17), GCATATGACT (SEQ ID NO: 18) and GATATCGTATAC (SEQ ID NO: 19).
  • the element d is a sequence selected from the following: GATATC (SEQ ID NO: 15), GTATAC (SEQ ID NO: 16), GAATCT (SEQ ID NO: 17), GCATATGACT (SEQ ID NO: 18) and GATATCGTATAC (SEQ ID NO: 19).
  • the element d comprises a polynucleotide sequence as shown in SEQ ID NO: 15.
  • the polynucleotide sequence of the element d is as shown in SEQ ID NO: 15.
  • the nucleotide at the 3' end of the poly (A) tail encoding sequence is A. In some embodiments, the nucleotide at the 3' end of the poly (A) tail encoding sequence is G. In some embodiments, the nucleotide at the 3' end of the poly (A) tail encoding sequence is C. In some embodiments, the nucleotide at the 3' end of the poly (A) tail encoding sequence is T.
  • the 3' portion of the Poly (A) tail coding sequence comprises one or more non-A nucleotides. In some embodiments, the Poly (A) tail coding sequence comprises one or more non-A nucleotides at 1/2 near the 3' end. In some embodiments, the Poly (A) tail coding sequence comprises one or more non-A nucleotides at 1/3 near the 3' end. In some embodiments, the Poly (A) tail coding sequence comprises one or more non-A nucleotides at 1/4 near the 3' end.
  • the structure of the Poly(A) tail encoding sequence is:
  • the structure of the Poly(A) tail encoding sequence is:
  • the structure of the Poly(A) tail encoding sequence is:
  • element d consists of 6 or 12 nucleotides.
  • the Poly(A) tail encoding sequence is as shown in any one of SEQ ID NO: 1 to 10.
  • the Poly (A) tail coding sequence is shown as SEQ ID NO: 3 or SEQ ID NO: 4.
  • the engineered DNA molecule is further connected to a target gene fragment on the 5' end side of its Poly (A) tail coding sequence, and the target gene fragment and the Poly (A) tail coding sequence jointly encode RNA.
  • the engineered DNA molecule is further connected to a target gene fragment on the 5' end side of its Poly (A) tail coding sequence, and the target gene fragment and the Poly (A) tail coding sequence jointly encode mRNA.
  • the target gene fragment comprises a protein coding sequence or a non-protein coding sequence such as a functional RNA coding sequence.
  • the target gene fragment further comprises a 5'UTR coding sequence on the 5' end side of the protein coding sequence or the functional RNA coding sequence.
  • the target gene fragment further comprises a 3'UTR coding sequence on the 3' end side of the protein coding sequence or the functional RNA coding sequence. In some embodiments, the target gene fragment further comprises a 3'UTR coding sequence on the 3' end side of the protein coding sequence or the functional RNA coding sequence, and further comprises a 5'UTR coding sequence on the 5' end side of the protein coding sequence or the functional RNA coding sequence. In some embodiments, the engineered DNA molecule further comprises a replicon, such as a replication initiation site, such as an ORI.
  • the engineered DNA molecule further comprises a marker gene to facilitate screening of cells comprising the engineered DNA molecule, wherein the marker gene is selected from, for example, a resistance gene for antibiotics, a fluorescent protein, etc.
  • the DNA molecule further comprises Containing a promoter, the promoter starts the transcription of the target gene fragment and the poly (A) tail coding sequence co-encoding RNA.
  • the promoter is a prokaryotic promoter.
  • the promoter is a eukaryotic promoter.
  • the DNA molecule further comprises a replicon such as a replication initiation site, a promoter, a 5'UTR coding sequence, a protein coding sequence, and a 3'UTR coding sequence. In some embodiments, the DNA molecule further comprises a replicon such as a replication initiation site, a resistance gene, a 5'UTR coding sequence, a protein coding sequence, and a 3'UTR coding sequence. In some embodiments, the DNA molecule further comprises a replicon such as a replication initiation site, a resistance gene, a promoter, a 5'UTR coding sequence, a protein coding sequence, and a 3'UTR coding sequence.
  • the protein coding sequence encodes an HPV virus antigen protein. In some embodiments, the HPV protein is derived from type 16 and/or type 18 HPV. In some embodiments, the protein coding sequence encodes the E2, E6 or E7 protein of HPV. In some embodiments, the protein coding sequence encodes a fusion protein of HPV E6 and E7 proteins. In some embodiments, the protein coding sequence encodes a fusion protein of HPV E2, E6 and E7 proteins. In some embodiments, the polypeptide fragment of the fusion protein is derived from HPV type 16 and/or type 18. In some embodiments, the polypeptide fragment of the fusion protein is derived from HPV type 16 and/or type 18 E2, E6 and E7 proteins. In some embodiments, the protein coding sequence encodes a polypeptide as shown in SEQ NO: 26 or a conservatively substituted variant thereof.
  • the engineered DNA molecule comprises a polynucleotide sequence as shown in any one of SEQ ID NOs: 22-25, or a synonymous mutant of the polynucleotide sequence as shown in any one of SEQ ID NOs: 22-25, or a polynucleotide sequence having more than 85% sequence identity with the polynucleotide sequence shown in any one of SEQ ID NOs: 22-25 or its synonymous mutant.
  • the DNA molecule is a DNA plasmid. In some embodiments, the DNA molecule is a linear plasmid or a circular plasmid. In some embodiments, the DNA molecule is single-stranded or double-stranded. In some embodiments, the plasmid is a plasmid based on pUC, pTZ, pMB1 or pCoIE1. In some embodiments, the plasmid is a plasmid based on a pUC57 vector.
  • the cell is a prokaryotic cell. In some embodiments, the cell is a recA-bacterium. In some embodiments, the cell is Escherichia coli. In some embodiments, the Escherichia coli is selected from: K-12 and its derivatives, and B strain and its derivatives.
  • the Escherichia coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2.
  • the second aspect of the present application discloses a cell comprising the DNA molecule of the aforementioned first aspect.
  • the DNA molecule of the aforementioned first aspect can be replicated and/or transcribed in the cell.
  • the cell is a prokaryotic cell, and the DNA molecule of the aforementioned first aspect can be replicated in the prokaryotic cell.
  • the cell is a recA-bacterium.
  • the cell is Escherichia coli.
  • the prokaryotic cell is a competent cell.
  • the prokaryotic cell is an engineered cell.
  • the prokaryotic cell is an engineered prokaryotic cell.
  • the cell is Escherichia coli
  • the Escherichia coli is selected from: K-12 and its derivative strains, and B strain and its derivative strains.
  • the Escherichia coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2.
  • the third aspect of the present application provides a Poly(A) tail.
  • the Poly(A) tail :
  • the further modification comprises replacing one or more ribonucleotides in the Poly (A) tail obtained by (1) or (2) with one or more deoxyribonucleotides.
  • the one or more ribonucleotides are replaced with deoxyribonucleotides corresponding to the one or more ribonucleotides, for example, one or more ribonucleotides A in the Poly (A) tail are replaced with deoxyribonucleotides A, one or more ribonucleotides U are replaced with deoxyribonucleotides T, one or more ribonucleotides C are replaced with deoxyribonucleotides C, one or more ribonucleotides G are replaced with deoxyribonucleotides G, or one or more ribonucleotides G are replaced with ribonucleotides I (inosine) or deoxyribonucleotides I.
  • the modification is a chemical modification. In some embodiments, the modification is a base editing. In some embodiments, the modification is a deamination treatment of one or more ribonucleotides in the Poly (A) tail obtained by (1) or (2).
  • the Poly(A) tail comprises a polynucleotide sequence selected from any one of SEQ ID NOs: 1-10. In some embodiments, the polynucleotide sequence of the Poly(A) tail is as shown in any one of SEQ ID NOs: 1-10.
  • the Poly (A) tail is used to make the RNA molecule more stable, wherein the Poly (A) tail is located at the 3' end of the RNA, and the more stable is more stable outside the cell, inside the cell, or in the animal body relative to the RNA molecule containing another Poly (A) tail.
  • the Poly (A) tail is used to reduce the stability of the RNA molecule, wherein the Poly (A) tail is located at the 3' end of the RNA, and the reduction in stability is relative to the RNA molecule containing another Poly (A) tail, and the stability is reduced outside the cell, inside the cell, or in the animal body.
  • the Poly (A) tail is used to increase the expression of the RNA molecule in the same time, and the increased expression is relative to the RNA molecule containing another Poly (A) tail, and the expression is increased outside the cell, inside the cell, or in the animal body.
  • the Poly (A) tail is used to reduce the expression of an RNA molecule in the same period of time, and the increased expression is relative to the RNA molecule containing another Poly (A) tail, and the expression is reduced outside the cell, inside the cell, or in the animal body.
  • the Poly (A) tail is used to extend the expression time of an RNA molecule, and the extended expression time is relative to the RNA molecule containing another Poly (A) tail, and the expression time is extended outside the cell, inside the cell, or in the animal body.
  • the Poly (A) tail is used to shorten the expression time of an RNA molecule, and the extended expression time is relative to the RNA molecule containing another Poly (A) tail, and the expression time is shortened outside the cell, inside the cell, or in the animal body.
  • the Poly (A) tail is used to extend the half-life of an RNA molecule, and the half-life extension is relative to the half-life extension of an RNA molecule containing another Poly (A) tail, outside the cell, inside the cell, or in the animal body.
  • the Poly(A) tail is used to shorten the half-life of an RNA molecule, wherein the shortened half-life is shortened outside a cell, inside a cell, or in an animal body relative to an RNA molecule comprising another Poly(A) tail.
  • the RNA molecule is an mRNA molecule.
  • the Poly(A) tail and the other Poly(A) tail are two different Poly(A) tails belonging to the Poly(A) tail of the third aspect of the present application.
  • the Poly(A) tail is a Poly(A) tail belonging to the third aspect of the present application, and the other Poly(A) tail is a Poly(A) tail other than the Poly(A) tail of the third aspect of the present application.
  • the intracellular is a host cell, and the host cell is a eukaryotic cell.
  • the host cell is Mammalian cells.
  • the host cell is a human cell.
  • the present application also provides a DNA fragment or a hybrid molecule fragment of DNA and RNA encoding the poly (A) tail of the third aspect of the present application, and the use of the DNA fragment or the hybrid molecule fragment of DNA and RNA to make the replication of the DNA molecule encoding RNA or the hybrid molecule of DNA and RNA in the host cell more conservative, in which the DNA fragment or the hybrid molecule fragment of DNA and RNA encoding the poly (A) tail of the third aspect of the present application is located on the 3' end of the RNA coding sequence in the DNA molecule or the hybrid molecule of DNA and RNA.
  • the host cell is a prokaryotic cell. In some embodiments, the host cell is a recA-bacterium.
  • the host cell is Escherichia coli.
  • the Escherichia coli is selected from: K-12 and its derivative strains, and B strain and its derivative strains.
  • the Escherichia coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2.
  • the fourth aspect of the present application also provides an RNA molecule, wherein the RNA molecule comprises the Poly (A) tail of the third aspect.
  • the RNA molecule is an mRNA molecule.
  • the RNA molecule :
  • the further modification comprises replacing one or more ribonucleotides in the RNA molecule obtained by (1) or (2) with one or more deoxyribonucleotides.
  • the one or more ribonucleotides are replaced with deoxyribonucleotides corresponding to the one or more ribonucleotides, for example, one or more ribonucleotide A of the RNA molecule is replaced with deoxyribonucleotide A, one or more ribonucleotide U is replaced with deoxyribonucleotide T, one or more ribonucleotide C is replaced with deoxyribonucleotide C, one or more ribonucleotide G is replaced with deoxyribonucleotide G, or one or more ribonucleotide G is replaced with ribonucleotide I (inosine) or deoxyribonucleotide I.
  • the modification is a chemical modification. In some embodiments, the modification is a base editing. In some embodiments, the modification is to one or more of the RNA molecules obtained by (1) or (2). In some embodiments, the further modification is a post-transcriptional modification. In some embodiments, the further modification comprises a capping process. In some embodiments, the further modification comprises splicing. In some embodiments, the further modification comprises splicing and capping.
  • the RNA molecule includes coding RNA or non-coding RNA (ncRNA). In some embodiments, the RNA molecule is Pre-mRNA. In some embodiments, the RNA is mature mRNA. In some embodiments, the RNA molecule is long noncoding RNA (lncRNA). In some embodiments, the RNA molecule further comprises a 5'-cap structure. In some embodiments, the polynucleotide sequence of the RNA molecule is as shown in any one of SEQ ID NO: 22-25. In some embodiments, the RNA molecule comprises a polynucleotide sequence as shown in any one of SEQ ID NO: 22-25.
  • the present application also provides a hybrid molecule of DNA and RNA, which carries the same genetic information as the engineered DNA molecule of the first aspect, the same genetic information as the Poly (A) tail of the third aspect, or the same genetic information as the RNA molecule of the fourth aspect.
  • the present application also provides a nucleic acid molecule library.
  • the nucleic acid molecule library comprises the engineered DNA molecule of the aforementioned first aspect, the Poly (A) tail of the aforementioned third aspect, a DNA fragment encoding the Poly (A) tail of the third aspect of the present application or a hybrid molecule fragment of DNA and RNA, or the RNA molecule of the aforementioned fourth aspect.
  • the present application also provides a method for regulating protein expression, comprising: introducing a plurality of nucleic acid molecules in the aforementioned nucleic acid molecule library into a target cell at different times and/or in different quantity ratios.
  • the nucleic acid molecule is the engineered DNA molecule of the aforementioned first aspect.
  • the nucleic acid molecule is the RNA molecule of the aforementioned fourth aspect.
  • FIG. 1 shows the complexation of 10 poly(A)s in this application in E. coli DH5 ⁇ in Example 2. Control stability;
  • FIG2 shows the base deletion statistics of 10 poly(A)s in the present application in Example 2 in Escherichia coli DH5 ⁇ ;
  • Figure 3 shows the replication stability of poly(A) P1, P2, P3, P4 and poly(A) controls C1 and C2 in Example 3 in Escherichia coli DH5 ⁇ under a plasmid system including HPV antigen sequences;
  • FIG4 shows the base deletion statistics of poly(A) P3, P4 and poly(A) controls C1 and C2 in Example 3 in E. coli DH5 ⁇ under the HPV antigen sequence plasmid system;
  • FIG5 shows the comparison of the replication stability of poly(A) P1, P2, P3 and poly(A) control C1 in Example 3 under two temperature conditions of 30° C. and 37° C. in E. coli DH5 ⁇ in a plasmid system including an HPV antigen sequence;
  • FIG. 6 shows an example of a map of a universal vector plasmid DNA in the Examples.
  • FIG7 shows the animal imaging results of the expression levels of luciferase in mice with poly(A) P3, P4, P5, P8, P9 and control C2 in Example 4;
  • Figure 8 shows the quantitative results of fluorescence intensity after animal imaging of the expression levels of luciferase carrying poly(A) P3, P4, P5, P8, P9 and control C2 in mice in Example 4 (ns, no significant difference; ⁇ , significant difference, p ⁇ 0.01).
  • the present application first provides a method for stably amplifying a poly (A) tail transcription template DNA in vitro to reduce the mutation frequency of the poly (A) tail transcription template sequence when the DNA is replicated in large quantities in cells.
  • a large number of RNAs containing poly (A) tails with determined sequences are obtained based on the DNA.
  • RNAs, such as mRNAs, with poly (A) tails having certain specific functions that have been engineered and designed can be mass-produced by in vitro fermentation.
  • the present application also provides uses of the aforementioned Poly(A) tail, RNA, DNA, and library.
  • Element a is composed of a plurality of consecutive adenine (A) nucleotides, and the length of element a is ⁇ 20nt; element b is composed of a plurality of consecutive A nucleotides, and the length of element b is 3nt ⁇ b ⁇ 20nt; element c is composed of a non-A nucleotide, and the nucleotide is selected from T, C, and G nucleotides; element d is composed of any two or more consecutive nucleotides, and the nucleotides are selected from A, T, C, and G nucleotides, wherein the nucleotides at the 5' and 3' ends of element d are not A nucleotides, and element d does not contain more than 3 consecutive A nucleotides, and the
  • the two or more elements when describing the positional relationship of two or more elements as "non-adjacent", it means that the two or more elements are not adjacent to each other. In other words, the two or more elements contain at least one or more other nucleotides or bases other than the nucleotides of the two elements.
  • nucleotides in all polynucleotide sequences are numbered from the 5' end to the 3' end, that is, the nucleotide at the 5' end is the first nucleotide, and the nucleotide at the 3' end is the last nucleotide.
  • “5'end” and “5'end” can be used interchangeably;
  • “3'end” and “3'end” can be used interchangeably.
  • “5'end” "5'end” and “3'end” focus on describing the relative position relationship between nucleotides, between nucleotide sequence segments, or between nucleotides and nucleotide sequence segments in the same nucleic acid sequence; "5'end” and “3'end” are used to describe the positions of the first and last nucleotides of a nucleic acid sequence or a segment of a nucleic acid sequence, respectively. "5' end side” is used to describe the relative position relationship between two sequences that do not overlap with each other in the same polynucleotide sequence.
  • RNA molecule comprises the poly (A) tail, then relative to the coding DNA of an RNA molecule that does not comprise the poly (A) tail (e.g., an RNA molecule comprising some other poly (A) tail), the probability that the daughter DNA molecule has 100% sequence identity with the parent DNA molecule is higher; or the average sequence identity between the multiple daughter DNA molecules obtained by replication of the parent DNA molecule and the parent DNA molecule is higher.
  • the “regulation” means: increasing or decreasing the total amount of protein or functional RNA expressed by the RNA molecule within the same length of time; or allowing the RNA to express protein or functional RNA within a longer or shorter time frame, and the increase, decrease, or longer or shorter time frame is compared to another RNA molecule expressing the same protein or functional RNA.
  • “regulating" protein expression it refers to regulating the expression of RNA molecules containing the protein coding sequence.
  • the regulatory effect described here can be achieved by connecting the Poly (A) tail of the present application to the 3' end of an RNA molecule that does not contain a Poly (A) tail, or by The Poly(A) tail is replaced with the Poly(A) tail of the present application.
  • the percentage of "identity”, such as 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.5% identity refers to the degree of similarity between amino acid sequences or between nucleotide sequences determined by sequence alignment, which is 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.5%.
  • the proportion of the number of positions with the same base or amino acid residue to the total number of positions determined after two sequences have the same residue at as many positions as possible by introducing spaces, etc.
  • the percentage of "identity" can be determined using software programs known in the art.
  • substantially complementary refers to a degree of complementarity of at least about 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% in a region of about 40, 50, 60, 70, 80, 100, 150, 200, 250 or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.
  • a single base or a single nucleotide according to the Watson-Crick base pairing principle, when A is paired with T or U, C is paired with G or I, it is called complementary or matching, and vice versa; and other base pairings are called non-complementary.
  • the "complementary polynucleotide sequence" of a polynucleotide sequence refers to a polynucleotide sequence that is completely complementary to the polynucleotide sequence.
  • a “conservative substitution variant" of a protein, polypeptide or amino acid sequence is one in which one or more amino acid residues are substituted with amino acids without changing the overall conformation and function of the protein or enzyme, including but not limited to replacing the amino acids in the amino acid sequence of the parent protein in the manner described by the aforementioned "conservative substitution". Therefore, the similarity of two proteins or amino acid sequences with similar functions may be different. For example, a similarity (identity) of 70% to 99% based on the MEGALIGN algorithm.
  • Constant substitution variants also include polypeptides or enzymes with more than 60% amino acid identity as determined by the BLAST or FASTA algorithms, preferably more than 75%, preferably more than 85%, or even more than 90%.
  • the invention relates to a protein or enzyme which is optimal for the production of a novel polypeptide and which has the same or substantially similar properties or functions as the native or parent protein or enzyme.
  • DNA and RNA refer to single-stranded or double-stranded DNA or RNA molecules.
  • DNA and DNA molecules refer to double-stranded DNA molecules consisting of A, C, G and/or T nucleotides
  • RNA and RNA molecules refer to single-stranded RNA molecules consisting of A, C, G and/or U nucleotides.
  • A, C, G, T and U nucleotides refer to nucleotides containing adenine, guanine, cytosine, thymine and uracil as their respective nitrogenous bases.
  • DNA and RNA hybrid molecule is a molecule comprising a polynucleotide sequence consisting of deoxyribonucleotides and ribonucleotides.
  • the DNA and RNA hybrid molecule can be obtained by:
  • hybrid molecule of DNA and RNA is not limited to the above method, and the hybrid molecule of DNA and RNA obtained by any method belongs to the category of "hybrid molecule of DNA and RNA" defined in this application.
  • nucleic acid molecules are described as having "the same genetic information"
  • the two nucleic acid molecules are complementary, or contain exactly the same base sequence, or one or more thymines in the base sequence of one of the nucleic acid molecules are converted to uracil to obtain a nucleic acid molecule with exactly the same base sequence as another nucleic acid molecule. Therefore, any two of DNA, RNA, and hybrid molecules of DNA and RNA can have the same genetic information.
  • base sequence refers to the order of arrangement of bases in a polynucleotide molecule.
  • RNA such as mRNA
  • U uracil
  • Poly A tail or "Poly (A) sequence” refers to an uninterrupted or uninterrupted sequence of adenylic acid residues that is typically located at the 3'-end of an RNA molecule.
  • the Poly-A sequence is attached to the 3' end of the 3'-UTR.
  • An uninterrupted poly-A tail is characterized by continuous adenylic acid residues.
  • the Poly-A tail can be of any length.
  • the Poly-A tail comprises, or consists of, at least 20, at least 30, at least 40, at least 80 or at least 100 and at most 500, at most 400, at most 300, at most 200 or at most 150 adenylic acid (A), particularly about 120 A.
  • the vast majority of nucleotides in the PolyA tail are adenosine, wherein the vast majority refers to at least 75%, at least 80%, at least 85%, at least 90% of the nucleotides, etc., but the remaining nucleotides are allowed to be nucleotides other than A (non-A nucleotides), such as U (uridylic acid), G (guanylic acid) or C (cytidylic acid).
  • the in vitro preparation process of the RNA is a prokaryotic fermentation process, that is, the coding nucleic acid of the RNA molecule containing the Poly (A) tail is introduced into a prokaryotic cell, and the prokaryotic cell is amplified to achieve the purpose of amplifying the coding nucleic acid, and then the amplified coding nucleic acid is transcribed into the RNA.
  • the in vitro preparation process of the RNA is to connect the RNA fragment containing the protein coding sequence with the Poly (A) tail by homologous recombination, enzyme ligation, or other non-homologous recombination, and the Poly (A) tail is prepared by a prokaryotic fermentation process, and the coding nucleic acid containing the Poly (A) tail is introduced into a prokaryotic cell during the prokaryotic fermentation process, and the prokaryotic cell is amplified to achieve the purpose of amplifying the coding nucleic acid, and then the amplified coding nucleic acid is transcribed into RNA containing the Poly (A) tail.
  • the aforementioned coding nucleic acid is linear.
  • the aforementioned coding nucleic acid is circular. In some embodiments, the aforementioned coding nucleic acid is a plasmid. In some embodiments, the aforementioned coding nucleic acid is single-stranded or double-stranded. In some embodiments, the aforementioned coding nucleic acid is chemically modified before being introduced into a prokaryotic cell. In some embodiments, the aforementioned encoding nucleic acid before being introduced into the prokaryotic cell is chemically synthesized. In some embodiments, the encoding nucleic acid is inserted into the nucleoid/nucleoid genomic DNA of the prokaryotic cell. In some embodiments, the encoding nucleic acid is free in the cytoplasm of the prokaryotic cell or outside the nucleoid/nucleoid. In some embodiments, the prokaryotic cell is Escherichia coli.
  • the present application provides a series of Poly(A) tails, which are highly conserved during the in vitro preparation of RNA.
  • the Poly(A) tails contain one or more non-A nucleotides at one or more positions.
  • the Poly(A) tail encoding sequence comprises:
  • Poly(A) tail coding sequence does not contain elements b adjacent to each other, does not contain elements c adjacent to each other, and does not contain elements d adjacent to each other.
  • the Poly (A) tail coding sequence further comprises a unique element e, wherein the element e consists of one or two consecutive A's and is located at the 3' end of the Poly (A) tail coding sequence and is adjacent to element d or element c.
  • the Poly(A) tail here can be a segment of RNA or a hybrid molecule of DNA and RNA.
  • the present application also provides a Poly(A) tail that can regulate protein expression levels.
  • the present application also provides a Poly(A) tail that can regulate protein expression levels while being highly conservative during in vitro preparation.
  • the Poly(A) tail that can regulate protein expression levels; or the structure of the Poly(A) tail that can regulate protein expression levels while being highly conservative during in vitro preparation is selected from:
  • the poly (A) tail of the regulatable protein expression level; or The structure of the Poly(A) tail, which is highly conserved during the in vitro preparation process and can regulate protein expression levels, is: 60A-element d-19A-G-19A-G-17A or 19A-G-19A-G-19A-element d-60A.
  • yA represents the number of consecutive A's in element a or element b, where y is a natural number, for example, 19A means that it contains 19 consecutive A's; 60A means that it contains 60 consecutive A's.
  • the Poly(A) tail that can regulate protein expression levels; or the polynucleotide coding sequence of the Poly(A) tail that can regulate protein expression levels that is highly conserved during the in vitro preparation process is selected from a polynucleotide sequence as shown in any one of SEQ ID NOs: 1-10.
  • the present application also provides the use of the above-mentioned Poly (A) tail for regulating protein expression, in which the Poly (A) tail is located at the 3' end of the mRNA, such as the 3' end of the 3' UTR.
  • the method for regulating protein expression described below is used.
  • the present application also provides an engineered DNA molecule that can replicate in a cell, which comprises the coding sequence of the aforementioned Poly A tail or its complementary sequence.
  • the engineered DNA molecule should also contain structural elements necessary for the DNA molecule to replicate in a cell, or to replicate efficiently.
  • the structural elements necessary for the engineered DNA molecule to replicate or replicate efficiently in a cell are known in the art, including, for example, an origin of replication (ORI).
  • the engineered DNA molecule further comprises a marker gene or a fragment thereof and/or a reporter gene or a fragment thereof, and a unique restriction endonuclease site that allows the insertion of DNA elements, preferably a restriction endonuclease site in the form of a multiple cloning site (MCS).
  • the marker gene is conducive to identifying cells containing a plasmid containing the marker gene, and can be selected from, for example, an antibiotic resistance gene.
  • Each restriction endonuclease site in the MCS can be specifically recognized by different restriction endonucleases.
  • the DNA molecule is a DNA plasmid.
  • DNA plasmid refers to a plasmid composed of a double-stranded DNA molecule.
  • the "plasmid” is a circular DNA molecule.
  • the "plasmid” can also encompass linear DNA molecules.
  • the term “plasmid” also encompasses molecules obtained by linearizing the circular plasmid, such as by cutting the circular plasmid with a restriction endonuclease, thereby converting the circular plasmid molecule into a linear molecule, and linear molecules that can replicate in prokaryotes.
  • Plasmids can replicate, i.e., in cells independently of prokaryotes.
  • the invention relates to amplifying the genomic genetic information stored in the nucleoid or nucleoid of a cell and can be used for cloning, i.e., for amplifying genetic information in bacterial cells.
  • the DNA plasmid according to the present application is a medium copy or high copy plasmid, more preferably a high copy plasmid.
  • high copy plasmids are such vectors: they are based on pUC, pTZ plasmids or any other plasmids (e.g., pMB1, pCoIE1) containing an ORI that supports high copy of the plasmid, etc.
  • the DNA molecule is a DNA molecule or a fragment thereof constituting a nucleoid or a nucleoid of a prokaryotic organism, that is, the coding sequence or its complementary sequence comprising the aforementioned Poly (A) tail can be replicated along with the prokaryotic genome.
  • the DNA molecule is further connected to a target gene fragment on the 5' end side of the Poly (A) tail coding sequence, and the target gene fragment and the Poly (A) tail coding sequence jointly encode RNA.
  • the target gene fragment and the Poly (A) tail coding sequence jointly encode mRNA.
  • the target gene fragment comprises a coding sequence of a protein, a polypeptide or a fragment thereof.
  • the target gene fragment also comprises a coding sequence of an element that can be used to start or regulate the expression of the protein, polypeptide or fragment thereof after transcription, and the element includes but is not limited to 5'UTR, 3'UTR, etc.
  • the target gene fragment comprises a coding sequence of at least one untranslated region (UTR). In some embodiments, the target gene fragment comprises at least a coding sequence of a 5'UTR and a coding sequence of a protein, polypeptide or a fragment thereof. In some embodiments, the target gene fragment comprises at least a coding sequence of a 5'UTR, a coding sequence of a protein, a polypeptide or a fragment thereof, and a coding sequence of a 3'UTR from 5' to 3'.
  • UTR untranslated region
  • the coding sequence of the protein, polypeptide or its fragment can be finally translated into one or more proteins, or one or more polypeptides, such as short peptides, oligopeptides, polypeptides, fusion proteins, proteins and fragments thereof, such as parts of known proteins, such as functional parts.
  • the functional part can be, for example, a biologically active part of a protein or an antigenic part that can effectively produce antibodies, such as an antigenic epitope.
  • the two ends of the coding sequence of the protein, polypeptide or its fragment respectively include a start codon (5' end) and a stop codon (3' end), which are the first three nucleotides and the last three nucleotides that can be translated of the mRNA molecule, respectively.
  • the 5'UTR usually contains at least one ribosome binding site (RBS), such as the Shine-Dalgarno sequence in prokaryotes, or at least one translation start site, such as the Kozak sequence in eukaryotes.
  • RBS ribosome binding site
  • translation start site such as the Kozak sequence in eukaryotes.
  • RBS promotes the efficient and accurate translation of mRNA molecules by recruiting ribosomes at the start of translation. Its activity can be optimized by changing the length and sequence of a given RBS or translation revelation site and the distance from the start codon.
  • the 5'UTR includes an internal ribosome entry site or IRES.
  • the 3'UTR may contain one or more regulatory sequences, such as binding sites for amino acid sequences that enhance the stability of the mRNA molecule, regulatory Binding sites for RNA molecules (such as miRNA molecules), and/or signal sequences involved in the intracellular trafficking of mRNA molecules.
  • regulatory sequences such as binding sites for amino acid sequences that enhance the stability of the mRNA molecule, regulatory Binding sites for RNA molecules (such as miRNA molecules), and/or signal sequences involved in the intracellular trafficking of mRNA molecules.
  • the target gene fragment further comprises one or more additional regulatory sequences, such as binding sites for amino acid sequences that enhance the stability of mRNA molecules, binding sites for amino acid sequences that enhance the translation of mRNA molecules, regulatory elements (such as riboswitches), binding sites for regulatory RNA molecules (such as miRNA molecules), and/or nucleotide sequences that have a positive impact on translation initiation.
  • additional regulatory sequences such as binding sites for amino acid sequences that enhance the stability of mRNA molecules, binding sites for amino acid sequences that enhance the translation of mRNA molecules, regulatory elements (such as riboswitches), binding sites for regulatory RNA molecules (such as miRNA molecules), and/or nucleotide sequences that have a positive impact on translation initiation.
  • regulatory elements such as riboswitches
  • binding sites for regulatory RNA molecules such as miRNA molecules
  • nucleotide sequences that have a positive impact on translation initiation within the 5'UTR, preferably there is no functional upstream open reading frame, an out-of
  • the coding sequence of the protein, polypeptide or fragment thereof comprises codons that can be translated into an amino acid sequence. All the codons contained in the coding sequence may be naturally occurring codons encoding amino acids, or may be partially or entirely composed of artificially synthesized codons. In some embodiments, some or all of the codons are codon optimized. In some embodiments, some or all of the codons encode non-natural amino acids.
  • the DNA molecule further comprises a structural element necessary for initiating or regulating transcription of the RNA on the 5' end side of the target gene fragment, and the structural element is known in the art.
  • the structural element at least comprises a promoter. Promoters and their sequences are known in the art, including weak promoters, medium strength promoters, strong promoters, mini promoters or core promoters, etc. In some specific embodiments, the promoter is a strong promoter.
  • the promoter can initiate transcription of the target gene fragment and/or Poly (A) tail in prokaryotes. In some embodiments, the promoter can initiate transcription of the target gene fragment and/or Poly (A) tail in eukaryotic cells.
  • the "promoter” comprises at least one transcription recognition site and a subsequent transcription factor binding site.
  • the recognition and binding sites can interact with an amino acid sequence that mediates or regulates transcription. Compared with the recognition site, the binding site is closer to the aforementioned target gene fragment.
  • the binding site can be, for example, a Pribnow box in prokaryotes or a TATA box in eukaryotes.
  • the transcription recognition site can be located about 35 bp upstream of the transcription start site
  • the transcription factor binding site can be located about 10 bp upstream of the transcription start site.
  • the promoter comprises at least one additional regulatory element, such as an AT-rich upstream element located about 40 and/or 60 nucleotides before the transcription start site, and/or an additional regulatory element that enhances promoter activity located between the recognition site and the binding site.
  • the promoter is a strong promoter, i.e., the promoter comprises a sequence that promotes the transcription of the aforementioned RNA coding sequence. Strong promoters are known to those skilled in the art, such as OXB18, OXB19 and OXB20 promoters derived from the RecA promoter of Escherichia coli, or can be identified or synthesized by routine laboratory procedures.
  • the promoter is a T7 promoter.
  • the promoter also comprises other regulatory elements, such as an enhancer that is included in a DNA plasmid that promotes the transcription of the aforementioned RNA coding sequence.
  • the present application also provides a library comprising the aforementioned engineered DNA molecules.
  • the library comprises at least two DNA molecules having different Poly (A) tail encoding sequences.
  • the present application also provides the use of the above-mentioned engineered DNA molecules in the stable amplification of the poly (A) tail coding sequence or the coding sequence of the RNA with the poly (A) tail.
  • the method for amplifying the poly (A) tail coding sequence or the coding sequence of the RNA with the poly (A) tail is as shown in the method for stably amplifying the poly (A) tail transcription template DNA in vitro later.
  • RNA comprising the aforementioned Poly(A) tail and a target gene fragment on the 5' end of the Poly(A) tail coding sequence.
  • the RNA further comprises a 5'-cap structure.
  • the RNA is mRNA.
  • mRNA messenger RNA
  • messenger RNA is any RNA, naturally occurring, non-naturally occurring or modified, that encodes at least one protein, polypeptide, or fragment thereof, and the mRNA has the ability to be translated to produce the encoded protein, polypeptide, or fragment thereof in vitro, in vivo, in situ, or ex vivo. Therefore, the mRNA can be a mature mRNA or a pre-mature mRNA, and the elements or structures that it must contain or selectively contain are known in the art. In some embodiments, the mRNA contains coding sequences for multiple necessary functional components to express, regulate, or enhance the expression level of the protein, polypeptide, or fragment thereof.
  • the functional components include, but are not limited to, a 5' cap, a 5'UTR, a 3'UTR, and the like. Both 5'UTR and 3'UTR are typically transcribed from genomic DNA and are elements that pre-mature mRNA possesses. As a mature mRNA,
  • 5'cap is located at the 5' end of the mRNA and contains methylated guanylate, which is linked to the 5' end of the mRNA via pyrophosphate to form a 5',5'-triphosphate linkage with its adjacent nucleotide.
  • There are usually three types of 5' cap structures m7G5'ppp5'Np, m7G5'ppp5'NmpNp, m7G5'ppp5'NmpNmpNp), which are called type O, type I and type II, respectively.
  • Type O means that the ribose of the terminal nucleotide is not methylated
  • type I means that the ribose of one terminal nucleotide is methylated
  • type II means that the ribose of both terminal nucleotides is methylated.
  • the 5' cap can be used according to the manufacturer's protocol.
  • RNA cap analogs simultaneously complete 5' capping of the polynucleotide during the in vitro transcription reaction to produce a 5'-guanosine cap structure: 3'-O-Me-m7G(5')ppp(5')G [ARCA cap], G(5')ppp(5')A, G(5')ppp(5')G, m7G(5')ppp(5')A, m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA), or m7G(5')ppp(5')(2'-OMeA)pG (CleanCapAG).
  • 5' capping of the modified RNA can be completed post-transcriptionally using a vaccinia virus capping enzyme to produce an O-type cap structure: m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA).
  • Type I cap structures can be generated using both vaccinia virus capping enzyme and 2'-O methyl-transferase to generate m7G(5')ppp(5')(2'-OMeA)pG.
  • Type II cap structures can be generated from type I cap structures by 2'-O-methylation of the 5'-third to last nucleotide using 2'-O methyl-transferase.
  • Type III cap structures can be generated from type II cap structures by 2'-O-methylation of the 5'-fourth to last nucleotide using 2'-O methyl-transferase.
  • some or all of the uridine in the mRNA is chemically modified uridine.
  • some or all of the uridine in the mRNA is pseudouridine or 1-methyl-pseudouridine.
  • part or all of the uracil nucleotides of the mRNA are substituted with pseudouridine ( ⁇ ) nucleotides or N1-methylpseudouridine (m1 ⁇ ) nucleotides.
  • the mRNA further comprises a stabilizing element.
  • Stabilizing elements may include, for example, a histone stem-loop.
  • the mRNA comprises a coding region, at least one histone stem-loop and optionally a poly(A) sequence or a polyadenylation signal.
  • the poly(A) sequence or polyadenylation signal should generally enhance the expression level of the encoded protein.
  • the mRNA comprises a combination of a poly(A) sequence or a polyadenylation signal and at least one histone stem-loop, although both have alternative mechanisms in nature, their synergistic effect can increase protein expression to a level that exceeds that observed for any single element.
  • the synergistic effect of the combination of poly(A) and at least one histone stem-loop is independent of the order of the elements or the length of the poly(A) sequence.
  • the histone stem-loop is generally derived from a histone gene and comprises two adjacent partial or complete reverse complementary sequences separated by a spacer (composed of a short sequence) that form a loop by intramolecular base pairing. Unpaired loop regions are generally unable to base pair with any of the stem-loop elements.
  • the stability of the stem-loop structure generally depends on the length, the number of mismatches or bulges, and the base composition of the paired region.
  • wobble base pairing non-Watson-Crick base pairing
  • the at least one histone stem-loop sequence comprises 15 to 45 nucleotides in length.
  • one or more AU-rich sequences of the mRNA may be removed. These sequences are sometimes referred to as AURES, which are destabilizing sequences found in the 3'UTR. AURES can be removed from the mRNA. Alternatively, AURES can be retained in the mRNA.
  • the lipid nanoparticles generally include ionizable lipids, non-cationic lipids, sterols and PEG lipid components and target nucleic acids, such as the above-mentioned mRNA.
  • Lipid nanoparticles of the present disclosure can be produced using components, compositions and methods as generally known in the art, see, for example, PCT/US2016/052352, PCT/US2016/068300, PCT/US2017/037551, PCT/US2015/027400, PCT/US2016/047406, PCT/US2016000129, PCT/US2016/014280, PCT/US2017/037551, PCT/US2017/037552, PCT/US2017/037551 ...
  • the present application also provides a library comprising the aforementioned mRNA molecules.
  • the library comprises at least two mRNA molecules with different Poly (A) tails.
  • the present application also provides the use of the mRNA and the mRNA library.
  • At least two or more mRNA molecules having a Poly (A) tail with different influence gradients on the expression of mRNA can be used to adjust the expression of the coding sequence of the aforementioned protein, polypeptide or fragment thereof.
  • the application also provides cells comprising the aforementioned engineered DNA molecules, which can be stored and/or amplified in the cells.
  • the cells are prokaryotic cells, and the DNA molecules can be replicated in the prokaryotic cells.
  • the cells are prokaryotic cells, and the DNA molecules can be replicated and/or transcribed in the prokaryotic cells.
  • the DNA molecules are eukaryotic cells, and the DNA molecules can be replicated in the cells.
  • the DNA molecules can be transcribed and/or replicated in the DNA cells.
  • the cell is a prokaryotic cell.
  • the cell is a bacterium, an actinomycete, a cyanobacterium, a mycoplasma, a rickettsia, and a chlamydia.
  • the cell is selected from: Bacillus subtilis, lactobacillus, acetic acid bacteria, corynebacterium, brevis, arthrobacter, pseudomonas, and micrococcus.
  • the cell is a recA - bacterium.
  • the cell is an Escherichia coli.
  • the cell is an Escherichia coli, and the Escherichia coli is selected from: K-12 and its derivative strains, and B strains and their derivative strains.
  • the E. coli is selected from: MG1655, DH5 or DH5 ⁇ , DH10B, BL21, DB3.1, HB101, JM109, JM110, MC1061, MG1655, Pir1, Stbl2, Stbl3, Top10, XL1Blue, XL10Gold, BLR, HMS174, Tuner, Rostetta2, Lemo21, T7Express, Origami2, etc.
  • the cell is selected from Streptomyces, Micromonospora and Nocardia.
  • the cell is a fungus.
  • the cell is selected from yeast or mold.
  • the present application provides a method for stably amplifying a poly(A) tail transcription template DNA in vitro to reduce the mutation frequency of the poly(A) tail transcription template sequence when the DNA is replicated in large quantities in cells.
  • the method comprises: amplifying the cells containing the engineered DNA molecule.
  • the engineered DNA molecules are introduced into the cells.
  • the introduction may include chemical conversion or electro-conversion.
  • the introduction is a natural endocytosis process of the engineered DNA molecules by the cells.
  • the transduction comprises administering to the human body, and the administration is selected from: by intravenous, intraperitoneal, subcutaneous, intracranial, intrathecal, intraarterial (e.g., via the carotid artery), intramuscular and intratumoral injection or perfusion.
  • the present application also provides a method for regulating protein expression, comprising:
  • Two or more of the aforementioned RNA molecules are added at different times and/or in different quantitative ratios. into target cells.
  • the two or more aforementioned engineered DNA molecules and the two or more aforementioned RNA molecules have different Poly(A) tails, and the Poly(A) tails have different influence gradients on the expression amount of RNA.
  • the present application also provides a method for regulating protein expression, comprising introducing the aforementioned engineered DNA molecule or the aforementioned RNA molecule into a target cell.
  • the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises a structure selected from the following:
  • the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises a structure selected from the following:
  • the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises the structure: 60A-element d-19A-G-19A-G-17A or 19A-G-19A-G-19A-element d-60A.
  • yA represents the number of consecutive A's in element a or element b, where y is a natural number, for example, 19A means that it contains 19 consecutive A's; 60A means that it contains 60 consecutive A's.
  • the coding sequence of the Poly(A) tail encoded by the DNA and the Poly(A) tail contained in the RNA comprises a polynucleotide sequence selected from the following, or consists of any one polynucleotide sequence selected from the following: SEQ ID NO: 1-10.
  • the Poly(A) tail and the DNA sequence encoding the Poly(A) tail shown in Table 1 below were constructed using conventional genetic engineering methods.
  • Example 2 Testing Poly(A) function using luciferase coding sequence as an example
  • luciferase as the protein coding region, the stability of different poly(A) variants in E. coli and their effects on luciferase expression in cells were investigated.
  • This universal vector uses the E. coli cloning vector pUC57 as the vector backbone, and the T7 promoter sequence (5’-TAATACGACTCACTATAAGG-3’), 5’UTR, luciferase protein, 3’UTR and polyadenylic acid string poly(dA:dT) are arranged in sequence between the Xba I restriction site and the EcoR I restriction site of its multiple cloning site.
  • C1C2 A60-10nt spacer-A60 (control C1), A30-10nt spacer-A70 (control C2), A60-1nt spacer-A60 (control C3) or A60-6nt spacer-A60 (control C4)
  • C1C2 is derived from the patent in the literature (U.S. Patent No. US 10717982B2)).
  • the vector plasmid constructed in step 2) is confirmed to be correct by sequencing, it is transformed into E. coli DH5 ⁇ , and the transformed plate is grown at 30°C to complete the plasmid extraction and sequencing.
  • the stability and base deletion of different poly (A) variants are analyzed and calculated according to the sequencing results; the replication stability is expressed by the percentage of clones without any base changes, and the higher the percentage, the higher the replication stability of the plasmid in E. coli.
  • the poly(A) variants designed in this application are superior to or equivalent to the prior art in terms of replication stability in E. coli cells.
  • the replication stability of poly(A) variants P3, P4, and P8 is the highest, and the replication stability of P3, P4, and P8 is equivalent to that of C2, with no statistically significant difference (p>0.05, ⁇ 2 test), and the replication stability of P3, P4, and P8 is better than that of the control C1, C3, and C4, and the difference is statistically significant (p ⁇ 0.05, ⁇ 2 test).
  • the HPV vector plasmid containing P1, P2, P3 and P4 constructed in step 1) was confirmed to be correct by sequencing and then transformed into E. coli DH5 ⁇ .
  • the transformed plates were grown at 30°C to complete plasmid extraction and sequencing. After sequencing, the stability and base deletion of different poly(A) variants were analyzed and calculated based on the sequencing results; the stability was expressed by the percentage of clones without any base changes, and the higher the percentage, the more stable it was.
  • the different poly(A) variants of the present application still maintain high replication stability; among them, P3 is comparable to C2 in stability, with no statistically significant difference (p>0.05, ⁇ 2 test), and has the best cloning stability compared with other groups of new variants of the present application.
  • the above results show that the poly(A) variants designed in the present application are universal in application in the embodiments for different protein coding regions.
  • Example 1 and Example 2 the plates transformed with E. coli were cultured overnight in a 30°C biochemical incubator, and the clones obtained were sequenced to evaluate replication stability.
  • the culture temperature of E. coli affects the DNA replication rate, it also affects the replication stability.
  • the present application also compared the sequencing detection of the plates transformed with E. coli after being cultured in a 37°C biochemical incubator. The results show ( Figure 5) that the 37°C culture greatly increased the base deletion ratio of the control C1, from 53% when cultured at 30°C to 98%.
  • the production of mRNA drugs must rely on large-scale fermentation to prepare a sufficient amount of template plasmids, and the stability of the plasmids during fermentation (in this application, the stability of the plasmids specifically refers to the stability of poly dA: dT) is crucial to the production of mRNA drugs with uniform quality.
  • the stability of the plasmids specifically refers to the stability of poly dA: dT
  • it is necessary to establish a strain library containing the target plasmid including strain libraries of different generations such as primary libraries and secondary libraries, so it is necessary to evaluate the plasmid stability in Escherichia coli of different generations.
  • the present application detected the stability of P1 and P3 between different generations of the fermentation process in Example 3.
  • each poly (A) variant selected 4 correct Escherichia coli clones and cultured them through fermentation.
  • the results show that in the 3rd, 5th, 7th, and 9th generations of seed passage, the plasmids of the 4 clones of P1 and P3 remain stable without any base changes.
  • poly(A) tail In eukaryotic cells, a certain length of poly(A) tail is essential for protecting the 3’ end of mRNA, maintaining mRNA stability and promoting protein expression. Under the influence of physiological or environmental factors in the body, poly(A) gradually becomes shorter, thereby triggering mRNA degradation.
  • This application investigates the effects of different poly(A) variants on protein expression levels.
  • the specific experimental process is as follows:
  • linearized DNA is obtained by digestion with type II restriction endonuclease BspQ I.
  • the 3’ end of the linearized DNA is different poly(A): control C2, P3, P4, P5, P8 or P9.
  • the linearized DNA is used as a template for in vitro transcription.
  • the 100 ⁇ l reaction system contains 1X reaction buffer, 5mM (final concentration) of ATP, CTP, N1M-UTP, GTP, CleanCap AG 4mM (final concentration), and 5 ⁇ l of in vitro transcriptase. After the reaction mixture is fully mixed, react at 37°C for 3h. The in vitro transcribed mRNA is collected by LiCl precipitation and finally dissolved in enzyme-free water.
  • LNP is lipid nanoparticles.
  • concentration and particle size of the mRNA encapsulated by LNP were measured using the Ribogreen RNA quantification kit (Invitrogen, R11490) and the Darwin ZetaSizer particle size analyzer.
  • the LNP is quality controlled by measuring the particle size, encapsulation efficiency, PDI and other indicators.
  • the quality control results show that the prepared LNP meets the standards of particle size range 50nm-150nm, PDI ⁇ 0.3, and encapsulation efficiency >90%, which can be used for subsequent experiments.
  • the mRNA content in the LNP was determined by the ribogreen method, and then the LNP was diluted to an mRNA content of 100ng/ ⁇ L.
  • mice were randomly divided into groups according to their body weight and administered after 2-3 days of adaptive feeding. Five mice were injected into each of C2, P3, P4, P5, P8, and P9. Each mouse was administered intramuscularly. 100 ⁇ l (10 ⁇ g mRNA), and the control group was injected with PBS into 5 mice. Animal imaging was performed 6h after injection, and the fluorescence value was calculated. The results showed ( Figure 7-8) that compared with the control C2, the expression activity of P3 was significantly increased by 1.8 times, with a statistically significant difference (p ⁇ 0.01, student's t-test), and the expression levels of P4, P5, P8, and P9 were comparable to those of C2, with no significant difference (p>0.05, student's t-test).
  • sequences used in the above examples of the present application are shown in the following sequence table. It should be understood that the following sequences are only exemplary sequences of the present application's embodiments, rather than any limitation to the present application's embodiments.
  • the nucleic acid sequences in the following sequence table may represent DNA sequences or RNA sequences, and when they represent RNA sequences, "T" therein represents uridine.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Medicinal Chemistry (AREA)
  • Virology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Cell Biology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

一种可在细胞中复制的工程化DNA分子,其包含Poly(A)尾编码序列,所述Poly(A)尾编码序列使得所述工程化DNA分子在细胞中,尤其在原核细胞中复制时更加保守,同时可调节RNA在真核细胞中表达水平。还提供了包含所述Poly(A)尾的RNA及其用途。

Description

编码RNA的工程化DNA分子 技术领域
本申请涉及生物技术领域,具体涉及一种包含Poly(A)尾的RNA。所述Poly(A)尾使得编码所述RNA的DNA在原核系统中的复制具有较高的稳定性,且可用于调节所述RNA在真核细胞中的表达水平。
背景技术
可翻译mRNA药物分子的一级结构由5’帽子结构、5’非编码区(5’UTR),编码区,3’非编码区以及多聚腺苷酸尾(poly(A)尾)组成,其中Poly(A)尾已知的作用包括维持mRNA分子的体内稳定性,参与蛋白翻译的起始过程,后者通过Poly(A)结合蛋白(poly A tail binding protein,PABP)与翻译起始复合物的相互作用实现。真核生物细胞内,Poly(A)尾以转录后修饰的方式,在典型Poly(A)聚合酶作用下合成。
体外制备mRNA药物的第一步,是以含有设计好的产品序列的线性化质粒为模板通过体外转录(IVT)合成,Poly(A)尾则通常以共转录的方式添加在3’UTR下游。为实现共转录添加Poly(A),需要在模板质粒中包含相对应的poly(dA:dT)序列。然而,质粒中poly(dA:dT)重复序列在大肠杆菌中复制过程中不稳定,经常发生缺失突变导致poly(dA:dT)变短。这种现象不利于通过发酵规模化生产体外转录模板质粒的制备工艺,Poly(A)截短对mRNA的体内稳定性和生物学活性产生显著影响。
发明概述
本申请提供一种全新的Poly(A)尾,以提升其在体外制备过程中的保守性。同时,提供一种基于所述Poly(A)尾调节RNA在真核细胞中表达水平的方法。
具体的,本申请的第一方面提供了一种可在细胞中复制的工程化DNA分子,其包含多聚腺苷尾(Poly(A)尾)编码序列,所述Poly(A)尾编码序列包含:
唯一的元件a和至少一个元件b,以及至少一个元件c;
唯一的元件a和至少一个元件b,以及至少一个元件d;或
唯一的元件a和至少一个元件b,以及至少一个元件c和至少一个元件d,
在所述Poly(A)尾编码序列中,所述元件a由多个连续的腺嘌呤(A)核苷酸组成,所述元件a的长度范围为≥20nt;
所述元件b由多个连续的A核苷酸组成,所述元件b的长度范围为3nt≤b<20nt;
所述元件c由一个非A的核苷酸组成,所述核苷酸选自T、C、G核苷酸;
所述元件d由任意两个或更多个连续的核苷酸组成,所述核苷酸选自A、T、C、G核苷酸,其中元件d的5’及3’末端的核苷酸不为A核苷酸,并且其不包含3个以上连续的A核苷酸;所述元件d的长度范围为2nt≤d≤20nt;
其中所述元件a和元件b不相邻、元件c和元件d不相邻,
且所述Poly(A)尾编码序列不包含彼此相邻的元件b、不包含彼此相邻的元件c、且不包含彼此相邻的元件d。
在一些实施方案中,所述Poly(A)尾编码序列还进一步包含唯一一个元件e,所述元件e由一个或两个连续的A组成;且其位于所述Poly(A)尾编码序列的3’末端,且与元件d或元件c相邻。
在一些实施方案中,所述Poly(A)尾编码序列不包含元件a、元件b、元件c、及元件d以外的其他元件。
在一些实施方案中,所述Poly(A)尾编码序列不包含元件a、元件b、元件c、元件d及元件e以外的其他元件。
在一些实施方案中,所述Poly(A)尾编码序列包含至少2个元件d。在一些实施方案中,所述Poly(A)尾编码序列包含至少2个元件c。在一些实施方案中,所述Poly(A)尾编码序列包含至少一个元件d和一个元件c。
在一些实施方案中,所述元件b的个数为2-10个,例如3个、4个、5个、6个、7个、8个、或9个。
在一些实施方案中,所述元件c的个数为0至10个,例如1个、2个、3个、4个、5个、6个、7个、8个、或9个。
在一些实施方案中,所述元件d的个数为0至5个,例如1个、2个、3个、或4个。
在一些实施方案中,当元件c和元件d同时存在时,所述元件c和元件d的 个数总和为2-15个,例如3个、4个、5个、6个、7个、8个、9个、10个、11个、12个、13个或14个。
在一些实施方案中,所述元件a的个数为1,所述元件b的个数为3,所述元件c的个数为2,且所述元件d的个数为1。
在一些实施方案中,所述元件a的个数为1,所述元件b的个数为4,所述元件c的个数为4,且所述元件d的个数为1。
在一些实施方案中,所述元件a的个数为1,所述元件b的个数为5,所述元件c的个数为4,且所述元件d的个数为1。
在一些实施方案中,所述元件a的个数为1,所述元件b的个数为3,所述元件c的个数为3,且所述元件d的个数为1。
在一些实施方案中,所述元件a的个数为1,所述元件b的个数为3,所述元件c的个数为2,且所述元件d的个数为1。
在一些实施方案中,所述元件a≤80nt。在一些实施方案中,所述元件a为21nt、22nt、23nt、24nt、25nt、26nt、27nt、28nt、29nt、30nt、31nt、32nt、33nt、34nt、35nt、36nt、37nt、38nt、39nt、40nt、41nt、42nt、43nt、44nt、45nt、46nt、47nt、48nt、49nt、50nt、51nt、52nt、53nt、54nt、55nt、56nt、57nt、58nt、59nt、60nt、61nt、62nt、63nt、64nt、65nt、66nt、67nt、68nt、69nt、70nt、71nt、72nt、73nt、74nt、75nt、76nt、77nt、78nt、或79nt。
在一些实施方案中,所述元件b为3nt、4nt、5nt、6nt、7nt、8nt、9nt、10nt、11nt、12nt、13nt、14nt、15nt、16nt、17nt、18nt或19nt。
在一些实施方案中,其中所述元件d为2nt≤d≤20nt、3至18nt、5至16nt、4至10nt、或6至12nt,例如2nt、3nt、4nt、5nt、6nt、7nt、8nt、9nt、10nt、11nt、12nt、13nt、14nt、15nt、16nt、17nt、18nt、19nt或20nt,优选6nt。
在一些实施方案中,所述Poly(A)尾编码序列的长度大于40nt,例如41nt、42nt、43nt、44nt、45nt、46nt、47nt、48nt、49nt、50nt、51nt、52nt、53nt、54nt、55nt、56nt、57nt、58nt、59nt、60nt、61nt、62nt、63nt、64nt、65nt、66nt、67nt、68nt、69nt、70nt、71nt、72nt、73nt、74nt、75nt、76nt、77nt、78nt、79nt、80nt、81nt、82nt、83nt、84nt、85nt、86nt、87nt、88nt、89nt、90nt、91nt、92nt、93nt、94nt、95nt、96nt、97nt、98nt、99nt、100nt、101nt、102nt、103nt、104nt、105nt、106nt、107nt、108nt、109nt、110nt、111nt、112nt、113nt、114nt、115nt、116nt、117nt、118nt、119nt、120nt、121nt、122nt、 123nt、124nt、125nt、126nt、127nt、128nt、129nt、130nt、131nt、132nt、133nt、134nt、135nt、136nt、137nt、138nt、139nt、140nt、141nt、142nt、143nt、144nt、145nt、146nt、147nt、148nt、149nt、150nt、151nt、152nt、153nt、154nt、155nt、156nt、157nt、158nt、159nt、160nt、161nt、162nt、163nt、164nt、165nt、166nt、167nt、168nt、169nt、170nt、171nt、172nt、173nt、174nt、175nt、176nt、177nt、178nt、179nt、180nt、181nt、182nt、183nt、184nt、185nt、186nt、187nt、188nt、189nt、190nt、191nt、192nt、193nt、194nt、195nt、196nt、197nt、198nt、199nt、200nt、201nt、202nt、203nt、204nt、205nt、206nt、207nt、208nt、209nt、210nt、211nt、212nt、213nt、214nt、215nt、216nt、217nt、218nt、219nt、220nt、221nt、222nt、223nt、224nt、225nt、226nt、227nt、228nt、229nt、230nt、231nt、232nt、233nt、234nt、235nt、236nt、237nt、238nt、239nt、240nt、241nt、242nt、243nt、244nt、245nt、246nt、247nt、248nt、249nt、250nt、251nt、252nt、253nt、254nt、255nt、256nt、257nt、258nt、259nt、260nt、261nt、262nt、263nt、264nt、265nt、266nt、267nt、268nt、269nt、270nt、271nt、272nt、273nt、274nt、275nt、276nt、277nt、278nt、279nt、280nt、281nt、282nt、283nt、284nt、285nt、286nt、287nt、288nt、289nt、290nt、291nt、292nt、293nt、294nt、295nt、296nt、297nt、298nt、299nt、300nt、301nt、302nt、303nt、304nt、305nt、306nt、307nt、308nt、309nt、310nt、311nt、312nt、313nt、314nt、315nt、316nt、317nt、318nt、319nt、320nt、321nt、322nt、323nt、324nt、325nt、326nt、327nt、328nt、329nt、330nt、331nt、332nt、333nt、334nt、335nt、336nt、337nt、338nt、339nt、340nt、341nt、342nt、343nt、344nt、345nt、346nt、347nt、348nt、349nt、350nt、351nt、352nt、353nt、354nt、355nt、356nt、357nt、358nt、359nt、360nt、361nt、362nt、363nt、364nt、365nt、366nt、367nt、368nt、369nt、370nt、371nt、372nt、373nt、374nt、375nt、376nt、377nt、378nt、379nt、380nt、381nt、382nt、383nt、384nt、385nt、386nt、387nt、388nt、389nt、390nt、391nt、392nt、393nt、394nt、395nt、396nt、397nt、398nt、399nt、400nt等。
在一些实施方案中,所述元件a的50%或以上的多核苷酸位于所述Poly(A)尾编码序列的5’部分或3’部分。在一些实施方案中,所述元件a的50%以上的多核苷酸位于所述Poly(A)尾编码序列的5’部分。在一些实施方案中,所述 元件a的50%以上的多核苷酸位于所述Poly(A)尾编码序列的3’部分。在一些实施方案中,所述元件a位于所述Poly(A)尾编码序列的3’部分的核苷酸数量和位于所述Poly(A)尾编码序列的5’部分的核苷酸数量相等。
在一些实施方案中,所述元件c为G、C或T。
在一些实施方案中,所述元件d包含回文序列。元件d是回文序列。在一些实施方案中,所述元件d包含选自以下序列:GATATC(SEQ ID NO:15)、GTATAC(SEQ ID NO:16)、GAATCT(SEQ ID NO:17)、GCATATGACT(SEQ ID NO:18)及GATATCGTATAC(SEQ ID NO:19)。在一些实施方案中,所述元件d为选自以下序列:GATATC(SEQ ID NO:15)、GTATAC(SEQ ID NO:16)、GAATCT(SEQ ID NO:17)、GCATATGACT(SEQ ID NO:18)及GATATCGTATAC(SEQ ID NO:19)。在一些实施方案中,所述元件d包含如SEQ ID NO:15所示的多核苷酸序列。在一些实施方案中,所述元件d的多核苷酸序列如SEQ ID NO:15所示。
在一些实施方案中,所述Poly(A)尾编码序列3’末端的核苷酸为A。在一些实施方案中,所述Poly(A)尾编码序列3’末端的核苷酸为G。在一些实施方案中,所述Poly(A)尾编码序列3’末端的核苷酸为C。在一些实施方案中,所述Poly(A)尾编码序列3’末端的核苷酸为T。
在一些实施方案中,在所述Poly(A)尾编码序列的3’部分包含一个或多个非A核苷酸。在一些实施方案中,在所述Poly(A)尾编码序列靠近3’末端的1/2处包含一个或多个非A核苷酸。在一些实施方案中,在所述Poly(A)尾编码序列靠近3’末端的1/3处包含一个或多个非A核苷酸。在一些实施方案中,在所述Poly(A)尾编码序列靠近3’末端的1/4处包含一个或多个非A核苷酸。
在一些实施方案中,所述Poly(A)尾编码序列的结构为:
元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;
元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b;
元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c;
元件a-元件d-元件b-元件c-元件b-元件c-元件b;或
元件b-元件c-元件b-元件c-元件b-元件d-元件a。
在一些实施方案中,所述Poly(A)尾编码序列的结构为:
元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;具体为60A-G-19A-G-19A-G-19A-G-3A。
在一些实施方案中,所述Poly(A)尾编码序列的结构为:
7A-C-18A-G-60A-GG-7A-C-18A-G-14A、
19A-G-19A-G-19A-元件d-60A-G、
60A-元件d-19A-G-19A-G-17A、
19A-G-19A-G-19A-元件d-60A、
19A-G-19A-G-19A-元件d-60A、
19A-C-19A-C-19A-元件d-60A、
19A-T-19A-T-19A-元件d-60A、
19A-G-19A-G-19A-元件d-60A、或
19A-G-19A-G-19A-元件d-60A;
且其中元件d由6个或12个核苷酸组成。
在一些实施方案中,所述Poly(A)尾编码序列如SEQ ID NO:1至10中任一项所示。
在一些实施方案中,所述Poly(A)尾编码序列如SEQ ID NO:3或SEQ ID NO:4所示。
在一些实施方案中,所述工程化DNA分子在其Poly(A)尾编码序列的5’端一侧进一步连接目的基因片段,所述目的基因片段与所述Poly(A)尾编码序列共同编码RNA。在一些实施方案中,所述工程化DNA分子在其Poly(A)尾编码序列的5’端一侧进一步连接目的基因片段,所述目的基因片段与所述Poly(A)尾编码序列共同编码mRNA。在一些实施方案中,所述目的基因片段包含蛋白编码序列或非蛋白编码序列例如功能RNA编码序列。在一些实施方案中,所述目的基因片段在所述蛋白编码序列或功能RNA编码序列的5’端一侧还进一步包含5’UTR编码序列。在一些实施方案中,目的基因片段在所述蛋白编码序列或功能RNA编码序列的3’端一侧还进一步包含3’UTR编码序列。在一些实施方案中,目的基因片段在所述蛋白编码序列或功能RNA编码序列的3’端一侧还进一步包含3’UTR编码序列,且在所述蛋白编码序列或功能RNA编码序列的5’端一侧还进一步包含5’UTR编码序列。在一些实施方案中,所述工程化DNA分子进一步包含复制子,例如复制起始位点,例如ORI。在一些实施方案中,所述工程化DNA分子中还进一步包含标记基因以方便对包含所述工程化DNA分子的细胞进行筛选,所述标记基因选自,例如针对抗生素的抗性基因、荧光蛋白等。在一些实施方案中,所述的DNA分子进一步包 含启动子,所述启动子启动所述目的基因片段与所述Poly(A)尾编码序列共同编码RNA的转录。在一些实施方案中,所述启动子为原核启动子。在一些实施方案中,所述启动子为真核启动子。在一些实施方案中,所述的DNA分子进一步包含复制子例如复制起始位点、启动子、5’UTR编码序列、蛋白编码序列、及3’UTR编码序列。在一些实施方案中,所述的DNA分子进一步包含复制子例如复制起始位点、抗性基因、5’UTR编码序列、蛋白编码序列、及3’UTR编码序列。在一些实施方案中,所述的DNA分子进一步包含复制子例如复制起始位点、抗性基因、启动子、5’UTR编码序列、蛋白编码序列、及3’UTR编码序列。在一些实施方案中,所述蛋白编码序列编码HPV病毒抗原蛋白。在一些实施方案中,所述HPV蛋白源自16型和/或18型HPV。在一些实施方案中,所述蛋白编码序列编码HPV的E2、E6或E7蛋白。在一些实施方案中,所述蛋白编码序列编码HPV的E6和E7蛋白的融合蛋白。在一些实施方案中,所述蛋白编码序列编码HPV的E2、E6和E7蛋白的融合蛋白。在一些实施方案中,所述融合蛋白的多肽片段来源于16型和/或18型HPV。在一些实施方案中,所述融合蛋白的多肽片段来源于16型和/或18型HPV E2、E6和E7蛋白。在一些实施方案中,所述蛋白编码序列编码如SEQ NO:26所示的多肽或其保守取代变体。
在一些实施方案中,所述工程化DNA分子包含如SEQ ID NO:22-25中任一项所示的多核苷酸序列,或如SEQ ID NO:22-25中任一项所示的多核苷酸序列的同义突变体,或与SEQ ID NO:22-25中任一项所示的多核苷酸序列或其同义突变体具有85%以上序列同一性的多核苷酸序列。
在一些实施方案中,所述DNA分子为DNA质粒。在一些实施方案中,所述DNA分子为线性质粒或环状质粒。在一些实施方案中,所述DNA分子为单链或双链。在一些实施方案中,所述质粒为基于pUC、pTZ、pMB1或pCoIE1的质粒。在一些实施方案中,所述质粒为基于pUC57载体的质粒。
在一些实施方案中,所述细胞为原核细胞。在一些实施方案中,所述细胞为recA‐的细菌。在一些实施方案中,所述细胞为大肠杆菌。在一些实施方案中,所述大肠杆菌选自:K-12及其衍生菌株,以及B菌株及其衍生菌株。在一些实施方案中,所述大肠杆菌选自:MG1655、DH5或DH5α、DH10B、BL21、DB3.1、HB101、JM109、JM110、MC1061、MG1655、Pir1、Stbl2、Stbl3、Top10、XL1Blue、XL10Gold、BLR、HMS174、Tuner、Rostetta2、 Lemo21、T7Express、Origami2。
本申请的第二方面公开了包含前述第一方面的DNA分子的细胞。在一些实施方案中,前述第一方面的DNA分子可在所述细胞中复制和/或转录。在一些实施方案中,所述细胞为原核细胞,且前述第一方面的DNA分子可在所述原核细胞中复制。在一些实施方案中,所述细胞为recA‐的细菌。在一些实施方案中,所述细胞为大肠杆菌。在一些实施方案中,所述原核细胞为感受态细胞。在一些实施方案中,所述原核细胞为工程化细胞。在一些实施方案中,所述原核细胞为工程化的原核细胞。在一些实施方案中,所述细胞为大肠杆菌,所述大肠杆菌选自:K-12及其衍生菌株,以及B菌株及其衍生菌株。在一些实施方案中,所述大肠杆菌选自:MG1655、DH5或DH5α、DH10B、BL21、DB3.1、HB101、JM109、JM110、MC1061、MG1655、Pir1、Stbl2、Stbl3、Top10、XL1Blue、XL10Gold、BLR、HMS174、Tuner、Rostetta2、Lemo21、T7Express、Origami2。在一些实施方案中,所述细胞为真核细胞,且前述第一方面的DNA分子可在所述真核细胞中转录。在一些实施方案中,所述真核细胞为哺乳动物细胞。在一些实施方案中,所述真核细胞选自酵母菌或霉菌。
本申请的第三方面,提供了一种Poly(A)尾。所述Poly(A)尾:
(1)由前述第一方面的工程化DNA分子转录获得;
(2)由化学合成的方法获得,且与通过所述第一方面的工程化DNA分子转录获得的Poly(A)尾具有相同的多核苷酸序列;或
(3)对前述(1)或(2)中的Poly(A)尾做进一步的修饰获得。
在一些实施方案中,所述进一步的修饰包含将由(1)或(2)获得的Poly(A)尾中的一个或多个核糖核苷酸替换为一个或多个脱氧核糖核苷酸。在一些实施方案中,所述一个或多个核糖核苷酸被替换为与所述一个或多个核糖核苷酸相应的脱氧核糖核苷酸,例如将所述Poly(A)尾的一个或多个核糖核苷酸A替换为脱氧核糖核苷酸A,将其中的一个或多个核糖核苷酸U替换为脱氧核糖核苷酸T,将其中的一个或多个核糖核苷酸C替换为脱氧核糖核苷酸C,将其中的一个或多个核糖核苷酸G替换为脱氧核糖核苷酸G,或将其中的一个或多个核糖核苷酸G替换为核糖核苷酸I(肌苷)或脱氧核糖核苷酸I。在一些实施方案中,所述修饰为化学修饰。在一些实施方案中所述修饰为碱基编辑。在一些实施方案中,所述修饰对由(1)或(2)获得的Poly(A)尾中的一个或多个核糖核苷酸进行脱氨基处理。
在一些实施方案中,所述Poly(A)尾包含选自如SEQ ID NO:1-10中任一项所示的多核苷酸序列。在一些实施方案中,所述Poly(A)尾的多核苷酸序列如SEQ ID NO:1-10中任一项所示。
本申请还提供了前述Poly(A)尾的用途。在一些实施方案中,所述Poly(A)尾用于使RNA分子更稳定的用途,其中,所述Poly(A)尾位于所述RNA的3’末端,所述更稳定是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内更加稳定。在一些实施方案中,所述Poly(A)尾用于使RNA分子稳定性降低的用途,其中,所述Poly(A)尾位于所述RNA的3’末端,该稳定性的降低是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的稳定性降低。在一些实施方案中,所述Poly(A)尾用于使RNA分子在相同时间内的表达量提高的用途,所述表达量提高是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的表达量提高。在一些实施方案中,所述Poly(A)尾用于使RNA分子在相同时间内的表达量降低的用途,所述表达量提高是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的表达量降低。在一些实施方案中,所述Poly(A)尾用于使RNA分子表达时间延长的用途,所述表达时间延长是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的表达时间延长。在一些实施方案中,所述Poly(A)尾用于使RNA分子表达时间缩短的用途,所述表达时间延长是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的表达时间缩短。在一些实施方案中,所述Poly(A)尾用于使RNA分子半衰期延长的用途,所述半衰期延长是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的半衰期延长。在一些实施方案中,所述Poly(A)尾用于使RNA分子半衰期缩短的用途,所述半衰期缩短是相对于包含另一Poly(A)尾的RNA分子,在细胞外、细胞内或动物体内的半衰期缩短。
在一些实施方案中,所述RNA分子为mRNA分子。在一些实施方案中,所述Poly(A)尾及另一Poly(A)尾分别属于本申请的第三方面的Poly(A)尾的两条不同的Poly(A)尾。在一些实施方案中,所述Poly(A)尾是属于本申请的第三方面的Poly(A)尾,所述另一Poly(A)尾为本申请的第三方面的Poly(A)尾以外的Poly(A)尾。在一些实施方案中,所述细胞内为宿主细胞内,所述宿主细胞为真核细胞。在一些实施方案中,所述宿主细胞为 哺乳动物细胞。在一些实施方案中,所述宿主细胞为人细胞。
本申请还提供了编码本申请的第三方面的所述Poly(A)尾的DNA片段或DNA与RNA的杂合分子片段,以及将所述DNA片段或DNA与RNA的杂合分子片段用于使编码RNA的DNA分子或DNA与RNA的杂合分子在宿主细胞中的复制更加保守的用途,在所述用途中,编码本申请的第三方面的所述Poly(A)尾的DNA片段或DNA与RNA的杂合分子片段在所述DNA分子或DNA与RNA的杂合分子中,位于所述RNA编码序列的3’端一侧。在一些实施方案中,所述宿主细胞为原核细胞。在一些实施方案中,所述宿主细胞为recA‐的细菌。在一些实施方案中,所述宿主细胞为大肠杆菌。在一些实施方案中,所述大肠杆菌选自:K-12及其衍生菌株,以及B菌株及其衍生菌株。在一些实施方案中,所述大肠杆菌选自:MG1655、DH5或DH5α、DH10B、BL21、DB3.1、HB101、JM109、JM110、MC1061、MG1655、Pir1、Stbl2、Stbl3、Top10、XL1Blue、XL10Gold、BLR、HMS174、Tuner、Rostetta2、Lemo21、T7Express、Origami2。
本申请的第四方面还提供了一种RNA分子,所述RNA分子包含第三方面的Poly(A)尾。在一些实施方案中,所述RNA分子为mRNA分子。在一些实施方案中,所述RNA分子:
(1)由前述第一方面的工程化DNA分子转录获得;
(2)由化学合成的方法获得,且与项(1)的RNA分子具有相同的多核苷酸序列;或
(3)由对前述(1)或(2)中的RNA分子做进一步的修饰获得。
在一些实施方案中,所述进一步的修饰包含将由(1)或(2)获得的RNA分子中的一个或多个核糖核苷酸替换为一个或多个脱氧核糖核苷酸。在一些实施方案中,所述一个或多个核糖核苷酸被替换为与所述一个或多个核糖核苷酸相应的脱氧核糖核苷酸,例如将所述RNA分子的一个或多个核糖核苷酸A替换为脱氧核糖核苷酸A,将其中的一个或多个核糖核苷酸U替换为脱氧核糖核苷酸T,将其中的一个或多个核糖核苷酸C替换为脱氧核糖核苷酸C,将其中的一个或多个核糖核苷酸G替换为脱氧核糖核苷酸G,或将其中的一个或多个核糖核苷酸G替换为核糖核苷酸I(肌苷)或脱氧核糖核苷酸I。在一些实施方案中,所述修饰为化学修饰。在一些实施方案中所述修饰为碱基编辑。在一些实施方案中,所述修饰对由(1)或(2)获得的RNA分子中的一个或 多个核糖核苷酸进行脱氨基处理。在一些实施方案中,所述进一步修饰为转录后修饰。在一些实施方案中,所述进一步修饰包含加帽处理。在一些实施方案中,所述进一步修饰包含剪接。在一些实施方案中,所述进一步修饰包含剪接和加帽处理。
在一些实施方案中,所述RNA分子包括编码RNA(coding RNA)或非编码RNA(non-coding RNA,ncRNA)。在一些实施方案中,所述RNA分子为Pre-mRNA。在一些实施方案中,所述RNA为成熟mRNA。在一些实施方案中,所述RNA分子为长链非编码RNA(long noncoding RNA,lncRNA)。在一些实施方案中,所述RNA分子还进一步包含5'‐帽结构。在一些实施方案中,所述RNA分子的多核苷酸序列如SEQ ID NO:22-25中任一项所示。在一些实施方案中,所述RNA分子包含如SEQ ID NO:22-25中任一项所示的的多核苷酸序列。
此外,本申请还提供了DNA与RNA的杂合分子,其携带与前述第一方面的工程化DNA分子相同的遗传信息,与前述第三方面的Poly(A)尾相同的遗传信息,或与前述第四方面的RNA分子相同的遗传信息。
此外,本申请还提供了核酸分子文库。在一些实施方案中,所述核酸分子文库包含前述第一方面的工程化DNA分子、前述第三方面的Poly(A)尾、编码本申请的第三方面的所述Poly(A)尾的DNA片段或DNA与RNA的杂合分子片段、或前述第四方面的RNA分子。
此外,本申请还提供了调控蛋白表达的方法,包括:将根据前述核酸分子文库中的多种核酸分子在不同时间,和/或以不同的数量比例导入目的细胞。在一些实施方案中,所述核酸分子为前述第一方面的工程化DNA分子。在一些实施方案中,所述核酸分子为前述第四方面的RNA分子。
应当理解,本文描述的本申请的方面和实施方案包括“包含”,“组成”和“基本上由……组成”的方面和实施方案。以上详细描述了本申请的优选实施方案,但是,本申请并不限于此。在本申请的技术构思范围内,可以对本申请的技术方案进行多种简单变型,包括各个技术特征以任何其它的合适方式进行组合,这些简单变型和组合同样应当视为本申请所公开的内容,均属于本申请的保护范围。
附图说明
图1示出了实施例2中,本申请中的10种poly(A)在大肠杆菌DH5α中的复 制稳定性;
图2示出了实施例2中,本申请中的10种poly(A)在大肠杆菌DH5α中的碱基缺失统计;
图3示出了实施例3中poly(A)P1、P2、P3、P4及poly(A)对照C1、C2在包括HPV抗原序列质粒体系下大肠杆菌DH5α中的复制稳定性;
图4示出了实施例3中poly(A)P3、P4及poly(A)对照C1、C2在包括HPV抗原序列质粒体系下大肠杆菌DH5α中的碱基缺失统计;
图5示出了实施例3中poly(A)P1、P2、P3及poly(A)对照C1在包括HPV抗原序列质粒体系下大肠杆菌DH5α中30℃和37℃两个温度条件下的复制稳定性比较;
图6示出了实施例中通用载体质粒DNA的图谱示例。
图7示出了实施例4中,带有poly(A)P3、P4、P5、P8、P9及对照C2的荧光素酶在小鼠体内表达水平的动物成像结果;
图8示出了实施例4中,带有poly(A)P3、P4、P5、P8、P9及对照C2的荧光素酶在小鼠体内表达水平的动物成像后荧光强度定量结果(ns,无显著性差异;★★,显著性差异,p<0.01)。
发明详述
本申请首先提供了在体外稳定扩增Poly(A)尾转录模板DNA的方法,以减少所述DNA在细胞中大量复制时,其中Poly(A)尾转录模板序列的突变频率。从而,基于所述DNA获得大量包含序列确定的Poly(A)尾的RNA。在此基础上,使得经过工程化设计而具有某种特定功能的Poly(A)尾的RNA,例如mRNA,得以通过体外发酵,实现规模化生产。
此外,本申请还提供了可在体外稳定扩增的包含Poly(A)尾转录模板的DNA,以及由所述DNA转录形成的RNA。进一步的,在满足体外稳定扩增的前提下,本申请还进一步提供了对RNA稳定性和/或表达效率具有不同调节作用的一组Poly(A)尾、包含所述Poly(A)尾的RNA、包含所述Poly(A)尾编码序列的DNA、以及由所述Poly(A)尾、所述RNA或所述DNA组成的文库。
更进一步地,本申请还提供了前述Poly(A)尾、RNA、DNA、以及文库的用途。
术语
如本文所用,“元件a”、“元件b”、“元件c”、“元件d”及“元件e”是Poly(A)可包含的元件类别。元件a由多个连续的腺嘌呤(A)核苷酸组成,所述元件a的长度范围为≥20nt;元件b由多个连续的A核苷酸组成,所述元件b的长度范围为3nt≤b<20nt;元件c由一个非A的核苷酸组成,所述核苷酸选自T、C、G核苷酸;元件d由任意两个或更多个连续的核苷酸组成,所述核苷酸选自A、T、C、G核苷酸,其中元件d的5’及3’末端的核苷酸不为A核苷酸,并且元件d不包含3个以上连续的A核苷酸,所述元件d的长度范围为2nt≤d≤20nt。元件e由一个或两个连续的A组成,且当其存在时,其位于Poly(A)尾编码序列的3’末端,且与元件d或元件c相邻。当在一条Poly(A)中,含有两个或多个“元件b”、“元件c”、及“元件d”时,每两个元件b的序列可以相同或不同、每两个元件c的序列可以相同或不同,以及每两个元件d的序列可以相同或不同,只要他们各自均符合前述对元件a、b、c和d的定义即可。在本申请中,Poly(A)尾中的“元件a”、“元件b”、“元件c”、“元件d”、“元件e”等均可以术语“元件”指代。
如本文所用,当描述两个或多个元件的位置关系为“不相邻”,则是指两个或多个元件两两之间不毗邻。换言之,即所述两个或多个元件两两之间至少包含所述两个元件的核苷酸以外的一个或多个其他核苷酸或碱基。
在本文中,“编码”是指i)DNA序列中包含可被转录成RNA分子的遗传信息,和/或ii)RNA分子中包含可被翻译成氨基酸序列的遗传信息。因此,如本文所用,“编码序列”可用以指代mRNA前体或成熟mRNA中可以被翻译为蛋白质的核糖核苷酸(RNA)序列或其片段,亦可指代作为模板用以转录所述mRNA前体或成熟mRNA的脱氧核糖核苷酸(DNA)序列的互补序列或其片段。此外,本申请的“编码序列”还可以进一步包含编码蛋白、功能性核酸、或其片段,例如miRNA、shRNA、dsRNA、向导RNA、Poly(A)尾、5’UTR、3’UTR等的多核苷酸序列。其中,包含可被转录成RNA分子的遗传信息的DNA分子称为所述RNA分子的“编码核酸”;包含可被翻译成氨基酸序列的遗传信息的RNA分子称为所述氨基酸序列的“编码核酸”。
在本申请中,所有多核苷酸序列中的核苷酸由5’端向3’端编号,即5’末端的核苷酸为第一个核苷酸,3’末端的核苷酸为最后一个核苷酸。如无特别说明,“5’端”与“5’末端”可互换使用;“3’端”与“3’末端”可互换使用。“5’端” 与“3’端”着重描述同一条核酸序列中,核苷酸之间,核苷酸序列区段之间,或核苷酸与核苷酸序列区段之间的相对位置关系;“5’末端”及“3’末端”则分别用于描述一条核酸序列或一条核酸序列的某一个区段的第一个及最后一个核苷酸所在的位置。“5’端一侧”则用以描述同一条多核苷酸序列中两段相互之间无重叠部分的序列的相对位置关系,当描述一段序列位于另一段序列的5’端一侧,则是指所述一段序列相对于另一段序列更加靠近所述多核苷酸序列的“5’端”。同理,当描述一段序列位于另一段序列的3’端一侧,则是指一段序列相对于所述另一段序列更加靠近所述多核苷酸序列的“3’端”,且所述一段序列与所述另一段序列彼此不包含重叠部分。具体例如“Poly(A)尾的DNA编码序列位于RNA编码序列的3’端”是指所述Poly(A)尾的DNA编码序列作为所述RNA编码序列的组成部分,包含所述RNA编码序列的3’末端的核苷酸。此外,如本文所用,“5’部分”是指以所述多核苷酸序列“中心位置”为界,靠近所述多核苷酸序列5’末端的一半。“3’部分”则指以所述多核苷酸序列中心位置为界,靠近所述多核苷酸序列3’末端的一半。本申请所述“中心位置”至所述5’端和至所述3’端的核苷酸数量相等。
如本文所用,将术语“保守”用于描述核酸分子的复制时,是指所述复制过程中出现突变的概率低。在此语境中,“保守”是个相对的概念。例如当描述“Poly(A)尾的DNA编码序列用于使编码RNA的DNA分子在宿主细胞中的复制更加保守”时,是指编码RNA的亲代DNA分子复制为子代DNA分子后,如果所述RNA分子包含该Poly(A)尾,则相对于不包含所述Poly(A)尾(例如包含某种其他Poly(A)尾的RNA分子)的RNA分子的编码DNA,所述子代DNA分子与所述亲代DNA分子具有100%序列同一性的概率更高;或由所述亲代DNA分子复制获得的多条所述子代DNA分子与所述亲代DNA分子之间的平均序列同一性更高。
在本申请中,当描述“调控”RNA分子表达时,所述“调控”是指:在相同的时长内使所述RNA分子表达的蛋白或功能RNA的总量提高或降低;或使所述RNA可以在更长或更短的时间范围内表达蛋白或功能RNA,所述提高、降低、或更长或更短的时间范围是相较于另一表达相同蛋白或功能RNA的RNA分子而言的。当描述“调控”蛋白表达时,则指调控包含所述蛋白编码序列的RNA分子表达。这里描述的调控作用可以通过将本申请的Poly(A)尾连接在不包含Poly(A)尾的RNA分子的3’末端实现,也可以通过将RNA原有的 Poly(A)尾替换为本申请的Poly(A)尾实现。
如本文所用,“同一性”的百分比,例如85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、98.5%、99%、99.5%同一性,是指氨基酸序列之间或核苷酸序列之间,通过序列比对确定的相似程度,是85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、98.5%、99%、99.5%。例如,通过引入空位等方式可以使两条序列在尽可能多的位置上具有相同残基后,确定的具有相同碱基或氨基酸残基的位置数量占位置总数的比例。“同一性”的百分比可以用本领域已知的软件程序来确定。优选的是使用默认参数进行比对。一个优选的比对程序是BLAST。优选的程序是BLASTN和BLASTP。这些程序的细节可以在以下互联网地址找到:ncbi.nlm.nih.gov/cgi-bin/BLAST。
如本文所用,核酸的“互补”是指一条核酸通过传统的Watson-Crick碱基配对与另一条核酸形成氢键的能力。百分比互补性表示核酸分子中可与另一核酸分子形成氢键(即,Watson-Crick碱基配对)的残基的百分比(例如,10个中的约5、6、7、8、9、10个分别为约50%,60%,70%,80%,90%和100%互补)。“完全互补”是指核酸序列的所有连续残基与第二核酸序列中相同数量的连续残基形成氢键。如本文所用,“基本上互补”是指在约40、50、60、70、80、100、150、200、250或更多个核苷酸的区域内,至少约70%,75%,80%,85%,90%,95%,96%,97%,98%,99%或100%中的任何一个的互补程度,或指在严格条件下杂交的两条核酸。对于单个碱基或单个核苷酸,按照Watson-Crick碱基配对原则,A与T或U、C与G或I配对时,被称为互补或匹配,反之亦然;而除此以外的碱基配对都称为不互补。本申请中某多核苷酸序列的“互补多核苷酸序列”则是指与该某多核苷酸序列完全互补的多核苷酸序列。
如本文所用,某个蛋白、多肽或氨基酸序列的“保守取代变体”是指其中一个或多个氨基酸残基经过氨基酸取代而不改变蛋白质或酶的整体构象和功能,这包括但不限于以前述“保守取代”描述的方式取代亲本蛋白质中氨基酸序列中的氨基酸。因此,相似功能的两个蛋白或氨基酸序列的相似性可能会不同。例如,基于MEGALIGN算法的70%至99%的相似度(同一性)。“保守取代变体”还包括通过BLAST或FASTA算法确定具有60%以上的氨基酸同一性的多肽或酶,若能达75%以上更好,最好能达85%以上,甚至达90%以 上为最佳,并且与天然或亲本蛋白质或酶相比具有相同或基本相似的性质或功能。
在本申请的上下文中,术语“DNA”和“RNA”是指单链或双链DNA或RNA分子。除非另有说明,否则术语“DNA”和“DNA分子”是指由A、C、G和/或T核苷酸构成的双链DNA分子,而术语“RNA”和“RNA分子”是指由A、C、G和/或U核苷酸构成的单链RNA分子。在本文中,所述A、C、G、T和U核苷酸是指包含腺嘌呤、鸟嘌呤、胞嘧啶、胸腺嘧啶和尿嘧啶作为各自的含氮碱基的核苷酸。
RNA分子包括编码RNA(coding RNA)或非编码RNA(non-coding RNA,ncRNA),例如Pre-mRNA、成熟mRNA或长链非编码RNA(long noncoding RNA,lncRNA)。
如本文所用,所述“DNA与RNA的杂合分子”是一个包含由脱氧核糖核苷酸和核糖核苷酸组成的多核苷酸序列的分子。所述DNA与RNA的杂合分子可以通过以下方式获得:
将DNA中的一个或多个脱氧核糖核苷酸替换为核糖核苷酸;
将RNA中的一个或多个核糖核苷酸替换为脱氧核糖核苷酸;或
通过生物或化学合成的方式,以脱氧核糖核苷酸以及核糖核苷酸为原料从头合成。需注意,获得DNA与RNA的杂合分子的方式不限于以上方式,由任何方式获取的DNA与RNA的杂合分子均属于本申请定义的“DNA与RNA的杂合分子”的范畴。
如本文所用,如果描述两个核酸分子具有“相同的遗传信息”,则是指所述两个核酸分子互补,或包含完全相同的碱基序列,或其中一个核酸分子的碱基序列中一个或多个胸腺嘧啶转变为尿嘧啶后可以获得与另一个核酸分子碱基序列完全相同的核酸分子。因此,DNA、RNA以及DNA与RNA的杂合分子中的任意两种均可具有相同的遗传信息。其中,术语“碱基序列”指多核苷酸分子中碱基的排列顺序。本领域技术人员应当知晓,除非另有说明,否则本申请中所述的碱基序列或多核苷酸序列可在用于描述DNA序列时,以“T”代指胸腺嘧啶,但在所述碱基序列或多核苷酸序列用于描述RNA(例如mRNA)时,“T”将由“U”(尿嘧啶)取代。因此,由本文中的特定序列号(SEQ ID NO)公开的任何DNA也公开与所述DNA互补或对应的RNA(例如mRNA或Poly(A)尾)序列,其中所述DNA序列的每个“T”被“U”取代。
Poly(A)尾及其用途
如本文所用,术语"PolyA尾"或"Poly(A)序列"是指通常位于RNA分子3'-末端的不间断或不中断的腺苷酸残基序列。在RNA中,在有3'-UTR存在的情况下,Poly-A序列与3'-UTR的3’端相连。不间断的poly-A尾的特点是有连续的腺苷酸残基。Poly-A尾可以是任何长度的。在一些实施方案中,Poly-A尾包含,或由至少20、至少30、至少40、至少80或至少100和至多500、至多400、至多300、至多200或至多150个腺苷酸(A)组成,特别是约120个A。通常,PolyA尾中的绝大多数核苷酸都是腺苷,所述绝大多数是指至少75%,至少80%,至少85%,至少90%的核苷酸等,但允许剩余的核苷酸是A以外的核苷酸(非A核苷酸),例如U(尿苷酸)、G(鸟苷酸)或C(胞苷酸)。
在一些实施方案中,所述RNA的体外制备过程是原核发酵过程,即将包含所述Poly(A)尾的RNA分子的编码核酸导入原核细胞中,通过扩增所述原核细胞,达到扩增所述编码核酸的目的,随后,将扩增后的所述编码核酸转录为所述RNA。在一些实施方案中,所述RNA的体外制备过程是将包含蛋白质编码序列的RNA片段与Poly(A)尾通过同源重组、酶切连接、或其他非同源重组等方式连接,且所述Poly(A)尾通过原核发酵过程制备,所述原核发酵过程时将包含所述Poly(A)尾的编码核酸导入原核细胞中,通过扩增所述原核细胞,达到扩增所述编码核酸的目的,随后,将扩增后的所述编码核酸转录为包含所述Poly(A)尾的RNA。在一些实施方案中,前述编码核酸是线性的。在一些实施方案中,前述编码核酸是环状的。在一些实施方案中,前述编码核酸是质粒。在一些实施方案中,前述编码核酸是单链或双链。在一些实施方案中,前述编码核酸在导入原核细胞前经过了化学修饰。在一些实施方案中,导入原核细胞前的前述编码核酸通过化学合成。在一些实施方案中,所述编码核酸插入所述原核细胞的类核/拟核基因组DNA中。在一些实施方案中,所述编码核酸游离存在于所述原核细胞的细胞质中或类核/拟核以外。在一些实施方案中,所述原核细胞是大肠杆菌。
基于此,本申请提供了一系列的Poly(A)尾,所述Poly(A)尾在RNA的体外制备过程中保守性高。所述Poly(A)尾在一个或多个位置包含一个或多个非A核苷酸。
在一些实施方案中,所述Poly(A)尾编码序列包含:
唯一的元件a和至少一个元件b,以及至少一个元件c;
唯一的元件a和至少一个元件b,以及至少一个元件d;或
唯一的元件a和至少一个元件b,以及至少一个元件c和至少一个元件d,
其中所述元件a和元件b不相邻、元件c和元件d不相邻,
且所述Poly(A)尾编码序列不包含彼此相邻的元件b、不包含彼此相邻的元件c、且不包含彼此相邻的元件d。
在一些实施方案中,所述Poly(A)尾编码序列还进一步包含唯一一个元件e,所述元件e由一个或两个连续的A组成,且其位于所述Poly(A)尾编码序列的3’末端,且与元件d或元件c相邻。
这里的Poly(A)尾可以是一段RNA,也可以是DNA与RNA的杂合分子。
本申请还提供了可调控蛋白表达水平的Poly(A)尾。并且,本申请还提供了在体外制备过程中保守性高的同时,可调控蛋白表达水平的Poly(A)尾。在一些实施方案中,所述可调控蛋白表达水平的Poly(A)尾;或所述在体外制备过程中保守性高的同时,可调控蛋白表达水平的Poly(A)尾的结构选自:
元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;
元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b;
元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c;
元件a-元件d-元件b-元件c-元件b-元件c-元件b;及
元件b-元件c-元件b-元件c-元件b-元件d-元件a。
在一些实施方案中,所述可调控蛋白表达水平的poly(A)尾;或所述在体外制备过程中保守性高的同时,可调控蛋白表达水平的poly(A)尾的结构选自:
60A-G-19A-G-19A-G-19A-G-3A;
7A-C-18A-G-60A-元件d-7A-C-18A-G-14A;
60A-元件d-19A-G-19A-G-17A;
19A-G-19A-G-19A-元件d-60A;
19A-G-19A-G-19A-元件d-60A-G;
19A-G-19A-G-19A-元件d-60A;
19A-C-19A-C-19A-元件d-60A;及
19A-T-19A-T-19A-元件d-60A。
在一些实施方案中,所述可调控蛋白表达水平的Poly(A)尾;或所述在 体外制备过程中保守性高的同时,可调控蛋白表达水平的Poly(A)尾的结构为:60A-元件d-19A-G-19A-G-17A或19A-G-19A-G-19A-元件d-60A。
其中上述“-”连接的两个元件之间直接相连,及所述两个元件之间不包含任何核苷酸。
上述Poly(A)尾的结构中以“yA”表示元件a或元件b中连续A的个数,所述y为自然数,例如19A,则表示包含19个连续的A;60A则表示包含60个连续的A。
在一些实施方案中,所述可调控蛋白表达水平的Poly(A)尾;或所述在体外制备过程中保守性高的同时,可调控蛋白表达水平的Poly(A)尾的多核苷酸编码序列选自如SEQ ID NO:1-10中任一项所示的多核苷酸序列。
此外,本申请也提供了将上述Poly(A)尾用于调控蛋白表达的用途,在所述用途中,所述Poly(A)尾位于所述mRNA的3’端,例如3’UTR的3’端。在一些实施方案中,所述调控蛋白表达使用后文中的调控蛋白表达的方法。
工程化DNA分子及文库
本申请还提供了一种可在细胞中复制的工程化DNA分子,其包含前述Poly A尾的编码序列或其互补序列。本领域技术人员应当知晓,除所述Poly A尾的编码序列以外,所述工程化DNA分子还应包含可使所述DNA分子在细胞中复制,或高效复制所必须的结构元件。使所述工程化DNA分子在细胞中复制或高效复制所述必须的结构元件是本领域已知的,包括例如复制起点(ORI)。在一些实施方案中,所述工程化DNA分子还进一步包含标记基因或其片段和/或报道基因或其片段、和允许插入DNA元件的独特的限制性内切酶位点,优选多克隆位点(MCS)形式的限制性内切酶位点。所述标记基因有利于鉴定含有包含所述标记基因的质粒的细胞,可选自,例如抗生素抗性基因。所述MCS中的每一个限制性内切酶位点均可被不同的限制性内切酶特异性识别。
在一些实施方案中,所述DNA分子是DNA质粒。如本文所用,术语“DNA质粒”是指由双链DNA分子组成的质粒。在一些实施方案中,所述“质粒”是环状DNA分子。在一些实施方案中,所述“质粒”还可以涵盖线性DNA分子。具体的,术语“质粒”还涵盖通过例如用限制性内切酶切割环状质粒,进而使该环状质粒分子转变成线性分子而使该环状质粒线性化所得到的分子,以及可在原核生物中复制的线性分子。质粒可以复制,即在细胞中独立于原核细 胞拟核或类核存储的基因组遗传信息而扩增,并且可以用于克隆,即用于在细菌细胞中扩增遗传信息。优选地,根据本申请的DNA质粒是中拷贝或高拷贝质粒,更优选地是高拷贝质粒。此类高拷贝质粒的实例是这样的载体:其基于pUC、pTZ质粒或包含支持质粒高拷贝的ORI的任意其它质粒(例如pMB1、pCoIE1)等。
在一些实施方案中,所述DNA分子是构成原核生物拟核或类核的DNA分子或其片段,即所述包含前述Poly(A)尾的编码序列或其互补序列可随原核生物基因组进行复制。
在一些实施方案中,所述DNA分子在所述Poly(A)尾编码序列的5’端一侧进一步连接目的基因片段,所述目的基因片段与所述Poly(A)尾编码序列共同编码RNA。在一些实施方案中,所述目的基因片段与所述Poly(A)尾编码序列共同编码mRNA。所述目的基因片段包含蛋白、多肽或其片段的编码序列。在一些实施方案中,所述目的基因片段还包含转录后可用于启动或调控所述蛋白、多肽或其片段表达的元件的编码序列,所述元件包括但不限于5’UTR、3’UTR等。在一些实施方案中,所述目的基因片段包含至少一个非翻译区(UTR)的编码序列。在一些实施方案中,所述目的基因片段包含至少5'UTR的编码序列和所述蛋白、多肽或其片段的编码序列。在一些实施方案中,所述目的基因片段从5’至3’至少依次包含5'UTR的编码序列,蛋白、多肽或其片段的编码序列以及3'UTR的编码序列。所述蛋白、多肽或其片段的编码序列可被最终翻译为一个或多个蛋白、或一条或多条多肽,例如短肽、寡肽、多肽、融合蛋白、蛋白质及其片段,如已知蛋白质的部分,例如功能性部分。所述功能性部分可以是例如,蛋白质的生物活性部分或可以有效地产生抗体的抗原部分,例如抗原表位。所述蛋白、多肽或其片段的编码序列的两端分别包含起始密码子(5’端)和终止密码子(3’端),其分别是所述mRNA分子的可被翻译的前三个核苷酸和后三个核苷酸。5'UTR通常包含至少一个核糖体结合位点(RBS),如原核生物中的Shine‐Dalgarno序列,或至少一个翻译起始位点,如真核生物中的Kozak序列。RBS通过在翻译起始时募集核糖体来促进mRNA分子的有效且准确的翻译。可以通过改变给定的RBS或翻译启示位点的长度和序列以及距起始密码子的距离来优化其活性。可选地或任选地,5'UTR包括内部核糖体进入位点或IRES。3'UTR可包含一个或多个调控序列,如增强mRNA分子稳定性的氨基酸序列的结合位点、调控 RNA分子(如miRNA分子)的结合位点、和/或参与mRNA分子的胞内运输的信号序列。
在前述实施方案的基础上,在一些实施方式中,所述目的基因片段还包含一个或多个另外的调控序列,如增强mRNA分子稳定性的氨基酸序列的结合位点、增强mRNA分子翻译的氨基酸序列的结合位点、调节元件(如核糖开关)、调节RNA分子(如miRNA分子)的结合位点、和/或对翻译起始产生积极影响的核苷酸序列。此外,在5'UTR内,优选地不存在功能性的上游开放阅读框、框外上游翻译起始位点、框外上游起始密码子、和/或产生减少或防止翻译的二级结构的核苷酸序列。5'UTR中此类核苷酸序列的存在可对翻译产生负面影响。
所述蛋白、多肽或其片段的编码序列包含可以翻译成氨基酸序列的密码子。所述编码序列包含的全部密码子中,可以全部是天然存在的编码氨基酸的密码子,也可以有部分或全部由人工合成的密码子组成。在一些实施方案中,所述部分或全部密码子经过了密码子优化。在一些实施方案中,所述部分或全部密码子编码非天然氨基酸。
在一些实施方案中,所述DNA分子在述目的基因片段的5’端一侧还进一步包含可启动或调控所述RNA转录所必须的结构元件,所述结构元件是本领域已知的。在一些实施方案中,所述结构元件至少包含启动子。启动子及其序列是本领域已知的,包括弱启动子、中等强度启动子、强启动子、mini启动子或核心启动子等。在一些特定的实施方案中,所述启动子为强启动子。在一些实施方案中,所述启动子可在原核细胞中启动所述目的基因片段和/或Poly(A)尾转录。在一些实施方案中,所述启动子可以在真核细胞中启动所述目的基因片段和/或Poly(A)尾转录。所述“启动子”包含至少一个转录识别位点及其后的转录因子结合位点。所述识别和结合位点可以与介导或调节转录的氨基酸序列相互作用。与识别位点相比,结合位点更靠近前述目的基因片段。结合位点可以是,例如原核生物中的Pribnow框或真核生物中的TATA框。例如,在一些实施方案中,当使用Pribnow框时,所述转录识别位点可以位于转录起始位点上游约35bp处,而转录因子结合位点可以位于转录起始位点上游约10bp处。在一些实施方案中,所述启动子包含至少一个另外的调控元件,如位于转录起始位点之前约40和/或60个核苷酸处富含AT的上游元件,和/或位于识别位点和结合位点之间的增强启动子活性的另外的调控元件。在 一些实施方式中,所述启动子是强启动子,即所述启动子包含促进前述RNA编码序列转录的序列。强启动子是本领域技术人员已知的,例如来自大肠杆菌的RecA启动子衍生的OXB18、OXB19和OXB20启动子,或者可以通过常规实验室程序鉴定或合成。在一些实施方案中,所述启动子为T7启动子。在一些实施方案中,所述启动子前还包含另外的调控元件,如包含在DNA质粒中可促进前述RNA编码序列转录的增强子。
本申请还提供了包含前述工程化DNA分子的文库。在一些实施方案中,所述文库包含至少两条具有不同Poly(A)尾编码序列的DNA分子。
此外,本申请还提供了上述工程化DNA分子在稳定扩增Poly(A)尾编码序列或带有Poly(A)尾的RNA的编码序列中的用途。在一些实施方案中,所述扩增Poly(A)尾编码序列或带有Poly(A)尾的RNA的编码序列的方法如后文在体外稳定扩增Poly(A)尾转录模板DNA的方法所示。
工程化RNA及文库
本申请提供了一种RNA,其包含前述Poly(A)尾,以及Poly(A)尾编码序列5’端一侧的目的基因片段。在一些实施方案中,所述RNA还包含5'‐帽结构。在一些实施方案中,所述RNA为mRNA。
如本文所用“mRNA”(信使RNA)是编码至少一种蛋白质、多肽或其片段的,天然存在、非天然存在或经修饰的任何RNA,所述mRNA具备经翻译以在体外、体内、原位或离体产生所编码的蛋白质、多肽或其片段的能力。因此,所述mRNA可以是成熟的mRNA或成熟前的mRNA,其必须包含或选择性包含的元件或结构是本领域已知的。在一些实施方案中,所述mRNA包含多个必要的功能组件的编码序列,以表达、调节、或增强所述蛋白、多肽或其片段的表达水平。所述功能组件包括但不限于5’帽子、5’UTR、3’UTR等。5'UTR和3'UTR两者通常都从基因组DNA转录,并且是成熟前的mRNA就具有的元件。作为成熟mRNA,
术语“5’帽子”位于mRNA的5’最末端,包含甲基化鸟苷酸,所述甲基化鸟苷酸经焦磷酸连接于mRNA的5'末端,与其相邻的核苷酸形成5',5'-三磷酸连接。5’帽子结构通常有三种类型(m7G5'ppp5'Np、m7G5'ppp5'NmpNp、m7G5'ppp5'NmpNmpNp),分别称为O型、I型和II型。O型指末端核苷酸的核糖未甲基化,I型指末端一个核苷酸的核糖甲基化,II型指末端两个核苷酸的核糖均甲基化。在一些实施方案中,所述5’帽子可根据制造商的方案,使用 以下化学RNA帽类似物在体外转录反应期间同时完成对多核苷酸的5'加帽以产生5'‐鸟苷帽结构:3'‐O‐Me‐m7G(5')ppp(5')G[ARCA帽]、G(5')ppp(5')A、G(5')ppp(5')G、m7G(5')ppp(5')A、m7G(5')ppp(5')G(NewEnglandBioLabs、Ipswich、MA)、或m7G(5’)ppp(5’)(2’‐OMeA)pG(CleanCapAG)。例如在一些实施方案中,可使用牛痘病毒加帽酶在转录后完成对经修饰RNA的5'加帽,以产生O型帽子结构:m7G(5')ppp(5')G(New England BioLabs,Ipswich,MA)。可使用牛痘病毒加帽酶和2'‐O甲基‐转移酶两者来产生I型帽子结构,以产生m7G(5’)ppp(5’)(2’‐OMeA)pG。可从I型帽子结构接着使用2'‐O甲基‐转移酶对5'‐倒数第三核苷酸进行2'‐O‐甲基化来产生II型帽子结构。可从II型帽子结构接着使用2'‐O甲基‐转移酶对5'‐倒数第四核苷酸进行2'‐O‐甲基化来产生III型帽子结构。
在一些实施方案中,所述mRNA的部分或全部尿苷为化学修饰的尿苷。
在一些实施方案中,所述mRNA的部分或全部尿苷为假尿苷或1‐甲基‐假尿苷。
在一些实施方案中,所述mRNA的部分或全部尿嘧啶核苷酸被假尿苷(ψ)核苷酸或N1-甲基假尿苷(m1ψ)核苷酸取代。
在一些实施方案中,所述mRNA还包含稳定化元件。稳定化元件可包括例如组蛋白茎环。在一些实施方案中,所述mRNA包含编码区、至少一个组蛋白茎环和任选地poly(A)序列或多聚腺苷酸化信号。所述poly(A)序列或多聚腺苷酸化信号通常应增强所编码蛋白质的表达水平。在一些实施方案中,所述mRNA包含poly(A)序列或多聚腺苷酸化信号与至少一个组蛋白茎环的组合,尽管两者在自然界中具有替代机制,但其协同作用可使蛋白质表达增加至超过任一单个元件所观察到的水平。poly(A)与至少一个组蛋白茎环的组合的协同效应不依赖于元件的次序或poly(A)序列的长度。在一些实施方案中,所述组蛋白茎环通常源自组蛋白基因,并且包括由间隔区(由短序列组成)分隔的两个相邻部分或完全反向互补序列的分子内碱基配对形成环。未配对的环区通常无法与茎环元件中的任一者进行碱基配对。茎环结构的稳定性通常取决于长度、错配或膨出部的数目以及配对区的碱基组成。在一些实施方案中,可产生摆动碱基配对(非Watson‐Crick碱基配对)。在一些实施方案中,所述至少一个组蛋白茎环序列包含15至45个核苷酸长度。
在一些实施方案中,可将所述mRNA的一个或多个富含AU的序列去除。 这些序列有时称为AURES,其是在3'UTR中发现的去稳定化序列。可将AURES从mRNA中去除。或者,可使AURES保留在mRNA中。
在一些实施方案中,所述mRNA被配置于脂质纳米颗粒(LNP)内。在一些实施方案中,脂质于所述mRNA混合形成脂质纳米颗粒。在一些实施方案中,将RNA配制在脂质纳米颗粒中。在一些实施方案中,所述脂质纳米颗粒首先形成为空的脂质纳米颗粒,并且在即将施用之前(例如在几分钟至一小时内)与疫苗的mRNA组合或包裹。
所述脂质纳米颗粒通常包含可电离的脂质、非阳离子脂质、固醇和PEG脂质组分以及目标核酸,例如上述mRNA。可使用如本领域中通常已知的组分、组合物和方法来产生本公开的脂质纳米颗粒,参见例如PCT/US2016/052352、PCT/US2016/068300、PCT/US2017/037551、PCT/US2015/027400、PCT/US2016/047406、PCT/US2016000129、PCT/US2016/014280、PCT/US2016/014280、PCT/US2017/038426、PCT/US2014/027077、PCT/US2014/055394、PCT/US2016/52117、PCT/US2012/069610、PCT/US2017/027492、PCT/US2016/059575和PCT/US2016/069491,其全部通过引用整体并入本文。
本申请还提供了包含前述mRNA分子的文库。所述文库中包含至少两条具有不同Poly(A)尾的mRNA分子。
本申请还提供了所述mRNA及所述mRNA文库的用途。使用对mRNA的表达量具有不同影响力梯度的Poly(A)尾的至少两种及以上的mRNA分子,可用于调节前述蛋白质、多肽或其片段的编码序列的表达量。例如通过调节具有所述两种及以上的mRNA分子的文库中不同mRNA分子的比例,或通过在不同时间导入具有相同或不同含量的所述两种及以上的mRNA分子中的一种或多种。
细胞
本申请还提供了包含前述工程化DNA分子的细胞,所述DNA分子可在所述细胞中储存和/或扩增。在一些实施方案中,所述细胞为原核细胞,所述DNA分子可在所述原核细胞中复制。在一些实施方案中,所述细胞为原核细胞,所述DNA分子可在所述原核细胞中复制和/或转录。在一些实施方案中,所述DNA分子为真核细胞,所述DNA分子可在所述细胞中复制。在一些实施方案中,所述DNA分子可在所述DNA细胞中转录和/或复制。
在一些实施方案中,所述细胞为原核细胞。在一些实施方案中,所述细胞为细菌、放线菌、蓝细菌、支原体、立克次氏体和衣原体。在一些实施方案中,所述细胞选自:枯草芽孢杆菌、乳酸杆菌、醋酸杆菌、棒状杆菌、短杆菌、节杆菌、假单胞菌、及小球菌。在一些实施方案中,所述细胞为recA的细菌。在一些实施方案中,所述细胞为大肠杆菌。在一些实施方案中,所述细胞为大肠杆菌,所述大肠杆菌选自:K-12及其衍生菌株,以及B菌株及其衍生菌株。在一些实施方案中,所述大肠杆菌选自:MG1655、DH5或DH5α、DH10B、BL21、DB3.1、HB101、JM109、JM110、MC1061、MG1655、Pir1、Stbl2、Stbl3、Top10、XL1Blue、XL10Gold、BLR、HMS174、Tuner、Rostetta2、Lemo21、T7Express、Origami2等。在一些实施方案中,所述细胞选自链霉菌属、小单孢菌属和诺卡氏菌属。在一些实施方案中,所述细胞为真菌。在一些实施方案中,所述细胞选自酵母菌或霉菌。
方法
本申请提供了在体外稳定扩增Poly(A)尾转录模板DNA的方法,以减少所述DNA在细胞中大量复制时,其中Poly(A)尾转录模板序列的突变频率。所述方法,包括:扩增上述包含上述工程化DNA分子的细胞。
在一些实施方案中,在扩增所述细胞之前,还包含将所述工程化DNA分子引入所述细胞。在一些实施方案中,所述引入可以包含化学转化或电转化。在一些实施方案中,所述引入是所述细胞对所述工程化DNA分子的自然胞吞过程。
在一些实施方案中,在扩增所述细胞之后,还包含提取细胞DNA并通过体外转录合成上述RNA。在一些实施方案中,在扩增所述细胞后,还包含诱导所述RNA在所述细胞中转录,然后提取并分离其中的RNA。在一些实施方案中,还包含提取细胞DNA并将其转导入可转录所述RNA的第二细胞中。在一些实施方案中,所述转导包括向人体施用,所述施用选自:通过静脉内,腹膜内,皮下,颅内,鞘内,动脉内(例如经由颈动脉),肌内及瘤内注射或灌注。
此外,本申请还提供了调控蛋白表达的方法,包括:
将前述工程化DNA分子中的两种或多种在不同时间,和/或以不同的数量比例导入目的细胞;或
将前述RNA分子中的两种或多种在不同时间,和/或以不同的数量比例 导入目的细胞。
其中,所述两种或多种的前述工程化DNA分子,以及两种或多种的前述RNA分子具有不同的Poly(A)尾,所述Poly(A)尾对RNA的表达量具有不同影响力梯度。
在一些实施方案中,本申请还提供了调控蛋白表达的方法,包括将前述工程化DNA分子或前述RNA分子导入目的细胞。在一些实施方案中,所述DNA编码的Poly(A)尾及所述RNA包含的Poly(A)尾的编码序列包含选自下述的结构:
元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;
元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b;
元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c;
元件a-元件d-元件b-元件c-元件b-元件c-元件b;或
元件b-元件c-元件b-元件c-元件b-元件d-元件a。
在一些实施方案中,所述DNA编码的Poly(A)尾及所述RNA包含的Poly(A)尾的编码序列包含选自下述的结构:
60A-G-19A-G-19A-G-19A-G-3A;
7A-C-18A-G-60A-元件d-7A-C-18A-G-14A;
60A-元件d-19A-G-19A-G-17A;
19A-G-19A-G-19A-元件d-60A;
19A-G-19A-G-19A-元件d-60A-G;
19A-G-19A-G-19A-元件d-60A;
19A-C-19A-C-19A-元件d-60A;及
19A-T-19A-T-19A-元件d-60A。
在一些实施方案中,所述DNA编码的Poly(A)尾及所述RNA包含的Poly(A)尾的编码序列包含的结构为:60A-元件d-19A-G-19A-G-17A或19A-G-19A-G-19A-元件d-60A。
其中上述“-”连接的两个元件之间直接相连,及所述两个元件之间不包含任何核苷酸。
上述Poly(A)尾的结构中以“yA”表示元件a或元件b中连续A的个数,所述y为自然数,例如19A,则表示包含19个连续的A;60A则表示包含60个连续的A。
在一些实施方案中,所述DNA编码的Poly(A)尾及所述RNA包含的Poly(A)尾的编码序列包含选自下述的多核苷酸序列,或由选自下述的任一个多核苷酸序列组成:SEQ ID NO:1-10。
应当理解,本申请包含本文所描述的各种方面、实施方案以及所述方面和/或实施方案的组合。以上描述以及随后的实施例旨在说明而不是限制本申请的范围。在本申请范围内的其他方面、改进和修改对于本申请所属领域的技术人员将是显而易见的。因此,本领域的普通技术人员应该认识到,本申请的范围还包括对所述方面和实施方案的所述改进和修改。
实施例
实施例1:Poly(A)尾的构建
使用常规基因工程方法,构建下表1所示的Poly(A)尾及编码所述Poly(A)尾的DNA序列。
表1.

实施例2:以荧光素酶编码序列为例测试Poly(A)功能
2.1编码mRNA的DNA分子在原核细胞中的复制稳定性测试
以荧光素酶为蛋白编码区,考察不同poly(A)变体的在大肠杆菌中的稳定性和对荧光素酶在细胞内表达的影响
1)构建包含荧光素酶蛋白编码区的通用载体
该通用载体以大肠杆菌克隆载体pUC57作为载体骨架,在其多克隆位点的Xba I酶切位点和EcoR I酶切位点之间按顺序排列T7启动子序列(5’-TAATACGACTCACTATAAGG-3’)、5’UTR、荧光素酶蛋白、3’UTR以及多聚腺苷酸串poly(dA:dT)。
2)将通用载体中的多聚腺苷酸串poly(dA:dT)分别替换为本申请的P1-P10以及对照C1-C4(A60-10nt spacer-A60(对照C1)、A30-10nt spacer-A70(对照C2)、A60-1nt spacer-A60(对照C3)或A60-6nt spacer-A60(对照C4)C1C2来源于文献专利(美国专利号US 10717982B2))。
合成构建P1-P10以及对照C1-C4所需的全部引物,用2个限制性内切酶进行双酶切以便从1)中构建的通用载体中切除poly(dA:dT),之后用T4DNA连接酶1将P1-P10以及C1-C4与切除poly(dA:dT)的载体连接,完成了对通用载体中poly(dA:dT)的替换。
3)检测不同poly(A)变体在大肠杆菌中的复制稳定性
步骤2)中构建的载体质粒经测序确认无误后,转入大肠杆菌DH5α,转化后的平板在30℃生长,完成质粒提取和测序工作。测序完成后根据测序结果分析和计算不同poly(A)变体的稳定性和碱基缺失情况;复制稳定性用未发生任何碱基变化的克隆数的占比来表示,占比越高说明质粒在大肠杆菌中的复制稳定性越高。
结果表明(见图1,图2),具体实验结果如下:
对照C1共测100个克隆,15个克隆发生碱基缺失,占比15%;正确克隆数85个,占比85%。
对照C2共测50个克隆,所有克隆均正确,没有任何碱基变化或缺失,正确克隆占比为100%。
对照C3共测50个克隆,9个克隆发生碱基缺失,占比18%;正确克隆数41个,占比82%。
对照C4共测50个克隆,14个克隆发生碱基缺失,占比28%;正确克隆数36个,占比72%。
P1共测100个克隆,9个克隆发生碱基缺失,占比9%;正确克隆数91个,占比91%。
P2共测100个克隆,12个克隆发生碱基缺失,占比12%;正确克隆数88个,占比88%。
P3共测62个克隆,5个克隆发生碱基缺失,占比8%;正确克隆数57个,占比92%。
P4共测50个克隆,其中3个克隆发生碱基缺失,占比6%;正确克隆数47个,占比94%。
P5共测50个克隆,其中4个克隆发生碱基缺失,占比8%;正确克隆数46个,占比92%。
P6共测50个克隆,5个克隆发生碱基缺失,占比10%;正确克隆数45个,占比90%。
P7共测50个克隆,7个克隆发生碱基缺失,占比14%,正确克隆数43个,占比86%。
P8共测50个克隆,3个克隆发生碱基缺失,占比6%,正确克隆数47个,占比94%。
P9共测50个克隆,4个克隆发生碱基缺失,占比8%,正确克隆数46个, 占比92%。
P10共测50个克隆,5个克隆发生碱基缺失,占比10%,正确克隆数45个,占比90%。
综合正确克隆占比和缺失碱基数两个方面的结果,本申请设计的poly(A)变体在大肠杆菌细胞内的复制稳定性方面优于现有技术或与现有技术相当。其中poly(A)变体P3、P4、P8的复制稳定性最高,P3、P4、P8的复制稳定性与C2相当,没有统计学意义的差别(p>0.05,χ2test),并且P3、P4、P8的复制稳定性优于对照C1、C3、C4,且差异在统计学上意义显著(p<0.05,χ2test)。
实施例3以HPV抗原蛋白编码序列为例测试poly(A)功能
1)以HPV为蛋白编码区的质粒构建
如上,实施例1中构建了荧光素酶编码基因组合不同poly(A)的载体。在这些载体的基础上,通过常规分子克隆方法将荧光素酶编码基因替换为HPV编码基因,主要元件按顺序排列T7启动子序列(5’-TAATACGACTCACTATAAGG-3’)、5’UTR、HPV抗原蛋白编码序列、3’UTR以及poly(A)编码序列。
2)小规模细菌培养,检测HPV的4个poly(A)变体在大肠杆菌中的稳定性
步骤1)中构建的含有P1、P2、P3和P4的HPV载体质粒经测序确认无误后,转入大肠杆菌DH5α,转化后的平板在30℃生长,完成质粒提取和测序工作。测序完成后根据测序结果分析和计算不同poly(A)变体的稳定性和碱基缺失情况;稳定性用未发生任何碱基变化的克隆数的占比来表示,占比越高越稳定。
结果表明(图3,图4),当目的基因更换为HPV抗原编码序列后,
对照C1共测88个克隆,47个克隆发生碱基缺失,占比53%,正确克隆数41个,占比47%。
对照C2共测50个克隆,1个克隆发生碱基缺失,占比2%,正确克隆数49个,占比98%。
P1共测101个克隆,14个克隆发生碱基缺失,占比14%;正确克隆数87个,占比86%。
P2共测100个克隆,12个克隆发生碱基缺失,占比12%,正确克隆数88个,占比88%。
P3共测70个克隆,4个克隆发生碱基缺失,占比6%,正确克隆数66个,占比94%。
P4共测50个克隆,9个克隆发生多个碱基缺失,占比18%;正确克隆数41个,占比82%。
综合发生突变的比例及平均碱基缺失数目,当蛋白编码基因由实施例1中的荧光素酶换成HPV抗原蛋白后,本申请不同的poly(A)变体仍然保持着高复制稳定性;其中P3与C2稳定性相当,没有统计学上的显著差异(p>0.05,χ2test),与本申请的其他组新变体相比,克隆稳定性最优。且C2发生大片段缺失的几率为1/50=2%,而P3发生大片段缺失的几率为1/70=1.4%,由于poly(A)大片段缺失会影响mRNA产品的体内表达和药效,因此,P3更符合产品需求。以上结果说明本申请设计的poly(A)变体在针对不同蛋白编码区的实施例中的应用具有普遍性。
如上所述,实施例1和实施例2中,大肠杆菌转化后的平板在30℃生化培养箱中培养过夜,得到的克隆进行测序检测以评估复制稳定性。此外,由于大肠杆菌培养温度影响DNA复制速度,也会影响复制稳定性。针对实施例3,本申请还比较了大肠杆菌转化后的平板在37℃生化培养箱中培养过后进行测序检测。结果表明(图5),37℃培养大大增加了对照C1的碱基缺失比例,由30℃培养时的53%上升至98%。与对照不同,P1、P2和P3在两个温度条件下的突变率无明显差异,说明本申请设计的poly(A)变体在针对大肠杆菌不同温度培养条件的实施例中仍然具有高复制稳定性,其应用具有普遍性。
3)检测3个poly(A)变体在大规模发酵培养及不同代数之间的稳定性
mRNA药物的生产必须依赖大规模发酵来制备足量的模板质粒,而发酵中质粒的稳定性(本申请中,质粒的稳定性特指poly dA:dT的稳定性)对生产质量均一的mRNA药物至关重要。另一方面,为满足不同批次生产的稳定性需求,需要建立含有目的质粒的菌种库,包括一级库、二级库等不同代数的菌种库,因此需要评估不同代数大肠杆菌中的质粒稳定性。针对上述两个方面的问题,本申请在实施例3中检测了发酵过程的不同代数之间P1、P3的稳定性,根据测序结果,每个poly(A)变体选择4个正确的大肠杆菌克隆分别通过发酵进行传代培养。结果表明,在种子传代的第3、5、7、和第9代,P1、P3的4个克隆的质粒保持稳定,未发生任何碱基变化。
实施例4荧光素酶mRNA在小鼠体内表达水平测试
真核生物细胞中,一定长度的poly(A)尾巴对保护mRNA 3’端、维持mRNA稳定性和促进蛋白表达至关重要。受体内生理或环境因素的影响,poly(A)逐渐变短,从而引发mRNA降解。本申请考察了不同poly(A)变体对蛋白表达水平的影响。我们采取小鼠体内表达检测用于评估不同polyA变体对荧光素酶活性的影响。本申请实施过程中,含有对照C2、以及P3、P4、P5、P8、P9的荧光素酶mRNA-LNP通过肌肉注射的方式进入小鼠体内,注射6h后进行动物成像并对荧光强度进行定量比较不同poly(A)对荧光素酶体内活性的影响。具体实验过程如下:
通过体外转录合成荧光素酶mRNA:用II型限制性内切酶BspQ I酶切处理得到线性化DNA,线性化后的DNA 3’端是不同的poly(A):对照C2、P3、P4、P5、P8或P9。线性化DNA作为模板进行体外转录,100μl反应体系中包含1X反应缓冲液,ATP、CTP、N1M-UTP、GTP各5mM(终浓度),CleanCap AG 4mM(终浓度),体外转录酶5μl。反应混合物充分混匀后,在37℃反应3h。体外转录合成的mRNA用LiCl沉淀法收集,最终用无酶水溶解。
动物实验:体外合成的荧光素酶mRNA包载成LNP,将得到的mRNA原液分别分散于20mM醋酸溶液(pH 5.0),得到mRNA浓度为200μg/mL的RNA溶液。按照可电离脂:胆固醇:DSPC:DMG-PEG2000=50:38.5:10:1.5的摩尔比进行混合成混脂。控制水相和油相流速通过T混流的方式,使mRNA与脂质混合物进行混合,启动注射泵,使mRNA溶液与脂质混合物混合形成LNP。然后用稀释液进行10倍稀释,经过超滤管离心浓缩后进行三次溶液置换。将上述得到的溶液,加入Tris水溶液调pH至7.0~8.0,得到LNP包载的mRNA溶液。LNP即脂质纳米颗粒。利用Ribogreen RNA定量试剂盒(Invitrogen,R11490)和达尔文ZetaSizer粒径仪分别测定LNP包载的mRNA浓度和粒径。4组分LNP中,各个组分的摩尔比为SM102:DSPC:胆固醇:DMG-PEG2000=50:10:38.5:1.5。包载完成后通过测定粒径、包封率、PDI等指标对LNP进行质控,质控结果表明制备的LNP符合粒径范围50nm-150nm,PDI<0.3的标准,而且包封率>90%,能够用于后续实验。通过ribogreen法测定LNP中的mRNA含量,然后将LNP稀释到mRNA含量为100ng/μL。
BALB/c小鼠按照体重随机分组,适应性饲养2-3天后进行给药,对照C2以及P3、P4、P5、P8、P9各注射5只小鼠,每只小鼠以肌肉注射的方式给药 100μl(10μg mRNA),对照组用PBS注射5只小鼠。注射6h后进行动物成像,并计算荧光值。结果表明(图7-8),与对照C2相比,P3的表达活性显著提高了1.8倍,具有统计学意义的显著性差异(p<0.01,student’s t-test),P4、P5、P8、P9与C2表达水平相当,无显著性差异(p>0.05,student’s t-test)。
本申请以上实施例中使用的序列示于如下序列表中。应当理解,以下序列仅为本申请实施方案的示例性序列,而非对本申请方案的任何限制。以下序列表中的核酸序列可表示DNA序列或RNA序列,当其表示RNA序列时,其中的“T”代表尿苷。
序列表:




Claims (51)

  1. 一种可在细胞中复制的工程化DNA分子,其包含多聚腺苷尾(Poly A尾)编码序列,所述Poly(A)尾编码序列包含唯一的元件a和至少一个元件b,以及至少一个元件c和/或至少一个元件d:
    所述元件a由多个连续的腺嘌呤(A)核苷酸组成,所述元件a的长度范围为≥20nt;
    所述元件b由多个连续的A核苷酸组成,所述元件b的长度范围为3nt≤b<20nt;
    所述元件c由一个非A的核苷酸组成,所述核苷酸选自T、C、G核苷酸;
    所述元件d由任意两个或更多个连续的核苷酸组成,所述核苷酸选自A、T、C、G核苷酸,其中元件d的5’及3’末端的核苷酸不为A核苷酸,并且其不包含3个以上连续的A核苷酸;所述元件d的长度范围为2nt≤d≤20nt;
    其中所述元件a和元件b不相邻、元件c和元件d不相邻,
    且所述Poly(A)尾编码序列不包含彼此相邻的元件b、不包含彼此相邻的元件c、且不包含彼此相邻的元件d。
  2. 根据权利要求1所述的DNA分子,所述Poly(A)尾编码序列的长度为101至200nt或101-150nt或120至150nt或130-140nt或120-135nt或123-125nt。
  3. 根据权利要求1或2所述的DNA分子,所述Poly(A)尾编码序列3’末端的为A核苷酸或非A核苷酸。
  4. 根据权利要求1-3中任一项所述的DNA分子,其中所述元件a≤80nt。
  5. 根据权利要求1-3中任一项所述的DNA分子,其中所述元件a为30至70nt、35至65nt、40至60nt、或45至55nt,优选60nt。
  6. 根据权利要求1-5中任一项所述的DNA分子,其中所述元件a的50%或以上的多核苷酸位于所述Poly(A)尾编码序列的5’部分或3’部分。
  7. 根据权利要求1-6中任一项所述的DNA分子,其中所述元件b为3nt至10nt、10至19nt、12至15nt、14nt至17nt或16至19nt,优选19nt。
  8. 根据权利要求1-7中任一项所述的DNA分子,其中所述元件b的个数为2-10个,优选2-5个,进一步优选3个。
  9. 根据权利要求1-8中任一项所述的DNA分子,其中所述元件c为G。
  10. 根据权利要求1-9中任一项所述的DNA分子,其中所述元件c的个数 为2至10个、3个至8个、或4至6个或2-5个,优选2个。
  11. 根据权利要求1-10中任一项所述的DNA分子,其中所述元件d包含回文序列。
  12. 根据权利要求1-11中任一项所述的DNA分子,其中所述元件d为3至18nt、5至16nt、4至10nt、或6至12nt,优选6nt。
  13. 根据权利要求1-12中任一项所述的DNA分子,其中所述元件d选自以下序列中的任一种或多种:GATATC(SEQ ID NO:15)、GTATAC(SEQ ID NO:16)、GAATCT(SEQ ID NO:17)、GCATATGACT(SEQ ID NO:18)及GATATCGTATAC(SEQ ID NO:19)。
  14. 根据权利要求1-13中任一项所述的DNA分子,其中所述元件d选自如下核苷酸序列中的任一种或多种:SEQ ID NO:15、SEQ ID NO:16、及SEQ ID NO:17。
  15. 根据权利要求1-14中任一项所述的DNA分子,其中所述元件d的核苷酸序列如SEQ ID NO:15所示。
  16. 根据权利要求1-15中任一项所述的DNA分子,其中所述元件d的个数为0-5,优选1-3个,进一步优选1个。
  17. 根据权利要求1-16中任一项所述的DNA分子,其中当元件c和元件d同时存在时,所述元件c和元件d的个数总和为2-15个,优选3-5个,进一步优选3个。
  18. 根据权利要求1-17中任一项所述的DNA分子,其中在Poly(A)尾编码序列的3’部分,优选所述Poly(A)尾编码序列靠近3’末端的1/2部分,包含一个或多个非A核苷酸。
  19. 根据权利要求1-18中任一项所述的DNA分子,所述Poly(A)尾编码序列的结构为:
    元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;
    元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b;
    元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c;
    元件a-元件d-元件b-元件c-元件b-元件c-元件b;或
    元件b-元件c-元件b-元件c-元件b-元件d-元件a。
  20. 根据权利要求1-19中任一项所述的DNA分子,所述Poly(A)尾编码序列的结构为:
    元件a-元件d-元件b-元件c-元件b-元件c-元件b;所述元件a为60nt长度,元件b为16至19nt长度,元件d为6nt长度。
  21. 根据权利要求1-20中任一项所述的DNA分子,所述Poly(A)尾编码序列的结构为:
    元件b-元件c-元件b-元件c-元件b-元件d-元件a;所述元件a为60nt长度,元件b为16至19nt长度,元件d为6nt长度。
  22. 根据权利要求1-21中任一项所述的DNA分子,所述Poly(A)尾编码序列如SEQ ID NO:1至10中任一项所示。
  23. 根据权利要求1-22中任一项所述的DNA分子,所述Poly(A)尾编码序列如SEQ ID NO:3或SEQ ID NO:4所示。
  24. 根据权利要求1-23中任一项所述的DNA分子,其在Poly(A)尾编码序列的5’端一侧进一步连接目的基因片段,所述目的基因片段与所述Poly(A)尾编码序列共同编码RNA。
  25. 根据权利要求1-24中任一项所述的DNA分子,其进一步包含复制子。
  26. 根据权利要求1-25中任一项所述的DNA分子,其进一步包含抗性基因。
  27. 根据权利要求1-26中任一项所述的DNA分子,其进一步包含启动所述RNA转录的启动子。
  28. 根据权利要求1-27中任一项所述的DNA分子,所述目的基因片段包括5’UTR编码序列。
  29. 根据权利要求1-28中任一项所述的DNA分子,所述目的基因片段包括蛋白编码序列或非蛋白编码序列。
  30. 根据权利要求1-29中任一项所述的DNA分子,所述目的基因片段包括3’UTR编码序列。
  31. 根据权利要求1-30中任一项所述的DNA分子包含复制子、抗生素抗性基因、启动子、5’UTR编码序列、蛋白编码序列、及3’UTR编码序列。
  32. 根据权利要求1-31的任一项所述的DNA分子,其中所述蛋白编码序列编码HPV(人乳头瘤病毒)蛋白,优选地所述HPV蛋白源自16型和/或18型HPV。
  33. 根据权利要求1-32中任一项所述的DNA分子,其中所述蛋白编码序列编码HPV的E2、E6或E7蛋白,E6和E7蛋白多肽片段的融合蛋白或E2、E6 和E7蛋白多肽片段的融合蛋白,优选地所述HPV蛋白源自16型和/或18型HPV。
  34. 根据权利要求1-33中任一项所述的DNA分子,其中所述蛋白编码序列编码如SEQ NO:26所示的多肽。
  35. 根据权利要求1-34中任一项所述的DNA分子,其包含如SEQ ID NO:22-25中任一项所示的多核苷酸序列,或如SEQ ID NO:22-25中任一项所示的多核苷酸序列的同义突变体,或与SEQ ID NO:22-25中任一项所示的多核苷酸序列或其同义突变体具有85%以上序列同一性的多核苷酸序列。
  36. 根据权利要求1-35中任一项所述的DNA分子,所述DNA分子为DNA质粒。
  37. 包含根据权利要求1-36中任一项所述的DNA分子的细胞。
  38. 根据权利要求37的细胞为原核细胞。
  39. 根据权利要求37的细胞为大肠杆菌。
  40. 根据权利要求1-36中任一项所述的DNA分子编码的RNA分子。
  41. 根据权利要求40所述的RNA分子,所述RNA还包含5'‐帽结构和/或所述RNA的部分或全部尿苷为化学修饰的尿苷;优选的所述RNA的部分或全部尿苷为假尿苷或1‐甲基‐假尿苷。
  42. 根据权利要求1-36中任一项所述的Poly(A)尾的DNA编码序列。
  43. 根据权利要求1-36中任一项所述的Poly(A)尾的DNA编码序列所编码的Poly(A)尾序列。
  44. 将根据权利要求42的Poly(A)尾的DNA编码序列用于使编码RNA的DNA分子在宿主细胞中的复制更加保守的用途,其中所述Poly(A)尾位于所述RNA的3’末端。
  45. 根据权利要求44所述的用途,所述宿主细胞为原核细胞,优选大肠杆菌。
  46. 根据权利要求43所述的Poly(A)尾序列用于调控RNA分子在宿主细胞中表达的用途,其中所述Poly(A)尾位于所述RNA的3’末端。
  47. 根据权利要求46所述的用途,所述宿主细胞为真核细胞,优选哺乳动物细胞,进一步优选人细胞。
  48. 包含根据权利要求1-36中任一项所述的DNA分子的文库。
  49. 包含根据权利要求1-36中任一项所述的DNA分子所编码的RNA分子 的文库。
  50. 调控蛋白表达的方法,包括:
    将根据权利要求48的DNA文库中的多种DNA在不同时间,和/或以不同的数量比例导入目的细胞;或
    将权利要求49的RNA文库中的多种RNA在不同时间,和/或以不同的数量比例导入目的细胞。
  51. DNA与RNA的杂合分子,其携带与根据权利要求1-36中任一项所述的DNA分子相同的遗传信息,与根据权利要求42所述的Poly(A)尾相同的遗传信息,或与根据权利要求40所述的RNA分子相同的遗传信息。
PCT/CN2024/079346 2023-03-01 2024-02-29 编码rna的工程化dna分子 Ceased WO2024179544A1 (zh)

Priority Applications (6)

Application Number Priority Date Filing Date Title
AU2024229819A AU2024229819A1 (en) 2023-03-01 2024-02-29 Engineered dna molecule for coding rna
KR1020257029385A KR20250161542A (ko) 2023-03-01 2024-02-29 Rna를 코딩하기 위한 조작된 dna 분자
JP2025549603A JP2026507667A (ja) 2023-03-01 2024-02-29 Rnaをコードするための工学的に操作されたdna分子
EP24763224.3A EP4674967A1 (en) 2023-03-01 2024-02-29 Engineered dna molecule for coding rna
US19/083,930 US20250215441A1 (en) 2023-03-01 2025-03-19 Engineered dna molecule for coding rna
MX2025010281A MX2025010281A (es) 2023-03-01 2025-08-29 Molécula de adn diseñada para codificar arn

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CNPCT/CN2023/079037 2023-03-01
CN2023079037 2023-03-01
CN202410205083 2024-02-23
CN202410205083.1 2024-02-23

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US19/083,930 Continuation US20250215441A1 (en) 2023-03-01 2025-03-19 Engineered dna molecule for coding rna

Publications (1)

Publication Number Publication Date
WO2024179544A1 true WO2024179544A1 (zh) 2024-09-06

Family

ID=92533833

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/079346 Ceased WO2024179544A1 (zh) 2023-03-01 2024-02-29 编码rna的工程化dna分子

Country Status (10)

Country Link
US (1) US20250215441A1 (zh)
EP (1) EP4674967A1 (zh)
JP (1) JP2026507667A (zh)
KR (1) KR20250161542A (zh)
CN (1) CN118581123A (zh)
AU (1) AU2024229819A1 (zh)
CL (1) CL2025002618A1 (zh)
MX (1) MX2025010281A (zh)
TW (1) TW202440933A (zh)
WO (1) WO2024179544A1 (zh)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170166905A1 (en) * 2014-07-11 2017-06-15 Biontech Rna Pharmaceuticals Gmbh Stabilization of poly(a) sequence encoding dna sequences
CN107073070A (zh) * 2014-08-15 2017-08-18 格纳西尼有限公司 治疗宫颈癌的方法
CN111212908A (zh) * 2017-08-15 2020-05-29 因特利亚治疗公司 编码信使核糖核酸(mRNA)的稳定核酸
CN112805386A (zh) * 2018-10-11 2021-05-14 埃泽瑞斯公司 含有编码具有节段多聚(A)尾的mRNA的序列的质粒
CN113088519A (zh) * 2019-12-23 2021-07-09 北京原基品德生物科技有限公司 一种分离的核酸分子及其应用
CN115698303A (zh) * 2020-03-17 2023-02-03 Abion株式会社 体外转录物mrna和包含其的药物组合物
WO2023022490A1 (ko) * 2021-08-17 2023-02-23 에스케이바이오사이언스 주식회사 폴리 a 테일을 안정적으로 유지하는 방법
WO2024041641A1 (zh) * 2022-08-26 2024-02-29 深圳瑞吉生物科技有限公司 在宿主细胞中稳定存在的多段化PolyA多核苷酸及其应用

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20240098596A (ko) * 2022-12-21 2024-06-28 엘지디스플레이 주식회사 표시 장치 및 표시 패널
AU2023414262B2 (en) * 2022-12-29 2025-08-14 Rinuagene Biotechnology Co., Ltd. Polynucleotide molecule for preventing or treating hpv infection-related diseases

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170166905A1 (en) * 2014-07-11 2017-06-15 Biontech Rna Pharmaceuticals Gmbh Stabilization of poly(a) sequence encoding dna sequences
US10717982B2 (en) 2014-07-11 2020-07-21 Biontech Rna Pharmaceuticals Gmbh Stabilization of poly(A) sequence encoding DNA sequences
CN107073070A (zh) * 2014-08-15 2017-08-18 格纳西尼有限公司 治疗宫颈癌的方法
CN111212908A (zh) * 2017-08-15 2020-05-29 因特利亚治疗公司 编码信使核糖核酸(mRNA)的稳定核酸
CN112805386A (zh) * 2018-10-11 2021-05-14 埃泽瑞斯公司 含有编码具有节段多聚(A)尾的mRNA的序列的质粒
CN113088519A (zh) * 2019-12-23 2021-07-09 北京原基品德生物科技有限公司 一种分离的核酸分子及其应用
CN115698303A (zh) * 2020-03-17 2023-02-03 Abion株式会社 体外转录物mrna和包含其的药物组合物
WO2023022490A1 (ko) * 2021-08-17 2023-02-23 에스케이바이오사이언스 주식회사 폴리 a 테일을 안정적으로 유지하는 방법
WO2024041641A1 (zh) * 2022-08-26 2024-02-29 深圳瑞吉生物科技有限公司 在宿主细胞中稳定存在的多段化PolyA多核苷酸及其应用

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4674967A1

Also Published As

Publication number Publication date
US20250215441A1 (en) 2025-07-03
CL2025002618A1 (es) 2025-12-19
JP2026507667A (ja) 2026-03-04
AU2024229819A1 (en) 2025-09-04
TW202440933A (zh) 2024-10-16
KR20250161542A (ko) 2025-11-17
CN118581123A (zh) 2024-09-03
EP4674967A1 (en) 2026-01-07
MX2025010281A (es) 2025-10-01

Similar Documents

Publication Publication Date Title
Trepotec et al. Segmented poly (A) tails significantly reduce recombination of plasmid DNA without affecting mRNA translation efficiency or half-life
JP6625521B2 (ja) 環状rnaの細胞内翻訳
EP4359521A2 (en) Compositions and methods for improved protein translation from recombinant circular rnas
CN117568337B (zh) 一种基于人源tRNA的自环化RNA
JP2025062016A (ja) 核酸コンカテマー由来産物の発現
CN116004696A (zh) 可与IRES组合的3ˊUTR加茎环结构基因及其应用、mRNA表达系统
JP2023505188A (ja) 核酸組成物
EP4642914A1 (en) Compositions and methods for generating circular rna
CN117844806B (zh) 一种5’-utr元件及其在提高蛋白表达量中的应用
US20260092293A1 (en) Expression-adjustable engineered rna molecule
WO2024179544A1 (zh) 编码rna的工程化dna分子
WO2025241623A1 (zh) 一种翻译调控元件、表达载体、药物组合物、疫苗组合物、构建方法、表达目的蛋白的方法及应用
CA3284145A1 (en) Engineered dna molecule for coding rna
HK40112587A (zh) 编码rna的工程化dna分子
CN121294433A (zh) 一种polyA尾部元件、构建体及其应用
HK40115559A (zh) 表达可调控的工程化rna分子
CN121801903A (zh) 一种自组装自复制rna系统及其应用
EP4208551A1 (en) Rna-aptamer-sensors
CN121801893A (zh) 提高mRNA的PolyA尾遗传稳定性的方法
CN119823983A (zh) 一种提高无帽无尾线性mRNA翻译效率的元件

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24763224

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: AU2024229819

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2025549603

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2025549603

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: P2025-02715

Country of ref document: AE

WWE Wipo information: entry into national phase

Ref document number: MX/A/2025/010281

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 1020257029385

Country of ref document: KR

Free format text: ST27 STATUS EVENT CODE: A-0-1-A10-A15-NAP-PA0105 (AS PROVIDED BY THE NATIONAL OFFICE)

ENP Entry into the national phase

Ref document number: 2024229819

Country of ref document: AU

Date of ref document: 20240229

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112025017585

Country of ref document: BR

WWE Wipo information: entry into national phase

Ref document number: 202592395

Country of ref document: EA

WWE Wipo information: entry into national phase

Ref document number: 11202505708U

Country of ref document: SG

WWP Wipo information: published in national office

Ref document number: 11202505708U

Country of ref document: SG

WWE Wipo information: entry into national phase

Ref document number: 2024763224

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: MX/A/2025/010281

Country of ref document: MX

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2024763224

Country of ref document: EP

Effective date: 20251001

WWP Wipo information: published in national office

Ref document number: 1020257029385

Country of ref document: KR

ENP Entry into the national phase

Ref document number: 2024763224

Country of ref document: EP

Effective date: 20251001

ENP Entry into the national phase

Ref document number: 2024763224

Country of ref document: EP

Effective date: 20251001

WWP Wipo information: published in national office

Ref document number: 2024763224

Country of ref document: EP