WO2024251229A1 - Cas酶及其系统和应用 - Google Patents
Cas酶及其系统和应用 Download PDFInfo
- Publication number
- WO2024251229A1 WO2024251229A1 PCT/CN2024/097935 CN2024097935W WO2024251229A1 WO 2024251229 A1 WO2024251229 A1 WO 2024251229A1 CN 2024097935 W CN2024097935 W CN 2024097935W WO 2024251229 A1 WO2024251229 A1 WO 2024251229A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- cas enzyme
- nucleic acid
- cas
- cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases [RNase]; Deoxyribonucleases [DNase]
- C12N9/222—Clustered regularly interspaced short palindromic repeats [CRISPR]-associated [CAS] enzymes
- C12N9/226—Class 2 CAS enzyme complex, e.g. single CAS protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/09—Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/80—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPR]
Definitions
- the present application relates to the field of biomedicine, and specifically to a Cas enzyme and its system and application.
- CRISPR-Cas Clustered regularly interspaced short palindromic repeats
- Cas CRISPR-associated genes
- Components of the system involved in host defense include one or more effector proteins capable of modifying DNA or RNA and RNA guide elements responsible for targeting the activity of these proteins to specific sequences on phage DNA or RNA, which can be reprogrammed to target alternative DNA or RNA targets.
- CRISPR-Cas systems can be roughly divided into two categories: Class 1 systems are composed of multiple effector proteins, and Class 2 systems are composed of a single effector protein, which is complexed with an RNA guide to target a DNA or RNA substrate.
- Class 1 systems are composed of multiple effector proteins
- Class 2 systems are composed of a single effector protein, which is complexed with an RNA guide to target a DNA or RNA substrate.
- the single-subunit effector composition of Class 2 systems provides a simpler set of components for engineering and application conversion, and has been an important source of programmable effectors to date.
- the characterization and engineering of Class 2 CRISPR-Cas systems, such as CRISPR-Cas9 have paved the way for diverse and extensive biotechnological applications in genome editing and other aspects.
- nucleic acids and polynucleotides i.e., DNA, RNA, or any hybrid, derivative, or modification thereof.
- the present application provides an isolated Cas enzyme, the Cas enzyme comprising an amino acid sequence as shown in any one of SEQ ID NOs: 1-58 or a sequence having at least about 80% identity with the amino acid sequence as shown in any one of the SEQ ID NOs: 1-58.
- the Cas enzyme comprises an amino acid sequence as shown in any one of the SEQ ID NOs: 1-58 having about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or a sequence having about 100% identity.
- the present application provides a Cas enzyme comprising an amino acid sequence as shown in the following table:
- the Cas enzyme has a catalytically active domain capable of binding to a target DNA chain and/or a catalytically active domain capable of cleaving the target DNA chain.
- the catalytically active domain comprises one or more amino acid changes, so that the Cas enzyme has only the activity of binding to the target DNA chain, or has the activity of binding to the target DNA chain and the activity of cutting the target DNA single strand.
- the present application provides a fusion molecule, which comprises the Cas enzyme described in the present application and one or more heterologous functional domains.
- the one or more heterologous functional domains are capable of regulating the expression of one or more gene products.
- the one or more heterologous functional domains are fused directly or indirectly to the Cas enzyme.
- the one or more heterologous functional domains are selected from the group consisting of an autohelicase, a nuclease, a helicase-nuclease, a DNA methyltransferase, a DNA hydroxymethylase, a histone methylase, a histone demethylase, a histone acetyltransferase, a histone deacetylase, a phosphatase, a kinase, a transcription (co)activator, a transcription repressor, a DNA binding protein, a DNA structural protein, a marker protein, a reporter protein, a fluorescent protein, a ligand binding protein, a signal peptide, a subcellular localization sequence, an antibody epitope, and an affinity purification tag.
- an autohelicase a nuclease, a helicase-nuclease, a DNA methyltransferase, a DNA hydroxymethylase,
- the one or more heterologous functional domains have one or more of the following activities: methylase activity, demethylase activity, deaminase activity, transcription activation activity, transcription repression activity, transcription release factor activity, reverse transcriptase activity, histone modification activity, RNA cleavage activity and nucleic acid binding activity.
- the present application provides an engineered, programmable, non-naturally occurring CRISPR-Cas system, the system comprising the Cas enzyme described in the present application or the fusion molecule described in the present application, and one or more guide RNAs, wherein the one or more guide RNAs target the loci of nucleic acid molecules encoding one or more gene products in a cell, thereby guiding the Cas enzyme or the fusion molecule to bind to and/or cut the loci of the nucleic acid molecules encoding one or more gene products; and the Cas enzyme or the fusion molecule and the guide RNA do not exist naturally together.
- the present application provides an engineered, non-naturally occurring vector system, the vector system comprising one or more vectors, the one or more vectors comprising: a) a first regulatory element, the first regulatory element being operably linked to one or more guide RNAs, the one or more guide RNAs being capable of hybridizing with a target sequence in a locus of a nucleic acid molecule encoding one or more gene products, and b) a second regulatory element, the second regulatory element being operably linked to the Cas enzyme described in the present application or the fusion molecule described in the present application, wherein the components a) and b) are located on the same or different vectors of the vector system, and the guide RNA targets the locus of the nucleic acid molecule encoding one or more gene products in the cell, thereby guiding the Cas enzyme or the fusion molecule to bind to and/or cut the locus of the nucleic acid molecule encoding one or more gene products; and the Cas enzyme or the fusion molecule
- expression of the one or more gene products is altered.
- expression of the gene product is decreased or increased.
- the gene product is a protein.
- the cell is a eukaryotic cell.
- the eukaryotic cell is a mammalian cell.
- the mammalian cell includes, but is not limited to, cells of mice, monkeys, humans, farm animals, sports animals, and pets.
- the mammalian cell is a human cell.
- the Cas enzyme is codon-optimized for expression in eukaryotic cells.
- the guide RNA comprises a guide sequence fused to a tracr sequence.
- the guide RNA comprises a direct repeat sequence and a spacer sequence, wherein the spacer sequence binds to the nucleic acid molecule targeted by the guide RNA.
- the direct repeat sequence is 10 to 70 nucleotides in length.
- the direct repeat sequence is 31 to 36 nucleotides in length.
- the direct repeat sequence comprises a nucleotide sequence shown in any one of SEQ ID NOs: 63-88 and 90-99, or comprises a nucleotide sequence having at least 95% sequence identity with a nucleotide sequence shown in any one of SEQ ID NOs: 63-88 and 90-99.
- the spacer sequence is 16 to 24 nucleotides in length.
- the nucleic acid molecule targeted by the guide RNA comprises a nucleotide sequence that is capable of complementary pairing with the spacer sequence.
- the vector or the Cas enzyme of the system further comprises one or more nuclear localization sequences (NLS).
- NLS nuclear localization sequences
- the system is introduced into the cell via a delivery system selected from the group consisting of viral particles, liposomes, lipid nanoparticles, electroporation, microinjection, and conjugation.
- the present application provides a method for changing the expression of one or more gene products, the method comprising introducing an engineered, non-naturally occurring CRISPR-Cas system into a cell containing and expressing a nucleic acid molecule encoding the one or more gene products, the system comprising the Cas enzyme described in the present application or the fusion molecule described in the present application, and one or more guide RNAs, the one or more guide RNAs targeting the locus of the nucleic acid molecule encoding the one or more gene products, thereby guiding the Cas enzyme or the fusion molecule to bind to and/or cut the locus, thereby changing the expression of the one or more gene products; and the Cas enzyme or the fusion molecule does not naturally exist together with the guide RNA.
- the present application provides a method for changing the expression of one or more gene products, the method comprising introducing an engineered, non-naturally occurring vector system into a cell containing and expressing a nucleic acid molecule encoding the one or more gene products, the vector system comprising one or more vectors, the one or more vectors comprising: a) a first regulatory element, the first regulatory element being operably linked to one or more guide RNAs, the one or more guide RNAs being capable of hybridizing with a target sequence in the locus of the nucleic acid molecule encoding the one or more gene products, and b) a second regulatory element, the second regulatory element being operably linked to the Cas enzyme described in the present application or the fusion molecule described in the present application, wherein the components a) and b) are located on the same or different vectors of the vector system, and the guide RNA targets the locus of the nucleic acid molecule encoding the one or more gene products in the cell, thereby guiding the Cas
- expression of the gene product is decreased or increased.
- the gene product is a protein.
- the cell is a eukaryotic cell.
- the eukaryotic cell is a mammalian cell.
- the mammalian cell includes but is not limited to Limited to cells from mice, monkeys, humans, farm animals, sports animals, and pets.
- the mammalian cell is a human cell.
- the Cas enzyme is codon-optimized for expression in eukaryotic cells.
- the guide RNA comprises a guide sequence fused to a tracr sequence.
- the guide RNA comprises a direct repeat sequence and a spacer sequence, wherein the spacer sequence binds to the nucleic acid molecule targeted by the guide RNA.
- the direct repeat sequence is 10 to 70 nucleotides in length.
- the direct repeat sequence is 31 to 36 nucleotides in length.
- the direct repeat sequence comprises a nucleotide sequence shown in any one of SEQ ID NOs: 63-88 and 90-99, or comprises a nucleotide sequence having at least 95% sequence identity with a nucleotide sequence shown in any one of SEQ ID NOs: 63-88 and 90-99.
- the spacer sequence is 16 to 24 nucleotides in length.
- the nucleic acid molecule targeted by the guide RNA comprises a nucleotide sequence that is capable of complementary pairing with the spacer sequence.
- the vector or the Cas enzyme of the system further comprises one or more nuclear localization sequences (NLS).
- NLS nuclear localization sequences
- the method comprises introducing the CRISPR-Cas system or the vector system into the cell via a delivery system selected from the group consisting of viral particles, liposomes, lipid nanoparticles, electroporation, microinjection, and conjugation.
- the present application provides a nucleic acid encoding the Cas enzyme described in the present application, the fusion molecule described in the present application, or the CRISPR-Cas system described in the present application.
- the present application provides a cell, comprising the Cas enzyme described in the present application, the fusion molecule described in the present application, the CRISPR-Cas system described in the present application, the vector system described in the present application and/or the nucleic acid described in the present application.
- the present application provides a kit comprising the Cas enzyme described in the present application, the fusion molecule described in the present application, the CRISPR-Cas system described in the present application, the vector system described in the present application, the nucleic acid described in the present application and/or the cell described in the present application.
- the kit further comprises a container for placing the Cas enzyme, the fusion molecule, the CRISPR-Cas system, the vector system, the nucleic acid and/or the cell, and instructions for use.
- Figure 1A-1B shows the schematic diagram of the assay process and fluorescence results of the cleavage activity of the Cas enzyme described in the present application in eukaryotic cells, as well as the PAM motif required for the gRNA binding of the Cas enzyme.
- Figures 2A-2B show a schematic diagram of the determination process of the cleavage activity of the Cas enzyme described in the present application in eukaryotic cells and the comparison results of green fluorescent cells of different Cas enzymes.
- Figures 3A-3B show the schematic diagram of the determination process of the in vitro cleavage activity of the Cas enzyme described in the present application and its PAM sequence and the sequencing results.
- Figure 4 shows the site information and amplification detection results of detecting the Cas enzyme activity described in the present application at the endogenous site.
- Figures 5A-5C show the GFP fluorescence results of different variants of the Cas enzyme described in the present application after cleavage activity assay in eukaryotic cells.
- Figures 6A-6B show the results of the proportion of green fluorescent cells after the cutting activity of sgRNA treated with the Cas enzyme described in this application combined with different DR (Direct repeat) region optimization schemes in eukaryotic cells was measured.
- FIG. 7 shows the results of the green fluorescent cell ratio after the cleavage activity of the Cas enzyme described in the present application combined with sgRNA of different Spacer lengths in eukaryotic cells was measured.
- Figures 8A-8B show the results of green fluorescent cell ratios after different variants of the Cas enzyme described in the present application were tested for cleavage activity in eukaryotic cells.
- Figure 8C shows the results of cleavage activity tests (green fluorescent cell ratios) of different variants of the Cas enzyme described in the present application in different PAM reporter systems.
- Figures 9A-9C show a schematic diagram of the structure of a cytosine base editor comprising the Cas enzyme described in the present application, as well as the results of detecting the base editing efficiency at two endogenous sites, EMX1 and VEGFA.
- FIG. 10 shows the activation effect of the gene activation epigenetic tool comprising the Cas enzyme described in the present application on detecting the target site gene expression at the CXCR4 endogenous site.
- identity can be used interchangeably with “homology”, and it generally refers to the relationship between two or more polypeptide molecules or two or more nucleic acid molecule sequences, and the relationship is determined by comparing their sequences.
- identity also refers to the degree of nucleic acid molecules or polypeptide sequence correlation, which can be determined by the matching between two or more nucleotides or two or more amino acid sequences.
- the identity percentage (%) of an amino acid sequence is defined as the percentage of the total number of residues in the candidate sequence that the amino acid residues identical with the amino acid residues in the reference polypeptide sequence are accounted for after the alignment sequence and, if necessary, introducing a gap to reach the maximum percentage sequence identity, and any conservative substitution is not considered as a part of the sequence identity.
- the comparison for the purpose of measuring the percentage amino acid sequence identity can be achieved in a variety of ways within the art technology, for example, using publicly available computer software, such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can determine the appropriate parameters for the alignment sequence, including any algorithm required for reaching the maximum alignment in the full length of the sequence compared.
- the calculation of the percent identity (%) of a polypeptide molecule or nucleic acid molecule sequence can also determine the total number of residues based on the sequence mutation type.
- the mutation type includes insertion (extension) at either or both ends of the sequence, deletion (truncation) at either or both ends of the sequence, substitution/replacement of one or more amino acids/nucleotides, insertion within the sequence, and deletion within the sequence.
- the mutation type is one or more of the following: substitution/replacement of one or more amino acids/nucleotides, insertion within the sequence, and deletion within the sequence
- the total number of residues is calculated as the larger of the molecules being compared.
- the mutation type also includes insertion (extension) at either or both ends of the sequence or deletion (truncation) at either or both ends of the sequence or insertion within the sequence and deletion within the sequence
- the number of amino acids inserted or deleted at either or both ends or within is less than 20
- the sequences being compared can be aligned in a manner that produces the maximum match between the sequences, and the gaps (if any) in the alignment are resolved by a specific algorithm.
- catalytically active domain refers to an identifiable or determinable conserved structural entity in a Cas protein (enzyme) that exhibits significant secondary structure content, and the conserved structure is a region in which the Cas protein (enzyme) implements functions such as binding and/or cutting polynucleotides.
- An exemplary catalytically active domain can be a protease of the Cas9 family, which has two catalytically active domains, one of which is HNH-like, and its function is to cut off a single-stranded polynucleotide (target chain) paired with a guide RNA, and the other domain is RuvC-like, and its function is to cut off the complementary chain of the target chain.
- binding e.g., with respect to a target DNA binding (catalytically active) domain of a polypeptide or protease
- macromolecules e.g., between a protein and a nucleic acid
- association e.g., when molecule X is referred to as interacting with molecule Y, it means that molecule X binds to molecule Y in a non-covalent manner.
- binding interaction components need to be sequence-specific (e.g., contacts with phosphate residues in a DNA backbone), but some portions of the binding interaction may be sequence-specific.
- fusion molecule generally refers to a molecule consisting of at least two parts (bipartite molecule), which contains the enzyme (protein or peptide) of the present application, which is coupled to at least one other part to form a single entity.
- the enzyme and the at least one other part can be separated by a linker, or can be directly coupled.
- the at least one other part can be fused to the enzyme of the present application at any amino acid other than the N-terminus, C-terminus or terminal amino acid.
- the other part can be fused to the part already contained in the fusion molecule.
- Those skilled in the art are fully aware of the optimal order and/or combination of determinations for determining the parts in the fusion molecule of the present application.
- the term does not include such a fusion molecule in which the fusion produces a naturally occurring peptide.
- heterologous generally refers to a nucleotide or polypeptide sequence that is not present in a natural nucleic acid or protein, respectively.
- heterologous functional domain may refer to about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains or a part of a fusion molecule described in the present application in addition to the Cas enzyme described in the present application.
- heterologous functional domains that may be included in the fusion molecule described in the present application or that may be fused to the Cas enzyme described in the present application include, but are not limited to, epitope tags, reporter gene sequences, and one or more protein domains having the following activities: methylase activity, demethylase activity, transcriptional activation activity, transcriptional repression activity, transcriptional release factor activity, histone modification activity, RNA cleavage activity, and nucleic acid binding activity.
- epitope tags include histidine (His) tags, V5 tags, FLAG tags, influenza virus hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags.
- reporter genes include, but are not limited to, glutathione-S-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT), ⁇ -galactosidase, ⁇ -glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), to include the autofluorescent protein of blue fluorescent protein (BFP).
- GST glutathione-S-transferase
- HRP horseradish peroxidase
- CAT chloramphenicol acetyltransferase
- CAT chloramphenicol acetyltransferase
- CAT chloramphenicol acetyltransferase
- CAT chloramphenicol acetyltransferase
- CAT chloramphenicol acetyltransferase
- Cas enzymes can be fused to a gene sequence encoding a protein or protein fragment, which binds to a DNA molecule or binds to other cell molecules, including, but not limited to, maltose binding protein (MBP), S-tag, Lex A DNA binding domain (DBD) fusions, GAL4 DNA binding domain fusions, and herpes simplex virus (HSV) BP16 protein fusions.
- MBP maltose binding protein
- DBD Lex A DNA binding domain
- GAL4 DNA binding domain fusions GAL4 DNA binding domain fusions
- HSV herpes simplex virus
- expression generally refers to the process by which a polynucleotide is transcribed from a DNA template (e.g., into mRNA or other RNA transcripts) and/or the process by which the transcribed mRNA is subsequently translated into a peptide, polypeptide or protein.
- Transcripts and encoded polypeptides may be collectively referred to as "gene products.” If the polynucleotide is derived from genomic DNA, expression may include splicing of mRNA in eukaryotic cells.
- polynucleotide generally refer to a polymeric form of nucleotides of any length, which are deoxyribonucleotides or ribonucleotides, or analogs thereof.
- Polynucleotides can have any three-dimensional structure and can perform any function known or unknown.
- polynucleotides coding or non-coding regions of genes or gene fragments, multiple loci (one locus) defined according to connection analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers.
- mRNA messenger RNA
- transfer RNA transfer RNA
- ribosomal RNA short hairpin RNA
- miRNA micro-RNA
- ribozymes ribozymes
- cDNA recombinant polynucleotides
- branched polynucleotides branched polynucleotides
- plasmids vectors, isolated DNA of any sequence, isolated RNA
- Polynucleotides can contain one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modification of the nucleotide structure can be performed before or after polymer assembly. The sequence of nucleotides can be interrupted by non-nucleotide components. Polynucleotides can be further modified after polymerization, such as by conjugation with labeled components.
- nucleic acid molecules, polypeptides, or combinations and systems thereof are used interchangeably.
- nucleic acid molecules, polypeptides, or combinations and systems thereof they generally mean that the nucleic acid molecules or polypeptides are at least substantially free from at least one other component with which they are associated in nature or as found in nature.
- vector generally refers to a nucleic acid molecule that can transport another nucleic acid molecule connected thereto.
- Vectors include, but are not limited to, single-stranded, double-stranded, or partially double-stranded nucleic acid molecules; nucleic acid molecules including one or more free ends, free ends (e.g., circular); nucleic acid molecules including DNA, RNA, or both; and other various polynucleotides known in the art.
- plasmid refers to a circular double-stranded DNA loop into which additional DNA fragments can be inserted, for example, by standard molecular cloning techniques.
- viral vector in which a virally derived DNA or RNA sequence is present in a vector for packaging a virus (e.g., a retrovirus, a replication-defective retrovirus, an adenovirus, a replication-defective adenovirus, and an adeno-associated virus).
- virus e.g., a retrovirus, a replication-defective retrovirus, an adenovirus, a replication-defective adenovirus, and an adeno-associated virus.
- Viral vectors also include polynucleotides carried by a virus for transfection into a host cell.
- Certain vectors e.g., bacterial vectors and episomal mammalian vectors with a bacterial origin of replication
- vectors are integrated into the genome of the host cell after being introduced into the host cell, and are thereby replicated together with the host genome. Moreover, some vectors are capable of directing the expression of genes to which they are operably linked. Such vectors are referred to herein as "expression vectors". Common expression vectors used in recombinant DNA technology are usually in the form of plasmids. Recombinant expression vectors may contain the nucleic acid of the present application in a form suitable for nucleic acid expression in a host cell, which means that these recombinant expression vectors contain the gene.
- One or more regulatory elements selected for the host cell to be used for expression which are operably linked to the nucleic acid sequence to be expressed.
- "operably linked” is intended to mean that the nucleotide sequence of interest is linked to the one or more regulatory elements in a manner that allows expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
- regulatory element is generally intended to include promoters, enhancers, internal ribosome entry sites (IRES), and other expression control elements (e.g., transcription termination signals, such as polyadenylation signals and poly-U sequences).
- promoters e.g., promoters, enhancers, internal ribosome entry sites (IRES), and other expression control elements (e.g., transcription termination signals, such as polyadenylation signals and poly-U sequences).
- IRES internal ribosome entry sites
- regulatory elements may include those sequences that direct constitutive expression of a nucleotide sequence in many types of host cells and those sequences that direct the expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences).
- Tissue-specific promoters may primarily direct expression in a desired tissue of interest, such as muscle, neurons, bone, skin, blood, a specific organ (e.g., liver, pancreas), or a particular cell type (e.g., lymphocytes). Regulatory elements can also direct expression in a timing-dependent manner (e.g., in a cell cycle-dependent or developmental stage-dependent manner), which may or may not be tissue- or cell-type-specific.
- tissue of interest such as muscle, neurons, bone, skin, blood, a specific organ (e.g., liver, pancreas), or a particular cell type (e.g., lymphocytes).
- Regulatory elements can also direct expression in a timing-dependent manner (e.g., in a cell cycle-dependent or developmental stage-dependent manner), which may or may not be tissue- or cell-type-specific.
- a vector may comprise one or more pol III promoters (e.g., 1, 2, 3, 4, 5, or more pol III promoters), one or more pol II promoters (e.g., 1, 2, 3, 4, 5, or more pol II promoters), one or more pol I promoters (e.g., 1, 2, 3, 4, 5, or more pol I promoters), or a combination thereof.
- pol III promoters include, but are not limited to, U6 and H1 promoters.
- pol II promoters include, but are not limited to, the retrovirus Rous sarcoma virus (RSV) LTR promoter (optionally with an RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with a CMV enhancer), the SV40 promoter, the dihydrofolate reductase promoter, the ⁇ -actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1 ⁇ promoter.
- RSV Rous sarcoma virus
- CMV cytomegalovirus
- SV40 promoter the SV40 promoter
- dihydrofolate reductase promoter the ⁇ -actin promoter
- PGK phosphoglycerol kinase
- regulatory element may also encompass enhancer elements such as WPRE, CMV enhancer, R-U5' fragment in the LTR of HTLV-I, SV40 enhancer; and the intron sequence between exons 2 and 3 of rabbit ⁇ -globin (Proc. Natl. Acad. Sci. USA., Vol. 78(3), pp. 1527-31, 1981).
- a vector may be introduced into a host cell to thereby produce transcripts, proteins, or peptides, including fusion molecules or enzymes encoded by nucleic acids as described herein (e.g., clustered regularly interspaced short palindromic repeats (CRISPR) transcripts, proteins, enzymes, mutant forms thereof, fusion molecules or fusion proteins thereof, etc.).
- Advantageous vectors include lentiviruses and adeno-associated viruses, and vectors of this type may also be selected to target specific types of cells.
- the term "codon optimization” generally refers to replacing at least one codon of a native sequence with a codon that is more frequently or most frequently used in the genes of the host cell, for example, about or more than about 1, 2, 3, 4, 5, 10, 15,20,25,50 or more codons are maintained simultaneously and a nucleic acid sequence is modified to enhance the method for expression in the host cell of interest.Different species show specific preferences for certain codons with specific amino acids.Codon preference (differences in the use of codons between organisms) is often related to the translation efficiency of messenger RNA (mRNA), and the translation efficiency is considered to depend on the properties of the codons translated (among others) and the availability of specific transfer RNA (tRNA) molecules. The advantage of the tRNA selected in the cell generally reflects the codons most frequently used for peptide synthesis.Therefore, genes can be customized to the best gene expression based on codon optimization in a given organism.Codon utilization tables can be easily obtained, for example, in codon usage
- codon optimization of specific sequences for expression in specific host cells are also available, such as Gene Forge (Aptagen, Jacobus, PA), which is also available.
- one or more codons e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons
- one or more codons in the sequence encoding the Cas enzyme correspond to the most frequently used codons for a particular amino acid.
- guide RNA is used interchangeably with “guide RNA” and "gRNA”, which generally refers to a group of nucleic acid molecules that facilitate the specific guidance of RNA-guided nucleases or other effector molecules (usually complexed with gRNA molecules) to target sequences.
- crRNA and tracrRNA usually exist as two independent RNA molecules, constituting gRNA.
- tracrRNA generally refers to a scaffold-type RNA that can bind to Cas nucleases
- crRNA also known as CRISPR RNA, generally refers to a nucleotide sequence that is complementary to the targeted target DNA.
- crRNA and tracrRNA can also be fused into a single strand, in which case gRNA can also be called single-stranded guide RNA (sgRNA), which has become the most common form of gRNA used by those skilled in the art in CRISPR technology, so the terms "sgRNA” and "gRNA” can have the same meaning in this article.
- sgRNA can be artificially synthesized or prepared from a DNA template in vitro or in vivo. sgRNA can bind to Cas nucleases or target target DNA, and it can guide Cas nucleases to cut DNA sites complementary to gRNA.
- crRNA generally comprises a spacer sequence that mediates target recognition and a direct repeat sequence (also referred to herein as "Direct repeat” or “DR sequence”) that forms a complex with the CRISPR-Cas effector protein.
- a direct repeat sequence also referred to herein as "Direct repeat” or "DR sequence”
- the spacer sequence is any polynucleotide sequence that has sufficient complementarity with the target sequence to hybridize with the target sequence and guide the CRISPR-Cas system complex to specifically bind to the target sequence.
- the degree of complementarity between the spacer sequence and its corresponding target sequence is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99%. Determining the optimal alignment is within the capabilities of a person of ordinary skill in the art. For example, there are publicly available and commercially available alignment algorithms and programs. programs, such as but not limited to ClustalW, the Smith-Waterman algorithm in Matlab, Bowtie, Geneious, Biopython, and SeqMan.
- the spacer sequence is at least 5, at least 10, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 35, at least 40, at least 45, or at least 50 nucleotides in length. In certain embodiments, the spacer sequence is no more than 50, 45, 40, 35, 30, 25, 24, 23, 22, 21, 20, 15, 10 or less nucleotides in length. In certain embodiments, the spacer sequence is 10-30, 15-25, 15-22, 16-24, 19-25, or 19-22 nucleotides in length. In certain preferred embodiments, the spacer sequence is 20 nucleotides in length.
- the direct repeat sequence is at least 10, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 40, at least 45, at least 50, at least 55, at least 56, at least 57, at least 58, at least 59, at least 60, at least 61, at least 62, at least 63, at least 64, at least 65, or at least 70 nucleotides in length.
- the direct repeat sequence is no more than 70, 65, 64, 63, 62, 61, 60, 59, 58, 57, 56, 55, 50, 45, 40, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 15, 10 or less nucleotides in length.
- the direct repeat sequence is 55-70 nucleotides, such as 55-65 nucleotides, such as 60-65 nucleotides, such as 62-65 nucleotides, such as 63-64 nucleotides in length.
- the direct repeat sequence is 15-40 nucleotides, such as 15-25 nucleotides, such as 20-30 nucleotides, such as 22-36 nucleotides, such as 31 nucleotides in length.
- the term “about” or “approximately” is generally within an acceptable error range for a particular value determined by one of ordinary skill in the art, which depends in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, “about” or “approximately” may mean within 1 or more than 1 standard deviation according to the practice in the art. Alternatively, “about” or “approximately” may mean a range of up to 10% or 20% (i.e., ⁇ 10% or ⁇ 20%).
- selected from generally refers to including selected objects and all combinations thereof.
- selected from (:) A, B and C is meant to include all combinations of A, B and C, for example, A, B, C, A+B, A+C, B+C or A+B+C.
- This example uses stably transfected 293T cells with a blue-green light reporter system to test the activity of the Cas enzyme provided in this application.
- the reporter system has a continuously expressed CMV promoter, a sequence encoding a blue fluorescent protein, a sequence encoding a green fluorescent protein, and a gRNA targeting sequence inserted in the middle; the gRNA targeting sequence has random N base sequences on both sides, which can be used to screen Cas enzymes with different protospacer adjacent motifs (PAM) preferences.
- PAM protospacer adjacent motifs
- the cleavage activity of the Cas enzyme of the present application will be determined by the fluorescence results displayed by the reporter system.
- the reporter system without cleavage will stably express blue fluorescent protein and emit blue fluorescence, while the green fluorescent protein cannot be expressed due to the presence of a stop codon before the sequence encoding the green fluorescent protein, and the reporter system will not emit green fluorescence. Only after DNA cleavage occurs near the gRNA targeting sequence, and the cell undergoes a frameshift mutation during the repair of the incision, can the green fluorescent protein be expressed while the blue fluorescent protein is stably expressed.
- the fluorescence results of Figure 1A show that after transfection with the Cas enzyme provided in the present application (SEQ ID NO: 2), the experimental group cells with cleavage activity will produce a cell population that clearly expresses green fluorescence.
- the green fluorescent cell population is enriched by flow cytometry, the target region is amplified after the genome is extracted, and the PAM motif required for the gRNA binding of the Cas enzyme provided in the present application can be determined by high-throughput sequencing (as shown in Figure 1B).
- nucleotide sequence encoding a reporter system is shown below (bold is blue and green fluorescent protein sequences, italics are linker sequences, underline is 2A cleavage peptide sequence, bold underline is an exemplary gRNA targeting sequence containing random N bases (SEQ ID NO: 59)):
- the Cas enzymes (SEQ ID NOs: 2, 47-51 and 54) provided in the present application the corresponding gRNA and the reporter plasmid with the gRNA targeting site were co-transfected into HEK293T cells.
- nucleotide sequence encoding a reporter system is shown below (bold is a blue and green fluorescent protein sequence, italics is a linker sequence, and underlined is a 2A cleavage peptide sequence; bold underlined is an exemplary gRNA targeting sequence (SEQ ID NO: 60), wherein the TTTG and TGG at both ends of the sequence will be set to different sequences according to the PAM preference of different Cas enzymes):
- the Cas enzyme provided in this application is expressed and purified by E. coli, and reacted with a plasmid library containing a PAM library together with an in vitro transcribed gRNA (the gRNA targeting sequence is SEQ ID NO: 59, and it has random N base sequences on both sides to determine the PAM preference), and the Cas enzyme with cutting activity will cut the plasmid library to form linearized DNA.
- the adapter fragment is connected to the library fragment after DNA linearization, and then the vector and the library fragment connected to the adapter are amplified by specific PCR primers, and high-throughput sequencing is performed.
- gRNA targeting sequence was SEQ ID NO: 89.
- the Cas enzyme (SEQ ID NOs: 44) provided in this application and the corresponding gRNA were co-transfected into HEK293T cells. Three days after transfection, the transfected positive cells were enriched by flow sorting, and the target region was amplified after the cells were lysed. The amplified PCR fragment was reacted with T7 endonuclease.
- the different sites of the Cas enzyme (SEQ ID NO:2) provided by the application are mutated into target amino acids (see the table below), and then the detection as described in Example 2 is performed.
- the DR (direct repeat) regional sequence and Spacer sequence of gRNA are respectively as shown in SEQ ID NO:64 and 60.
- the result shown in Fig. 5A shows that the activity of m1, m2, m4, m5, m7, m11, and m15 mutants is significantly improved compared to the wild type. After these effective mutations are superimposed, the activity detection of the same method is performed.
- FIG. 5B shows that the double mutants formed by the superposition of two mutations selected from the table below can further enhance the activity, and the activity of some double mutants is comparable to AsCas12a;
- Fig. 5C shows that the triple mutants and quadruple mutants formed by further superimposing effective single mutations and double mutations still maintain the original activity, and the activity is comparable to AsCas12a.
- the amino acid sequence of AsCas12a for comparison is as shown in SEQ ID NO:127.
- This example optimizes the editing system comprising the Cas enzyme (SEQ ID NO: 57) and gRNA provided in this application.
- the detection of Cas enzyme activity adopts the reporting system and detection method described in Example 2.
- the DR (direct repeat) region of the gRNA was changed to 36nt (SEQ ID NO: 87), 31nt (SEQ ID NO: 91), and 22nt (SEQ ID NO: 90) by molecular cloning.
- the results shown in Figure 6A show that a DR of 31nt length can maintain 100% Cas enzyme activity.
- the results shown in Figure 6B show that most of the changes have little effect on the activity, while the change of 31.8 can most preferably improve the activity.
- This embodiment also changes the length of the spacer region (Spacer) used by gRNA for targeting DNA by molecular cloning to improve its binding efficiency, thereby enhancing the activity of the Cas enzyme.
- Spacer spacer region
- the Cas enzyme provided in this application can maintain activity.
- TTTG was mutated into TTTH, TTVG, TVTG and VTTG (H represents A/C/T, V represents A/C/G, and the three plasmids were mixed in equal proportions and transfected) to construct a new PAM reporter system.
- H represents A/C/T
- V represents A/C/G
- the three plasmids were mixed in equal proportions and transfected
- the Cas enzyme provided in the present application (SEQ ID NO: 57) was inactivated and then fused with rAPOBEC1 and UGI ( FIG. 9A ) to obtain a cytosine base editor based on the Cas enzyme provided in the present application, and co-transfected with the corresponding gRNA (the DR sequence is shown in SEQ ID NO: 91, and the Spacer sequences targeting the EMX1 site and the VEGFA site are shown in SEQ ID NO: 108 and 109, respectively) into HEK293T cells. After 48 hours of transfection, the transfected cells were sorted, the genome was extracted, the target site was amplified, and the base editing efficiency was determined by Sanger sequencing. The results are shown in FIG. 9B-9C .
- amino acid sequence of the dEpiCas059m46-CBE editor is as follows (SEQ ID NO: 116, italics are NLS, italic bold is rAPOBEC1, bold is dEpiCas059, italics underline is UGI, underline is P2A cleavage peptide, and ⁇ > is mCherry marker; its plasmid sequence is SEQ ID NO: 117):
- amino acid sequence of the dAsCas12a-CBE editor is as follows (SEQ ID NO: 118, italic NLS, italic bold rAPOBEC1, bold dAsCas12a, italic underline UGI, underline P2A cleavage peptide, and ⁇ > is mCherry marker; its plasmid sequence is SEQ ID NO: 119):
- the Cas enzyme (SEQ ID NO: 55) provided in this application was inactivated and then fused with 10 ⁇ GCN4 to recruit the fusion peptide of scFV-P65-HSF1, thereby obtaining a gene activation tool based on the Cas enzyme provided in this application.
- the principle of this tool is based on the fact that GCN4 can spontaneously recognize and bind to scFV, thereby enriching the P65 and HSF1 effectors with transcriptional activation function near the target site of the Cas enzyme, and then activating the gene expression of the target site.
- the transcriptional activation tool and the gRNA targeting CXCR4 (Spacer sequence is shown in SEQ ID NOs: 120-123, and the four gRNAs were mixed in equal proportions) were co-transfected into HEK293T cells. After 48 hours of transfection, the cells were collected and stained with PE anti-human CXCR4 antibody (BioLegendg, 306506). The fluorescence intensity of the PE channel was detected by flow cytometry to reflect the expression intensity of CXCR4.
- the average fluorescence intensity of PE in the transfection-positive group was divided by the average fluorescence intensity of PE in the transfection-negative group to obtain the activation efficiency (MFI fold change), which was used to represent the activation intensity of different activation tools (Figure 10), and then represent the DNA binding effect of different tools.
- amino acid sequence of the scFV-P65-HSF1 fusion peptide is as follows (SEQ ID NO: 124, italics are NLS, italic bold are P65 and HSF1, bold is scFV, italics underline is HA tag, underline is connecting peptide, and ⁇ > is Flag marker):
- the amino acid sequence of the dEpiCas057-10 ⁇ GCN4 fusion peptide is shown below (SEQ ID NO: 125, NLS in italics, dEpiCas057 in bold, and GCN4 in ⁇ >):
- the fusion peptide scFV-P65-HSF1 and dEpiCas057-10 ⁇ GCN4 can be expressed together by the plasmid sequence shown in SEQ ID NO:126.
- the Cas enzyme provided in the present application that loses its cleavage activity through mutation is suitable for application scenarios of base editing and epigenetic modification editing, and is not limited to other application scenarios based on DNA targeting, such as gene activation, gene silencing, chromosome imaging, etc.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims (47)
- 一种分离的Cas酶,所述Cas酶包含SEQ ID NOs:1-58中任一项所示的氨基酸序列或与所述SEQ ID NOs:1-58中任一项所示的氨基酸序列具有至少约80%同一性的序列。
- 根据权利要求1所述的Cas酶,所述Cas酶具有能够结合靶DNA链的催化活性结构域和/或切割所述靶DNA链的催化活性结构域。
- 根据权利要求1-2中任一项所述的Cas酶,所述催化活性结构域包含一个或多个氨基酸的改变,从而使得所述Cas酶仅具有结合靶DNA链的活性,或者具有结合靶DNA链的活性和切割所述靶DNA单链的活性。
- 一种融合分子,所述融合分子包含权利要求1-3中任一项所述的Cas酶和一个或多个异源功能结构域。
- 根据权利要求4所述的融合分子,所述一个或多个异源功能结构域能够调控一种或多种基因产物的表达。
- 根据权利要求4-5中任一项所述的融合分子,所述一个或多个异源功能结构域直接或间接地融合在所述Cas酶上。
- 根据权利要求4-6中任一项所述的融合分子,所述一个或多个异源功能结构域选自自解旋酶、核酸酶、解旋酶-核酸酶、DNA甲基转移酶、DNA羟甲基化酶、组蛋白甲基化酶、组蛋白去甲基化酶、组蛋白乙酰转移酶、组蛋白去乙酰化酶、磷酸酶、激酶、转录(共)活化物、转录阻遏物、DNA结合蛋白、DNA结构蛋白、标志物蛋白、报告物蛋白、荧光蛋白、配体结合蛋白、信号肽、亚细胞定位序列、抗体表位和亲和纯化标签。
- 根据权利要求4-7中任一项所述的融合分子,所述一个或多个异源功能结构域具有以下活性中的一种或多种:甲基酶活性、脱甲基酶活性、脱氨酶活性、转录激活活性、转录阻抑活性、转录释放因子活性、逆转录酶活性、组蛋白修饰活性、RNA切割活性和核酸结合活性。
- 一种工程化的、可编程的、非天然存在的CRISPR-Cas系统,所述系统包含权利要求1-3中任一项所述的Cas酶或权利要求4-8中任一项所述的融合分子,和一种或多种指导RNA,所述一种或多种指导RNA在细胞中靶向编码一种或多种基因产物的核酸分子的基因座,从而指导所述Cas酶或所述融合分子结合和/或切割所述编码一种或多种基因产物的核酸分子的基因座;并且,所述Cas酶或所述融合分子与所述指导RNA不共同天然存在。
- 一种工程化的、非天然存在的载体系统,所述载体系统包含一种或多种载体,所述一种或多种载体包括:a)第一调节元件,所述第一调节元件可操作地连接到一种或多种指导RNA上,所述一种或多种指导RNA能够与编码一种或多种基因产物的核酸分子的基因座中的靶序列杂交, 和b)第二调节元件,所述第二调节元件可操作地连接到权利要求1-3中任一项所述的Cas酶或权利要求4-8中任一项所述的融合分子上,其中,所述组分a)和所述组分b)位于所述载体系统的相同或不同载体上,且所述指导RNA在细胞中靶向所述编码一种或多种基因产物的核酸分子的基因座,从而指导所述Cas酶或所述融合分子结合和/或切割所述编码一种或多种基因产物的核酸分子的基因座;并且,所述Cas酶或所述融合分子与所述指导RNA不共同天然存在。
- 根据权利要求9-10中任一项所述的系统,所述一种或多种基因产物的表达被改变。
- 根据权利要求9-11中任一项所述的系统,所述基因产物的表达被降低或者被增多。
- 根据权利要求9-12中任一项所述的系统,所述基因产物是一种蛋白质。
- 根据权利要求9-13中任一项所述的系统,所述细胞是真核细胞。
- 根据权利要求9-14中任一项所述的系统,所述真核细胞是哺乳动物细胞。
- 根据权利要求9-15中任一项所述的系统,所述哺乳动物细胞是人类细胞。
- 根据权利要求9-16中任一项所述的系统,所述Cas酶是经密码子优化的,用以在真核细胞中进行表达。
- 根据权利要求9-17中任一项所述的系统,所述指导RNA包含融合到tracr序列上的指导序列。
- 根据权利要求9-18中任一项所述的系统,所述指导RNA包含直接重复(Direct repeat)序列和间隔(Spacer)序列,其中所述间隔序列与所述指导RNA靶向的核酸分子结合。
- 根据权利要求19所述的系统,所述直接重复序列的长度为10个至70个核苷酸。
- 根据权利要求19或20所述的系统,所述直接重复序列的长度为31个至36个核苷酸。
- 根据权利要求19-21中任一项所述的系统,所述直接重复序列包含SEQ ID NO:63-88和90-99中任一项所示的核苷酸序列,或者包含与SEQ ID NO:63-88和90-99中任一项所示的核苷酸序列具有至少95%序列同一性的核苷酸序列。
- 根据权利要求19所述的系统,所述间隔序列的长度为16个至24个核苷酸。
- 根据权利要求19或23所述的系统,所述指导RNA靶向的核酸分子包含能够与所述间隔序列互补配对的核苷酸序列。
- 根据权利要求9-24中任一项所述的系统,所述系统的所述载体或所述Cas酶还包含一个或多个核定位序列(NLS)。
- 根据权利要求9-25中任一项所述的系统,所述系统通过递送系统被引入所述细胞中,所述递送系统选自病毒粒子、脂质体、脂质纳米颗粒、电穿孔、显微注射和缀合。
- 改变一种或多种基因产物的表达的方法,所述方法包括向包含和表达编码所述一种或多种基因产物的核酸分子的细胞中引入一种工程化的、非天然存在的CRISPR-Cas系统,所述系统包含权利要求1-3中任一项所述的Cas酶或权利要求4-8中任一项所述的融合分子,和一种或多种指导RNA,所述一种或多种指导RNA靶向所述编码一种或多种基因产物的核酸分子的基因座,从而指导所述Cas酶或所述融合分子结合和/或切割所述基因座,由此改变所述一种或多种基因产物的表达;并且,所述Cas酶或所述融合分子与所述指导RNA不共同天然存在。
- 改变一种或多种基因产物的表达的方法,所述方法包括向包含和表达编码所述一种或多种基因产物的核酸分子的细胞中引入一种工程化的、非天然存在的载体系统,所述载体系统包含一种或多种载体,所述一种或多种载体包括:a)第一调节元件,所述第一调节元件可操作地连接到一种或多种指导RNA上,所述一种或多种指导一种或多种指导RNA能够与所述编码一种或多种基因产物的核酸分子的基因座中的靶序列杂交,和b)第二调节元件,所述第二调节元件可操作地连接到权利要求1-3中任一项所述的Cas酶或权利要求4-8中任一项所述的融合分子上,其中,所述组分a)和所述组分b)位于所述载体系统的相同或不同载体上,且所述指导RNA在所述细胞中靶向所述编码一种或多种基因产物的核酸分子的基因座,从而指导所述Cas酶或所述融合分子结合和/或切割所述基因座,由此改变所述一种或多种基因产物的表达;并且,所述Cas酶或所述融合分子与所述指导RNA不共同天然存在。
- 根据权利要求27或28所述的方法,所述基因产物的表达被降低或者被增多。
- 根据权利要求27-29中任一项所述的方法,所述基因产物是一种蛋白质。
- 根据权利要求27-30中任一项所述的方法,所述细胞是真核细胞。
- 根据权利要求27-31中任一项所述的方法,所述真核细胞是哺乳动物细胞。
- 根据权利要求27-32中任一项所述的方法,所述哺乳动物细胞是人类细胞。
- 根据权利要求27-33中任一项所述的方法,所述Cas酶是经密码子优化的,用以在真核细胞中进行表达。
- 根据权利要求27-34中任一项所述的方法,所述指导RNA包含融合到tracr序列上的指导序列。
- 根据权利要求27-35中任一项所述的方法,所述指导RNA包含直接重复(Direct repeat)序列和间隔(Spacer)序列,其中所述间隔序列与所述指导RNA靶向的核酸分子结合。
- 根据权利要求36所述的方法,所述直接重复序列的长度为10个至70个核苷酸。
- 根据权利要求36或37所述的方法,所述直接重复序列的长度为31个至36个核苷酸。
- 根据权利要求36-38中任一项所述的方法,所述直接重复序列包含SEQ ID NO:63-88和90-99中任一项所示的核苷酸序列,或者包含与SEQ ID NO:63-88和90-99中任一项所示的核苷酸序列具有至少95%序列同一性的核苷酸序列。
- 根据权利要求36所述的方法,所述间隔序列的长度为16个至24个核苷酸。
- 根据权利要求36或40所述的方法,所述指导RNA靶向的核酸分子包含能够与所述间隔序列互补配对的核苷酸序列。
- 根据权利要求27-41中任一项所述的方法,所述系统的所述载体或所述Cas酶还包含一个或多个核定位序列(NLS)。
- 根据权利要求27-42中任一项所述的方法,所述方法包括通过递送系统将所述CRISPR-Cas系统或所述载体系统引入到所述细胞中,所述递送系统选自病毒粒子、脂质体、脂质纳米颗粒、电穿孔、显微注射和缀合。
- 编码权利要求1-3中任一项所述的Cas酶、权利要求4-8中任一项所述的融合分子或权利要求9和11-26中任一项所述的CRISPR-Cas系统的核酸。
- 一种细胞,所述细胞包含权利要求1-3中任一项所述的Cas酶、权利要求4-8中任一项所述的融合分子、权利要求9和11-26中任一项所述的CRISPR-Cas系统、权利要求10-26中任一项所述的载体系统和/或权利要求44所述的核酸。
- 一种试剂盒,所述试剂盒包含权利要求1-3中任一项所述的Cas酶、权利要求4-8中任一项所述的融合分子、权利要求9和11-26中任一项所述的CRISPR-Cas系统、权利要求10-26中任一项所述的载体系统、权利要求44所述的核酸和/或权利要求33所述的细胞。
- 根据权利要求46所述的试剂盒,所述试剂盒还包含用于放置所述Cas酶、所述融合分子、所述CRISPR-Cas系统、所述载体系统、所述核酸和/或所述细胞的容器,以及用法说明书。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP24818760.1A EP4726035A1 (en) | 2023-06-09 | 2024-06-07 | Cas enzyme and system and use thereof |
| CN202480038064.2A CN121358850A (zh) | 2023-06-09 | 2024-06-07 | Cas酶及其系统和应用 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310684902.0 | 2023-06-09 | ||
| CN202310684902 | 2023-06-09 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2024251229A1 true WO2024251229A1 (zh) | 2024-12-12 |
| WO2024251229A9 WO2024251229A9 (zh) | 2025-02-20 |
Family
ID=93795045
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/097935 Ceased WO2024251229A1 (zh) | 2023-06-09 | 2024-06-07 | Cas酶及其系统和应用 |
Country Status (4)
| Country | Link |
|---|---|
| EP (1) | EP4726035A1 (zh) |
| CN (1) | CN121358850A (zh) |
| TW (1) | TW202449148A (zh) |
| WO (1) | WO2024251229A1 (zh) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119265164A (zh) * | 2024-12-06 | 2025-01-07 | 南京师范大学 | 一种Cas蛋白及其在检测中的应用 |
| CN119286824A (zh) * | 2024-09-05 | 2025-01-10 | 武汉尚睿生物科技有限公司 | SrCas12a-2蛋白及其基因编辑系统和应用 |
| US20250179534A1 (en) * | 2023-09-04 | 2025-06-05 | China Agricultural University | Novel CRISPR-Cas sigma enzyme and system |
| CN119286824B (zh) * | 2024-09-05 | 2026-05-01 | 武汉尚睿生物科技有限公司 | SrCas12a-2蛋白及其基因编辑系统和应用 |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110059502A1 (en) | 2009-09-07 | 2011-03-10 | Chalasani Sreekanth H | Multiple domain proteins |
| CN112041444A (zh) * | 2018-03-14 | 2020-12-04 | 阿伯生物技术公司 | 新型crispr dna靶向酶及系统 |
| US20210222140A1 (en) * | 2018-08-09 | 2021-07-22 | G+Flas Life Sciences | Compositions and methods for genome engineering with cas12a proteins |
| US11174470B2 (en) * | 2019-01-04 | 2021-11-16 | Mammoth Biosciences, Inc. | Programmable nuclease improvements and compositions and methods for nucleic acid amplification and detection |
| CN113930413A (zh) * | 2020-06-29 | 2022-01-14 | 中国农业大学 | 新型CRISPR-Cas12j.23酶和系统 |
| CN114517190A (zh) * | 2021-02-05 | 2022-05-20 | 山东舜丰生物科技有限公司 | Crispr酶和系统以及应用 |
| CN116144631A (zh) * | 2023-01-17 | 2023-05-23 | 华中农业大学 | 耐热型核酸内切酶及其介导的基因编辑系统 |
-
2024
- 2024-06-07 WO PCT/CN2024/097935 patent/WO2024251229A1/zh not_active Ceased
- 2024-06-07 TW TW113121355A patent/TW202449148A/zh unknown
- 2024-06-07 CN CN202480038064.2A patent/CN121358850A/zh active Pending
- 2024-06-07 EP EP24818760.1A patent/EP4726035A1/en active Pending
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110059502A1 (en) | 2009-09-07 | 2011-03-10 | Chalasani Sreekanth H | Multiple domain proteins |
| CN112041444A (zh) * | 2018-03-14 | 2020-12-04 | 阿伯生物技术公司 | 新型crispr dna靶向酶及系统 |
| US20210222140A1 (en) * | 2018-08-09 | 2021-07-22 | G+Flas Life Sciences | Compositions and methods for genome engineering with cas12a proteins |
| US11174470B2 (en) * | 2019-01-04 | 2021-11-16 | Mammoth Biosciences, Inc. | Programmable nuclease improvements and compositions and methods for nucleic acid amplification and detection |
| CN113930413A (zh) * | 2020-06-29 | 2022-01-14 | 中国农业大学 | 新型CRISPR-Cas12j.23酶和系统 |
| CN114517190A (zh) * | 2021-02-05 | 2022-05-20 | 山东舜丰生物科技有限公司 | Crispr酶和系统以及应用 |
| CN116144631A (zh) * | 2023-01-17 | 2023-05-23 | 华中农业大学 | 耐热型核酸内切酶及其介导的基因编辑系统 |
Non-Patent Citations (4)
| Title |
|---|
| DATABASE PROTEIN 16 May 2021 (2021-05-16), ANONYMOUS: "MAG: type V CRISPR-associated protein Cas12a/Cpf1 [Clostridium sp.]", XP093246084, Database accession no. MBS6461344.1 * |
| NAKAMURA Y. ET AL.: "Codon usage tabulated from the international DNA sequence databases: status for the year 2000", NUCL. ACIDS RES., vol. 28, 2000, pages 292 |
| PROC. NATL. ACAD. SCI. USA., vol. 78, no. 3, 1981, pages 1527 - 31 |
| TÓTH ESZTER, VARGA ÉVA, KULCSÁR PÉTER ISTVÁN, KOCSIS-JUTKA VIRÁG, KRAUSZ SARAH LAURA, NYESTE ANTAL, WELKER ZSOMBOR, HUSZÁR KRISZTI: "Improved LbCas12a variants with altered PAM specificities further broaden the genome targeting range of Cas12a nucleases", NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, GB, vol. 48, no. 7, 17 April 2020 (2020-04-17), GB , pages 3722 - 3733, XP055894432, ISSN: 0305-1048, DOI: 10.1093/nar/gkaa110 * |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20250179534A1 (en) * | 2023-09-04 | 2025-06-05 | China Agricultural University | Novel CRISPR-Cas sigma enzyme and system |
| CN119286824A (zh) * | 2024-09-05 | 2025-01-10 | 武汉尚睿生物科技有限公司 | SrCas12a-2蛋白及其基因编辑系统和应用 |
| CN119286824B (zh) * | 2024-09-05 | 2026-05-01 | 武汉尚睿生物科技有限公司 | SrCas12a-2蛋白及其基因编辑系统和应用 |
| CN119265164A (zh) * | 2024-12-06 | 2025-01-07 | 南京师范大学 | 一种Cas蛋白及其在检测中的应用 |
| CN119265164B (zh) * | 2024-12-06 | 2025-03-11 | 南京师范大学 | 一种Cas蛋白及其在检测中的应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4726035A1 (en) | 2026-04-15 |
| CN121358850A (zh) | 2026-01-16 |
| TW202449148A (zh) | 2024-12-16 |
| WO2024251229A9 (zh) | 2025-02-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2022253185A1 (zh) | Cas12蛋白、含有Cas12蛋白的基因编辑系统及应用 | |
| WO2024251229A1 (zh) | Cas酶及其系统和应用 | |
| GB2556648A (en) | Methods | |
| CN108823202A (zh) | 用于特异性修复人hbb基因突变的碱基编辑系统、方法、试剂盒及其应用 | |
| WO2020069029A1 (en) | Novel crispr nucleases | |
| KR20200135225A (ko) | 단일염기 치환 단백질 및 이를 포함하는 조성물 | |
| CN112159801B (zh) | SlugCas9-HF蛋白、含有SlugCas9-HF蛋白的基因编辑系统及应用 | |
| CN116656649A (zh) | 一种is200/is60s转座子iscb突变蛋白及其应用 | |
| CN116949012B (zh) | 一种融合蛋白及其应用 | |
| US12297450B2 (en) | CRISPR-Cas13 system and use thereof | |
| WO2024240138A1 (zh) | 基于perv逆转录酶的先导编辑系统 | |
| WO2024089629A1 (en) | Cas12 protein, crispr-cas system and uses thereof | |
| CN110499335B (zh) | CRISPR/SauriCas9基因编辑系统及其应用 | |
| CN120173914A (zh) | 基于IscB系统的紧凑型基因组编辑器和碱基编辑器及其应用 | |
| WO2020087631A1 (zh) | 基于C2c1核酸酶的基因组编辑系统和方法 | |
| WO2023165613A1 (zh) | 5'→3'核酸外切酶在基因编辑系统中的用途和基因编辑系统及其编辑方法 | |
| WO2023208256A1 (zh) | 经分离的Cas13蛋白、基于它的基因编辑系统及其用途 | |
| CN110551762B (zh) | CRISPR/ShaCas9基因编辑系统及其应用 | |
| WO2025010350A2 (en) | Compositions and methods for precise genome editing using retrons | |
| US20230045095A1 (en) | Compositions, Methods and Systems for the Delivery of Gene Editing Material to Cells | |
| WO2025190256A1 (en) | Type ii cas protein and uses thereof | |
| EP4684009A1 (en) | Improved methods and compositions for crispr interference and activation | |
| WO2026067648A1 (en) | Type ii cas protein and uses thereof | |
| TW202526012A (zh) | 改良整合酶 | |
| EP4662324A2 (en) | Integrase variants for gene insertion in human cell |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24818760 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2024818760 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2024818760 Country of ref document: EP Effective date: 20260109 |
|
| ENP | Entry into the national phase |
Ref document number: 2024818760 Country of ref document: EP Effective date: 20260109 |
|
| WWP | Wipo information: published in national office |
Ref document number: 2024818760 Country of ref document: EP |