EP4490288A2 - Systèmes d'édition d'acides nucléiques, méthodes et leurs utilisations - Google Patents
Systèmes d'édition d'acides nucléiques, méthodes et leurs utilisationsInfo
- Publication number
- EP4490288A2 EP4490288A2 EP23767727.3A EP23767727A EP4490288A2 EP 4490288 A2 EP4490288 A2 EP 4490288A2 EP 23767727 A EP23767727 A EP 23767727A EP 4490288 A2 EP4490288 A2 EP 4490288A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- nucleic acid
- seq
- nuclease
- sequence
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1138—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against receptors or cell surface proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases [RNase]; Deoxyribonucleases [DNase]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
- C12Q1/683—Hybridisation assays for detection of mutation or polymorphism involving restriction enzymes, e.g. restriction fragment length polymorphism [RFLP]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/09—Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16041—Use of virus, viral particle or viral elements as a vector
- C12N2740/16043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Definitions
- CRISPR/Cas nucleic acid editing systems have emerged as an approach for the targeted treatment of mammalian diseases and as a tool for diagnostics, synthetic biology, and cell biology.
- current CRISPR/Cas nucleic acid editing systems have low integration rates of exogenous genes for gene-editing applications and some mammals develop Cas-directed immunity that reduces the efficacy of gene editing in mammalian cells.
- Alternative nucleic acid editing systems are needed for precise genome targeting, selective disruption of individual genetic elements, and reduced innate and adaptive immune responses to programmable nucleases.
- nucleic acid editing systems will advance gene editing capabilities in biotechnology, medical applications, and synthetic biology.
- BRIEF SUMMARY The compositions, methods, and kits provided herein are based, in part, on the discovery of new nuclease proteins and nucleic acid editing systems for use in selectively modifying gene expression and gene editing applications.
- nucleases Provided herein are nucleases, wherein the nucleases comprise an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, a functional fragment, or a derivative thereof.
- nucleases wherein the nucleases comprise an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 2, a functional fragment, or a derivative thereof. Further provided herein are nucleases, wherein the nucleases comprise an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, a functional fragment, or a derivative thereof. Further provided herein are nucleases, wherein the nucleases comprise an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 11, a functional fragment, or a derivative thereof.
- nucleases comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 77, a functional fragment, or a derivative thereof. Further provided herein are nucleases, wherein the nucleases comprise an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 78, a functional fragment, or a derivative thereof.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 1.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 2.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 10.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 11. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 65. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 67. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 69. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 71. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 73.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 75. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 77. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 78. In some embodiments, a nuclease provided herein is encoded by a nucleic acid comprising a sequence that is at least 85% identical to SEQ ID NO: 90.
- nucleases comprising: (a) a nuclease (NUC) lobe; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease is capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- REC helical recognition
- nucleases comprising: (a) a nuclease (NUC) lobe; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 15, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease is capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- REC helical recognition
- nucleases comprising: (a) a nuclease (NUC) lobe; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 64, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease is capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- REC helical recognition
- nucleases comprising: (a) a nuclease (NUC) lobe comprising a Protospacer Adjacent Motif (PAM) Interacting (PI) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 20; and (b) a helical recognition (REC) lobe.
- NUC nuclease
- PAM Protospacer Adjacent Motif
- PI Protospacer Adjacent Motif
- REC helical recognition
- nucleases comprising: (a) a nuclease (NUC) lobe comprising a Protospacer Adjacent Motif (PAM) Interacting (PI) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 21; and (b) a helical recognition (REC) lobe.
- NUC nuclease
- PAM Protospacer Adjacent Motif
- PI Protospacer Adjacent Motif
- REC helical recognition
- nucleases comprising: (a) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease is capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- WED2 wedge 2
- nucleic acid sequences wherein the nucleic acid sequences encode for a nuclease provided herein.
- ribonucleoprotein (RNP) complexes wherein the RNP complexes comprises: (i) a nuclease provided herein; and (ii) a guide RNA that binds to a target nucleic acid.
- vectors wherein the vectors comprise a nucleic acid sequence encoding for a nuclease provided herein.
- viral vectors wherein the viral vectors comprise a nucleic acid sequence encoding for a nuclease provided herein.
- nucleic acid editing systems comprising a nuclease provided herein.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, or SEQ ID NO: 77; a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprising: a nuclease comprising a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64; wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid; and a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease; the guide nucleic acid; and the target nucleic acid form a complex.
- REC helical recognition
- nucleic acid editing systems comprising: a nuclease comprising (i) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (ii) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64; wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving the target nucleic acid via
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 2, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 11, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 77, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 78, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 2, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 11, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 77, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 78, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 77, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 78, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 2, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 6, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 11, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 6, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease provided herein; and (b) a guide nucleic acid sequence comprising: a direct repeat sequence of: SEQ ID NO: 5 operably linked to a variable sequence, wherein the variable sequence is complementary to a target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease provided herein; and (b) a guide nucleic acid sequence comprising: a direct repeat sequence of: SEQ ID NO: 6 operably linked to a variable sequence, wherein the variable sequence is complementary to a target nucleic acid.
- compositions wherein the compositions comprise a delivery vehicle; and a nuclease provided herein or a polynucleotide sequence encoding for a nuclease provided herein.
- compositions wherein the compositions comprise: (a) a delivery vehicle; and (b) a nucleic acid encoding for an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 64-78, 182, a functional fragment, or a derivative thereof.
- the nucleic acid comprises a sequence that is at least 85% identical to any one of SEQ ID NOS: 85-90, or 183.
- viral vectors comprising: a nucleic acid encoding for an amino acid sequence that is at least 85% identical to any one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, 182, a functional fragment, or a derivative thereof.
- viral vectors comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, 182, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid.
- viral vectors wherein the viral vectors comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, 182, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid comprising any one of SEQ ID NO: 5 or SEQ ID NO: 6.
- kits comprising a nuclease provided herein.
- kits comprising: a first container comprising a nuclease provided herein; and a second container comprising a guide nucleic acid provided herein, packaging and materials therefor. Further provided herein are kits, wherein the kits comprise: a gene editing system provided herein, packaging and materials therefor.
- a target nucleic acid molecule comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182; and (b) a guide nucleic acid comprising a sequence that binds to the target nucleic acid, wherein upon binding of said guide nucleic acid to the target nucleic acid molecule, the nuclease, the guide nucleic acid, and the target nucleic acid molecule form a complex, wherein the nuclease cleaves a target nucleic acid molecule generating a cleavage site within the target nucleic acid molecule, thereby modifying the target nucleic acid molecule.
- methods of ex vivo modifying a cell comprising: contacting a cell with a composition provided herein under conditions that permit nuclease cleavage of a target nucleic acid molecule, thereby modifying said cell.
- the methods comprise: contacting a cell with a ribonucleoprotein (RNP) complex, wherein the RNP complex comprises: (i) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182; and (ii) a guide RNA that binds to a target nucleic acid, wherein upon contacting the cell with the RNP complex, the nuclease cleaves a target nucleic acid molecule, thereby modifying said cell.
- RNP ribonucleoprotein
- the methods comprise: (a) immobilizing a guide nucleic acid onto a solid support; and (b) contacting the guide nucleic acid with: (i) a test sample; and (ii) a nuclease comprising an amino acid sequence that is at least 85% identical to any one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182; wherein when the test sample comprises a target nucleic acid capable of binding to the guide nucleic acid, a complex is formed between the guide nucleic acid, the nuclease, and the target nucleic acid, and wherein upon formation of the complex the nuclease cleaves the target nucleic acid; and (c) detecting a signal indicating cleavage of the target nucleic acid molecule, thereby detecting the target nucleic acid in the sample.
- nuclease comprises: (a) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable
- scaffolds comprising a nuclease or a gene editing system provided herein, wherein the nuclease or the gene editing system is immobilized to the scaffold.
- the scaffolds comprise: (a) a set of nucleases, wherein at least one nuclease comprises an amino acid sequence comprising at least 85% identity to one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182, a functional fragment, or a derivative thereof; and (b) a set of guide nucleic acids, wherein the set of nucleases and/or the set of guide nucleic acids are immobilized to the scaffold.
- systems and devices comprising: (a) a scaffold provided herein; (b) a reporter molecule or a phase changing agent; and (c) a detector, wherein, when a target nucleic acid forms a complex with the scaffold, the reporter molecule or phase changing agent produces a detectable signal that is detected by the detector.
- systems and devices comprising: (a) a nuclease provided herein; (b) a reporter molecule or a phase changing agent; and (c) a detector, wherein, when the nuclease cleaves a target nucleic acid the reporter molecule or phase changing agent produces a detectable signal that is detected by the detector.
- fusion proteins comprising a nuclease provided herein, a fragment, or a derivative thereof.
- fusion proteins comprising: (a) a first protein construct comprising: a first nuclease comprising a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas poly
- fusion proteins comprise: (a) a first protein construct comprising: a first nuclease comprising: (i) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (ii) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving
- fusion proteins comprise: (a) a first protein construct comprising: (a) a first protein construct comprising: a nuclease comprising an amino acid sequence that is at least 85% identical to one of SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182, a functional fragment, or a derivative thereof; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonuclease, a helicase, a ligase, a meganuclease, a methylase, a methyltransferas
- fusion proteins comprise: (a) a first protein construct comprising: (a) a first protein construct comprising: a first nuclease consisting of: SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 78, or SEQ ID NO: 182; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonuclease, a helicase, a ligase, a meganuclease, a methylase, a methyltransferase, a nickase,
- isolated proteins wherein the isolated proteins comprise: an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182. Further provided herein are isolated proteins, wherein the isolated proteins comprise any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182. [0035] Provided herein are isolated nucleic acids, wherein the isolated nucleic acids comprise: a sequence that encodes a protein that comprises an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a fragment, or a derivative thereof.
- isolated nucleic acids comprising: a sequence that encodes a protein that comprises an amino acid sequence comprising any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a fragment, or a derivative thereof.
- engineered nucleic acids wherein the engineered nucleic acids comprise: a sequence that is at least 85% identical to SEQ ID NO: 90 or SEQ ID NO: 183.
- engineered nucleic acids wherein the engineered nucleic acids comprise SEQ ID NO: 90 or SEQ ID NO: 183.
- FIGURES 1A-1B shows a schematic representation of a gene editing system.
- FIG. 1A shows a nuclease provided herein, including a NUC lobe, a REC lobe, and associated domains.
- FIG. 1A shows a nuclease provided herein, including a NUC lobe, a REC lobe, and associated domains.
- FIG. 1B shows a gRNA provided herein including a direct repeat (DR) sequence that is a scaffold for the nuclease and a variable sequence that is at least partially complementary to a target nucleic acid.
- FIGURES 2A-2C show nucleic acid editing system targeting of CD274 (also called PD-L1).
- FIG. 2A shows a schematic representation of the protocol used to determine nuclease targeting in RKO cells.
- FIG. 2B shows flow cytometry analysis of CD274 expression in cells transfected with DNA encoding Nuclease #1 without a guide RNA (CNTRL Nuc) and Nuclease #1 (SEQ ID NO: 10) with CD274-targeting gRNA, SEQ ID NO: 53.
- CNTRL Nuc guide RNA
- SEQ ID NO: 10 Nuclease #1
- FIGURE 3 shows a T7 endonuclease assay demonstrating indel formation for control cells (not transfected with a nuclease), a CNTRL Nuc, test nucleases, and Nuclease #1 and Nuclease #2 as indicated. Cleavage products are indicated with arrows.
- FIGURE 4A-4B shows graphs demonstrating of the percentage and identity of indels produced when cells were treated with an CNTRL Nuc-gRNA; Nuclease #2-gRNA; and Nuclease #1-gRNA.
- FIG. 4A shows indel/mismatch % for nucleases and a CD274-targeting gRNA (SEQ ID NO: 53). Next generation sequencing (NGS) results for Nuclease #1 are shown.
- FIG. 4B shows indel/mismatch % for nucleases and a B2M-targeting gRNA (SEQ ID NO: 38). Next generation sequencing (NGS) results for Nuclease #1 are also shown.
- FIG. 4A shows indel/mismatch % for nucleases and a CD274-targeting gRNA (SEQ ID NO: 53). Next generation sequencing (NGS) results for Nuclease #1 are shown.
- FIG. 4B shows indel/mismatch % for nucle
- FIGURE 5 shows flow cytometry analysis of RKO cells contacted with Nuclease #1 and guide nucleic acids SEQ ID NOS: 53-56.
- FIGURE 6 shows flow cytometry analysis of RKO cells contacted with Nuclease #2 and guide nucleic acids SEQ ID NOS: 57-60.
- FIGURES 7A-7B show a graph of flow cytometry and gel electrophoresis after T7 endonuclease assay for RKO cells transfected with a guide sequence targeting CD274 and a control or mutant nuclease.
- FIG.7A is a graph showing the percentage of CD247 negative RKO cells for each of the following conditions: (1) CNTRL Nuclease (SEQ ID NO: 1), no guide; (2) CNTRL Nuclease + Guide; (3) REC Mutant # 1 + Guide (E155R, SEQ ID NO: 65); (4) WED2 Mutant #1 + Guide (S532R, SEQ ID NO: 67); (5) WED2 Mutant #2 + Guide (K538R, SEQ ID NO: 69); (6) Double Mutant #1 + Guide (E155R, S532R, SEQ ID NO: 71); (7) Double Mutant #2 + Guide (E155R, K538R, SEQ ID NO: 73); (8) Double Mutant #3 + Guide (S532R, K538R, SEQ ID NO: 75); and (9) Triple Mutant + Guide (E155R, S532R, K538R, SEQ ID NO: 77).
- FIG. 7B is a T7 endonuclease assay demonstrating indel formation for control cells (not transfected with a nuclease); a CNTRL Nuclease (SEQ ID NO: 1), no guide; CNTRL Nuclease + Guide; REC Mutant # 1 + Guide (E155R, SEQ ID NO: 65); WED2 Mutant #1 + Guide (S532R, SEQ ID NO: 67); WED2 Mutant #2 + Guide (K538R, SEQ ID NO: 69); Double Mutant #1 + Guide (E155R, S532R, SEQ ID NO: 71); Double Mutant #2 + Guide (E155R, K538R, SEQ ID NO: 73); Double Mutant #3 + Guide (S532R, K538R, SEQ ID NO: 75); and Triple Mutant + Guide (E155R, S532R, K538R, SEQ ID NO: 77).
- a CNTRL Nuclease SEQ ID NO
- FIGURE 8 shows flow cytometry results for RKO cells transfected with a guide sequence targeting CD46 and a control or mutant nuclease.
- the percentage of CD46 negative RKO cells is shown for each of the following conditions: (1) CNTRL Nuclease (SEQ ID NO: 1), no guide; (2) CNTRL Nuclease (SEQ ID NO: 1) + Guide; (3) REC Mutant # 1 (SEQ ID NO: 65) + Guide (E155R); (4) WED2 Mutant #1 (SEQ ID NO: 67) + Guide (S532R); (5) WED2 Mutant #2 (SEQ ID NO: 69) + Guide (K538R); (6) Double Mutant #1 (SEQ ID NO: 71) + Guide (E155R, S532R); (7) Double Mutant # 2 (SEQ ID NO: 73) + Guide (E155R, K538R); (8) Double Mutant #3 (SEQ ID NO: 75) + Guide (S532R, K538R); and (9) Triple
- FIGURE 9 shows flow cytometry results for RKO cells transfected with a guide sequence targeting B2M and a control or mutant nuclease.
- the percentage of B2M negative RKO cells is shown for each of the following conditions: 1) CNTRL Nuclease (SEQ ID NO: 1), no guide; (2) CNTRL Nuclease (SEQ ID NO: 1) + Guide; (3) REC Mutant # 1 (SEQ ID NO: 65) + Guide (E155R); (4) WED2 Mutant #1 (SEQ ID NO: 67) + Guide (S532R); (5) WED2 Mutant #2 (SEQ ID NO: 69) + Guide (K538R); (6) Double Mutant #1 (SEQ ID NO: 71) + Guide (E155R, S532R); (7) Double Mutant # 2 (SEQ ID NO: 73) + Guide (E155R, K538R); (8) Double Mutant #3 (SEQ ID NO: 75) + Guide (S532R, K538R); and (
- FIGURES 10A-10C show a schematic and assay results for various nuclear localization sequence configurations for the control nuclease.
- FIG. 10A shows a schematic of Configuration 1 (SEQ ID NO: 85), Configuration A (SEQ ID NO: 86), Configuration B (SEQ ID NO: 87), and Configuration C (SEQ ID NO: 88).
- FIG.10B shows the percentage of CD247 negative RKO cells when cells were contacted with a Configuration 1 construct, Configuration A construct, a Configuration B construct, or a Configuration C construct.
- FIG. 10C shows flow cytometry graphs for each sequence configuration.
- FIGURES 11A-11C show graphs of the flow cytometry assay results for various nuclear localization sequence configurations comparing the WT nuclease using the NLS configuration 1 (SEQ ID NO: 85) with the optimized Triple Mutant nuclease using the NLS configuration A (SEQ ID NO: 90).
- FIG. 11A shows a graph of the percentage of CD274 negative RKO cells.
- FIG. 11B shows the percentage of B2M negative RKO cells.
- FIG. 11C shows the percentage of CD46 negative RKO cells after co-transfection with the respective sgRNAs.
- FIGURE 12 shows the NGS results of RKO cells co-transfected with the WT (SEQ ID NO: 85) or Triple Mutant nucleases (SEQ ID NOS: 89 and 90) and sgRNAs targeting either B2M, CD274 or CD46.
- Genomic DNA was extracted from transfected RKO cells and genomic loci of B2M, CD274 and CD46 genes were PCR amplified. PCR amplicons of 200 bp, 210 bp or 233 bp were sequenced via MiSeq PE150 paired-end reads.
- FIGURES 13A-13B show the effect of various PAM sequences on nuclease targeting of CD274.
- FIG. 13A show a graph of the % of CD247- negative RKO cells treated with the control nuclease and guide sequences targeting CD247.
- FIGURES 14A-14B show graphs demonstrating Triple Mutant nuclease fitness effects relative to the starting point on day 0.
- FIG.14A shows a graph of fitness effects assessed on day 7 relative to day 0 (guide RNA library representation)
- FIG.14B shows a graph of fitness effects on day 21 (final time point) post guide RNA transduction.
- the dropout effects of the fitness screen are shown as log fold-change (LFC) values relative to day 0 (X-axis).
- PAM sites e.g., “CTTC”, “TTTC” and “TTCC” are shown on the Y-axis.
- compositions, kits, methods, and uses thereof for editing a target gene are provided herein.
- nucleases (2) guide nucleic acids; (3) target nucleic acids; (4) fusion proteins; (5) nuclease activity and efficiency; (6) delivery systems and vectors; (7) pharmaceutical compositions, dosing, and administration; (8) gene editing applications and diagnostics; (9) scaffolds and systems; and (10) kits.
- Definitions [0055] All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms. [0056] All references, patents and patent applications disclosed herein are incorporated by reference with respect to the subject matter for which each is cited, which in some cases may encompass the entirety of the document.
- a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
- “or” should be understood to have the same meaning as “and/or” as defined above.
- the term “about” and its grammatical equivalents in relation to a reference numerical value and its grammatical equivalents as used herein can include a range of values plus or minus 10% from that value.
- the amount “about 10” includes amounts from 9 to 11.
- the term “about” in relation to a reference numerical value can also include a range of values plus or minus 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1% from that value.
- the term “cell” or “engineered cell” and their grammatical equivalents as used herein refers to a cell of human or non-human origin.
- cleavage and its grammatical equivalents as used herein refers to the breakage of the covalent backbone of a DNA molecule. Cleavage can be initiated by a variety of methods including, but not limited to, enzymatic or chemical hydrolysis of a phosphodiester bond, and introducing 5’ base pair (bp) staggered cuts. Both single-stranded cleavage and double-stranded cleavage are possible, and double-stranded cleavage can occur as a result of two distinct single-stranded cleavage events. DNA cleavage can result in the production of either blunt ends or staggered ends.
- fusion polypeptides are used for targeted single-stranded RNA or single-stranded DNA cleavage. In some embodiments, fusion polypeptides are used for targeted double-stranded DNA cleavage. In some cases, cleavage refers to processing of an array of guide nucleic acids, such that individual guide nucleic acids are created. Such processing can involve recognition of a direct repeat region of a guide sequence and cleavage upstream of the direct repeat region. [0064]
- delivery vehicle refers to a formulation when in combination with a composition provided herein (e.g., a nuclease) delivers the composition to a cell.
- the delivery vehicle can be a solid, semi-solid, a liquid, or a gas.
- a delivery vehicle provided herein is man-made or a non-naturally occurring vehicle.
- a delivery vehicle is naturally occurring.
- a delivery vehicle can comprise a vesicle, an exosome, a liposome, or a cell.
- a delivery vehicle provided herein is isolated from a biological fluid. The type of delivery vehicle used can be determined by a skilled practitioner and will depend on the use of the compositions provided herein. [0065]
- the term “effective amount” or “therapeutically effective amount” refers to an amount that is sufficient to achieve or at least partially achieve the desired effect.
- engineered and its grammatical equivalents as used herein refers to one or more alterations of a nucleic acid, e.g., the nucleic acid within an organism’s genome.
- engineered refers to alterations, additions, and/or deletion of genes.
- An engineered cell can also refer to a cell with an added, deleted and/or altered gene.
- function and its grammatical equivalents as used herein refers to the capability of operating, having, or serving an intended purpose. Functional can comprise any percent from baseline to 100% of normal function.
- functional can comprise or comprise about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50,55, 60, 65, 70, 75, 80, 85, 90, 95, and/or 100% of normal function.
- the term functional can mean over or over about 100% of normal function, for example, 125, 150, 175, 200, 250, 300% and/or above normal function.
- the term “functional fragment” and its grammatical equivalents as used herein refers to a protein, polypeptide or nucleic acid whose sequence is not identical to the full- length protein, polypeptide or nucleic acid, yet retains the same or similar function as the full- length protein, polypeptide or nucleic acid.
- a functional fragment can possess more, fewer, or the same number of residues as the corresponding native molecule, and/or can contain one or more amino acid or nucleotide substitutions.
- Methods for determining the function of a nucleic acid include, but are not limited to: genomic Cleavage Detection assays, RT-PCR, protein expression assays, fluorescence in situ hybridization (FISH), reporter splice-correction assays, cellular in vitro assays and animal models.
- gene editing and its grammatical equivalents as used herein refers to genetic engineering in which one or more nucleotides are inserted, replaced, or removed from a genome. Gene editing can be performed using a nuclease (e.g., a naturally-existing nuclease or an artificially engineered nuclease) or other methods known to those of ordinary skill in the art.
- nuclease e.g., a naturally-existing nuclease or an artificially engineered nuclease
- homolog and its grammatical equivalents as used herein refers to a protein of the same organism which performs the same or a similar function as the protein it is a homologue of. Homologous proteins may but need not be structurally related, or are only partially structurally related.
- modifying refers to a process of altering a gene, e.g., by deletion, insertion, mutation, rearrangement, or any combination thereof. Modifying a gene can, for example, partially or completely suppress expression of the gene. Modifying a gene can also cause activation of a different gene, for example, a downstream gene.
- modulation and its grammatical equivalents as used herein refers to a change in the activity or level of a given parameter (e.g., gene expression). For example, modulation of gene expression can include, but is not limited to, gene activation and gene repression.
- mutants can include the substitution, deletion, and insertion of one or more nucleotides in a polynucleotide. For example, up to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 40, 50, or more nucleotides/amino acids in a polynucleotide (cDNA, gene) or a polypeptide sequence can be substituted, deleted, and/or inserted.
- a mutation can affect the coding sequence of a gene or its regulatory sequence.
- a mutation can also affect the structure of the genomic sequence or the structure/stability of the encoded mRNA and/or the translated polypeptide structure and function [0074]
- the term “nuclease” and its grammatical equivalents as used herein refers to a naturally occurring or engineered construct or protein that can be guided by a single guide nucleic acid molecule to specifically recognize a target nucleic acid comprising a sequence that binds the guide nucleic acid molecule.
- a nuclease has nuclease activity or the ability to cleave a phosphodiester bond in a target nucleic acid.
- ortholog and its grammatical equivalents as used herein refers to a protein of a different species which performs the same or a similar function as the protein it is an orthologue of.
- percent complementary and its grammatical equivalents as used herein refers to the percentage of complementary base-pairs in a heteroduplex molecule. Complementary base-pairs are known in the art, and their complementarity is achieved by distinct interaction between nucleobases, e.g., adenine (A) pairs with thymine (T) or uracil (U); and guanine (G) pairs with cytosine (C).
- partially complementary refers to a nucleic acid sequence (e.g., a variable sequence of a guide nucleic acid) having a substantially complementary sequence to another nucleic acid sequence (e.g., a target nucleic acid sequence) but that differs from the other nucleic acid by at least two or more nucleotides.
- a partially complementary nucleic acid specifically excludes a population containing sequences that are exactly complementary, that is, a complementary sequence that has 100% complementarity. Therefore, each member of such a partially complementary nucleic acid population differs from other members of the population by two or more nucleotides, including both strands.
- nucleases [0077] Provided herein are nucleases, nucleic acid editing systems, and methods for use in modifying or altering a target nucleic acid sequence.
- the nucleases provided herein can be used in various therapeutic, diagnostic, and biotechnological applications as described herein.
- the nucleic acid editing systems provided herein have high specificity for a target nucleic acid, high efficacy, and high safety.
- the nucleases provided herein are guided by a single guide nucleic acid (e.g., a guide RNA) that binds downstream of a protospacer adjacent motif (PAM). Binding is mediated by full or partial complementarity of the variable region of the guide RNA.
- a single guide nucleic acid e.g., a guide RNA
- PAM protospacer adjacent motif
- a PAM is a short nucleic acid sequence (usually 2-6 base pairs in length) that precedes the region targeted for cleavage by the nuclease provided herein.
- the nucleases provided herein specifically cut the target nucleic acid (e.g., DNA) at the distal end of the PAM, by introducing staggered cuts.
- nucleases and nucleic acid editing systems provided herein have several advantages over other gene editing systems including, but not limited to the following: (1) the nucleases provided herein do not require a trans-activating crRNA (tracrRNA); (2) the guide nucleic acid is shorter than guide RNAs typically used for other nucleases (e.g., Cas9); (3) the nucleases provided herein are capable of processing a multi-cistronic guide RNA that enables multiplexed gene editing; (4) the nucleases provided herein induce staggered ends when cutting the target nucleic acid that enables higher integration rates of exogenous nucleic acids, improving gene editing efficiency; and (5) the nucleases provided herein can be used as a gene therapy and can optionally be used to avoid Cas9-directed immunity in vivo.
- tracrRNA trans-activating crRNA
- the guide nucleic acid is shorter than guide RNAs typically used for other nucleases (e.g., Cas9)
- nucleases that form a complex with a guide nucleic acid and a target nucleic acid provided herein. Further provided herein is a nuclease with DNA cleavage activity. [0079] The nucleases provided herein comprise a helical recognition (REC) lobe that recognizes the target nucleic acid and a nuclease (NUC) lobe, which cuts the target nucleic acid.
- REC helical recognition
- NUC nuclease
- the NUC lobe comprises a wedge 1 (WED 1) domain; a WED 2 domain; a Protospacer Adjacent Motif (PAM) Interacting (PI) domain; a WED 3 domain; a RuvC1 domain; a bridge helix; a RuvC2 domain; a nuclease (Nuc); and a RuvC3 domain.
- the nuclease comprises a REC lobe comprising a REC1 domain and a REC2 domain.
- the RuvC domain of the nucleases provided herein comprise three split RuvC motifs: RuvC I, RuvC II, and RuvC III.
- nucleases provided herein lack a HNH endonuclease domain.
- a nuclease (NUC) lobe provided herein comprises a helical region between RuvC I and RuvC II, and a zinc finger-like domain between RuvC II and RuvC III.
- a nuclease provided herein comprises an amino acid sequence that is least 85% identical to an amino acid sequence listed in Table 1.
- a nuclease provided herein comprises an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 100% identical to an amino acid sequence listed in Table 1.
- a nuclease provided herein comprises an amino acid sequence selected from Table 1. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NOS: 64-78, or SEQ ID NO: 182.
- a nuclease provided herein comprises an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NOS: 64-78 or SEQ ID NO: 182.
- a nuclease provided herein comprises an amino acid sequence that is 100% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NOS: 64-78 or SEQ ID NO: 182.
- a nuclease provided herein comprises at least one of SEQ ID NOS: 12 to 29, 40, 41, 66, 68, 70, 72, 74, 76 or any combination thereof.
- Percent (%) sequence identity for a given sequence relative to a reference sequence is defined as the percentage of identical residues identified after aligning the two sequences and introducing gaps if necessary, to achieve the maximum percent sequence identity. Percent identity can be calculated using alignment methods known in the art, for instance alignment of the sequences can be conducted using publicly available software such as BLAST, Align, ClustalW2. Table 1. Nuclease Amino Acid Sequences.
- a nuclease provided herein comprises: SEQ ID NO: 12 and SEQ ID NO: 14. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 12 and SEQ ID NO: 26. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 14 and SEQ ID NO: 26. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 12, SEQ ID NO: 14, and SEQ ID NO: 26. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 13 and SEQ ID NO: 15.
- a nuclease provided herein comprises: SEQ ID NO: 13 and SEQ ID NO: 27. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 15 and SEQ ID NO: 27. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 13, SEQ ID NO: 15, and SEQ ID NO: 27. [0082] In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 66 and SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 66 and SEQ ID NO: 26.
- a nuclease provided herein comprises: SEQ ID NO: 64 and SEQ ID NO: 26. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 66, SEQ ID NO: 14, and SEQ ID NO: 26. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 68 and SEQ ID NO: 15. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 68 and SEQ ID NO: 27. In some embodiments, a nuclease provided herein comprises: SEQ ID NO: 15 and SEQ ID NO: 27.
- a nuclease provided herein comprises: SEQ ID NO: 68, SEQ ID NO: 15, and SEQ ID NO: 27.
- Additional combinations of nuclease amino acid sequences can include but are not limited to the sequences provided in Table 2.
- the amino acid sequences in Table 2 can be combined to generate a functional nuclease provided herein (e.g., a nuclease that cleaves a target nucleic acid).
- a nuclease provided herein comprises at least two amino acid sequences selected from Table 2, in any combination.
- Table 2 Nuclease Amino Acid Sequence Combinations.
- nucleases comprising an amino acid sequence that is derived from one or more bacteria.
- the nuclease is a chimeric nuclease, wherein the chimeric nuclease comprises an amino acid sequence that is derived from two or more different bacteria.
- a nuclease provided herein comprises an amino acid sequence that is derived or isolated from a bacterium of the genus Prevotella.
- a nuclease provided herein comprises an amino acid sequence that is derived from a Prevotella ruminicola bacterium.
- a nuclease provided herein comprises an amino acid sequence that is derived from a P. bryantii bacterium, a P. brevis bacterium, or a P. albensis bacterium. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is derived from a Prevotella ruminicola, wherein the Prevotella ruminicola is of the strain ATCC 19189 / JCM 8958 / 23. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is derived or isolated from a bacterium of the genus Rumminococcus.
- a nuclease provided herein comprises an amino acid sequence that is derived from a Rumminococcus bovis bacterium. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is derived from a Rumminococcus bovis, wherein the Rumminococcus bovis is of the strain ATCC 10801. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is derived from a Ruminococcus sp. AF37-3AC bacterium. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is derived from a Ruminococcus bromii bacterium.
- a nuclease provided herein comprises an amino acid sequence that is derived from a Ruminococcus sp. AM36-18. In some embodiments, a nuclease provided herein comprises an amino acid sequence that is derived from a Ruminococcus sp. AM28-29LB. [0085] In some embodiments, a nuclease provided herein further comprises a nuclear localization sequence (NLS). In some embodiments, the nuclease comprises more than one nuclear localization sequence (NLS). An NLS targets a protein to the nucleus of a cell, localizing a nuclease provided herein in close proximity to a target nucleic acid within the nucleus of a cell.
- the NLS comprises an amino acid sequence that is at least 99% identical to an NLS sequence listed in Table 3, a functional fragment, or a derivative thereof. In some embodiments, the NLS comprises an amino acid sequence that is 100% identical to an NLS sequence listed in Table 3, a functional fragment, or a derivative thereof. Table 3. Nuclear Localization Sequences.
- a nuclease provided herein comprises an SV40 NLS. In some embodiments, a nuclease provided herein comprises a nucleoplasmin nuclear localization sequence. In some embodiments, a nuclease provided herein comprises an SV40 NLS and a nucleoplasmin nuclear localization sequence.
- a nuclease provided herein comprises SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51, SEQ ID NO: 63, or SEQ ID NO: 153.
- a nuclease provided herein comprises a linker.
- a linker is a molecular entity that can directly or indirectly connect at two parts of a composition, e.g., from the first protein construct to the second protein construct and the second protein construct to a third protein construct, and so on. Linkers can be configured according to a specific need, e.g., stability or length between two amino acid sequences.
- linkers can be configured to allow multimerization of at least two nucleases provided herein (e.g., to from a di-, tri-, tetra-, penta-, or higher multimeric complex) while retaining biological activity (e.g., cleavage of a target nucleic acid or set of target nucleic acids).
- linkers can be configured to facilitate expression and purification of the nuclease provided herein.
- a linker can be configured to have any length in a form of a peptide, peptidomimetic, a protein, a nucleic acid (e.g., DNA or RNA), or any combinations thereof.
- the linker can vary from about 2 to about 4 amino acids long, from about 2 to about 6 amino acids long, from about 2 to about 10 amino acids long, or from about 2 to about 100 amino acids long. Longer or shorter linker sequences can be also used for the nucleases and fusion proteins provided herein. [0089] In some embodiments, the linker can be configured to have a sequence comprising at least one of the amino acids selected from the group consisting of glycine (Gly), serine (Ser), asparagine (Asn), threonine (Thr), methionine (Met) or alanine (Ala).
- uncharged polar amino acids e.g., Gln, Cys, or Tyr
- nonpolar amino acids e.g., Val, Leu, Pro, Phe, and Trp
- the linkers can be of any shape.
- a linker can be linear.
- a linker can be folded.
- a linker can be branched.
- a linker further comprises a detectable label.
- a linker can be a chemical linker of any length.
- chemical linkers can comprise a direct bond or an atom such as oxygen or sulfur, a unit such as NH, C(O), C(O)NH, SO, SO2, SO2NH, or a chain of atoms, such as substituted or unsubstituted C1-C6 alkyl, substituted or unsubstituted C2-C6 alkenyl, substituted or unsubstituted C 2 -C 6 alkynyl, substituted or unsubstituted C 6 -C 12 aryl, substituted or unsubstituted C 5 -C 12 heteroaryl, substituted or unsubstituted C 5 -C 12 heterocyclyl, substituted or unsubstituted C 3 -C 12 cycloalkyl, where one or more methylenes can be interrupted or terminated by O, S, S(O), SO2, NH, or C(O).
- the chemical linker can be a polymer chain (branched or linear).
- the linker is a GSGSGS linker (SEQ ID NO: 52).
- a linker can comprise non-charged or charged amino acids.
- a linker can comprise alpha-helical domains.
- a linker can comprise a chemical cross linker.
- a linker can be of different lengths to adjust the function of fused domains and their physical proximity.
- a linker can comprise peptides with ligand-inducible conformational changes.
- a nuclease provided herein further comprises a tag.
- the tag is a C-terminal tag.
- the tag is an N-terminal tag.
- the tag provided herein can be used to detect the nuclease in a sample or immobilize the nuclease to a solid support (e.g, a column).
- the tag is a c-myc tag.
- the tag is a recombinant protein.
- Non-limiting examples of protein tags include: His tags (e.g., HHHHHH (SEQ ID NO: 155)), chitin binding protein (CBP), maltose binding protein (MBP), Strep-tag (WSHPQFEK) (SEQ ID NO: 156), and glutathione-S-transferase (GST).
- the tag is a solubilization tag (e.g., thioredoxin (TRX), polyNANP, MBP, and GST).
- the tag is a FLAG-tag (DYKDDDDK (SEQ ID NO: 157)).
- the tag is an epitope tag.
- Non-limiting examples of epitope tags include: an ALFA-tag (SRLEEELRRRLTE (SEQ ID NO: 158)), a V5-tag (GKPIPNPLLGLDST (SEQ ID NO: 159)), a Myc-tag (EQKLISEEDL (SEQ ID NO: 30)), an HA-tag (YPYDVPDYA (SEQ ID NO: 160)), a Spot-tag (PDRVRAVSHWSS (SEQ ID NO: 161)), a T7-tag (MASMTGGQQMG (SEQ ID NO: 162)) and a NE-tag (TKENPRSNQEESYDDNES (SEQ ID NO: 163)).
- the tag is a HiBiT- tag.
- the tag is an Fc tag, a Nus-tag, a Biotin Carboxyl Carrier Protein (BCCP) tag, an SPB tag (MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP (SEQ ID NO: 164)), a Ty tag (EVHTNQDPLD (SEQ ID NO: 165)), an Xpress tag (DLYDDDDK (SEQ ID NO: 166)).
- the tag is a horseradish peroxidase (HRP), a chloramphenicol acetyltransferase (CAT) beta-galactosidase, a beta-glucuronidase, or a luciferase.
- the tag is a fluorescent tag or a fluorophore.
- fluorescent tags include: red fluorescent protein, cjBlue, aeBlue, amilGFP, amilCP, Green fluorescent protein (GFP), mCherry, mOrange, blue fluorescent protein (BFP), yellow fluorescent protein (YFP) and cyan fluorescent protein (CFP), Venus, Cerulean, HcRed, DsRed, among many others. Any tag that facilitates the detection or immobilization of a nuclease provided herein or a guide nucleic acid provided herein can be used.
- a nuclease provided herein comprises at least one amino acid substitution or deletion.
- a nuclease provided herein comprises at least one amino acid substitution or deletion within the nuclease domain of the nuclease amino acid sequence.
- the amino acid substitution or deletion is between position 1067 and 1262 of the nuclease amino acid sequence.
- the amino acid substitution or deletion occurs at position 1165 or position 1186.
- the amino acid substitution is an R1165A substitution.
- the amino acid substitution is R1186A substitution.
- a nuclease is modified to become a catalytically dead nuclease or a partially dead nuclease.
- a nuclease comprises SEQ ID NO: 61 or SEQ ID NO: 62, a functional fragment, or a derivative thereof.
- the amino acid substitution is within the WED2 domain of a nuclease provided herein.
- the amino acid substitution is within the REC1 domain of a nuclease provided herein.
- a nuclease provided herein comprises one or more amino acid substitution.
- a nuclease provided herein comprises two or more amino acid substitutions.
- a nuclease provided herein comprises three or more amino acid substitutions.
- a nuclease provided herein comprises an amino acid substitution at glutamate 155 (Glu155, E155), serine 532 (Ser 532, S532), lysine 538 (Lys 538, K538), or any combination thereof as compared to the sequence of SEQ ID NO: 1.
- a nuclease provided herein comprises an amino acid substitution at E155, S532, and/or K538, wherein the amino acid substitution at any one of positions E155, S532, and/or K538 comprises a substitution of an amino acid residue to a different amino acid residue comprising a hydrophobic amino acid residue, a hydrophilic amino acid residue, a charged amino acid residue that is a basic amino acid residue, or an acidic amino acid residue, or an aliphatic amino acid residue.
- the different amino acid residue comprises a charged amino acid residue that is a basic amino acid residue.
- the different amino acid residue is an arginine (Arg, R) or a lysine (Lys, K), a glutamate (Glu, E), an aspartate (Asp, D), or a valine (Val, V).
- a nuclease provided herein comprises an amino acid substitution listed in Table 8.
- a nuclease provided herein comprises an amino acid sequence of any one of SEQ ID NOS: 64-78.
- a nuclease provided herein is encoded by any one of the nucleic acid sequences of SEQ ID NOS: 85-90, or 183.
- a nuclease provided herein is encoded by a nucleic acid comprising a sequence that is at least 85% identical to SEQ ID NO: 90, or 183.
- Methods of modifying a nucleic acid sequence encoding a nuclease include, e.g., direct mutagenesis or random mutagenesis.
- Guide Nucleic Acids [0097] Provided herein are nucleic acid editing systems, wherein the nucleic acid editing systems comprise a guide nucleic acid.
- a nuclease provided herein binds to a guide nucleic acid provided herein.
- a guide nucleic acid is a guide DNA (gDNA) or a guide RNA (gRNA).
- the guide nucleic acid comprises RNA.
- the guide nucleic acid comprises RNA and/or DNA.
- the guide nucleic acid comprises RNA and a non- canonical nucleobase or non-canonical nucleoside.
- Canonical nucleobases include, for example, adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U).
- Nucleoside analogues are non-canonical nucleosides which contain a nucleobase analogue and a sugar.
- nucleosides transition to nucleotides capable of incorporation into growing RNA strands. While non-canonical nucleosides are capable of incorporating into a newly synthesized nucleic acid strand, they are generally not capable of being “read” by an RNA polymerase for amplification and act as chain terminators that stop an RNA polymerase.
- Modified nucleobases which can be incorporated into modified nucleosides and nucleotides and be present in the guide nucleic acid include: m5C (5-methylcytidine), m5U (5-methyluridine), m6A (N6- methyladenosine), s2U (2-thiouridine), Um (2′-O-methyluridine), m1A (1-methyladenosine); m2A (2-methyladenosine); Am (2-1-O-methyladenosine); ms2m6A (2-methylthio-N6- methyladenosine); i6A (N6-isopentenyladenosine); ms2i6A (2-methylthio- N6isopentenyladenosine); io6A (N6-(cis-hydroxyisopentenyl)adenosine); ms2io6A (2- methylthio-N6-(cis-hydroxyisopentenyl)
- any one or any combination of these modified nucleobases may be included in the guide sequence. Many of these modified nucleobases and their corresponding ribonucleosides are available from commercial suppliers. If desired, the guide nucleic acid can contain phosphoramidate, phosphorothioate, and/or methylphosphonate linkages. [0099] Guide nucleic acids provided herein that comprise at least one modified nucleotide can be prepared using any suitable method. Several suitable methods are known in the art for producing nucleic acid molecules that contain modified nucleotides.
- a guide nucleic acid that contains modified nucleotides can be prepared by transcribing (e.g., in vitro transcription) a DNA that encodes for a guide RNA using a suitable DNA-dependent RNA polymerase, such as T7 phage RNA polymerase, SP6 phage RNA polymerase, T3 phage RNA polymerase, and the like, or mutants of these polymerases which allow efficient incorporation of modified nucleotides into RNA molecules.
- the transcription reaction can contain nucleotides and modified nucleotides, and other components that support the activity of the selected polymerase, such as a suitable buffer, and suitable salts.
- the guide nucleic acid is from about 1 base pair to about 30 base pairs in length. In some embodiments, the guide nucleic acid is at least partially complementary to a target nucleic acid sequence. In some embodiments, the guide nucleic acid binds a target nucleic acid. In some embodiments, the guide nucleic acid is at least about 75% complementary to a target polynucleotide sequence.
- the guide nucleic acid is at least about 80% , 85%, 90%, 95%, 99% complementary to a target nucleic acid sequence. In some embodiments, a guide nucleic acid is fully or 100% complementary to a target nucleic acid sequence.
- Guide nucleic acids provided herein can comprise a direct repeat sequence.
- a direct repeat sequence is generally within 20 to 30 base pairs (bps) in length.
- the direct repeat sequence is at least about 16 bps in length or more, 17 bps in length or more, 18 bps in length or more, 19 bps in length or more, 20 bps in length or more, 21 bps in length or more,, 22 bps in length or more, 23 bps in length or more, 24 bps in length or more, 25 bps in length or more, 26 bps in length or more, 27 bps in length or more, 28 bps in length or more, 29 bps in length or more, 30 bps in length or more.
- guide nucleic acids provided herein comprise one or more, two or more, three or more, or four or more direct repeat sequences.
- the length and sequence composition of the guide nucleic acid provided herein may influence the binding of the nuclease provided herein to the guide nucleic acid and the target nucleic acid.
- the direct repeat sequence comprises a short stem-loop structure.
- the direct repeat sequence comprises a sequence of SEQ ID NO: 5 or SEQ ID NO: 6, a functional fragment, or a derivative thereof.
- the guide nucleic acid comprise SEQ ID NO: 5 or SEQ ID NO: 6; and a sequence that is at least partially complementary to a target nucleic acid sequence. Exemplary guide nucleic acids and their corresponding targets and nucleases are provided in Table 4 below.
- the guide nucleic acid comprises a sequence that is at least 85% identical to a sequence set forth in any one of SEQ ID NOS: 38, 39, 53-60, 79-84, 113-152.
- the guide nucleic acid comprises a sequence that is at least 90% identical to a sequence set forth in any one of SEQ ID NOS: 38, 39, 53-60, 79-84, 113-152. In some embodiments, the guide nucleic acid comprises a sequence that is at least 95% identical to a sequence set forth in any one of SEQ ID NOS: 38, 39, 53-60, 79-84, 113-152. In some embodiments, the guide nucleic acid comprises a sequence that is at least 99% identical to a sequence set forth in any one of SEQ ID NOS: 38, 39, 53-60, 79-84, 113-152.
- the guide nucleic acid comprises a sequence set forth in any one of SEQ ID NOS: 38, 39, 53-60, 79-84, 113-152.
- the guide nucleic acid comprises a variable sequence targeting a nucleic acid encoding for CD274, wherein the variable sequence comprises any one of SEQ ID NOS: 7-9, 36, 79, 113-152.
- the guide nucleic acid comprises any one of SEQ ID NOS: 53-60, or 113-152.
- the guide nucleic acid comprises a variable sequence targeting a nucleic acid encoding for B2M, wherein the variable sequence comprises SEQ ID NO: 37 or SEQ ID NO: 81.
- the guide nucleic acid comprises any one of SEQ ID NOS: 38-39, SEQ ID NO: 84.
- the guide nucleic acid comprises a variable sequence targeting a nucleic acid encoding for cluster of differentiation 46 (CD46), wherein the variable sequence comprises SEQ ID NO: 80.
- the guide nucleic acid comprises SEQ ID NO: 83.
- the guide nucleic acid comprises a sequence of any one of: SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152, or a sequence that is complementary to any one of SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152.
- nucleic acid editing systems comprising: (a) a nuclease provided herein; and (b) a guide nucleic acid sequence comprising: a direct repeat sequence of SEQ ID NO: 5 or SEQ ID NO: 6 operably linked to a variable sequence, wherein the variable sequence is complementary to a target nucleic acid.
- the guide nucleic acid comprises a variable sequence.
- the variable sequence is at least partially complementary to a target nucleic acid sequence.
- guide nucleic acids provided herein comprise a direct repeat sequence and a variable sequence.
- a nucleic acid editing system comprises a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5.
- a nucleic acid editing system comprises a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5.
- a nucleic acid editing system comprises a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 2, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 6.
- a nucleic acid editing system comprises a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 11, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 6.
- guide nucleic acids provided herein are delivered to a cell via an expression cassette.
- An expression cassette is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a nucleic acid sequence in a host cell.
- An expression cassette may be part of a plasmid, viral genome, or nucleic acid fragment.
- an expression cassette includes a nucleic acid to be transcribed, operably linked to a promoter.
- guide nucleic acids provided herein further comprise a promoter sequence.
- the promotor sequence can be any promoter that permits expression of the guide RNA in a cell.
- a promoter capable of transcribing the guide RNA in a cell includes a heat shock /heat inducible promoter operably linked to a nucleotide sequence encoding the guide RNA.
- a promoter includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of a polymerase II type promoter, a TATA element.
- promotors include: a ubiquitin promoter (e.g., U6); a Pol-II promoter; a Pol-III promoter; a Banana Streak Virus Promoter (BSV promoter); a Phospholipid Transfer Protein Promoter (PLTP promoter); H1, EF-1a, and a T7 promoter.
- guide nucleic acids provided herein comprise a promoter sequence, a direct repeat sequence, and a variable sequence.
- the guide nucleic acid optionally, comprises a polynucleotide linker. In some embodiments, the guide nucleic acid comprises two direct repeat sequences and a variable sequence, wherein the variable sequence is at least 75% complementary to a target nucleic acid sequence.
- Guide nucleic acids provided herein may comprise non-naturally occurring sequences or engineered sequences.
- a guide nucleic acid sequence can comprise synthetic RNA or DNA molecules.
- guide nucleic acids can be generated by transducing or transfecting cells with a plasmid DNA that encodes for an expression cassette.
- a U6 promoter can be used to drive the expression of the guide RNA.
- the U6 promoter comprises the sequence of SEQ ID NO: 31.
- a guide nucleic acid provided herein can be at least partially complementary to a target polynucleotide sequence of interest.
- a target nucleic acid provided herein comprises a gene sequence.
- the gene sequence is a mammalian gene sequence.
- the gene sequence is a human gene sequence.
- a target nucleic acid provided herein comprises a DNA.
- a target nucleic acid provided herein comprises a double-stranded DNA.
- a target nucleic acid provided herein comprises a single-stranded DNA.
- a target nucleic acid provided herein comprises RNA.
- a target nucleic acid provided herein comprises single stranded RNA. In some embodiments, a target nucleic acid provided herein comprises double stranded RNA. In some embodiments, a target nucleic acid binds to a variable sequence of a guide nucleic acid. In some embodiments, the target nucleic acid comprises a protospacer-adjacent motif ("PAM"), wherein the PAM is a short T-rich sequence. In some embodiments, cleavage of a target nucleic acid occurs downstream from the PAM sequence. Molecules from different bacterial species can recognize different sequence motifs (e.g., PAM sequences). In some embodiments, the PAM is a T-rich PAM.
- PAM protospacer-adjacent motif
- the PAM has the nucleotide sequence (T)XN, wherein the X is the number of thymines (e.g, 1-10), and N is A, G, C or T. In certain embodiments, X is equal to 2, and thus, the PAM is TTN. In some embodiments, X is 3, and thus, the PAM is TTTN.
- a nuclease provided herein recognizes the sequence motif TTTN and directs cleavage of a target nucleic acid sequence 1-24 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24) bp downstream from that sequence.
- the PAM site is a TTTA sequence or a TTTC sequence. In some embodiments, the PAM site is a sequence listed in Table 13.
- the target nucleic acid sequence can be from any cell or organism. Determining the appropriate sequence for the guide nucleic acid to bind to a target nucleic acid provided herein may depend on the sequence of the target and structure of the nuclease provided herein. Methods of designing a guide nucleic acid for gene editing are known in the art and include, e.g., using software and databases such as Breaking-Cas, Cas-OFFinder, CRISPR-DT, CHOPCHOP, CCTOP, CRISPick, or CRISPOR.
- the target nucleic acid sequence is a mammalian DNA sequence. In some embodiments, the target nucleic acid sequence is a human DNA sequence. In some embodiments, the target nucleic acid sequence is a sequence expressed by a tumor or a cancer cell. In some embodiments, the target nucleic acid comprises a sequence associated with a disease or condition. In some embodiments, the target nucleic acid sequence is a sequence expressed by an immune cell.
- Non-limiting examples of target nucleic acids include nucleic acid encoding: cluster of differentiation 274 (CD274, also called programmed cell death ligand 1 or PD-L1), beta-2-microglobulin (B2M), CD46, CD81, C-C chemokine receptor type 5 (CCR5), BAF Chromatin Remodeling Complex Subunit 11 (BCL11A), transthyretin (TTR), programmed cell death 1 (PD-1), programmed cell death 2 (PD-2), T cell immunoglobulin and ITIM domain (TIGIT), Lymphocyte Activating 3 (LAG3), and Cytokine-inducible SH2-containing protein (CISH).
- CD274 cluster of differentiation 274
- B2M beta-2-microglobulin
- CD46 CD81
- CD81 C-C chemokine receptor type 5
- CCR5 C-C chemokine receptor type 5
- BCL11A BAF Chromatin Remodeling Complex Subunit 11
- TTR transthyretin
- Fusion proteins comprising a nuclease provided herein.
- the nucleases provided herein can be fused to one or more additional protein constructs for a particular application (e.g., therapeutic gene editing or cellular gene editing for biotechnology applications in vitro or ex vivo ). Multimerization of an effector or protein tag by direct fusion with the nuclease provided herein can localize the effector domain to a site matched by the guide nucleic acid provided herein.
- the fusion proteins provided herein can be used to modulate transcription, expression, translation, RNA splicing, methylation status, DNA unwinding, or any other property that alters nucleic acid expression in a cell.
- nucleases provided herein are fused to at least one additional element.
- a fusion protein comprises a first protein construct comprising a nuclease provided herein.
- a fusion protein provided herein comprises a second protein construct.
- fusion proteins provided herein optionally, comprise a linker between the first protein construct and second protein construct.
- fusion proteins provided herein further comprise a third, fourth, and/or fifth protein construct.
- a fusion protein provided herein comprises one or more linkers between one or more protein constructs.
- a fusion protein provided herein comprises a cleavable or non-cleavable linker between the different protein constructs or domains of the fusion protein.
- a linker can be a polypeptide linker, such as a linker that is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, or more amino acids long.
- two polypeptide constructs that are “fused” need not be directly adjacent to each other.
- Fused polypeptide sequences can be fused by a linker, or by an additional functional or non-functional polypeptide sequence that is fused to the polypeptide sequences.
- the number of additional elements in the nuclease-fusion protein will depend on a number of factors, including, e.g., size or molecule weight of the fusion protein for delivery to a cell, or translocation of the fusion protein desired in a cell type of interest.
- a fusion protein provided herein comprises a first protein construct comprising: a nuclease comprising an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77-78, 182, a functional fragment, or a derivative thereof.
- a fusion protein provided herein comprises a first protein construct comprising: a nuclease comprising an amino acid sequence that is at least 90% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77-78, 182, a functional fragment, or a derivative thereof. In some embodiments, a fusion protein provided herein comprises a first protein construct comprising: a nuclease comprising an amino acid sequence that is at least 95% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77-78, 182, a functional fragment, or a derivative thereof.
- a fusion proteins provided herein comprise a first protein construct comprising: a nuclease comprising an amino acid sequence that is at least 99% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77-78, 182, a functional fragment, or a derivative thereof.
- a fusion protein provided herein comprises a first protein construct comprising: a nuclease comprising an amino acid sequence that is 100% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77-78, 182, a functional fragment, or a derivative thereof.
- a fusion protein provided herein comprises one or more amino acid sequence selected from Table 1.
- a fusion protein comprises: a first protein construct comprising: a first nuclease comprising a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14; or SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid.
- REC helical recognition
- the first protein construct comprises a first nuclease comprising: (i) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (ii) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid.
- NUC nuclease
- WED2 wedge 2
- REC helical recognition
- the first protein construct (e.g., a first nuclease) is capable of cleaving the target nucleic acid via the NUC lobe.
- a fusion protein provided herein comprises a second protein construct.
- a fusion protein provided herein comprises a protein construct selected from Table 5.
- Table 5 provides a list of proteins that can be fused to a nuclease provided herein, a functional fragment, or a derivative thereof.
- Table 5 also provides the function of various proteins and exemplary NCBI/UniProt accession numbers, which can be selected using ordinary skill in the art, for instance, based on amino acid sequence, crystal structure data, computational structure predictions, and functional activity. Table 5. Protein constructs.
- a fusion protein provided herein comprises a protein construct comprising an amino acid sequence that is at least 85% identical to an amino acid sequence listed in Table 5, a functional fragment, or a derivative thereof.
- a fusion protein provided herein comprises a protein construct comprising a protein from Table 5, a homolog, or an ortholog thereof.
- Orthologous proteins may but need not be structurally related, or are only partially structurally related. Homologs and orthologs may be identified by homology modelling, e.g., BLAST, NCBI, or UniProt.
- the additional protein construct is of eukaryotic origin, such as of human, rat or lamprey origin.
- the additional protein construct is of prokaryotic origin, such as a bacterium or archaeon.
- the additional protein construct comprises a viral protein.
- a fusion protein provided herein comprises one or more additional protein constructs, wherein the one or more additional protein constructs comprises: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonuclease, a helicase, a ligase, a meganuclease, a methylase, a methyltransferase, a nickase, a polymerase, a protease,
- a fusion protein provided herein comprise a Cas protein.
- the Cas is a catalytically dead Cas or a partially dead Cas (e.g., a nickase).
- the catalytically dead Cas or the partially dead Cas is selected from the group consisting of catalytically dead derivatives of: Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9, Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, C
- the nickase comprises a sequence that is at least 90% identical to SEQ ID NO: 61 or SEQ ID NO: 62. In some embodiments, the nickase comprises a sequence that is at least 95% identical to SEQ ID NO: 61 or SEQ ID NO: 62. In some embodiments, the nickase comprises a sequence comprising SEQ ID NO: 61. In some embodiments, the nickase comprises a sequence comprising SEQ ID NO: 62.
- the fusion protein provided herein comprises a nuclease deficient protein construct, wherein the nuclease deficient protein construct comprises a sequence that is at least 85% identical to SEQ ID NO: 61 or SEQ ID NO: 62. In some embodiments, the nuclease deficient protein construct comprises a sequence that is at least 90% identical to SEQ ID NO: 61 or SEQ ID NO: 62. In some embodiments, the nuclease deficient protein construct comprises a sequence that is at least 95% identical to SEQ ID NO: 61 or SEQ ID NO: 62. In some embodiments, the nuclease deficient protein construct comprises a sequence comprising SEQ ID NO: 61.
- the nuclease deficient protein construct comprises a sequence comprising SEQ ID NO: 62.
- a fusion protein provided herein comprises a protein construct that modulates transcription.
- a fusion protein provided herein comprises a transcriptional repressor protein construct.
- a fusion protein provided herein comprises a zinc finger protein construct.
- the zinc finger protein construct comprises a Krüppel-associated box (KRAB) protein or a functional fragment thereof.
- the fusion protein comprises a KRAB domain that binds to a transcriptional corepressor protein.
- a fusion protein provided herein comprises a SUMO protein construct.
- a fusion protein provided herein comprises a transcriptional activator protein construct.
- a transcriptional activator protein construct recruits transcription factors from a host cell to the target nucleic acid for regulation of target nucleic acid expression.
- Exemplary transcriptional activators include VP64, VP16, VP160, VP48, VP96, p65, Rta, VPR, hsf1, and p300.
- a fusion protein provided herein comprises one or more VP16 protein construct.
- a fusion protein provided herein comprises one or more VP64 protein construct.
- a fusion protein provided herein comprises one or more VPR protein construct.
- a fusion protein provided herein comprises one or more SunTag protein construct.
- a fusion protein provided herein comprises VP64, p65, and HSF1 (SunTag-p65-heat-shock factor 1 or SPH).
- a fusion protein provided herein comprises a CREB-binding protein (CPB).
- CBP can be used to recruit transcriptional machinery and function as a histone acetyltransferase (HAT) that alters chromatin structure.
- a nuclease or a fusion protein provided herein further comprises an aptamer.
- the aptamers are bind to one or more MS2 proteins.
- a fusion protein provided herein comprises a self-cleaving protein sequence.
- self-cleaving protein sequences include: E2A, P2A and T2A.
- the fusion protein comprises an antibiotic resistance protein construct or an antibiotic resistance selectable marker.
- Non-limiting examples of antibiotic resistance proteins and selectable markers include: aminoglycoside acetyltransferase, rifampin ADP-ribosyltransferase, dihydrofolate reductase, multidrug and toxic compound extrusion transporters, antibiotic resistance ATP-binding cassette family F (ARE ABC-F) proteins (e.g., MsrE, Erm, Vga, Lsa, Sal, OptrA), ⁇ -lactamase, blasticidin-S deaminase, penicillin-binding proteins (PBPs), and puromycin-N-acetyltransferase.
- a fusion protein provided herein comprises a base editor.
- the base editor is selected from the group consisting of: an adenosine nucleobase editor, a cytidine deaminase, an adenine base editor.
- the cytidine deaminase is activation-induced deaminase (AID), an APOBEC deaminase, APOBEC3G, APOBEC1, cytidine deaminase 1 (CDA1), a functional fragment, or a derivative thereof.
- the adenosine base editor is ecTadA, saTadA, a functional fragment, or a derivative thereof.
- fusion proteins provided herein further comprise a uracil-DNA glycosylase, a functional fragment, or a derivative thereof.
- fusion proteins comprise: (a) a first protein construct comprising: (a) a first protein construct comprising: a first nuclease consisting of SEQ ID NO: 1, 2, 10, 11, 77-78; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonuclease, a helicase, a ligase, a meganuclease, a methyla
- Fusion proteins provided herein can be synthesized using known technologies, for instance, recombination DNA technology where the coding sequences of various portions of the fusion proteins can be linked together at the nucleic acid level. Subsequently a fusion protein can be produced using a host cell. 5. Methods of Determining Gene Editing Efficiency [00131] Provided herein are methods of determining gene editing efficiency and selectivity of a nuclease or fusion protein provided herein for a target nucleic acid sequence.
- the target nucleic acid is cleaved by a nuclease provided herein upon binding of the guide sequence to the target nucleic acid sequence.
- the target nucleic acid is a DNA.
- the nuclease provided herein cleaves the DNA resulting in a double-stranded break (DSB). In some embodiments, the nuclease provided herein cleaves the target DNA via staggered DNA double-stranded break. In some embodiments, the staggered DNA double-stranded break generates a 5' overhang. The 5' overhang can facilitate gene insertion via non-homologous end joining (NHEJ) mechanisms. In some embodiments, the 5’ overhang is at least 3 base pairs (bps) or more, at least 4 bps or more, at least 5 bps or more, or at least 6 bps or more. In some embodiments, the 5’ overhang is about 4 bps to 5 bps in length.
- the ability of a nuclease to recognize a PAM sequence can be determined by an in vitro selection assay.
- the activity of a nucleic acid editing system may be assayed using a cell expressing a reporter protein or containing a reporter gene.
- a reporter protein may be engineered to contain an obstruction, such as a stop codon, a frameshift mutation, a spacer, a linker, or a transcriptional terminator; the nucleic acid editing system may then be used to remove the obstruction and the resultant functional reporter protein may be detected.
- the obstruction may be designed such that a specific sequence modification is required to restore functionality of the reporter protein.
- the obstruction may be designed such that any insertion or deletion which results in a frame shift of one or two bases may be sufficient to restore functionality of the reporter protein.
- reporter proteins include colorimetric enzymes, metabolic enzymes, fluorescent proteins, enzymes and transporters associated with antibiotic resistance, and luminescent enzymes.
- reporter proteins include ⁇ -galactosidase, Chloramphenicol acetyltransferase, Green fluorescent protein, Red fluorescent protein, luciferase, and renilla.
- Different detection methods may be used for different reporter proteins.
- the reporter protein may affect cell viability, cell growth, fluorescence, luminescence, or expression of a detectable product.
- the reporter protein may be detected using a colorimetric assay.
- the reporter protein may be a fluorescent protein, and DNA editing may be assayed by measuring the degree of fluorescence in treated cells, or the number of treated cells with at least a threshold level of fluorescence.
- transcript levels of a reporter gene may be assessed.
- a reporter gene may be assessed by sequencing.
- an assay for measuring DNA editing may use a split fluorescence protein system, such as the self-complementing split GFP1-10/11 systems, in which two fragments (G 1-10 and G 11 ) of the GFP protein which can associate by themselves to form a functional GFP signal are linked using a frameshifting linker.
- Insertions or deletions within the frameshifting linker can restore the frame of the G11 fragment allowing the two fragments to form a functional GFP signal.
- Integration of an exogenous nucleic acid can be measured using any technique, e.g., integration can be measured by flow cytometry, surveyor nuclease assay, tracking of indels by decomposition (TIDE), junction PCR, or any combination thereof.
- TIDE indels by decomposition
- junction PCR junction PCR
- transgene integration can be measured by PCR.
- a TIDE analysis can also be performed on engineered cells.
- Ex vivo cell transfection can also be used for diagnostics, research, or for gene therapy (e.g., via re-infusion of the transfected cells into the host organism).
- cells are isolated from the subject organism, transfected with a nucleic acid (e.g., gene or cDNA), and re-infused back into the subject organism (e.g., subject).
- a nucleic acid e.g., gene or cDNA
- the amount of nuclease-containing modified cells that can be necessary to be therapeutically effective in a subject can vary depending on the viability of the cells, and the efficiency with which the cells have been genetically modified (e.g., the efficiency with which a transgene has been integrated into one or more cells).
- the product (e.g., multiplication) of the viability of cells post genetic modification and the efficiency of integration of a transgene can correspond to the therapeutic aliquot of cells available for administration to a subject.
- an increase in the viability of cells post genetic modification can correspond to a decrease in the amount of cells that are necessary for administration to be therapeutically effective in a subject.
- an increase in the efficiency with which a transgene has been integrated into one or more cells can correspond to a decrease in the amount of cells that are necessary for administration to be therapeutically effective in a subject.
- determining an amount of cells that are necessary to be therapeutically effective can comprise determining a function corresponding to a change in the viability of cells over time.
- determining an amount of cells that are necessary to be therapeutically effective can comprise determining a function corresponding to a change in the efficiency with which a transgene can be integrated into one or more cells with respect to time dependent variables (e.g., cell culture time, electroporation time, cell stimulation time).
- time dependent variables e.g., cell culture time, electroporation time, cell stimulation time.
- a percent of NHEJ, HDR, or a combination of both can be determined by co-delivering the gene editing molecules, for example a guide nuclease acid and a nuclease or fusion protein provided herein, with a donor nucleic acid template that encodes a promoter-less tag or marker (e.g., GFP) into cells.
- a duration of time e.g., 72-96 hours
- flow cytometry can be performed to quantify the total cell number (NTotal), tag-positive cell number and tag/GFP-negative cell number.
- next-generation sequencing can be performed to identify cells without mutations and with mutations. HDR efficiency and NHEJ efficiency can be calculated from the assay.
- Additional assays for determining gene editing efficiency of a nucleic acid editing system for cleaving a nucleic acid include but are not limited to: RT-PCR, nucleic acid sequencing, T7 endonuclease 1 (T7E1) mismatch detection assays, tracking of indels by decomposition (TIDE) assays, and indel detection by amplicon analysis (IDAA) assays.
- T7E1 T7 endonuclease 1
- TIDE tracking of indels by decomposition
- IDAA indel detection by amplicon analysis
- the indel pattern that is induced at the target site of a programmable nuclease may also be determined by PCR-amplifying the respective region and subsequent next generation sequencing.
- polynucleotides encoding a nuclease provided herein comprises a promoter sequence.
- Useful promoters for nuclease encoding sequences include, e.g., CMV, EF- 1a, MSCV, PGK, and CAG control promoters.
- compositions comprising: a nucleic acid encoding for a nuclease provided herein; and a delivery vehicle. Further provided herein are compositions, wherein the compositions comprise: one or more nucleic acids encoding for a nucleic acid editing system provided herein and a delivery vehicle. Further provided herein are compositions, wherein the compositions comprise: one or more nucleic acids encoding for a fusion protein provided herein and a delivery vehicle.
- compositions comprising: a first nucleic acid encoding for a nuclease provided herein; a second nucleic acid encoding for a guide nucleic acid provided herein; and a delivery vehicle.
- compositions wherein the compositions comprise: a first nucleic acid encoding for a fusion protein provided herein; a second nucleic acid encoding for a guide nucleic acid provided herein; and a delivery vehicle.
- compositions comprising: a first nucleic acid encoding for a first protein construct provided herein; a second nucleic acid encoding for a second protein construct; a third nucleic acid encoding for a guide nucleic acid provided herein; and a delivery vehicle.
- compositions comprisin the compositions comprise: (a) a delivery vehicle; and (b) a nucleic acid encoding for an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77-78, or 182, a functional fragment, or a derivative thereof.
- the composition further comprises an exogenous nucleic acid for delivery to a cell.
- the exogenous nucleic acid is incorporated into a target nucleic acid sequence.
- the exogenous nucleic acid encodes for a protein construct provided in Table 5.
- the nucleases, fusion proteins, guide nucleic acids, and nucleic acid editing systems provided herein can be delivered to a target cell by any suitable means.
- the nucleases and guide nucleic acids provided herein can be admixed with a delivery vehicle that permits delivery of the system to the target nucleic acid sequence.
- Non-limiting examples of delivery vehicles include an emulsion, a suspension, a liposome, a micelle, an exosome, an endosome, a virus, a vector, a particle, a nanoparticle, a polymer, microcapsules, recombinant cells, cell culture medium, blood, or serum. Specific types of delivery vehicles that can be used in a composition provided herein are further described below. [00139] In some embodiments, the delivery vehicle is a liposome. Liposomes are formed from phospholipids that are dispersed in an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles (also termed multilamellar vesicles (MLVs)).
- MLVs multilamellar vesicles
- MLVs generally have diameters of from 25 nm to 4 ⁇ m. Sonication of MLVs results in the formation of small unilamellar vesicles (SUVs) with diameters in the range of 200 to 500 angstroms containing an aqueous solution in the core.
- SUVs small unilamellar vesicles
- Liposomes interact with cells via different mechanisms: endocytosis by phagocytic cells of the reticuloendothelial system such as macrophages and neutrophils; adsorption to the cell surface, either by nonspecific weak hydrophobic or electrostatic forces, or by specific interactions with cell-surface components; fusion with the plasma cell membrane by insertion of the lipid bilayer of the liposome into the plasma membrane, with simultaneous release of liposomal contents into the cytoplasm; and by transfer of liposomal lipids to cellular or subcellular membranes, or vice versa, without any association of the liposome contents. Varying the liposome formulation can alter which mechanism is operative, although more than one can operate at the same time.
- Nanocapsules can generally entrap compounds in a stable and reproducible way. To avoid side effects due to intracellular polymeric overloading, such ultrafine particles (sized around 0.1 ⁇ m) should be designed using polymers able to be degraded in vivo. Biodegradable polyalkyl-cyanoacrylate nanoparticles can also be used as a delivery vehicle.
- the delivery vehicle is a phospholipid. Phospholipids can form a variety of structures other than liposomes when dispersed in water, depending on the molar ratio of lipid to water. At low ratios, the liposomes form. Physical characteristics of liposomes depend on pH, ionic strength and the presence of divalent cations.
- Liposomes can show low permeability to ionic and polar substances, but at elevated temperatures undergo a phase transition which markedly alters their permeability.
- the phase transition involves a change from a closely packed, ordered structure, known as the gel state, to a loosely packed, less- ordered structure, known as the fluid state. This occurs at a characteristic phase-transition temperature and results in an increase in permeability to ions, sugars and drugs.
- the delivery vehicle is a nanoparticle.
- Nanoparticle carriers that specifically target a tissue provided herein may also be used as a pharmaceutically acceptable carrier.
- the nanoparticle is a gold nanoparticle, a platinum nanoparticle, an iron-oxide nanoparticle, a lipid nanoparticle, a selenium nanoparticle, a tumor- targeting glycol chitosan nanoparticle (CNP), a cathepsin B sensitive nanoparticle, a hyaluronic acid nanoparticle, a paramagnetic nanoparticle, or a polymeric nanoparticle.
- CNP tumor- targeting glycol chitosan nanoparticle
- the nucleic acid editing systems provided herein can be delivered using vectors, for example containing sequences encoding one or more of the nucleases or fusion proteins provided herein.
- a nucleic acid editing system as described herein can be delivered absent a viral vector.
- Transgenes encoding polynucleotides can be similarly delivered. Any vector systems can be used including, but not limited to, plasmid vectors, viral vectors, and oncolytic viral vectors. Furthermore, any of these vectors can comprise one or more transcription factor, nuclease, and/or transgene. [00143] In some embodiments, vectors provided herein are a viral vector.
- Exemplary viral vectors include, but are not limited to, lentiviral vectors, retroviral vectors, adeno- associated viral vectors (AAV), adenoviral vectors, herpes simplex viral vectors, alphaviral vectors, flaviviral vectors, rhabdoviral vectors, measles viral vectors, Newcastle disease viral vectors, poxviral vectors, and picornaviral vectors.
- AAV adeno- associated viral vectors
- polynucleotides encoding an engineered protein provided herein are contained in an AAV viral vector.
- vectors provided herein are an oncolytic viral vector.
- polynucleotides encoding for a nuclease or a fusion protein provided herein are contained in an lentiviral viral vector.
- lentiviral vectors wherein the lentiviral vectors comprise a polynucleotide encoding for a nucleic acid editing system provided herein.
- Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids encoding for a nuclease provided herein, a nucleic acid editing system, and/or transgenes in cells (e.g., mammalian cells) and target tissues.
- Non-viral vector delivery systems can include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.
- Viral vector delivery systems can include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell.
- Methods of non-viral delivery of nucleic acids include electroporation, lipofection, nucleofection, gold nanoparticle delivery, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid: nucleic acid conjugates, naked DNA, mRNA, artificial virions, and agent-enhanced uptake of DNA. Sonoporation using, e.g., the Sonitron 2000 system (Rich-Mar) can also be used for delivery of nucleic acids. Additional exemplary nucleic acid delivery systems include those provided by AMAXA ® Biosystems (Cologne, Germany), Life Technologies (Frederick, Md.), MAXCYTE, Inc.
- Lipofection reagents are sold commercially (e.g., TRANSFECTAM ® and LIPOFECTIN ® ). Delivery can be to cells (ex vivo administration) or target tissues (in vivo administration). Additional methods of delivery include the use of packaging the nucleic acids to be delivered into EnGeneIC delivery vehicles (EDVs). These EDVs are specifically delivered to target tissues using bispecific antibodies where one arm of the antibody has specificity for the target tissue and the other has specificity for the EDV.
- EDVs EnGeneIC delivery vehicles
- Vectors including viral and non-viral vectors containing nucleic acids encoding a nucleic editing system provided herein can also be administered directly to an organism for transduction of cells in vivo. Alternatively, naked DNA or mRNA can be administered. Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or tissue cells including, but not limited to, injection, infusion, topical application and electroporation. More than one route can be used to administer a particular composition.
- a vector encoding for an exogenous transgene can be shuttled to a cellular nucleus.
- a vector can contain a nuclear localization sequence (NLS).
- NLS nuclear localization sequence
- An NLS can be from Simian Vacuolating Virus 40.
- a vector can also be shuttled by a protein or protein complex.
- a nuclease provided herein can be used as a means to shuttle a minicircle vector.
- the exogenous transgene comprises one or more NLS.
- a vector can be pre-complexed with a nuclease provided herein prior to electroporation into a cell.
- a nuclease that can be used for shuttling can be a nuclease-deficient or catalytically dead protein.
- a nuclease that can be used for shuttling can be a nuclease-competent protein.
- a nuclease provided herein can be pre-mixed with a guide nucleic acid and a vector or plasmid encoding an exogenous transgene.
- a cell can be transfected with a mutant or chimeric adeno-associated viral vector encoding an exogenous transgene and a nucleic acid editing system comprising a nuclease provided herein.
- An AAV vector concentration can be from 0.5 nanograms to 50 micrograms.
- nucleic acid e.g., ssDNA, dsDNA, and/or RNA
- the amount of nucleic acid (e.g., ssDNA, dsDNA, and/or RNA) that can be introduced into the cell by electroporation can be varied to optimize transfection efficiency and/or cell viability. In some embodiments, less than about 100 picograms of nucleic acid can be added to each cell sample (e.g., one or more cells being electroporated).
- dsDNA 1 microgram of dsDNA can be added to each cell sample for electroporation.
- the amount of nucleic acid (e.g., dsDNA) required for optimal transfection efficiency and/or cell viability can be specific to the cell type.
- the amount of nucleic acid (e.g., dsDNA) used for each sample can directly correspond to the transfection efficiency and/or cell viability.
- the transfection efficiency of cells with any of the nucleic acid delivery platforms described herein, for example, nucleofection or electroporation can be or can be about 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9%, or more than 99.9%.
- Viral particles such as AAV, can be used to deliver a viral vector comprising a gene of interest or a transgene into a cell ex vivo or in vivo.
- a mutated or chimeric adeno-associated viral vector as disclosed herein can be measured as pfu (plaque forming units).
- the pfu of recombinant virus or mutated or chimeric adeno-associated viral vector of the compositions and methods of the disclosure can be about 10 8 to about 5 ⁇ 10 10 pfu.
- recombinant viruses of this disclosure are at least about 1 ⁇ 10 8 , 2 ⁇ 10 8 , 3 ⁇ 10 8 , 4 ⁇ 10 8 , 5 ⁇ 10 8 , 6 ⁇ 10 8 , 7 ⁇ 10 8 , 8 ⁇ 10 8 , 9 ⁇ 10 8 , 1 ⁇ 10 9 , 2 ⁇ 10 9 , 3 ⁇ 10 9 , 4 ⁇ 10 9 , 5 ⁇ 10 9 , 6 ⁇ 10 9 , 7 ⁇ 10 9 , 8 ⁇ 10 9 , 9 ⁇ 10 9 , 1 ⁇ 10 10 , 2 ⁇ 10 10 , 3 ⁇ 10 10 , 4 ⁇ 10 10 , and 5 ⁇ 10 10 pfu.
- recombinant viruses of this disclosure are at most about 1 ⁇ 10 8 , 2 ⁇ 10 8 , 3 ⁇ 10 8 , 4 ⁇ 10 8 , 5 ⁇ 10 8 , 6 ⁇ 10 8 , 7 ⁇ 10 8 , 8 ⁇ 10 8 , 9 ⁇ 10 8 , 1 ⁇ 10 9 , 2 ⁇ 10 9 , 3 ⁇ 10 9 , 4 ⁇ 10 9 , 5 ⁇ 10 9 , 6 ⁇ 10 9 , 7 ⁇ 10 9 , 8 ⁇ 10 9 , 9 ⁇ 10 9 , 1 ⁇ 10 10 , 2 ⁇ 10 10 , 3 ⁇ 10 10 , 4 ⁇ 10 10 , and 5 ⁇ 10 10 pfu.
- a mutated or chimeric adeno-associated viral vector of the disclosure can be measured as vector genomes.
- recombinant viruses of this disclosure are 1 ⁇ 10 10 to 3 ⁇ 10 12 vector genomes, or 1 ⁇ 10 9 to 3 ⁇ 10 13 vector genomes, or 1 ⁇ 10 8 to 3 ⁇ 10 14 vector genomes, or at least about 1 ⁇ 10 1 , 1 ⁇ 10 2 , 1 ⁇ 10 3 , 1 ⁇ 10 4 , 1 ⁇ 10 5 , 1 ⁇ 10 6 , 1 ⁇ 10 7 , 1 ⁇ 10 8 , 1 ⁇ 10 9 , 1 ⁇ 10 10 , 1 ⁇ 10 11 , 1 ⁇ 10 12 , 1 ⁇ 10 13 , 1 ⁇ 10 14 , 1 ⁇ 10 15 , 1 ⁇ 10 16 , 1 ⁇ 10 17 , and 1 ⁇ 10 18 vector genomes, or are 1 ⁇ 10 8 to 3 ⁇ 10 14 vector genomes, or are at most about 1 ⁇ 10 1 , 1 ⁇ 10 2 , 1 ⁇ 10 3 , 1 ⁇ 10 4 , 1 ⁇ 10 5 , 1 ⁇ 10 6 , 1 ⁇ 10 7 , 1 ⁇ 10 8 , 1 ⁇ 10 9 , 1 ⁇ 10 10 , 1 ⁇ 10 11
- a mutated or chimeric adeno-associated viral vector of the disclosure can be measured using multiplicity of infection (MOI).
- MOI can refer to the ratio, or multiple of vector or viral genomes to the cells to which the nucleic can be delivered.
- the MOI can be 1 ⁇ 10 6 GC/mL.
- the MOI can be 1 ⁇ 10 5 GC/mL to 1 ⁇ 10 7 GC/mL.
- the MOI can be 1 ⁇ 10 4 GC/mL to 1 ⁇ 10 8 GC/mL.
- recombinant viruses of the disclosure are at least about 1 ⁇ 10 1 GC/mL, 1 ⁇ 10 2 GC/mL, 1 ⁇ 10 3 GC/mL, 1 ⁇ 10 4 GC/mL, 1 ⁇ 10 5 GC/mL, 1 ⁇ 10 6 GC/mL, 1 ⁇ 10 7 GC/mL, 1 ⁇ 10 8 GC/mL, 1 ⁇ 10 9 GC/mL, 1 ⁇ 10 10 GC/mL, 1 ⁇ 10 11 GC/mL, 1 ⁇ 10 12 GC/mL, 1 ⁇ 10 13 GC/mL, 1 ⁇ 10 14 GC/mL, 1 ⁇ 10 15 GC/mL, 1 ⁇ 10 16 GC/mL, 1 ⁇ 10 17 GC/mL, and 1 ⁇ 10 18 GC/mL MOI.
- a mutated or chimeric adeno-associated viruses of this disclosure are from about 1 ⁇ 10 8 GC/mL to about 3 ⁇ 10 14 GC/mL MOI, or are at most about 1 ⁇ 10 1 GC/mL, 1 ⁇ 10 2 GC/mL, 1 ⁇ 10 3 GC/mL, 1 ⁇ 10 4 GC/mL, 1 ⁇ 10 5 GC/mL, 1 ⁇ 10 6 GC/mL, 1 ⁇ 10 7 GC/mL, 1 ⁇ 10 8 GC/mL, 1 ⁇ 10 9 GC/mL, 1 ⁇ 10 10 GC/mL, 1 ⁇ 10 11 GC/mL, 1 ⁇ 10 12 GC/mL, 1 ⁇ 10 13 GC/mL, 1 ⁇ 10 14 GC/mL, 1 ⁇ 10 15 GC/mL, 1 ⁇ 10 16 GC/mL, 1 ⁇ 10 17 GC/mL, and 1 ⁇ 10 18 GC/mL MOI.
- a non-viral vector or nucleic acid can be delivered without the use of a mutated or chimeric adeno-associated viral vector and can be measured according to the quantity of nucleic acid.
- any suitable amount of nucleic acid can be used with the compositions and methods of this disclosure.
- nucleic acid can be at least about 1 pg, 10 pg, 100 pg, 1 pg, 10 pg, 100 pg, 200 pg, 300 pg, 400 pg, 500 pg, 600 pg, 700 pg, 800 pg, 900 pg, 1 ⁇ g, 10 ⁇ g, 100 ⁇ g, 200 ⁇ g, 300 ⁇ g, 400 ⁇ g, 500 ⁇ g, 600 ⁇ g, 700 ⁇ g, 800 ⁇ g, 900 ⁇ g, 1 ng, 10 ng, 100 ng, 200 ng, 300 ng, 400 ng, 500 ng, 600 ng, 700 ng, 800 ng, 900 ng, 1 mg, 10 mg, 100 mg, 200 mg, 300 mg, 400 mg, 500 mg, 600 mg, 700 mg, 800 mg, 900 mg, 1 g, 2 g, 3 g, 4 g, or 5 g.
- nucleic acid can be at most about 1 pg, 10 pg, 100 pg, 1 pg, 10 pg, 100 pg, 200 pg, 300 pg, 400 pg, 500 pg, 600 pg, 700 pg, 800 pg, 900 pg, 1 ⁇ g, 10 ⁇ g, 100 ⁇ g, 200 ⁇ g, 300 ⁇ g, 400 ⁇ g, 500 ⁇ g, 600 ⁇ g, 700 ⁇ g, 800 ⁇ g, 900 ⁇ g, 1 ng, 10 ng, 100 ng, 200 ng, 300 ng, 400 ng, 500 ng, 600 ng, 700 ng, 800 ng, 900 ng, 1 mg, 10 mg, 100 mg, 200 mg, 300 mg, 400 mg, 500 mg, 600 mg, 700 mg, 800 mg, 900 mg, 1 g, 2 g, 3 g, 4 g, or 5 g.
- Proteins, vectors, plasmids, and the nucleic acid editing systems provided herein can be delivered by any suitable method, including transfection, electroporation, liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral infection and protoplast fusion.
- the methods used to construct any embodiment of the compositions provided herein are known to those with skill in nucleic acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques.
- a nuclease, nucleic acid editing system, fusion protein, or a polynucleotide encoding a nuclease, nucleic acid editing system, fusion protein can be delivered to a cell by electroporation.
- Electroporation using, for example, the NEON® Transfection System (ThermoFisher Scientific) or the AMAXA® Nucleofector (AMAXA® Biosystems) can also be used for delivery of nucleic acids and proteins into a cell.
- a nuclease or fusion protein provided herein can be purified and complexed with a suitable guide nucleic acid for delivery into a cell.
- Electroporation parameters can be adjusted to optimize delivery efficiency and/or cell viability.
- Electroporation devices can have multiple electrical wave form pulse settings such as exponential decay, time constant and square wave. Every cell type has a unique optimal Field Strength (E) that is dependent on the pulse parameters applied (e.g., voltage, capacitance and resistance).
- E Field Strength
- compositions provided herein can be delivered to any suitable cell.
- suitable cells can include but are not limited to eukaryotic and prokaryotic cells and/or cell lines.
- a suitable cell can be a human primary cell.
- a primary cell can be taken directly from living tissue (i.e.
- a primary cell can be acquired from a variety of sources such as an organ, vasculature, buffy coat, whole blood, apheresis, plasma, bone marrow, tumor, cell-bank, cryopreservation bank, or a blood sample.
- a primary cell can be a stem cell.
- a suitable cells that can be edited with a nucleic acid editing system provided herein include but are not limited to: epithelial cells, fibroblast cells, neural cells, keratinocytes, hematopoietic cells, melanocytes, chondrocytes, lymphocytes (B, NK, and T), macrophages, monocytes, mononuclear cells, cardiac muscle cells, other muscle cells, granulosa cells, cumulus cells, epidermal cells, endothelial cells, pancreatic islet cells, blood cells, blood precursor cells, bone cells, bone precursor cells, neuronal stem cells, primordial stem cells, hepatocytes, keratinocytes, umbilical vein endothelial cells, aortic endothelial cells, microvascular endothelial cells, fibroblasts, liver stellate cells, aortic smooth muscle cells, cardiac myocytes, neurons, Kupffer cells, smooth muscle cells, Schwann cells, and epithelial cells, ery
- the one or more cells can be pancreatic islet cells and/or cell clusters or the like, including, but not limited to pancreatic ⁇ cells, pancreatic ⁇ cells, pancreatic ⁇ cells, pancreatic F cells (e.g., PP cells), or pancreatic ⁇ cells.
- the one or more cells can be pancreatic ⁇ cells.
- the one or more cells can be pancreatic ⁇ cells.
- a human primary cell can be an immune cell.
- An immune cell can be a T cell, B cell, NK cell, and/or a macrophage.
- Suitable cells also include stem cells such as, by way of example, embryonic stem cells, induced pluripotent stem cells, hematopoietic stem cells, neuronal stem cells and mesenchymal stem cells.
- Suitable cells can comprise any number of primary cells, such as human cells, non-human cells, and/or mouse cells. Suitable cells can be progenitor cells. Suitable cells can be derived from the subject to be treated (e.g., a subject with a disease, a subject in need of treatment, or a subject that is immunocompromised). Suitable cells can be derived from a human donor. [00158] A method of attaining suitable cells, such as human primary cells, can comprise selecting cells.
- a cell can comprise a marker that can be selected for the cell.
- a marker can comprise GFP, a resistance gene (for example, a gene conferring antibiotic resistance), a cell surface marker, an endogenous tag.
- Cells can be selected using any endogenous marker.
- Suitable cells can be selected using any technology. Such technology can comprise flow cytometry and/or magnetic columns.
- the selected cells can then be infused into a subject.
- the selected cells can also be expanded to large numbers.
- the selected cells can be expanded prior to infusion.
- Delivery vehicles for in vivo and ex vivo use can include pharmaceutically acceptable carriers. Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. 7.
- compositions comprising a nuclease, nucleic acid editing system, or a fusion protein provided herein; and a pharmaceutically acceptable diluent, carrier, or excipient.
- a pharmaceutical composition comprising a vector provided herein; and a pharmaceutically acceptable diluent, carrier, or excipient.
- compositions provided herein e.g., nucleic acid editing systems, nucleases, and fusion proteins provided herein
- compositions provided herein are combined with pharmaceutically acceptable salts, excipients, and/or carriers to form a pharmaceutical composition.
- compositions may be chosen based on the route of administration, the location of the target issue, and the time course of delivery of the drug.
- a pharmaceutically acceptable carrier or excipient may include solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, etc., compatible with pharmaceutical administration.
- the pharmaceutical composition is in the form of a solid, semi-solid, liquid or gas (aerosol).
- injectable preparations for example, sterile injectable aqueous or oleaginous suspensions may be formulated according to the known art using suitable dispersing or wetting agents and suspending agents.
- the sterile injectable preparation may also be a sterile injectable solution, suspension, or emulsion in a nontoxic parenterally acceptable diluent or solvent.
- acceptable vehicles and solvents that may be employed are water, Ringer's solution, U.S.P., and isotonic sodium chloride solution.
- sterile, fixed oils are conventionally employed as a solvent or suspending medium.
- any bland fixed oil can be employed including synthetic mono- or diglycerides.
- fatty acids such as oleic acid are used in the preparation of injectables.
- compositions provided herein may be formulated in dosage unit form for ease of administration and uniformity of dosage.
- a dosage unit form is a physically discrete unit of a composition provided herein appropriate for a subject to be treated.
- the therapeutically effective dose can be estimated initially either in cell culture assays or in animal models, such as mice, rabbits, dogs, pigs, or non-human primates. The animal model is also used to achieve a desirable concentration range and route of administration.
- compositions provided herein can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 (the dose is therapeutically effective in 50% of the population) and LD 50 (the dose is lethal to 50% of the population).
- ED50 the dose is therapeutically effective in 50% of the population
- LD 50 the dose is lethal to 50% of the population.
- the dose ratio of toxic to therapeutic effects is the therapeutic index, and it can be expressed as the ratio, LD50/ED50.
- Pharmaceutical compositions which exhibit large therapeutic indices may be useful in some embodiments.
- the data obtained from cell culture assays and animal studies may be used in formulating a range of dosage for human use.
- compositions and pharmaceutical compositions for administering to a subject in need thereof are in a form that allows for compositions provided herein to be administered to a subject.
- the pharmaceutical composition is formulated for intratumoral delivery.
- administration of a pharmaceutical composition provided herein is local administration or systemic administration.
- a pharmaceutical composition provided herein is formulated for administration / for use in administration via an intratumoral, subcutaneous, intradermal, intramuscular, inhalation, intravenous, intraperitoneal, or intracranial route.
- the administering is every 1, 2, 4, 6, 8, 12, 24, 36, or 48 hours. In some embodiments, the administering is daily, weekly, or monthly. In some embodiments, the administering is repeated at least about every 28 days or 56 days.
- Vectors can be delivered in vivo by administration to an individual subject, typically by systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical application, as described below.
- vectors can be delivered to cells ex vivo, such as cells explanted from an individual subject (e.g., lymphocytes, T cells, bone marrow aspirates, or tissue biopsy), followed by reimplantation of the cells into a subject, usually after selection for cells which have incorporated the vector. Prior to or after selection, the cells can be expanded.
- cells expressing a nuclease, fusion protein, or a nucleic acid editing system provided herein.
- the cell is contacted in vitro or ex vivo with a nucleic acid encoding for a nuclease, fusion protein, or a nucleic acid editing system provided herein.
- the cell is contacted in vitro or ex vivo with a vector encoding for an engineered protein provided here nuclease, fusion protein, or a nucleic acid editing system provided herein.
- the cell is an immune cell.
- the cell is a lymphocyte, a leukocyte, a T cell, a natural killer cell, a macrophage, a neutrophil, an eosinophil, a basophil, a dendritic cell, a stem cell, a cancer cell, a stem cell, a heart cell, a pancreatic cell, a neuron, an induced pluripotent stem-cell derived cell (iPSC), or an in-vitro differentiated iPSC.
- the cell is a genetically modified cell.
- the genetically modified cell is a chimeric antigen receptor (CAR) T cell, an engineered T cell receptor (TCR) T cell, or a tumor-infiltrating lymphocyte (TIL).
- CAR chimeric antigen receptor
- TCR engineered T cell receptor
- TIL tumor-infiltrating lymphocyte
- Cells provided herein can be administered to a subject in need thereof, e.g., a subject with a disease or a condition. Therapeutic applications are discussed further below. 8. Gene Editing Applications & Diagnostics [00169] Provided herein are methods of modifying a target nucleic acid. In some embodiments, the methods comprise, contacting a target nucleic acid molecule with a nuclease, fusion protein, nucleic acid editing system or composition provided herein.
- the nuclease comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of one of: SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182.
- the nucleic acid editing system provided herein comprises a guide nucleic acid comprising a sequence that binds to the target nucleic acid, wherein upon binding of said guide nucleic acid to the target nucleic acid molecule, the nuclease, the guide nucleic acid, and the target nucleic acid molecule form a complex.
- the nuclease cleaves a target nucleic acid molecule generating a cleavage site within the target nucleic acid molecule, thereby modifying the target nucleic acid molecule.
- the percentage of target nucleic acid molecule modification is at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%.
- the method of modifying the target nucleic acid further comprises mutation of a target nucleic acid sequence.
- the mutation comprises an insertion or deletion at said one or more nucleic acid sequences.
- the mutation comprises an indel.
- the methods further comprise delivering the composition to a cell.
- the methods further comprise delivering to the cell at least one exogenous nucleic acid molecule for insertion at the cleavage site.
- the at least one exogenous nucleic acid molecule is delivered with a recombination polypeptide.
- the recombination polypeptide is an integrase, a flippase, a transponase, or a recombinase.
- the exogenous nucleic acid molecule comprises at least one sequence having at least 75% homology to the target nucleic acid molecule.
- the method is in vitro or in vivo. [00171] Provided herein are methods of ex vivo modifying a cell.
- the methods comprise: contacting a cell with a nuclease, nucleic acid editing system, fusion protein, or a composition provided herein under conditions that permit nuclease cleavage of a target nucleic acid molecule, thereby modifying said cell.
- RNA complex comprises: (i) a nuclease provided herein; and (ii) a guide RNA that binds to a target nucleic acid, wherein upon contacting the cell with the RNP complex, the nuclease cleaves a target nucleic acid molecule, thereby modifying said cell.
- the cell is an immune cell or a stem cell.
- the immune cell is a leukocyte, a lymphocyte, a natural killer cell, a dendritic cell, a macrophage, a myeloid cell, a T-cell, a B cell, a stem cell, an induced- pluripotent derived cell, a cancer cell, or an endothelial cell.
- the stem cell is an embryonic stem cell, an induced-pluripotent stem cell (iPSC), or an adult stem cell.
- test sample comprises a target nucleic acid capable of binding to the guide nucleic acid.
- test sample comprises a target nucleic acid capable of binding to the guide nucleic acid.
- a complex is formed between the guide nucleic acid, a nuclease provided herein, and the target nucleic acid.
- the nuclease upon formation of the complex the nuclease cleaves the target nucleic acid.
- the method further comprises detecting a signal indicating cleavage of the target nucleic acid molecule, thereby detecting the target nucleic acid in the sample.
- the guide nucleic acid comprises a sequence of any one of: SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152, or a sequence that is complementary to any one of SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152.
- the nuclease comprises a sequence set forth in any one of SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182.
- the method prior to the detecting step, the method further comprises, amplifying the target nucleic acid prior to the detecting step.
- the amplifying comprises polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), recombinase polymerase amplification (RPA), loop-mediated isothermal amplification (LAMP), strand displacement amplification (SDA), helicase-dependent amplification (HDA), nicking enzyme amplification reaction (NEAR), multiple displacement amplification (MDA), rolling circle amplification (RCA), improved multiple displacement amplification (EVIDA), l simple method amplifying RNA targets (SMART), single primer isothermal amplification (SPIA), ligase chain reaction (LCR), transcription mediated amplification (TMA), ramification amplification method (RAM), or any combination thereof.
- PCR polymerase chain reaction
- NASBA nucleic acid sequence-based amplification
- RPA recombinase polymerase amplification
- LAMP loop-mediated isothermal amplification
- SDA strand displacement amplification
- HDA helicase-dependent
- the methods further comprise performing an endonuclease mismatch detection assay, an immunoassay, gel electrophoresis, a plasmid interference assay, nucleic acid sequencing, or any combination thereof.
- the detecting comprises calorimetric detection, potentiometric detection, amperometric detection, optical detection, piezo-electric detection, or any combination thereof.
- the solid support comprises a reaction chip, a paper, a quartz microfiber, mixed esters of cellulose, a porous aluminum oxide, a patterned surface, a tube, a well, or a matrix. n some embodiments, the solid support comprises a patterned surface suitable for immobilization of molecules in an ordered pattern.
- a patterned surface refers to an arrangement of different regions in or on an exposed layer of a solid support.
- the solid support comprises an array of wells or depressions in a surface. The composition and geometry of the solid support can vary with its use.
- the solid support is a planar structure such as a slide, chip, microchip and/or array. As such, the surface of the substrate can be in the form of a planar layer.
- the solid support comprises one or more surfaces of a flowcell.
- a flowcell is a type of chamber comprising a solid surface across which one or more fluid reagents can be flowed.
- the solid support or its surface is non-planar, such as the inner or outer surface of a tube or vessel.
- the solid support comprise microspheres or beads.
- Microspheres, beads, or particles can be made of various material including, but not limited to, plastics, ceramics, glass, and polystyrene.
- the microspheres are magnetic microspheres or beads.
- the beads may be porous. The bead sizes range from nanometers, e.g., about 100 nm, to millimeters, e.g., about 1 mm.
- the test sample comprises a liquid sample.
- the test sample comprises a biological sample or an environmental sample.
- the environmental sample comprises a food sample, a paper surface, a fabric, a metal surface, a wood surface, a plastic surface, a soil sample, a fresh water sample, a waste water sample, a saline water sample, or any combination thereof.
- the environmental sample is suspected of comprising a pathogenic microbial organism. Sequences specific to a pathogen of interest may be identified or selected by comparing the coding sequences from the pathogen of interest to all coding sequences in other organisms by BLAST software.
- Appropriate samples for use in the methods provided herein include any conventional biological sample obtained from an organism or a part thereof, such as a plant, animal, bacteria, and the like.
- the biological sample is obtained from an animal subject, such as a human subject.
- a biological sample is any solid or fluid sample obtained from, excreted by or secreted by any living organism, including, without limitation, single celled organisms, such as bacteria, yeast, protozoans, and amoebas among others, multicellular organisms (such as plants or animals, including samples from a healthy or apparently healthy human subject or a human patient affected by a condition or disease to be diagnosed or investigated, such as an infection with a pathogenic microorganism, such as a pathogenic bacteria or virus).
- a biological sample can be a biological fluid obtained from, for example, blood, plasma, serum, urine, stool, sputum, mucous, lymph fluid, synovial fluid, bile, ascites, pleural effusion, seroma, saliva, cerebrospinal fluid, aqueous or vitreous humor, or any bodily secretion, a transudate, an exudate (for example, fluid obtained from an abscess or any other site of infection or inflammation), or fluid obtained from a joint (for example, a normal joint or a joint affected by disease, such as rheumatoid arthritis, osteoarthritis, gout or septic arthritis), or a swab of skin or mucosal membrane surface.
- a transudate for example, an exudate (for example, fluid obtained from an abscess or any other site of infection or inflammation), or fluid obtained from a joint (for example, a normal joint or a joint affected by disease, such as rheumatoid
- a sample can also be a sample obtained from any organ or tissue (including a biopsy or autopsy specimen, such as a tumor biopsy) or can include a cell (whether a primary cell or cultured cell) or medium conditioned by any cell, tissue or organ.
- Exemplary samples include, without limitation, cells, cell lysates, blood smears, cytocentrifuge preparations, cytology smears, bodily fluids (e.g., blood, plasma, serum, saliva, sputum, urine, bronchoalveolar lavage, semen, etc.), tissue biopsies (e.g., tumor biopsies), fine-needle aspirates, and/or tissue sections (e.g., cryostat tissue sections and/or paraffin-embedded tissue sections).
- the sample includes circulating tumor cells (which can be identified by cell surface markers).
- samples are used directly (e.g., fresh or frozen), or can be manipulated prior to use, for example, by fixation (e.g., using formalin) and/or embedding in wax (such as formalin-fixed paraffin-embedded (FFPE) tissue samples).
- fixation e.g., using formalin
- FFPE formalin-fixed paraffin-embedded
- the biological sample comprises a cell extract, cell medium, blood, plasma, serum, urine, stool, sputum, mucous, lymph fluid, synovial fluid, bile, ascites, pleural effusion, seroma, saliva, cerebrospinal fluid, aqueous or vitreous humor, a bodily secretion, a transudate, an exudate, a skin swab, or a mucosal membrane swab.
- the test sample is from a subject that has, is suspected of having, or at risk of developing a disease or a disorder.
- the nucleic acid editing systems and compositions provided herein have a broad spectrum of applications in, e.g., gene therapy, drug screening, disease diagnosis, and prognosis. Further provided herein are methods of treating a disease, a disorder, or a condition.
- diseases and conditions that can be treated using the compositions and methods provided herein include cancer, autoimmune diseases, diabetes, heart disease, infectious diseases, lower respiratory infections, neonatal conditions, neurological diseases, neuromuscular diseases, muscular diseases, gastrointestinal diseases, kidney diseases, liver diseases, vascular diseases, deafness, blindness, and the like.
- the methods comprise detecting one or more target nucleic acids in a test sample obtained from a subject using the methods provided herein; identifying a subject as having a target nucleic acid; and providing an appropriate treatment for the disease, disorder, or condition.
- the method comprises using a nuclease provided herein to cleave a diseased DNA or RNA in a cell.
- the disease DNA or RNA may yield translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non-disease control.
- the disease- associated nucleic acid can include RNA transcribed from a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease.
- the translated products may be known or unknown, and may be at a normal or abnormal level.
- the target nucleic acid of a nucleic acid editing system provided herein can be any endogenous or exogenous nucleic acid of a eukaryotic cell.
- the target nucleic acid can be a nucleic acid residing in the nucleus of the eukaryotic cell.
- the target nucleic acid can encode for a gene product (e.g., a protein) or a non-coding sequence (e.g., ncRNA, lncRNA, tRNA, or rRNA).
- a gene product e.g., a protein
- a non-coding sequence e.g., ncRNA, lncRNA, tRNA, or rRNA.
- a scaffold comprises: a nuclease provided herein; and a guide nucleic acid provided herein, wherein the nuclease and/or the guide nucleic acid are immobilized to the scaffold.
- the scaffold comprises a set of nucleases.
- the set of nucleases comprise at least one nuclease comprising an amino acid sequence comprising at least 85% identity to one of SEQ ID NOS: 1, 2, 10, 11, 77-78, or 182, a functional fragment, or a derivative thereof.
- the scaffold comprises a set of guide nucleic acids, wherein the set of guide nucleic acids comprise a direct repeat sequence of any one or more of SEQ ID NO: 5 or SEQ ID NO: 6.
- at least one guide nucleic acid is immobilized to the scaffold.
- the set of nucleases are immobilized to the scaffold.
- at least one nuclease and at least one guide nucleic acid are immobilized to the scaffold.
- at least one nuclease is immobilized to the scaffold and the at least one guide nucleic acid is in complex with the at least one nuclease.
- the set of guide nucleic acids are each immobilized to the scaffold; and the at least one nuclease is in complex with at least one guide sequence.
- at least one nuclease comprises an amino acid sequence with at least 90% identity to one of SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182. In some embodiments, at least one nuclease comprises an amino acid sequence with at least 95% identity to one of SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182. In some embodiments, at least one nuclease comprises an amino acid sequence of one of SEQ ID NOS: 1, 2, 10, 11, 64-78, or 182.
- the guide nucleic acid comprises a direct repeat sequence of SEQ ID NO: 5 or SEQ ID NO: 6; and a variable sequence.
- the variable sequence is at least 75% complementary to a target sequence.
- the scaffold further comprises a target sequence in complex with the at least one nuclease and at least one guide nucleic acid.
- systems comprising a scaffold provided herein.
- the systems comprise: (a) a scaffold provided herein; (b) a reporter molecule; and (c) a detector.
- the reporter molecule when a target nucleic acid forms a complex with the scaffold, the reporter molecule produces a detectable signal that is detected by the detector.
- the system further comprises reagents for nucleic acid amplification.
- the reporter molecule is selected from the group consisting of: a fluorophore, a dye, a polypeptide, an antibody, a nucleic acid, and any combination thereof.
- the detectable signal is a calorimetric signal, a potentiometric signal, an amperometric signal, an optical signal, or a piezo-electric signal.
- kits comprising: a gene editing system provided herein, packaging and materials therefor.
- a formulation of a composition described herein is prepared in a single container for administration.
- a formulation of a composition provided herein is prepared two containers for administration, separating the nucleic acids and/or the nucleases provided herein.
- “container” includes vessel, vial, ampule, tube, cup, box, bottle, flask, jar, dish, well of a single-well or multi-well apparatus, reservoir, tank, or the like, or other device in which the herein disclosed compositions may be placed, stored and/or transported, and accessed to remove the contents.
- kits comprising: a first container comprising a nuclease provided herein; and a second container comprising a guide nucleic acid provided herein, packaging and materials therefor. 11.
- nucleases comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 77, a functional fragment, or a derivative thereof.
- nucleases comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 78, or SEQ ID NO: 182, a functional fragment, or a derivative thereof.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 1.
- a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 2. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 77. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 10. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 11. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 78. In some embodiments, a nuclease provided herein comprises an amino acid sequence of SEQ ID NO: 182.
- nucleases comprising: (a) a nuclease (NUC) lobe; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 119, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- REC helical recognition
- nucleases comprising: (a) a nuclease (NUC) lobe comprising a Protospacer Adjacent Motif (PAM) Interacting (PI) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 20; or SEQ ID NO: 21; and (b) a helical recognition (REC) lobe.
- NUC nuclease
- PAM Protospacer Adjacent Motif
- PI Interacting
- nucleases comprising: (a) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- WED2 wedge 2
- nucleic acid sequences wherein the nucleic acid sequences encode for a nuclease provided herein.
- ribonucleoprotein (RNP) complexes wherein the RNP complexes comprises: (i) a nuclease provided herein; and (ii) a guide RNA that binds to a target nucleic acid.
- vectors wherein the vectors comprise a nucleic acid sequence encoding for a nuclease provided herein.
- viral vectors wherein the viral vectors comprise a nucleic acid sequence encoding for a nuclease provided herein.
- nucleic acid editing systems comprising a nuclease provided herein.
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 77, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 77, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: a nuclease comprising a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64 wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid; and a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease; the guide nucleic acid; and the target nucleic acid form a complex.
- REC helical recognition
- nucleic acid editing systems comprise: a nuclease comprising (i) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (ii) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving the target nucleic acid via the NUC
- nucleic acid editing systems comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 78, or SEQ ID NO: 182, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein upon binding of said guide nucleic acid to said target nucleic acid, the nuclease or the functional fragment, or the derivative thereof; the guide nucleic acid; and the target nucleic acid form a complex.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 1, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 10, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 5, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 2, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 6, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 11, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid that binds to a target nucleic acid, wherein the guide nucleic acid comprises a direct repeat sequence set forth in SEQ ID NO: 6, and wherein upon binding of said nucleic acid editing system to said target nucleic acid, the nucleic acid editing system cleaves the target nucleic acid.
- nucleic acid editing systems comprising: (a) a nuclease provided herein; and (b) a guide nucleic acid sequence comprising: a direct repeat sequence of SEQ ID NO: 5 or SEQ ID NO: 6 operably linked to a variable sequence, wherein the variable sequence is complementary to a target nucleic acid.
- nucleic acid editing systems wherein upon formation of the complex, the nuclease generates a break in the target nucleic acid.
- nucleic acid editing systems wherein the nuclease generates a single-stranded break in the target nucleic acid or a double-stranded break in the target nucleic acid.
- nucleic acid editing systems wherein the nuclease generates a double-stranded break with staggered 5’ overhangs in the target nucleic acid.
- the target nucleic acid is a DNA.
- nucleic acid editing systems wherein the nuclease further comprises one or more nuclear localization sequence (NLS).
- NLS nuclear localization sequence
- nucleic acid editing systems wherein the one or more NLS is an SV40 Large NLS, a large T antigen NLS, a c-Myc NLS, a nucleoplasmin NLS, an EGL-13 NLS, or a TUS-protein NLS.
- nucleic acid editing systems wherein the NLS comprises an SV40 Large NLS, and wherein the SV40 Large NLS comprises an amino acid sequence of: MGPKKKRKV (SEQ ID NO: 3). Further provided herein are nucleic acid editing systems, wherein the NLS comprises a nucleoplasmin NLS, and wherein the nucleoplasmin NLS comprises an amino acid sequence of: KRPAATKKAGQAKKKK (SEQ ID NO: 4). Further provided herein are nucleic acid editing systems, wherein the nuclease further comprises: (a) an SV40 Large NLS; and (b) a nucleoplasmin NLS.
- nucleic acid editing systems wherein the nuclease further comprises: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51, SEQ ID NO: 63, or SEQ ID NO: 153.
- nucleic acid editing systems wherein the nuclease further comprises one or more peptide linkers.
- nucleic acid editing systems wherein the nuclease further comprises a C-terminal tag.
- nucleic acid editing systems, wherein the C-terminal tag is a c-myc tag.
- the guide nucleic acid comprises RNA.
- nucleic acid editing systems wherein the guide nucleic acid comprises RNA and at least one modified nucleoside. Further provided herein are nucleic acid editing systems, wherein the guide nucleic acid comprises one or more direct repeat sequences. Further provided herein are nucleic acid editing systems, wherein the one or more direct repeat sequences comprises an sequence of: SEQ ID NO: 5, SEQ ID NO: 6, a functional fragment thereof; or a sequence that is complementary to any one of SEQ ID NO: 5 or SEQ ID NO: 6, or a functional fragment thereof. Further provided herein are nucleic acid editing systems, wherein the guide nucleic acid further comprises a sequence that is at least partially complementary to the target nucleic acid sequence.
- nucleic acid editing systems wherein the guide nucleic acid comprises two direct repeat sequences and a variable sequence, wherein the variable sequence is at least 75% complementary to a target nucleic acid sequence. Further provided herein are nucleic acid editing systems, wherein the guide nucleic acid further comprises a promoter sequence. Further provided herein are nucleic acid editing systems, wherein the nuclease comprises an amino acid sequence that is at least 90% identical to an amino acid sequence of any one of SEQ ID NO: 1, SEQ ID NO: 2, or SEQ ID NO: 77.
- nucleic acid editing systems wherein the nuclease comprises an amino acid sequence that is at least 95% identical to an amino acid sequence of any one of SEQ ID NO: 1, SEQ ID NO: 2, or SEQ ID NO: 77. Further provided herein are nucleic acid editing systems, wherein the nuclease comprises an amino acid sequence that is identical to SEQ ID NO: 1, SEQ ID NO: 2, or SEQ ID NO: 77. the nuclease comprises an amino acid sequence that is at least 90% identical to an amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, or SEQ ID NO: 78.
- nucleic acid editing systems wherein the nuclease comprises an amino acid sequence that is at least 95% identical to an amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, or SEQ ID NO: 78. Further provided herein are nucleic acid editing systems, wherein the nuclease comprises an amino acid sequence that is identical to an amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, or SEQ ID NO: 78. Further provided herein are nucleic acid editing systems, wherein the nucleic acid editing system is delivered inside a cell or a cellular organelle via electroporation, nucleofection, conjugation, lipofection, calcium phosphate precipitation, or a delivery vehicle.
- compositions wherein the compositions comprise a delivery vehicle; and a nuclease provided herein or a polynucleotide sequence encoding for a nuclease provided herein. Further provided herein are compositions, wherein the compositions comprise: (a) a delivery vehicle; and (b) a nucleic acid encoding for an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof. Further provided herein are compositions, wherein the vector is a viral vector.
- compositions wherein the viral vector is a lentiviral vector, an adenoviral vector, an adeno-associated viral vector, or an oncolytic viral vector. Further provided herein are compositions, wherein the compositions further comprise a guide nucleic acid. Further provided herein are compositions, wherein the guide nucleic acid comprises a sequence of any one of SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152, or a sequence that is complementary to any one of SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152.
- compositions wherein the guide nucleic acid further comprises a sequence that is at least partially complementary to a target nucleic acid sequence.
- compositions wherein the guide nucleic acid comprises SEQ ID NO: 5 or SEQ ID NO: 6; and a sequence that is at least partially complementary to a target nucleic acid sequence.
- viral vectors wherein the viral vectors comprise: a nucleic acid encoding for an amino acid sequence that is at least 85% identical to any one of: SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof.
- viral vectors wherein the viral vectors comprise a nucleic acid comprising a sequence that is at least 85% identical to SEQ ID NO: 90. Further provided herein are viral vectors, wherein the viral vectors comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid.
- viral vectors wherein the viral vectors comprise: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid comprising any one of SEQ ID NOS: 5-6.
- viral vectors comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof; and (b) a guide nucleic acid comprising any one of SEQ ID NOS: 5-9, 36-39, 53-60.
- compositions wherein the compositions comprise: a viral vector provided herein; and a delivery vehicle.
- viral vectors and compositions wherein the delivery vehicle is an emulsion, a suspension, a liposome, a micelle, an exosome, an endosome, a virus, a vector, a particle, or a polymer.
- the viral vector is a lentiviral vector, an adenoviral vector, an adeno-associated viral vector, or an oncolytic viral vector.
- lentiviral vectors wherein the lentiviral vectors comprise: a nuclease provided herein, a functional fragment, or a derivative thereof.
- kits wherein the kits comprise a nuclease provided herein.
- the kits comprise: a first container comprising a nuclease provided herein; and a second container comprising a guide nucleic acid provided herein, packaging and materials therefor.
- the kits comprise: a gene editing system provided herein, packaging and materials therefor.
- a target nucleic acid molecule comprising: (a) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of one of: SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182; and (b) a guide nucleic acid comprising a sequence that binds to the target nucleic acid, wherein upon binding of said guide nucleic acid to the target nucleic acid molecule, the nuclease, the guide nucleic acid, and the target nucleic acid molecule form a complex, wherein the nuclease cleaves a target nucleic acid molecule generating a cleavage site within the target nucleic acid molecule, thereby modifying the target nucleic acid molecule.
- the percentage of target nucleic acid molecule modification is at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%.
- the methods further comprise mutation of a target nucleic acid sequence.
- the mutation comprises an insertion or deletion at said one or more nucleic acid sequences.
- the methods further comprise delivering the composition to a cell.
- the methods further comprise delivering to the cell at least one exogenous nucleic acid molecule for insertion at the cleavage site.
- the at least one exogenous nucleic acid molecule is delivered with a recombination polypeptide.
- the recombination polypeptide is an integrase, a flippase, a transponase, or a recombinase.
- the exogenous nucleic acid molecule comprises at least one sequence having at least 75% homology to the target nucleic acid molecule. Further provided herein are methods, wherein the methods are performed in vitro or in vivo.
- kits for ex vivo modifying a cell comprise: contacting a cell with a composition provided herein under conditions that permit nuclease cleavage of a target nucleic acid molecule, thereby modifying said cell.
- the methods comprise: contacting a cell with a ribonucleoprotein (RNP) complex, wherein the RNP complex comprises: (i) a nuclease comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of any one of: SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182; and (ii) a guide RNA that binds to a target nucleic acid, wherein upon contacting the cell with the RNP complex, the nuclease cleaves a target nucleic acid molecule, thereby modifying said cell.
- the cell is an immune cell or a stem cell.
- the immune cell is a leukocyte, a lymphocyte, a natural killer cell, a dendritic cell, a macrophage, a myeloid cell, a T-cell, a stem cell, an induced-pluripotent derived cell, a cancer cell, or an endothelial cell.
- the stem cell is an embryonic stem cell, an induced-pluripotent stem cell (iPSC), or an adult stem cell.
- the methods comprise: (a) immobilizing a guide nucleic acid onto a solid support; and (b) contacting the guide nucleic acid with: (i) a test sample; and (ii) a nuclease comprising an amino acid sequence that is at least 85% identical to any one of: SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182, wherein when the test sample comprises a target nucleic acid capable of binding to the guide nucleic acid, a complex is formed between the guide nucleic acid, the nuclease, and the target nucleic acid, and wherein upon formation of the complex the nuclease cleaves the target nucleic acid; and (c) detecting a signal indicating cleavage of the target nucleic acid molecule, thereby detecting the target nucleic acid in the sample.
- the guide nucleic acid comprises a sequence of any one of: SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152, or a sequence that is complementary to any one of SEQ ID NOS: 5-9, 36-39, 53-60, 79-84, 113-152. wherein prior to step (c), the method further comprises, amplifying the target nucleic acid.
- the amplifying comprises polymerase chain reaction (PCR), nucleic acid sequence- based amplification (NASBA), recombinase polymerase amplification (RPA), loop-mediated isothermal amplification (LAMP), strand displacement amplification (SDA), helicase-dependent amplification (HDA), nicking enzyme amplification reaction (NEAR), multiple displacement amplification (MDA), rolling circle amplification (RCA), improved multiple displacement amplification (EVIDA), l simple method amplifying RNA targets (SMART), single primer isothermal amplification (SPIA), ligase chain reaction (LCR), transcription mediated amplification (TMA), ramification amplification method (RAM), or any combination thereof.
- PCR polymerase chain reaction
- NASBA nucleic acid sequence- based amplification
- RPA recombinase polymerase amplification
- LAMP loop-mediated isothermal amplification
- SDA strand displacement amplification
- HDA helicase-
- the methods further comprise performing an endonuclease mismatch detection assay, an immunoassay, gel electrophoresis, a plasmid interference assay, nucleic acid sequencing, or any combination thereof.
- the detecting comprises calorimetric detection, potentiometric detection, amperometric detection, optical detection, piezo-electric detection, or any combination thereof.
- the solid support comprises a reaction chip, a paper, a quartz microfiber, mixed esters of cellulose, a porous aluminum oxide, a patterned surface, a tube, a well, or a matrix.
- the test sample comprises a liquid sample.
- test sample comprises a biological sample or an environmental sample.
- biological sample comprises a cell extract, cell medium, blood, plasma, serum, urine, stool, sputum, mucous, lymph fluid, synovial fluid, bile, ascites, pleural effusion, seroma, saliva, cerebrospinal fluid, aqueous or vitreous humor, a bodily secretion, a transudate, an exudate, a skin swab, or a mucosal membrane swab.
- the environmental sample comprises a food sample, a paper surface, a fabric, a metal surface, a wood surface, a plastic surface, a soil sample, a fresh water sample, a waste water sample, a saline water sample, or any combination thereof.
- the test sample is from a subject that has or is suspected of having a disease or a disorder.
- the target nucleic acid is DNA.
- nuclease comprises: (a) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (b) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to the target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving the target nucleic acid via the NUC lobe.
- NUC nuclease
- WED2 wedge 2
- NUC lobe further comprises: a wedge 1 (WED1) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 16; or SEQ ID NO: 17; and/or a WED 3 domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 18; or SEQ ID NO: 19.
- WED1 wedge 1
- SEQ ID NO: 16 amino acid sequence of: 16
- SEQ ID NO: 17 or SEQ ID NO: 17
- WED 3 domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 18; or SEQ ID NO: 19.
- nuclease wherein the NUC lobe further comprises: a Protospacer Adjacent Motif (PAM) Interacting (PI) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 20; or SEQ ID NO: 21.
- PAM Protospacer Adjacent Motif
- PI Interacting
- nuclease wherein the NUC lobe further comprises: a RuvCI domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 22; or SEQ ID NO: 23.
- nuclease wherein the NUC lobe further comprises: a RuvCII domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 24; or SEQ ID NO: 25. Further provided herein are uses for a nuclease, wherein the NUC lobe further comprises a nuclease domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 26; or SEQ ID NO: 27.
- nuclease wherein the NUC lobe further comprises: a RuvCIII domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 28; or SEQ ID NO: 29. Further provided herein are uses for a nuclease, wherein the nuclease comprises an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, or 78. Further provided herein are uses for a nuclease, wherein the nuclease comprises an amino acid sequence that is at least 90% identical to any one of SEQ ID NOS: 11, 2, 10, 11, 77, or 78.
- nuclease comprises an amino acid sequence that is at least 95% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, or 78. Further provided herein are uses for a nuclease, wherein the nuclease comprises an amino acid sequence comprising one of SEQ ID NOS: 1, 2, 10, 11, 77, or 78.
- scaffolds comprising a nuclease or a gene editing system provided herein, wherein the nuclease or the gene editing system is immobilized to the scaffold.
- scaffolds comprising: (a) a set of nucleases, wherein at least one nuclease comprises an amino acid sequence comprising at least 85% identity to one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof; and (b) a set of guide nucleic acids, wherein the set of nucleases and/or the set of guide nucleic acids are immobilized to the scaffold.
- scaffolds wherein the set of nucleases comprise at least one nuclease comprising an amino acid sequence comprising at least 85% identity to one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof.
- the scaffolds comprise a set of guide nucleic acids, wherein the set of guide nucleic acids comprise a direct repeat sequence of SEQ ID NO: 5 or SEQ ID NO: 6.
- at least one guide nucleic acid is immobilized to the scaffold.
- the set of nucleases are immobilized to the scaffold.
- scaffolds wherein at least one nuclease and at least one guide nucleic acid are immobilized to the scaffold. Further provided herein are scaffolds, wherein at least one nuclease is immobilized to the scaffold and the at least one guide nucleic acid is in complex with the at least one nuclease. Further provided herein are scaffolds, wherein the set of guide nucleic acids are each immobilized to the scaffold; and the at least one nuclease is in complex with at least one guide sequence. Further provided herein are scaffolds, wherein at least one nuclease comprises an amino acid sequence with at least 90% identity to one of SEQ ID NOS: 1, 2, 10, 11, 77, or 78.
- scaffolds wherein at least one nuclease comprises an amino acid sequence with at least 95% identity to one of SEQ ID NOS: 1, 2, 10, 11, 77, or 78. Further provided herein are scaffolds, wherein at least one nuclease comprises an amino acid sequence of one of SEQ ID NOS: 1, 2, 10, 11, 77, or 78. Further provided herein are scaffolds, wherein the guide nucleic acid comprises a direct repeat sequence of SEQ ID NO: 5 or SEQ ID NO: 6; and a variable sequence. Further provided herein are scaffolds, wherein the variable sequence is at least 75% complementary to a target sequence.
- scaffolds wherein the scaffold further comprises a target sequence in complex with the at least one nuclease and at least one guide nucleic acid.
- systems and devices comprising: (a) a scaffold provided herein; (b) a reporter molecule or a phase changing agent; and (c) a detector, wherein, when a target nucleic acid forms a complex with the scaffold, the reporter molecule or phase changing agent produces a detectable signal that is detected by the detector.
- systems and devices comprising: (a) a nuclease provided herein; (b) a reporter molecule or a phase changing agent; and (c) a detector, wherein, when the nuclease cleaves a target nucleic acid the reporter molecule or phase changing agent produces a detectable signal that is detected by the detector.
- the system further comprises reagents for nucleic acid amplification.
- the reporter molecule is selected from the group consisting of: a fluorophore, a dye, a polypeptide, an antibody, a nucleic acid, and any combination thereof.
- the detectable signal is a calorimetric signal, a potentiometric signal, an amperometric signal, an optical signal, or a piezo-electric signal.
- the fusion proteins comprise a nuclease provided herein.
- fusion proteins comprise: (a) a first protein construct comprising: a first nuclease comprising a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonucle
- fusion proteins comprise: (a) a first protein construct comprising: a first nuclease comprising: (i) a nuclease (NUC) lobe, wherein the NUC lobe comprises at least one wedge 2 (WED2) domain comprising an amino acid sequence that is at least 85% identical to an amino acid sequence of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 66, SEQ ID NO: 68, or SEQ ID NO: 74; and (ii) a helical recognition (REC) lobe, wherein the REC lobe comprises an amino acid sequence that is at least 85% identical to an amino acid sequence of: SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 64, wherein the nuclease is capable of binding to a target nucleic acid via the REC lobe in the presence of a guide nucleic acid, and wherein the nuclease capable of cleaving
- fusion proteins comprise: (a) a first protein construct comprising: (a) a first protein construct comprising: a nuclease comprising an amino acid sequence that is at least 85% identical to one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a functional fragment, or a derivative thereof; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonuclease, a helicase, a ligase, a meganuclease, a methylase, a methyltransfera
- fusion proteins comprise: (a) a first protein construct comprising: (a) a first protein construct comprising: a first nuclease consisting of SEQ ID NO: 10, SEQ ID NO: 11, or SEQ ID NO: 78; and (b) a second protein construct comprising: a second nuclease, an acetylase, an acetyltransferase, an ATPase, an Argonaute protein, a base editor, a Cas polypeptide, a catalytically dead Cas polypeptide, a deacetylase, a deaminase, a decapping protein, an endonuclease, an exonuclease, a helicase, a ligase, a meganuclease, a methylase, a methyltransferase, a nickase, a polymerase, a prote
- isolated proteins wherein the isolated proteins comprise: an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182. Further provided herein are isolated proteins, wherein the isolated proteins comprise any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, or 182. [00204] Provided herein are isolated nucleic acids, wherein the isolated nucleic acids comprise: a sequence that encodes a protein that comprises an amino acid sequence that is at least 85% identical to any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a fragment, or a derivative thereof.
- isolated nucleic acids comprising: a sequence that encodes a protein that comprises an amino acid sequence comprising any one of SEQ ID NOS: 1, 2, 10, 11, 77, 78, 182, a fragment, or a derivative thereof.
- engineered nucleic acids wherein the engineered nucleic acids comprise: a sequence that is at least 85% identical to SEQ ID NO: 90 or SEQ ID NO: 183.
- engineered nucleic acids wherein the engineered nucleic acids comprise SEQ ID NO: 90 or SEQ ID NO: 183.
- Nuclease #1 comprising a sequence of SEQ ID NO: 1
- Nuclease #2 comprising a sequence of SEQ ID NO: 2 according to guided silencing effects and homolog-specific gRNA cassettes are described herein.
- 164 proteins were identified to have homology to two known gene editing nuclease proteins. A phylogenetic tree was generated and the percent sequence identity was compared for all sequences ranging from 19% to 50% sequence identity to the known gene editing nuclease protein.
- Nuclease #1 and Nuclease #2 were generated by standard cloning techniques along with eight other test nuclease sequences.
- Nuclease #1 and Nuclease #2 were each modified to include an SV40 NLS and nucleoplasmin NLS sequence yielding SEQ ID NO: 10 and SEQ ID NO: 11.
- Co-factor RNAs were designed using CRISPick software yielding SEQ ID NOS: 5 and 6 for direct repeat sequences.
- Full length targeting sequences for CD274 and B2M were designed and incorporated into expression cassettes with an hU6 promoter (SEQ ID NO: 31) summarized in Table 6.
- T7 endonuclease (T7EN1) assays were performed.200 ng of gDNA was used to amplify the genomic locus of CD274 covering the guide targeting sequence.
- 510 bp PCR amplicons were purified and used for T7 endonuclease assays at 10 ng/ ⁇ L.
- Samples from the T7EN1 assay were separated by agarose gel electrophoresis to detect the DNA cleavage products.
- the expected cleavage products were 400 base pairs (bps) and 100 bps using CD274 Guide #1 with control nuclease-CNTRL Nuc.
- Nuclease #1 and Nuclease #2 produced the expected bands 400 base pair cleavage product (FIG.3).
- Genomic analysis of B2M and CD274 targeting was performed (FIG. 4A-4B).
- Nuclease #1 and Nuclease #2 both produced indels/mismatches as using the same CD274 guide sequence (SEQ ID NO: 35).
- Nuclease #1 produced 1.8% indels targeting CD274 (FIG.4A) and 0.4% indels targeting B2M (FIG.4B).
- EXAMPLE 2 GUIDE RNA OPTIMIZATION FOR NUCLEASE #1 AND NUCLEASE #2.
- Optimization of the guide nucleic acid sequence was performed using CRISPick software to improve CD274 targeting by Nuclease #1 (SEQ ID NO: 10) and Nuclease #2 (SEQ ID NO: 11). Different combinations of direct repeat sequences and variable sequences were assayed with their corresponding nucleases.
- Nuclease #1 and Nuclease #2 were transferred into a suitable lentiviral vector including blasticidin selection (BlastR). Suitable mutations and protein engineering of Nuclease #1 and Nuclease #2 are performed to enhance stability and cleavage efficiency of target DNA. Up to 50 different Nuclease #1 and Nuclease #2 mutants are designed and tested in RKO cells using the protocol outlined in Example 3. Lentiviral vectors comprising the Nuclease #1 or Nuclease #2 mutants are used to establish a pool of 50 to 100 nucleases for a high-throughput screening platform.
- Nuclease mutants are generated that are fully or partially nuclease-deficient.
- nuclease #1 an Arginine at position 1165 can be substituted with an Alanine (R1165A) to produce a nuclease-deficient enzyme with SEQ ID NO: 61.
- R1165A Alanine
- Nuclease-deficient mutant of Nuclease #2 is generated by a R1186A substitution as set forth in SEQ ID NO: 62.
- EXAMPLE 5 STRUCTURE-GUIDED ENGINEERING OF A NUCLEASE.
- CNTRL wild-type
- FIG. 7A-7B Flow cytometry and T7 endonuclease assays were performed (FIG. 7A-7B, FIG. 8, and FIG. 9).
- Flow cytometry of RKO cells transfected with a mutant nuclease or the WT control nuclease targeting CD274 is shown in FIG. 8A.
- Double mutant #3 and the triple mutant exhibited a 2-fold reduction in the percentage of CD274 negative RKO cells (approx 40% CD274 negative) relative to the control nuclease with the same guide sequence (FIG. 7A).
- a T7 endonuclease assay confirmed that the mutant nucleases effectively disrupted the CD274 gene (FIG.7B).
- FIG. 8 Flow cytometry of RKO cells transfected with a mutant nuclease or the WT control nuclease targeting CD46 is shown in FIG. 8.
- the WT control nuclease did not disrupt expression of CD46 in RKO cells.
- the mutant nucleases comprising single, double or triple mutants effectively reduced the percentage of CD46 RKO cells.
- the single mutant #1, the double mutant #2 and the triple mutant resulted in enhacned targeting of 40% CD46 negative RKO cells.
- RKO cells transfected with the WT control nuclease and guide RNA targeting B2M did not disrupt the expression of B2M in RKO cells (FIG. 9).
- Configuration sequences A-C increased CD247 targeting of the control nuclease by over 30% (FIG.10B-10C) and 1.36-fold higher relative to the initial sequence, Configuration 1.
- Configuration 1 When the triple mutant nuclease (E155R, S532R, K538R) was delivered using configuration A in combination with a CD247-targeting guide sequence, targeting of CD247 was greater than 70% (FIG. 11A). This was also the case for targeting of B2M (FIG. 11B) and CD46 genomic loci (FIG.11C).
- RKO cells were co-transfected with the WT (SEQ ID NO: 85) or Triple Mutant nucleases (SEQ ID NO: 89 and 90) and sgRNAs targeting either B2M, CD274 or CD46.
- Genomic DNA was extracted from transfected RKO cells and genomic loci of B2M, CD274 and CD46 genes were PCR amplified.
- Next generation sequencing after polymerase chain reaction (PCR) amplification of genomic targets was performed to analyze and quantify genomic indel formation. NGS data derived from a MiSeq PE150 run was analyzed in order to identify and quantify the indel formation as shown in Table 11 and 12. The most dominant genomic targeting patterns have been summarized including an indel-specific quantification of editing frequencies.
- RKO cells were transfected with one of the guide sequences targeting CD274 and the Triple Mutant Nuclease or the WT control nuclease. Flow cytometry was performed as previously described.
- the Triple Mutant Nuclease (SEQ ID NO: 90) efficiently targeted TTNN PAMs and NTTN PAMs, indicating that the Triple Mutant Nuclease had greater flexibility for generating double strand breaks in the CD274 gene over the WT control nuclease (SEQ ID NO: 86).
- the baseline targeting activity of the CD274 guide using a canonical TTTC PAM was used as a reference to rank alternative, non-canonical guides-PAM combinations.
- EXAMPLE 8 Drop Out Fitness Screen of Engineered Nucleases for PAM Site Flexibility. [00236] A dropout fitness screen was conducted to explore the PAM site flexibility of the engineered triple mutant endonuclease at larger scale.
- RKO cells expressing the engineered triple mutant endonuclease (E155R/S532R/K538R) in conjunction with the optimized NLS configuration and a 2A-mCherry reporter (SEQ ID NO: 182 and SEQ ID NO: 183) were used for the fitness screen and guide RNA library transduction.
- the library of 5,994 guide RNAs expressed next to GFP and an antibiotic resistance cassette was designed to target 300 essential and 33 non-essential control genes covering a total of 18 different PAM sites for each target gene (SEQ ID NO: 184 and SEQ ID NO: 185).
- the guide RNA library was introduced via lentiviral transduction at a multiplicity of infection (MOI) of 0.1 (10% GFP-positive RKO cells) to integrate single guide RNAs into endonuclease expressing RKO cells.
- MOI multiplicity of infection
- Antibiotic selection was employed for seven days to gain approximately 98% GFP-positive RKO cells expressing the endonuclease and a single guide RNA.
- Fitness effects relative to the starting point on day 0 (guide RNA library representation) were assessed on day 7 (end of antibiotic selection) and on day 21 (final time point) post guide RNA transduction (FIGS. 14A-14B). For this, guide RNA sequences integrated into the gDNA of RKO cells were PCR-amplified and evaluated via NGS.
- Counting of total guide RNAs enabled identification of the most active PAM - guide RNA combinations targeting essential genes as those combinations dropped out resulting in low guide counts (FIG. 14B).
- the dropout effects improved along the fitness screen as log fold-change (LFC) values were increased on day 21 with PAM sites “CTTC”, “TTTC” and “TTCC” identified as ideal candidates (FIG. 14B).
- LFC log fold-change
- three additional PAM sites were identified which can be categorized as “NTTN” (ATTC, ATTG, GTTC).
- sequences (SEQ ID NOS: 32-35) are formatted to signify the promoter region, the direct repeat (DR) sequence, and variable sequence. Underlined signifies the DR and bold text signifies the variable sequence. Unformatted text is the promoter region of the expression cassette. Docket No.199827-761601/PCT [00241] The following sequences (SEQ ID NOS: 38-39) are formatted to signify the direct repeat (DR) sequence and variable sequence. Bold text signifies the variable sequence. [00242] The following sequences (SEQ ID NO: 53-60) are formatted to signify the direct repeat (DR) sequence and variable sequence. Bold text signifies the variable sequence. SEQ ID NO: 54: Nuclease #1 direct repeat with CD274-targeting guide RNA (bold: Variable Sequence)
- SEQ ID NOS: 61-62 are formatted to signify the mutation that generates nickase activity. Underlined and bold text signifies the amino acid substitution as compared with reference sequences, SEQ ID NO: 1 and SEQ ID NO: 2, respectively.
- SEQ ID NO: 182 The following sequence (SEQ ID NO: 182) is formatted to signify the the amino acid substitutions made in the sequence relative to a reference sequence (e.g., SEQ ID NO: 1). Italicized and bold text signifies the 2A-self-cleaving peptide sequence (SEQ ID NO: 168). Capital letters indicate the endonuclease sequence.
- SEQ ID NO: 184 The following sequence (SEQ ID NO: 184) is formatted to signify the the nucleic acid substitutions made in the sequence relative to a reference sequence.
- SEQ ID NO: 184 DNA sequence encoding the guide RNA library scaffold with a GFP - P2A - PuroR - WPRE - hU6 - Optimized direct repeat (DR) (20 bp, bold italics) - Example guide (23 bp) [00247]
- the following sequence (SEQ ID NO: 185) is formatted to signify the the amino acid substitutions made in the sequence relative to a reference sequence.
- SEQ ID NO: 185 GFP - P2A (italics)– PuroR (bold and italics) amino acid sequence of the guide RNA library vector.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Virology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Crystallography & Structural Chemistry (AREA)
- Medicinal Chemistry (AREA)
- Cell Biology (AREA)
- Mycology (AREA)
- Enzymes And Modification Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263318906P | 2022-03-11 | 2022-03-11 | |
| US202363479274P | 2023-01-10 | 2023-01-10 | |
| PCT/US2023/064097 WO2023173062A2 (fr) | 2022-03-11 | 2023-03-10 | Systèmes d'édition d'acides nucléiques, méthodes et leurs utilisations |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP4490288A2 true EP4490288A2 (fr) | 2025-01-15 |
Family
ID=87936073
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP23767727.3A Withdrawn EP4490288A2 (fr) | 2022-03-11 | 2023-03-10 | Systèmes d'édition d'acides nucléiques, méthodes et leurs utilisations |
Country Status (2)
| Country | Link |
|---|---|
| EP (1) | EP4490288A2 (fr) |
| WO (1) | WO2023173062A2 (fr) |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MX374090B (es) * | 2013-03-14 | 2025-03-05 | Caribou Biosciences Inc | Composiciones y métodos de ácidos nucleicos dirigidos al ácido nucleico. |
| US9580727B1 (en) * | 2015-08-07 | 2017-02-28 | Caribou Biosciences, Inc. | Compositions and methods of engineered CRISPR-Cas9 systems using split-nexus Cas9-associated polynucleotides |
| US20200263190A1 (en) * | 2016-04-19 | 2020-08-20 | The Broad Institute, Inc. | Novel crispr enzymes and systems |
| WO2019006418A2 (fr) * | 2017-06-30 | 2019-01-03 | Intima Bioscience, Inc. | Vecteurs viraux adéno-associés destinés à la thérapie génique |
| US20190316101A1 (en) * | 2017-07-18 | 2019-10-17 | Howard Hughes Medical Institute | Methods and compositions for genetically manipulating genes and cells |
-
2023
- 2023-03-10 WO PCT/US2023/064097 patent/WO2023173062A2/fr not_active Ceased
- 2023-03-10 EP EP23767727.3A patent/EP4490288A2/fr not_active Withdrawn
Also Published As
| Publication number | Publication date |
|---|---|
| WO2023173062A3 (fr) | 2023-11-23 |
| WO2023173062A2 (fr) | 2023-09-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3222728B1 (fr) | Procédé de régulation de l'expression génique à l'aide de la protéine cas9 exprimée par deux vecteurs | |
| US11827899B2 (en) | Vector-free delivery of gene editing proteins and compositions to cells and tissues | |
| CA3142521A1 (fr) | Compositions et procedes pour l'edition de beta-globine pour le traitement d'hemoglobinopathies | |
| AU2017260426B2 (en) | Engineered nucleases useful for treatment of hemophilia A | |
| US20230340437A1 (en) | Modified nucleases | |
| CN113039276A (zh) | 核酸酶介导的核酸修饰 | |
| US20250243516A1 (en) | Dual Cut Retron Editors for Genomic Insertions and Deletions | |
| Zhang et al. | Gene knock-in by CRISPR/Cas9 and cell sorting in macrophage and T cell lines | |
| AU2023289695A1 (en) | Compositions and methods for epigenetic regulation of b2m expression | |
| US20250127924A1 (en) | Gene editing systems comprising an rna guide targeting stathmin 2 (stmn2) and uses thereof | |
| AU2024347035A1 (en) | Dna-dependent dna polymerase compositions, methods, and uses thereof | |
| JP2024501892A (ja) | 新規の核酸誘導型ヌクレアーゼ | |
| EP4490288A2 (fr) | Systèmes d'édition d'acides nucléiques, méthodes et leurs utilisations | |
| JP2023548386A (ja) | B2mを標的とするrnaガイドを含む組成物及びその使用 | |
| TW202440913A (zh) | Cas12蛋白、CRISPR-Cas系統及其用途 | |
| AU2021329295B2 (en) | Nuclease-mediated nucleic acid modification | |
| EP4499849A1 (fr) | Production d'adn transcrit inversé (adn-rt) au moyen d'une rétron transcriptase inverse à partir d'adn exogène | |
| US12173336B2 (en) | PaCas9 nuclease | |
| CN109913452A (zh) | 用于靶向HBB RNA的gRNA及基于C2c2的HBB突变检测方法、检测试剂盒 | |
| CN109897852A (zh) | 基于C2c2的肿瘤相关突变基因的gRNA、检测方法、检测试剂盒 | |
| WO2022104344A2 (fr) | Knock-in d'adn de grande longueur pour une expression génomique élevée à long terme | |
| WO2025064825A1 (fr) | Compositions d'adn ligase, méthodes et utilisations associées | |
| US20240042029A1 (en) | Delivery of molecules to cells using trogocytosis and engineered cells | |
| WO2025106934A1 (fr) | Enzymes à domaines ruvc | |
| TW202334421A (zh) | 包含靶向ciita之rna引導之組合物及其用途 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
| 17P | Request for examination filed |
Effective date: 20241001 |
|
| AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
| 18W | Application withdrawn |
Effective date: 20250417 |