WO2023082011A1 - Endonucleases that selectively cleave single-stranded nucleic acids and uses thereof - Google Patents
Endonucleases that selectively cleave single-stranded nucleic acids and uses thereof Download PDFInfo
- Publication number
- WO2023082011A1 WO2023082011A1 PCT/CA2022/051668 CA2022051668W WO2023082011A1 WO 2023082011 A1 WO2023082011 A1 WO 2023082011A1 CA 2022051668 W CA2022051668 W CA 2022051668W WO 2023082011 A1 WO2023082011 A1 WO 2023082011A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- endonuclease
- sequence
- isolated
- nucleic acid
- stranded nucleic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases [RNase]; Deoxyribonucleases [DNase]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/34—Polynucleotides, e.g. nucleic acids, oligoribonucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/36—Dinucleotides, e.g. nicotineamide-adenine dinucleotide phosphate
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/21—Endodeoxyribonucleases producing 5'-phosphomonoesters (3.1.21)
- C12Y301/21001—Deoxyribonuclease I (3.1.21.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/26—Endoribonucleases producing 5'-phosphomonoesters (3.1.26)
- C12Y301/26003—Ribonuclease III (3.1.26.3)
Definitions
- the present disclosure generally relates to the fields of enzymology and molecular biology, and more particularly to endonucleases.
- Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain.
- Two types of endonucleases are restriction nucleases and homing endonucleases.
- Restriction endonucleases are enzymes that recognize a specific nucleotide sequence in a double-stranded nucleic acid called a restriction site. Upon binding to the restriction site, the restriction endonucleases cleave within or near the restriction site. These enzymes are routinely used for DNA modification in laboratories, such as for genetic engineering and molecular cloning. For example, they are used to assist insertion of genes into plasmid vectors, to distinguish gene alleles by specifically recognizing single base changes in DNA known as single-nucleotide polymorphisms (SNPs), to digest genomic DNA for gene analysis, and to insert nucleic acid molecules within the genome of an organism. Most of the known restriction enzymes recognize a restriction site, typically comprising from 4 to 8 nucleotides that are often palindromic, within a double-stranded DNA molecule, and produce a double-stranded cut in the DNA.
- SNPs single-nucleotide polymorphisms
- Homing endonucleases are double-stranded DNases that have large, asymmetric recognition sites (12-40 base pairs). However, unlike restriction endonucleases, homing endonucleases tolerate some sequence degeneracy within their recognition sequence, which means that single base changes do not abolish cleavage but reduce its efficiency to variable extents. As a result, their observed sequence specificity is typically in the range of 10-12 base pairs.
- endonucleases that cleave single-stranded nucleic acid molecules are usually non-specific, i.e., they do not recognize a specific sequence (e.g., restriction site) within the single-stranded nucleic acid molecules but rather cleave at various sequences to degrade the nucleic acid molecules in several fragments.
- a specific sequence e.g., restriction site
- the present disclosure provides the following items 1 to 73:
- An isolated endonuclease specific for single-stranded desoxyribonucleic acid molecules having a length of 60 to 150 amino acids or less and comprising a single GIY-YIG domain.
- X1 is any amino acid
- X2 is any amino acid
- X3 is Y or H
- B1 is a sequence of 8 to 12 amino acids
- X4 is Y or H
- X5 is any amino acid.
- X6 is G or D
- B2 is a sequence of 6 to 15 amino acids
- X7 is R or T
- B3 is a sequence of 30 to 40 amino acids; and X8 is E.
- X1-X2-X3-B1-X4-X5-X6-B2-X7-B4-X9-B5-X8 (II) wherein X1 , X2, X3, B1 , X4, X5, X6, B2, X7 and X8 are as defined in items 4 to 20; B4 is a sequence of 1 to 5 amino acids;
- X9 is H, Q or Y
- B5 is a sequence of 30 to 38 amino acids.
- X1 , X2, X3, B1 , X4, X5, X6, B2, X7, B4, X9, B5 and X8 are as defined in items 4 to 20;
- B6 is a sequence of 15 to 20 amino acids; and X10 is N or K.
- composition comprising (i) the isolated endonuclease of any one of items 1 to 34, and (ii) an aqueous saline solution or buffer.
- composition of item 35 wherein the aqueous saline solution or buffer comprises a metal.
- composition of item 36, wherein the metal is in the form of a metal salt.
- composition of any one of items 35 to 41 , wherein the single-stranded nucleic acid molecule comprises a nucleotide sequence having at least 50% sequence identity with the sequence: GTCATTCCCNNNNNNNNGGGAATC or GTCATTCCCGCGAAAGCGGGAATC.
- composition of any one of items 35 to 42, wherein the single-stranded nucleic acid molecule comprises the following nucleotide sequence: GTCANNCCNGNNNANNCNGGNNNC.
- composition of item 43, wherein the single-stranded nucleic acid molecule comprises the following nucleotide sequence: GTCAYBCCMGYRHAVRCKGGVRNC.
- composition of item 43 or 44, wherein the single-stranded nucleic acid molecule comprises any one of the nucleotide sequences depicted in FIG. 8 and FIG. 9C.
- 47. A method for cleaving a single-stranded nucleic acid molecule, the method comprising contacting the single-stranded nucleic acid molecule with the isolated endonuclease of any one of items 1 to 34 or the composition of any one of items 35 to 45 under conditions suitable for cleavage of the single-stranded nucleic acid molecule by the isolated endonuclease, wherein the single-stranded nucleic acid molecule comprises a recognition sequence for the isolated endonuclease.
- nucleic acid fragment comprising the nucleotide sequence defined in any one of items 42 to 45 at the 5’-end, 3’-end or within the single-stranded nucleic acid.
- a cell comprising the endonuclease defined in any one of items 1 to 34, wherein the endonuclease is heterologous to the cell.
- a method for expressing the endonuclease defined in any one of items 1 to 34 in a cell comprising introducing a nucleic acid encoding the endonuclease into the cell.
- kits comprising the endonuclease defined in any one of items 1 to 34 or the composition of any one of items 35 to 45, and instructions for cleaving single-stranded nucleic acid molecules using the endonuclease.
- kit of item 72 wherein said instructions comprise the method of any one of items 47 to 62.
- FIG. 1A depicts purification of SsnA from /V. meningitidis, N. elongate, R. fells and L. pneumophila along with three mutant proteins from N. meningitidis. Proteins were fused to a N- terminal 6xHis-tag or a N-terminal GST tag by cloning into pET15-MHL and pGEX vectors respectively, then purified to near purity by affinity chromatography.
- FIG. 1B depicts the amino acid sequence of SsnA from N. meningitidis (SEQ ID NO:2) along with its sequence features. Individually mutated residues are highlighted.
- FIG. 2A shows that SsnA is a metal-dependent endonuclease. Results of an endonuclease assay of 1 pM SsnA on a 100 nt specific ssDNA. Rows #1-3 were performed using commercial NEBuffer2.1 (containing 10 mM MgCI 2 ), supplemented with 10 mM EDTA where indicated (#3). Rows #4-5 were performed using a homemade identical buffer lacking MgCI 2 , which was supplemented with 10 mM MgCI 2 where indicated (#5).
- FIG. 2B shows that SsnA only interacts with single-stranded DNA containing a specific sequence (NTS).
- NTS specific sequence
- FIG. 2C shows enzymatic activity of SsnA on truncated ssDNA and RNA. Sequences of 75nt, 37nt and 28nt were assayed, all of them containing a complete NTS sequence. The 37nt RNA sequence is the transcribed equivalent to the 37nt DNA. The 75nt ssDNA(T- U) sequence corresponds to the 75nt ssDNA with uracils instead of thymines (but with desoxyribose sugars).
- FIG. 3 depicts the binding and cleavage of different ssDNA by SsnA. 100 nt oligonucleotides containing the full-length NTS repeat with different flanking sequences were assayed. Each sequence was taken from the /V. meningitidis genome.
- FIG. 3B depicts the assayed ssDNA sequences.
- the NTS repeat region is underlined.
- FIG. 4A depicts the cleavage site determination of SsnA on a 75nt ssDNA containing its target sequence. 5’-label led oligonucleotides of 18 to 25nt were run on a gel next to the results of an endonuclease assay with SsnA.
- FIG. 4B depicts the sequence requirements for binding of cutting of SsnA on ssDNA.
- Gel- Shift (binding activity) and endonuclease (cutting activity) assays were performed on 75 nt ssDNA containing the target sequence (NTS), with single-nucleotide mutations throughout its length. The relative activities were measured for each individually mutated sequence and illustrated using a heat-map. Cleavage activity was normalized to the binding activity since binding is a prerequisite for cutting. Arrows denote the palindrome within the repeated sequence, which forms the stem of a stem-loop structure. Scissors depict the cutting site identified in FIG. 5.
- FIG. 5 depicts the absence of binding activity of SsnA on branched DNA.
- Gel-shift assays (EMSA) of different branched DNA structures in presence of SsnA.
- H Holliday junction
- D D- loop
- F fork
- Y pseudo-Y.
- FIGs. 6A-D show the nuclease activity of SsnA.
- FIG. 6A Metal requirements for the nuclease activity of SsnA. 10mM of each metal was used as the sole metal in the reaction mix. GST-tagged SsnA was assayed with nickel to ensure that the activity seen with His-SsnA was not due to nickel interacting with the purification tag.
- FIG. 6B Magnesium and manganese requirements for the nuclease activity of SsnA.
- FIG. 6C Temperature effect on SsnA’s nuclease activity.
- 0.8 pM of 100 nt ssDNA containing the recognition sequence was used with 1 pM of His-SsnA WT unless otherwise indicated. Boxed images show the specific cleavage products that were obtained and quantified.
- FIG. 6D depicts the cleavage kinetics of SsnA with the indicated concentrations of ssDNA.
- FIG. 6E Dose-response nuclease assay of SsnA depicting its sensitivity.
- FIGs. 7A-C show the maximum likelihood mid-rooted phylogeny of SsnA and the Ssn protein family (GIY-YIG small proteins). Proteins NMV_0044 (SsnA) and NMV_0402 (SsnB, circle) from /V. meningitidis 8013 2C4.3 was blasted against all bacteria with a threshold of 50% identity. Results were screened and curated to only keep proteins of 80-120 amino acids, corresponding to single GIY-YlG-domain proteins.
- FIG. 7B shows the portion of the tree corresponding to SsnA homologs from the Neisseriaceae family.
- FIG. 7C shows the portion of the tree corresponding to SsnB homologs from the Neisseriaceae family.
- FIG. 8 shows the alignment of the putative recognition sequences of SsnA homologs, all neighbouring the SsnA gene in the corresponding species.
- the recognition sequence of SsnANm was blasted against the genomic regions (10kbp) encompassing homologous genes from other species, then aligned with AlignX from the Vector NTI software.
- FIG. 9A shows an alignment of the amino acid sequences of SsnA nucleases from N. meningitidis 80132C4.3 (SsnA(NMV0044), SEQ ID NO:2), Neisseria elongata subsp. glycolytica (SsnA(EFE49965.1), SEQ ID NO:3), Legionella pneumophila subsp. pneumophila str. Paris SsnA(WP_011213498), SEQ ID NO:4) and Rickettsia fells str.
- URRWXCal2 SsnA(WP_011271370), SEQ ID NO:5
- FIG. 9B depicts the percent identity matrix of SsnA homologs aligned above.
- FIG. 9C Sequences located near the genes encoding SsnA homologs and having similarities with the recognition sequence from N meningitidis were identified and aligned using Clustal Omega. conserveed nucleotides are highlighted, with darker tones highlighting the most conserved positions.
- FIG. 10A-C show the nuclease activity of SsnA homologs from other species. Sequences located near the genes encoding these SsnA and having similarities with the recognition sequence from N meningitidis were identified and synthesized with a 5' fluorescent tag, then tested with their respective SsnA using nuclease assays as previously described.
- FIG. 10A depicts the nuclease activity of SsnA from N elongata (EFE49965.1)
- FIG. 10B depicts the nuclease activity of SsnA from R. felis (WP_011271370)
- FIG. 10C depicts the nuclease activity of SsnA from L. pneumophila (WP_011213498).
- FIGs. 11A-11JJJJ depicts the amino acid sequences of various putative Ssn having at least 50% sequence identity with Protein NMV_0044 (SsnA) from N. meningitidis 80132C4.3 (SEQ ID NO:2).
- FIGs. 12A-B show quantitative transformation assays performed on wild-type (WT), SsnA knock-out (KO) and SsnA complemented (Compl) strains of N. meningitidis.
- FIG. 13A transformation of a plasmid that does not harbour the nuclease's recognition sequence.
- FIG. 13B transformation of a plasmid harbouring SsnA's recognition sequence.
- FIGs. 13A-B show an alignment of the amino acid sequences of 79 endonucleases comprising a GIY-YIG domain belonging to the GIY-YIG_unchar_3 conserveed Protein Domain Family (CD10448) (Lu S et al. (2020). "CDD/SPARCLE: the conserved domain database in 2020.”, Nucleic Acids Res. 48(D1):D265-D268)
- the term “about” has its ordinary meaning.
- the term “about” is used to indicate that a value includes an inherent variation of error for the device or the method being employed to determine the value, or encompass values close to the recited values, for example within 10% of the recited values (or range of values).
- the present inventors have identified a family of endonucleases that preferentially bind to and cleave single-stranded DNA. These endonucleases recognize and cleave specific nucleotide sequences in single-stranded nucleic acids (single-stranded DNA). These endonucleases may be useful, for example, in various genetic engineering and molecular biology applications. These endonucleases are short proteins (typically less than 150 amino acids, and preferably less than 140 or 130 amino acids) and comprises a conserved GIY-YIG domain.
- the GIY-YIG domain comprises two short semi-conserved motifs "GIY” and "YIG” in the N-terminal part, followed by an Arg residue in the center and a Glu residue in the C-terminal part.
- the GIY-YIG domain has an a/p-sandwich architecture with a central three-stranded antiparallel p-sheet flanked by three- helices.
- the three-stranded antiparallel p-sheet contains the GIY-YIG sequence elements.
- the present disclosure provides an isolated endonuclease specific for single-stranded nucleic acid molecules (e.g., binds to and cleaves a single-stranded nucleic acid such as singlestranded DNA), the isolated endonuclease having a length of 150 amino acids or less and comprising a GIY-YIG domain.
- the present disclosure also provides a cell comprising an endonuclease specific for singlestranded nucleic acid molecules as described herein.
- the endonuclease is heterogenous to the cell.
- “Heterogenous” as used herein means that the endonuclease is the product of a gene that is not naturally present in the cell.
- the cell is not a Neisseria elongata subsp. glycolytica cell.
- the cell may be a cell from another bacterial species or subspecies or an eucaryotic cell (mammalian cell, human cell, yeast cell, etc.), for example.
- the endonuclease of the present disclosure has a length of 80 to 130 amino acids. In another embodiment, the endonuclease of the present disclosure has a length of 85 to 120 amino acids. In an embodiment, the endonuclease of the present disclosure does not comprise any additional domain, it only consists of a single GIY-YIG domain.
- FIGs 13A-B depicts a sequence alignment of 79 representative GIY-YIG domains from short endonucleases according to the present disclosure. These 79 endonucleases belong to the GIY-YIG_unchar_3 conserveed Protein Domain Family (CD10448) (Lu S et al. (2020). "CDD/SPARCLE: the conserved domain database in 2020.”, Nucleic Acids Res. 48(D1):D265- D268).
- FIG. 2A also depicts the sequences corresponding to the GIY motif, the YIG motif and the conserved Glu residue (putative metal binding site) in SsnA from N. meningitidis (SEQ ID NO:2).
- the endonuclease of the present disclosure comprises a GIY-YIG domain of the formula I:
- X1 is any amino acid, preferably G, Y, W, V, A, F, I, C, H, R, T, S, more preferably Y X2 is any amino acid, preferably V, I, L, T, A, F, more preferably X3 is Y or H, preferably Y B1 is a sequence of 8 to 12 amino acids, preferably 9 to 11 amino acids, for example 10 amino acids
- X4 is Y or H, preferably Y
- X5 is any amino acid, preferably I, L, V, T, A, C, or K, more preferably I, T or , even more preferably I.
- X6 is G or D, preferably G;
- B2 is a sequence of 6 to 15 amino acids, preferably of 6 to 10 amino acids, more preferably of 6 to 8 amino acids, for example 7 amino acids;
- X7 is R or T, preferably R;
- B3 is a sequence of 30 to 40 amino acids, preferably of 35 to 40 amino acids, for example 35, 36 or 37 amino acids;
- X8 is E, I or A, preferably E.
- the endonuclease of the present disclosure comprises a GIY-YIG domain of the formula II:
- X1 , X2, X3, B1 , X4, X5, X6, B2, X7 and X8 are as defined above;
- B4 is a sequence of 1 to 5 amino acids, preferably of 2 to 4 amino acids, more preferably of 3 amino acids;
- X9 is H, Q or Y, preferably H.
- B5 is a sequence of 30 to 38 amino acids, preferably of 30 to 35, more preferably of 31 to 33 amino acids, for example 32 amino acids.
- the endonuclease of the present disclosure comprises a GIY-YIG domain of the formula III:
- X1 , X2, X3, B1 , X4, X5, X6, B2, X7, B4, X9, B5 and X8 are as defined above;
- B6 is a sequence of 15 to 20 amino acids, preferably of 16 to 19 amino acids, more preferably of 18 amino acids;
- X10 is N or K, preferably N.
- the isolated endonuclease of the present disclosure comprises or consists of an amino acid sequence having at least 50% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891).
- nuclease refers to an enzyme having the ability to cleave a single-stranded nucleic acid molecule, such as single-stranded DNA, at or near a specific nucleotide sequence (recognition or restriction site).
- isolated refers to a molecule (endonuclease) that is in a milieu or environment that is different from the natural milieu or environment where it is found in nature (/.e., that has been subjected to human manipulation), for example a endonuclease that has isolated from the natural bacteria that normally expressed it.
- isolated does not necessarily reflect the extent to which the endonuclease has been purified, but indicates that the molecule has been separated in some way from the natural environment where it is normally found.
- An isolated endonuclease may also be produced recombinantly by cloning a nucleic acid encoding the endonuclease in a host cell capable of expressing the endonuclease, and collecting the endonuclease produced.
- Identity refers to sequence identity between two polypeptides. Percent (%) sequence identity with respect to a reference polypeptide sequence is the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the reference polypeptide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are known for instance, using publicly available computer software such as Clustal Omega, BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Appropriate parameters for aligning sequences are able to be determined, including algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
- Similarity refers to sequence similarity between two polypeptides. Percent (%) sequence similarity with respect to a reference polypeptide sequence is the percentage of amino acid residues in a candidate sequence that are similar (identical or conserved) with the amino acid residues in the reference polypeptide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence similarity, and considering conservative substitutions as part of the sequence similarity.
- the similarity between amino acids can be defined either by their chemical properties (e.g., hydrophobic, hydrophilic, charged, polar, etc.) or based on a PAM matrix.
- Variations in the endonucleases described herein can be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934.
- Variations may be a substitution, deletion or insertion of one or more codons encoding the endonuclease that results in a change in the amino acid sequence as compared with the native sequence of the endonuclease.
- the variation is by substitution of at least one amino acid with any other amino acid in one or more of the domains of the endonuclease.
- Guidance in determining which amino acid residue may be inserted, substituted or deleted without adversely affecting the desired activity may be found by comparing the sequence of the endonuclease with that of homologous known protein molecules and minimizing the number of amino acid sequence changes made in regions of high homology.
- Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid replacements.
- Insertions or deletions may optionally be in the range of about 1 to 5 amino acids.
- the variation allowed may be determined by systematically making insertions, deletions or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the full-length or mature native sequence (e.g., ability to cleave single-stranded nucleic acid molecules).
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 55% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812- 2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 60% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 65% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 70% identity similarity or with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 75% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 80% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 85% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A- JJJJ (SEQ ID NO:6-2811) and FIGs.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 90% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 95% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 96% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 97% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 98% similarity or identity with the any one of the sequences set forth in SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs.
- the isolated endonuclease comprises or consists of an amino acid sequence having at least 99% similarity or identity with the any one of the sequences set forth SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- the isolated endonuclease comprises or consists of or consists one of the amino acid sequences set forth in any one of SEQ ID NOs:2-5, FIGs. 11A-JJJJ (SEQ ID NO:6-2811) and FIGs. 13A-B (SEQ ID NO: 2812-2891), preferably SEQ ID NOs:2-5.
- FIG. 9A depicts an alignment of the amino acid sequences of SEQ ID NOs:2-5, with the residues conserved between the sequences indicated by asterisks (*), the residues with strong similarity (PAM250 MATRIX score between amino acids of greater than 0.5) indicated by colons (:), and the residues with weak similarity (PAM250 MATRIX score between amino acids of 0.5 or less) indicated by dots (.).
- the isolated endonuclease comprises the conserved residues in SEQ ID NOs:2-5.
- the isolated endonuclease comprises the conserved residues and the residues with strong similarity in SEQ ID NOs:2-5.
- the isolated endonuclease comprises the conserved residues and the residues with strong and weak similarities in SEQ ID NOs:2-5.
- FIGs. 13A-B depict an alignment of the amino acid sequences of the GIY-YIG domains from 79 endonucleases belonging to the GIY-YIG_unchar_3 conserveed Protein Domain Family (CD 10448), with the residues conserved between the sequences indicated by a # sign above the sequences.
- the isolated endonuclease comprises the conserved residues in the sequences depicted in FIGs. 13A-B.
- the isolated endonuclease may further comprise additional amino acids at its amino- (N) and/or carboxy (C)-terminal end.
- the isolated endonuclease may be fused to a peptide or polypeptide, for example a peptide or polypeptide that may be used as an affinity tag to facilitate the detection and/or purification of the endonuclease.
- affinity tags include polyhistidine (His) tags, polyarginine tags, glutathione-S-transferase (GST) tags, FLAG tags, streptavidin-binding peptide or streptavidin-binding protein (SBP) tags, streptavidin-binding tag (Strep-tag), calmodulin-binding peptide (CBP) tags, chitin-binding tags, Maltose-binding protein (MBP) tags, and natural histidine affinity tags (HAT).
- His polyhistidine
- polyarginine tags glutathione-S-transferase (GST) tags
- FLAG tags FLAG tags
- streptavidin-binding peptide or streptavidin-binding protein (SBP) tags streptavidin-binding tag
- CBP calmodulin-binding peptide
- chitin-binding tags chitin-binding tags
- Maltose-binding protein (MBP) tags and natural
- Such a linker may be a peptide/polypeptide linker comprising one or more amino acids or another type of chemical linker (e.g., a carbohydrate linker, a lipid linker, a fatty acid linker, a polyether linker, PEG, etc.) having suitable flexibility and stability to allow the endonuclease to adopt a proper conformation.
- the linker may comprise at least 2, 3 or 4 amino acids.
- the linker may comprise about 100, 90, 80, 70, 60 or 50 amino acids or less, and preferably 20, 15 or 10 amino acids or less.
- the isolated endonuclease of the disclosure may be produced by expression in a host cell comprising a nucleic acid encoding the isolated endonuclease (recombinant expression) or by chemical synthesis (e.g., solid-phase peptide synthesis).
- Peptides and polypeptides can be readily synthesized by manual and automated solid phase procedures well known in the art. Suitable syntheses can be performed for example by utilizing "t-Boc" or "Fmoc" procedures. Techniques and procedures for solid phase synthesis are described in for example Solid Phase Peptide Synthesis: A Practical Approach, by E. Atherton and R. C. Sheppard, published by IRL, Oxford University Press, 1989.
- polypeptides may be prepared by way of segment condensation, as described, for example, in Liu et al., Tetrahedron Lett. 37: 933-936, 1996; Baca et al., J. Am. Chem. Soc. 117: 1881-1887, 1995; Tarn et al., Int. J. Peptide Protein Res. 45: 209-216, 1995; Schnolzer and Kent, Science 256: 221-225, 1992; Liu and Tarn, J. Am. Chem. Soc. 116: 4149-4153, 1994; Liu and Tarn, Proc. Natl. Acad. Sci. USA 91 : 6584-6588, 1994; and Yamashiro and Li, Int. J. Peptide Protein Res. 31 : 322-334, 1988).
- Other methods useful for synthesizing polypeptides are described in Nakagawa et al., J. Am. Chem. Soc. 107: 7087-7092, 1985.
- the isolated endonuclease may also be prepared using recombinant DNA technology using standard methods. Accordingly, in another aspect, the disclosure further provides a nucleic acid (e.g., mRNA, cDNA) encoding the above-mentioned endonuclease. The disclosure also provides a vector comprising the above-mentioned nucleic acid. In yet another aspect, the present disclosure provides a cell (e.g., a host cell) comprising the above-mentioned nucleic acid and/or vector. The disclosure further provides a recombinant expression system, vectors and host cells, such as those described above, for the expression/production of an endonuclease of the disclosure, using for example culture media, production, isolation and purification methods well known in the art.
- a nucleic acid e.g., mRNA, cDNA
- the disclosure also provides a vector comprising the above-mentioned nucleic acid.
- a cell e.g.,
- the endonuclease of the disclosure can be purified by many techniques of peptide/polypeptide purification well known in the art, such as reverse phase chromatography, high performance liquid chromatography (HPLC), ion exchange chromatography, size exclusion chromatography, affinity chromatography, gel electrophoresis, and the like.
- HPLC high performance liquid chromatography
- ion exchange chromatography size exclusion chromatography
- affinity chromatography gel electrophoresis, and the like.
- affinity chromatography purification any ligand or antibody that specifically binds the endonuclease (or to an affinity tag fused to the endonuclease) may for example be used.
- the present disclosure also provides a composition
- a composition comprising (i) an isolated endonuclease specific for single- stranded nucleic acid molecules as described herein, and (ii) an aqueous saline solution or buffer.
- composition according to the present disclosure an aqueous saline solution or buffer.
- aqueous saline solutions or buffers include ingredients that stabilize the endonuclease and provide suitable conditions for the enzymatic activity of the endonuclease (e.g., conditions that permit the cleavage of single-stranded nucleic acids).
- the aqueous saline solution or buffer present in the composition according to the present disclosure may include suitable salts, buffering agents, minerals, co-factors, stabilizing agents, anti-oxidants, redox reagent, preservatives, etc.
- the composition comprises a buffering agent.
- the buffering agent is useful to keep the composition at a desired pH.
- Buffering agents are well known in the art and include potassium, acetate, citrate, acetate, phosphate, carbonate, succinate, histidine, borate, maleate, tris(hydroxymethyl) aminomethane (Tris), BIS-Tris, piperazine-N,N'-bis(2-ethanesulfonic acid) (PIPES), 2-(N-morpholino)ethanesulfonic acid (MES), (3-(N-morpholino)propanesulfonic acid) (MOPS), N-(2-Acetamido)-2-aminoethanesulfonic acid (ACES), (4-(2-hydroxyethyl)-1- piperazineethanesulfonic acid) (HEPES), magnesium and hydrochloride buffers.
- the composition comprises a Tris buffer.
- the Tris buffer is a Tris-HCI or a Tris-acetate buffer.
- the buffering agent is at a concentration of about 0.1 mM to 1 M.
- the buffering agent is at a concentration of about 1 mM to about 500 mM, about 1 mM to about 200 mM, about 1 mM to about 100 mM, about 5 mM to about 100 mM, about 5 mM to about 75 mM, about 5 mM to about 50 mM, or about 5 mM to about 25 or 20 mM.
- the buffering agent is at a concentration of about 10 mM.
- the buffering agent has a pH of about 5 to about 10. In further embodiments, the buffering agent has a pH of about 6 to about 9, of about 6.5 to about 9, of about 7 to about 9, of about 7.5 to about 8.5, or of about 7.6 to about 8.2. In an embodiment, the buffering agent has a pH of about 7.9 or 8.0. In an embodiment, the pH is the pH at a temperature of about 20 to about 40°C. In an embodiment, the pH is the pH at a temperature of about 20 or 25°C. In an embodiment, the pH is the pH at a temperature of about 37°C.
- the composition comprises a salt, such as a metal salt.
- a salt such as a metal salt.
- Common saltforming cations include ammonium (NH 4 + ), manganese, nickel, calcium, iron, magnesium, potassium, sodium and copper.
- the metal salt is a magnesium salt, manganese salt or zinc salt.
- Common salt-forming anions include acetate, carbonate, chloride, citrate, fluoride, nitrate, nitrite, oxide, phosphate and sulfate.
- salts include magnesium chloride (MgCI 2 ), magnesium acetate, potassium acetate (KCH 3 CO 2 ), potassium chloride (KCI), sodium acetate (CH 3 COONa), sodium chloride (NaCI), calcium chloride, zinc chloride, manganese sulfate, manganese chloride, nickel chloride, nickel acetate, and sodium sulfate (Na 2 SO 4 ).
- the salt comprises a magnesium, manganese, nickel and/or sodium cation.
- the salt comprises a chloride anion.
- the composition comprises KCI.
- the composition comprises KCI and NaCI.
- the concentration of salt in the composition is about 1 mM to about 500 mM.
- the concentration of salt in the composition is about 10 mM to about 300 mM, about 20 mM to about 200 mM, about 20 mM to about 150 mM, about 30 mM to about 150 mM.
- the composition comprises a salt comprising a magnesium cation (e.g., MgCI 2 ) at a concentration of about 1 mM to about 100 mM, for example about 1 mM to about 50 mM, about 5 mM to about 20 mM, or about 5 to about 15 mM.
- the composition comprises a salt comprising a magnesium cation (e.g., MgCI 2 ) at a concentration of about 10 mM.
- the composition comprises a salt comprising a sodium cation (e.g., NaCI) at a concentration of about 1 mM to about 200 mM, about 10 mM to about 150 mM, about 10 mM to about 100 mM, about 25 mM to about 75 mM.
- the composition comprises a salt comprising a sodium cation (e.g., NaCI) at a concentration of about 50 mM.
- the composition comprises a stabilizing agent, such as a protein.
- the protein is albumin, such as bovine serum albumin (BSA).
- BSA bovine serum albumin
- the stabilizing agent is at a concentration of about 1 pg/ml to about 1 mg/ml. In an embodiment, the stabilizing agent is at a concentration of about 10 pg/ml to about 500 pg/ml. In an embodiment, the stabilizing agent is at a concentration of about 50 pg/ml to about 200 pg/ml. In an embodiment, the stabilizing agent is at a concentration of about 50 pg/ml to about 150 pg/ml. In an embodiment, the stabilizing agent is at a concentration of about 80 to about 120 pg/ml, for example about 100 pg/ml.
- the composition further comprises additional ingredients, such as detergents (e.g., non-ionic detergents like Triton® X-100 or Tween® 20) and/or redox reagents (DTT, beta-mercaptoethanol).
- detergents e.g., non-ionic detergents like Triton® X-100 or Tween® 20
- DTT redox reagents
- beta-mercaptoethanol redox reagents
- the composition may further comprise a metalchelating agent, such as ethylenediaminetetraacetic acid (EDTA).
- EDTA ethylenediaminetetraacetic acid
- Aqueous solutions/buffers for endonucleases are available commercially from several providers such as New England Biolabs Inc., Pomega and Thermo Scientific. Examples from New England Biolabs Inc. include NEBuffer 1 (10 mM Bis-Tris-Propane-HCI, 10 mM MgCI 2 , 1 mM DTT, pH 7.0@25°C); NEBuffer 1.1 (10 mM Bis-Tris-Propane-HCI, 10 mM MgCI 2 , 100 pg/ml BSA.
- NEBuffer 2.1 50 mM NaCI, 10 mM Tris-HCl, 10 mM MgCI 2 ,100 pg/ml BSA, pH 7.9@25°C
- NEBuffer 3.1 100 mM NaCI, 50 mM Tris-HCl. 10 mM MgCI 2 ,100 pg/ml BSA, pH 7.9@25°C
- NEBuffer 4 50 mM Potassium acetate, 20 mM Tris-acetate, 10 mM Magnesium Acetate, 1 mM DTT, pH 7.9@25°C
- CutSmart Buffer 50 mM Potassium Acetate, 20 mM Tris-acetate.
- Examples from Promega include Buffer A (6 mM Tris-HCl (pH 7.5 at 37°C), 6 mM MgCI 2 , 6 mM NaCI, 1 mM DTT), Buffer B (6 mM Tris-HCl (pH 7.5 at 37°C), 6 mM MgCI 2 , 50 mM NaCI, 1 mM DTT), Buffer C (10 mM Tris-HCl (pH 7.9 at 37°C), 10 mM MgCI 2 , 50 mM NaCI, 1 mM DTT), Buffer D (6 mM Tris-HCl (pH 7.9 at 37°C), 6 mM MgCI 2 , 150 mM NaCI, 1 mM DTT), Buffer E (6 mM Tris-HCl (pH 7.5 at 37°C), 6 mM MgCI 2 , 100 mM NaCI, 1 mM DTT), Buffer F (10 mM Tris-HCI (pH 8.5 at
- the composition comprises an aqueous solution/buffer comprising: from about 1 to about 100 mM of a buffering agent, such as a Tris-based buffering agent (e.g., Tris-HCI), having a pH of about 7 to about 9; from about 1 to about 100 mM of a metal salt, such as a salt comprising a magnesium and/or sodium cation (NaCI and/or KCI); and from about 10 pg/ml to about 500 pg/ml of a stabilizing agent, such as a protein (e.g., albumin).
- a buffering agent such as a Tris-based buffering agent (e.g., Tris-HCI)
- a metal salt such as a salt comprising a magnesium and/or sodium cation (NaCI and/or KCI)
- a stabilizing agent such as a protein (e.g., albumin).
- the composition comprises an aqueous solution/buffer comprising: from about 5 to about 20 mM of a buffering agent, such as a Tris-based buffering agent (e.g., Tris-HCI), having a pH of about 7.5 to about 8.5; from about 10 to about 100 mM of a metal salt, such as a metal salt comprising a magnesium and/or sodium cation (NaCI and/or KCI); and from about 50 pg/ml to about 200 pg/ml of a stabilizing agent, such as a protein (e.g., albumin).
- a buffering agent such as a Tris-based buffering agent (e.g., Tris-HCI)
- a metal salt such as a metal salt comprising a magnesium and/or sodium cation (NaCI and/or KCI)
- a stabilizing agent such as a protein (e.g., albumin).
- the composition comprises an aqueous solution/buffer comprising: about 10 mM of a buffering agent, such as a Tris-based buffering agent (e.g., Tris- HCI), having a pH of about 7.9 or 8.0; about 50 mM of NaCI; and about 100 pg/ml of albumin (e.g., BSA).
- a buffering agent such as a Tris-based buffering agent (e.g., Tris- HCI), having a pH of about 7.9 or 8.0; about 50 mM of NaCI; and about 100 pg/ml of albumin (e.g., BSA).
- the composition comprises an aqueous solution/buffer comprising: about 10 mM of a buffering agent, such as a Tris-based buffering agent (e.g., Tris- HCI), having a pH of about 7.9 or 8.0; about 50 mM of NaCI; about 10 mM of MgCI 2 ; and. about 100 pg/ml of BSA.
- a buffering agent such as a Tris-based buffering agent (e.g., Tris- HCI)
- Tris- HCI Tris- HCI
- the present disclosure also provides a mixture comprising the above-described isolated endonuclease or composition and a single-stranded nucleic acid molecule (e.g., single-stranded DNA).
- a single-stranded nucleic acid molecule e.g., single-stranded DNA
- the present disclosure also provides a method for cleaving a single-stranded nucleic acid molecule comprising contacting the single-stranded nucleic acid molecule with the isolated endonuclease or composition defined herein under conditions suitable for cleavage of the singlestranded nucleic acid molecule by the isolated endonuclease.
- the results presented in the Examples below show that the endonucleases according to the present disclosure recognize specific nucleotide sequences within the single-stranded nucleic acid molecules.
- the single-stranded nucleic acid molecule cleaved by the endonuclease comprises a nucleotide sequence having at least 50% sequence identity with the sequence: GTCATTCCCnnnnnnnnGGGAATC (SEQ ID NO:2917) or GUCAUUCCCnnnnnnnnGGGAAUC (SEQ ID NO: 2918).
- the single-stranded nucleic acid molecule cleaved by the endonuclease comprises a nucleotide sequence having at least 50% sequence identity with the sequence: GTCATTCCCGCGAAAGCGGGAATC (SEQ ID NO: 2919) or
- GUCAUUCCCGCGAAAGCGGGAAUC SEQ ID NO: 2920.
- the single-stranded nucleic acid molecule comprises the following nucleotide sequence: GTCANNCCNGNNNANNCNGGNNNC (SEQ ID NO: 2921) or GUCANNCCNGNNNANNCNGGNNNC (SEQ ID NO: 2922).
- the single-stranded nucleic acid molecule comprises the following nucleotide sequence: GTCAYBCCMGYRHAVRCKGGVRNC (SEQ ID NO: 2923) or GUCAYBCCMGYRHAVRCKGGVRNC (SEQ ID NO: 2924).
- the single-stranded nucleic acid molecule comprises a sequence having at least 50, 60, 70, 80, 90, 95 or 100% identity with one of the following nucleotide sequences: • GTCATCCCCGCGCAGGCGGGGACCC (SEQ ID NO: 2925) or
- GUCAUCCCCGCGGCAGGCGGGACCC SEQ ID NO: 2926
- GUCAUUCCCGCGCAGGCGGGAAUCC SEQ ID NO: 2928
- GUCAUUCCCGCGAAAGCGGGAAUCC SEQ ID NO: 2930
- GUCAUUCCCGCGAAGGCGGGAAUCC SEQ ID NO: 2932
- GUCAUUCCCGCGCAGGCGGGAAUCC SEQ ID NO: 2934
- GUCAUCCCCGCGCAGGCGGGGACCC SEQ ID NO: 2936
- GUCAUUCCCGCGAAAGCGGGAAGCC SEQ ID NO: 2940
- GUCAUUCCCGCGCAGGCGGGAAUCC SEQ ID NO: 2942
- GUCAUUCCCGUGCACACGGGAAUCC SEQ ID NO: 2944
- GUCAUGCCCGCAGGCGGGCAUCC SEQ ID NO: 2946
- GUCAUCCCCGCGAAGGCGGGGAUCC SEQ ID NO: 2948
- GUCAUUCCCGCGAAAGCGGGAAUCC SEQ ID NO: 2950
- GUCAUUCCCGCGAAGGCGGGAAUCC SEQ ID NO: 2952
- GUCAUUCCCGCGCAGGCGGGAAUCU SEQ ID NO: 2954
- GUCAUUCCCGCGUAGGCGGGAAUCC SEQ ID NO: 2956
- GUCACUCCCGCGAAGGCGGGAGUCC SEQ ID NO: 2958
- GUCACCCCAGCGAAAGCUGGGGUCC SEQ ID NO: 2960; • GTCATTCCCGCACAGGCGGGAATCC (SEQ ID NO: 2961) or
- GUCAUUCCCGCACAGGCGGGAAUCC SEQ ID NO: 2962
- GUCAUUCCCGCGCAGGCGGGAAUCU SEQ ID NO: 2964.
- the genes encoding the endonucleases according to the present disclosure are typically surrounded by one or several repeats of their own recognition sequences in the genome of the bacteria.
- the skilled person would be able to easily identify the recognition sequence of any endonuclease according to the present disclosure by identifying repeating nucleotide sequences/motifs located near (i.e., just before/upstream and/or after/downstream) the gene encoding the endonuclease.
- recognition sequence is expected to have some level of sequence identity with the recognition sequences disclosed herein.
- the method according to the present disclosure comprises incubating the single-stranded nucleic acid molecule with the composition defined herein for a period of time and under conditions suitable for cleavage of the single-stranded nucleic acid molecule by the endonuclease.
- the period of time is at least 5 minutes. In further embodiments, the period of time is at least 10 or 15 minutes. In yet further embodiments, the period of time is at least 20, 30 or 45 minutes. In embodiments, the period of time is from about 15 minutes to about 2 hours, from about 20 minutes to about 90 minutes, from about 30 minutes to about 60 minutes, or from about 45 to about 60 minutes.
- the conditions for incubation comprise a temperature of about 10, 15 or 20°C to about 60, 55 or 50°C. In further embodiments, the conditions for incubation comprise a temperature of about 25 or 30°C to about 40 or 45°C, such as a temperature of about 35 to about 40°C, e.g., about 36, 37 or 38°C.
- the conditions for incubation comprise the presence of a suitable amount of a metal, such as a divalent metal.
- a metal such as a divalent metal.
- divalent metals include magnesium, manganese, cadmium, calcium, cobalt, nickel, zinc, iron and copper.
- the divalent metal is magnesium, manganese or nickel, preferably magnesium. If the divalent metal is not present in the initial composition comprising the endonuclease, a suitable amount of the divalent metal is added to the reaction mixture prior to and/or during the incubation period.
- the divalent metal may be in the form of a salt, such as the salts listed above.
- the divalent metal is magnesium and is in the form of magnesium chloride (MgCI 2 ).
- the concentration of metal salt (e.g., MgCI 2 ) present during the incubation period is about 1 mM to about 100 mM, for example about 1 mM to about 50 mM, about 5 mM to about 20 mM, about 5 to about 15 mM, or about 10 mM.
- the conditions for incubation comprise a pH of about 5 to about 10.
- the conditions for incubation comprise a pH of about 6 to about 9, of about 6.5 to about 9, of about 7 to about 9, of about 7.5 to about 8.5, or of about 7.6 to about 8.2.
- the conditions for incubation comprise a pH of about 7.9 or 8.0.
- the amount of the endonuclease relative to that of the single-stranded nucleic acid molecule may be adjusted to obtain a suitable cleavage efficiency.
- the [concentration of endonuclease] I [concentration of single-stranded nucleic acid molecule] ratio is at least 0.00005, 0.0001 , 0.001 , 0.01 , 0.05, 0.1 or 0.5.
- the [concentration of endonuclease] I [concentration of single-stranded nucleic acid molecule] ratio is from 0.01 to 100, from 0.05 to 50 or from 0.1 to 10.
- the present disclosure provides a method for rendering a single-stranded nucleic acid susceptible to cleavage by the endonuclease described herein, the method comprising incorporating the nucleotide sequence defined above (recognition motif) into the single-stranded nucleic acid.
- the incorporation of the nucleotide sequence (recognition motif) may be achieved by adding the nucleotide sequence defined above (or a portion thereof) at the 5’-end, 3’-end or within the single-stranded nucleic acid, and/or by introducing one or more mutations (e.g., substitutions) within the sequence of the single-stranded nucleic acid to obtain the desired nucleotide sequence (recognition motif).
- nucleic acids are well known in the art and include, for example, cassette mutagenesis, PCR site-directed mutagenesis and genome-editing technologies using nucleases such as zinc finger nucleases (ZPNs) (Gommans et al., J.
- ZPNs zinc finger nucleases
- the present disclosure also provides a method for expressing the endonuclease defined herein in a cell, the method comprising introducing a nucleic acid encoding the endonuclease into the cell.
- the cell may be a procaryotic or eucaryotic cell.
- the nucleic acid may be an mRNA or cDNA molecule, naked or incorporated into a vector or plasmid, and it may be incorporated into the cell using any suitable methods for introducing nucleic acids into a cell (e.g., transfection, transformation, etc.).
- the cell comprises a single-stranded nucleic acid that is cleaved by the endonuclease.
- kits or commercial package comprising the endonuclease or composition described herein.
- the kit or package further comprises instructions setting forth a method for cleaving a single-stranded nucleic acid with the endonuclease, such as the method described herein.
- the kit or package may further comprise various components such as solutions or buffers (e.g., a reaction buffer) such as those described herein, containers, vials, etc.
- Example 1 Materials and methods
- FIG. 1A Gene expression and purification (FIG. 1A).
- the NMV0044 gene was amplified from N. meningitidis 8013 2C4.3 by Phusion PCR using primers containing the Nde ⁇ and Xho ⁇ restriction sites, then cloned into pET15-MHL expression vector, generating a recombinant GIY-YIG small protein A (SsnA) with a 6xHis-tag in N-terminal.
- the E64A, Y6A and Y17A mutants were also generated using site-directed mutagenesis by PCR in conserved residues expected to be critical for enzymatic activity (FIG. 1B).
- NMV_RS00225 Accession_number: WP_002216166.1 Protjocus : CAX49033. Embl_accession: FM999788.1.
- the SsnA gene is known as NMB0047 or NMB_RS00250 (CDS WP_002216166.1) in the reference strain N. meningitidis MC58, and NMA0292 or NMA_RS01540 in N. meningitidis Z2491 (CDS WP_002246543.1).
- SsnA(NMV0044) protein Nm2C4.3 (SEQ ID NO:2) MQPAVYILASQRNGTLYIGVTSDLVQRIYQHREHLIEGFTSRYNVTMLVWYELHPTMESAITREK
- NEIELOOT_01219 (EFE49965.1) was taken from Neisseria elongata subsp. glycolytica, WP_011213498 was taken from Legionella pneumophila subsp. pneumophila str Paris and WP_011271370 was taken from Rickettria fells URRWXCal2.
- Expression and purification of 6xHis-recombinant proteins were done by affinity chromatography using a nickel resin.
- Expression and purification of GST-recombinant proteins were done by affinity chromatography using a glutathione resin (FIG. 1A).
- Electrophoretic mobility shift assay (EMSA). Gel shift assays, or EMSA, were performed by diluting the proteins in Diluent A (NEB) to the indicated working concentrations. 5’carboxyfluorescin-tagged oligos corresponding to genomic regions of N. meningitidis MC58 were synthesized from Sigma. When needed, complementary oligonucleotides were annealed by mixing equimolar amounts in annealing buffer (10 mM Tris-HCI pH8, 50 mM NaCI, 1 mM EDTA), incubating them 5 minutes at 95°C and letting them slowly cool down.
- annealing buffer (10 mM Tris-HCI pH8, 50 mM NaCI, 1 mM EDTA
- proteins were mixed with the fluorescent oligonucleotides in a reaction buffer containing 50 mM NaCI, 10 mM Tris-HCI, 100 pg/ml BSA, pH7.9. The mixes were incubated at 37°C for 30 minutes before adding native loading dye. Samples were resolved on native 10% TBE-acrylamide gels and imaged with a Typhoon FLA9500 scanner. For branched DNA binding assays, a similar approach was used but the gel was stained with GelStain (Biotium) and imaged on a GelDoc (BioRad) since the oligonucleotides were not fluorescently labelled.
- GelStain Biotium
- GelDoc BioRad
- Nuclease assays were performed similarly to gel-shift assays, with the addition of 10 mM MgCI 2 in the reaction buffer. Reactions were stopped by adding formamide loading dye and incubating 3 minutes at 95°C. Samples were resolved on denaturing 17.5% TBE- Urea (8M) acrylamide gels, and imaged with a Typhoon FLA9500 scanner. The sequences of the DNA constructs used in the studies described herein are depicted in the table below.
- the NMV0044 gene from Neisseria meningitidis 8013 2C4.3 was cloned in pET15-MHL, allowing its expression with a 6xHis-tag in N-terminal. Purification was done by affinity chromatography with a nickel resin. The resulting protein was diluted in reaction buffer and used directly for in vitro assays to determine its enzymatic activity.
- SsnA is a specific single-stranded nuclease
- SsnA possesses a single functional domain, belonging to the GIY-YIG nuclease superfamily. Its nuclease activity was therefore tested on different nucleic acids (FIGs. 2B-C). Using dsDNA does not reveal significant binding nor cleavage activity, even when it contains the recognition pattern (FIG. 2B, dsDNA). On the other hand, 100 % of a 100 nt ssDNA containing the recognition pattern is cleaved at a unique specific position, meaning it is an endonuclease (FIG. 2B, ssDNA). However, SsnA has no activity at all on the exact reverse complement of the ssDNA that is efficiently cleaved (FIG.
- SsnA is a specific single-stranded endonuclease with no detectable activity on dsDNA.
- the single GIY-YIG domain of SsnA mediates both its cutting and binding activities, without the need for any co-interacting protein or complex.
- GST-tagged SsnA exhibits the same ssDNA binding and nuclease activity that the His-tagged protein (FIG. 2B), suggesting that the addition of tags or protein domains do not alter the enzymatic activity of SsnA, making it modulable.
- SsnA is a metal-dependent nuclease. SsnA cannot bind dsDNA, or ssDNA without the recognition sequence, which explains why it cannot cleave these substrates. It binds 100 % of ssDNA harboring the recognition sequence.
- the glutamic acid residue in position 64 of SsnA is well conserved within the GIY-YIG superfamily, where it often corresponds to a metal cofactor (e.g., magnesium) binding site (FIG. 1B).
- the E64A mutant was therefore expressed and purified along with the WT protein (FIG. 1A), and its cutting and binding activity was assessed against ssDNA harboring the recognition sequence (FIG. 3).
- SsnA E64A can efficiently bind ssDNA, but has completely lost its nuclease activity (FIG. 3). This conserved amino acid is therefore confirmed as the magnesium binding site allowing cleavage of single-stranded nucleic acids.
- the tyrosine residues in position 6 and 17 of SsnA are also well conserved within the GIY-YIG superfamily as part of their catalytic core (FIG. 1B).
- the Y6 and Y17 mutants were also expressed, purified (FIB. 1A), and assayed for their ssDNA binding and nuclease activities (FIG. 2B).
- mutation of amino acids Y6 orY17 completely prevents the nuclease activity of SsnA.
- both mutants show reduced binding to ssDNA, indicating that they are involved in both binding and cutting of ssDNA.
- the specific cutting site of SsnA on ssDNA was determined precisely by running the cleaved product from a 75 nt ssDNA next to fluorescent oligonucleotides of increasing length (FIG. 4A), which confirmed that SsnA cuts several nucleotides upstream of the NTS repeated sequence (FIG. 4B). Since the binding and cleavage specificities of SsnA are not identical, individual nucleotides in and around the repeated sequence from the same 75 nt ssDNA were mutated and the binding and cleavage specificities of SsnA on the mutated ssDNA were tested (FIG. 4B). Binding activity was completely lost when the ssDNA mutation occurred immediately downstream of the palindromic region.
- binding activity was significantly reduced in mutated ssDNA with mutations located within the palindromic region of the repeated sequence, suggesting that SsnA binds the stem part of the stem-loop formed by this repeated sequence.
- the cleavage activity of SsnA was highly dependent on the sequence immediately upstream of the palindromic region, but still within the conserved part of the repeated sequence. Therefore, SsnA binds to the ssDNA hairpin formed by the NTS repeated sequence, and needs to interact with the sequence immediately upstream of the hairpin to be able to cleave ssDNA a few nucleotides upstream.
- SsnA nuclease activity was determined using the 100 nt ssDNA containing its recognition sequence (FIG. 6).
- SsnA requires a metal cofactor such as magnesium, manganese or nickel to cleave ssDNA, with an optimal activity at 10 mM MgCI 2 .
- FIG. 6A-B Manganese however only allows partial nuclease activity.
- the enzyme is active in temperatures ranging 14 to 54°C, with an optimal temperature of around 37°C (FIG. 6C).
- Most of the ssDNA substrate is cleaved within the first 15 minutes of reaction (FIG. 6D).
- ssDNA cleavage is dosedependent and requires subnanomolar amounts of protein (FIG. 6E).
- Example 4 SsnA belongs to a novel family of single-stranded specific endonucleases
- SsnA shows two distinct branches or clusters of Ssn proteins. These subfamilies are referred to herein as SsnA and SsnB. Some strains or species of bacteria may encode for multiple Ssn homologs. The sequences of several representative Ssn proteins having at least 50% identity with SsnA are depicted in FIGs. 11A-JJJJ.
- SsnA In /V. meningitidis and closely related species such as N. elongate, the gene encoding SsnA is surrounded by several repeats of its own recognition sequence. Although the enzyme cannot cut genomic DNA in a double-stranded form, SsnA might also act as a mobile genetic element, similarly to transposases. It is shown here that the genes encoding SsnA homologs in unrelated species are located near highly similar sequences, which could be an indication of their nuclease specificity (FIG. 8).
- SsnA homologs from three unrelated bacterial species, Neisseria elongata, Rickettsia fells and Legionella pneumophila (sequence alignment in FIG. 9A), were expressed and purified. Apart from the N. elongata protein which shares 82% identity with the N. meningitidis protein, these SsnA homologs share relatively little (47-60%) identity with each other (FIG. 9B) and are spread across the Ssn protein phylogeny. They are therefore representative of the diversity of this novel Ssn protein family. They all contain the conserved features of the GIY-YIG domain, namely the N-terminal tyrosine (Y) residues, the central arginine (R) residue as well as the C-terminal glutamic acid (E) residue.
- Ssn proteins can indeed be grouped as a novel family of enzymes, more specifically specific single-stranded endonucleases (Ssn) with a potentially wide array of unique specificities.
- SsnA is able to cleave ssDNA in cellulo
- the transformations mixes are then serially diluted, and the appropriate dilutions are plated on non-selective media (GCB agar) and selective media containing either 5 pg/ml chloramphenicol (Cm) or 75 pg/ml spectinomycin (Sp). Colonies are counted after an overnight incubation at 37C with 5% CO 2 , and the rate of transformation is determined, corresponding to the number of resistant CFUs divided by the number of total CFUs.
- GCB agar non-selective media
- selective media containing either 5 pg/ml chloramphenicol (Cm) or 75 pg/ml spectinomycin (Sp).
- Two plasmids were assayed, each containing an antibiotic resistance gene (chloramphenicol or spectinomycin) flanked by sequences homologous to the N. meningitidis genome, which allow integration to the host genome by double recombination.
- One plasmid did not contain the recognition sequence of SsnA, while the other one contains several repeats of it in the homologous regions.
- Neisseria species are naturally competent, meaning they can readily update DNA from their environment by transformation and integrate it into their own genome if there is sufficient homology.
- ssDNA only one strand of DNA
- FIG. 12A When transforming N. meningitidis mutant strains with a plasmid that does not contain SsnA's recognition sequence, a slight decrease of transformation efficiency was observed in the SsnA KO strain (FIG. 12A). To the contrary, if the plasmid contains SsnA's recognition sequence, the knock-out strain is transformed much more efficiently than the strains expressing SsnA (FIG. 12B).
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP22891249.9A EP4430177A4 (en) | 2021-11-11 | 2022-11-11 | Endonucleases that selectively split single-strand nucleic acids, and uses thereof |
| CA3237085A CA3237085A1 (en) | 2021-11-11 | 2022-11-11 | Endonucleases that selectively cleave single-stranded nucleic acids and uses thereof |
| US18/709,349 US20250043259A1 (en) | 2021-11-11 | 2022-11-11 | Endonucleases that selectively cleave single-stranded nucleic acids and uses thereof |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163263896P | 2021-11-11 | 2021-11-11 | |
| US63/263,896 | 2021-11-11 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023082011A1 true WO2023082011A1 (en) | 2023-05-19 |
Family
ID=86334843
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CA2022/051668 Ceased WO2023082011A1 (en) | 2021-11-11 | 2022-11-11 | Endonucleases that selectively cleave single-stranded nucleic acids and uses thereof |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20250043259A1 (en) |
| EP (1) | EP4430177A4 (en) |
| CA (1) | CA3237085A1 (en) |
| WO (1) | WO2023082011A1 (en) |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2005049642A2 (en) * | 2003-11-21 | 2005-06-02 | Institut Pasteur | Genome of legionella pneumophila paris and lens strain-diagnostic and epidemiological applications |
| US20090298099A1 (en) * | 2001-02-12 | 2009-12-03 | Novartis Vaccines And Diagnostics S.R.L. | Gonococcal Proteins and Nucleic Acids |
| US20120070457A1 (en) * | 2010-09-10 | 2012-03-22 | J. Craig Venter Institute, Inc. | Polypeptides from neisseria meningitidis |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| NZ515935A (en) * | 1999-05-19 | 2004-01-30 | Chiron S | Combination Neisserial compositions and vaccines for the prevention of infections due to Neisseria bacteria such as meningococcal meningitis |
| KR101873327B1 (en) * | 2016-03-30 | 2018-07-02 | 연세대학교 산학협력단 | Novel homing endonuclease from arabidopsis thaliana |
-
2022
- 2022-11-11 EP EP22891249.9A patent/EP4430177A4/en active Pending
- 2022-11-11 US US18/709,349 patent/US20250043259A1/en active Pending
- 2022-11-11 CA CA3237085A patent/CA3237085A1/en active Pending
- 2022-11-11 WO PCT/CA2022/051668 patent/WO2023082011A1/en not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090298099A1 (en) * | 2001-02-12 | 2009-12-03 | Novartis Vaccines And Diagnostics S.R.L. | Gonococcal Proteins and Nucleic Acids |
| WO2005049642A2 (en) * | 2003-11-21 | 2005-06-02 | Institut Pasteur | Genome of legionella pneumophila paris and lens strain-diagnostic and epidemiological applications |
| US20120070457A1 (en) * | 2010-09-10 | 2012-03-22 | J. Craig Venter Institute, Inc. | Polypeptides from neisseria meningitidis |
Non-Patent Citations (5)
| Title |
|---|
| CHANDLER MICHAEL, DE LA CRUZ FERNANDO, DYDA FRED, HICKMAN ALISON B., MONCALIAN GABRIEL, TON-HOANG BAO: "Breaking and joining single-stranded DNA: the HUH endonuclease superfamily", NATURE REVIEWS MICROBIOLOGY, NATURE PUBLISHING GROUP, GB, vol. 11, no. 8, 1 August 2013 (2013-08-01), GB , pages 525 - 538, XP093067449, ISSN: 1740-1526, DOI: 10.1038/nrmicro3067 * |
| DUNIN-HORKAWICZ STANISLAW; FEDER MARCIN; BUJNICKI JANUSZ M: "Phylogenomic analysis of the GIY-YIG nuclease superfamily", BMC GENOMICS, BIOMED CENTRAL LTD, LONDON, UK, vol. 7, no. 1, 28 April 2006 (2006-04-28), London, UK , pages 98, XP021014700, ISSN: 1471-2164, DOI: 10.1186/1471-2164-7-98 * |
| GUHA ET AL.: "Applications of alternative nucleases in the age of CRISPR/Cas9", INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, vol. 18, no. 12, 29 November 2017 (2017-11-29), pages 2565, XP055554937, ISSN: 1661-6596, DOI: 10.3390/ijms18122565 * |
| See also references of EP4430177A4 * |
| TOMPKINS KASSIDY J, HOUTTI MO, LITZAU LAUREN A, AIRD ERIC J, EVERETT BLAKE A, NELSON ANDREW T, PORNSCHLOEGL LELAND, LIMÓN-SWANSON : "Molecular underpinnings of ssDNA specificity by Rep HUH-endonucleases and implications for HUH-tag multiplexing and engineering", NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, GB, vol. 49, no. 2, 25 January 2021 (2021-01-25), GB , pages 1046 - 1064, XP093067452, ISSN: 0305-1048, DOI: 10.1093/nar/gkaa1248 * |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4430177A1 (en) | 2024-09-18 |
| CA3237085A1 (en) | 2023-05-19 |
| EP4430177A4 (en) | 2025-12-31 |
| US20250043259A1 (en) | 2025-02-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11427818B2 (en) | S. pyogenes CAS9 mutant genes and polypeptides encoded by same | |
| US20220340931A1 (en) | S. pyogenes cas9 mutant genes and polypeptides encoded by same | |
| EP2712931B1 (en) | Immobilized transposase complexes for DNA fragmentation and tagging | |
| CN102796728B (en) | Methods and compositions for DNA fragmentation and tagging by transposases | |
| Schleper et al. | Characterization of a DNA polymerase from the uncultivated psychrophilic archaeon Cenarchaeum symbiosum | |
| US20200172895A1 (en) | Using split deaminases to limit unwanted off-target base editor deamination | |
| Sinha et al. | AdnAB: a new DSB-resecting motor–nuclease from mycobacteria | |
| JP2020103295A (en) | Methods and compositions related to sequences that guide cas9-targeting | |
| US20170191047A1 (en) | Adenosine-specific rnase and methods of use | |
| KR20250021632A (en) | Crispr/cpf1 systems and methods | |
| US20220307009A1 (en) | Isolated nucleic acid binding domains | |
| KR20210031699A (en) | DNA polymerase mutant suitable for nucleic acid amplification reaction from RNA | |
| Babu et al. | Sinorhizobium meliloti YbeY is a zinc-dependent single-strand specific endoribonuclease that plays an important role in 16S ribosomal RNA processing | |
| Maciejewska et al. | New nuclease from extremely psychrophilic microorganism Psychromonas ingrahamii 37: identification and characterization | |
| KR20080031255A (en) | Mutant type PCNA | |
| Murray et al. | Structural and functional diversity among Type III restriction-modification systems that confer host DNA protection via methylation of the N4 atom of cytosine | |
| US20250043259A1 (en) | Endonucleases that selectively cleave single-stranded nucleic acids and uses thereof | |
| Chen et al. | Biochemical and mutational analyses of a unique clamp loader complex in the archaeon Methanosarcina acetivorans | |
| Honda et al. | Archaeal homologs of human RNase P protein pairs Pop5 with Rpp30 and Rpp21 with Rpp29 work on distinct functional domains of the RNA subunit | |
| US11932847B2 (en) | Transposase competitor control system | |
| Hoeller et al. | Random tag insertions by Transposon Integration mediated Mutagenesis (TIM) | |
| Luna-Chávez et al. | Molecular basis of inhibition of the ribonuclease activity in colicin E5 by its cognate immunity protein | |
| Sommer et al. | Activation of a chimeric Rpb5/RpoH subunit using library selection | |
| Yang et al. | Physical and functional interactions between 3-methyladenine DNA glycosylase and topoisomerase I in mycobacteria | |
| Modrusan et al. | Spermine-mediated improvement of cycling probe reaction |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22891249 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 3237085 Country of ref document: CA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 18709349 Country of ref document: US |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022891249 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2022891249 Country of ref document: EP Effective date: 20240611 |



