EP4291677A2 - Procédés de détection de méthylation de base dans des acides nucléiques - Google Patents

Procédés de détection de méthylation de base dans des acides nucléiques

Info

Publication number
EP4291677A2
EP4291677A2 EP22705528.2A EP22705528A EP4291677A2 EP 4291677 A2 EP4291677 A2 EP 4291677A2 EP 22705528 A EP22705528 A EP 22705528A EP 4291677 A2 EP4291677 A2 EP 4291677A2
Authority
EP
European Patent Office
Prior art keywords
nucleic acid
5hmc
adduct
sequencing
malononitrile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP22705528.2A
Other languages
German (de)
English (en)
Inventor
Frank Bergmann
Shwu shin CHANG
Peter CRISALLI
Abre De Beer
Dieter Heindl
Omid KHAKSHOOR
David L. PENKLER
Jo-Anne PENKLER
Martin Ranik
Meng Taing
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
F Hoffmann La Roche AG
Roche Diagnostics GmbH
Kapa Biosystems Inc
Original Assignee
F Hoffmann La Roche AG
Roche Diagnostics GmbH
Kapa Biosystems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by F Hoffmann La Roche AG, Roche Diagnostics GmbH, Kapa Biosystems Inc filed Critical F Hoffmann La Roche AG
Publication of EP4291677A2 publication Critical patent/EP4291677A2/fr
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay

Definitions

  • [001] The invention related to the field of nucleic acid-based diagnostics.
  • the invention related to a method of detecting epigenetic modification in nudeic acids, wherein the epigenetic modifications may have biological and clinical significance.
  • Detecting methylation comprises detecting a modified cytosine base (methyl and hydroxymethyl cytosine (5mC and 5hmC)) in nucleic acids.
  • a modified cytosine base methyl and hydroxymethyl cytosine (5mC and 5hmC)
  • the gold standard of detecting methylation involved treating DNA with bisulfite. The treatment would convert unmethylated cytosines (C) to uracils (U) while methylated cytosines (5mC and 5hmC) would remain intact. The change of C to U could then be detected e.g., by nucleic acid sequencing.
  • bisulfite treatment leads to degradation of large portion of sample DNA.
  • TAPS TET-assisted pyridine-borane sequencing
  • oxidation products can be reacted with malononitrile to form an adduct also read as T during sequencing, see Zhu C., et al., (2017) Single- Cell 5-Formylcytosine Landscapes of Mammalian Early Embryos and ESCs at Single- Base Resolution, Cell Stem Cell, 20:720-731. e5.
  • Malononitrile reacts exclusively with 5-formylcytosine (5fC).
  • Another method of detecting 5fC is with a Wittig reagent in an organic solvent, and then irradiating with ultraviolet light. The products of the reaction are detected using fluorescence recognition technology as described in International Patent Publication No. WO2020155742.
  • the invention is a method of detecting a 5-formyl cytosine (5fC) nucleotide in a nucleic acid, the method comprising: (i) forming a reaction ixture by contacting a sample containing a nucleic acid comprising 5fC with a composition comprising a compound of formula Ri — CH — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme [007]
  • 5fC 5-formyl cytosine
  • Ri is an electron-withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl, ⁇ (ii) incubating the reaction mixture for less than 3 hours wherein at least 90% of 5fC has formed the adduct; (iii) sequencing the nucleic acid from the reaction ixture to obtain a test sequence wherein the adduct is read as thymine (T) during sequencing; and (iii) comparing the test sequence with a reference sequence, wherein a transition from a cytosine (C) in the reference sequence to athymine (T) in the corresponding position in the test sequence indicates the presence of 5fC in the nucleic acid.
  • T thy
  • Rl is a cyano group (CN).
  • the composition comprising the compound of formula Ri — CH 2 — CN contains an organic acid moiety.
  • the organic acid has a formula R-COOH and R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, and C1-C30 linear or branched alkynyl.
  • the organic acid is acetic acid.
  • the composition comprising the compound of formula R — CH 2 — CN is present in a non-aqueous solvent.
  • the non-aqueous solvent has a formula R-OH wherein R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl.
  • R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl.
  • the non- aqueous solvent is methanol or ethanol at 10-100%, e.g., 90-100%.
  • R x NH y is triethanolamine.
  • the reaction mixture is incubated for 1 hour.
  • the nucleic acid prior to sequencing in step (iii), is amplified, e.g., with a B -family polymerase efficiently incorporating an adenine (A) nudeotide opposite the adduct.
  • sequencing in step (iii) is by sequencing-by-synthesis (SBS) method, e.g., with a nanopore.
  • SBS sequencing-by-synthesis
  • the nucleic acid comprising 5fC is obtained by contacting the nucleic acid comprising methylated cytosine with a composition comprising a ten-eleven-translocation (TET) dioxygenase and 5-100 mM of a Fe(II) ion at pH 7-8.
  • the composition comprises a ten-eleven- translocation (TET) dioxygenase and 5-10 mM of a Fe(II) ion at pH 8.
  • the composition comprises a ten-eleven-translocation (TET) dioxygenase and 80-100 mM of a Fe(II) ion at pH 7.
  • the Fe(II) ion is produced by contacting the sample with a compound selected from FeS0 4 , (NH 4 ) 2 Fe(S0 4 ) 2 , FeS0 4 7H 2 0, (NH 4 ) 2 Fe(S0 4 ) 2 6H 2 0 and FeCl 2 .
  • the composition further comprises one or more of ascorbic acid, alpha-ketoglutarate and a reducing agent.
  • the nucleic acid comprising 5fC is obtained by contacting the nucleic acid comprising methylated cytosine with a composition comprising Cu(II) compound and 2, 2,6,6- tetramethylpiperidine-l-oxyl (TEMPO).
  • the nucleic acid comprising 5fC is obtained by contacting the nucleic acid comprising methylated cytosine with a potassium ruthenium salt selected from potassium ruthenate (K 2 Ru0 4 ) and potassium perruthenate (KRu0 4 ).
  • a potassium ruthenium salt selected from potassium ruthenate (K 2 Ru0 4 ) and potassium perruthenate (KRu0 4 ).
  • the invention is a method of detecting a methylated cytosine (C) nudeotide in a nucleic acid, the method comprising: (i) forming a reaction mixture by contacting a sample containing a nucleic acid comprising 5- methyl cytosine (5mC) and/or 5-hydroxymethyl cytosine (5hmC) with a composition comprising a ten-eleven-translocation (TET) dioxygenase capable of converting 5mC and 5hmC in the nucleic acid into 5-formyl cytosine (5fC) and a compound of formula Ri — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme
  • TET ten-eleven-translocation
  • Ri is an electron -withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl,
  • Rl is a cyano group (CN) and the composition added in step (i) contains a non-aqueous solvent having a formula R-OH wherein R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl,
  • Rl is a cyano group (CN) and the composition added in step (i) contains ethanol or methanol at the concentration of at least 90% in the reaction mixture, and further comprises tri-ethanolamine.
  • the invention is a method of detecting a methylated cytosine nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nucleic acid comprising 5- methyl cytosine (5mC) and/or 5-hydroxymethyl cytosine (5hmC), wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with a ten-eleven-translocation (TET) dioxygenase capable of converting 5mC and 5hmC in the nucleic acid into 5-formyl cytosine (5fC);
  • TAT ten-eleven-translocation
  • R is an electron-withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl,
  • R1 is a cyano group (CN) and the composition added in step (i) contains a non-aqueous solvent having a formula R-OH wherein R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl, and the concentration of the solvent in the reaction mixture of at least 90%.
  • R1 is a cyano group (CN) and the composition added in step (i) contains ethanol or methanol at the concentration of at least 90% in the reaction mixture, and further comprises tri ethanolamine.
  • the invention is a kit for detecting 5-formyl cytosine
  • the kit comprising an ethanol solution of malononitrile.
  • the kit further comprises one or more of the following: nucleic acid sequencing reagents, nucleic acid amplification reagents, nucleic acid purification reagents, a solution of acetic acid, a solution of triethanolamine and instructions on reacting 5fC in nucleic acid with malononitrile in the presence of organic acids and alkylamines.
  • the invention is a kit for detecting methylated cytosine nucleotides in a nucleic acid in under 3 hours, the kit comprising a ten- eleven-translocation (TET) dioxygenase enzyme, an ethanol solution of malononitrile and further comprising reagents for nucleic acid purification, amplification and sequencing.
  • TET ten- eleven-translocation
  • the invention is a method of detecting a 5-formyl cytosine (5fC) and 5-carboxy cytosine (5caC) nucleotide in a nucleic acid, the method comprising: (i) forming a reaction mixture by contacting a sample containing a nucleic acid comprising 5fC and/or 5caC with a composition comprising a borane derivative; (ii) incubating the reaction mixture for less than 3 hours wherein at least 90% of 5fC and 5caC has been reduced to di hydrouracil (DHU);
  • DHU di hydrouracil
  • the borane derivative is picoline borane.
  • the reaction mixture contains an organic acid moiety.
  • the organic acid has a formula R-COOH and R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, and C1-C30 linear or branched alkynyl.
  • the organic acid is acetic acid.
  • the borane derivative is present in a non-aqueous solvent.
  • the non-aqueous solvent has a formula R-OH wherein R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl.
  • the non-aqueous solvent is methanol or ethanol.
  • the reaction mixture is incubated for 1 hour.
  • the nucleic acid is amplified, e.g., with a B -family polymerase efficiently incorporating an adenine (A) nucleotide opposite DHU.
  • sequencing in step (iii) is by sequencing-by-synthesis (SBS) method, e.g., with a nanopore.
  • SBS sequencing-by-synthesis
  • the nucleic acid comprising 5fC and/or 5caC is obtained by contacting the nucleic acid comprising methylated cytosine with a composition comprising a ten-eleven-translocation (TET) dioxygenase and 5-100 mM of a Fe(II) ion at pH 7-8.
  • the composition comprises a ten-eleven-translocation (TET) dioxygenase and 5-10 mM of a Fe(II) ion at pH 8.
  • the composition comprises a ten-eleven-translocation (TET) dioxygenase and 80-100 mM of a Fe(II) ion at pH 7.
  • the Fe(II) ion is produced by contacting the sample with a compound selected from FeS0 4 , (NH 4 ) 2 Fe(S0 4 ) 2 , FeS0 4 7H 2 0, (NH 4 ) 2 Fe(S0 4 ) 2 6H 2 0 and FeCl 2 .
  • the composition further comprises one or more of ascorbic acid, alpha-ketoglutarate and a reducing agent.
  • the nucleic acid comprising 5fC is obtained by contacting the nudeic acid comprising methylated cytosine with a composition comprising Cu(II) compound and 2, 2,6,6- tetramethylpiperidine-l-oxyl (TEMPO).
  • the nucleic acid comprising 5fC is obtained by contacting the nucleic acid comprising methylated cytosine with a potassium ruthenium salt selected from potassium ruthenate (K 2 RU0 4 ) and potassium perruthenate (KRu0 4 ).
  • a potassium ruthenium salt selected from potassium ruthenate (K 2 RU0 4 ) and potassium perruthenate (KRu0 4 ).
  • the invention is a method of detecting a methylated cytosine (C) nucleotide in a nucleic acid, the method comprising: (i) forming a reaction mixture by contacting a sample containing a nucleic acid comprising 5- methyl cytosine (5mC) and/or 5-hydroxymethyl cytosine (5hmC) with a composition comprising a ten-eleven-translocation (TET) dioxygenase capable of converting 5mC and 5hmC in the nucleic acid into 5-formyl cytosine (5fC) and 5- carboxy cytosine (5caC) and a borane derivative in a non-aqueous solvent;
  • TET ten-eleven-translocation
  • the borane derivative is selected from pyridine borane, 2-picoline borane (pic-BH3), borane, sodium borohydride, sodium cyanoborohydride, and sodium triacetoxyborohydride and the non-aqueous solvent is selected from methanol and methanol and the reaction mixture further comprises acetic acid.
  • the invention is a method of detecting a methylated cytosine nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nucleic acid comprising 5- methyl cytosine (5mC) and/or 5-hydroxymethyl cytosine (5hmC), wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with a ten-eleven-translocation (TET) dioxygenase capable of converting 5mC and 5hmC in the nucleic acid into 5-formyl cytosine (5fC) and 5- carboxycytosine (5caC); (iii) contacting the reaction mixture with a borane derivative in a non-aqueous solvent; (iv) incubating the reaction mixture for less than 3 hours wherein at least 90% of 5fC and 5caC has been reduced to dihydrouracil
  • the borane derivative is selected from pyridine borane, 2-picoline borane (pic-BH3), borane, sodium borohydride, sodium cyanoborohydride, and sodium triacetoxyborohydride and the non-aqueous solvent is selected from methanol and methanol and the reaction mixture further comprises acetic acid.
  • the invention is a kit for detecting 5-formyl cytosine (5fC) and 5-carboxy cytosine (5caC) in a nucleic acid in under 3 hours, the kit comprising a borane derivative in an ethanol solution.
  • the borane derivative is selected from pyridine borane, 2-picoline borane (pic-BH3), borane, sodium borohydride, sodium cyanoborohydride, and sodium triacetoxyborohydride.
  • the kit further comprises one or more of the following: nucleic acid sequencing reagents, nucleic acid amplification reagents, nucleic acid purification reagents, a solution of acetic acid, and instructions on reacting 5fC and 5caC in nucleic acid with a borane compound in the presence of organic acids.
  • the invention is a kit for detecting methylated cytosine nucleotides in a nucleic acid in under 3 hours, the kit comprising a ten- eleven-translocation (TET) dioxygenase enzyme, an ethanol solution of a borane derivative and further comprising reagents for nucleic acid purification, amplification and sequencing.
  • TET ten- eleven-translocation
  • the invention is a single-tube method of detecting a 5-formyl cytosine (5fC) nudeotide in a nudeic acid, the method comprising: (i) forming a reaction mixture by contacting a sample containing a nucleic acid comprising 5fC with a composition comprising ten-eleven-translocation (TET) dioxygenase in a solution with a compound of formula Ri — CH — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme
  • TET ten-eleven-translocation
  • Ri is an electron-withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl, ⁇ (ii) incubating the reaction mixture to form the adduct; [0029] (iii) sequencing the nucleic acid from the reaction mixture to obtain a test sequence wherein the adduct is read as thymine (T) during sequencing; and (iii) comparing the test sequence with a reference sequence, wherein a transition from a cytosine (C) in the reference sequence to a thymine (T) in the corresponding position in the test sequence indicates the presence of 5fC in the nucleic acid.
  • the compound of formula R is an electron-withdrawing group selected
  • the invention is a method of detecting a tissue of origin of a nucleic acid in a sample, the method comprising detecting the presence and location of methylated cytosines in the nucleic acid by the method disclosed herein, comparing the methylation pattern to the known methylation patterns of several tissues; and identifying the tissue of origin of the nucleic acid in the sample.
  • the invention is a method of detecting organ transplant rejection in a transplant recipient, the method comprising obtaining from the transplant recipient a blood sample containing cell-free nucleic acids; detecting the presence and location of methylated cytosines in the cell-free nucleic acid by the method disclosed herein; comparing the methylation pattern to the known methylation patterns of several organs; detecting transplant rejection if the cell-free nucleic acid with the transplanted organ-specific methylation pattern is detected in the sample.
  • the invention is a method of monitoring for transplant rejection by periodically sampling circulating cell-free DNA and detecting the presence and location of methylated cytosines according to the disclosed herein, measuring changes in the level of cell-free DNA with the transplanted organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates organ transplant rejection.
  • the invention is a method of screening for the presence of a cancerous tumor in a patient, the method comprising obtaining from the patient a blood sample containing cell-free nucleic acids; detecting the presence and location of methylated cytosines in the cell-free nucleic acids by the method disclosed herein; comparing the methylation pattern to the known methylation patterns of tumor and non-tumor tissues, and detecting the presence of a tumor if tumor-specific methylation patter is detected.
  • the invention is a method of monitoring tumor volume in a patient the method comprising periodically sampling circulating cell-free DNA and detecting the presence and location of methylated cytosines by the method disclosed herein, measuring changes in the level of cell-free DNA with the tumor-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates tumor growth, while a decrease in the level of such cell-free DNA indicates tumor shrinkage.
  • the invention is a method of monitoring the effectiveness of treatment of cancer in a patient by a method comprising periodically sampling circulating cell- free DNA and detecting the presence and location of methylated cytosines by the method disclosed herein, measuring changes in the level of cell-free DNA with the tumor-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates that treatment ineffective, while a decrease in the level of such cell- free DNA indicates treatment effectiveness.
  • the invention is a method of diagnosis or minimal residual disease (MRD) in a cancer patient the method comprising obtaining from the patient a blood sample comprising cell -free nucleic acids, detecting the presence and location of methylated cytosines in the nucleic acid by the method disclosed herein, comparing the methylation pattern to the known methylation patterns of several tissues; and identifying the tissue of origin of the nucleic acid in the sample.
  • MRD minimal residual disease
  • the invention is a method of diagnosing an autoimmune disease in a patient the method comprising from the patient a blood sample comprising cell-free nucleic acids, detecting the presence and location of methylated cytosines in the nucleic acid by the method disclosed herein, comparing the methylation pattern to the known methylation patterns of tissues damaged by the immune disease; diagnosing immune disease if such methylation pattern is found.
  • the invention is a method detecting the presence and location of methylated cytosines as disclosed herein further comprising prior to contacting the reaction mixture with the TET dioxygenase, the 5-hydroxymethyl cytosine (5hmC) in the nucleic acid is chemically blocked from reacting with TET.
  • 5hmC is blocked by contacting the reaction mixture with a glucosyltransferase and a glucose moiety.
  • the reaction mixture is contacted with a beta-glucosyltransferase and UDP glucose.
  • the invention is a method of forming 5-formyl cytosine (5fC) in a nucleic acid comprising contacting a reaction mixture containing the nucleic acid including at least one 5 -hydroxymethyl cytosine (5hmC) with laccase.
  • laccase is isolated from a species selected from Hexagonia tenuis, Pleurotis sajor caju, Pleutoris ostreatus, Xylaria polymorpha, Trametes hirsuta, Trametes versicolor and Coprinus spp.
  • laccase is isolated from a strain selected from Pleurotis sajor caju MTCC-141, Pleutoris ostreatus MTCC-1801, Xylaria polymorpha MTCC-1100, and Trametes hirsuta MTCC-1171.
  • the reaction mixture further comprises a co-factor, e.g., 2,2,6,6-tetramethylpiperidine-l-oxyl (TEMPO), acetosyringone, syringaldehyde, para-coumaric acid 2,2’-azino-bis(3-ethylbenzothiazoline-6- sulfonate (ABTS), violuric acid (VLA), N-acetyl-N-phenylhydroxylamine (NHA), N-hydroxybenzotriazole (HBT), and N-hydroxyphthalimide (HPI).
  • TEMPO 2,2,6,6-tetramethylpiperidine-l-oxyl
  • acetosyringone acetosyringone
  • syringaldehyde para-coumaric acid 2,2’-azino-bis(3-ethylbenzothiazoline-6- sulfonate
  • ABTS para-coumaric acid 2,2’-azino-bis(3-e
  • the invention is a method of detecting 5- hydroxymethyl cytosine (5hmC) in nucleic acids comprising the steps of: contacting a sample comprising the nucleic acid comprising 5hmC with laccase under conditions suitable for oxidizing 5hmC into 5-formyl cytosine (5fC); contacting the sample with a composition comprising a compound of formula R — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein R is an electron-withdrawing group selected from cyano, nitro, Cl - C6 alkyl carboxylic ester, unsubstituted carboxamide, C1-C6 alkyl mono-substituted and Cl- C6 alkyl di-substituted carboxamide, substituted carbonyl moiety, substituted sulfonyl moiety, wherein the substitution is selected from C1-C6 linear or branched alkyl
  • the alkyl substitution may comprise heteroatoms like O or N, e.g. -CH -CH -O-CH .
  • the compound of formula Ri — CH 2 — CN is malononitrile.
  • the same conversion can be performed with a Wittig reagent.
  • reaction is as follows:
  • the above disclosed R1 — CH2 — CN and the Wittig reagent are defined as “fC Conversion Reagent”, since they are capable of converting a Cytosine into a Thymine equivalent, when serving as a polymerase substrate.
  • reaction product of R1 — CH2 — CN and the Wittig reagent with 5-formyl-cytosine (5fC) is defined as “5fC adduct” or “adduct” which acts a thymine equivalent when serving as a polymerase substrate.
  • the organic acid is acetic acid
  • the non-aqueous solvent is ethanol or methanol
  • the compound of formula R x NH y is tri-ethanolamine or piperidine.
  • the compound is a buffer, e.g.
  • the non-aqueous solvent is present in the reaction mixture at a concentration of 10-100%, e.g., 90-100%. In some embodiments, the reaction mixture is incubated for 1 hour. In some embodiments, prior to sequencing, the nucleic acid is amplified with a B-family polymerase efficiently incorporating an adenine (A) nucleotide opposite the Thymine equivalent.
  • the invention is a method of detecting a methylated cytosine (C) in a nucleic acid, the method comprising: contacting a sample containing a nucleic acid comprising 5- methyl cytosine (5mC) and/or 5- hydroxymethyl cytosine (5hmC) with a ten- eleven-translocation (TET) dioxygenase capable of converting 5mC into 5hmC and with a laccase capable of converting 5hmC into 5-formyl cytosine (5fC); contacting the sample with a compound of formula Ri — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein Ri is an electron-withdrawing group selected from cyano, nitro, Cl - C6 alkyl carboxylic ester, unsubstituted carboxamide, C1-C6 alkyl mono-substituted and Cl- C6 alky
  • the compound of formula R — CEE — CN is malononitrile.
  • the organic acid is acetic acid
  • the non-aqueous solvent is ethanol or methanol
  • the compound of formula R x NH y is tri-ethanolamine or piperidine.
  • the non-aqueous solvent is present in the reaction mixture at a concentration of 10-100%, e.g., 90-100%. In some embodiments, the reaction mixture is incubated for 1 hour. In some embodiments, prior to sequencing, the nudeic acid is amplified with a B-family polymerase efficiently incorporating an adenine (A) nucleotide opposite the adduct. In some embodiments, TET and laccase are active in the same reaction mixture. In other embodiments, TET and laccase are not active in the same reaction mixture and are added consecutively to the sample.
  • the invention is a method of detecting 5- hydroxymethyl cytosine (5hmC) in nucleic acids comprising the steps of: contacting a sample comprising the nucleic acid comprising 5hmC with laccase under conditions suitable for oxidizing 5hmC into 5-formyl cytosine (5fC); contacting the sample with a Wittig reagent; irradiating the sample with an ultraviolet light to form a product; sequencing the nucleic acid from the reaction mixture to obtain a test sequence wherein the product is read as thymine (T) during sequencing; and comparing the test sequence with a reference sequence, wherein a transition from a cytosine (C) in the reference sequence to a thymine (T) in the corresponding position in the test sequence indicates the presence of 5hmC in the nucleic acid.
  • the invention is a method of forming 5-formyl cytosine (5fC) in a nucleic acid comprising contacting a reaction mixture containing the nucleic acid including at least one 5-methyl cytosine (5mC) and/or 5- hydroxymethyl cytosine (5hmC) with an enzyme selected from xylene monooxygenase, toluene methyl-monooxygenase (EC 1.14.15.26), P450 monooxygenase (EC 1.14.14.1), alcohol dehydrogenase, alcohol oxidase, galactose oxidase, chloroperoxidase and peroxidase.
  • an enzyme selected from xylene monooxygenase, toluene methyl-monooxygenase (EC 1.14.15.26), P450 monooxygenase (EC 1.14.14.1), alcohol dehydrogenase, alcohol oxidase, galactose oxidase, chloroperoxidas
  • the invention is a kit for detecting methylated cytosine in nucleic acids comprising laccase.
  • the laccase is isolated from a species selected from Hexagonia tenuis, Pleurotis sajor caju, Pleutoris ostreatus, Xylaria polymorpha, Trametes hirsuta, Trametes versicolor and Coprinus spp.
  • the laccase is isolated from a strain selected from Pleurotis sajor caju MTCC-141, Pleutoris ostreatus MTCC-1801, Xylaria polymorpha MTCC-1100, and Trametes hirsuta MTCC-1171.
  • the kit further comprises a laccase cofactor selected from 2,2,6,6-tetramethylpiperidine-l- oxyl (TEMPO), acetosyringone, syringaldehyde, para-coumaric acid 2,2’-azino- bis(3-ethylbenzothiazoline-6-sulfonate (ABTS), violuric acid (VLA), N-acetyl-N- phenylhydroxylamine (NHA), N-hydroxybenzotriazole (HBT), and N- hydroxyphthalimide (HPI).
  • TEMPO 2,2,6,6-tetramethylpiperidine-l- oxyl
  • ABTS para-coumaric acid 2,2’-azino- bis(3-ethylbenzothiazoline-6-sulfonate
  • VLA violuric acid
  • NHA N-acetyl-N- phenylhydroxylamine
  • HBT N-hydroxybenzotriazole
  • HPI N- hydroxy
  • the kit further comprises one or more of the following: nucleic acid sequencing reagents, nucleic acid amplification reagents, and nucleic acid purification reagents.
  • the kit further comprises a ten- eleven-translocation (TET) dioxygenase enzyme.
  • TET and laccase are present in the same tube.
  • the kit further comprises an ethanol solution of malononitrile.
  • the kit further comprises a solution of acetic acid.
  • the kit further comprises a solution of triethanolamine.
  • the invention is a kit for detecting methylated cytosine in nucleic acids comprising an enzyme capable of converting 5mC into 5hmC and/or 5fC, the enzyme selected from xylene monooxygenase, toluene methyl-monooxygenase (EC 1.14.15.26) and P450 monooxygenase (EC 1.14.14.1).
  • the invention is a kit for detecting methylated cytosine in nucleic acids comprising an enzyme capable of converting 5hmC into 5fC, the enzyme selected from alcohol dehydrogenase, alcohol oxidase, galactose oxidase, chloroperoxidase and peroxidase.
  • the invention is a method of detecting a hydroxymethylated cytosine (5hmC) nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nucleic acid comprising 5-hydroxymethyl cytosine (5hmC), wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with a laccase capable of converting 5hmC in the nucleic acid into 5- formyl cytosine (5fC); (iii) contacting the reaction mixture with malononitrile to form a 5fC adduct; (iv) amplifying the nucleic acids from step (iii) utilizing a DNA polymerase and primers capable of binding to the primer-binding sites, wherein the DNA polymerase reads the 5fC adduct as thymine (T) during amplification; (v) sequencing
  • the invention is a method of detecting a methylated cytosine (5mC) nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nucleic acid comprising 5-methyl cytosine (5mC), wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with a TET enzyme capable of converting 5mC in the nucleic acid into 5hmC and laccase capable of converting 5hmC in the nucleic acid into 5-formyl cytosine (5fC); (iii) contacting the reaction mixture with malononitrile to form a 5fC adduct; (iv) amplifying the nucleic acids from step (iii) utilizing a DNA polymerase and primers capable of binding to the primer-binding sites, wherein the DNA polymerase reads the 5fC ad
  • the invention is a method of detecting a tissue of origin of a nucleic acid in a sample, the method comprising detecting the presence and location of methylated cytosines in the nucleic acid by the method as disclosed herein comparing the methylation pattern to the known methylation patterns of several tissues; and identifying the tissue of origin of the nucleic acid in the sample.
  • the invention is a method of detecting organ transplant rejection in a transplant recipient, the method comprising obtaining from the transplant recipient a blood sample containing cell-free nudeic acids; detecting the presence and location of methylated cytosines in the cell-free nucleic acid by the method described herein, comparing the methylation pattern to the known methylation patterns of several organs; detecting transplant rejection if the cell-free nucleic acid with the transplanted organ-specific methylation pattern is detected in the sample.
  • the invention is a method of monitoring for transplant rejection by periodically sampling circulating cell-free DNA and detecting the presence and location of methylated cytosines according to the method described herein, measuring changes in the level of cell-free DNA with the transplanted organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates organ transplant rejection.
  • the invention is a method of screening for the presence of a cancerous tumor in a patient, the method comprising obtaining from the patient a blood sample containing cell-free nucleic acids; detecting the presence and location of methylated cytosines in the cell-free nucleic acids by the method described herein, comparing the methylation pattern to the known methylation patterns of tumor and non-tumor tissues, and detecting the presence of a tumor if tumor-specific methylation patter is detected.
  • the invention is a method of monitoring tumor volume in a patient the method comprising periodically sampling circulating cell- free DNA and detecting the presence and location of methylated cytosines according to the method described herein, measuring changes in the level of cell-free DNA with the tumor-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates tumor growth, while a decrease in the level of such cell-free DNA indicates tumor shrinkage.
  • the invention is a method of monitoring the effectiveness of treatment of cancer in a patient by a method comprising periodically sampling circulating cell-free DNA and detecting the presence and location of methylated cytosines according to the method described herein, measuring changes in the level of cell-free DNA with the tumor-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates that treatment ineffective, while a decrease in the level of such cell-free DNA indicates treatment effectiveness.
  • the invention is a method of diagnosis or minimal residual disease (MRD) in a cancer patient the method comprising obtaining from the patient a blood sample comprising cell-free nucleic acids, detecting the presence and location of methylated cytosines in the nucleic acid by the method described herein, comparing the methylation pattern to the known methylation patterns of several tissues; and identifying the tissue of origin of the nucleic acid in the sample.
  • MRD minimal residual disease
  • the invention is a method of diagnosing an autoimmune disease in a patient the method comprising from the patient a blood sample comprising cell-free nucleic acids, detecting the presence and location of methylated cytosines in the nucleic acid by the method described herein, comparing the methylation pattern to the known methylation patterns of tissues damaged by the immune disease; diagnosing immune disease if such methylation pattern is found.
  • the invention is a method of distinguishing 5- hydroxymethylcytosine (5hmC) from 5-methylcytosine (5mC) in nucleic acids in a sample, the method comprising: separating a sample into two aliquots; in the first aliquot, contacting the nucleic acid comprising 5mC and 5hmC with a ten-eleven- translocation (TET) dioxygenase under conditions where 5hmC and 5mC are converted into 5-formyl cytosine (5fC); in the second aliquot, contacting the nucleic acid comprising 5mC and 5hmC with a laccase under conditions where 5hmC is converted into 5-formyl cytosine (5fC); contacting both aliquots separately with a compound of formula Ri — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein Ri is an electron-withdrawing group selected from cyano,
  • FIG. 1 shows DHU conversion using ethanol as co-solvent of picoline- borane in methylation detection assays (TAPS).
  • FIG. 2 shows DHU conversion using methanol and acetic acid as co-solvents of picoline-borane in methylation detection assays (TAPS).
  • FIG. 3 shows malononitrile adduct formation using malononitrile in a sodium acetate buffer in methylation detection assays.
  • FIG. 4 shows malononitrile adduct formation using malononitrile in an ethanol-TRIS buffer in methylation detection assays.
  • FIG. 5 shows malononitrile adduct formation using malononitrile in an ethanol-triethylamine buffer in methylation detection assays.
  • FIG. 6 shows conversion of CpG sites in a single-tube methylation detection assay with TET and malononitrile
  • FIG. 7 shows oxidation of 5hmC into 5fC in an oligonucleotide by laccase in the presence of TEMPO.
  • FIG. 8 shows that 5mC in the oligonucleotide is not oxidized by laccase under the same conditions.
  • FIG. 9A and FIG. 9B show Liquid Chromatography-Mass Spectrometry (LC-MS) data of how various amine buffer catalysts modulate TET activity to 5hmC and 5fC.
  • FIG. 9A shows the effects of 2-Amino-5-methoxybenzoic acid
  • FIG. 9B shows the effects of 2-(Aminomethyl)imidazole diydrochloride on TET oxidation of 5mC to 5hmC/ 5fC.
  • FIG. 10 shows a table depicting the amounts of 5fC and 5caC produced by 5mC oxidation via laccase with TEMPO at two different temperatures, 25°C and 37°C.
  • FIG. 11A shows LC-MS data of the effect of malononitrile on conversion of 5fC to 5fC-M adduct under various buffer conditions.
  • the top graph of FIG. 11 A shows buffer conditions of 40°C for 1 hour
  • the middle graph of FIG. 11A shows buffer conditions of 60°C for 1 hour
  • the bottom graph of FIG. 11 A shows buffer conditions of 95°C for 10 minutes.
  • FIG. 11B shows data showing the effect of pre- denaturation with NaOH on the activity of maloninitrile.
  • FIG. 12 shows LC-MS data showing the oxidation of 5hmC in as little as 22 hours in Cu 2,2,6,6-tetramethylpiperidine-l-oxyl (CuTEMPO).
  • the top graph of FIG. 12 shows the oxidation of 5hmC in CuTEMPO, and the bottom graph of FIG. 12 shows the derivatization of the products from the top graph with DMEAH.
  • FIG. 13A shows the reaction of 5fC-M adduct conversion to Thymine (T), which is mediated by the activity of polymerase enzymes.
  • FIG. 13B shows the composition of the optimized buffer (“DOE_l”) for polymerases.
  • FIG. 13C shows the conversion of 5fC-M adduct to T using standard buffer (“BufferA”) and an optimized buffer (“DOE_l”).
  • Unmethylated cytosines (C) would read as thymine (T) after reacting with bisulfite, while methylated cytosines (5mC and 5hmC) would read as C.
  • bisulfite treatment leads to degradation of large portion of sample nucleic acid making it unsuitable for applications requiring high sensitivity.
  • the method is unsuitable for latest applications analyzing cell-free nucleic acid such as cell-free DNA.
  • Recently, less harsh methods for the detection of methylated cytosines have been disclosed. The newest methods involve modification of the methylated cytosines instead of the unmethylated cytosines, as is the case with bisulfite treatment Liu, Y., et al.
  • CAPS Chemically Assisted Picoline- borane Sequencing
  • KRuCL potassium perruthenate
  • Oxidative Bisulfite Sequencing or oxBS-seq see Booth M.
  • the 5fC obtained by potassium perruthenate conversion becomes a favorable target for further processing by e.g. borane treatment or any other downstream method.
  • Yet another sequencing technique is an alternative to the reduction of
  • 5fC with borane This method involves forming an adduct of 5fC recognized as T.
  • the adduct is formed with the use of malononitrile, see Zhu C., et al., (2017) Single- Cell 5-Formylcytosine Landscapes of Mammalian Early Embryos and ESCs at Single- Base Resolution, Cell Stem Cell, 20:720-731.e5.
  • TAPS, CAPS and the malononitrile method of Zhu et al. are superior to bisulfite method in that they avoid the harsh chemical treatment and the resulting loss of sample nucleic acids.
  • the newer methods have a disadvantage of taking a very long time to complete or require high temperatures: 3h at 70°C or 16h at 37°C for borane reactions of TAPS (see Liu et al., Nature Biotech. 37, pages 424- 429(2019) or 1-2 days to form the malononitrile adduct (see U.S. Patent No. 10,519,184 and application Pub. No. US20200165661).
  • the instant disclosure comprises improved and more practical methods of detecting cytosine methylation in nucleic acids.
  • the invention is a method of detecting an epigenetic modification, specifically, cytosine methylation in nucleic acids.
  • the state of the art methods of detecting methylated cytosines in nucleic acids include the following key steps: 1) oxidation of methylated cytosine; 2) reduction of the oxidized product into a form capable of being read as thymine (T) during sequencing; 3) sequencing the nucleic acids; and 4) comparing the treated and untreated sequences wherein a change from a cytosine (T) to a thymine (T) in the sequence read indicated the presence of a methylated cytosine.
  • the instant invention comprises several useful improvements to the general scheme set forth above.
  • the invention is a method comprising an improved step 1) oxidizing methylated cytosine, and steps 2) -4) performed according to the state of the art. In some embodiments, the invention is a method comprising an improved step 2) reduction of the oxidized product, and steps 1), 3) and 4) performed according to the state of the art. In some embodiments, the invention is a method comprising an improved step 1) oxidizing methylated cytosine, an improved step 2) reduction of the oxidized product, and steps 3) and 4) performed according to the state of the art.
  • the present invention involves a method of manipulating nucleic acids from a sample.
  • the sample is derived from a subject or a patient.
  • the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy.
  • the sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples) .
  • the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated.
  • the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi.
  • the present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art. See J. Sambrook et al., "Molecular Cloning: A Laboratory Manual," 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N.Y.).
  • kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
  • nucleic acids are extracted, separated by size and optionally, concentrated by epitachophoresis as described e.g., in WO2019092269 and W02020074742.
  • the present invention involves detecting epigenetic modification in nucleic acids.
  • the nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein.
  • the same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC) .
  • a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai AV, et al. (2015) Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas.
  • the entire genome of an organism is probed for the presence of methylation.
  • the method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described e.g, in Shull AY, et al., (2015) Sequencing the cancer methylome. Methods Mol Biol. 1238:627-5. [0084] In some embodiments, it is desired to separately detect or distinguish
  • 5hmC and 5hmC in a sample are blocked from oxidation and is not converted to a compound read as T during sequencing.
  • the blocking process takes advantage of the reactive hydroxyl group present on 5hmC but not 5mC.
  • the blocking group added to 5hmC is a sugar moiety.
  • the sugar moiety is a modified or unmodified glucose moiety and 5-glucosyl-hydroxymethyl cytosine (5ghmC) is formed. 5ghmC does not undergo adduct formation or reduction with borane derivatives according to the scheme known for 5fC and 5hmC.
  • addition of the blocking group is catalyzed by a glycosyltransferase, e.g., a glucosyltransferase.
  • a glycosyltransferase e.g., a glucosyltransferase.
  • 5hmC in nucleic acid is reacted with a modified glucose in the presence of a beta-glucosyltransferase.
  • the modified glucose is UDP-glucose and the catalyst is a bacteriophage T4 beta-glucosyltransferase (T4 BGT).
  • the method includes a step of oxidizing methylated cytosines for downstream detection.
  • the method includes a step of converting 5-methyl cytosine (5mC) and/or 5-hydroxymethyl cytosine (5hmC) into 5-formyl cytosine (5fC) or 5-carboxyl cytosine (5caC) or a mixture of 5fC and 5caC.
  • the invention comprises a step of contacting a sample or a reaction ixture with a ten-eleven-translocation (TET) dioxygenase as described e.g., in the U.S. Patent No. 9,115,386 or U.S. Application Pub. No. US20200370114.
  • TET ten-eleven-translocation
  • the TET enzyme is selected from TET1, TET2, TET3 and a related protein CXXC4.
  • TET is selected from mouse TET1, TET2 or TET3 (mTETl, 2 or 3), human TET1, TET2 or TET3 (hTETl, 2 or 3), Naegleria TET (NgTET), Coprinopsis cinerea (CcTET) or any other analog or equivalent thereof with similar or equivalent enzymatic activity.
  • the invention utilizes a step of converting
  • the invention utilizes a step of converting 5mC and 5hmC in the sample into predominantly or exclusively 5fC. In some embodiments, the invention utilizes a step of converting 5mC and 5hmC in the sample into a mixture of 5fC and 5caC.
  • the invention comprises a step of converting
  • the invention comprises a step of converting 5mC and 5hmC in the sample into 5fC by contacting the sample with TET and 5-100 mM of a Fe(II) ions at pH 7-8.
  • the invention utilizes a step of converting 5mC and 5hmC in the sample into 5fC by incubation with TET and 5-10 mM of a Fe(II) ions at pH 8. In some embodiments, the invention utilizes a step of converting 5mC and 5hmC in the sample into 5fC by incubation with TET and 80-100 mM of a Fe(II) ions at pH 7. [0088] In some embodiments, the invention comprises a step of converting
  • 5mC and 5hmC in the sample into predominantly or exclusively 5fC by contacting the sample with TET and Fe(II) ions in the presence of ascorbic acid, alpha- ketoglutarate and a reducing agent.
  • the invention utilizes a step of converting
  • the invention comprises improved steps of detecting methylated cytosines in nucleic acids by detecting 5-formyl cytosine (5fC) nucleotide in a nucleic acid, wherein the 5fC is formed by one of the methods described herein above.
  • 5fC 5-formyl cytosine
  • the method involves contacting a sample containing a nucleic acid comprising 5fC with an improved composition comprising a compound of formula R — CH 2 — CN in improved solvent composition, the compound being capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein, Ri is an electron- withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl, or heteroaryl.
  • Ri is an electron- withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl,
  • Rl is a cyano group (CN) and the reactant is malononitrile.
  • the instant invention provides an improved composition of the reaction mixture which improves on the Yi method by enabling the reaction to proceed for less than 3 hours wherein at least 90% of 5fC has formed the adduct. In some embodiments, the reaction proceeds for only 1 hour with at least 90% of 5fC forming the adduct. By contrast, the Yi reaction requires no less than 20 hours and up to 48 hours (see US20200165661, Examples).
  • the improvement over the prior art involves conducting the reaction between 5fC and the compound of formula Ri — CH 2 — CN in a solution comprising an organic acid moiety.
  • the organic acid has a formula R- COOH and R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl or heteroaryl.
  • the reaction takes place in the presence of acetic acid.
  • the concentration of the organic acid in the reaction is between 1% and 30%, e.g., 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, or 30%.
  • the improvement over the prior art involves conducting the reaction between 5fC and the compound of formula R — CH 2 — CN in a non-aqueous solvent.
  • the non-aqueous solvent has a formula R-OH wherein R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl or heteroaryl.
  • the reaction takes place in methanol or ethanol.
  • the reaction takes place in 10%-100% methanol or ethanol.
  • the reaction takes place in 90% or more of methanol or ethanol.
  • the compound of formula R x NH y is a primary, secondary, or tertiary amine with aliphatic or aromatic groups.
  • reaction takes place in the presence of triethanolamine .
  • the improvement over the prior art involves conducting the reaction between 5fC and the compound of formula Ri — CH 2 — CN simultaneously with TET oxidation to enable a simplified, one-tube workflow.
  • TET and malononitrile are added simultaneously and oxidation to 5fC and 5fC-malononitrile adduct formation take place in the same tube.
  • the invention comprises improved steps of detecting methylated cytosine in nucleic acids by forming and detecting 5- carboxylcytosine (5caC) and 5-formylcytosine (5fC), wherein 5fC, 5caC or a mixture of 5fC and 5caC are formed by one of the methods described herein above.
  • the method involves contacting a sample containing a nucleic acid comprising 5fC and/or 5caC with an improved composition comprising a borane derivative in improved solvent composition, the borane derivative being capable of reacting with 5caC and with lesser efficiency with 5fC in the nucleic acid to form dihydrouracil
  • borane derivatives include 2-picoline borane (pic-borane), pyridine borane, tert-butylamine borane, ethylenediamine borane and dimethylamine borane as described by Song and Liu in WO2019136413 (TET- Assisted Picoline borane Sequencing or TAPS).
  • TAPS borane reaction as described by Liu et al, requires no less than 3 hours at 70°C or 16 hours and 37°C (see WO2019136413, Examples: Borane Reduction).
  • the improvement over the prior art involves conducting the reaction between 5fC or 5caC and the borane derivative in a solution comprising an organic acid moiety.
  • the organic acid has a formula R-COOH and R is selected from C 1 -C30 linear or branched alkyl, C 1 -C30 linear or branched alkenyl,
  • reaction takes place in the presence of acetic acid.
  • the improvement over the prior art involves conducting the reaction between 5fC or 5caC and the borane derivative in a non- aqueous solvent.
  • the non-aqueous solvent has a formula R-OH wherein R is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkynyl, cycloalkyl, aryl or heteroaryl.
  • the reaction takes place in methanol or ethanol. In some embodiments, the reaction takes place in 90% or more of methanol or ethanol.
  • sequencing is by a next-generation massively parallel sequencing process. Sequencing results in a test sequence wherein the adduct or DHU are read as thymine (T), i.e., the sequencing polymerase is able to accommodate the adduct or DHU in the strand being copied, and to incorporate an adenine (A) opposite the adduct or DHU.
  • T thymine
  • A adenine
  • the method further comprises a step of comparing the test sequence with a reference sequence, wherein a change from a cytosine (C) in the reference sequence to a thymine (T) in the corresponding position in the test sequence indicates the presence of methylated cytosine in the test nucleic acid.
  • C cytosine
  • T thymine
  • the nucleic acids in the sample are amplified prior to sequencing.
  • amplification utilizes a B -family polymerase efficiently incorporating an adenine (A) nucleotide opposite the malononitrile adduct or DHU.
  • the sequencing may proceed with any polymerase suitable for the sequencing process as the adduct or DHU have already been recognized as T by the amplification polymerase.
  • the nucleic acid in the sample is ligated to adaptors, wherein adaptors comprise elements useful in amplification and sequencing.
  • An adaptor comprises at least one of the following: barcode, primer binding site and ligation site.
  • the invention is an improved method of detecting a methylated cytosine nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nucleic acid in a sample wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with TET capable of converting 5mC in the nucleic acid into 5-formyl cytosine (5fC); (iii) contacting the reaction mixture with a compound of formula Ri — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein, Ri is an electron- withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C
  • step (iii) the compound of formula Ri — CH 2 — CN is present in a non-aqueous solvent, e.g., ethanol or methanol.
  • a non-aqueous solvent e.g., ethanol or methanol.
  • the compound of formula Ri — CH 2 — CN is present in a solution comprising an organic acid such acetic acid.
  • the compound of formula Ri — CH 2 — CN is present in a solution comprising an amine such as triethanolamine .
  • the invention is an improved method of detecting a methylated cytosine nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nucleic acid in a sample wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with TET capable of converting the methylated cytosine in the nucleic acid into 5-carboxycytosine (5caC) or a mixture of 5-formyl cytosine (5fC) and 5caC; (iii) contacting the reaction mixture with a borane derivative capable of reacting with 5fC and 5caC in the nucleic acid to form DHU (iv) incubating the reaction mixture for no more than about 1 hour wherein at least 90% of 5fC and 5caC has formed DHU; (v) amplifying the adapted nucleic acids utilizing a DNA polymerase and
  • the borane derivative in steps (iii) and (iv) is present in a non-aqueous solvent, e.g., ethanol or methanol. In some embodiments, in steps (iii) and (iv) the borane derivative is present in a solution comprising an organic acid such acetic acid. [00104] In some embodiments, the invention is a single-tube method of detecting methylation in nucleic acids.
  • the method comprises: (i) ligating adaptors to a nudeic acid in a sample wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor-ligated nucleic acid with TET capable of converting 5mC in the nucleic acid into 5-formyl cytosine (5fC) or a mixture of 5- carboxy cytosine (5caC) and 5fC and simultaneously, contacting the same reaction mixture with a compound of formula R — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein, Ri is an electron- withdrawing group selected from substituted or unsubstituted cyano, nitro, formyl, carbonyl compound, wherein the substitution is selected from C1-C30 linear or branched alkyl, C1-C30 linear or branched alkenyl, C1-C30 linear or branched alkyny
  • the conditions are optimized to improve performance of TET.
  • the reaction mixture comprises a non-aqueous solvent, e.g., ethanol or methanol.
  • the reaction mixture comprises an organic acid such acetic acid.
  • the reaction mixture comprises an organic amine such triethanolamine.
  • the invention includes a step of amplifying nucleic acids.
  • amplification occurs prior to the sequencing step.
  • amplification occurs after the step of forming an adduct of 5fC and malononitrile.
  • amplification occurs after the step of reduction of oxidized methylated cytosine with a borane derivative.
  • amplification occurs prior to the target enrichment step.
  • the amplification utilizes an upstream primer and a downstream primer.
  • both primers are target specific primers, i.e., primers comprising a sequence complementary to the target sequence of the methylation biomarker.
  • one or both primers are universal primers.
  • universal primer binding sites are present in adaptors ligated to the target sequenced as described herein.
  • a universal primer binding site is present in the 5’-region (tail) of a target-specific primer. Accordingly, after one or more rounds of primer extension with a tailed target-specific primer, a universal primer may be used for subsequent rounds of amplification.
  • a universal primer in paired with another universal primer (of the same or different sequence). In other embodiments, a universal primer is paired with a target-specific primer.
  • the invention involves a nucleic acid polymerase.
  • Nucleic acid polymerases used in amplification and sequencing are known and commercially available from multiple sources.
  • the instant invention involves copying a strand comprising a 5fC adduct formed as described herein. Such copying requires a polymerase accommodating the 5fC adduct.
  • the polymerase is a B-family polymerase.
  • the polymerases is able to copy a strand comprising a 5fC adduct by recognizing the adduct as T (i.e., incorporating an A opposite the adduct).
  • Polymerases able to accommodate the 5fC adduct described herein include DNA polymerases known to accommodate uracil (U) in a DNA strand.
  • the polymerase may be a naturally-occurring or an engineered polymerase.
  • the polymerase is isolated from hyperthermophilic archaea e.g., genus Pyrococcus (. e.g ., Pyrococcus furious) or genus Thermus (e.g., Thermus aquaticus).
  • the polymerase is isolated from mesophilic archaea, e.g., genus Metanosarcina (e.g, Methanosarcina acetivorans).
  • engineered uracil- tolerant polymerases include KAPA HiFi Uracil+ DNA polymerase (Roche Sequencing Solutions, Desion, Cal.), Takara Terra (Takara Bio USA, Mountain View, Cal.), and EpiMark ® Hot Start Taq DNA polymerase (New England Biolabs, Waltham, Mass.).
  • the DNA polymerase is a type A DNA polymerase (DNA-dependent DNA polymerase). Some DNA polymerases possess limited terminal transferase activity (Taq polymerase adding a single dA at the 3’- end of the copy strand). Other DNA polymerases do not possess detectable terminal transferase activity. In such embodiments, a separate terminal transferase enzyme is used to add non-templated nucleotides to the 3’-end of the copy strand.
  • DNA-dependent DNA polymerase DNA-dependent DNA polymerase.
  • the DNA polymerase is a Hot Start polymerase or a similar conditionally activated polymerase.
  • a thermostable DNA polymerase is used, for example polymerase is a Taq or Taq-derived polymerase (e.g., KAPA 2G polymerase from KAPA Biosystems, Wilmington, Mass.).
  • the invention utilizes an adaptor added to one or both ends of a nucleic acid or nucleic acid strand.
  • Adaptors of various shapes and functions are known in the art (see e.g., PCT/EP2019/05515 filed on February 28, 2019, US8822150 and US8455193).
  • the function of an adaptor is to introduce desired elements into a nucleic acid.
  • the adaptor-borne elements include at least one of nucleic acid barcode, primer binding site or a ligation-enabling site.
  • the adaptor may be double-stranded, partially single stranded or single stranded.
  • a Y -shaped, a hairpin adaptor or a stem -loop adaptor is used wherein the double-stranded portion of the adaptor is ligated to the double stranded nucleic acid formed as described herein.
  • the adaptor molecules are in vitro synthesized artificial sequences. In other embodiments, the adaptor molecules are in vitro synthesized naturally-occurring sequences. In yet other embodiments, the adaptor molecules are isolated naturally occurring molecules or isolated non naturally- occurring molecules.
  • the double-stranded or partially double-stranded adaptor oligonucleotide can have overhangs or blunt ends.
  • the double-stranded DNA may comprise blunt ends to which a blunt-end ligation can be applied to ligate a blunt-ended adaptor.
  • the blunt ended DNA undergoes A-tailing where a single A nucleotide is added to the blunt ends to match an adaptor designed to have a single T nucleotide extending from the blunt end to facilitate ligation between the DNA and the adaptor.
  • kits for performing adaptor ligation include AVENIO ctDNA Library Prep Kit or KAPA HyperPrep and HyperPlus kits (Roche Sequencing Solutions, Pleasanton, CA).
  • the adaptor ligated (adapted) DNA may be separated from excess adaptors and unligated DNA.
  • the invention includes the use of a barcode.
  • the method of detecting epigenetic modifications includes sequencing.
  • the nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction.
  • the use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368.
  • a unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR).
  • the unique molecular barcode has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, A., et al, (2014) An ultrasensitive method for quantitating circulating tumor DN A with broad patient coverage, Nature Medicine doi:10.1038/nm.3519).
  • unique molecular barcodes are used for sequencing error correction.
  • the entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family.
  • a variation in the sequence not shared by all members of the barcoded family is discarded as an artefact.
  • Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, A., et al, (2016) Integrated digital error suppression for improved detection of circulating tumor DNA, Nature Biotechnology 34:547).
  • the adaptor ligated to one or both ends of the barcoded target nucleic acid comprises one or more barcodes used in sequencing.
  • a barcode can be a UID or a multiplex sample ID (MID or SID) used to identify the source of the sample where samples are mixed (multiplexed).
  • the barcode may also be a combination of a UID and an MID.
  • a single barcode is used as both UID and MID.
  • each barcode comprises a predefined sequence. In other embodiments, the barcode comprises a random sequence.
  • the barcodes are between about 4-20 bases long so that between 96 and 384 different adaptors, each with a different pair of identical barcodes are added to a human genomic sample.
  • the number of UIDs in the reaction can be in excess of the number of molecules to be labelled. A person of ordinary skill would recognize that the number of barcodes depends on the complexity of the sample (i.e., expected number of unique target molecules) and would be able to create a suitable number of barcodes for each experiment.
  • the method involves forming a library comprising nucleic acids from a sample.
  • the library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR.
  • a library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library.
  • the library is the input nucleic acid in which methylation is detected by the method described herein.
  • the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
  • the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized.
  • the term "Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
  • Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Pat. Publ. Nos. 2013/0244340, 2013/0264207, 2014/0134616, 2015/0119259 and 2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears et al, Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman et al, Methods Mol.
  • sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu et al, Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeq TM , MiSeq", or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiD TM , Life Technologies), ion semiconductor sequencing (e.g., Ion Torrent TM , Life Technologies), and SMRT ® sequencing (e.g., Pacific Biosciences).
  • MALDI-TOF/MS matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
  • MALDI-TOF/MS matrix-assisted laser desorption/ionization time-
  • sequencing-by-hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by-synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.).
  • Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology from Roche Sequencing Solutions, Santa Clara, Cal.), and Oxford Nanopore Technologies (Oxford, UK).
  • the sequencing step involves sequence aligning.
  • aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID).
  • the molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step.
  • a UID is present in the 5’-portion of the RT primer.
  • a UID can be present in the 5’ -end of the last barcode subunit to be added to the compound barcode.
  • a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
  • a consensus sequence is determined from a plurality of sequences all having an identical UID.
  • the sequenced having an identical UID are presumed to derive from the same original molecule through amplification.
  • UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
  • the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID).
  • Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID.
  • a person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence. In some embodiments, the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result. In some embodiments, the desired depth is 5-50 reads per UID.
  • the invention is a kit including components and tools for performing an improved method of detecting DNA methylation described herein.
  • the kit includes components for detecting cytosine methylation in nucleic acids by detecting a product of in vitro oxidized 5- methyl cytosine (5mC) or 5-hyrdoxymethyl cytosine (5hmC).
  • the product is 5-formyl cytosine (5fC) or 5-carboxy cytosine (5caC).
  • the kit further includes components for performing in vitro oxidation of 5-methyl cytosine (5mC) or 5-hyrdoxymethyl cytosine (5hmC) to 5- formyl cytosine (5fC) or 5-carboxy cytosine (5caC).
  • 5-methyl cytosine (5mC) or 5-hyrdoxymethyl cytosine (5hmC) to 5- formyl cytosine (5fC) or 5-carboxy cytosine (5caC).
  • the kit includes a borane derivative and a non- aqueous solvent.
  • the borane derivative is selected from pyridine borane, 2-picoline borane (pic-BH3), borane, sodium borohydride, sodium cyanoborohydride, and sodium triacetoxyborohydride, while the non-aqueous solvent is selected from ethanol and methanol.
  • the kit instead of including the non-aqueous solvent, includes the borane derivative and instructions on using the non- aqueous solvent (such as ethanol or methanol) with the borane derivative in a method of detecting DNA methylation as described herein.
  • the kit further includes an organic acid.
  • the kid includes instructions on using the organic acid (such as acetic acid) in a method of detecting DNA methylation including borane derivatives in a non-aqueous solvent as described herein.
  • the kit further includes a buffer such as MES or TRIS.
  • the kit includes malononitrile and a non- aqueous solvent.
  • the non-aqueous solvent is selected from ethanol and methanol.
  • the kit instead of including the non-aqueous solvent, includes instructions on using the non-aqueous solvent (such as ethanol or methanol) in a method of detecting DNA methylation with malononitrile as described herein.
  • the kit further comprises an organic acid and a primary, a secondary or a tertiary amine.
  • the organic acid may be acetic acid and the amine may be triethanolanime.
  • the kit includes instructions on using the organic acid and the amine (such as acetic acid and triethanolamine) in a method of detecting DNA methylation with malononitrile as described herein.
  • the kit further includes a buffer such as MES or TRIS.
  • the kit further includes TET enzyme for in vitro oxidation of 5-methyl cytosine (5mC) or 5-hyrdoxymethyl cytosine (5hmC) to 5-carboxy cytosine (5caC).
  • TET is selected from mouse TET1, TET2 or TET3 (mTETl, 2 or 3), human TET1, TET2 or TET3 (hTETl, 2 or 3), Naegleria TET (NgTET), Coprinopsis cinerea (CcTET).
  • TET is Naegleria TET -like oxygenase (NgTETl).
  • TET is a wild-type protein.
  • TET is a mutant protein.
  • the kit further includes one or more co-factors selected from alpha- ketoglutarate and a source of Fe(II) ions.
  • the kit includes a chemical oxidative agent is included, e.g., potassium perruthenate (KRu0 4 ) or potassium ruthenate (K 2 Ru0 4 ).
  • the kit further indudes reagents for chemically blocking 5hmC from undergoing reactions that include 5mC.
  • the kit includes a glucose compound and a glucosyltransferase capable of transferring the glucose moiety to the 5-hydroxyl moiety of 5hmC.
  • the kit includes a beta-glucosyltransferase (BGT) and a UDP-glucose.
  • the BGT is T4 BGT.
  • the method further comprises assessment of a status of a subject (e.g., a patient) based on the methylation status of one or more genetic loci in the patient’s genome.
  • the method comprises determining in the patient’s sample, the genomic location and optionally, amount of methylated cytosines (5mC and/or 5hmC) in the genome.
  • genetic loci known to be biomarkers of disease are assessed for methylation.
  • the method further comprises diagnosis of disease or condition in the patient or selecting or changing a treatment based on the presence or amount of methylation in the nucleic acid isolated from the patient.
  • the method may further include identifying a tissue of origin of the methylated DNA present in the sample. In some embodiments, the method further includes identifying a tissue of origin of cell-free DNA isolated from blood. In another aspect of this embodiment, the invention includes detection of organ failure or organ injury, including organ transplant rejection in a transplant recipient using methylation patterns of cell-free DNA. The invention includes detecting circulating cell-free DNA with the organ- specific methylation pattern, wherein the presence of such cell-free DNA indicates organ transplant rejection. In some embodiments, the invention includes monitoring for transplant rejection by periodically sampling circulating cell -free DNA and measuring changes in the level of cell -free DNA with the organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates organ transplant rejection.
  • the invention includes a method of diagnosis or screening for the presence of a cancerous tumor in a patient or subject.
  • the invention includes detection of a tumor using methylation patterns of cell-free DNA using the methylation detection methods disclosed herein.
  • the invention includes detecting a tumor originating from a particular tissue or organ by detecting circulating cell-free DNA with the tissue or organ-specific methylation pattern detected using the methylation detection methods disclosed herein, wherein the presence of such cell-free DNA indicates the presence of a tumor originating from the tissue or organ.
  • the invention includes monitoring the growth or shrinkage of a tumor by periodically sampling circulating cell-free DNA and measuring changes in the level of cell-free DNA with the tumor-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates tumor growth, while a decrease in the level of such cell-free DNA indicates tumor shrinkage.
  • the invention includes a method of monitoring the effectiveness of treatment of cancer in a patient or subject.
  • the invention includes detection of tumor dynamics correlated with treatment using methylation patterns of cell-free DNA detected using the methylation detection methods disclosed herein.
  • the invention includes detecting effects of treatment on a tumor originating from a particular tissue or organ by periodically sampling circulating cell-free DNA and measuring changes in the level of cell-free DNA with the tissue or organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates tumor growth and ineffectiveness of treatment, while a decrease in the level of such cell-free DNA indicates tumor shrinkage and effectiveness of treatment, and a stable level of such cell-free DNA indicates stable disease and effectiveness of treatment.
  • the invention includes a method of diagnosis or minimal residual disease (MRD) in a cancer patient following a treatment.
  • MRD minimal residual disease
  • National Cancer Institute defines MRD as a very small number of cancer cells that remain in the body during or after treatment when the patient has no signs or symptoms of the disease.
  • the invention includes a method of detecting MRD using methylation patterns of cell-free DNA detected using the methylation detection methods disclosed herein.
  • the invention includes detecting MRD from tumor originating from a particular tissue or organ by detecting circulating cell-free DNA with the tissue or organ-specific methylation pattern, wherein the presence of such cell-free DNA indicates the presence of MRD from the tumor.
  • the invention includes a method of diagnosis or screening for the presence or status of an autoimmune disease in a patient or subject.
  • the invention includes detection of an autoimmune disease using methylation patterns of cell-free DNA detected using the methylation detection methods disclosed herein.
  • the invention includes detecting autoimmune disease characterized by damage to a particular tissue or organ by detecting circulating cell-free DNA with the tissue or organ-specific methylation pattern, wherein the presence of such cell-free DNA indicates organ damage resulting from the autoimmune disease and the presence of the autoimmune disease.
  • the invention includes monitoring for flare-ups or remission of an autoimmune disease by periodically sampling circulating cell-free DNA and measuring changes in the level of cell-free DNA with the tissue or organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates increased organ damage and a flare-up of the autoimmune disease, while a decrease in the level of such cell-free DNA indicates decreased organ damage and remission of the autoimmune disease.
  • 5-Methyl cytosine and 5-hydroxymethyl cytosine are important epigenetic biomarkers with many clinical applications in oncology, prenatal testing and other fields.
  • base-level detection of methylation was achieved by reacting unmethylated cytosines with bisulfite followed by PCR, array hybridization or sequencing.
  • Unmethylated cytosines (C) would read as thymine (T) after reacting with bisulfite, while methylated cytosines (5mC and 5hmC) would read as C.
  • bisulfite treatment leads to degradation of large portion of sample nucleic acid making it unsuitable for applications requiring high sensitivity.
  • the method is unsuitable for latest applications analyzing cell-free nucleic acid such as cell-free DNA.
  • CAPS Chemically Assisted Pyridine- borane Sequencing
  • KRuCh potassium perruthenate
  • Oxidative Bisulfite Sequencing or oxBS-seq see Booth M.
  • Yet another sequencing technique is an alternative to the reduction of 5fC with borane.
  • This method involves forming an adduct of 5fC recognized as T.
  • the adduct is formed with the use of malononitrile, see Zhu C., et al., (2017) Single- Cell 5-Formylcytosine Landscapes of Mammalian Early Embryos and ESCs at Single- Base Resolution, Cell Stem Cell, 20:720-731.e5.
  • All the above methods rely at least in part on oxidizing 5hmC and 5mC into 5fC with TET family enzymes.
  • One other known oxidation technique involves converting 5hmC in into predominantly or exclusively 5fC with Cu(II) compound and 2,2,6,6-tetramethylpiperidine-l-oxyl (TEMPO).
  • the invention is a method of detecting an epigenetic modification, specifically, cytosine methylation in nucleic acids.
  • the state of the art methods of detecting methylated cytosines in nucleic acids include the following key steps: 1) oxidation of methylated cytosine; 2) conversion of the oxidized product into a form capable of being read as thymine (T) during sequencing; 3) sequencing the nucleic acids; and 4) comparing the treated and untreated sequences wherein a change from a cytosine (C) to a thymine (T) in the sequence read indicated the presence of a methylated cytosine.
  • the instant invention comprises a new means of performing step 1) oxidizing methylated cytosine. Following the oxidation step, steps 2) -4) are performed according to the state of the art.
  • the present invention involves a method of manipulating nucleic acids from a sample.
  • the sample is derived from a subject or a patient.
  • the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy.
  • the sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples) .
  • the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated.
  • the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi.
  • the present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art. See J. Sambrook et al., "Molecular Cloning: A Laboratory Manual," 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N.Y.).
  • kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
  • KAPA Express Extract Roche Sequencing Solutions, Pleasanton, Cal.
  • Other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
  • nucleic acids are extracted, separated by size and optionally, concentrated by epitachophoresis as described e.g., in WO2019092269 and W02020074742.
  • the present invention involves detecting epigenetic modification in nucleic acids.
  • the nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein.
  • the same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC) .
  • a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai AV, et al. (2015) Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas.
  • a panel of DNA methylation signature from peripheral blood may predict colorectal cancer susceptibility.
  • BMC Cancer 20, 692 methylation of biomarkers in a panel of methylation biomarkers is indicative of the presence of colorectal cancer in the patient. Accordingly, testing any known or future panels of methylation biomarkers for prognostic or diagnostic purposes is envisioned with the method disclosed herein.
  • the entire genome of an organism is probed for the presence of methylation.
  • the method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described e.g, in Shull AY, et al., (2015) Sequencing the cancer methylome. Methods Mol Biol. 1238:627-5.
  • two procedures are run in parallel on two aliquots of a sample.
  • 5hmC is blocked while 5mC is detected exclusively by converting to a T equivalent, e.g., by TET and malononitrile procedure described in U.S. Provisional Application Serial No. 63/147,307 filed on February 9, 2021.
  • the blocking of 5hmC takes advantage of the reactive hydroxyl group present on 5hmC but not 5mC.
  • the blocking group added to 5hmC is a sugar moiety.
  • the sugar moiety is a modified or unmodified glucose moiety and 5-glucosyl-hydroxymethyl cytosine (5ghmC) is formed.
  • addition of the blocking group is catalyzed by a glycosyltransferase, e.g., a glucosyltransferase.
  • a glycosyltransferase e.g., a glucosyltransferase.
  • 5hmC in nucleic acid is reacted with a modified glucose in the presence of a beta- glucosyltransferase.
  • the modified glucose is UDP-glucose and the catalyst is a bacteriophage T4 beta-glucosyltransferase (T4 BGT).
  • two procedures are run in parallel on two aliquots of a sample.
  • 5hmC is converted into 5fC using laccase as described herein, and 5fC is detected as T via the malononitrile process.
  • 5mC is not reacting and is detected as C.
  • both 5hmC and 5mC are detected as T without distinction, e.g., by TET and malononitrile procedure described in in U.S. Provisional Application Serial No. 63/147,307 filed on February 9, 2021.
  • the first of the two parallel procedures reveals 5hmC while the second of the two parallel procedures reveals 5hmC plus 5mC.
  • the invention is a method of distinguishing 5- hydroxymethylcytosine (5hmC) from 5-methylcytosine (5mC) in nucleic acids in a sample, the method comprising: (i) separating a sample into two aliquots; (ii) in the first aliquot, contacting the nucleic acid comprising 5mC and 5hmC with a ten- eleven-translocation (TET) dioxygenase under conditions where 5hmC and 5mC are converted into 5-formyl cytosine (5fC); (iii) in the second aliquot, contacting the nudeic acid comprising 5mC and 5hmC with a laccase under conditions where 5hmC is converted into 5-formyl cytosine (5fC); (iv) contacting both aliquots separately with a moiety of formula R — CEE — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme where
  • the method includes a step of oxidizing methylated cytosines for downstream detection.
  • the method includes a step of converting 5-hydroxymethyl cytosine (5hmC) into 5-formyl cytosine (5fC) with laccase enzyme.
  • the oxidation takes place in the presence of a co-factor.
  • the co-factor is 2, 2,6,6- tetramethylpiperidine-l-oxyl (TEMPO).
  • TEMPO 2, 2,6,6- tetramethylpiperidine-l-oxyl
  • the oxidation takes place at low pH, e.g., pH ⁇ 6. Laccase catalyzes the oxidation of phenol-containing compounds, including lignin, through the reduction of oxygen to water; the presence of mediators allows the oxidation of non-phenolic compounds like benzylic alcohols as well according to the scheme: ( See Catalysis Communications 2020, 135, 105887).
  • oxidoreductases of the laccase type are used.
  • laccase is from a fungal source.
  • the fungal source is selected from Hexagonia tenuis, Pleurotis sajor caju, Pleutoris ostreatus, Xylaria polymorpha, Trametes hirsuta, Trametes versicolor and Coprinus spp.
  • the fungal source is selected from Hexagonia tenuis, Pleurotis sajor caju MTCC-141, Pleutoris ostreatus MTCC-1801, Xylaria polymorpha MTCC-1100, Trametes hirsuta MTCC-1171, Coprinus spp. or any other analog or equivalent thereof with similar or equivalent enzymatic activity such as alcohol dehydrogenases, alcohol oxidases, galactose oxidases, chloroperoxidases and peroxidases.
  • toluene methyl- monooxygenase (EC 1.14.15.26) and P450 monooxygenase (EC 1.14.14.1) are used to convert 5mC to 5hmC.
  • the sample is contacted with laccase in the presence of cofactors.
  • the cofactor is 2, 2,6,6- tetramethylpiperidine-l-oxyl (TEMPO).
  • the cofactors are selected from, natural cofactors for laccase selected from acetosyringone, syringaldehyde, para-coumaric acid and synthetic cofactors for laccase selected from
  • the method comprises a preliminary step of converting 5mC into 5hmC with an enzyme such as TET prior to reacting 5hmC with laccase.
  • TET and laccase are present in the same convenient “one-pot” reaction.
  • TET, laccase and malononitrile are present in the same convenient “one-pot” reaction.
  • the invention further comprises a downstream step of detecting 5-formyl cytosine (5fC) nucleotide in a nucleic acid, wherein the 5fC is formed by the method described herein above.
  • the downstream step involves contacting a sample containing a nucleic acid comprising 5fC with an improved composition comprising a compound of formula R — CH 2 —
  • R is an electron-withdrawing group selected from cyano, nitro, Cl - C6 alkyl carboxylic ester, unsubstituted carboxamide, C1-C6 alkyl mono-substituted and Cl- C6 alkyl di-substituted carboxamide, substituted carbonyl moiety, substituted sulfonyl moiety, wherein the substitution is selected from C1-C6 linear or branched alkyl, C4-C6 cycloalkyl, phenyl, 5- or 6-membered heteroaryl and benzannulated 5- or 6-membered heteroaryl.
  • Rl is a cyano group (CN) and the reactant is malononitrile.
  • 1,3-indandione compounds can be used as the 5fC conversion reagents instead of RI-CH -CN ( see B. Xia et al., Nature Methods 2015, 12(11), 1047-1050). Still alternatively, it is possible to react 5fC with a Wittig reagent.
  • the downstream step involves contacting a sample containing a nucleic acid comprising 5fC with a Wittig reagent in an organic solvent, and then irradiating with ultraviolet light.
  • the products of the reaction are detected using fluorescence recognition technology as described in WO2020155742.
  • the compound of formula Ri — CH 2 — CN is provided in a reaction ixture that enables the reaction to proceed for less than 3 hours wherein at least 90% of 5fC has formed the adduct as described in the U.S. Provisional Application Serial No. 63/147,307 filed on February 9, 2021.
  • the reaction proceeds for only 1 hour with at least 90% of 5fC forming the adduct.
  • the reaction mixture comprises an organic acid moiety.
  • the organic acid has a formula R-COOH and R is selected from C1-C30 linear or branched alkyl, C2-C30 linear or branched alkenyl, C2-C30 linear or branched alkynyl (may comprise heteroatoms such as O and N), aryl or heteroaryl.
  • the reaction takes place in the presence of acetic acid.
  • the concentration of the organic acid in the reaction is between 1% and 30%, e.g., 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, or 30%.
  • the reaction mixture comprises a non-aqueous solvent.
  • the non-aqueous solvent has a formula R-OH wherein R is selected from C1-C3 linear or branched alkyl and may comprise heteroatoms such as O and N.
  • R is selected from C1-C3 linear or branched alkyl and may comprise heteroatoms such as O and N.
  • the reaction takes place in methanol or ethanol.
  • the reaction takes place in 10%-100% methanol or ethanol.
  • the reaction takes place in 90% or more of methanol or ethanol.
  • the compound of formula R x NH y is a primary, secondary, or tertiary amine with aliphatic or aromatic groups.
  • Rx can form with N a 5- or 6-membered cyclic heteroalkyl such as piperidine.
  • reaction takes place in the presence of triethanolamine .
  • sequencing is by a next- generation massively parallel sequencing process. Sequencing results in a test sequence wherein the adduct is read as thymine (T), i.e., the sequencing polymerase is able to accommodate the adduct in the strand being copied, and to incorporate an adenine (A) opposite the adduct.
  • T thymine
  • A adenine
  • the method further comprises a step of comparing the test sequence with a reference sequence, wherein a change from a cytosine (C) in the reference sequence to a thymine (T) in the corresponding position in the test sequence indicates the presence of methylated and / or hydroxymethylated cytosine in the test nucleic acid.
  • the nucleic acids in the sample are amplified prior to sequencing.
  • amplification utilizes a B -family polymerase efficiently incorporating an adenine (A) nucleotide opposite the malononitrile adduct.
  • the sequencing may proceed with any polymerase suitable for the sequencing process as the adduct has already been recognized as T by the amplification polymerase.
  • the nucleic acid in the sample is ligated to adaptors, wherein adaptors comprise elements useful in amplification and sequencing.
  • An adaptor comprises at least one of the following: barcode, primer binding site and ligation site.
  • the invention is an improved method of detecting a methylated and/or hydroxymethylated cytosine nucleotide in a nucleic acid, the method comprising: (i) ligating adaptors to a nudeic acid in a sample wherein adaptors comprise amplification primer binding sites; (ii) forming a reaction mixture by contacting the sample containing adaptor -ligated nucleic acid with laccase capable of converting 5hmC in the nucleic acid into 5fC; (iii) contacting the reaction mixture with a compound of formula R — CH 2 — CN capable of reacting with 5fC in the nucleic acid to form an adduct according to the reaction scheme wherein Ri is an electron- withdrawing group selected from cyano, nitro, Cl - C6 alkyl carboxylic ester, unsubstituted carboxamide, C1-C6 alkyl mono-substituted and Cl- C6 alkyl di-substituted car
  • step (iii) the compound of formula Ri — CH 2 — CN is present in a non-aqueous solvent, e.g., ethanol or methanol.
  • a cofactor for laccase such as e.g., 2,2,6,6-tetramethylpiperidine-l-oxyl (TEMPO).
  • TEMPO 2,2,6,6-tetramethylpiperidine-l-oxyl
  • the sample is contacted with TET to convert 5mC in nucleic acids into 5hmC for reaction with laccase in step (ii).
  • step (iii) the compound of formula R — CH 2 — CN is present in a solution comprising an organic acid such acetic acid. In some embodiments, in step (iii) the compound of formula R — CH 2 — CN is present in a solution comprising an amine such as triethanolamine or piperidine. Alternatively, it is possible to use a Wittig reagent in step (iii).
  • the invention includes a step of amplifying nucleic acids.
  • amplification occurs prior to the sequencing step.
  • amplification occurs after the step of forming an adduct of 5fC and malononitrile.
  • amplification occurs after the step of reduction of oxidized methylated cytosine with a borane derivative.
  • amplification occurs prior to the target enrichment step.
  • the amplification utilizes an upstream primer and a downstream primer.
  • both primers are target specific primers, i.e., primers comprising a sequence complementary to the target sequence of the methylation biomarker.
  • one or both primers are universal primers.
  • universal primer binding sites are present in adaptors ligated to the target sequenced as described herein.
  • an universal primer binding site is present in the 5’-region (tail) of a target-specific primer. Accordingly, after one or more rounds of primer extension with a tailed target-specific primer, an universal primer may be used for subsequent rounds of amplification.
  • an universal primer in paired with another universal primer (of the same or different sequence). In other embodiments, an universal primer is paired with a target-specific primer.
  • the invention involves a nucleic acid polymerase.
  • Nucleic acid polymerases used in amplification and sequencing are known and commercially available from multiple sources.
  • the instant invention involves copying a strand comprising a 5fC adduct formed as described herein. Such copying requires a polymerase accommodating the 5fC adduct.
  • the polymerase is a B-family polymerase.
  • the polymerases is able to copy a strand comprising a 5fC adduct by recognizing the adduct as T (i.e., incorporating an A opposite the adduct).
  • Polymerases able to accommodate the 5fC adduct described herein include DNA polymerases known to accommodate uracil (U) in a DNA strand.
  • the polymerase may be a naturally-occurring or an engineered polymerase.
  • the polymerase is isolated from hyperthermophilic archaea e.g., genus Pyrococcus (. e.g ., Pyrococcus furious ) or genus Thermus (e.g, Thermus aquaticus).
  • the polymerase is isolated from mesophilic archaea, e.g, genus Metanosarcina (e.g, Methanosarcina acetivorans).
  • engineered uracil- tolerant polymerases include KAPA HiFi Uracil+ DNA polymerase (Roche Sequencing Solutions, Desion, Cal.), Takara Terra (Takara Bio USA, Mountain View, Cal.), and EpiMark ® Hot Start Taq DNA polymerase (New England Biolabs, Waltham, Mass.).
  • the DNA polymerase is a type A DNA polymerase (DNA -depen dent DNA polymerase). Some DNA polymerases possess limited terminal transferase activity (Taq polymerase adding a single dA at the 3’- end of the copy strand). Other DNA polymerases do not possess detectable terminal transferase activity. In such embodiments, a separate terminal transferase enzyme is used to add non-templated nucleotides to the 3’-end of the copy strand.
  • the DNA polymerase is a Hot Start polymerase or a similar conditionally activated polymerase.
  • a thermostable DNA polymerase is used, for example the polymerase is a Taq or Taq-derived polymerase (e.g., KAPA 2G polymerase from KAPA Biosystems, Wilmington, Mass.).
  • the invention utilizes an adaptor added to one or both ends of a nucleic acid or nucleic acid strand.
  • Adaptors of various shapes and functions are known in the art (see e.g., PCT/EP2019/05515 filed on February 28, 2019, US8822150 and US8455193).
  • the function of an adaptor is to introduce desired elements into a nucleic acid.
  • the adaptor-borne elements include at least one of nucleic acid barcode, primer binding site or a ligation-enabling site.
  • the adaptor may be double-stranded, partially single stranded or single stranded.
  • a Y -shaped, a hairpin adaptor or a stem -loop adaptor is used wherein the double-stranded portion of the adaptor is ligated to the double stranded nucleic acid formed as described herein.
  • the adaptor molecules are in vitro synthesized artificial sequences. In other embodiments, the adaptor molecules are in vitro synthesized naturally-occurring sequences. In yet other embodiments, the adaptor molecules are isolated naturally occurring molecules or isolated non naturally- occurring molecules.
  • the double-stranded or partially double-stranded adaptor oligonucleotide can have overhangs or blunt ends.
  • the double-stranded DNA may comprise blunt ends to which a blunt-end ligation can be applied to ligate a blunt-ended adaptor.
  • the blunt ended DNA undergoes A-tailing where a single A nucleotide is added to the blunt ends to match an adaptor designed to have a single T nucleotide extending from the blunt end to facilitate ligation between the DNA and the adaptor.
  • kits for performing adaptor ligation include AVENIO ctDNA Library Prep Kit or KAPA HyperPrep and HyperPlus kits (Roche Sequencing Solutions, Pleasanton, CA).
  • the adaptor ligated (adapted) DNA may be separated from excess adaptors and unligated DNA.
  • the invention includes the use of a barcode.
  • the method of detecting epigenetic modifications includes sequencing.
  • the nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction.
  • the use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368.
  • a unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR).
  • the unique molecular barcode has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, A., et al, (2014) An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage, Nature Medicine doi:10.1038/nm.3519). [00176] In some embodiments, unique molecular barcodes (UIDs) are used for sequencing error correction. The entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family. A variation in the sequence not shared by all members of the barcoded family is discarded as an artefact.
  • Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, A., et al, (2016) Integrated digital error suppression for improved detection of circulating tumor DNA, Nature Biotechnology 34:547).
  • the adaptor ligated to one or both ends of the barcoded target nucleic acid comprises one or more barcodes used in sequencing.
  • a barcode can be a UID or a multiplex sample ID (MID or SID) used to identify the source of the sample where samples are mixed (multiplexed).
  • the barcode may also be a combination of a UID and an MID.
  • a single barcode is used as both UID and MID.
  • each barcode comprises a predefined sequence. In other embodiments, the barcode comprises a random sequence.
  • the barcodes are between about 4-20 bases long so that between 96 and 384 different adaptors, each with a different pair of identical barcodes are added to a human genomic sample.
  • the number of UIDs in the reaction can be in excess of the number of molecules to be labelled. A person of ordinary skill would recognize that the number of barcodes depends on the complexity of the sample (i.e., expected number of unique target molecules) and would be able to create a suitable number of barcodes for each experiment.
  • the method involves forming a library comprising nucleic acids from a sample.
  • the library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR.
  • a library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library.
  • the library is the input nucleic acid in which methylation is detected by the method described herein.
  • the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
  • the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized.
  • the term "Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
  • Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Pat. Publ. Nos. 2013/0244340, 2013/0264207, 2014/0134616, 2015/0119259 and 2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears et al, Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman et al, Methods Mol.
  • sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu et al, Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeq TM , MiSeq TM , or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiD TM , Life Technologies), ion semiconductor sequencing (e.g., Ion Torrent TM , Life Technologies), and SMRT sequencing (e.g., Pacific Biosciences).
  • MALDI-TOF/MS matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
  • MALDI-TOF/MS matrix-assisted laser desorption/ionization time
  • CommerciaUy avaAable sequencing technologies include: sequencing-by-hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by-synthesis platforms from fllumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.).
  • Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology from Roche Sequencing Solutions, Santa Clara, Cal.), and Oxford Nanopore Technologies (Oxford, UK).
  • the sequencing step involves sequence aligning.
  • aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID).
  • the molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step.
  • a UID is present in the 5’-portion of the RT primer.
  • a UID can be present in the 5’ -end of the last barcode subunit to be added to the compound barcode.
  • a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
  • a consensus sequence is determined from a plurality of sequences all having an identical UID.
  • the sequences having an identical UID are presumed to derive from the same original molecule through amplification.
  • UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
  • the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID).
  • Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID.
  • a person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence. In some embodiments, the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result. In some embodiments, the desired depth is 5-50 reads per UID.
  • the invention is a kit including components and tools for performing an improved method of detecting DNA methylation described herein.
  • the kit includes reagents for detecting cytosine methylation in nucleic acids by performing in vitro oxidation of 5- hydroxymethyl cytosine (5hmC) to 5-formyl cytosine (5fC).
  • the kit further comprises reagents for detecting 5-formyl cytosine (5fC) in nucleic acids.
  • the kit comprises a laccase enzyme.
  • laccase is from a fungal source.
  • the fungal source is selected from Hexagonia tenuis, Pleurotis sajor caju, Pleutoris ostreatus, Xylaria polymorpha, Trametes hirsuta, Trametes versicolor and Coprinus spp.
  • the fungal source is selected from Hexagonia tenuis, Pleurotis sajor caju MTCC-141, Pleutoris ostreatus MTCC-1801, Xylaria polymorpha MTCC- 1100, Trametes hirsuta MTCC-1171, Coprinus spp. or any other analog or equivalent thereof with similar or equivalent enzymatic activity such as F.
  • the kit further includes a co-factor for laccase oxidation.
  • the cofactor is selected from 2, 2,6,6- tetramethylpiperidine-l-oxyl (TEMPO), acetosyringone, syringaldehyde, para- coumaric acid 2,2’-azino-bis(3-ethylbenzothiazoline-6-sulfonate (ABTS), violuric acid (VLA), N-acetyl-N-phenylhydroxylamine (NHA), N-hydroxybenzotriazole (HBT), and N-hydroxyphthalimide (HPI).
  • TEMPO 2, 2,6,6- tetramethylpiperidine-l-oxyl
  • ABTS para- coumaric acid 2,2’-azino-bis(3-ethylbenzothiazoline-6-sulfonate
  • VLA violuric acid
  • NHA N-acetyl-N-phenylhydroxylamine
  • HBT N-hydroxybenzo
  • the kit further includes ten-eleven translocation dioxygenase (TET).
  • TET is selected from mouse TET1, TET2 or TET3 (mTETl, 2 or 3), human TET1, TET2 or TET3 (hTETl, 2 or 3), Naegleria TET (NgTET), Coprinopsis cinerea (CcTET) or any other analog or equivalent thereof with similar or equivalent enzymatic activity.
  • the kit further includes malononitrile.
  • malononitrile is present in a non-aqueous solvent.
  • the non- aqueous solvent is selected from ethanol and methanol.
  • the kit instead of including the non-aqueous solvent, the kit includes instructions on using the non-aqueous solvent (such as ethanol or methanol) in a method of detecting DNA methylation with malononitrile as described herein.
  • the kit further comprises an organic acid and a primary, a secondary or a tertiary amine.
  • the organic acid may be acetic acid and the amine may be triethanolamine.
  • the kit includes instructions on using the organic acid and the amine (such as acetic acid and triethanolamine or piperidine) in a method of detecting DNA methylation with malononitrile as described herein.
  • the kit further includes a buffer such as MES or TRIS.
  • the kit further includes reagents for distinguishing 5mC in nucleic acids from 5hmC by protecting 5hmC while 5mC is chemically reacted.
  • the kit includes a glucose compound and a glucosyltransferase capable of transferring the glucose moiety to the 5-hydroxyl moiety of 5hmC to form 5-glucosylhydroxymethyl cytosine (5ghmC).
  • the kit includes a beta-glucosyltransferase (BGT) and a UDP-glucose.
  • the BGT is T4 BGT.
  • the method further comprises assessment of a status of a subject (e.g., a patient) based on the methylation status of one or more genetic loci in the patient’s genome.
  • the method comprises determining in the patient’s sample, the genomic location and optionally, amount of methylated cytosines (5mC and/or 5hmC) in the genome.
  • genetic loci known to be biomarkers of disease are assessed for methylation.
  • the method further comprises diagnosis of disease or condition in the patient or selecting or changing a treatment based on the presence or amount of methylation in the nucleic acid isolated from the patient.
  • the invention includes a method of detecting tissue-specific DNA methylation patterns using the methylation detection methods disclosed herein.
  • the method may further include identifying a tissue of origin of the methylated DNA present in the sample.
  • the method further includes identifying a tissue of origin of cell-free DNA isolated from blood.
  • the invention includes detection of organ failure or organ injury, including organ transplant rejection in a transplant recipient using methylation patterns of cell-free DNA.
  • the invention includes detecting circulating cell-free DNA with the organ- specific methylation pattern, wherein the presence of such cell-free DNA indicates organ transplant rejection.
  • the invention includes monitoring for transplant rejection by periodically sampling circulating cell -free DNA and measuring changes in the level of cell -free DNA with the organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates organ transplant rejection.
  • the invention includes a method of diagnosis or screening for the presence of a cancerous tumor in a patient or subject.
  • the invention includes detection of a tumor using methylation patterns of cell-free DNA using the methylation detection methods disclosed herein.
  • the invention includes detecting a tumor originating from a particular tissue or organ by detecting circulating cell-free DNA with the tissue or organ-specific methylation pattern detected using the methylation detection methods disclosed herein, wherein the presence of such cell-free DNA indicates the presence of a tumor originating from the tissue or organ.
  • the invention includes monitoring the growth or shrinkage of a tumor by periodically sampling circulating cell-free DNA and measuring changes in the level of cell-free DNA with the tumor-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates tumor growth, while a decrease in the level of such cell-free DNA indicates tumor shrinkage.
  • the invention includes a method of monitoring the effectiveness of treatment of cancer in a patient or subject.
  • the invention includes detection of tumor dynamics correlated with treatment using methylation patterns of cell-free DNA detected using the methylation detection methods disclosed herein.
  • the invention includes detecting effects of treatment on a tumor originating from a particular tissue or organ by periodically sampling circulating cell-free DNA and measuring changes in the level of cell-free DNA with the tissue or organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates tumor growth and ineffectiveness of treatment, while a decrease in the level of such cell-free DNA indicates tumor shrinkage and effectiveness of treatment, and a stable level of such cell -free DNA indicates stable disease and effectiveness of treatment.
  • the invention includes a method of diagnosis or minimal residual disease (MRD) in a cancer patient following a treatment.
  • MRD minimal residual disease
  • National Cancer Institute defines MRD as a very small number of cancer cells that remain in the body during or after treatment when the patient has no signs or symptoms of the disease.
  • the invention includes a method of detecting MRD using methylation patterns of cell-free DNA detected using the methylation detection methods disclosed herein.
  • the invention includes detecting MRD from tumor originating from a particular tissue or organ by detecting circulating cell-free DNA with the tissue or organ- specific methylation pattern, wherein the presence of such cell-free DNA indicates the presence of MRD from the tumor.
  • the invention includes a method of diagnosis or screening for the presence or status of an autoimmune disease in a patient or subject.
  • the invention includes detection of an autoimmune disease using methylation patterns of cell-free DNA detected using the methylation detection methods disclosed herein.
  • the invention includes detecting autoimmune disease characterized by damage to a particular tissue or organ by detecting circulating cell-free DNA with the tissue or organ-specific methylation pattern, wherein the presence of such cell-free DNA indicates organ damage resulting from the autoimmune disease and the presence of the autoimmune disease.
  • the invention includes monitoring for flare-ups or remission of an autoimmune disease by periodically sampling circulating cell-free DNA and measuring changes in the level of cell-free DNA with the tissue or organ-specific methylation pattern, wherein an increase in the level of such cell-free DNA indicates increased organ damage and a flare-up of the autoimmune disease, while a decrease in the level of such cell-free DNA indicates decreased organ damage and remission of the autoimmune disease.
  • Some embodiments of the disclosure are directed to a method for detecting 5-hydroxymethylcytosine (5hmC) in a target nudeic acid from a sample, wherein the method comprises the following steps: (a) contacting the target nucleic acid with laccase or copper(II) perchlorate and 2,2,6,6-tetramethylpiperidine-l-oxyl (Cu(II)/TEMPO), wherein the laccase or Cu(II)/TEMPO converts 5hmC to 5- formylcytosine (5fC), thereby producing a nucleic acid comprising one or more 5- formylcytosine (5fC); (b) contacting the nucleic acid comprising one or more 5fC of step (a) with malononitrile, wherein the malononitrile converts 5fC to 5fC-M adduct, thereby producing a nucleic acid comprising one or more 5fC-M adduct; (c) contacting the nucleic acid comprising one or more 5
  • the target nucleic acid is contacted with laccase in step (a), and wherein step (a) occurs in less than 22 hours, or step (a) occurs in less than 5 hours, or step (a) occurs in less than 4 hours, or step (a) occurs in less than 3 hours, or step (a) occurs in in 3 hours.
  • step (a) occurs at around 25°C, or step (a) occurs at 25°C, or step (a) occurs at around 37°C, or step (a) occurs at 37°C.
  • the target nucleic acid is contacted with Cu(II)/TEMPO in step (a), and wherein step (a) occurs in less than 24 hours, or step
  • step (a) occurs in 22 hours.
  • step (b) occurs at around 60°C.
  • step (b) occurs at 60°C
  • step (b) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises 25 mM Tris.
  • the buffer is at a pH of around 8.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile for 1.5 hours.
  • the method further comprises an additional step between step (a) and step (b), wherein the additional step between step (a) and step (b) comprises contacting the nucleic acid comprising one or more 5fC with NaOH.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (b) for less than 1 hour. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (b) for about 1 hour. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile in step
  • step (b) for less than 30 minutes.
  • the nudeic acid comprising one or more 5fC is contacted with malononitrile in step (b) for about 30 minutes.
  • step (c) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises the following components: (i) 5% dimethyl sulfoxide (DMSO); (ii) 0.85M Betaine; (iii) 70 mM tetramethylammonium chloride (TMAC); (iv) 2.1 mM dATP; (v) 2.25 mM MgCl2; and (vi) 15 mM ammonium sulfate.
  • Another embodiment of the disclosure is directed to a method for detecting 5-methylcytosine (5mC) in a target nucleic acid from a sample, wherein the method comprises the following steps: (a) contacting the target nucleic acid with ten-eleven-translocation (TET), wherein the TET converts 5mC to 5- hydroxymethylcytosine (5hmC), thereby producing a nucleic acid comprising one or more 5hmC; (b) contacting the nucleic acid comprising one or more 5hmC of step (a) with laccase or copper(II) perchlorate and 2,2,6,6-tetramethylpiperidine-l-oxyl (Cu(II)/TEMPO), wherein the laccase or Cu(II)/TEMPO converts 5hmC to 5- formylcytosine (5fC), thereby producing a nucleic acid comprising one or more 5- formylcytosine (5fC); (c) contacting the nucleic acid comprising one or more 5fC
  • step (a) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer comprising an amine catalyst.
  • the amine catalyst is 2-amino-5- methoxybenzoic acid.
  • the buffer comprises sodium phosphate and has a pH of around 5.2.
  • the amine catalyst is 2-(aminomethyl)imidazole dihydrochloride.
  • the buffer comprises Tris and has a pH of around 8.
  • the target nucleic acid is contacted with laccase in step (b).
  • step (b) occurs in less than 22 hours.
  • step (b) occurs in less than 5 hours.
  • step (b) occurs in less than 4 hours.
  • step (b) occurs in less than 3 hours. In another embodiment, step (b) occurs in 3 hours. In another embodiment, step (b) occurs at around 25°C. In another embodiment, step (b) occurs at 25°C. In another embodiment, step (b) occurs at around 37°C. In another embodiment, step (b) occurs at 37°C. In another embodiment, step (a) and step (b) are combined in a single step. In another embodiment, the combination of step (a) and step (b) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer, and wherein the buffer comprises both TET and laccase.
  • the target nucleic acid is contacted with TET, wherein the TET converts 5mC to 5hmC, thereby producing a nucleic acid comprising one or more 5hmQ and wherein the laccase converts 5hmC to 5fC, thereby producing a nudeic acid comprising one or more 5fC.
  • the target nucleic acid is contacted with Cu(II)/TEMPO in step (b).
  • step (b) occurs in less than 24 hours.
  • step (b) occurs in 22 hours.
  • step (c) occurs at around 60°C.
  • step (c) occurs at 60°C.
  • step (c) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises 25 mM Tris.
  • the buffer is at a pH of around 8.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile for 1.5 hours.
  • the method further comprises an additional step between step (b) and step (c), wherein the additional step between step (b) and step (c) comprises contacting the nucleic acid comprising one or more 5fC with NaOH.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for less than 1 hour.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for about 1 hour. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for less than 30 minutes. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for about 30 minutes. In another embodiment, step (d) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises the following components: (i) 5% dimethyl sulfoxide (DMSO); (ii) 0.85M Betaine; (iii) 70 mM tetramethylammonium chloride (TMAC); (iv) 2.1 mM dATP; (v) 2.25 mM MgCl2; and (vi) 15 mM ammonium sulfate.
  • Another embodiment of the disclosure is directed to a method for converting 5-hydroxymethylcytosine (5hmC), in a target nucleic acid from a sample, to Thymine (T), wherein the method comprises the following steps: (a) contacting the target nucleic acid with laccase or copper(II) perchlorate and 2, 2,6,6- tetramethylpiperidine-l-oxyl (Cu(II)/TEMPO), wherein the laccase or Cu(II)/TEMPO converts 5hmC to 5-formylcytosine (5fC), thereby producing a nucleic acid comprising one or more 5-formylcytosine (5fC); (b) contacting the nucleic acid comprising one or more 5fC of step (a) with malononitrile, wherein the malononitrile converts 5fC to 5fC-M adduct, thereby producing a nucleic acid comprising one or more 5fC-M adduct; and (c) contacting the nucle
  • the target nucleic acid is contacted with laccase in step (a). In another embodiment, step (a) occurs in less than 22 hours. In another embodiment, step (a) occurs in less than 5 hours. In another embodiment, step (a) occurs in less than 4 hours. In another embodiment, step (a) occurs in less than 3 hours. In another embodiment, step (a) occurs in 3 hours. In another embodiment, step (a) occurs at around 25°C. In another embodiment, step (a) occurs at 25°C. In another embodiment, step (a) occurs at around 37°C. In another embodiment, step (a) occurs at 37°C. In another embodiment, the target nucleic acid is contacted with Cu(II)/TEMPO in step (a).
  • step (a) occurs in less than 24 hours. In another embodiment, step (a) occurs in 22 hours. In another embodiment, step (b) occurs at around 60°C. In another embodiment, step (b) occurs at 60°C. In another embodiment, step (b) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer. In another embodiment, the buffer comprises 25 mM Tris. In another embodiment, the buffer is at a pH of around 8. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile for 1.5 hours.
  • the method further comprises an additional step between step (a) and step (b), wherein the additional step between step (a) and step (b) comprises contacting the nudeic acid comprising one or more 5fC with NaOH.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (b) for less than 1 hour.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (b) for about 1 hour.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (b) for less than 30 minutes.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (b) for about 30 minutes.
  • step (c) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises the following components: (i) 5% dimethyl sulfoxide (DMSO); (ii) 0.85M Betaine; (iii) 70 mM tetramethylammonium chloride (TMAC); (iv) 2.1 mM dATP; (v) 2.25 mM MgCl2; and (vi) 15 mM ammonium sulfate.
  • Another embodiment of the disclosure is directed to a method for converting 5-methylcytosine (5mC), in a target nucleic acid from a sample, to Thymine (T), wherein the method comprises the following steps: (a) contacting the target nucleic acid with ten- eleven-translocation (TET), wherein the TET converts 5mC to 5-hydroxymethylcytosine (5hmC), thereby producing a nucleic acid comprising one or more 5hmC; (b) contacting the nudeic acid comprising one or more 5hmC of step (a) with laccase or copper(II) perchlorate and 2, 2,6,6- tetramethylpiperidine-l-oxyl (Cu(II)/TEMPO), wherein the laccase or Cu(II)/TEMPO converts 5hmC to 5-formylcytosine (5fC), thereby producing a nucleic acid comprising one or more 5-formylcytosine (5fC); (c) contacting the nucleic acid comprising
  • step (a) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer comprising an amine catalyst.
  • the amine catalyst is 2-amino-5-methoxybenzoic acid.
  • the buffer comprises sodium phosphate and has a pH of around 5.2.
  • the amine catalyst is 2- (aminomethyl)imidazole dihydrochloride.
  • the buffer comprises Tris and has a pH of around 8.
  • the target nucleic acid is contacted with laccase in step (b).
  • step (b) occurs in less than 22 hours.
  • step (b) occurs in less than 5 hours.
  • step (b) occurs in less than 4 hours.
  • step (b) occurs in less than 3 hours. In another embodiment, step (b) occurs in 3 hours. In another embodiment, step (b) occurs at around 25°C. In another embodiment, step (b) occurs at 25°C. In another embodiment, step (b) occurs at around 37°C. In another embodiment, step (b) occurs at 37°C. In another embodiment, step (a) and step (b) are combined in a single step. In another embodiment, the combination of step (a) and step (b) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer, and wherein the buffer comprises both TET and laccase.
  • the target nucleic acid is contacted with TET, wherein the TET converts 5mC to 5hmC, thereby producing a nucleic acid comprising one or more 5hmQ and wherein the laccase converts 5hmC to 5fC, thereby producing a nucleic acid comprising one or more 5fC.
  • the target nucleic acid is contacted with Cu(II)/TEMPO in step (b).
  • step (b) occurs in less than 24 hours.
  • step (b) occurs in 22 hours.
  • step (c) occurs at around 60°C.
  • step (c) occurs at 60°C.
  • step (c) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises 25 mM Tris.
  • the buffer is at a pH of around 8.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile for 1.5 hours.
  • the method further comprises an additional step between step (b) and step (c), wherein the additional step between step (b) and step (c) comprises contacting the nucleic acid comprising one or more 5fC with NaOH.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for less than 1 hour.
  • the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for about 1 hour. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for less than 30 minutes. In another embodiment, the nucleic acid comprising one or more 5fC is contacted with malononitrile in step (c) for about 30 minutes. In another embodiment, step (d) occurs in a reaction mixture, wherein the reaction mixture comprises a buffer.
  • the buffer comprises the following components: (i) 5% dimethyl sulfoxide (DMSO); (ii) 0.85M Betaine; (iii) 70 mM tetramethylammonium chloride (TMAC); (iv) 2.1 mM dATP; (v) 2.25 mM MgCl2; and (vi) 15 mM ammonium sulfate.
  • Example 1 Using Ethanol as Co-Solvent of Picoline-Borane in Methylation Detection Assays (TAPS).
  • TAPS Methylation Detection Assays
  • a synthetic oligonucleotide with a caC nucleotide was subjected to reduction by DHU under novel conditions.
  • 2-Picoline borane was dissolved in absolute ethanol or methanol (lmg/5uL, 1.87mM).
  • the synthetic oligonucleotide, TAPS-caC SEQ ID NO: 1 (2.7nmol, 5’-Phos- CACGT CCAGAT CAAT (caC)GACTAT GAGCAGT ACA), was dissolved in 35uL of sodium acetate (3M, pH 4.3) and mixed with 25uL of picohne borane solution to a final concentration of 790mM.
  • Example 1 the oligonucleotide of Example 1 was used under the conditions described in Example 1 (SEQ ID NO: 1) , except picoline-borane was present in methanol/acetic acid solution.
  • the reaction contained 1.39nmol TAPS caC oligonucleotide, 250mM picborane in 10:1 v:v acetic acid and 200mM MES pH 6.
  • We were able to reduce the effective borane concentration to as low as 25 mM, and reaction time reduced further to 1 hr.
  • the products of the reaction were detected as described in Example 1. Results are shown in FIG. 2.
  • Example 3 Using Malononitrile in a Sodium Acetate Buffer in Methylation Detection Assays.
  • Example 4 Using Malononitrile in an Ethanol-TRIS Buffer in Methylation Detection Assays.
  • TAPS-fC oligo SEQ ID NO: 1 reacts with malononitrile solution (200mM, 25uL) in ethanol and TRIS (pH 8, 20mM, 26uL).
  • LCMS analysis of the sample at different incubation time show most of 5fC was converted to product after 4 hours at 37C. Results are shown in FIG. 4. Mass of TAPS-fC is 9894 and mass of the reacted product is 9942.
  • Example 5 Using Malononitrile in an Ethanol-Triethylamine Buffer in Methylation Detection Assays.
  • Example 6 A single-tube methylation detection assay with TET and malononitrile.
  • Example 7 Oxidation of5hmC with laccase enzyme.
  • a synthetic oligonucleotide with a 5hmC nucleotide was subjected to oxidation with laccase.
  • the 5hmC synthetic oligonucleotide (SEQ ID NO: 2) had the sequence 5‘-ATT ATT TAT TTA TThmC GTA TTA TTT ATT ATT-3‘.
  • the 150 m ⁇ reaction mixture comprised 50 nmol oligonucleotide, 1.6mg (IOmthoI) 2,2,6,6-tetramethylpiperidine-l-oxyl (TEMPO), 2 mg laccase from Trametes versicolor (Sigma, Cat. No. 38429), 50 mM phosphate buffer pH 5.2.
  • the reaction was allowed to proceed for 5 hrs at r.t.
  • the first control reaction contained all the above reagents (including TEMPO) except laccase.
  • the second control reaction contained all the above reagents (including laccase and TEMPO) except laccase and TEMPO were added at 1:1000 dilution.
  • the third control reaction contained all the above reagents except the oligonucleotide SEQ. ID. NO: 2 contained 5mC instead of 5hmC: 5‘-ATT ATT TAT TTA TTmC GTA TTA TTT ATT ATT-3‘.
  • the reaction ixture was analyzed by liquid chromatography - mass- spectrometry (LCMS). The molecular weights are as follows:
  • FIG. 7 shows the 5fC peak at 9174 Da resulting from the reaction.
  • an additional oxidation occurs at the hydroxyl group of the ribose sugar (+14 Da).
  • FIG. 8 shows no reaction with 5mC (5mC peak unchanged at 9160 Da). To some extent, an oxidation occurs at the hydroxyl group of the ribose sugar (+14 Da). Control reactions with no enzyme and diluted enzyme showed no change in the 5hmC starting material (data not shown).
  • Example 8 (prophetic). Oxidizing 5mC into 5hmC prior to reacting 5hmC with laccase.
  • the reaction mixture contains 3 ug TET protein and 2 pg of oligonucleotide substrates in 50 mM HEPES, pH 8, 50 mM NaCl, 2 mM Ascorbic Acid, ImM 2-oxoglutarate, 100 mM ferrous ammonium sulfate (Fe2+), and 1 mMDTT and is incubated for 3 hours at 37°C as described by Tahiliani et al, (2009) Conversion of 5-Methylcytosine to 5-Hydroxymethylcytosine in Mammalian DNA byMLL Partner TET1, Science 324 (5929):930-935.
  • the resulting nucleic acid is purified e.g., by SPRI. An aliquot of the reaction mixture is incubated with laccase as described in Example 7.
  • Example 9 Use of amine buffer catalysts to modulate TET activity to 5hmC/5fC.
  • FIG. 9A shows TET with 5 mM AMBA
  • the bottom graph of FIG. 9A shows TET with 10 mM AMBA.
  • FIG. 9A shows that AMBA promotes TET- mediated oxidation of 5mC to 5hmC/5fC efficiently, in a dose-dependent manner, without undue accumulation of unwanted 5caC product.
  • the catalyst 2-(Aminomethyl)imidazole dihydrochloride (AMI) was added to modulate TET activity in a buffer containing Tris, and having a pH of 8 (results depicted in FIG. 9B).
  • the top graph of FIG. 9B shows TET without any AMI
  • FIG. 9B shows TET with 5 mM AMI
  • the bottom graph of FIG. 9B shows TET with 10 mM AMI.
  • FIG. 9B shows that AMI promotes TET-mediated oxidation of 5mC to 5hmC/5fC efficiently, in a dose-dependent manner, without undue accumulation of unwanted 5caC product.
  • amine catalysts, AMBA and AMI in a buffer, in the presence of TET, favor the oxidation of 5mC to 5hmC/5fC.
  • amine catalysts such as 2- Amino-5-methoxybenzoic acid (AMBA) and 2-(Aminomethyl)imidazole dihydrochloride (AMI) maybe useful to promote TET-mediated oxidation of 5mC to favor generation of 5hmC and 5fC species, while discouraging full oxidation to the 5caC species.
  • amine catalysts such as 2-Amino-5-methoxybenzoic (AMBA) and 2-(Aminomethyl)imidazole dihydrochloride (AMI) maybe ultimately be useful and employed in workflows to detect 5mC and 5hmC species by ultimately promoting Thymine production.
  • Example 10 Detection and accumulation of 5fC species by oxidation of 5hmC with laccase enzyme as early as within 3 hours.
  • laccase enzyme converts 5hmC to 5fC after 22 hours. Studies were conducted in order to determine if 5fC could be detected earlier/faster than 22 hours. To that end, the ability of laccase (2 mg laccase (from Trametes versicolor (Sigma Catalog No. 38429)) to oxidize 5hmC from 50 nM of a 5hmC-containing oligonucleotide substrate was tested, in the presence of 1.6 mg (10 mM), 2,2,6,6-tetramethylpiperidine-l-oxyl (TEMPO), and 150 m ⁇ of 50 mM phosphate buffer (pH 5.2), at two different temperatures (25°C and 37°C).
  • FIG. 10 shows that significant amounts of 5fC generated/ accumulated by the laccase oxidation of 5hmC can be detected as early as 3 hours, at 25°C and 37°C.
  • laccase is efficient at converting 5hmC to 5fC, within as early as 3 hours, at either 25°C and 37°C, without the unwanted accumulation/ conversion of unwanted 5caC (as shown in FIG. 10).
  • laccase can be employed as an important enzyme for detecting methylated species (e.g., 5hmC) by promoting the conversion to 5fC, which can then be subsequently converted to Thymine.
  • Example 11 Optimization of malononitrile activity in converting 5fC to 5fC-M adduct.
  • FIG. 11A shows LC-MS data of the effect of malononitrile on conversion of 5fC to 5fC-M adduct under various buffer conditions.
  • the top graph of FIG. 11A shows buffer conditions of 40°C for 1 hour
  • the middle graph of FIG. 11 A shows buffer conditions of 60°C for 1 hour
  • the bottom graph of FIG. 11A shows buffer conditions of 95°C for 10 minutes.
  • FIG. 11A shows dramatic accumulation of 5fC-M adduct at elevated temperatures (60°C) after 1 hour.
  • FIG. 11A also shows dramatic accumulation of 5fC-M adduct at elevated temperatures (95°C) after only 10 minutes (but with the existence of degradation products).
  • FIG. 11A shows that malononitrile activity in converting 5fC to 5fC-M adduct is optimized at elevated temperatures (60°C), which represents a significant improvement over the art.
  • Example 12 Optimization of copper(II) perchlorate and 2, 2, 6, 6- tetramethylpiperidine-l-oxyl (Cu(II) /TEMPO).
  • FIG. 13C shows the data showing conversion of 5fC-M adduct to T using standard buffer (“BufferA”) and an optimized buffer (“DOE_l”).
  • FIG. 13C shows that the conversion of 5fC-M adduct to T is increased/enhanced with the optimized buffer (“DOE_l”) compared to standard buffer.
  • the conversion rate with the optimized buffer 94.12) is significantly greater than the conversion rate with the standard buffer (76.94)
  • CGC the conversion rate with the optimized buffer (86.28) is significantly greater than the conversion rate with the standard buffer (69.46).

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

L'invention concerne des procédés et des compositions améliorés pour détecter la méthylation dans des acides nucléiques. En particulier, l'invention concerne des procédés de conversion de 5-hydroxyméthyl cytosine (5 hmC) et/ou de 5-méthylcytosine (5 mC) en thymine (T). De plus, l'invention concerne également des procédés de détection de 5 hmC et/ou de 5 mC dans un échantillon.
EP22705528.2A 2021-02-09 2022-02-08 Procédés de détection de méthylation de base dans des acides nucléiques Pending EP4291677A2 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US202163147307P 2021-02-09 2021-02-09
US202163150019P 2021-02-16 2021-02-16
US202163191079P 2021-05-20 2021-05-20
PCT/EP2022/052979 WO2022171606A2 (fr) 2021-02-09 2022-02-08 Procédés de détection de méthylation de base dans des acides nucléiques

Publications (1)

Publication Number Publication Date
EP4291677A2 true EP4291677A2 (fr) 2023-12-20

Family

ID=80446200

Family Applications (1)

Application Number Title Priority Date Filing Date
EP22705528.2A Pending EP4291677A2 (fr) 2021-02-09 2022-02-08 Procédés de détection de méthylation de base dans des acides nucléiques

Country Status (3)

Country Link
EP (1) EP4291677A2 (fr)
JP (1) JP7646850B2 (fr)
WO (1) WO2022171606A2 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2026501670A (ja) * 2023-01-04 2026-01-16 ナテラ, インコーポレイテッド メチル化標的dna分子を増幅するための方法および組成物
CN121605113A (zh) * 2023-06-14 2026-03-03 豪夫迈·罗氏有限公司 5-甲酰基胞嘧啶的加速标记及在核酸甲基化测序中的应用
WO2025078657A1 (fr) 2023-10-13 2025-04-17 F. Hoffmann-La Roche Ag Flux de travail d'enrichissement cible sans amplification pour détection directe de modifications d'acide nucléique

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5010005A (en) 1989-04-28 1991-04-23 Duff Sheldon J B Bio-oxidation of high alcohols in non-aqueous reaction media
DE10149266A1 (de) 2001-10-05 2003-04-17 Basf Ag Verfahren zur selektiven Oxidation von substituierten Toluolen durch mikrobielle Peroxidasen aus Coprinus
DE102004047774A1 (de) 2004-09-28 2006-03-30 Jenabios Gmbh Verfahren zur enzymatischen Hydroxylierung nicht-aktivierter Kohlenwasserstoffe
US7393665B2 (en) 2005-02-10 2008-07-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
WO2008093098A2 (fr) 2007-02-02 2008-08-07 Illumina Cambridge Limited Procedes pour indexer des echantillons et sequencer de multiples matrices nucleotidiques
MX2010010600A (es) 2008-03-28 2011-03-30 Pacific Biosciences California Inc Composiciones y metodos para secuenciacion de acidos nucleicos.
WO2010037001A2 (fr) 2008-09-26 2010-04-01 Immune Disease Institute, Inc. Oxydation sélective de 5-méthylcytosine par des protéines de la famille tet
WO2011070441A2 (fr) 2009-12-11 2011-06-16 Adam Wasserstrom Catégorisation d'échantillons d'adn
US9783850B2 (en) 2010-02-19 2017-10-10 Nucleix Identification of source of DNA samples
CA2811185C (fr) 2010-09-21 2020-09-22 Population Genetics Technologies Ltd. Niveau de confiance accru d'appels d'alleles par un comptage moleculaire
EP2652153B1 (fr) 2010-12-17 2017-07-05 The Trustees of Columbia University in the City of New York Séquençage d'adn par une synthèse utilisant des nucléotides modifiés et une détection par nanopores
JP6333179B2 (ja) 2012-01-20 2018-05-30 ジニア テクノロジーズ, インコーポレイテッド ナノポアベースの分子検出および配列決定
EP3540077B1 (fr) 2012-02-16 2021-01-27 Genia Technologies, Inc. Procédés de création de bicouches destinées à être utilisées avec des capteurs de nanopore
EP3674412A1 (fr) 2012-06-20 2020-07-01 The Trustees of Columbia University in the City of New York Séquençage d'acide nucléique par détection de nanopores de molécules d'étiquettes
US9605309B2 (en) 2012-11-09 2017-03-28 Genia Technologies, Inc. Nucleic acid sequencing using tags
JP6243013B2 (ja) * 2013-09-27 2017-12-06 北京大学Peking University 5−ホルミルシトシン特異的な化学標識法及びその利用
CN106957350B (zh) 2017-02-28 2019-09-27 北京大学 5-醛基胞嘧啶的标记方法及其在单碱基分辨率测序中的应用
US11691141B2 (en) 2017-11-13 2023-07-04 Roche Sequencing Solutions, Inc. Devices for sample analysis using epitachophoresis
JP7136899B2 (ja) * 2018-01-08 2022-09-13 ルドウイグ インスティテュート フォー キャンサー リサーチ エルティーディー シトシン修飾の、亜硫酸水素塩非含有、塩基分解能特定
CN111989407B (zh) 2018-03-13 2024-10-29 格里尔公司 异常的片段检测及分类
JP7620803B2 (ja) 2018-03-15 2025-01-24 グレイル インコーポレイテッド 組織特異的メチル化マーカー
CA3096678A1 (fr) 2018-04-13 2019-10-17 Grail, Inc. Modele de prediction de dosages multiples pour la detection du cancer
EP3864403A1 (fr) 2018-10-12 2021-08-18 F. Hoffmann-La Roche AG Procédés de détection pour l'automatisation de flux de travail d'épitachophorèse
US20200385813A1 (en) 2018-12-18 2020-12-10 Grail, Inc. Systems and methods for estimating cell source fractions using methylation information
WO2020132572A1 (fr) 2018-12-21 2020-06-25 Grail, Inc. Source de déconvolution d'origine basée sur des fragments de méthylation dans des échantillons d'adn acellulaire
CN109678802B (zh) 2019-01-28 2020-12-29 四川大学 衍生醛基嘧啶的方法、检测5-醛基胞嘧啶的方法以及醛基嘧啶衍生物的应用
IL265451B (en) 2019-03-18 2020-01-30 Frumkin Dan Methods and systems for the detection of methylation changes in DNA samples

Also Published As

Publication number Publication date
JP2024506899A (ja) 2024-02-15
WO2022171606A3 (fr) 2022-10-06
JP7646850B2 (ja) 2025-03-17
WO2022171606A2 (fr) 2022-08-18

Similar Documents

Publication Publication Date Title
US11274335B2 (en) Methods for the epigenetic analysis of DNA, particularly cell-free DNA
JP7646850B2 (ja) 核酸中のメチル化の塩基レベル検出のための方法
US20250283170A1 (en) Modified adapters for enzymatic dna deamination and methods of use thereof for epigenetic sequencing of free and immobilized dna
CN115181783A (zh) 胞嘧啶修饰的免亚硫酸氢盐的碱基分辨率鉴定
US12601006B2 (en) Targeted, long-read nucleic acid sequencing for the determination of cytosine modifications
US20170101675A1 (en) Ion sensor dna and rna sequencing by synthesis using nucleotide reversible terminators
KR20220024778A (ko) 차세대 시퀀싱 라이브러리를 제조하기 위한 핵산 표지화에 유용한 올리고뉴클레오티드-테더된 트리포스페이트 뉴클레오티드
US20250283145A1 (en) Methods for base-level detection of methylation in nucleic acids
AU2018328324B2 (en) Selective labeling of 5-methylcytosine in circulating cell-free DNA
CN116806266A (zh) 用于核酸中的甲基化的碱基水平检测的方法
CN120019160A (zh) Tet辅助的吡啶硼烷测序
US20250361549A1 (en) Detection of epigenetic cytosine modification
US20250115960A1 (en) Nucleic acid testing method and system
WO2025083068A1 (fr) Procédé de capture d'adn modifié épigénétiquement
AU2024365033A1 (en) Method for capturing epigenetically modified dna
EP4727953A2 (fr) Marquage accéléré de 5-formyl cytosine et utilisation dans le séquençage de méthylation d'acide nucléique
CN121443752A (zh) Dna测序方法
JP2025527591A (ja) エピジェネティック修飾の検出
HK40081618A (en) Bisulfite-free, base-resolution identification of cytosine modifications
CN120677253A (zh) 用于原位分析rna样品中单核苷酸变异的方法及使用于原位分析rna样品中单核苷酸变异的试剂盒

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: UNKNOWN

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20230911

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)