WO2022005988A2 - Solid support for synthesizing nucleic acid sequences and methods for making and using - Google Patents

Solid support for synthesizing nucleic acid sequences and methods for making and using Download PDF

Info

Publication number
WO2022005988A2
WO2022005988A2 PCT/US2021/039403 US2021039403W WO2022005988A2 WO 2022005988 A2 WO2022005988 A2 WO 2022005988A2 US 2021039403 W US2021039403 W US 2021039403W WO 2022005988 A2 WO2022005988 A2 WO 2022005988A2
Authority
WO
WIPO (PCT)
Prior art keywords
solid support
protected
exocyclic amine
formula
nucleic acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2021/039403
Other languages
French (fr)
Other versions
WO2022005988A3 (en
Inventor
Serge L. Beaucage
Andrzej M. GRAJKOWSKI
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
US Department of Health and Human Services
Original Assignee
US Department of Health and Human Services
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by US Department of Health and Human Services filed Critical US Department of Health and Human Services
Priority to US18/003,404 priority Critical patent/US11987599B2/en
Publication of WO2022005988A2 publication Critical patent/WO2022005988A2/en
Publication of WO2022005988A3 publication Critical patent/WO2022005988A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H21/00Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
    • C07H21/04Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids with deoxyribosyl as saccharide radical
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07FACYCLIC, CARBOCYCLIC OR HETEROCYCLIC COMPOUNDS CONTAINING ELEMENTS OTHER THAN CARBON, HYDROGEN, HALOGEN, OXYGEN, NITROGEN, SULFUR, SELENIUM OR TELLURIUM
    • C07F9/00Compounds containing elements of Groups 5 or 15 of the Periodic Table
    • C07F9/02Phosphorus compounds
    • C07F9/06Phosphorus compounds without P—C bonds
    • C07F9/08Esters of oxyacids of phosphorus
    • C07F9/09Esters of phosphoric acids
    • C07F9/093Polyol derivatives esterified at least twice by phosphoric acid groups
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07FACYCLIC, CARBOCYCLIC OR HETEROCYCLIC COMPOUNDS CONTAINING ELEMENTS OTHER THAN CARBON, HYDROGEN, HALOGEN, OXYGEN, NITROGEN, SULFUR, SELENIUM OR TELLURIUM
    • C07F9/00Compounds containing elements of Groups 5 or 15 of the Periodic Table
    • C07F9/02Phosphorus compounds
    • C07F9/06Phosphorus compounds without P—C bonds
    • C07F9/08Esters of oxyacids of phosphorus
    • C07F9/09Esters of phosphoric acids
    • C07F9/098Esters of polyphosphoric acids or anhydrides
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07FACYCLIC, CARBOCYCLIC OR HETEROCYCLIC COMPOUNDS CONTAINING ELEMENTS OTHER THAN CARBON, HYDROGEN, HALOGEN, OXYGEN, NITROGEN, SULFUR, SELENIUM OR TELLURIUM
    • C07F9/00Compounds containing elements of Groups 5 or 15 of the Periodic Table
    • C07F9/02Phosphorus compounds
    • C07F9/547Heterocyclic compounds, e.g. containing phosphorus as a ring hetero atom
    • C07F9/655Heterocyclic compounds, e.g. containing phosphorus as a ring hetero atom having oxygen atoms, with or without sulfur, selenium, or tellurium atoms, as the only ring hetero atoms
    • C07F9/65515Heterocyclic compounds, e.g. containing phosphorus as a ring hetero atom having oxygen atoms, with or without sulfur, selenium, or tellurium atoms, as the only ring hetero atoms the oxygen atom being part of a five-membered ring
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H21/00Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H21/00Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
    • C07H21/02Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids with ribosyl as saccharide radical
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1068Template (nucleic acid) mediated chemical library synthesis, e.g. chemical and enzymatical DNA-templated organic molecule synthesis, libraries prepared by non ribosomal polypeptide synthesis [NRPS], DNA/RNA-polymerase mediated polypeptide synthesis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/14Solid phase synthesis, i.e. wherein one or more library building blocks are bound to a solid support during library creation; Particular methods of cleavage from the solid support
    • C40B50/18Solid phase synthesis, i.e. wherein one or more library building blocks are bound to a solid support during library creation; Particular methods of cleavage from the solid support using a particular method of attachment to the solid support

Definitions

  • the application concerns a solid support for synthesizing nucleic acid sequences and methods for making and using the solid support.
  • the purity of synthetic nucleic acid sequences is important for the production of safe and efficacious nucleic acid-based drugs, such as those for antisense or RNA interference in vivo therapies.
  • Highly pure synthetic DNA sequences are also important for the construction of entire genes to be used in synthetic biology applications (e.g., mRNA and/or genome editing).
  • asDNA antisense DNA
  • siRNA small interfering RNA
  • the clinical applications of these nucleic acid sequences for the treatment of human diseases has been hindered by various factors including: (i) instability in biological media; (ii) poor delivery to target cells; (iii) poor uptake by target cells and; (iv) dose-related toxicities. Severe thrombocytopenic or peripheral neuropathy adverse events have been reported in patients treated with asDNA sequences or siRNAs, respectively.
  • n-1 DNA sequences are difficult to remove from the full-length DNA product and can potentially elicit immune responses and/or adverse events arising from off-target activities upon administration to patients under antisense therapy settings. Accordingly, there is a need to minimize the formation of those process-related impurities to levels that should not become a safety concerns to patients.
  • a solid support suitable for solid phase synthesis of nucleic acid sequences.
  • Using the disclosed solid support may result in a nucleic acid composition that has a reduced amount of impurities, compared to the same nucleic acid sequence being produced using current commercially available solid supports.
  • the disclosed solid support has a structure according to Formula I
  • CPG is controlled pore glass
  • m is from 2 to 6, such as 2, 3, 4, 5, or 6, and in some embodiments, m is 2, 3 or 4, and may be 3.
  • x is from 1 to 5, such as 1, 2, 3, 4, or 5, and in some embodiments, x is 1, 2 or 3, and may be 1.
  • y is from 2 to 12, and in some embodiments, y is from 3 to 10, and may be 6.
  • n is from 3 to 10, such as 3, 4, 5, 6, 7, 8, 9 or 10, and in some embodiments, n is from 3 to 7, and may be 5.
  • each R 1 independently Ci- 6 alkyl, -(CH2) I -6CN, -(CF ⁇ i-eOR’ or a thermolabile phosphate protecting group, where R’ is aliphatic, aryl, or aralkyl.
  • R 1 may be Ci-4alkyl or -(CH2) I -4CN, and in certain embodiments, R 1 is -CH2CH2CN.
  • R 2 may from 2 to 10, such as 2, 3, 4, 5, 6, 7, 8, 9 or 10, and in certain embodiments, p is 6.
  • R 3 may
  • R 4 is H or OR 6 ; and B p is a nucleic acid base where the exocyclic amine group, if present, is protected.
  • R 5 is PG or a nucleic acid sequence, where PG is a protecting group.
  • PG may be 4,4’-dimethoxytrityl (DMTr).
  • R 6 may be 9-phenylxanthyl (pixyl), tert-butyldimethylsilyl (TBDMS), tert- butyldiphenylsilyl (TBDPS), trimethyl silyl (TMS), triethylsilyl (TES), triisopropyl silyl (TIPS), and in some embodiments, R 6 is TBDMS.
  • m, x, y and n may be selected to produce a support backbone length from the silicon atom to the R 2 moiety of from 50 atoms to 400 atoms, such as from 100 atoms to 150 atoms.
  • B p may be a nucleic acid base with exocyclic amine group(s) protected if present, such as exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, thymine, uracil, hypoxanthine, xanthine, exocyclic amine-protected 7-methylguanine, 5,6-dihydrouracil, exocyclic amine-protected 5-methylcytosine, or exocyclic amine-protected 5-hydroxymethylcytosine, and may be exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine- protected guanine, thymine, or uracil.
  • exocyclic amine group(s) protected if present such as exocyclic amine-protected adenine, exocyclic amine
  • B p is adenine, cytosine, or guanine, where the exocyclic amine is protected by a benzoyl (Bz), isobutyryl(iBu), phenoxyacetyl (Pac), phenylsulfonylethoxycarbonyl, p-nitrophenyloxycarbonyl, allyloxycarbonyl, or levulinyl group.
  • B p is thymine or uracil.
  • R 4 is H and/or B p is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, or thymine.
  • R 4 is OR 6 and/or B p is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, or uracil.
  • R 6 may be TBDMS, TBDPS, TMS, TES, or TIPS, such as TBDMS.
  • Exocyclic amine-protected adenine may be N 6 -benzoyl adenine (A Bz ) or N 6 - phenoxyacetyl adenine (A Pac ).
  • Exocyclic amine-protected cytosine may be N 4 -benzoyl cytosine (C Bz ) or N 4 -phenoxyacetyl cytosine (C Pac ).
  • And/or Exocyclic amine-protected guanine may be N 2 -isobutyryl guanine (G' Bu ) or N 2 -phenoxyacetyl guanine (G Pac ).
  • a loading of the support on the CPG may be from 5 pmol/g to about 125 pmol/g.
  • the solid support has a formula selected from:
  • t may be 2.
  • R 5 may be PG, and in some embodiments, is DMTr.
  • R 5 may be a nucleic acid sequence, and may comprise one or more DNA sequences, such as one or more antisense DNA sequences.
  • the nucleic acid sequence comprises one or more RNA sequences, such as one or more antisense RNA sequences, one or more microRNA (miRNA) sequences, one or more small interfering RNA (siRNA) sequences, one or more repeat-associated small interfering RNA (rasiRNA) sequences, or combinations thereof.
  • miRNA microRNA
  • siRNA small interfering RNA sequences
  • rasiRNA repeat-associated small interfering RNA
  • the universal linker may have a structure:
  • Embodiments of a method for synthesizing a nucleic acid sequence using the disclosed solid support also are disclosed herein.
  • the method comprises loading a solid support according to any one of the disclosed embodiments into a DNA/RNA synthesizer, and operating the synthesizer to produce a desired nucleic acid sequence.
  • the solid support is a solid support where R 5 is PG, such as DMTr.
  • kits comprising a solid support according to any one of the disclosed embodiments, and may comprise a protected 2'-deoxynucleoside, ribonucleoside, and/or chemically modified nucleoside wherein an exocyclic amine on the deoxynucleoside, ribonucleoside or chemically modified nucleoside, if present, also is protected.
  • the 2'- deoxynucleoside may be DMTrdA Bz , DMTrdC Bz , DMTrdG' 6 ", or DMTrT), and/or the ribonucleosides may be DMT r A Pac -2 ’ -OTBDMS , DMTrC Pac -2’-OTBDMS, DMTrG Pac -2’- OTBDMS, or DMTrU-2’-OTBDMS.
  • the kit comprises a universal linker phosphoramidite, such as the universal linker phosphoramidite disclosed herein.
  • the kit further comprises ammonium hydroxide.
  • FIG. l is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5 ’ -d(CTGAGT AGCGAACGT GAAGA) (SEQ ID NO: 1) produced by an embodiment of the disclosed solid support structure comprising 3 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial long chain alkylamine-controlled pore glass (LCAA-CPG) support.
  • LCAA-CPG commercial long chain alkylamine-controlled pore glass
  • FIG. 2 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5 ’ -d(CTGAGTAGCGAACGTGAAGA) (SEQ ID NO: 1) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 3 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5 ’ -d(CTGAGTAGCGAACGTGAAGA) (SEQ ID NO: 1) produced by an embodiment of the disclosed solid support structure comprising 7 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 4 is a graph of retention time versus absorbance units at 254 nm, comparing the HPLC profiles of unpurified 5 ’ -d(CTGAGTAGCGAACGTGAAGA) (SEQ ID NO: 1) produced by embodiments of the disclosed solid support structure comprising 5 (blue) or 7 (black) hexaethylene glycol phosphate repeating units.
  • FIG. 5 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- r(UCUUGGUUACAUGAAAUCCU) (SEQ ID NO: 3) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 5 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- r(UCUUGGUUACAUGAAAUCCU) (SEQ ID NO: 3) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 6 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- d(TCTTGGTTACATGAAATCCT) (SEQ ID NO: 2) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 7 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- d( ATAGT GT GC ATCGAT GCC AC) (SEQ ID NO: 5) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 8 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- d(CTCTGTACCTTACGTCTTCG) (SEQ ID NO: 4) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
  • FIG. 9 provides stacked expanded HPLC profiles of the spectra from FIG. 6, illustrating the approximate 50% reduction in impurities in the product made using the CPG support, compared to the product made using the commercial LCAA-CPG support.
  • nucleic acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases as defined in 37 C.F.R. 1.822. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood as included by any reference to the displayed strand.
  • sequence Listing is submitted as an ASCII text file, created on June 28, 2021, 4 KB, which is incorporated by reference herein in its entirety. In the accompanying sequence listing:
  • SEQ ID NOs: 1-5 are nucleic acid sequences produced using exemplary embodiments of the disclosed solid support structure.
  • compounds such as the solid supports disclosed herein, may exhibit the phenomena of tautomerism, conformational isomerism, geometric isomerism, and/or optical isomerism.
  • certain disclosed compounds can include one or more chiral centers and/or double bonds and as a consequence can exist as stereoisomers, such as double-bond isomers (i.e., geometric isomers), enantiomers, diasteromers, and mixtures thereof, such as racemic mixtures.
  • the compounds disclosed herein are synthesized in or are purified to be in substantially enantiopure form, such as in an 85% enantiomeric excess (e.e.), a 90% enantiomeric excess, a 95% enantiomeric excess, a 97% enantiomeric excess, a 98% enantiomeric excess, a 99% enantiomeric excess, or even in greater than a 99% enantiomeric excess, such as in a substantially enantiopure form.
  • the compounds are in a racemic form, having substantially a 50:50 mixture of enantiomers.
  • certain disclosed compounds can exist in several tautomeric forms, including the enol form, the keto form, and mixtures thereof.
  • a compound may have a moiety exhibiting the following isomerization:
  • any or all hydrogens present in the compound, or in a particular group or moiety within the compound may be replaced by a deuterium or a tritium.
  • a recitation of alkyl includes deuterated alkyl, where from one to the maximum number of hydrogens present may be replaced by deuterium.
  • ethyl may be C 2 H 5 or C 2 H 5 where from 1 to 5 hydrogens are replaced by deuterium, such as in
  • substituted refers to all subsequent modifiers in a term, for example in the term “substituted arylCi-salkyl,” substitution may occur on the “Ci- salkyl” portion, the “aryl” portion or both portions of the arylC i-xalkyl group.
  • Aliphatic A substantially hydrocarbon-based group or moiety.
  • An aliphatic group or moiety can be acyclic, including alkyl, alkenyl, or alkynyl groups, cyclic versions thereof, such as cycloaliphatic and/or spiroaliphatic groups or moieties including cycloalkyl, cycloalkenyl, cycloalkynyl, or spiroalkyl and further including straight- and branched-chain arrangements, and all stereo and position isomers as well.
  • an aliphatic group contains from one to twenty -five carbon atoms (Ci- 25 ), for example, from one to fifteen (Ci- 15 ), from one to ten (Ci- 10 ) from one to six (Ci ⁇ ), or from one to four carbon atoms (C 1-4 ) for an acyclic alkyl group or moiety; from two to twenty-five carbon atoms (Ci- 25 ) for example, from two to fifteen (Ci- 15 ), from two to ten (Ci- 10 ) from two to six (Ci-6), or from two to four carbon atoms (C 1-4 ) for an acyclic alkenyl or alkynyl group or moiety; from three to fifteen carbon atoms (C 3-15 ), such as from three to ten (C3-1 0 ), from three to eight (C3-8), from three to six (C3-6), or from three to four (C3-4) carbon atoms for a cycloaliphatic group or
  • An aliphatic group may be substituted or unsubstituted, unless expressly referred to as an “unsubstituted aliphatic” or a “substituted aliphatic.”
  • Alkyl A saturated aliphatic hydrocarbyl group having from 1 to 10 (Ci-io) or more carbon atoms, more typically 1 to 8 (Ci-s) carbon atoms such as 1 to 6 (Ci- 6 ) carbon atoms or 1 to 4 (Ci -4) carbon atoms.
  • An alkyl moiety may be substituted or unsubstituted.
  • This term includes, by way of example, linear and branched hydrocarbyl groups such as methyl (CH3), ethyl (-CH2CH3), n-propyl (-CH2CH2CH3), isopropyl (-CH(CH3)2), n-butyl (-CH2- CH2CH2CH3), or isobutyl (-CH 2 CH 2 (CH3)2).
  • linear and branched hydrocarbyl groups such as methyl (CH3), ethyl (-CH2CH3), n-propyl (-CH2CH2CH3), isopropyl (-CH(CH3)2), n-butyl (-CH2- CH2CH2CH3), or isobutyl (-CH 2 CH 2 (CH3)2).
  • Cycloaliphatic refers to a cyclic aliphatic group having a single ring ⁇ e.g, cyclohexyl), or multiple rings, such as in a fused, bridged or spirocyclic system, at least one of which is aliphatic. Typically, the point of attachment to the parent structure is through an aliphatic portion of the multiple ring system. Cycloaliphatic includes saturated and unsaturated systems, including cycloalkyl, cycloalkenyl and cycloalkynyl. A cycloaliphatic group may contain from three to twenty -five carbon atoms; for example, from three to fifteen, from three to ten, or from three to six carbon atoms.
  • a cycloaliphatic group may be substituted or unsubstituted.
  • exemplary cycloaliphatic groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, cyclopentenyl, or cyclohexenyl.
  • Aryl refers to an aromatic carbocyclic group of, unless specified otherwise, from 6 to 15 carbon atoms having a single ring (e.g., phenyl) or multiple condensed rings in which at least one ring is aromatic (e.g., naphthalene). If any aromatic ring portion contains a heteroatom, the group is heteroaryl and not aryl.
  • Aryl groups may be, for example, monocyclic, bicyclic, tricyclic or tetracyclic. Unless otherwise stated, an aryl group may be substituted or unsubstituted.
  • Aralkyl refers to an aryl group attached to the parent via an alkyl moiety.
  • exemplary aralkyl groups include benzyl and phenyl ethyl.
  • Exocyclic amine is an amine moiety that is not part of a ring structure, i.e., the nitrogen atom of the exocyclic amine is not a ring atom.
  • Exemplary exocyclic amines include, but are not limited to, the amine at the N 6 position of adenine, the amine at the N 2 position of guanine, and the amine at the N 4 position of cytosine.
  • An exocyclic amine may be unprotected or protected, such as by a suitable amine protecting group.
  • protecting groups include, but are not limited to, isobutyryl(iBu); phenoxyacetyl (Pac); levulinyl; amidine protecting groups, such as carbamate protecting groups, such as 9-fluorenylmethyl carbamate (Fmoc), 1,1 -dimethyl-
  • Heteroaryl An aromatic group or moiety of, unless specified otherwise, from 5 to 15 ring atoms comprising at least one carbon atom and at least one heteroatom, such as N, S, O, P, or Si, preferably N, S or O.
  • a heteroaryl group or moiety may comprise a single ring (e.g., pyridinyl, or pyrazine) or multiple condensed rings (e.g., indolyl).
  • Heteroaryl groups or moiety may be, for example, monocyclic, bicyclic, tricyclic or tetracyclic. Unless otherwise stated, a heteroaryl group or moiety may be substituted or unsubstituted.
  • Heterocyclyl, heterocyclo or heterocycle Aromatic and non-aromatic ring systems, and more specifically refer to a stable three- to fifteen-membered ring moiety comprising at least one carbon atom, and typically plural carbon atoms, and at least one, such as from one to five, heteroatoms.
  • the heteroatom(s) may be nitrogen, phosphorus, oxygen, silicon or sulfur atom(s), preferably N, S or O.
  • the heterocyclyl moiety may be a monocyclic moiety, or may comprise multiple rings, such as in a bicyclic or tricyclic ring system, provided that at least one of the rings contains a heteroatom.
  • Such a multiple ring moiety can include fused or bridged ring systems as well as spirocyclic systems; and any nitrogen, phosphorus, carbon, silicon or sulfur atoms in the heterocyclyl moiety can be optionally oxidized to various oxidation states.
  • nitrogens particularly, but not exclusively, those defined as annular aromatic nitrogens, are meant to include their corresponding N-oxide form, although not explicitly defined as such in a particular example.
  • annular nitrogen atoms can be optionally quaternized.
  • Heterocycle includes heteroaryl moieties, and heterocycloaliphatic moieties, such as heterocycloalkyl moieties, which are heterocyclyl rings that are partially or fully saturated. Unless otherwise stated, a heterocyclyl group or moiety may be substituted or unsubstituted.
  • heterocyclyl groups include, but are not limited to, azetidinyl, oxetanyl, acridinyl, benzodioxolyl, benzodioxanyl, benzofuranyl, dioxolanyl, indolizinyl, naphthyridinyl, phenazinyl, phenothiazinyl, phenoxazinyl, phthalazinyl, pteridinyl, purinyl, quinazolinyl, quinoxalinyl, quinolinyl, isoquinolinyl, tetrazoyl, tetrahydroisoquinolyl, piperidinyl, piperazinyl, 2-oxopiperazinyl, 2-oxopiperidinyl, 2-oxopyrrolidinyl, 2- oxoazepinyl, azepinyl, pyrrolyl, 4-piperidonyl
  • Halo, halide or halogen refers to fluoro, chloro, bromo or iodo.
  • Nucleic acid sequence refers to DNA and RNA sequences, such as cDNA and mRNA.
  • includes antisense nucleic acid sequences such as antisense RNA or antisense DNA
  • microRNAs miRNAs
  • siRNAs small interfering RNAs
  • rasiRNAs repeat-associated small interfering RNAs
  • a nucleic acid sequence is a therapeutic nucleic acid sequence, such as a DNA therapeutic (e.g ., antisense oligonucleotide, DNA aptamers) or RNA therapeutic (e.g., miRNA, siRNA, ribozyme, or RNA decoy).
  • a nucleic acid sequence can include naturally occurring and/or non-naturally occurring nucleotides.
  • Nucleosides The major nucleosides of DNA are deoxyadenosine (dA), deoxyguanosine (dG), deoxycytidine (dC) and deoxythymidine (T).
  • the major nucleosides of RNA are adenosine (rA), guanosine (rG), cytidine (rC) and uridine (U).
  • rA adenosine
  • rG guanosine
  • rC cytidine
  • U uridine
  • nucleosides containing modified bases and modified sugar moieties for example as described in U.S. Patent No. 5,866,336 to Nazarenko et al. (herein incorporated by reference).
  • modified sugar moieties which may be used to modify nucleotides at any position on its structure include, but are not limited to: arabinose, 2-fluoroarabinose, xylose, and hexose.
  • a nucleoside is a 2'-deoxynucleoside (dA, dC, dG, or T).
  • a nucleoside is chemically modified (e.g., LNA, BNA or UNA).
  • Embodiments of the solid support structure may facilitate synthesizing nucleic acid sequences having reduced process-related impurities and/or increased yield, compared to the same sequence synthesized using commercial solid supports.
  • the impurities may comprise, but are not limited to, nucleic acid sequences having shorter lengths than a desired nucleic acid sequence, such as one or more nucleotides shorter; partially alkylated thymine or uracil bases in DNA or RNA sequences, possibly resulting from exposure to acrylonitrile produced during the deprotection of 2-cyanoethyl phosphate protecting groups under basic conditions; and/or impurities from removed protecting groups, such as tert- butyldimethylsilyl fluoride or tetrabutylammonium fluoride, that may contaminate the sequence, particularly solid-phase purified RNA sequences.
  • the disclosed solid support structure has a formula I:
  • CPG is controlled pore glass.
  • the CPG has a pore size of from 250 A to 1500 A or more, such as from 500 A to 1500A, from 500 A to 1250A or from 500 A to lOOOA, and in certain embodiments, the CPG has a pore size of about 500 A.
  • m is 2, 3, 4, 5, 6, such as 2, 3, or 4, and in certain embodiments, m is 3.
  • x is 1, 2, 3, 4, or 5, such as 1, 2, or 3.
  • x is 1 or 2, and in certain embodiments, x is 1.
  • y is from 2 to 12, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12, and may be from 3 to 10, or from 4 to 8, and in some embodiments, y is 6.
  • n is from 3 to 10 or more, such as 3, 4, 5, 6, or 7, and may be 5, 6 or 7. In certain embodiments, n is 5.
  • m, x, y and/or n are selected to produce a carbon/oxygen/phosphorus backbone chain from the silicon atom to the R 2 moiety of 50 atoms or more in length, such as from 50 atoms to 400 atoms, from 60 atoms to 350 atoms, from 100 atoms to 210 atoms, or from 100 atoms to 150 atoms.
  • Each R 1 independently is Ci- 6 alkyl, -(CH2)i-6CN, -(CH 2 ) I-6 0R’ or a thermolabile phosphate protecting group, where R’ is aliphatic, aryl or aralkyl.
  • R’ may be alkyl, such as Ci- 6 alkyl; alkenyl, such as C2-6alkenyl; alkynyl, such as C2-6alkynyl; cycloalkyl, such as C3- 8 cycloalkyl; aryl, such as phenyl; or aralkyl, such as benzyl.
  • the thermolabile phosphate protecting group may have a structure
  • X is O or S.
  • R 7 is H, R a , 0R a , SR a , or N(R b )2, where R a is R d ; and R b is H, R d or two R b s together with the nitrogen to which they are attached, form a 3- to 7-membered heterocyclyl.
  • Each R 8 independently is H or R d , or one R 8 together with Z forms an aryl ring, such as phenyl.
  • Each R 9 independently is H or R d or one R 9 and one R 8 together with the atoms to which they are attached, forms a moiety having a formula where r is 0 to 6, and each R 10 independently is H, Ci- 6 alkyl, NO2, -N(Ci-6alkyl)2, -OCi- 6 alkyl, -SCi- 6 alkyl, -CN, or halogen, provided that the aromatic ring substituted with R 10 is one carbon removed from the phosphate oxygen of Formula I.
  • R d is alkyl, alkenyl, alkynyl, cycloalkyl, aryl, or aralkyl.
  • thermolabile phosphate protecting group is selected from information concerning thermolabile phosphate protecting groups can be found in U.S. Patent No. 6,762,298, which is incorporated herein by reference in its entirety.
  • each R 1 independently is Ci-4alkyl or -(CH2) I -4CN, and may be methyl, ethyl, propyl, -CTBCN or -CH2CH2CN, and in certain embodiments, R 1 is - CH2CH2CN.
  • each R 1 independently is a thermolabile phosphate protecting group as defined herein.
  • each R 1 is the same, but in other embodiments, the support comprises two or more R 1 moieties, such as from 2 to the maximum number of R 1 moieties present in the structure. In certain embodiments, each R 1 is -CH2CH2CN. from 2 to
  • B p is a nucleic acid base where the exocyclic amine, if present, is protected.
  • the protecting group can be any suitable protecting group, and may be a protecting group as disclosed herein.
  • B p is a nucleic acid where the exocyclic amine, if present, is protected by a benzoyl (Bz), isobutyryl(iBu), phenoxyacetyl (Pac), phenylsulfonylethoxycarbonyl, p-nitrophenyloxycarbonyl, allyloxycarbonyl, or levulinyl group.
  • B p is N 6 -benzoyl adenine (A Bz ), N 4 -benzoyl cytosine (C Bz ), N 2 -isobutyryl guanine (G' Bu ), thymine (T), N 6 -phenoxyacetyl adenine (A Pac ), N 4 - phenoxyacetyl cytosine (C Pac ), N 2 -phenoxyacetyl guanine (G Pac ), uracil (U), and/or similarly exocyclic amine-protected (where applicable) hypoxanthine, xanthine, 7-methylguanine, 5,6-dihydrouracil, 5-methylcytosine, or 5-hydroxymethylcytosine.
  • B p is A Bz , C Bz , G' Bu , T or A Pac , C Pac , G Pac , or U.
  • R 5 is PG or a nucleic acid sequence.
  • the nucleic acid sequence may comprise one or more DNA sequences and/or one or more RNA sequences.
  • An exemplary DNA sequence is an antisense DNA sequence.
  • An exemplary RNA sequence is an antisense RNA sequence, microRNA (miRNA) sequence, small interfering RNA (siRNA) sequence, repeat-associated small interfering RNA (rasiRNA) sequence, or a combination thereof.
  • miRNA microRNA
  • siRNA small interfering RNA
  • rasiRNA repeat-associated small interfering RNA
  • R 6 is a hydroxyl protecting group that can be removed with fluoride ions or under essentially neutral conditions.
  • R 6 is 9-phenylxanthyl (pixyl) or a silyl protecting group, such as tert-butyldimethylsilyl (TBDMS), tert-butyldiphenylsilyl (TBDPS), trimethyl silyl (TMS), triethylsilyl (TES), triisopropyl silyl (TIPS).
  • R 6 is TBDMS, TBDPS, TMS, TES, TIPS, and may be TBDMS.
  • PG is any protecting group suitable for use in DNA or RNA synthesis.
  • PG is dimethoxytrityl(DMTr), triphenylmethyl (trityl), p-monomethoxytrityl (MMTr), trimethoxytrityl (TMTr), 9-phenylxanthen-9-yl, 9-(p-methoxyphenyl)xanthen-9- yl, 9-phenylthioxanthen-9-yl, or 7-chloro-9-phenylthioxanthen-9-yl.
  • PG is DMTr.
  • R 4 is H and B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine, for example:
  • R 4 is OR 6 and B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil, for example:
  • x is 1, leading to a support structure according to Formula II:
  • CPG, m, y, n, R 1 and R 2 are as previously defined for Formula I.
  • the solid support structure has a formula according to Formula III:
  • CPG, m, n, R 1 and R 2 are as previously defined for Formula I.
  • m is 3, leading to solid support structure according to Formula IV Formula IV.
  • CPG, n, x, y, R 1 and R 2 are as previously defined for Formula I.
  • x is 1 and y is 6, leading to a solid support structure according to Formula V
  • CPG, n, R 1 and R 2 are as previously defined for Formula I.
  • R 2 is H.
  • R 1 and R 5 are as previously defined for Formula I.
  • the solid support structure has a formula according to Formula VI or VII
  • CPG, m, n, x, y, R 1 and R 5 are as previously defined for Formula I.
  • R 5 is PG, where PG is as previously defined for Formula I.
  • R 2 is OR 1 i n certain embodiments, the solid support structure has a formula according to Formula VIII or IX
  • CPG, m, n, p, x, y, R 1 and R 3 are as previously defined for Formula I.
  • R 3 is H. In other embodiments, are as previously defined for
  • the solid support structure has a formula according to Formula X, XI, XII or XIII Formula XI
  • R 5 is PG.
  • R 4 is H and B p is an exocyclic amine- protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine.
  • R 4 is OR 6 and B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil.
  • each R 1 may be the same, and in certain embodiments, each R 1 is -CH2CH2CN.
  • n is 3, 4, 5, 6, 7, 8, 9 or 10, such as 3, 4, 5, 6, or 7, and may be selected from 3, 5, or 7, or from 4, 5, 6 or 7, such as 5, 6, or 7. And in certain examples, n is 5.
  • R 5 is PG, such as DMTr.
  • R 5 is a nucleic acid sequence.
  • R 5 is, or comprises, a DNA sequence.
  • R 5 is, or comprises, an RNA sequence.
  • R 5 is a DNA sequence
  • R 4 is H and B p an exocyclic amine- protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine.
  • R 5 is an RNA sequence
  • R 4 is OR 6
  • B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil.
  • Certain disclosed exemplary solid support structures within the scope of one or more of the general formulas include: where B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine; where B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine; where B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine; where B p is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or th
  • the loading of the support on the CPG is from greater than zero to 125 pmol/g or more, such as from 5 pmol/g to 125 pmol/g, from 10 pmol/g to 100 pmol/g, or from 15 pmol/g to 75 pmol/g.
  • the disclosed solid support structures can be prepared as exemplified below, as illustrated for specific supports in the examples, and as will be understood by a person of ordinary skill in the art of organic synthesis.
  • An exemplary synthesis may include the following first reaction step according to Scheme 1.
  • CPG A is treated with a trialkoxysilane, such as trimethoxysilane as illustrated in Scheme 1, in a suitable solvent, such as an aprotic solvent, for example, toluene.
  • a suitable solvent such as an aprotic solvent, for example, toluene.
  • the mixture is agitated, such as by stirring or shaking, at a temperature suitable to facilitate production of compound B.
  • the temperature may be from 20 °C or lower, to 100 °C or more, such as from 25 °C to 75 °C or from 40 °C to 60 °C, and in some embodiments, a temperature of about 50 °C is used.
  • the reaction may proceed until the reaction is complete, such as by reaching an equilibrium, and may proceed for from 6 hours to 48 hours, from 12 hours to 36 hours, or from 18 hours to 30 hours, and in some embodiments, the reaction proceeds for about 24 hours.
  • Compound B is then treated with a suitable base, such as aqueous ammonia, to form compound C.
  • the mixture is agitated, such as by stirring or shaking, at a temperature suitable to facilitate the reaction, such as from 25 °C or less to 75 °C or more, or from 40 °C to 60 °C, and in some embodiments, a temperature of about 55 °C is used.
  • the reaction may proceed from greater than zero to 6 hours or more, such as from 1 hour to 4 hours, or for about 2 hours.
  • Compound C is then isolated by a suitable technique, such as filtration.
  • Compound C is treated with phosphoramidite D under a standard solid-phase DNA synthesis protocol, such as conditions recommended by a manufacturer of an automated DNA/RNA synthesizer, to form Compound E.
  • the protecting group is any protecting group suitable to facilitate solid phase DNA synthesis, such as DMTr.
  • each alkyl group in the (alkyl)2N moiety in phosphoramidite D may be Ci- 6 alkyl or the two alkyl moieties together with the nitrogen to which they are attached form a 5- to 7 membered heterocycloaliphatic group.
  • Suitable (alkyl)2N moieties include, but are not limited to, dimethylamino (NCH3)2, diethylamino (N(CH2CH3)2), di-n-propylamino, diisopropylamino, di-n-butylamino, diisopropylamino, diisobutylamino, di-sec-butylamino, di-tert-butylamino, di-n-hexylamino, or morpholino.
  • Compound E is exposed to an aqueous solution of iodine and then to reagents needed to inactivate any unreacted hydroxyl groups, such as a 1 : 1 (v/v) solution of Cap A (Ac 2 0/THF/pyridine) and Cap B (10% 1-Methylimidazole in THF).
  • the protecting group is then removed under acidic conditions to form Compound F.
  • a person of ordinary skill in the art understands the conditions required to remove a particular protecting group, and additional information concerning suitable protecting groups and how to remove them can be found in “Greene’s Protective Groups in Organic Synthesis, Fourth Edition,” published by John Wiley and Sons, Inc, April 10, 2006.
  • a DMTr protecting group may be removed by treatment with 3% trichloroacetic acid in a suitable solvent, such as a chlorinated solvent (for example, chloroform or dichlorom ethane).
  • the chain length can be extended as desired by repeating the steps above, as illustrated in Scheme 3.
  • Compound G is obtained after oxidation of the phosphite triester intermediate with an aqueous solution of iodine followed by treatment with a 1 : 1 (v/v) Cap A: Cap B solution to inactivate unreacted hydroxyl groups and after removal of the protecting group under acidic conditions as previously described with respect to Scheme 2.
  • Compound G is then treated with phosphoramidite H according to standard solid-phase DNA synthesis protocols.
  • the amino moiety is protected by a suitable protecting group, such as a 4- monomethoxytrityl group. Removal of such a group produces Compound J.
  • a suitable protecting group such as a 4- monomethoxytrityl group. Removal of such a group produces Compound J. A person of ordinary skill in the art understands how to remove such protecting groups.
  • a 4-monomethoxytrityl amino protecting group may be removed using 3% trichloroacetic acid (TCA) in a chlorinated solvent, such as dichloromethane, over a period of 15 minutes at about 25 °C.
  • TCA trichloroacetic acid
  • a fourth reaction step in the exemplary synthesis is provided below according to
  • Compound J is treated with a suitable 5’-0-deoxy- or ribonucleoside K comprising a nucleic acid base suitable for the 3’ -end of the resultant nucleic acid sequence, and a linker suitable to attach the nucleoside to the solid support.
  • a suitable protecting group such as a protecting group disclosed herein.
  • an exemplary succinate linker is shown, but a person of ordinary skill in the art understands that any linker suitable to facilitate the DNA or RNA sequence synthesis may be used.
  • the reaction proceeds in the presence of a suitable coupling agent, such as dicyclohexylcarbodiimide (DCC), ethyl-(N , ,N’-dimethylamino)propylcarbodiimide hydrochloride (EDC), diisopropylcarbodiimide (DIC), carbonyldiimidazole (CDI), BOP, PyBOP, BOP-C1, or HATU.
  • DCC dicyclohexylcarbodiimide
  • EDC ethyl-(N , ,N’-dimethylamino)propylcarbodiimide hydrochloride
  • DIC diisopropylcarbodiimide
  • CDI carbonyldiimidazole
  • BOP PyBOP
  • BOP-C1 HATU
  • a solvent suitable to facilitate the coupling reaction such as pyridine, DMF, acetonitrile, toluene, a chlorinated solvent,
  • reaction mixture is treated with a 1 : 1 (v/v) Cap A:Cap B solution to inactivate any unreacted amines moieties, and the solid support is filtered and treated with a suitable reagent, such as TCA in a chlorinated solvent, such as dichloromethane, to remove the protecting group to form compound L.
  • a suitable reagent such as TCA in a chlorinated solvent, such as dichloromethane
  • Compound G is treated with phosphoramidite M according to standard solid-phase DNA synthesis protocols. After treatment with an aqueous solution of iodine, unreacted hydroxyl groups are inactivated by a 1:1 (v/v) Cap A:Cap B solution as previously described with respect to Scheme 2.
  • a suitable protecting group such as a 4-monomethoxytrityl or 4,4’-dimethoxytrityl group. Removal of such a group produces Compound N, and a person of ordinary skill in the art understands the conditions used to remove such protecting groups.
  • a 4- monomethoxytrityl or 4,4’-dimethoxytrityl hydroxyl protecting group may be removed using 3% TCA in a chlorinated solvent, such as dichloromethane, over a period of 15 minutes at about 25 °C.
  • phosphoramidite M is Using either solid support L or solid support N, the DNA or RNA sequence can be synthesized in an automated DNA/RNA synthesizer using the standard protocols recommended by the manufacturer. Upon completion of the automated solid-phase synthesis, the DNA sequence is released by passing ammonium hydroxide through the synthesis column over a suitable period, such as from greater than zero to 1 hour or more, or from 10 minutes to 30 minutes, while collecting the eluate.
  • the eluate then is heated to a temperature suitable to ensure complete deprotection, such as from 30 °C or less to 100 °C or less, for example, from 40 °C to 75 °C or from 50 °C to 60 °C, and in some embodiments, the temperature is about 55 °C.
  • the eluate is heated for a time period to facilitate deprotection, such as from 6 hours or less to 30 hours or more, from 12 hours to 24 hours, or from 15 hours to 20 hours, and in some embodiments, the eluate is heated for about 18 hours.
  • nucleic acid sequences such as RNA sequences
  • RNA sequences may be manually released by suspending each support in an alcoholic solution of concentrated ammonium hydroxide, typically an ethanolic solution, at an approximate ratio of from 1:1 v/v to 1:5 or more v/v, such as about EtOfTNITtOH (1 :3 v/v).
  • the mixture is maintained, typically in a closed container, at ambient temperature, such as from 20 °C to 30 °C or about 25 °C, for a time period suitable to facilitate release of the nucleic acid sequence.
  • the time period may be from 6 hours or less to 24 hours or more, such as from 12 hours to 18 hours, and in some embodiments, the time period is about 16 hours.
  • RNA sequences the residue is dissolved in a suitable solvent, such as DMSO, and treated with conditions suitable to remove the OH protecting group.
  • a suitable solvent such as DMSO
  • the residue is treated with a fluoride reagent, such as triethylamine trihydrofluoride.
  • the mixtures are heated, such as on a heat block, at a temperature suitable to facilitate OH deprotection, such as from 50 °C or less to 100 °C or more, from 55 °C to 75 °C or about 65 °C for a suitable time period, such as from 1 hours or less to 56 hours or more, from 2 hours to 4 hours or about 3 hours. IV. Examples
  • the CPG support 6 was produced upon exposing 5 to a solution of 3% trichloroacetic acid (TCA) in CH2CI2 to cleave the 4,4’-dimethoxytrityl (DMTr) group according to a standard automated DNA synthesis protocol.
  • TCA 3% trichloroacetic acid
  • DMTr 4,4’-dimethoxytrityl
  • the released DMTr cation solution obtained from an accurately weighed sample of support 5, was collected into a 10- mL volumetric flask and spectrophotometrically measured at 498 nm to reveal a functional hydroxyl concentration of 108 pmole OH/gram of CPG support 6.
  • the procedure comprises repeating all the steps described above at the same scale, under the same conditions, using CPG support 6 as the starting material.
  • Cleavage of the 4-monomethoxytritylamino protecting group was performed manually, off the automated DNA/RNA synthesizer, using 3% TCA in CH2CI2 over a period of 15 min at about 25 °C. Multiple batches of each CPG support were needed to generate enough material to initiate solid-phase synthesis of each DNA or RNA sequence at the 1 pmole scale on each support.
  • CPG support 10, 11 or 12 50 mg
  • the glass vial and its content were then subjected to high vacuum for 2 hours at about 25 °C.
  • a solution of 10% dry pyridine in anhydrous DMF (200 pL) was added by syringe to the glass vial, which was immediately sealed with a teflon-lined screw cap and shaken at about 25 °C over a period of 24 hours.
  • the suspension was filtered, washed with dry CH 3 CN (10 mL) and treated with 2-mL of a 1 : 1 (v/v) Cap A:Cap B solution to inactivate any unreacted amine functions.
  • the CPG support was again filtered, washed with dry
  • Cap A Cap B, 1:1 (v/v)
  • T Automated syntheses of DNA and RNA sequences were performed on a DNA/RNA synthesizer, employing commercial long chain alkylamine controlled-pore glass supports (LCAA-CPG) or modified CPG supports 13, 14 and 15 pre-loaded with suitably protected leader deoxy- or ribonucleosides. Each solid support was accurately weighed, based on its leader nucleoside load, to provide one micromole of leader nucleoside per synthesis column. The synthesis of DNA or RNA sequences was conducted, side-by-side, on LCAA- CPG and CPG support 13 according to the standard (trityl-off) DNA or RNA protocol conditions recommended by the manufacturer of the DNA/RNA synthesizer.
  • LCAA-CPG long chain alkylamine controlled-pore glass supports
  • RNA sequence linked to LCAA-CPG and CPG 17 supports was manually released upon suspending each support in 1 mL of an ethanolic solution of concentrated ammonium hydroxide [EtOLfNLLOH (1:3 v/v)] kept in capped 4-mL screw cap glass vials over 16 hours at about 25 °C.
  • the support of each vial was then filtered and washed with RNase free water (0.5 mL) twice, and the filtrates were placed in 1.5 mL polypropylene microcentrifuge tubes and concentrated to dryness using a speedvac concentrator.
  • Each RNA sequence was dissolved in DMSO (100 pL) to which was added triethylamine trihydrofluoride (125 pL).
  • RNA sequence solution was heated on a heat block at 65 °C for 3 hours.
  • Each deprotected RNA sequence solution was cooled to room temperature, diluted with 775 pL of RNase free water, and desalted through a PD-10 column.
  • Each desalted RNA solution was immediately analyzed by RP-HPLC as described below.
  • a first attempt to reduce the level of process-related impurities in synthetic DNA and RNA sequences used a CPG 500 support functionalized with one hexaethylene glycol spacer under typical solid-phase synthesis conditions.
  • a DNA sequence (20-mer) was produced in a yield not better than that obtained (86%) when employing the standard commercial LCAA-CPG support.
  • Hexaethylene glycol has about the same number of carbon-carbon (C-C) bond lengths (about 18) than that of the long chain alkylamine spacer of LCAA-CPG.
  • CPG support 6 (see Example 2) was functionalized with two, four and six additional hexaethylene glycol spacers to provide the CPG supports 13, 14 and 15 from which, solid- phase syntheses of DNA and RNA sequences were conducted to provide supports 16, 17, and 18, respectively.
  • FIG. 1 provides expanded HPLC profiles of unpurified 5 ’ -d(CTGAGT AGCGAACGT GAAGA) (SEQ ID NO: 1), which was released from commercial LCAA-CPG (red profile) or CPG support 16 (blue profile) after complete deprotection. Peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm.
  • FIG. 1 provides expanded HPLC profiles of unpurified 5 ’ -d(CTGAGT AGCGAACGT GAAGA) (SEQ ID NO: 1), which was released from commercial LCAA-CPG (red profile) or CPG support 16 (blue profile) after complete deprotection. Peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm.
  • AU absorbance unit
  • FIG. 2 provides the expanded HPLC profiles of unpurified SEQ ID NO: 1, which was released from commercial LCAA-CPG (red profile) or CPG support 17 (blue profile) after complete deprotection. Peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm.
  • FIG. 2 demonstrates that the product released from CPG support 17 was of a product of higher quality than that obtained from the commercial LCAA-CPG support.
  • the presence of the shoulder on the right side of the peak shown at rt: 20.4 min and of the peak at 21.8 min in the red profile was considerably reduced in the profile corresponding to CPG support 17
  • FIG. 3 provides the extended HPLC profiles of unpurified SEQ ID NO: 1 that were released from commercial LCAA-CPG (red) or disclosed CPG support 18 (blue) after complete deprotection.
  • FIG. 4 provides the extended HPLC profiles of unpurified SEQ ID NO: 1 that were released from disclosed CPG support 17 (blue) or support 18 (black) after complete deprotection.
  • peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm.
  • FIGS. 3 and 4 show that the release of SEQ ID NO: 1 from CPG support 18 was highly comparable to that obtained from support 17 based on side-by-side comparison of their chromatographic profiles.
  • FIG. 5 provides expanded HPLC profiles of unpurified SEQ ID NO: 3, which was released from commercial LCAA-CPG (red profile) or CPG support 17 (blue profile) after complete deprotection.
  • FIGS. 6, 7 and 8 provide the expanded HPLC profiles for SEQ ID NOs: 2, 4 and 5, respectively, released from the disclosed CPG support 17 (blue) and LCAA-CPG (red), after complete deprotection.
  • FIG. 9 provides stacked HPLC profiles for SEQ ID NO: 2 produced using a LCAA-CPG support and CPG support 17 as disclosed herein and illustrating the approximate 50% reduction in impurities in the product made using CPG support 17, compared to the product made using LCAA-CPG.
  • the loading on the CPG decreases as the length increases, possibly due to issues resulting from synthesizing such long chain supports.
  • the CPG support 14 is composed of five hexaethylene glycol spacers and was found to be as efficient as the CPG support 15, carrying seven hexaethylene glycol spacers, for minimizing process-related impurities in synthetic DNA sequences.
  • a reduction of process-related impurities of up to 53% in synthetic nucleic acid sequences was achieved when using 14, instead of LCAA-CPG, for the solid-phase synthesis of nucleic acid sequences.
  • process-related impurities are variable and may vary depending on the composition of the nucleic acid sequence, a lower content of residual process-related impurities facilitates the removal of those impurities from the full-length nucleic acid sequences, to ultimately provide nucleic acid-based drugs of extremely purity for safer and more efficacious therapies for human diseases.
  • the universal support is produced using a universal linker phosphoramidite, such as the universal linker phosphoramidite shown below, according to the method illustrated in Example 4. Unreacted hydroxyl moieties are inactivated using a 1:1 (v/v) Cap A:Cap B solution as described in Example 4. And standard DNA or RNA synthesis proceeds as described in Examples 6 and 7.
  • a universal linker phosphoramidite such as the universal linker phosphoramidite shown below

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Structural Engineering (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Saccharide Compounds (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
  • Pyrane Compounds (AREA)

Abstract

Disclosed herein are embodiments of a solid support suitable for synthesizing nucleic acid sequences. The solid support may have a structure according to Formula (I), where CPG is controlled pore glass, and m, n, x, y, R1 and R2 are as defined herein. Also disclosed are methods for making and using the solid support, kits including solid support, and a universal linker phosphoramidite suitable for use in the solid support.

Description

SOLID SUPPORT FOR SYNTHESIZING NUCLEIC ACID SEQUENCES AND METHODS FOR MAKING AND USING
CROSS REFERENCE TO RELATED APPLICATION
This application claims the benefit of the earlier filing date of U.S. provisional patent application No. 63/046,413, filed June 30, 2020, which is incorporated herein by reference in its entirety.
FIELD
The application concerns a solid support for synthesizing nucleic acid sequences and methods for making and using the solid support.
BACKGROUND
The purity of synthetic nucleic acid sequences is important for the production of safe and efficacious nucleic acid-based drugs, such as those for antisense or RNA interference in vivo therapies. Highly pure synthetic DNA sequences are also important for the construction of entire genes to be used in synthetic biology applications (e.g., mRNA and/or genome editing). Although the use of antisense DNA (asDNA) sequences or small interfering RNA (siRNA) duplexes have been demonstrated to be highly potent at silencing the expression of disease-causing proteins in vitro , the clinical applications of these nucleic acid sequences for the treatment of human diseases has been hindered by various factors including: (i) instability in biological media; (ii) poor delivery to target cells; (iii) poor uptake by target cells and; (iv) dose-related toxicities. Severe thrombocytopenic or peripheral neuropathy adverse events have been reported in patients treated with asDNA sequences or siRNAs, respectively. These limitations have prompted the use of chemical modifications and/or formulations to improve nuclease resistance and binding affinity of asDNAs to their respective targets with the aims of enhancing cellular delivery, potency and efficacy of nucleic acid-based drugs. Identification of the root cause leading to adverse events associated with the use of asDNA sequences is challenging given the various structural modifications made to DNA sequences to ensure their stability in a biological environment and affinity to targeted mRNA sequences. Furthermore, even though the phosphoramidite-based manufacture of synthetic DNA and RNA sequences is highly efficient, synthetic DNA and RNA sequences are still contaminated with process-related impurities. These impurities include partially protected and/or 5’ -uncapped DNA or RNA sequences leading to the production of shorter than full-length sequences. The distinct shorter than full-length (n-1) DNA sequences are difficult to remove from the full-length DNA product and can potentially elicit immune responses and/or adverse events arising from off-target activities upon administration to patients under antisense therapy settings. Accordingly, there is a need to minimize the formation of those process-related impurities to levels that should not become a safety concerns to patients.
SUMMARY
Disclosed herein are embodiments of a solid support suitable for solid phase synthesis of nucleic acid sequences. Using the disclosed solid support may result in a nucleic acid composition that has a reduced amount of impurities, compared to the same nucleic acid sequence being produced using current commercially available solid supports. In some embodiments, the disclosed solid support has a structure according to Formula I
Figure imgf000004_0001
Formula I.
With respect to Formula I, CPG is controlled pore glass m is from 2 to 6, such as 2, 3, 4, 5, or 6, and in some embodiments, m is 2, 3 or 4, and may be 3. x is from 1 to 5, such as 1, 2, 3, 4, or 5, and in some embodiments, x is 1, 2 or 3, and may be 1. y is from 2 to 12, and in some embodiments, y is from 3 to 10, and may be 6. n is from 3 to 10, such as 3, 4, 5, 6, 7, 8, 9 or 10, and in some embodiments, n is from 3 to 7, and may be 5. And each R1 independently Ci-6alkyl, -(CH2)I-6CN, -(CF^i-eOR’ or a thermolabile phosphate protecting group, where R’ is aliphatic, aryl, or aralkyl. R1 may be Ci-4alkyl or -(CH2)I-4CN, and in certain embodiments, R1 is -CH2CH2CN.
In any embodiments, R2 may
Figure imgf000004_0002
Figure imgf000005_0001
from 2 to 10, such as 2, 3, 4, 5, 6, 7, 8, 9 or 10, and in certain embodiments, p is 6. And R3 may
Figure imgf000005_0002
2, 3 or 4, and may be 2; R4 is H or OR6; and Bp is a nucleic acid base where the exocyclic amine group, if present, is protected.
R5 is PG or a nucleic acid sequence, where PG is a protecting group. In any embodiments, PG may be 4,4’-dimethoxytrityl (DMTr).
R6 may be 9-phenylxanthyl (pixyl), tert-butyldimethylsilyl (TBDMS), tert- butyldiphenylsilyl (TBDPS), trimethyl silyl (TMS), triethylsilyl (TES), triisopropyl silyl (TIPS), and in some embodiments, R6 is TBDMS. m, x, y and n may be selected to produce a support backbone length from the silicon atom to the R2 moiety of from 50 atoms to 400 atoms, such as from 100 atoms to 150 atoms.
In any embodiments, Bp may be a nucleic acid base with exocyclic amine group(s) protected if present, such as exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, thymine, uracil, hypoxanthine, xanthine, exocyclic amine-protected 7-methylguanine, 5,6-dihydrouracil, exocyclic amine-protected 5-methylcytosine, or exocyclic amine-protected 5-hydroxymethylcytosine, and may be exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine- protected guanine, thymine, or uracil. In some embodiments, Bp is adenine, cytosine, or guanine, where the exocyclic amine is protected by a benzoyl (Bz), isobutyryl(iBu), phenoxyacetyl (Pac), phenylsulfonylethoxycarbonyl, p-nitrophenyloxycarbonyl, allyloxycarbonyl, or levulinyl group. In other embodiments, Bp is thymine or uracil.
In some embodiments, R4 is H and/or Bp is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, or thymine. In other embodiments, R4 is OR6 and/or Bp is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, or uracil. In such embodiments, R6 may be TBDMS, TBDPS, TMS, TES, or TIPS, such as TBDMS. Exocyclic amine-protected adenine may be N6 -benzoyl adenine (ABz) or N6- phenoxyacetyl adenine (APac). Exocyclic amine-protected cytosine may be N4 -benzoyl cytosine (CBz) or N4-phenoxyacetyl cytosine (CPac). And/or Exocyclic amine-protected guanine may be N2-isobutyryl guanine (G'Bu) or N2-phenoxyacetyl guanine (GPac). In any embodiments, a loading of the support on the CPG may be from 5 pmol/g to about 125 pmol/g.
And in some embodiments, the solid support has a formula selected from:
Figure imgf000006_0001
Figure imgf000007_0001
In any embodiments, t may be 2. Also in any embodiments, R5 may be PG, and in some embodiments, is DMTr. Alternatively, R5 may be a nucleic acid sequence, and may comprise one or more DNA sequences, such as one or more antisense DNA sequences. In other embodiments, the nucleic acid sequence comprises one or more RNA sequences, such as one or more antisense RNA sequences, one or more microRNA (miRNA) sequences, one or more small interfering RNA (siRNA) sequences, one or more repeat-associated small interfering RNA (rasiRNA) sequences, or combinations thereof.
Also disclosed is a universal linker phosphoramidite suitable for use with certain embodiments of the disclosed solid support. The universal linker may have a structure:
Figure imgf000007_0002
Embodiments of a method for synthesizing a nucleic acid sequence using the disclosed solid support also are disclosed herein. In some embodiments, the method comprises loading a solid support according to any one of the disclosed embodiments into a DNA/RNA synthesizer, and operating the synthesizer to produce a desired nucleic acid sequence. In some embodiments, the solid support is a solid support where R5 is PG, such as DMTr.
Also disclosed herein is a kit comprising a solid support according to any one of the disclosed embodiments, and may comprise a protected 2'-deoxynucleoside, ribonucleoside, and/or chemically modified nucleoside wherein an exocyclic amine on the deoxynucleoside, ribonucleoside or chemically modified nucleoside, if present, also is protected. The 2'- deoxynucleoside may be DMTrdABz, DMTrdCBz, DMTrdG'6", or DMTrT), and/or the ribonucleosides may be DMT r APac-2 ’ -OTBDMS , DMTrCPac-2’-OTBDMS, DMTrGPac-2’- OTBDMS, or DMTrU-2’-OTBDMS. In some embodiments, the kit comprises a universal linker phosphoramidite, such as the universal linker phosphoramidite disclosed herein. In some embodiments, the kit further comprises ammonium hydroxide.
The foregoing and other objects and features of the disclosure will become more apparent from the following detailed description, which proceeds with reference to the accompanying figures.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. l is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5 ’ -d(CTGAGT AGCGAACGT GAAGA) (SEQ ID NO: 1) produced by an embodiment of the disclosed solid support structure comprising 3 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial long chain alkylamine-controlled pore glass (LCAA-CPG) support.
FIG. 2 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5 ’ -d(CTGAGTAGCGAACGTGAAGA) (SEQ ID NO: 1) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
FIG. 3 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5 ’ -d(CTGAGTAGCGAACGTGAAGA) (SEQ ID NO: 1) produced by an embodiment of the disclosed solid support structure comprising 7 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
FIG. 4 is a graph of retention time versus absorbance units at 254 nm, comparing the HPLC profiles of unpurified 5 ’ -d(CTGAGTAGCGAACGTGAAGA) (SEQ ID NO: 1) produced by embodiments of the disclosed solid support structure comprising 5 (blue) or 7 (black) hexaethylene glycol phosphate repeating units.
FIG. 5 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- r(UCUUGGUUACAUGAAAUCCU) (SEQ ID NO: 3) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support. FIG. 6 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- d(TCTTGGTTACATGAAATCCT) (SEQ ID NO: 2) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
FIG. 7 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- d( ATAGT GT GC ATCGAT GCC AC) (SEQ ID NO: 5) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
FIG. 8 is a graph of retention time versus absorbance units at 254 nm, illustrating the HPLC profiles of unpurified 5’- d(CTCTGTACCTTACGTCTTCG) (SEQ ID NO: 4) produced by an embodiment of the disclosed solid support structure comprising 5 hexaethylene glycol phosphate repeating units, and comparing it to the same sequence produced using a commercial LCAA-CPG support.
FIG. 9 provides stacked expanded HPLC profiles of the spectra from FIG. 6, illustrating the approximate 50% reduction in impurities in the product made using the CPG support, compared to the product made using the commercial LCAA-CPG support.
FIG. 10 provides stacked expanded HPLC profiles for sequences according to SEQ ID NO: 2 produced by CPG supports where n = 5 and n = 10, and illustrating that longer support structures, such as n = 10, provide substantially the same purity benefits as support structures where n = 5.
FIG. 11 provides stacked expanded HPLC profiles for sequences according to SEQ ID NO: 1 produced by CPG supports where n is 1, 3 or 5, illustrating the improved purity achieved by using supports where n is 3 and 5 compared to the purity achieved when n = 1.
SEQUENCE LISTING
The nucleic acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases as defined in 37 C.F.R. 1.822. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood as included by any reference to the displayed strand. The Sequence Listing is submitted as an ASCII text file, created on June 28, 2021, 4 KB, which is incorporated by reference herein in its entirety. In the accompanying sequence listing:
SEQ ID NOs: 1-5 are nucleic acid sequences produced using exemplary embodiments of the disclosed solid support structure.
DETAILED DESCRIPTION
I. Terms
The following explanations of terms and methods are provided to better describe the present disclosure and to guide those of ordinary skill in the art in the practice of the present disclosure. The singular forms “a,” “an,” and “the” refer to one or more than one, unless the context clearly dictates otherwise. The term “or” refers to a single element of stated alternative elements or a combination of two or more elements, unless the context clearly indicates otherwise. As used herein, “comprises” means “includes.” Thus, “comprising A or B,” means “including A, B, or A and B,” without excluding additional elements. All references, including patents and patent applications cited herein, are incorporated by reference in their entireties.
Unless otherwise indicated, all numbers expressing quantities of components, molecular weights, percentages, temperatures, times, and so forth, as used in the specification or claims are to be understood as being modified by the term “about.” Accordingly, unless otherwise indicated, implicitly or explicitly, the numerical parameters set forth are approximations that may depend on the desired properties sought and/or limits of detection under standard test conditions/methods. When directly and explicitly distinguishing embodiments from discussed prior art, the embodiment numbers are not approximates unless the word “about” is recited.
Unless explained otherwise, all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this disclosure belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, suitable methods and materials are described below. The materials, methods, and examples are illustrative only and not intended to be limiting.
When chemical structures are depicted or described, unless explicitly stated otherwise, all carbons are assumed to include implicit hydrogens such that each carbon conforms to a valence of four. For example, in the structure on the left-hand side of the schematic below there are nine hydrogen atoms implied. The nine hydrogen atoms are depicted in the right-hand structure.
Figure imgf000011_0001
Sometimes a particular atom in a structure is described in textual formula as having a hydrogen or hydrogen atoms, for example -CH2CH2-. It will be understood by a person of ordinary skill in the art that the aforementioned descriptive techniques are common in the chemical arts to provide brevity and simplicity to description of organic structures.
A person of ordinary skill in the art will appreciate that compounds, such as the solid supports disclosed herein, may exhibit the phenomena of tautomerism, conformational isomerism, geometric isomerism, and/or optical isomerism. For example, certain disclosed compounds can include one or more chiral centers and/or double bonds and as a consequence can exist as stereoisomers, such as double-bond isomers (i.e., geometric isomers), enantiomers, diasteromers, and mixtures thereof, such as racemic mixtures. In certain embodiments the compounds disclosed herein are synthesized in or are purified to be in substantially enantiopure form, such as in an 85% enantiomeric excess (e.e.), a 90% enantiomeric excess, a 95% enantiomeric excess, a 97% enantiomeric excess, a 98% enantiomeric excess, a 99% enantiomeric excess, or even in greater than a 99% enantiomeric excess, such as in a substantially enantiopure form. In other embodiments, the compounds are in a racemic form, having substantially a 50:50 mixture of enantiomers.
As another example, certain disclosed compounds can exist in several tautomeric forms, including the enol form, the keto form, and mixtures thereof. For example, a compound may have a moiety exhibiting the following isomerization:
Figure imgf000011_0002
As the various compound names, formulae and compound drawings within the specification and claims can represent only one of the possible tautomeric, conformational isomeric, optical isomeric, or geometric isomeric forms, a person of ordinary skill in the art will appreciate that the disclosed compounds encompass any tautomeric, conformational isomeric, optical isomeric, and/or geometric isomeric forms of the compounds described herein, as well as mixtures of these various different isomeric forms. In cases of limited rotation, e.g. around the amide bond, atropisomers are also possible and are also specifically included in the compounds of the invention.
In any embodiments, any or all hydrogens present in the compound, or in a particular group or moiety within the compound, may be replaced by a deuterium or a tritium. Thus, a recitation of alkyl includes deuterated alkyl, where from one to the maximum number of hydrogens present may be replaced by deuterium. For example, ethyl may be C2H5 or C2H5 where from 1 to 5 hydrogens are replaced by deuterium, such as in
C2DXH5.x.
As used herein, the term “substituted” refers to all subsequent modifiers in a term, for example in the term “substituted arylCi-salkyl,” substitution may occur on the “Ci- salkyl” portion, the “aryl” portion or both portions of the arylC i-xalkyl group.
Aliphatic: A substantially hydrocarbon-based group or moiety. An aliphatic group or moiety can be acyclic, including alkyl, alkenyl, or alkynyl groups, cyclic versions thereof, such as cycloaliphatic and/or spiroaliphatic groups or moieties including cycloalkyl, cycloalkenyl, cycloalkynyl, or spiroalkyl and further including straight- and branched-chain arrangements, and all stereo and position isomers as well. Unless expressly stated otherwise, an aliphatic group contains from one to twenty -five carbon atoms (Ci-25), for example, from one to fifteen (Ci-15), from one to ten (Ci-10) from one to six (Ci^), or from one to four carbon atoms (C1-4) for an acyclic alkyl group or moiety; from two to twenty-five carbon atoms (Ci-25) for example, from two to fifteen (Ci-15), from two to ten (Ci-10) from two to six (Ci-6), or from two to four carbon atoms (C1-4) for an acyclic alkenyl or alkynyl group or moiety; from three to fifteen carbon atoms (C3-15), such as from three to ten (C3-10), from three to eight (C3-8), from three to six (C3-6), or from three to four (C3-4) carbon atoms for a cycloaliphatic group or moiety; or from three to fifteen (C6-15) carbon atoms for a spiroaliphatic group or moiety. An aliphatic group may be substituted or unsubstituted, unless expressly referred to as an “unsubstituted aliphatic” or a “substituted aliphatic.” An aliphatic group can be substituted with one or more substituents (up to two substituents for each methylene carbon in an aliphatic chain, or up to one substituent for each carbon of a -C=C- double bond in an aliphatic chain, or up to one substituent for a carbon of a terminal methine group). Alkyl: A saturated aliphatic hydrocarbyl group having from 1 to 10 (Ci-io) or more carbon atoms, more typically 1 to 8 (Ci-s) carbon atoms such as 1 to 6 (Ci-6) carbon atoms or 1 to 4 (Ci -4) carbon atoms. An alkyl moiety may be substituted or unsubstituted. This term includes, by way of example, linear and branched hydrocarbyl groups such as methyl (CH3), ethyl (-CH2CH3), n-propyl (-CH2CH2CH3), isopropyl (-CH(CH3)2), n-butyl (-CH2- CH2CH2CH3), or isobutyl (-CH2CH2(CH3)2).
Cycloaliphatic: Refers to a cyclic aliphatic group having a single ring {e.g, cyclohexyl), or multiple rings, such as in a fused, bridged or spirocyclic system, at least one of which is aliphatic. Typically, the point of attachment to the parent structure is through an aliphatic portion of the multiple ring system. Cycloaliphatic includes saturated and unsaturated systems, including cycloalkyl, cycloalkenyl and cycloalkynyl. A cycloaliphatic group may contain from three to twenty -five carbon atoms; for example, from three to fifteen, from three to ten, or from three to six carbon atoms. Unless otherwise stated, a cycloaliphatic group may be substituted or unsubstituted. Exemplary cycloaliphatic groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, cyclopentenyl, or cyclohexenyl.
Aryl: Refers to an aromatic carbocyclic group of, unless specified otherwise, from 6 to 15 carbon atoms having a single ring (e.g., phenyl) or multiple condensed rings in which at least one ring is aromatic (e.g., naphthalene). If any aromatic ring portion contains a heteroatom, the group is heteroaryl and not aryl. Aryl groups may be, for example, monocyclic, bicyclic, tricyclic or tetracyclic. Unless otherwise stated, an aryl group may be substituted or unsubstituted.
Aralkyl: Refers to an aryl group attached to the parent via an alkyl moiety. Exemplary aralkyl groups include benzyl and phenyl ethyl.
Exocyclic amine: As used herein, an exocyclic amine is an amine moiety that is not part of a ring structure, i.e., the nitrogen atom of the exocyclic amine is not a ring atom. Exemplary exocyclic amines include, but are not limited to, the amine at the N6 position of adenine, the amine at the N2 position of guanine, and the amine at the N4 position of cytosine. An exocyclic amine may be unprotected or protected, such as by a suitable amine protecting group. Exemplary protecting groups include, but are not limited to, isobutyryl(iBu); phenoxyacetyl (Pac); levulinyl; amidine protecting groups, such as
Figure imgf000014_0001
carbamate protecting groups, such as 9-fluorenylmethyl carbamate (Fmoc), 1,1 -dimethyl-
2, 2, 2-trichloroethyl carbamate (TCBOC),
Figure imgf000014_0002
(where R is H, Cl or
NO2), 2-(4-nitrophenyl)ethyl carbamate, benzyl carbamate (Cbz), allyl carbamate (allyloxycarbonyl), 4-nitrophenyloxycarbonyl, or (CH3)2CHCH20C(=0)-; or amide protecting groups, such as formamide, acetamide, CH3CH3CAO)-, (CH3)2CHC(=0)-,
(CH3)3CC(=0)-,
Figure imgf000014_0008
Me0CH2C(=0)-, i-Pr0CH2C(=0)-,
Figure imgf000014_0003
(where R is H, 2-Cl or 4-t-butyl),
MeC(=0)CH2CH2C(=0)-, benzoyl (Bz),
Figure imgf000014_0004
(where R is 4-methoxy, 4-Cl, 4- nitro, 4-NMe2, 4-tert-butyl, 2-methyl, 3 -Cl, 3,4-dichloro, or 3-methoxy-4-phenoxy),
Figure imgf000014_0005
is Ac, or Ph),
Figure imgf000014_0007
, n orr
Figure imgf000014_0006
Additional information concerning protecting groups for exocyclic amines on nucleic acid bases can be found in Beaucage, S. L. and Iyer, R. L. “Advances in the Synthesis of Oligonucleotides by the Phosphoramidite Approach,” Tetrahedron , 1992, Vol. 48(12), pp 2223-2311, which is incorporated herein by reference in its entirety.
Heteroaryl: An aromatic group or moiety of, unless specified otherwise, from 5 to 15 ring atoms comprising at least one carbon atom and at least one heteroatom, such as N, S, O, P, or Si, preferably N, S or O. A heteroaryl group or moiety may comprise a single ring (e.g., pyridinyl, or pyrazine) or multiple condensed rings (e.g., indolyl). Heteroaryl groups or moiety may be, for example, monocyclic, bicyclic, tricyclic or tetracyclic. Unless otherwise stated, a heteroaryl group or moiety may be substituted or unsubstituted.
Heterocyclyl, heterocyclo or heterocycle: Aromatic and non-aromatic ring systems, and more specifically refer to a stable three- to fifteen-membered ring moiety comprising at least one carbon atom, and typically plural carbon atoms, and at least one, such as from one to five, heteroatoms. The heteroatom(s) may be nitrogen, phosphorus, oxygen, silicon or sulfur atom(s), preferably N, S or O. The heterocyclyl moiety may be a monocyclic moiety, or may comprise multiple rings, such as in a bicyclic or tricyclic ring system, provided that at least one of the rings contains a heteroatom. Such a multiple ring moiety can include fused or bridged ring systems as well as spirocyclic systems; and any nitrogen, phosphorus, carbon, silicon or sulfur atoms in the heterocyclyl moiety can be optionally oxidized to various oxidation states. For convenience, nitrogens, particularly, but not exclusively, those defined as annular aromatic nitrogens, are meant to include their corresponding N-oxide form, although not explicitly defined as such in a particular example. Thus, for a compound having, for example, a pyridinyl ring, the corresponding pyridinyl-N-oxide is included as another compound of the invention, unless expressly excluded or excluded by context. In addition, annular nitrogen atoms can be optionally quaternized. Heterocycle includes heteroaryl moieties, and heterocycloaliphatic moieties, such as heterocycloalkyl moieties, which are heterocyclyl rings that are partially or fully saturated. Unless otherwise stated, a heterocyclyl group or moiety may be substituted or unsubstituted. Examples of heterocyclyl groups include, but are not limited to, azetidinyl, oxetanyl, acridinyl, benzodioxolyl, benzodioxanyl, benzofuranyl, dioxolanyl, indolizinyl, naphthyridinyl, phenazinyl, phenothiazinyl, phenoxazinyl, phthalazinyl, pteridinyl, purinyl, quinazolinyl, quinoxalinyl, quinolinyl, isoquinolinyl, tetrazoyl, tetrahydroisoquinolyl, piperidinyl, piperazinyl, 2-oxopiperazinyl, 2-oxopiperidinyl, 2-oxopyrrolidinyl, 2- oxoazepinyl, azepinyl, pyrrolyl, 4-piperidonyl, pyrrolidinyl, pyrazolyl, pyrazolidinyl, imidazolyl, imidazolinyl, imidazolidinyl, dihydropyridinyl, tetrahydropyridinyl, pyridinyl, pyrazinyl, pyrimidinyl, pyridazinyl, oxazolyl, oxazolinyl, oxazolidinyl, triazolyl, isoxazolyl, isoxazolidinyl, morpholinyl, thiazolyl, thiazolinyl, thiazolidinyl, isothiazolyl, quinuclidinyl, isothiazolidinyl, indolyl, isoindolyl, indolinyl, isoindolinyl, octahydroindolyl, octahydroisoindolyl, quinolyl, isoquinolyl, decahydroisoquinolyl, benzimidazolyl, thiadiazolyl, benzopyranyl, benzothiazolyl, benzoxazolyl, furyl, tetrahydrofuryl, tetrahydropyranyl, thienyl, benzothieliyl, thiamorpholinyl, thiamorpholinyl sulfoxide, thiamorpholinyl sulfone, and oxadiazolyl.
Halo, halide or halogen: Refers to fluoro, chloro, bromo or iodo.
Nucleic acid sequence: Refers to DNA and RNA sequences, such as cDNA and mRNA. In one examples, includes antisense nucleic acid sequences (such as antisense RNA or antisense DNA), microRNAs (miRNAs), small interfering RNAs (siRNAs), and repeat-associated small interfering RNAs (rasiRNAs). In one example, a nucleic acid sequence is a therapeutic nucleic acid sequence, such as a DNA therapeutic ( e.g ., antisense oligonucleotide, DNA aptamers) or RNA therapeutic (e.g., miRNA, siRNA, ribozyme, or RNA decoy). A nucleic acid sequence can include naturally occurring and/or non-naturally occurring nucleotides.
Nucleosides: The major nucleosides of DNA are deoxyadenosine (dA), deoxyguanosine (dG), deoxycytidine (dC) and deoxythymidine (T). The major nucleosides of RNA are adenosine (rA), guanosine (rG), cytidine (rC) and uridine (U). Includes nucleosides containing modified bases and modified sugar moieties, for example as described in U.S. Patent No. 5,866,336 to Nazarenko et al. (herein incorporated by reference). Examples of modified sugar moieties which may be used to modify nucleotides at any position on its structure include, but are not limited to: arabinose, 2-fluoroarabinose, xylose, and hexose. In one example, a nucleoside is a 2'-deoxynucleoside (dA, dC, dG, or T). In one example, a nucleoside is chemically modified (e.g., LNA, BNA or UNA).
II. Solid Support Structure
Disclosed herein is a solid support structure suitable for synthesizing nucleic acid sequences. Embodiments of the solid support structure may facilitate synthesizing nucleic acid sequences having reduced process-related impurities and/or increased yield, compared to the same sequence synthesized using commercial solid supports. The impurities may comprise, but are not limited to, nucleic acid sequences having shorter lengths than a desired nucleic acid sequence, such as one or more nucleotides shorter; partially alkylated thymine or uracil bases in DNA or RNA sequences, possibly resulting from exposure to acrylonitrile produced during the deprotection of 2-cyanoethyl phosphate protecting groups under basic conditions; and/or impurities from removed protecting groups, such as tert- butyldimethylsilyl fluoride or tetrabutylammonium fluoride, that may contaminate the sequence, particularly solid-phase purified RNA sequences.
In some embodiments, the disclosed solid support structure has a formula I:
Figure imgf000017_0001
Formula I
With respect to formula I, CPG is controlled pore glass. In some embodiments, the CPG has a pore size of from 250 A to 1500 A or more, such as from 500 A to 1500A, from 500 A to 1250A or from 500 A to lOOOA, and in certain embodiments, the CPG has a pore size of about 500 A. m is 2, 3, 4, 5, 6, such as 2, 3, or 4, and in certain embodiments, m is 3. x is 1, 2, 3, 4, or 5, such as 1, 2, or 3. In some embodiments, x is 1 or 2, and in certain embodiments, x is 1. y is from 2 to 12, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12, and may be from 3 to 10, or from 4 to 8, and in some embodiments, y is 6. n is from 3 to 10 or more, such as 3, 4, 5, 6, or 7, and may be 5, 6 or 7. In certain embodiments, n is 5.
In some embodiments, m, x, y and/or n are selected to produce a carbon/oxygen/phosphorus backbone chain from the silicon atom to the R2 moiety of 50 atoms or more in length, such as from 50 atoms to 400 atoms, from 60 atoms to 350 atoms, from 100 atoms to 210 atoms, or from 100 atoms to 150 atoms.
Each R1 independently is Ci-6alkyl, -(CH2)i-6CN, -(CH2)I-60R’ or a thermolabile phosphate protecting group, where R’ is aliphatic, aryl or aralkyl. R’ may be alkyl, such as Ci-6alkyl; alkenyl, such as C2-6alkenyl; alkynyl, such as C2-6alkynyl; cycloalkyl, such as C3- 8cycloalkyl; aryl, such as phenyl; or aralkyl, such as benzyl. The thermolabile phosphate protecting group may have a structure
R9 R9
^VYr7
R8 R8 x
With respect to this structure, X is O or S. R7 is H, Ra, 0Ra, SRa, or N(Rb)2, where Ra is Rd; and Rb is H, Rd or two Rbs together with the nitrogen to which they are attached, form a 3- to 7-membered heterocyclyl.
Z is O, S, N(RC), C(Rc)2 or C(RC)2C(RC)2 where each Rc independently is H or Rd or one Rd in combination with the C=X moiety and one Ra or Rb from R7 together form a 3- to 7-membered cycloaliphatic or heterocyclyl ring.
Each R8 independently is H or Rd, or one R8 together with Z forms an aryl ring, such as phenyl.
Each R9 independently is H or Rd or one R9 and one R8 together with the atoms to which they are attached, forms a moiety having a formula
Figure imgf000018_0001
where r is 0 to 6, and each R10 independently is H, Ci-6alkyl, NO2, -N(Ci-6alkyl)2, -OCi- 6alkyl, -SCi-6alkyl, -CN, or halogen, provided that the aromatic ring substituted with R10 is one carbon removed from the phosphate oxygen of Formula I.
Rd is alkyl, alkenyl, alkynyl, cycloalkyl, aryl, or aralkyl.
In some embodiments, the thermolabile phosphate protecting group is selected from
Figure imgf000018_0002
information concerning thermolabile phosphate protecting groups can be found in U.S. Patent No. 6,762,298, which is incorporated herein by reference in its entirety.
In some embodiments, each R1 independently is Ci-4alkyl or -(CH2)I-4CN, and may be methyl, ethyl, propyl, -CTBCN or -CH2CH2CN, and in certain embodiments, R1 is - CH2CH2CN.
In other embodiments, each R1 independently is a thermolabile phosphate protecting group as defined herein.
And in some embodiments, each R1 is the same, but in other embodiments, the support comprises two or more R1 moieties, such as from 2 to the maximum number of R1 moieties present in the structure. In certain embodiments, each R1 is -CH2CH2CN.
Figure imgf000019_0001
from 2 to
10, such as from 3 to 8 or from 4 to 8, and in certain embodiments, p is 6.
Figure imgf000019_0002
such as 2; R4 is H or
OR6; and Bp is a nucleic acid base where the exocyclic amine, if present, is protected. The protecting group can be any suitable protecting group, and may be a protecting group as disclosed herein. In some embodiments, Bp is a nucleic acid where the exocyclic amine, if present, is protected by a benzoyl (Bz), isobutyryl(iBu), phenoxyacetyl (Pac), phenylsulfonylethoxycarbonyl, p-nitrophenyloxycarbonyl, allyloxycarbonyl, or levulinyl group. In certain embodiments, Bp is N6 -benzoyl adenine (ABz), N4 -benzoyl cytosine (CBz), N2-isobutyryl guanine (G'Bu), thymine (T), N6 -phenoxyacetyl adenine (APac), N4- phenoxyacetyl cytosine (CPac), N2 -phenoxyacetyl guanine (GPac), uracil (U), and/or similarly exocyclic amine-protected (where applicable) hypoxanthine, xanthine, 7-methylguanine, 5,6-dihydrouracil, 5-methylcytosine, or 5-hydroxymethylcytosine. Additional information concerning modified nucleic acid bases that can be used with the disclosed technology can be found in U.S. Patent Nos. 7,355,037 and 7,612,197, which are incorporated herein by reference in their entireties. In some embodiments, Bp is ABz, CBz, G'Bu, T or APac, CPac, GPac, or U.
R5 is PG or a nucleic acid sequence. The nucleic acid sequence may comprise one or more DNA sequences and/or one or more RNA sequences. An exemplary DNA sequence is an antisense DNA sequence. An exemplary RNA sequence is an antisense RNA sequence, microRNA (miRNA) sequence, small interfering RNA (siRNA) sequence, repeat-associated small interfering RNA (rasiRNA) sequence, or a combination thereof. A person of ordinary skill in the art understands that when R5 is a nucleic acid sequence, the nucleic acid sequence is attached to the support via a phosphate moiety at the 3’ end of the nucleic acid sequence, in the same manner as nucleotides are typically attached together to form a nucleic acid sequence.
R6 is a hydroxyl protecting group that can be removed with fluoride ions or under essentially neutral conditions. Typically, R6 is 9-phenylxanthyl (pixyl) or a silyl protecting group, such as tert-butyldimethylsilyl (TBDMS), tert-butyldiphenylsilyl (TBDPS), trimethyl silyl (TMS), triethylsilyl (TES), triisopropyl silyl (TIPS). In certain embodiments, R6 is TBDMS, TBDPS, TMS, TES, TIPS, and may be TBDMS.
PG is any protecting group suitable for use in DNA or RNA synthesis. In some embodiments PG is dimethoxytrityl(DMTr), triphenylmethyl (trityl), p-monomethoxytrityl (MMTr), trimethoxytrityl (TMTr), 9-phenylxanthen-9-yl, 9-(p-methoxyphenyl)xanthen-9- yl, 9-phenylthioxanthen-9-yl, or 7-chloro-9-phenylthioxanthen-9-yl. In certain embodiments, PG is DMTr.
Figure imgf000020_0001
DMTr.
In some embodiments, R4 is H and Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine, for example:
Figure imgf000021_0001
In other embodiments, R4 is OR6 and Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil, for example:
Figure imgf000021_0002
In some embodiments, x is 1, leading to a support structure according to Formula II:
Figure imgf000021_0003
Formula II.
With respect to formula II, CPG, m, y, n, R1 and R2 are as previously defined for Formula I.
In some embodiments of Formulas I and II, y is 6. In particular embodiments, the solid support structure has a formula according to Formula III:
Figure imgf000021_0004
Formula III.
With respect to Formula III, CPG, m, n, R1 and R2 are as previously defined for Formula I.
In some embodiments of Formula I, m is 3, leading to solid support structure according to Formula IV
Figure imgf000021_0005
Formula IV.
With respect to Formula IV, CPG, n, x, y, R1 and R2 are as previously defined for Formula I. In certain embodiments, of Formula IV, x is 1 and y is 6, leading to a solid support structure according to Formula V
Figure imgf000022_0001
Formula V.
With respect to Formula V, CPG, n, R1 and R2 are as previously defined for Formula I. In some embodiments of Formulas I-V, R2 is H.
In other embodiments of Formulas
Figure imgf000022_0002
R1 and R5 are as previously defined for Formula I. In certain embodiments, the solid support structure has a formula according to Formula VI or VII
Figure imgf000022_0003
Formula VII.
With respect to Formulas VI and VII, CPG, m, n, x, y, R1 and R5 are as previously defined for Formula I. In certain embodiments, of Formulas VI and VII, R5 is PG, where PG is as previously defined for Formula I. o
-f-P-0-(CH2)p-NHR3
In some other embodiments of Formulas I to V, R2 is OR1 in certain embodiments, the solid support structure has a formula according to Formula VIII or IX
Figure imgf000023_0001
Formula IX.
With respect to Formulas VIII and IX, CPG, m, n, p, x, y, R1 and R3 are as previously defined for Formula I.
In some embodiments of Formulas I-V or VIII-IX, R3 is H. In other embodiments,
Figure imgf000023_0002
are as previously defined for
Formula I. And in certain embodiments, the solid support structure has a formula according to Formula X, XI, XII or XIII
Figure imgf000023_0003
Formula XI
Figure imgf000024_0001
Formula XIII.
With respect to Formulas X-XIII, CPG, m, n, p, t, x, y, R1, R4, R5, and Bp are as previously defined for Formula I. In some embodiments, R5 is PG.
In some embodiments of Formulas X-XIII , R4 is H and Bp is an exocyclic amine- protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine. In other embodiments of Formulas X-XIII , R4 is OR6 and Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil.
In particular embodiments of Formulas I-V and VIQ-XIII, p is 6.
In any embodiments, each R1 may be the same, and in certain embodiments, each R1 is -CH2CH2CN.
In any embodiments, n is 3, 4, 5, 6, 7, 8, 9 or 10, such as 3, 4, 5, 6, or 7, and may be selected from 3, 5, or 7, or from 4, 5, 6 or 7, such as 5, 6, or 7. And in certain examples, n is 5.
In certain embodiments of Formulas I-XIII, R5 is PG, such as DMTr. In other embodiments, R5 is a nucleic acid sequence. In certain embodiments, R5 is, or comprises, a DNA sequence. In certain other embodiments, R5 is, or comprises, an RNA sequence. In particular embodiments, R5 is a DNA sequence, R4 is H and Bp an exocyclic amine- protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine. In other particular embodiments, R5 is an RNA sequence, R4 is OR6 and Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil. Certain disclosed exemplary solid support structures within the scope of one or more of the general formulas include:
Figure imgf000025_0001
Figure imgf000026_0001
Figure imgf000027_0001
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine;
Figure imgf000027_0002
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine;
Figure imgf000027_0003
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine;
Figure imgf000028_0001
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine;
Figure imgf000028_0002
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine;
Figure imgf000028_0003
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or thymine;
Figure imgf000028_0004
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil;
Figure imgf000028_0005
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil;
Figure imgf000029_0001
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil;
Figure imgf000029_0002
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil;
Figure imgf000029_0003
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil;
Figure imgf000029_0004
where Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, or uracil.
In any embodiments, the loading of the support on the CPG is from greater than zero to 125 pmol/g or more, such as from 5 pmol/g to 125 pmol/g, from 10 pmol/g to 100 pmol/g, or from 15 pmol/g to 75 pmol/g. III. Method for Making the Solid Support Structure
The disclosed solid support structures can be prepared as exemplified below, as illustrated for specific supports in the examples, and as will be understood by a person of ordinary skill in the art of organic synthesis. An exemplary synthesis may include the following first reaction step according to Scheme 1.
Figure imgf000030_0001
A B C
Scheme 1
CPG A is treated with a trialkoxysilane, such as trimethoxysilane as illustrated in Scheme 1, in a suitable solvent, such as an aprotic solvent, for example, toluene. The mixture is agitated, such as by stirring or shaking, at a temperature suitable to facilitate production of compound B. The temperature may be from 20 °C or lower, to 100 °C or more, such as from 25 °C to 75 °C or from 40 °C to 60 °C, and in some embodiments, a temperature of about 50 °C is used. The reaction may proceed until the reaction is complete, such as by reaching an equilibrium, and may proceed for from 6 hours to 48 hours, from 12 hours to 36 hours, or from 18 hours to 30 hours, and in some embodiments, the reaction proceeds for about 24 hours. Compound B is then treated with a suitable base, such as aqueous ammonia, to form compound C. The mixture is agitated, such as by stirring or shaking, at a temperature suitable to facilitate the reaction, such as from 25 °C or less to 75 °C or more, or from 40 °C to 60 °C, and in some embodiments, a temperature of about 55 °C is used. The reaction may proceed from greater than zero to 6 hours or more, such as from 1 hour to 4 hours, or for about 2 hours. Compound C is then isolated by a suitable technique, such as filtration.
A second reaction step in the exemplary synthesis is provided below according to Scheme 2.
Figure imgf000031_0001
Scheme 2
Compound C is treated with phosphoramidite D under a standard solid-phase DNA synthesis protocol, such as conditions recommended by a manufacturer of an automated DNA/RNA synthesizer, to form Compound E. The protecting group is any protecting group suitable to facilitate solid phase DNA synthesis, such as DMTr. And each alkyl group in the (alkyl)2N moiety in phosphoramidite D may be Ci-6alkyl or the two alkyl moieties together with the nitrogen to which they are attached form a 5- to 7 membered heterocycloaliphatic group. Suitable (alkyl)2N moieties include, but are not limited to, dimethylamino (NCH3)2, diethylamino (N(CH2CH3)2), di-n-propylamino, diisopropylamino, di-n-butylamino, diisopropylamino, diisobutylamino, di-sec-butylamino, di-tert-butylamino, di-n-hexylamino, or morpholino.
Compound E is exposed to an aqueous solution of iodine and then to reagents needed to inactivate any unreacted hydroxyl groups, such as a 1 : 1 (v/v) solution of Cap A (Ac20/THF/pyridine) and Cap B (10% 1-Methylimidazole in THF). The protecting group is then removed under acidic conditions to form Compound F. A person of ordinary skill in the art understands the conditions required to remove a particular protecting group, and additional information concerning suitable protecting groups and how to remove them can be found in “Greene’s Protective Groups in Organic Synthesis, Fourth Edition,” published by John Wiley and Sons, Inc, April 10, 2006. For example, a DMTr protecting group may be removed by treatment with 3% trichloroacetic acid in a suitable solvent, such as a chlorinated solvent (for example, chloroform or dichlorom ethane).
The chain length can be extended as desired by repeating the steps above, as illustrated in Scheme 3.
Figure imgf000032_0002
Scheme 3
A third reaction step in the exemplary synthesis is provided below according to
Scheme 4.
C6-amino modifier phosphoramidite
Figure imgf000032_0001
Scheme 4
Compound G is obtained after oxidation of the phosphite triester intermediate with an aqueous solution of iodine followed by treatment with a 1 : 1 (v/v) Cap A: Cap B solution to inactivate unreacted hydroxyl groups and after removal of the protecting group under acidic conditions as previously described with respect to Scheme 2. Compound G is then treated with phosphoramidite H according to standard solid-phase DNA synthesis protocols. Typically, the amino moiety is protected by a suitable protecting group, such as a 4- monomethoxytrityl group. Removal of such a group produces Compound J. A person of ordinary skill in the art understands how to remove such protecting groups. For example, a 4-monomethoxytrityl amino protecting group may be removed using 3% trichloroacetic acid (TCA) in a chlorinated solvent, such as dichloromethane, over a period of 15 minutes at about 25 °C. A fourth reaction step in the exemplary synthesis is provided below according to
Scheme 5.
5'-0-PG-2'-0-deoxy- or ribo-nucleoside 3'-0-linker
Figure imgf000033_0001
Scheme 5
Compound J is treated with a suitable 5’-0-deoxy- or ribonucleoside K comprising a nucleic acid base suitable for the 3’ -end of the resultant nucleic acid sequence, and a linker suitable to attach the nucleoside to the solid support. A person of ordinary skill in the art understands that if the nucleic acid comprises an exocyclic amine, such amine likely will be protected by a suitable protecting group, such as a protecting group disclosed herein. In Scheme 5, an exemplary succinate linker is shown, but a person of ordinary skill in the art understands that any linker suitable to facilitate the DNA or RNA sequence synthesis may be used. The reaction proceeds in the presence of a suitable coupling agent, such as dicyclohexylcarbodiimide (DCC), ethyl-(N,,N’-dimethylamino)propylcarbodiimide hydrochloride (EDC), diisopropylcarbodiimide (DIC), carbonyldiimidazole (CDI), BOP, PyBOP, BOP-C1, or HATU. The reaction is performed in a solvent suitable to facilitate the coupling reaction, such as pyridine, DMF, acetonitrile, toluene, a chlorinated solvent, such as chloroform, dichloromethane, or dichloroethane, or any combination thereof. Pyridine may be used in combination with a solvent to further facilitate the reaction proceeding.
After the reaction is complete, the reaction mixture is treated with a 1 : 1 (v/v) Cap A:Cap B solution to inactivate any unreacted amines moieties, and the solid support is filtered and treated with a suitable reagent, such as TCA in a chlorinated solvent, such as dichloromethane, to remove the protecting group to form compound L.
An alternative exemplary synthesis to those illustrated by Schemes 4 and 5 is shown in Scheme 6.
Figure imgf000034_0001
Scheme 6
Compound G is treated with phosphoramidite M according to standard solid-phase DNA synthesis protocols. After treatment with an aqueous solution of iodine, unreacted hydroxyl groups are inactivated by a 1:1 (v/v) Cap A:Cap B solution as previously described with respect to Scheme 2. Typically, the hydroxyl moiety of M is protected by a suitable protecting group, such as a 4-monomethoxytrityl or 4,4’-dimethoxytrityl group. Removal of such a group produces Compound N, and a person of ordinary skill in the art understands the conditions used to remove such protecting groups. For example, a 4- monomethoxytrityl or 4,4’-dimethoxytrityl hydroxyl protecting group may be removed using 3% TCA in a chlorinated solvent, such as dichloromethane, over a period of 15 minutes at about 25 °C.
In some embodiments, phosphoramidite M is
Figure imgf000034_0002
Using either solid support L or solid support N, the DNA or RNA sequence can be synthesized in an automated DNA/RNA synthesizer using the standard protocols recommended by the manufacturer. Upon completion of the automated solid-phase synthesis, the DNA sequence is released by passing ammonium hydroxide through the synthesis column over a suitable period, such as from greater than zero to 1 hour or more, or from 10 minutes to 30 minutes, while collecting the eluate. The eluate then is heated to a temperature suitable to ensure complete deprotection, such as from 30 °C or less to 100 °C or less, for example, from 40 °C to 75 °C or from 50 °C to 60 °C, and in some embodiments, the temperature is about 55 °C. The eluate is heated for a time period to facilitate deprotection, such as from 6 hours or less to 30 hours or more, from 12 hours to 24 hours, or from 15 hours to 20 hours, and in some embodiments, the eluate is heated for about 18 hours.
Alternatively, nucleic acid sequences, such as RNA sequences, may be manually released by suspending each support in an alcoholic solution of concentrated ammonium hydroxide, typically an ethanolic solution, at an approximate ratio of from 1:1 v/v to 1:5 or more v/v, such as about EtOfTNITtOH (1 :3 v/v). The mixture is maintained, typically in a closed container, at ambient temperature, such as from 20 °C to 30 °C or about 25 °C, for a time period suitable to facilitate release of the nucleic acid sequence. The time period may be from 6 hours or less to 24 hours or more, such as from 12 hours to 18 hours, and in some embodiments, the time period is about 16 hours. The support is then filtered and washed with RNase free water. The filtrates are concentrated to dryness, such as by centrifugation and/or a speedvac concentrator. For RNA sequences, the residue is dissolved in a suitable solvent, such as DMSO, and treated with conditions suitable to remove the OH protecting group. In some embodiments, the OH protecting group is fluoride-labile, and the residue is treated with a fluoride reagent, such as triethylamine trihydrofluoride. The mixtures are heated, such as on a heat block, at a temperature suitable to facilitate OH deprotection, such as from 50 °C or less to 100 °C or more, from 55 °C to 75 °C or about 65 °C for a suitable time period, such as from 1 hours or less to 56 hours or more, from 2 hours to 4 hours or about 3 hours. IV. Examples
Example 1
Preparation of the 3-hydroxypropylated CPG Support 3 H
Figure imgf000036_0001
3
To CPG (500A, 1.00 g, 1) placed in a 4-mL screw-capped glass vial was added a solution of 3 -acetoxypropyltrimethoxy silane (890 mg, 4.00 mmol) in dry toluene (4 mL). The suspension was then shaken at 50 °C over a period of 24 hours. The 3- acetoxypropylated support 2 was filtered, washed with acetonitrile (10 mL), air-dried and transferred to a 7-mL screw-capped glass vial. Concentrated aqueous ammonia (4 mL) was added to the vial which was immediately capped; the suspension was shaken at 55 °C over 2 hours. The 3-hydroxypropylated support 3 was filtered and successively washed with water (10 mL), acetonitrile (10 mL), air dried and then left under high vacuum for 1 hour at about
25 °C.
Example 2
Conversion of the CPG support 3 to CPG supports 5 and 6
Figure imgf000036_0002
A 0.1 M solution of commercial phosphoramidite 4 in anhydrous CH3CN was employed for the phosphitylation of CPG support 3, which was performed via a standard 1 pmole scale solid-phase DNA synthesis protocol under conditions recommended by the manufacturer of the automated DNA/RNA synthesizer. The CPG support 5 was then exposed to an aqueous solution of iodine followed by a 1 : 1 (v/v) solution of Cap A
(Ac20/THF/pyridine) and Cap B(10% 1-methylimidazole in THF) to inactivate unreacted hydroxyls. The CPG support 6 was produced upon exposing 5 to a solution of 3% trichloroacetic acid (TCA) in CH2CI2 to cleave the 4,4’-dimethoxytrityl (DMTr) group according to a standard automated DNA synthesis protocol. The released DMTr cation solution, obtained from an accurately weighed sample of support 5, was collected into a 10- mL volumetric flask and spectrophotometrically measured at 498 nm to reveal a functional hydroxyl concentration of 108 pmole OH/gram of CPG support 6.
Example 3
General procedure for the automated preparation of CPG supports 7, 8 or 9
Repeat coupling reaction 4
Figure imgf000037_0001
The procedure comprises repeating all the steps described above at the same scale, under the same conditions, using CPG support 6 as the starting material.
Example 4
Typical procedure for the automated preparation of CPG supports 10, 11 or 12
C6-amino modifier phosphoramidite,1H-tetrazole,
7 n = 3 CH3CN, followed by steps 1 through 3 from Example 2
Figure imgf000038_0001
A 0.1 M solution of commercial 6-(4-monomethoxytritylamino)hexyl-(2- cyanoethyl)-(Af,A-diisopropyl)-phosphoramidite in anhydrous CH3CN was employed for the phosphitylation of CPG support 7, 8 or 9, at the 1 pmole scale, according to a standard solid-phase DNA synthesis protocol. The CPG support 10, 11, or 12, was then treated with an aqueous solution of iodine followed by a 1 : 1 (v/v) Cap A:Cap B solution to inactivate unreacted hydroxyls. Cleavage of the 4-monomethoxytritylamino protecting group was performed manually, off the automated DNA/RNA synthesizer, using 3% TCA in CH2CI2 over a period of 15 min at about 25 °C. Multiple batches of each CPG support were needed to generate enough material to initiate solid-phase synthesis of each DNA or RNA sequence at the 1 pmole scale on each support.
Example 5
General procedure for the preparation of CPG supports 13, 14 or 15.
Figure imgf000039_0001
To a flame-dried 4-mL glass vial was added CPG support 10, 11 or 12 (50 mg) and a 5 ’ -0-(4,4’ -dimethoxytrityl)-2’ -deoxythymidine-3 ’ -O-succinate, /V4 -benzoyl-5 ’ -0-(4,4’ - dimethoxytrityl)-2’ -deoxycytidine-3 ’ - - succinate, N6 -benzoyl-5 ’ -0-(4,4’ -dimethoxytrityl)- 2 ’ -deoxy adenosine-3 ’ - -succinate, N2- i sobutyryl -5 ’ -0-(4,4 ’ -dime-thoxytrityl)-2 ’ - deoxyguanosine-3 ’ -O-succinate or 5 ’ -0-(4, 4’ -dimethoxytrityl)-2’-0-/t 7-butyl dimethyl si lyl uridine-3’ -O-succinate salt (30 mg) along with /VW-dicyclohexylcarbodiimide (15 mg). The glass vial and its content were then subjected to high vacuum for 2 hours at about 25 °C. A solution of 10% dry pyridine in anhydrous DMF (200 pL) was added by syringe to the glass vial, which was immediately sealed with a teflon-lined screw cap and shaken at about 25 °C over a period of 24 hours. The suspension was filtered, washed with dry CH3CN (10 mL) and treated with 2-mL of a 1 : 1 (v/v) Cap A:Cap B solution to inactivate any unreacted amine functions. The CPG support was again filtered, washed with dry
CH3CN (10 mL) and air-dried. An accurately weighed sample of support 13, 14 or 15 was mixed with a solution of 3% TCA in CH2CI2 (10 mL), over 5 minutes at about 25 °C, to spectrophotometrically measure at 498 nm the concentration of the leader nucleoside (dABz) covalently linked to the support. DMTr cation measurements revealed a 5 ’-hydroxyl concentration of: 57 m mol e/gram of CPG support 13; 43 m mol e/gram of CPG support 14 or 26 m mol e/gram of CPG support 15. When the leader nucleoside is dT, dCBz, dG'Bu or U, DMTr cation measurements were: 51 m mol e/gram, 49 m mol e/gram, 50 m mol e/gram or 49 m mol e/gram of CPG support 14, respectively. Example 6
Protocol for automated synthesis of DNA or RNA sequences on commercial LCAA-
CPG and CPG supports 13, 14 and 15
13 n = 3
14 n = 5
15 n = 7
1. Cap A:Cap B, 1:1 (v/v)
2. 3% TCA in CH2CI2
3. Standard solid-phase DNA synthesis
T
Figure imgf000040_0001
Automated syntheses of DNA and RNA sequences were performed on a DNA/RNA synthesizer, employing commercial long chain alkylamine controlled-pore glass supports (LCAA-CPG) or modified CPG supports 13, 14 and 15 pre-loaded with suitably protected leader deoxy- or ribonucleosides. Each solid support was accurately weighed, based on its leader nucleoside load, to provide one micromole of leader nucleoside per synthesis column. The synthesis of DNA or RNA sequences was conducted, side-by-side, on LCAA- CPG and CPG support 13 according to the standard (trityl-off) DNA or RNA protocol conditions recommended by the manufacturer of the DNA/RNA synthesizer. Side-by-side syntheses were carried out on the same day by the same operator, using the same batches of deoxy- or ribonucleoside phosphoramidites under identical conditions in terms of concentration, activation/coupling times and subsequent usage of reagents through all the steps of each synthesis cycle. This protocol was repeated under identical conditions for the side-by-side synthesis of DNA sequences on LCAA-CPG and CPG support 14 and 15. Example 7
Deprotection of the DNA or RNA sequences released from LCAA-CPG and CPG supports 16, 17 and 18
16 n = 3
17 n = 5
18 n = 7
Deprotection and release of the DNA or RNA sequences from the solid supports
Figure imgf000041_0001
5'-d(CTGAGTAGCGAACGTGAAGA) from CPG supports 16, 17 and 18
5’-d(ATAGTGTGCATCGATGCCAC)
5’-d(CTCTGTACCTTACGTCTTCG)
5'-d(TCTTGGTTACATGAAATCCT) each from CPG support 17
5'-r(UCUUGGUUACAUGAAAUCCU)
+ shorter than full length DNA or RNA sequences Upon completion of the automated solid-phase synthesis of DNA or RNA sequences, the synthesis columns containing the DNA sequences linked to LCAA-CPG support and CPG supports 16, 17 or 18 were taken off the DNA/RNA synthesizer, and each DNA sequence of each CPG support was manually released by passing concentrated ammonium hydroxide (1 mL) through the synthesis column over a period of 15 minutes while collecting the eluate in a 4-mL screw cap glass vial. Each glass vial was then capped and heated at 55 °C for 18 hours on a heat block to ensure complete deprotection. The ammoniacal solution of each DNA sequence was then concentrated to about 50% of its original volume using a stream of air to remove most of the ammonia from each solution.
The synthesis columns containing the RNA sequence linked to LCAA-CPG and CPG 17 supports was manually released upon suspending each support in 1 mL of an ethanolic solution of concentrated ammonium hydroxide [EtOLfNLLOH (1:3 v/v)] kept in capped 4-mL screw cap glass vials over 16 hours at about 25 °C. The support of each vial was then filtered and washed with RNase free water (0.5 mL) twice, and the filtrates were placed in 1.5 mL polypropylene microcentrifuge tubes and concentrated to dryness using a speedvac concentrator. Each RNA sequence was dissolved in DMSO (100 pL) to which was added triethylamine trihydrofluoride (125 pL). The solutions were heated on a heat block at 65 °C for 3 hours. Each deprotected RNA sequence solution was cooled to room temperature, diluted with 775 pL of RNase free water, and desalted through a PD-10 column. Each desalted RNA solution was immediately analyzed by RP-HPLC as described below.
The identity of all nucleic acid sequences released from the CPG support 17 was verified by mass spectrometry.
5 ’ -d(CTGAGTAGC-GAACGTGAAGA) (SEQ ID NO: 1): MALDI: m/z calcd for C197H244N85O114-P19: 6215 [M+H]+; found: 6212.
5 ’ -d(TCTTGGTT AC AGA- A ATCCT) (SEQ ID NO: 2): MALDI: m/z calcd for C196H249N68O121P19: 6082 [M+H]+; found: 6077.
5’-d(CTCTGTACCTTACGTCTTCG) (SEQ ID NO: 4): MALDI: m/z calcd for C193H249N62O124P19: 6010 [M+H]+; found: 6014.
5 ’ -d( AT AGT GT GC ATCGAT GCC AC) (SEQ ID NO: 5): MALDI: m/z calcd for C195H246N75O118P19: [M+H]+; 6117; found: 6118.
5 ’ -r(UCUUGGUUAC AUGAAAUCCU) (SEQ ID NO: 3): MALDI: m/z calcd for C188H233N68O141P19: [M+H]+; 6290; found: 6282.
Example 8
Comparative RP-HPLC analyses of unpurified DNA or RNA sequences released from LCAA-CPG and CPG supports 16, 17 and 18
All analyses were performed using an Agilent Technologies 1260 Infinity II HPLC system equipped with a diode array detector for spectral analysis. The OpenLAB CDS ChemStation software provides peak integration capabilities needed for comparative analyses. Optimally, 0.2 OD260 unit of fully deprotected and unpurified DNA or RNA sequences released from the above CPG supports were each analyzed using an Agilent ion- pair reversed-phase AdvanceBio Oligonuleotide column under the following chromatographic conditions: from 0.1 M triethylammonium acetate (pH 7.0), a linear gradient of 0.66% CH3CN/min is pumped at a flow rate of 0.8 mL/min for 30 minutes. Chromatographic peak areas were measured using the OpenLAB CDS ChemStation software by perpendicularly extending the start and end of DNA or RNA peak elution points to base line. Example 9
Results and Discussion
In this study, a controlled-pore glass support functionalized with multiple hexaethylene glycol spacers was designed, implemented and demonstrated to reduce the level of process-related impurities in synthetic DNA and RNA sequences when compared to that achieved using commercial long-chain alkylamine controlled-pore glass supports (also see Grajkowski et ah, Bioorg. Med. Chem. 28: 115779, 2020, herein incorporated by reference in its entirety).
A first attempt to reduce the level of process-related impurities in synthetic DNA and RNA sequences used a CPG 500 support functionalized with one hexaethylene glycol spacer under typical solid-phase synthesis conditions. A DNA sequence (20-mer) was produced in a yield not better than that obtained (86%) when employing the standard commercial LCAA-CPG support. Hexaethylene glycol has about the same number of carbon-carbon (C-C) bond lengths (about 18) than that of the long chain alkylamine spacer of LCAA-CPG.
It was hypothesized that a CPG support functionalized with either a spacer much larger in length than the alkylamine spacer of LCAA-CPG or multiple hexaethylene glycol spacers would improve access of activated phosphoramidites and required reagents to the leader nucleoside for efficient initiation of solid-phase DNA or RNA synthesis. Therefore, the CPG support 6 (see Example 2) was functionalized with two, four and six additional hexaethylene glycol spacers to provide the CPG supports 13, 14 and 15 from which, solid- phase syntheses of DNA and RNA sequences were conducted to provide supports 16, 17, and 18, respectively. The quality of the nucleic acid sequences obtained from these CPG supports was assessed by HPLC and compared with that obtained from the same sequences made from the commercial LCAA-CPG support. FIG. 1 provides expanded HPLC profiles of unpurified 5 ’ -d(CTGAGT AGCGAACGT GAAGA) (SEQ ID NO: 1), which was released from commercial LCAA-CPG (red profile) or CPG support 16 (blue profile) after complete deprotection. Peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm. FIG. 1 clearly illustrates that the shoulder on the right side of the red profile peak at retention time (rt: 20.3 min) was significantly reduced in the blue profile corresponding to the sequence produced using the disclosed solid support structure (rt: 20.5 min), whereas the red profile peak at 21.7 minutes, that corresponds to the sequence made using the commercial LCAA-CPG support, was essentially absent in the blue profile.
FIG. 2 provides the expanded HPLC profiles of unpurified SEQ ID NO: 1, which was released from commercial LCAA-CPG (red profile) or CPG support 17 (blue profile) after complete deprotection. Peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm. FIG. 2 demonstrates that the product released from CPG support 17 was of a product of higher quality than that obtained from the commercial LCAA-CPG support. The presence of the shoulder on the right side of the peak shown at rt: 20.4 min and of the peak at 21.8 min in the red profile was considerably reduced in the profile corresponding to CPG support 17 Another notable difference between the profiles provided by FIGS. 1 and 2 was the absence of the relatively large shoulder on the right side of the LCAA-CPG main peak at rt: 21.2 minutes. Furthermore, the shape of the main peak observed in the blue profile of FIG. 2 was much slimmer than that of the red LCAA-CPG profile. Without being bound to a particular theory, this indicated a substantial reduction of process-related impurities superimposed on the main product peak.
The results obtained by using CPG support 14 prompted an investigation as to whether better results could be obtained using CPG support 15 for solid-phase synthesis of SEQ ID NO: 1. FIG. 3 provides the extended HPLC profiles of unpurified SEQ ID NO: 1 that were released from commercial LCAA-CPG (red) or disclosed CPG support 18 (blue) after complete deprotection. FIG. 4 provides the extended HPLC profiles of unpurified SEQ ID NO: 1 that were released from disclosed CPG support 17 (blue) or support 18 (black) after complete deprotection. For both FIGS. 3 and 4, peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm. FIGS. 3 and 4 show that the release of SEQ ID NO: 1 from CPG support 18 was highly comparable to that obtained from support 17 based on side-by-side comparison of their chromatographic profiles.
Results of the above experiments demonstrated that the disclosed CPG support provided a significant reduction in process-related impurities formation, compared to the commercial LCAA-CPG support, during solid-phase synthesis of nucleic acid sequences.
The solid-phase synthesis of one RNA sequence (SEQ ID. NO: 3) and three additional DNA sequences (SEQ ID NOs: 2, 4 and 5) were therefore conducted on LCAA- CPG and CPG support 14 to demonstrate that the minimization of process-related impurities was not limited to one particular nucleic acid sequence. FIG. 5 provides expanded HPLC profiles of unpurified SEQ ID NO: 3, which was released from commercial LCAA-CPG (red profile) or CPG support 17 (blue profile) after complete deprotection. FIGS. 6, 7 and 8 provide the expanded HPLC profiles for SEQ ID NOs: 2, 4 and 5, respectively, released from the disclosed CPG support 17 (blue) and LCAA-CPG (red), after complete deprotection. In each case, peak heights of each profile were normalized to the highest peak, which was then set to 0.15 absorbance unit (AU) at 254 nm. FIG. 9 provides stacked HPLC profiles for SEQ ID NO: 2 produced using a LCAA-CPG support and CPG support 17 as disclosed herein and illustrating the approximate 50% reduction in impurities in the product made using CPG support 17, compared to the product made using LCAA-CPG.
Table 1. Minimization of process-related impurities in synthetic nucleic acid sequences0
Figure imgf000045_0001
a mPA, main peak area; PA-pRI, peak area of process-related impurities; Mi-pRI, relative minimization of process-related impurities resulting from the use of CPG support 14 and calculated according to the following equation: % Mi-pRI — [1- (% PA-pRIcpG-14 ÷ % PA-PRILCAA-CPG)] x 100 b percent of total DNA- or RNA-related peak areas.
As shown in Table 1, synthesizing nucleic acid sequences using embodiments of the disclosed solid support structures results in significant reductions in process-related impurities. In contrast, using a similar solid support structure having only a single hexaethylene glycol spacer, i.e., n = 1 in the solid phase supports from Examples 3-7, resulted in substantially less pure nucleic acid sequences than those made using solid phase supports where n was 3 or more (FIG. 11).
Figure imgf000046_0001
Table 2 provides comparative data illustrating the purity of sequences made using supports where n = 1, 3, 5, 7, and 10. FIG. 10 provides a comparison of the HPLC profiles for sequences according to SEQ ID NO: 2 produced by CPG supports where n = 5 and n = 10. FIGS. 10 demonstrates that longer support structures, such as n = 10, provide substantially the same purity benefits as support structures where n = 5 or 7. However, the loading on the CPG decreases as the length increases, possibly due to issues resulting from synthesizing such long chain supports. In some embodiments, the loading when n = 10 is about 5-10 pmol/g, compared to about 50 pmol/g when n = 5 Table 2. Purity of DNA sequences made using different length solid phase supports as a percentage of total peak area
Figure imgf000046_0002
CONCLUSIONS
As demonstrated herein, modification of a CPG support with the addition of multiple hexaethylene glycol spacers led to the synthesis of DNA and RNA sequences of significantly greater purity that that obtained from the current, state-of-the art, LCAA-CPG support. The CPG support 14 is composed of five hexaethylene glycol spacers and was found to be as efficient as the CPG support 15, carrying seven hexaethylene glycol spacers, for minimizing process-related impurities in synthetic DNA sequences. A reduction of process-related impurities of up to 53% in synthetic nucleic acid sequences was achieved when using 14, instead of LCAA-CPG, for the solid-phase synthesis of nucleic acid sequences. Although the reduction of process-related impurities is variable and may vary depending on the composition of the nucleic acid sequence, a lower content of residual process-related impurities facilitates the removal of those impurities from the full-length nucleic acid sequences, to ultimately provide nucleic acid-based drugs of exquisite purity for safer and more efficacious therapies for human diseases.
Example 10
Typical procedure for the automated preparation of a CPG support comprising a universal linker
Universal linker phosphoramidite, 1 H-tetrazole, CH3CN followed by steps 1 through 3 from Example 2
7 n = 3
Figure imgf000047_0001
The universal support is produced using a universal linker phosphoramidite, such as the universal linker phosphoramidite shown below, according to the method illustrated in Example 4. Unreacted hydroxyl moieties are inactivated using a 1:1 (v/v) Cap A:Cap B solution as described in Example 4. And standard DNA or RNA synthesis proceeds as described in Examples 6 and 7.
Figure imgf000048_0001
Exemplary universal linker phosphoramidite,
In view of the many possible embodiments to which the principles of the disclosed technology may be applied, it should be recognized that the illustrated embodiments are only examples of the technology and should not be taken as limiting the scope of the disclosure. Rather, the scope of the disclosure is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.

Claims

We claim:
1. A solid support according to Formula I
Figure imgf000049_0001
Formula I wherein:
CPG is controlled pore glass; m is 2, 3, 4, 5 or 6; x is 1, 2, 3, 4 or 5; y is from 2 to 12; n is 3, 4, 5, 6, 7, 8, 9 or 10; each R1 independently is Ci-6alkyl, -(CH2)I-6CN, -(CF^i-eOR’ or a thermolytic phosphate protecting group;
R’ is aliphatic, aryl, or aralkyl;
Figure imgf000049_0002
R4 is H or OR6;
R5 is PG or a nucleic acid sequence;
R6 is pixyl, TBDMS, TBDPS, TMS, TES, or TIPS t is 1, 2, 3, or 4;
Bp is a nucleic acid base where the exocyclic amine, if present, is protected; and PG is a protecting group. 2. The solid support of claim 1, wherein PG is 4,4’-dimethoxytrityl (DMTr).
3. The solid support of claim 1 or claim 2, wherein m is 2, 3 or 4.
4. The solid support of any one of claims 1-3, wherein the solid support has a
Formula IV
Figure imgf000050_0001
Formula IV.
5. The solid support of any one of claims 1-4, wherein x is 1, 2 or 3.
6. The solid support of any one of claims 1-5, wherein the solid support has a Formula II
Figure imgf000050_0002
Formula II.
7. The solid support of any one of claims 1-6, wherein y is from 3 to 10.
8 The solid support of any one of claims 1-7, wherein the solid support has a
Formula III
Figure imgf000050_0003
Formula III. 9. The solid support of any one of claims 1-8, wherein n is from 3 to 7.
10. The solid support of any one of claims 1-9, wherein n is 5.
11. The solid support of any one of claims 1-10, wherein m, x, y and n are selected to produce a support backbone length from the silicon atom to the R2 moiety of from 50 atoms to 400 atoms.
12. The solid support of claim 11, wherein the support backbone length is from 100 atoms to 150 atoms.
13. The solid support of any one of claims 1-12, wherein the therm olabile phosphate protecting group has a structure
Figure imgf000051_0001
wherein:
X is O or S;
R7 is H, Ra, ORa, SRa, or N(Rb)2;
Ra is Rd;
Rb is H, Rd or two Rbs together with the nitrogen to which they are attached, form a 3- to 7-membered heterocyclyl;
Z is O, S, N(RC), C(Rc)2 or C(RC)2C(RC)2; each Rc independently is H or Rd, or one Rd in combination with the C=X moiety and one Ra or Rb from R7 together form a 3 - to 7-membered cycloaliphatic or heterocyclyl ring;
Rd is alkyl, alkenyl, alkynyl, cycloalkyl, aryl, or aralkyl; each R8 independently is H or Rd, or one R8 together with Z forms an aryl ring; each R9 independently is H or Rd, or one R9 and one R8 together with the atoms to which they are attached, forms a moiety having a formula
Figure imgf000052_0001
wherein r is 0 to 6; and each R10 independently is H, Ci-6alkyl, NO2, -N(Ci-6alkyl)2, -OCi-6alkyl, - SCi-6alkyl, -CN, or halogen, provided that the aromatic ring substituted with R10 is 5 one carbon removed from the phosphate oxygen of Formula I.
14. The solid support of any one of claims 1-13, wherein the thermolabile phosphate protecting group is selected from:
Figure imgf000052_0002
5 15. The solid support of any one of claims 1-12, wherein each R1 independently is Ci-4alkyl or -(CH2)I-4CN.
16. The solid support of claim 15, wherein each R1 is -CH2CH2CN. 0 17. The solid support of any one of claims 1-16, wherein R2 is H. 18. The solid support of any one of claims 1-16, wherein R2 is
Figure imgf000053_0002
20. The solid support of claim 19, wherein p is 6. 21. The solid support of claim 19 or claim 20, wherein R3 is H.
22. The solid support of claim 19 or claim 20, wherein R3 is
Figure imgf000053_0001
23. The solid support of claim 22, wherein Bp is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, thymine, uracil, hypoxanthine, xanthine, exocyclic amine-protected 7-methylguanine, 5,6- dihydrouracil, exocyclic amine-protected 5-methylcytosine, or exocyclic amine-protected 5- hydroxymethylcytosine.
24. The solid support of any one of claims 19-23, wherein R4 is H. 25. The solid support of claim 24, wherein Bp is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, or thymine.
26. The solid support of any one of claims 19-23, wherein R4 is OR6.
27. The solid support of claim 26, wherein Bp is exocyclic amine-protected adenine, exocyclic amine-protected cytosine, exocyclic amine-protected guanine, or uracil.
28. The solid support of any one of claims 19-27, wherein:
Bp is adenine, cytosine, or guanine, where the exocyclic amine is protected by a benzoyl (Bz), isobutyryl(iBu), phenoxyacetyl (Pac), phenylsulfonylethoxycarbonyl, p- nitrophenyloxycarbonyl, allyloxycarbonyl, or levulinyl group; or
Bp is thymine or uracil.
29. The solid support of any one of claims 1-28, wherein the solid support has a
Formula V
Figure imgf000054_0001
30. The solid support of any one of claims 1-29, wherein the solid support has a Formula VII
Figure imgf000054_0002
Formula VII. 31. The solid support of any one of claims 1-29, wherein the solid support has a
Formula XI or XIII
Figure imgf000055_0001
Formula XIII.
32. The solid support of any one of claims 1-31, wherein R6 is TBDMS.
33. The solid support of any one of claims 1-32, wherein t is 2.
34. The solid support of any one of claims 1-33, wherein R5 is PG.
35. The solid support of any one of claims 1-33, wherein R5 is a nucleic acid sequence.
36. The solid support of claim 35, wherein the nucleic acid sequence comprises one or more DNA sequences.
37. The solid support of claim 36, wherein the one or more DNA sequences comprise one or more antisense DNA sequences.
38. The solid support of claim 35, wherein the nucleic acid sequence comprises one or more RNA sequences. 39. The solid support of claim 38, wherein the one or more RNA sequences comprise one or more antisense RNA sequences, one or more microRNA (miRNA) sequences, one or more small interfering RNA (siRNA) sequences, one or more repeat- associated small interfering RNA (rasiRNA) sequences, or combinations thereof.
40. The solid support of claim 1, selected from:
Figure imgf000056_0001
Figure imgf000057_0001
Figure imgf000058_0001
Figure imgf000059_0001
Figure imgf000060_0001
wherein Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, thymine or uracil.
41. The solid support of any one of claims 1-40, wherein Bp comprises a protecting group selected from benzoyl, isobutyryl, or phenoxyacetyl.
42. The solid support of claim 41, wherein Bp is selected from N6 -benzoyl adenine (ABz), N4 -benzoyl cytosine (CBz), N2 -isobutyryl guanine
Figure imgf000060_0002
thymine (T), N6- phenoxyacetyl adenine (APac), N4 -phenoxyacetyl cytosine (CPac), N2 -phenoxyacetyl guanine
(Gpac), or uracii (U)
43. The solid support of any one of claims 1-42, wherein a loading of the support on the CPG is from 5 pmol/g to 50 pmol/g.
44. A method for synthesizing a nucleic acid sequence, comprising: loading a solid support according to any one of claims 1-43 into a DNA/RNA synthesizer; and operating the synthesizer to produce a desired nucleic acid sequence.
45. The method of claim 44, wherein the solid support is a solid support according to claim 34.
46. The method of claim 45, wherein R5 is DMTr.
47. A universal linker phosphoramidite having a structure
Figure imgf000061_0001
48. A kit, comprising the solid support of any one of claims 1-43.
49. The kit of claim 48, further comprising the phosphoramidite of claim 47.
50. The kit of claim 48, wherein the kit comprises a solid support of claim 40.
51. The kit of claim 50, wherein the kit comprises a solid support selected from:
Figure imgf000061_0002
Figure imgf000062_0001
52. The kit of claim 48, wherein the kit comprises a solid support selected from:
Figure imgf000062_0002
Figure imgf000063_0001
wherein Bp is an exocyclic amine-protected adenine, an exocyclic amine-protected cytosine, an exocyclic amine-protected guanine, thymine or uracil.
53. The kit of any one of claims 48-52, further comprising a protected 2'- deoxynucleoside, ribonucleoside, and/or chemically modified nucleoside wherein an exocyclic amine on the deoxynucleoside, ribonucleoside or chemically modified nucleoside, if present, also is protected.
54. The kit of claim 53, wherein the 2'-deoxynucleoside is DMTrdABz, DMTrdCBz, DMTrdG'6", or DMTrT), or the ribonucleosides is DMTrAPac -2’ -OTBDMS, DMT rCPac-2 ’ -OTBDMS, DMT rGPac-2 ’ -OTBDMS , or DMTrU-2’-OTBDMS.
PCT/US2021/039403 2020-06-30 2021-06-28 Solid support for synthesizing nucleic acid sequences and methods for making and using Ceased WO2022005988A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/003,404 US11987599B2 (en) 2020-06-30 2021-06-28 Solid support for synthesizing nucleic acid sequences and methods for making and using

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202063046413P 2020-06-30 2020-06-30
US63/046,413 2020-06-30

Publications (2)

Publication Number Publication Date
WO2022005988A2 true WO2022005988A2 (en) 2022-01-06
WO2022005988A3 WO2022005988A3 (en) 2022-02-10

Family

ID=77022308

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2021/039403 Ceased WO2022005988A2 (en) 2020-06-30 2021-06-28 Solid support for synthesizing nucleic acid sequences and methods for making and using

Country Status (2)

Country Link
US (1) US11987599B2 (en)
WO (1) WO2022005988A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024114776A1 (en) 2022-12-02 2024-06-06 上海舶望制药有限公司 Bicyclic abasic nucleic acid analogs and oligomeric compounds prepared therefrom

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5866336A (en) 1996-07-16 1999-02-02 Oncor, Inc. Nucleic acid amplification oligonucleotides with molecular energy transfer labels and methods based thereon
US6762298B2 (en) 1999-03-24 2004-07-13 The United States Of America As Represented By The Department Of Health And Human Services Thermolabile phosphorus protecting groups, associated intermediates and methods of use
US7355037B2 (en) 2001-12-03 2008-04-08 The United States Of America As Represented By The Department Of Health And Human Services Thermolabile hydroxyl protecting groups and methods of use
US7612197B2 (en) 2003-05-09 2009-11-03 The United States of America as repesented by the Secretary of the Department of Health and Human Services Thermolabile hydroxyl protecting groups and methods of use

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2021227185A1 (en) * 2020-02-24 2022-08-18 Integrated Dna Technologies, Inc. Oligonucleotide synthesis on solid support

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5866336A (en) 1996-07-16 1999-02-02 Oncor, Inc. Nucleic acid amplification oligonucleotides with molecular energy transfer labels and methods based thereon
US6762298B2 (en) 1999-03-24 2004-07-13 The United States Of America As Represented By The Department Of Health And Human Services Thermolabile phosphorus protecting groups, associated intermediates and methods of use
US7355037B2 (en) 2001-12-03 2008-04-08 The United States Of America As Represented By The Department Of Health And Human Services Thermolabile hydroxyl protecting groups and methods of use
US7612197B2 (en) 2003-05-09 2009-11-03 The United States of America as repesented by the Secretary of the Department of Health and Human Services Thermolabile hydroxyl protecting groups and methods of use

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Greene's Protective Groups in Organic Synthesis", 10 April 2006, JOHN WILEY AND SONS, INC
BEAUCAGE, S. L.IYER, R. L.: "Advances in the Synthesis of Oligonucleotides by the Phosphoramidite Approach", TETRAHEDRON, vol. 48, no. 12, 1992, pages 2223 - 2311, XP000915225, DOI: 10.1016/S0040-4020(01)88752-4
GRAJKOWSKI ET AL., BIOORG. MED. CHEM., vol. 28, 2020, pages 115779

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024114776A1 (en) 2022-12-02 2024-06-06 上海舶望制药有限公司 Bicyclic abasic nucleic acid analogs and oligomeric compounds prepared therefrom
EP4628583A1 (en) 2022-12-02 2025-10-08 Shanghai Argo Biopharmaceutical Co., Ltd. Bicyclic abasic nucleic acid analogs and oligomeric compounds prepared therefrom

Also Published As

Publication number Publication date
US11987599B2 (en) 2024-05-21
US20230271998A1 (en) 2023-08-31
WO2022005988A3 (en) 2022-02-10

Similar Documents

Publication Publication Date Title
KR101718406B1 (en) - 3- rna synthesis phosphoramidites for synthetic rna in the reverse direction and application in convenient introduction of ligands chromophores and modifications of synthetic rna at the 3-end
EP4103577B1 (en) Novel mrna 5'-end cap analogs, rna molecule incorporating the same, uses thereof and method of synthesizing rna molecule or peptide
KR102190852B1 (en) Method of preparing oligomeric compounds using modified coupling protocols
CN113412268A (en) 5' -modified nucleoside and nucleotide using the same
EP2006293B1 (en) 2'-hydroxyl-modified ribonucleoside derivative
WO2022005988A2 (en) Solid support for synthesizing nucleic acid sequences and methods for making and using
JP2026063490A (en) Cytosine-type cross-linked nucleoside amidite crystals and method for producing the same
Ohkubo et al. The ability of a triplex-forming oligonucleotide to recognize TA and CG base pairs in a DNA duplex is enhanced by incorporating N-acetyl-2, 7-diaminoquinoline
Wojtczak et al. General method for the synthesis of 2′-O-carboranyl-nucleosides
Horie et al. Synthesis and properties of oligonucleotides modified with an N-methylguanidine-bridged nucleic acid (GuNA [Me]) bearing adenine, guanine, or 5-methylcytosine nucleobases
CN114107308B (en) Gemcitabine modified oligonucleotide
EP1253154B1 (en) Method for purifying 5'-protected 2'-deoxypurine nucleosides
Sato et al. A convenient method for the conversion of β-thymidine to α-thymidine based on TMSOTf-mediated C1′-epimerization
EP4488281A1 (en) 5'-modified nucleoside and nucleotide using same
US10927140B2 (en) Compositions and methods for reverse automated nucleic acid synthesis
JP2003513101A (en) Nucleoside derivative having a photosensitive protecting group
Madsen et al. Synthesis, nucleic acid hybridization properties and molecular modelling studies of conformationally restricted 3′-O, 4′-C-methylene-linked α-l-ribonucleotides
Cheng a et al. Synthesis of Pyrrolo [2, 3-d] pyrimidines that are Structurally Related to Methylated Guanosines from tRNA and the Nucleoside Q Analogs, PreQ0 and PreQ1
CA2577339C (en) Artificial rna modified at its 2' hydroxyl group
JP2008094831A (en) New dideoxynucleoside derivative
WO2004048376A1 (en) Bicyclic naphthylidine nucleosides
Hoshika et al. Investigation of physical and physiological properties of 4′-thioribonucleotide (4′-thioRNA)
WO2017201382A1 (en) Purine nucleotide derivatives
RU2041884C1 (en) 2,6-n,n′-bis[1-(dimethylamino)ethylidene]-4,4′-o-(2′- dimethoxytriphenylmethyl)-2-amino-3′-deoxyadenosine-n,n′- o-alkyl-n,n-diisopropylamidophosphites and process for preparing thereof
Akabane‐Nakata et al. Synthesis of 2′‐Fluorinated Northern Methanocarbacyclic (2′‐F‐NMC) Nucleosides and Their Incorporation Into Oligonucleotides

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21745610

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21745610

Country of ref document: EP

Kind code of ref document: A2