WO2025178954A1 - Procédés de séquençage et de comptage d'acides nucléiques - Google Patents

Procédés de séquençage et de comptage d'acides nucléiques

Info

Publication number
WO2025178954A1
WO2025178954A1 PCT/US2025/016475 US2025016475W WO2025178954A1 WO 2025178954 A1 WO2025178954 A1 WO 2025178954A1 US 2025016475 W US2025016475 W US 2025016475W WO 2025178954 A1 WO2025178954 A1 WO 2025178954A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
sample
random
reads
sequencing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/US2025/016475
Other languages
English (en)
Inventor
Robert MELTZER
Christopher D'amato
Yi XUE
Trinity SMITHERS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Illumina Inc
Original Assignee
Illumina Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Illumina Inc filed Critical Illumina Inc
Publication of WO2025178954A1 publication Critical patent/WO2025178954A1/fr
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1065Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the present invention provides methods for nucleic acid sequencing in which the number of sequence reads is representative of the number of molecules that were present in a sample.
  • nucleic acids in a sample are copied in a manner such that each copy has a segment of bases that uniquely identifies the original molecule. This can be done by capturing the target molecules with primer oligonucleotides that each have a random sequence at the 3' end. The primer oligonucleotides anneal to the target molecules at random locations, due the random 3' ends. When the primer oligonucleotides are extended to copy the target molecules, the 5' ends of the resultant copies will include random stretches of bases.
  • the random start sites provide each of the copies of the nucleic acid molecules with a unique sequence.
  • the unique sequence in each of the copies is copied (directly or as, e.g., a second-strand copy) from a segment of a transcript transcribed by an organism.
  • aspects of the disclosure provide methods of preparing a sample for sequencing and counting molecules therein.
  • the methods include providing a sample comprising a plurality of sample polynucleotides and generating a plurality of copies of the sample polynucleotides.
  • a copy of the plurality of copies comprises (i) a sample sequence copied from a sample polynucleotide of the plurality of sample polynucleotides, and (ii) a tag sequence copied from the sample polynucleotide, the tag sequence distinguishing said sample polynucleotide from other sample polynucleotides from the sample.
  • Dual IMI embodiments may include fragmenting the plurality of sample polynucleotides at a plurality of random break sites. Each copy may include the tag sequence copied from the sample polynucleotide at a 5' end and a second tag sequence at a 3' end, the second tag sequence copied from the sample polynucleotide from a segment adjacent one break site of the plurality of random break sites.
  • FIG. 2 shows compositions used in methods of the invention.
  • FIG. 3 shows melting temperatures over [Mg++] for randomer capture sequences.
  • the invention provides methods and compositions that are useful for nucleic acid sequencing methods.
  • claimed methods are useful with particle-templated instant partitions (PIPs) when sequencing target nucleic acids (PIPseq).
  • PIPs particle-templated instant partitions
  • Methods disclosed herein are optimized for preserved, fixed, or otherwise challenging samples such as formalin-fixed, paraffin embedded (FFPE) tissue samples.
  • Methods and compositions of the invention are quantitative in that they are useful to determine identities and quantities of individual molecules present in samples.
  • methods herein are useful for transcriptomics, to measure and show what quantities of different RNA transcripts are being expressed in samples.
  • aqueous partitions such as PIPs, methods may be performed for individual cells and even for multiple individual cells that are liberated from, or interrogated within, a single sample.
  • a count of reads that map to each transcript provides a count of how many transcripts from each gene are in a sample (e.g., single cell) and those counts can serve as expression levels, or an expression profile, or a transcriptome of the sample.
  • An IMI may be understood to be a transcript-informed sequence that is also used for transcript capture using a randomer capture sequence or that is copied from a random fragmentation position. With random priming (or random fragmentation) the identifying information at the end(s) of each molecule is generated by the transcript sequence. That identifying information is a function of gene identity and the position at which each transcript was annealed to the capture moiety. Randomer capture with IMI analysis has multiple potential uses including, for example, analysis of non-polyadenylated RNA; bacterial transcriptomics; viral gene transcription; analysis of non-coding RNA (IncRNA, etc.); analysis on sequencing platforms that only support single-end reads; and sequencing on other platforms such as Ultima sequencing.
  • Methods may use an IMI or may use dual-IMIs.
  • dual IMI embodiments each molecule has randomized initiation positions at both 3’ and 5’ ends.
  • Dual IMI analysis may have particular advantages in identification of very short transcript fragments.
  • a specific application of dual-IMI approaches may include recovery of severely fragmented RNA from clinical formaldehyde fixed samples or forensic or archeological samples.
  • Preferred embodiments of the disclosure subject the target nucleic acid to fragmentation, even to fragmentation conditions and chemistries that would be recognized as too promiscuous for conventional methods, causing too many random breaks, yielding many short fragments that terminate at essentially random points of breakage.
  • the 3' random ends of the capture oligos anneal to those fragments. After a capture oligo anneals to a fragment, it is extended by a polymerase or reverse transcriptase — copying the fragment until the enzyme reaches the end, the random point of breakage, of the fragment.
  • the enzyme generates a copy of the fragment for which the 5' end starts with the capture oligo and the 3' includes a copy of the broken end of the fragment (plus any additional sequence that may be added 3' of that copy).
  • Some embodiments use template-switching oligos.
  • Some embodiments use blunt-ending and adapter ligation.
  • the 5' end of the copy has one IMI generated by random sequence of the capture oligo.
  • the 3' end of the copy has a second IMI generated by copying the segment of the fragment near the random point of breakage.
  • each copy of one of the fragments includes first and second IMIs, or dual IMIs.
  • a transposase is used to randomly cut the cDNA.
  • Tn5 transposase
  • RNA sequencing by direct tagmentation of RNA/DNA hybrids PNAS117 (6) 2886- 2893, incorporated by reference.
  • the Tn5 transposase randomly binds and cuts doublestranded RNA/DNA and attaches its end sequence to the random cut site.
  • some embodiments of the invention use Tn5 transposase to directly tagment RNA/DNA hybrids and form polynucleotide libraries with intrinsic molecular identifiers (essentially unique sequences of bases originating in genetic material of the organism or biological system being studied).
  • Tn5 a RNase H superfamily member
  • the desired oligo is preferably a PCR handle (aka a universal primer binding site, a sequencing adaptor, a synthetic oligo of known sequence to which a PCR primer anneals, etc.).
  • Methods of the invention may be used with various amounts of input sample, from single cells to large numbers of cells, with a dynamic range spanning numerous orders of magnitude.
  • each copy of a fragment has a first IMI at one end and a second IMI at the other end.
  • Those copies are preferably subject to amplification (either with adapter ligation or with tailed primers) to generate amplicons (e.g., with sequencing adaptors and/or primer binding sites at both ends).
  • amplicons e.g., with sequencing adaptors and/or primer binding sites at both ends.
  • all of the amplicons have first and second IMIs at their respective ends.
  • an IMI functions like a molecular barcode, it is not the case that dual IMIs are equivalent to a single, but double-length, barcode.
  • Methods and compositions of the invention are compatible with cell preparation techniques that include aldehyde or formaldehyde fixation protocols.
  • Cell fixation and cell preservation techniques incorporate programmable fixation times, reversible bond formation and cleavage, chemo-selective reactions, and analyte recovery using, e.g., materials and techniques such as those discussed in Gallion, 2021, Preserving single cells in space and time for analytical assays, Trends Anal Chem 122: 115723, incorporated by reference.
  • Samples may be obtained from fixed cells or fixed tissue blocks. Samples that are formaldehyde-fixed, paraffin-embedded (FFPE) may be used.
  • FFPE formaldehyde-fixed, paraffin-embedded
  • methods of the disclosure may include steps from nuclei extraction from FFPE and formaldehyde fixed tissue. According to methods of the invention, fixed cells or nuclei may be captured as normal. Methods may include treatment with proteinase K, optionally using heat-triggered proteinase K activation, with added lysis buffer enzymes to efficiently dissolve and liberate cells and/or nuclei.
  • nucleic acids of interest typically include mRNA or precursor mRNA transcripts.
  • Other RNAs, such as ribosomal RNA (rRNA) may not be of-interest to the assay being performed.
  • Methods of the invention include a step for the depletion or removal of nontarget nucleic acids such as rRNA.
  • Ribosomal RNA removal using CRISPR may generally be referred to as CRISPR-based depletion or CRISPR-depletion or CRISPR ribodepletion or similar when applied to ribosomal RNA.
  • CRISPR-depletion technology harnesses the specificity of CRISPR to degrade abundant, uninformative sequences.
  • CRISPR-depletion may be integrated into a stranded total RNA sequencing library prep protocol by adding Cas9/RNA complexes after the adapter ligation step.
  • the Cas9/RNA complexes include a pool of guide RNAs that target and deplete unwanted sequences.
  • CRISPR-depletion technologies are not specific to sequencing platforms.
  • CRISPR-depletion or bulk ribodepletion reagents may be used to remove overabundant human, mouse, or rat rRNA from RNA-Seq libraries to improve sequencing sensitivity and performance. Kits are available commercially to provide a bulk post-library depletion reagent with multiplexing format - designed to remove overabundant human 5S, 5.8S, 18S, 28S, 45S (precursor), mitochondrial 12S and 16S ribosomal RNA.
  • CRISPR-based depletion may include the methods and materials for CRISPR-depletion sold under the trademark CRISPRCLEAN by Jumpcode Genomics, Inc. (San Diego, CA).
  • CRISPR-depletion may use materials or techniques described in US 2022/0145359 or US 2023/0265528, incorporated by reference.
  • magnetic pull-down depletion is an option, using rRNA-specific probes linked to magnetic beads, or example.
  • Single cell embodiments may involve isolating single cells into aqueous partitions or compartments to sequence and count nucleic acid molecules of a single cell. Isolation into partitions may be accomplished by any suitable mechanism and any suitable type of aqueous partition may be used. Exemplary suitable partitions include droplets, wells in a plate, or other fluid portioning structures. For example, the partitions may be wells, cavities, pockets, or openings in a pico-, nano-, or microtiter plate or substrate, or fluidic harbors (see, e.g., US Pub 2010/0041046 Al, incorporated by reference).
  • the partitions may be well in a multi-well plate such as a 96-well plate, 384 well plate, a 1536 well plate, a 3456 well plate, or a 9600 well plate.
  • the partitions may be separate chambers (see, e.g., 20210178395 Al, incorporated by reference).
  • the partitions may be distinct regions defined within a fluidic device (see, e.g., 20200269248 Al, incorporated by reference).
  • the partitions are droplets of an emulsion such as a water-in-oil (W/O) emulsion or a water-in-oil-in-water (W/O/W) emulsion.
  • W/O water-in-oil
  • W/O/W water-in-oil-in-water
  • template particles are in the mixture that is sheared or vortexed (e.g., dozens, hundreds, thousands, tens of thousands, more) is the number of droplets that are formed (as well as some "satellite droplets, or mere bubbles, of miscellaneous sizes and integrities that are not relevant to downstream analysis and can be simply ignored).
  • the resultant droplets that each include a template particle are monodisperse (e.g., same number of polymer subunits among the beads aka particles, and/or same mass each, and/or essentially same size/volume among the particles, e.g., same diameter when viewed under a microscope)
  • the resultant droplets that each include a template particle are monodisperse (each the size of one template particle plus a small, thin shell of aqueous fluid around it). If a droplet forms that contains two or more template particles, then during shearing or vortexing, that droplet breaks into one droplet per each template particle.
  • the capture moiety e.g., 3' sequence of the hybrid capture oligo
  • genomic information that is encoded within the sequence of the source gene for a molecule. Where the other end is generated by random fragmentation, both ends of each molecule are informed by the specific gene identity and the position of the molecule along that gene.
  • FIG. 2 shows compositions used in methods of the invention.
  • Methods and compositions of the invention preferably use a template particle 205, such as a hydrogel bead, linked to one or more capture oligos 207.
  • a 3' end of the capture oligo 207 defines a primer 219 for hybrid capture of a target nucleic acid 213.
  • the primer 219 preferably includes a stretch of random bases and may be referred to as a randomer or random priming sequence. Any suitable number (e g., from 3 to 30, or fewer, or more) bases may constitute the random priming sequence.
  • Preferred embodiments use about 6 to about 10, e.g., 8, random bases to make up the primer 219.
  • the capture oligo hybridizes to the target nucleic acid 213 at a random location and primes the synthesis of a first strand copy 235, where a 5' end of the first strand copy 235 begins at a random start location 232 in the target nucleic acid 231.
  • the first strand copy has a 3' end that includes a copy of bases adjacent a second random location 231.
  • Amplification may be performed to generate amplicons ready for sequencing, or sequencing libraries, from the target molecules.
  • Each amplicon may have an IMI or dual IMIs.
  • the disclosed system may use IMIs to deduplicate sequence reads and count template molecules in the sample.
  • Preferred embodiments use dual IMIs.
  • the dual IMI system involves randomer capture (by a segment of random bases on a 3' end of the capture oligo, e.g., that is attached to a bead) to read a random sequence from a 3 1 end of each molecule (e.g., template RNA).
  • Libraries have randomized segments at both 5' and 3' ends of molecules. Either or both may be used as IMI(s) for sequence read deduplication.
  • Attachment of the end sequence to the cDNA at the random cut site produces a construct, a contiguous DNA molecule that includes a first PCR handle (PEI), a cell barcode, a capture segment, a portion of the cDNA optionally terminating at the random cut site, and a second PCR handle (PE2).
  • PEI first PCR handle
  • PE2 cell barcode
  • Amplification of the construct yields amplicons.
  • constructs are amplified with a P5-PE1 hybrid oligo and P7 index primer directly into a sequencing library.
  • the library may be sequenced to assess RNA expression, for example, as described in Hrdlickova, 2017, RNA-Seq methods for transcriptome analysis, WIREs RNA 8(1): 10.1002, incorporated by reference.
  • Constructs or amplicons may include certain primer and index sequences or copies thereof, such as, P5s and P7s.
  • Those sequences may be any arbitrary sequence useful in downstream analysis. For example, they may be additional universal primer binding sites or sequencing adaptors.
  • either or both of the P5s and P7s may be arbitrary universal priming sequence (universal meaning that the sequence information is not specific to the naturally occurring genomic sequence being studied but is instead suited to being amplified using a pair of cognate universal primers, by design).
  • the index segment may be any suitable barcode or index such as may be useful in downstream information processing.
  • Libraries may be sequenced by any suitable method. Suitable methods include Sanger sequencing, Illumina or Ultima sequencing, Roche pyrosequencing, single-molecule, long-read sequencing using platforms offered by Pacific Biosciences or Oxford Nanopore.
  • An example of a sequencing technology that can be used is Illumina sequencing. Illumina sequencing is based on the amplification of DNA on a solid surface using fold-back PCR and anchored primers. Genomic DNA is fragmented and attached to the surface of flow cell channels. Four fluorophore-labeled, reversibly terminating nucleotides are used to perform sequencing. After nucleotide incorporation, a laser is used to excite the fluorophores, and an image is captured, and the identity of the first base is recorded.
  • SAM sequence alignment map format
  • Other methods useful for processing and analyzing sequence reads are discussed in U.S. Pat. No. 8,209,130, which is incorporated by reference. Determining gene expression generally involves counting numbers of unique sequence reads that uniquely map to a human reference genome. Mapping reads to a reference to identify genes may be performed using computer software packages known in the art.
  • mapping reads to a reference and identifying genes gives a quantitative result when reads are deduplicated to yield one read per mRNA from which those reads originated. Because each mRNA is typically copied into cDNA and each cDNA is typically copied into an unpredictably large number of amplicons in the sequencing library, and because each library member is often amplified or read redundantly as part of a sequencing technique, a number of raw sequence reads does not necessarily correlate to numbers of input molecules from the single cells. Nevertheless, one cell may include abundant transcripts that map to one gene.
  • compositions and methods of the invention give each cDNA at least one unique intrinsic identifier (and preferably dual-IMIs) that can be identified within, and used to deduplicate, sequence reads. After those sequence reads are identified by gene and deduplicated, then counts of those reads provide a quantitative measure of gene expression levels. Methods may include, prior to the deduplicating step, saving the sequence reads in memory, coupled to at least on processor in a computer system, as a FASTA or FASTQ file, wherein the deduplicating and mapping are performed by the computer system. [0069] As discussed, methods of the invention are useful for scRNA-Seq and specifically for expression analysis.
  • cells are isolated into, and lysed within, aqueous partitions with capture oligos.
  • the capture oligos anneal to RNAs released from the cells.
  • the capture oligos preferably include partition-specific barcodes and PCR handles. Once the capture oligos have hybridized to the RNAs, those duplexes may be released from partitions and pooled at any subsequent stage. Because capture oligos with partition-specific barcodes are used to capture and tag RNA from cells isolated in the partition, any arbitrary number of cells may be captured in parallel (simultaneously).
  • the cell barcodes in the sequencing data can be used to “bin” the sequence data by original cell, i.e., assign each sequence read (or assembled contigs or sequences therefrom) back to originating cells.
  • Read deduplication 131 and transcript counting is preferably performed by a computer system operably linked to a sequencing instrument and executing program instructions causing the computer system to perform those functions.
  • sequence reads may arrive at the computer system in FASTQ format.
  • Each entry — each "sequence read — in a FASTQ file (or FASTA) will include a segment of sequence information read from the target molecules.
  • Those sequence reads will also each include at least one IMI.
  • each target molecule will generate two reads, a forward read and a reverse read, and each read will have an IMI read from the target molecule.
  • sequence information is read from the target molecule is used to identify the gene of origin for that molecule (e.g., transcript). But each IMI is also read from the target molecule.
  • the computer system can execute program software to deduplicate the reads (simply identifying all duplicates and saving only one, i.e., "collapsing" the reads to a single read; or by leaving the FASTQ files intact but only "counting" all duplicate reads as 1, e.g., in a count file.
  • the system may also identify gene information for each read, e.g., by mapping to a reference or by querying a transcript database.
  • Read mapping can proceed by known methods including, for example, by methods that involve pairwise alignment of each read to a reference, such as a published human genome such as the 36 th build of the human genome refer e d to in industry as HG36 or hg36. Comparison to references may also proceed by building hashes of k-mers in the reference and in the query (the sequence read) and looking up the hashed k-mers of the query in the target (the reference). Read-mapping may involve transforming each sequence in order or in characters via an informatic transform such as the Burroughs-Wheeler transform (BWT) after which comparison of the BWT of the query to the target is trivial.
  • BWT Burroughs-Wheeler transform
  • RNA transcript a gene (as it is found in the DNA within the genome of an organism that is being studied) that has been transcribed into an RNA transcript, that was copied with a randomer and amplified to generate amplicons with at least one IMI, which amplicons are sequenced to generate sequence reads that include the IMI is assigned to the sequence read, and by implication, the RNA transcript is identified as having been transcribed from that gene. For that gene, each unduplicated read is used to increment a count of transcripts by one.
  • the read counts are a measure of a number of actual transcript molecules that were present in the sample.
  • the gene identities and read counts are a measure of expression levels of the gene in that cell and are also thus a transcriptome or transcriptomic profile for the cell.
  • Each copy includes (i) a sample sequence copied from a sample polynucleotide of the plurality of sample polynucleotides, and (ii) a tag sequence copied from the sample polynucleotide, the tag sequence distinguishing said sample polynucleotide from other sample polynucleotides from the sample.
  • the tag sequence functions like an IMI discussed above.
  • the information of the tag sequence is information that was in the sample polynucleotides originally, i.e., it is genetic information of the organism being studied.
  • Generating the copies may include annealing, to the sample polynucleotide, a primer (e.g., random er of a capture oligo) having a random sequence of bases at a 3' end and extending the primer to generate the tag sequence.
  • the random sequence may be about 8 bases in length, e.g., about 6 to 9.
  • the plurality of sample polynucleotides may be RNA and each of the plurality of copies of the sample polynucleotides may include a random tag sequence copied from the RNA.
  • the unique sequence in each of the copies is copied from a segment of a transcript transcribed by an organism.
  • the methods may include performing the recited steps for a plurality of cells to generate, for each cell of the plurality, a transcriptome profile based on, for that cell, the count of mapped unique reads.
  • the fluid may comprise reagents such as surfactants (e.g., octylphenol ethoxylate and/or octylphenoxypolyethoxyethanol), reducing agents (e.g., DTT, beta mercaptoethanol, or combinations thereof).
  • surfactants e.g., octylphenol ethoxylate and/or octylphenoxypolyethoxyethanol
  • reducing agents e.g., DTT, beta mercaptoethanol, or combinations thereof.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Plant Pathology (AREA)
  • Immunology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

L'invention concerne des procédés de séquençage (127) et de comptage d'acides nucléiques. Les procédés peuvent comprendre la réalisation de copies (113) de molécules d'acide nucléique à partir de sites de départ aléatoires, le séquençage (127) des copies pour obtenir des lectures de séquence, la déduplication (131) des lectures de séquence pour produire uniquement des lectures uniques, et le mappage des lectures uniques sur des gènes et, pour chacun des gènes, la fourniture d'un nombre de lectures uniques mappées.
PCT/US2025/016475 2024-02-23 2025-02-19 Procédés de séquençage et de comptage d'acides nucléiques Pending WO2025178954A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202463557130P 2024-02-23 2024-02-23
US63/557,130 2024-02-23

Publications (1)

Publication Number Publication Date
WO2025178954A1 true WO2025178954A1 (fr) 2025-08-28

Family

ID=94974271

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2025/016475 Pending WO2025178954A1 (fr) 2024-02-23 2025-02-19 Procédés de séquençage et de comptage d'acides nucléiques

Country Status (1)

Country Link
WO (1) WO2025178954A1 (fr)

Citations (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6210891B1 (en) 1996-09-27 2001-04-03 Pyrosequencing Ab Method of sequencing DNA
US6306597B1 (en) 1995-04-17 2001-10-23 Lynx Therapeutics, Inc. DNA sequencing by parallel oligonucleotide extensions
US6828100B1 (en) 1999-01-22 2004-12-07 Biotage Ab Method of DNA sequencing
US6833246B2 (en) 1999-09-29 2004-12-21 Solexa, Ltd. Polynucleotide sequencing
US6911345B2 (en) 1999-06-28 2005-06-28 California Institute Of Technology Methods and apparatus for analyzing polynucleotide sequences
US20060024681A1 (en) 2003-10-31 2006-02-02 Agencourt Bioscience Corporation Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof
US20060292611A1 (en) 2005-06-06 2006-12-28 Jan Berka Paired end sequencing
US20070114362A1 (en) 2005-11-23 2007-05-24 Illumina, Inc. Confocal imaging methods and apparatus
US7232656B2 (en) 1998-07-30 2007-06-19 Solexa Ltd. Arrayed biomolecules and their use in sequencing
US7598035B2 (en) 1998-02-23 2009-10-06 Solexa, Inc. Method and compositions for ordering restriction fragments
US20100041046A1 (en) 2008-08-15 2010-02-18 University Of Washington Method and apparatus for the discretization and manipulation of sample volumes
US7835871B2 (en) 2007-01-26 2010-11-16 Illumina, Inc. Nucleic acid sequencing system and method
US7960120B2 (en) 2006-10-06 2011-06-14 Illumina Cambridge Ltd. Method for pair-wise sequencing a plurality of double stranded target polynucleotides
US8209130B1 (en) 2012-04-04 2012-06-26 Good Start Genetics, Inc. Sequence assembly
US9005891B2 (en) 2009-11-10 2015-04-14 Genomic Health, Inc. Methods for depleting RNA from nucleic acid samples
WO2018136248A1 (fr) * 2017-01-18 2018-07-26 Illuminia, Inc. Procédés et systèmes de génération et de correction d'erreur d'ensembles d'indices moléculaires uniques ayant des longueurs moléculaires hétérogènes
WO2019139650A2 (fr) 2017-09-29 2019-07-18 The Regents Of The University Of California Procédé de génération d'émulsions monodispersées
US10421992B2 (en) 2012-09-13 2019-09-24 Takara Bio Usa, Inc. Methods of depleting a target nucleic acid in a sample and kits for practicing the same
US10472666B2 (en) 2016-02-15 2019-11-12 Roche Sequencing Solutions, Inc. System and method for targeted depletion of nucleic acids
US20200269248A1 (en) 2007-02-06 2020-08-27 Brandeis University Manipulation of fluids and reactions in microfluidic systems
US20210178395A1 (en) 2007-04-19 2021-06-17 President And Fellows Of Harvard College Manipulation of fluids, fluid components and reactions in microfluidic systems
US20210215591A1 (en) 2020-01-13 2021-07-15 Fluent Biosciences Inc. Devices for generating monodisperse droplets from a bulk liquid
US20210214721A1 (en) 2020-01-13 2021-07-15 Fluent Biosciences Inc. Reverse transcription during template emulsification
US20210214792A1 (en) 2020-01-13 2021-07-15 Fluent Biosciences Inc. Methods and systems for single cell gene profiling
US11115853B2 (en) 2009-01-07 2021-09-07 Yamaha Corporation Wireless network system and wireless communication method
US20210340596A1 (en) 2018-09-28 2021-11-04 Fluent Biosciences Inc. Target capture and barcoding in monodisperse droplets
US20210381064A1 (en) 2020-01-13 2021-12-09 Fluent Biosciences Inc. Single cell sequencing
US20220145359A1 (en) 2019-02-12 2022-05-12 Jumpcode Genomics, Inc. Methods for targeted depletion of nucleic acids
WO2022098726A1 (fr) * 2020-11-03 2022-05-12 Fluent Biosciences Inc. Systèmes et procédés de fabrication de banques de séquençage
US20230265528A1 (en) 2020-08-12 2023-08-24 Jumpcode Genomics, Inc. Methods for targeted depletion of nucleic acids
US20230279490A1 (en) 2022-03-01 2023-09-07 Watchmaker Genomics, Inc. Depletion probes
WO2024077114A1 (fr) * 2022-10-06 2024-04-11 Fluent Biosciences Inc. Séquençage et quantification
WO2024259274A2 (fr) * 2023-06-16 2024-12-19 Illumina, Inc. Procédés d'analyse de déduplication moléculaire

Patent Citations (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6306597B1 (en) 1995-04-17 2001-10-23 Lynx Therapeutics, Inc. DNA sequencing by parallel oligonucleotide extensions
US6210891B1 (en) 1996-09-27 2001-04-03 Pyrosequencing Ab Method of sequencing DNA
US7598035B2 (en) 1998-02-23 2009-10-06 Solexa, Inc. Method and compositions for ordering restriction fragments
US7232656B2 (en) 1998-07-30 2007-06-19 Solexa Ltd. Arrayed biomolecules and their use in sequencing
US6828100B1 (en) 1999-01-22 2004-12-07 Biotage Ab Method of DNA sequencing
US6911345B2 (en) 1999-06-28 2005-06-28 California Institute Of Technology Methods and apparatus for analyzing polynucleotide sequences
US6833246B2 (en) 1999-09-29 2004-12-21 Solexa, Ltd. Polynucleotide sequencing
US20060024681A1 (en) 2003-10-31 2006-02-02 Agencourt Bioscience Corporation Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof
US20060292611A1 (en) 2005-06-06 2006-12-28 Jan Berka Paired end sequencing
US20070114362A1 (en) 2005-11-23 2007-05-24 Illumina, Inc. Confocal imaging methods and apparatus
US7960120B2 (en) 2006-10-06 2011-06-14 Illumina Cambridge Ltd. Method for pair-wise sequencing a plurality of double stranded target polynucleotides
US7835871B2 (en) 2007-01-26 2010-11-16 Illumina, Inc. Nucleic acid sequencing system and method
US20110009278A1 (en) 2007-01-26 2011-01-13 Illumina, Inc. Nucleic acid sequencing system and method
US20200269248A1 (en) 2007-02-06 2020-08-27 Brandeis University Manipulation of fluids and reactions in microfluidic systems
US20210178395A1 (en) 2007-04-19 2021-06-17 President And Fellows Of Harvard College Manipulation of fluids, fluid components and reactions in microfluidic systems
US20100041046A1 (en) 2008-08-15 2010-02-18 University Of Washington Method and apparatus for the discretization and manipulation of sample volumes
US11115853B2 (en) 2009-01-07 2021-09-07 Yamaha Corporation Wireless network system and wireless communication method
US11149297B2 (en) 2009-11-10 2021-10-19 Genomic Health, Inc. Methods for depleting RNA from nucleic acid samples
US9005891B2 (en) 2009-11-10 2015-04-14 Genomic Health, Inc. Methods for depleting RNA from nucleic acid samples
US8209130B1 (en) 2012-04-04 2012-06-26 Good Start Genetics, Inc. Sequence assembly
US10421992B2 (en) 2012-09-13 2019-09-24 Takara Bio Usa, Inc. Methods of depleting a target nucleic acid in a sample and kits for practicing the same
US10472666B2 (en) 2016-02-15 2019-11-12 Roche Sequencing Solutions, Inc. System and method for targeted depletion of nucleic acids
WO2018136248A1 (fr) * 2017-01-18 2018-07-26 Illuminia, Inc. Procédés et systèmes de génération et de correction d'erreur d'ensembles d'indices moléculaires uniques ayant des longueurs moléculaires hétérogènes
US20200261879A1 (en) 2017-09-29 2020-08-20 The Regents Of The University Of California Method of generating monodisperse emulsions
WO2019139650A2 (fr) 2017-09-29 2019-07-18 The Regents Of The University Of California Procédé de génération d'émulsions monodispersées
US20210340596A1 (en) 2018-09-28 2021-11-04 Fluent Biosciences Inc. Target capture and barcoding in monodisperse droplets
US20220145359A1 (en) 2019-02-12 2022-05-12 Jumpcode Genomics, Inc. Methods for targeted depletion of nucleic acids
US20210381064A1 (en) 2020-01-13 2021-12-09 Fluent Biosciences Inc. Single cell sequencing
US20210214792A1 (en) 2020-01-13 2021-07-15 Fluent Biosciences Inc. Methods and systems for single cell gene profiling
US20210215591A1 (en) 2020-01-13 2021-07-15 Fluent Biosciences Inc. Devices for generating monodisperse droplets from a bulk liquid
US20210214721A1 (en) 2020-01-13 2021-07-15 Fluent Biosciences Inc. Reverse transcription during template emulsification
US11773452B2 (en) 2020-01-13 2023-10-03 Fluent Biosciences Inc. Single cell sequencing
US20230265528A1 (en) 2020-08-12 2023-08-24 Jumpcode Genomics, Inc. Methods for targeted depletion of nucleic acids
WO2022098726A1 (fr) * 2020-11-03 2022-05-12 Fluent Biosciences Inc. Systèmes et procédés de fabrication de banques de séquençage
US20230279490A1 (en) 2022-03-01 2023-09-07 Watchmaker Genomics, Inc. Depletion probes
WO2024077114A1 (fr) * 2022-10-06 2024-04-11 Fluent Biosciences Inc. Séquençage et quantification
WO2024259274A2 (fr) * 2023-06-16 2024-12-19 Illumina, Inc. Procédés d'analyse de déduplication moléculaire

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BOLGER: "Trimmomatic: a flexible trimmer for Illumina sequence data", BIOINFORMATICS, vol. 30, no. 15, 2014, pages 2114 - 2120, XP055862121, DOI: 10.1093/bioinformatics/btu170
BOWMAN: "Multiplexed Illumina sequencing libraries from picogram quantities of DNA", BMC GENOMICS, vol. 14, 2013, pages 466, XP021156155, DOI: 10.1186/1471-2164-14-466
CONTE MATILDE I ET AL: "Opportunities and tradeoffs in single-cell transcriptomic technologies", TRENDS IN GENETICS, ELSEVIER SCIENCE PUBLISHERS B.V. AMSTERDAM, NL, vol. 40, no. 1, 10 November 2023 (2023-11-10), pages 83 - 93, XP087448218, ISSN: 0168-9525, Retrieved from the Internet <URL:10.1016/J.TIG.2023.10.003> [retrieved on 20231110], DOI: 10.1016/J.TIG.2023.10.003 *
GALLION: "Preserving single cells in space and time for analytical assays", TRENDS ANAL CHEM, vol. 122, 2021, pages 115723
LIN: "RNA sequencing by direct tagmentation of RNA/DNA hybrids", PNAS, vol. 117, no. 6, 2020, pages 2886 - 2893, XP055866200, DOI: 10.1073/pnas.1919800117

Similar Documents

Publication Publication Date Title
US20220333185A1 (en) Methods and compositions for whole transcriptome amplification
EP4240870B1 (fr) Systèmes et procédés de fabrication de banques de séquençage
JP7234114B2 (ja) 細胞区画内の生体分子への直交アクセスおよび細胞区画内の生体分子のタグ付けのための分析システム
JP2024149636A (ja) ヌクレアーゼを利用したrna枯渇
JP2021522797A (ja) ハイスループットマルチオミクスサンプル解析
US11976325B2 (en) Quantitative detection and analysis of molecules
KR20170020704A (ko) 개별 세포 또는 세포 개체군으로부터 핵산을 분석하는 방법
AU2016331058B2 (en) Methods for subtyping diffuse large b-cell lymphoma (dlbcl)
US20200157600A1 (en) Methods and compositions for whole transcriptome amplification
JP2024502028A (ja) 配列決定ライブラリー調製のための方法および組成物
US12534721B2 (en) Quantitative detection and analysis of moleculesip
EP4413158A2 (fr) Traitement atacseq à base de billes
KR20220121826A (ko) 시료 처리 바코딩된 비드 조성물, 방법, 제조, 및 시스템
US20240050949A1 (en) Highly efficient partition loading of single cells
US20250046395A1 (en) Molecular deduplication analysis methods
JP2025078774A (ja) 連結された標的捕捉
WO2025178954A1 (fr) Procédés de séquençage et de comptage d&#39;acides nucléiques
JP7152599B2 (ja) 塩基配列決定のためのモジュール式およびコンビナトリアル核酸試料調製のためのシステムおよび方法
JP2024546177A (ja) シングルセル核酸を標識及び分析する方法
US20240067959A1 (en) Library preparation from fixed samples
HK40061304A (en) Linked target capture

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 25711913

Country of ref document: EP

Kind code of ref document: A1