WO2003025219A2 - Verfahren zur sequenzierung von nukleinsäuren - Google Patents
Verfahren zur sequenzierung von nukleinsäuren Download PDFInfo
- Publication number
- WO2003025219A2 WO2003025219A2 PCT/EP2002/010235 EP0210235W WO03025219A2 WO 2003025219 A2 WO2003025219 A2 WO 2003025219A2 EP 0210235 W EP0210235 W EP 0210235W WO 03025219 A2 WO03025219 A2 WO 03025219A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- spectrum
- calculated
- measured
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6872—Methods for sequencing involving mass spectrometry
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
Definitions
- the invention relates to a method for sequencing nucleic acids, the values of the CID spectrum calculated for an assumed sequence being compared with the measured spectrum of the nucleic acid and the degree of agreement being calculated.
- nucleic acids are first ionized, selected in a first spectrometer and then fragmented by collision with gaseous atoms or molecules (referred to as collision induced dissociation (CID)).
- CID collision induced dissociation
- 6,017,693 (Yates et al) describes a method of the type defined in the introduction, in which a library of peptides is checked to determine whether it contains a compound corresponding to the measured spectrum.
- the extension of the method to oligonucleotides is also suggested, but the method is typically limited to 10 mers and below because of the huge number of initial values to be checked.
- the invention has for its object to make the known methods applicable for significantly longer sequences. This is essentially achieved by not primarily asking the question about the correct sequence, but rather whether an improvement in the agreement with the measured spectrum can be achieved by systematically changing an initial sequence.
- the invention is thus characterized in that at least one position of the assumed sequence is changed and the degree of agreement with the measured spectrum is recalculated.
- the invention can be used to achieve an increasing approximation to the correct sequence starting from a randomly assumed starting sequence by repeated application of the method.
- the method's real strength lies in the detection and localization of point mutations and the omission or insertion of nucleic acid monomers into a known sequence.
- the number of nucleotides is known, it is advantageous if each of the four possible nucleotides is used at a specific point in the assumed sequence and the degree of agreement with the spectrum is calculated in each case. If this procedure is carried out for each position in the sequence and the nucleotide with the best match is inserted into the sequence, the result is an optimal match for the entire sequence.
- the values of m / z for a particular fragment can be calculated with great accuracy, while the intensity of the measured values, that is to say the frequency distribution of the various fragments, is difficult to predict. On this basis, it makes sense initially to refrain from forecasting the intensities at all, but to rate the arrival of a predicted measured value in the measured spectrum the higher the greater the measured intensity. On the other hand, since the values of m / z are to be calculated with greater accuracy than to be measured, a difference value has to be defined up to which the prediction is considered to be fulfilled. This stipulation is based on the requirement that the number of wrong hits and false reports should be minimized.
- Fig. 4 shows the relative importance of the factors of the algorithm of Fig. 2,
- Collision-induced fragmentation cleaves nucleic acids in a sequence-specific manner, resulting in fragments of the precursor ion, which are designated according to the type of bond break in the phosphodiester group and the charge remaining. While loaded fragments with an intact 5'-terminal end are referred to as a, b, c, d fragments, loaded fragments with an intact 3'-terminal end are referred to as w, x, y, z fragments (FIG. 1a ). The number of deoxyribose residues of a fragment is given with subscripts and the charge state with superscripts.
- (a 5 ) 2 stands for a fragment of the a series, which consists of five nucleotide units and is negatively charged twice. Fragments with an additional cleavage of a base are obtained by adding -B, for example as (a n -B n ) m " .
- fragmentation can in principle take place on any binding, it has been shown that there is a main fragmentation pathway for oligonucleotide anions.
- the 3'-C-O bond of the sugar breaks, which subsequently leads to a loss of the base and formation of w- and complementary (a-B) ions (FIG. 1b).
- Fragment ions of a certain type belong to a certain ion series, the mass differences within an ion series being determined by the masses of the bases in the sequence. The sequence of a fragmented nucleic acid can therefore be read from the mass differences in ion series (FIG. 1c).
- the starting point of the sequencing approach according to the invention is the fact that a complete redetermination of the sequence is not necessary for many genome-specific applications. It is usually sufficient to verify a sequence or to find minor deviations from a known sequence. Sequencing using MS / MS is an attractive alternative to the sequencing method conventionally used by Sanger. The reason for this is that sequence-specific data can be obtained within seconds by collision-induced fragmentation and subsequent analysis of the fragment ions.
- the quality of correspondence between a measured MS / MS spectrum and a set of fragment ions can be described by practically any mathematical algorithm.
- an algorithm is explained in which the correspondence between the measured spectrum and the calculated fragment ions is characterized by the value of fitness (FS).
- FS value of fitness
- the smaller the value of fitness the greater the agreement of the measured spectrum with the calculated m / z values.
- the first reference sequence is varied sequentially by incorporating all four possible nucleotides A, T, G and C at each position in the sequence. The sequence with the lowest fitness value is then identified as the correct one.
- the input parameters are on the one hand the reference sequence together with the charge state of the precursor ion and on the other hand a list of experimentally determined m / z values and their relative intensities 1%.
- a list of monoisotopic m / z values for the a, aB, w and wB ion series is calculated, taking into account all possible charge states of the fragment ions from 1 to the charge state of the precursor ion (step 1, Fig. 2).
- the calculated and experimentally found m / z values are then compared with one another (step 2a, FIG. 2).
- the result of this comparison is the number of fragment ions (K) to which a calculated m / z value could be assigned and the sum of uncovered a or w positions (M).
- K fragment ions
- M sum of uncovered a or w positions
- a positions that are not covered are those positions for which neither an aB nor an a fragment of any possible charge state could be found in the spectrum. It also applies to the uncovered w positions that neither a wB nor a w fragment could be assigned to any possible charge state in the spectrum.
- the absolute value of the difference between the measured and the calculated m / z value must be less than or equal to a tolerance value ( ⁇ ).
- the sizes of K and M are calculated increasing for all ⁇ values between 0.2 and 0.8 in 0.1 steps (step 2b, Fig. 2).
- the optimum ⁇ value is then selected under the criterion that the number of false positive ( ⁇ too large) or false negative ( ⁇ too small) assignments is minimized (step 2c, FIG. 2).
- the optimal ⁇ value can be derived from a plot of M against ⁇ as the ⁇ value at which the curve is just reaching the constant value (FIG. 3).
- the next step of the algorithm is the calculation of a match factor MF, which reflects the quality of the assignment of the m / z values in relation to mass deviation and intensity (step 3, FIG. 2).
- the coefficient a is set to the value 1, whereas b and c are determined empirically.
- a value of 0.1 for the coefficient c was found to be suitable for all examples given here. Inserting the coefficients in the above equation gives FS (step 5, Fig. 2):
- the primarily calculated value of the fitness is a measure of the agreement of the measured MS / MS spectrum with the set of fragment ions calculated from the reference sequence.
- the reference sequence is systematically varied at each position and then a new fitness is calculated for each of these mutated sequences. For this, the base is replaced sequentially at each position by the three other bases, while the rest of the sequence is kept constant (step 6a, FIG. 2).
- step 6b-e 3n calculations of the FS are carried out for a sequence of n nucleotides (step 6b-e, FIG. 2).
- the result of the entire calculation is a matrix which contains the fitness of the reference sequence (bold numbers, Fig. 2) and the varied sequences at the different positions of the sequence (Fig. 2).
- the FS (T) value in row 1 of the matrix (-5.53) represents the fitness of the TGGC sequence.
- FS (A) is the value for the sequence in which the T at position 1 was replaced by an A (15.93).
- A 15.93
- the smallest FS value was obtained for the reference sequence, which leads to the conclusion that the best match between the experimental spectrum and the reference sequence exists here.
- the exchange of a base would be indicated by an FS value that is less than the FS value of the reference sequence for a mutated sequence.
- Example 1 Comparative sequencing of a 20-mer oligodeoxynucleotide
- the mass spectroscopic data obtained in this way (m / z, 1%) then acted as input for the comparative sequencing algorithm.
- the sequence GACAGGAAAGACATTCTGGC (Seq. ID No .: 2) was used as the reference sequence, which has an A at position 13 contained a T.
- Table 1 summarizes the m / z values of the a, aB, w and wB fragment ions for the charge states from 1 to 4 calculated for this reference sequence using the fragmentation mechanisms described above.
- the matrix of FS values was then calculated according to the information in the previous part.
- the matrix was shown in the form of a diagram in which the FS values of the individual bases, represented by the letters A, T, G and C, were plotted against the position in the oligodeoxynucleotide sequence. It can be seen in FIG. 6a that the reference sequence for the first 6 bases provided the best match, but that from position 7 the FS values for other bases fit better. In particular, the exchange of T for A led to significantly lower fitness values, which suggested a mutation from A to T very closely. The exact position resulted from the minimum of the FS values, which clearly demonstrated the mutation from A to T at position 13.
- the base sequence GACAGGAAAGACTTTCTGGC
- PCR polymerase chain reaction
- an amplified PCR product must have a length of more than 40 base pairs in order to contain relevant sequence information.
- a 51-mer (molecular mass: 15580.0) of the sequence AAACCACATT CTGAGCATAG CCCCAAAAAA TTTCATGCCG AAGCTGTGGT C (Seq. ID No .: 3) was fragmented and measured.
- FIG. 8 Another example that is intended to show the applicability of the algorithm for identifying long oligodeoxynucleotides is shown in FIG. 8.
- the correctness of the sequence of an 80-mer was checked by comparison with the MS / MS spectrum (CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAGCAAT AAACCAGCCA GCCGGAAGGG) (Seq. ID No .: 4).
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
Description
Claims
Priority Applications (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AT02798646T ATE432505T1 (de) | 2001-09-14 | 2002-09-12 | Verfahren zur sequenzierung von nukleinsäuren |
| AU2002335313A AU2002335313A1 (en) | 2001-09-14 | 2002-09-12 | Method for sequencing nucleic acids |
| EP02798646A EP1430437B1 (de) | 2001-09-14 | 2002-09-12 | Verfahren zur sequenzierung von nukleinsäuren |
| DE50213578T DE50213578D1 (de) | 2001-09-14 | 2002-09-12 | Verfahren zur sequenzierung von nukleinsäuren |
| US10/490,074 US7720613B2 (en) | 2001-09-14 | 2002-09-12 | Method for sequencing nucleic acids |
| CA2460329A CA2460329C (en) | 2001-09-14 | 2002-09-12 | Method for sequencing nucleic acids |
| JP2003529991A JP4550416B2 (ja) | 2001-09-14 | 2002-09-12 | 核酸の配列決定方法 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| ATA1444/2001 | 2001-09-14 | ||
| AT0144401A AT410983B (de) | 2001-09-14 | 2001-09-14 | Verfahren zur ermittlung der sequenz von nukleinsäuren |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2003025219A2 true WO2003025219A2 (de) | 2003-03-27 |
| WO2003025219A3 WO2003025219A3 (de) | 2004-02-12 |
Family
ID=3688238
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/EP2002/010235 Ceased WO2003025219A2 (de) | 2001-09-14 | 2002-09-12 | Verfahren zur sequenzierung von nukleinsäuren |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US7720613B2 (de) |
| EP (1) | EP1430437B1 (de) |
| JP (1) | JP4550416B2 (de) |
| AT (2) | AT410983B (de) |
| AU (1) | AU2002335313A1 (de) |
| CA (1) | CA2460329C (de) |
| DE (1) | DE50213578D1 (de) |
| WO (1) | WO2003025219A2 (de) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1762629A1 (de) | 2005-09-12 | 2007-03-14 | Roche Diagnostics GmbH | Nachweis biologischer DNA |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2009031128A (ja) * | 2007-07-27 | 2009-02-12 | Univ Of Tokyo | 核酸の塩基配列及び塩基修飾を解析する装置、方法及びプログラム |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6017693A (en) | 1994-03-14 | 2000-01-25 | University Of Washington | Identification of nucleotides, amino acids, or carbohydrates by mass spectrometry |
| ATE298004T1 (de) | 1999-04-30 | 2005-07-15 | Methexis Genomics N V | Diagnostische sequenzierung durch eine kombination von spezifischer spaltung und massenspektrometrie |
| JP2004503780A (ja) * | 2000-06-12 | 2004-02-05 | ユニバーシティ オブ ワシントン | リンペプチドの選択的標識および単離ならびにプロテオーム分析への適用 |
-
2001
- 2001-09-14 AT AT0144401A patent/AT410983B/de not_active IP Right Cessation
-
2002
- 2002-09-12 AU AU2002335313A patent/AU2002335313A1/en not_active Abandoned
- 2002-09-12 DE DE50213578T patent/DE50213578D1/de not_active Expired - Lifetime
- 2002-09-12 US US10/490,074 patent/US7720613B2/en not_active Expired - Fee Related
- 2002-09-12 WO PCT/EP2002/010235 patent/WO2003025219A2/de not_active Ceased
- 2002-09-12 JP JP2003529991A patent/JP4550416B2/ja not_active Expired - Fee Related
- 2002-09-12 AT AT02798646T patent/ATE432505T1/de not_active IP Right Cessation
- 2002-09-12 EP EP02798646A patent/EP1430437B1/de not_active Expired - Lifetime
- 2002-09-12 CA CA2460329A patent/CA2460329C/en not_active Expired - Fee Related
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1762629A1 (de) | 2005-09-12 | 2007-03-14 | Roche Diagnostics GmbH | Nachweis biologischer DNA |
Also Published As
| Publication number | Publication date |
|---|---|
| ATE432505T1 (de) | 2009-06-15 |
| AU2002335313A1 (en) | 2003-04-01 |
| DE50213578D1 (de) | 2009-07-09 |
| US20040265826A1 (en) | 2004-12-30 |
| JP2005503570A (ja) | 2005-02-03 |
| ATA14442001A (de) | 2003-01-15 |
| WO2003025219A3 (de) | 2004-02-12 |
| CA2460329A1 (en) | 2003-03-27 |
| EP1430437B1 (de) | 2009-05-27 |
| CA2460329C (en) | 2011-04-12 |
| JP4550416B2 (ja) | 2010-09-22 |
| US7720613B2 (en) | 2010-05-18 |
| AT410983B (de) | 2003-09-25 |
| EP1430437A2 (de) | 2004-06-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE60219132T2 (de) | Mutationsanalyse mittels PCR und Massenspektrometrie | |
| DE69426731T2 (de) | Verfahren zur massenspektroskopischen sequenzanalyse einer nukleinsäure mittels primerverlängerung | |
| EP1232287B1 (de) | Oligomer-array mit pna- und/oder dna-oligomeren auf einer oberfläche | |
| DE19802905C2 (de) | Verfahren zur bevorzugten Herstellung nur eines Stranges selektierten Genmaterials für massenspektrometrische Messungen | |
| DE10108453A1 (de) | Massenspektrometrische Mutationsanalyse mit photolytisch spaltbaren Primern | |
| WO2003057909A2 (de) | Verfahren zum nachweis von cytosin-methylierungsmustern durch exponentielle ligation hybridisierter sondenoligonukleotide (mla) | |
| WO1999028498A2 (de) | Verfahren zur herstellung komplexer dna-methylierungs-fingerabdrücke | |
| WO1998007885A1 (de) | Verfahren zum nachweis von nukleinsäuren unter ermittlung der masse | |
| EP1294946B1 (de) | Multiplex-sequenzierungsverfahren | |
| DE19853398C1 (de) | Verfahren zur Identifikation von Cytosin-Methylierungsmustern in genomischer DNA | |
| EP1654388A2 (de) | Verfahren zum nachweis von cytosin-methylierungen in dna | |
| EP1228247B1 (de) | Verfahren zur kontrollierbaren durchführung komplexer pcr-amplifikationen | |
| EP0840802B1 (de) | Verfahren zur sequenzierung durch oligomerhybridisierung | |
| AT410983B (de) | Verfahren zur ermittlung der sequenz von nukleinsäuren | |
| DE10112387B4 (de) | Massenspektrometrische Genotypisierung | |
| DE10162536A1 (de) | Evaneszenz-basierendes Multiplex-Sequenzierungsverfahren | |
| EP1369493A1 (de) | Verfahren zur quantitiven Bestimmung des Methylierungsgrades von Cytosinen in CpG-Positionen | |
| EP1309728A2 (de) | Verfahren zur hochparallelen analyse von polymorphismen | |
| DE102005059227A1 (de) | Verfahren zur Bestimmung des Genotyps aus einer biologischen Probe enthaltend Nukleinsäuren unterschiedlicher Individuen | |
| DE19713556A1 (de) | Verfahren zur Identifizierung von eukaryontischen und viralen Spezies durch Bestimmung relativer und absoluter Oligonukleotidhäufigkeiten | |
| DE102008007112B3 (de) | Verfahren zur Sequenzierung eines RNA-Moleküls mittels Massenspektrometrie | |
| DE10051714A1 (de) | Verfahren zur kontrollierbaren Durchführung komplexer PCR-Amplifikationen | |
| EP1811043A1 (de) | Massenspektroskopische Analyse von Nukleinsäurefragmenten | |
| EP0750047A1 (de) | Verwendung der Auftrennung von DNA-Fragmenten unter Anlagerung von polymergekoppelten DNA-Liganden zur Unterscheidung von Organismen, Viren, Plasmiden und Genen und zur Bestimmung ihrer Vielfalt | |
| DE102004049532A1 (de) | Sequenzierung von Nukleinsäuren über Strand Displacement |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EC EE ES FI GB GD GE GH HR HU ID IL IN IS JP KE KG KP KR LC LK LR LS LT LU LV MA MD MG MN MW MX MZ NO NZ OM PH PL PT RU SD SE SG SI SK SL TJ TM TN TR TZ UA UG US UZ VN YU ZA ZM |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2002798646 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2460329 Country of ref document: CA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2003529991 Country of ref document: JP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 10490074 Country of ref document: US |
|
| WWP | Wipo information: published in national office |
Ref document number: 2002798646 Country of ref document: EP |
|
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |