ES2656023T3 - Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome - Google Patents
Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome Download PDFInfo
- Publication number
- ES2656023T3 ES2656023T3 ES12876917.1T ES12876917T ES2656023T3 ES 2656023 T3 ES2656023 T3 ES 2656023T3 ES 12876917 T ES12876917 T ES 12876917T ES 2656023 T3 ES2656023 T3 ES 2656023T3
- Authority
- ES
- Spain
- Prior art keywords
- sequencing
- predetermined region
- fetus
- base information
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6809—Methods for determination or identification of nucleic acids involving differential detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6858—Allele-specific amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/10—Ploidy or copy number detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/20—Sequence assembly
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/118—Prognosis of disease development
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Organic Chemistry (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Public Health (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Pathology (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Artificial Intelligence (AREA)
- Bioethics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
Procedimiento para determinar la información de bases de una región predeterminada en un genoma fetal, que comprende las siguientes etapas: extraer una muestra de ADN genómico de un feto de sangre periférica de una mujer embarazada, construir una biblioteca de secuenciación basándose en una muestra de ADN genómico del feto; someter la biblioteca de secuenciación a secuenciación para obtener un resultado de la secuenciación del feto que consiste en una pluralidad de datos de secuenciación; y determinar la información de bases de la región predeterminada basándose en el resultado de la secuenciación del feto en combinación con la información genética de un individuo relacionado utilizando un modelo oculto de Markov, en el que la información de bases de la región predeterminada comprende la recombinación más posible de haplotipos fetales, y la información de bases de la región predeterminada se obtiene mediante: la construcción de un vector de distribución de probabilidades de un estado inicial y una matriz de transición de la recombinación de haplotipos; la construcción de una matriz de probabilidades de las observaciones; la construcción de una matriz de probabilidades parciales y un cursor de inversión; la determinación de un estado final y el retroceso por una ruta opcional; y la obtención de un resultado para analizar la secuencia de ADN del genoma fetal.Procedure for determining the base information of a predetermined region in a fetal genome, which comprises the following steps: extracting a genomic DNA sample from a peripheral blood fetus of a pregnant woman, constructing a sequencing library based on a DNA sample genomic of the fetus; Sequencing the sequencing library to obtain a fetus sequencing result consisting of a plurality of sequencing data; and determining the base information of the predetermined region based on the result of the sequencing of the fetus in combination with the genetic information of a related individual using a hidden Markov model, in which the base information of the predetermined region comprises recombination most possible of fetal haplotypes, and the baseline information of the predetermined region is obtained by: the construction of a probability distribution vector of an initial state and a transition matrix of the haplotype recombination; the construction of a matrix of probabilities of the observations; the construction of a matrix of partial probabilities and an investment cursor; the determination of a final state and the withdrawal by an optional route; and obtaining a result to analyze the fetal genome DNA sequence.
Description
5 5
15 fifteen
25 25
35 35
45 Four. Five
55 55
65 65
Etapa 5, salida de resultados Stage 5, output of results
De este modo, se puede analizar de manera eficaz la secuencia del genoma fetal. En comparación con otro procedimiento existente de detección prenatal, el procedimiento de la presente divulgación puede tener las siguientes ventajas técnicas, que incorpora principalmente la precisión y cantidad de información genética que puede obtenerse: In this way, the fetal genome sequence can be analyzed effectively. Compared to another existing prenatal screening procedure, the procedure of the present disclosure may have the following technical advantages, which mainly incorporates the precision and amount of genetic information that can be obtained:
1) Según las realizaciones de la presente divulgación, un sitio a detectar no se limita a un sitio parental, para un sitio materno, es decir, un sitio heterocigoto materno, si un feto hereda un sitio de patopoiesia materna también puede detectarse de forma excelente, con una precisión de hasta el 95% o más; y se puede detectar una pluralidad de tipos de anormalidad que agranda el intervalo de detección de la enfermedad. 1) According to the embodiments of the present disclosure, a site to be detected is not limited to a parental site, for a maternal site, that is, a maternal heterozygous site, if a fetus inherits a maternal pathopoietic site it can also be detected excellently , with an accuracy of up to 95% or more; and a plurality of types of abnormality that enlarge the disease detection range can be detected.
2) Según las realizaciones de la presente divulgación, la información de una pluralidad de sitios y enfermedades se puede obtener mediante una sola secuenciación; mientras que las secuencias de genes, que tienen una baja cobertura en el plasma de la mujer embarazada que no se puede determinar con precisión solamente mejorando la profundidad de la secuenciación, se pueden obtener mediante el procedimiento de la presente divulgación con un resultado preciso y fiable. 2) According to the embodiments of the present disclosure, information from a plurality of sites and diseases can be obtained by a single sequencing; while gene sequences, which have a low plasma coverage of the pregnant woman that cannot be determined with precision only by improving the depth of sequencing, can be obtained by the method of the present disclosure with a precise and reliable result .
3. Según las realizaciones de la presente divulgación, se puede realizar una representación gráfica con una enfermedad genética, algunas enfermedades relacionadas se pueden deducir directamente con la información de otros sitios, con una gran cantidad de información obtenida en una vez, lo cual tiene un significado más instructivo para la detección clínica. 3. According to the embodiments of the present disclosure, a graphic representation can be made with a genetic disease, some related diseases can be deduced directly with the information from other sites, with a large amount of information obtained at one time, which has a more instructive meaning for clinical detection.
Además, según las realizaciones de la presente divulgación, el procedimiento de determinación de la información de bases de una región predeterminada en un genoma fetal, no limitado a ciertos sitios polimórficos genéticos, tales como SNP o STR, está adaptado para todos los sitios polimórficos genéticos, el cual se puede utilizar en paralelo para una pluralidad de sitios para verificarse entre sí. Además de aplicar la información genómica de detección no invasiva prenatal de un feto, consiguiendo el propósito de la detección de la enfermedad, el procedimiento de la presente divulgación también se puede utilizar en la identificación de paternidad prenatal no invasiva, es decir, la determinación de la identidad del padre del feto antes del nacimiento, la disposición de asistencia para disputas que implican responsabilidades y obligaciones de crianza, bienes y casos de agresión sexual, etc. Furthermore, according to the embodiments of the present disclosure, the method of determining the base information of a predetermined region in a fetal genome, not limited to certain genetic polymorphic sites, such as SNP or STR, is adapted for all genetic polymorphic sites. , which can be used in parallel for a plurality of sites to verify each other. In addition to applying the genomic information of non-invasive prenatal detection of a fetus, achieving the purpose of the detection of the disease, the method of the present disclosure can also be used in the identification of non-invasive prenatal paternity, that is, the determination of the identity of the father of the fetus before birth, the provision of assistance for disputes involving responsibilities and obligations of parenting, property and cases of sexual assault, etc.
Sistema para determinar la información de bases de una región predeterminada en un genoma fetal System for determining the base information of a predetermined region in a fetal genome
En otro aspecto de la presente divulgación, se da a conocer un sistema para determinar la información de bases de una región predeterminada en un genoma fetal. Según las realizaciones de la presente divulgación, en referencia a la figura 2, el sistema -1000-puede comprender: un aparato de construcción de una biblioteca -100-, un aparato de secuenciación -200-y un aparato de análisis -400-. In another aspect of the present disclosure, a system for determining the base information of a predetermined region in a fetal genome is disclosed. According to the embodiments of the present disclosure, with reference to Figure 2, the system -1000-may comprise: a construction apparatus of a library -100-, a sequencing apparatus -200- and an analysis apparatus -400-.
Según las realizaciones de la presente divulgación, aparato de construcción de una biblioteca -100-está adaptado para construir una biblioteca de secuenciación basándose en una muestra de ADN genómico de un feto. Según las realizaciones de la presente divulgación, el aparato de secuenciación -200-está conectado al aparato de construcción de una biblioteca -100-y está adaptado para someter la biblioteca de secuenciación a secuenciación para obtener un resultado de la secuenciación del feto que consiste en una pluralidad de datos de secuenciación. Según las realizaciones de la presente divulgación, el sistema -1000-puede comprender también un aparato de extracción de muestras de ADN adaptado para extraer la muestra de ADN genómico del feto a partir de sangre periférica de mujer embarazada. De este modo, el sistema puede estar adaptado para la detección prenatal no invasiva. According to the embodiments of the present disclosure, a 100-library building apparatus is adapted to construct a sequencing library based on a genomic DNA sample of a fetus. According to the embodiments of the present disclosure, the sequencing apparatus -200-is connected to the apparatus for building a library -100- and is adapted to subject the sequencing library to sequencing to obtain a result of the sequencing of the fetus consisting of a plurality of sequencing data. According to the embodiments of the present disclosure, the -1000-system may also comprise a DNA sample extraction apparatus adapted to extract the genomic DNA sample from the fetus from peripheral blood of pregnant woman. In this way, the system can be adapted for non-invasive prenatal detection.
Según las realizaciones de la presente divulgación, opcionalmente, el sistema puede comprender también un aparato de alineación -300-. Según las realizaciones de la presente divulgación, el aparato de alineación -300-está conectado al aparato de secuenciación -200-y está adaptado para alinear el resultado de la secuenciación del feto con una secuencia de referencia, para determinar el resultado de la secuenciación que deriva de la región predeterminada. Según las realizaciones de la presente divulgación, los procedimientos y dispositivos para la secuenciación no están sometidos a ningún tipo de restricción especial, incluyendo, pero sin limitarse a los mismos, procedimiento de terminación de cadena (Sanger); es preferente un procedimiento de secuenciación de alto rendimiento. De este modo, utilizando características de secuenciación de alto rendimiento y profundas de estos aparatos, puede mejorarse adicionalmente la eficacia, mediante lo cual se puede mejorar adicionalmente la precisión y exactitud del análisis posterior con los datos de secuenciación, tal como una prueba estadística. El procedimiento de secuenciación de alto rendimiento incluye, pero sin limitarse a las mismas, una tecnología de secuenciación Next-Generation o una tecnología de secuenciación única. La plataforma de secuenciación Next-Generation (Metzker ML. Sequencing technologies-the next generation. Nat Rev Genet. Enero 2010; 11 (1): 31-46) incluye, pero sin limitarse a las mismas, las plataformas de secuenciación Illumina-Solexa (GATM, HiSeq2000TM), etc), ABI-Solid y Roche-454 (pirosecuenciación); la plataforma (tecnología) de secuenciación única incluye, pero sin limitarse a las mismas, secuenciación True Single Molecule DNA de Helicos Company, secuenciación de una única molécula en tiempo real (SMRT®) de Pacific Biosciences Company y la tecnología de secuenciación por nonaporos de Oxford Nanopore According to the embodiments of the present disclosure, optionally, the system may also comprise an alignment apparatus -300-. According to the embodiments of the present disclosure, the alignment apparatus -300-is connected to the sequencing apparatus -200- and is adapted to align the result of the sequencing of the fetus with a reference sequence, to determine the result of the sequencing that derives from the default region. According to the embodiments of the present disclosure, the procedures and devices for sequencing are not subject to any special restriction, including, but not limited to, chain termination procedure (Sanger); a high performance sequencing process is preferred. Thus, using high performance and deep sequencing characteristics of these devices, the efficiency can be further improved, whereby the accuracy and accuracy of the subsequent analysis with the sequencing data, such as a statistical test, can be further improved. The high performance sequencing procedure includes, but is not limited to, a Next-Generation sequencing technology or a unique sequencing technology. The Next-Generation sequencing platform (Metzker ML. Sequencing technologies-the next generation. Nat Rev Genet. January 2010; 11 (1): 31-46) includes, but is not limited to, Illumina-Solexa sequencing platforms (GATM, HiSeq2000TM), etc.), ABI-Solid and Roche-454 (pyrosequencing); the unique sequencing platform (technology) includes, but is not limited to, True Single Molecule DNA sequencing from Helicos Company, real-time single molecule (SMRT®) sequencing from Pacific Biosciences Company, and nonaporous sequencing technology from Oxford Nanopore
7 7
Technologies (Rusk, Nicole (). Cheap Third-Generation Sequencing. Nature Methods 6 (4): 244-245), etc. Con el desarrollo gradual de la tecnología de secuenciación, un experto en la materia puede comprender otros procedimientos de secuenciación y también se pueden utilizar aparatos para la secuenciación del genoma completo. Según los ejemplos específicos de la presente divulgación, la biblioteca de secuenciación del genoma completo se 5 puede someter a secuenciación, como mínimo, mediante un procedimiento seleccionado de Illumina-Solexa, ABI-SOLID, Roche-454 y un aparato de secuenciación de una única molécula. Según las realizaciones de la presente divulgación, un tipo de secuencia de referencia utilizada puede no estar sometido a ningún tipo de restricción especial, la cual puede ser cualquiera de las secuencias conocidas contenidas en una región diana. Según las realizaciones de la presente divulgación, la secuencia de referencia puede utilizar un genoma de referencia humano Technologies (Rusk, Nicole (). Cheap Third-Generation Sequencing. Nature Methods 6 (4): 244-245), etc. With the gradual development of sequencing technology, one skilled in the art can understand other sequencing procedures and apparatus for sequencing the entire genome can also be used. According to the specific examples of the present disclosure, the complete genome sequencing library can be subjected to sequencing, at a minimum, by a method selected from Illumina-Solexa, ABI-SOLID, Roche-454 and a single sequencing apparatus. molecule. According to the embodiments of the present disclosure, a type of reference sequence used may not be subject to any special restriction, which may be any of the known sequences contained in a target region. According to the embodiments of the present disclosure, the reference sequence can use a human reference genome
10 conocido. Por ejemplo, según las realizaciones de la presente divulgación, el genoma de referencia humano es NCBI 36.3, HG18. Además, según las realizaciones de la presente divulgación, los procedimientos de alineación no están sometidos a ninguna restricción especial. Según ejemplos específicos, se puede utilizar SOAP para la alineación. 10 known. For example, according to the embodiments of the present disclosure, the human reference genome is NCBI 36.3, HG18. In addition, according to the embodiments of the present disclosure, the alignment procedures are not subject to any special restrictions. According to specific examples, SOAP can be used for alignment.
15 Según las realizaciones de la presente divulgación, el aparato de análisis -400-está conectado al aparato de secuenciación y está adaptado para determinar la información de bases de la región predeterminada basándose en el resultado de la secuenciación del feto en combinación con información genética de un individuo relacionado utilizando un modelo oculto de Markov. According to the embodiments of the present disclosure, the analysis apparatus -400-is connected to the sequencing apparatus and is adapted to determine the base information of the predetermined region based on the result of the sequencing of the fetus in combination with genetic information of a related individual using a hidden Markov model.
20 Según las realizaciones de la presente divulgación, en el algoritmo de Viterbi, se utiliza 0,25 como una distribución de probabilidades de un estado inicial, se utiliza re/N como una probabilidad de recombinación, siendo re 25∼30, de manera preferente, siendo re 25 y siendo N la longitud de la región predeterminada, 20 According to the embodiments of the present disclosure, in the Viterbi algorithm, 0.25 is used as a probability distribution of an initial state, re / N is used as a probability of recombination, being re 25 re30, preferably , being re 25 and N being the length of the predetermined region,
30 se utiliza como una matriz de transición de la recombinación, siendo pr re/N. Según las realizaciones de la presente divulgación, el aparato de alineación está adaptado para determinar una 30 is used as a transition matrix of recombination, being pr re / N. According to the embodiments of the present disclosure, the alignment apparatus is adapted to determine a
El análisis con datos de secuenciación, que se describe en detalle anteriormente, está también adaptado al sistema The analysis with sequencing data, which is described in detail above, is also adapted to the system
45 para determinar la información de bases de una región predeterminada en un genoma fetal, el cual se omite por razones de brevedad. 45 to determine the base information of a predetermined region in a fetal genome, which is omitted for reasons of brevity.
De este modo, utilizando el sistema se puede implementar de manera eficaz el procedimiento anterior de determinación de la información de bases de una región predeterminada en un genoma fetal, el cual puede Thus, using the system, the above procedure for determining the base information of a predetermined region in a fetal genome can be effectively implemented, which can
50 determinar la secuencia de ácidos nucleicos de una región predeterminada en un genoma fetal gracias al modelo oculto de Markov, por ejemplo utilizando el algoritmo de Viterbi, y con referencia a la información genética de un individuo relacionado, mediante lo cual se puede realizar de manera eficaz una detección genética prenatal con la información genética del genoma fetal. 50 determine the nucleic acid sequence of a predetermined region in a fetal genome thanks to the hidden Markov model, for example using the Viterbi algorithm, and with reference to the genetic information of a related individual, whereby it can be performed in a manner effective a prenatal genetic detection with the genetic information of the fetal genome.
55 Además, según las realizaciones de la presente divulgación, la región predeterminada es un sitio en el que se ha determinado previamente que tiene un polimorfismo genético y el polimorfismo genético es, como mínimo, un polimorfismo seleccionado de polimorfismo de nucleótido único y STR. In addition, according to the embodiments of the present disclosure, the predetermined region is a site where it has been previously determined to have a genetic polymorphism and the genetic polymorphism is at least a polymorphism selected from single nucleotide polymorphism and STR.
El término "conectado" debe entenderse ampliamente, el cual puede referirse a una conexión directa o una conexión 60 indirecta, siempre que se consiga la conexión funcional anterior. The term "connected" should be broadly understood, which may refer to a direct connection or an indirect connection 60, provided the previous functional connection is achieved.
Cabe señalar que un experto en la materia puede entender que las características y ventajas del procedimiento de determinación de la información de bases de una región predeterminada en un genoma fetal descrito anteriormente también pueden adaptarse al sistema para determinar la información de bases de una región predeterminada en un It should be noted that one skilled in the art can understand that the characteristics and advantages of the procedure for determining the base information of a predetermined region in a fetal genome described above can also be adapted to the system to determine the base information of a predetermined region in a
65 genoma fetal, lo cual se omite por razones de brevedad. 65 fetal genome, which is omitted for reasons of brevity.
8 8
- c. C.
- Calcular la información de secuenciación de cada sitio y la construcción de una matriz de probabilidades de las observaciones (otras eran iguales que las descritas en el "procedimiento general"); Calculate the sequencing information of each site and the construction of a probability matrix of the observations (others were the same as those described in the "general procedure");
- d. d.
- La construcción de una matriz de probabilidades parciales y un cursor de inversión (otras eran iguales que las descritas en el "procedimiento general"); The construction of a matrix of partial probabilities and an investment cursor (others were the same as those described in the "general procedure");
- e. and.
- Determinar un estado final y retroceder por una ruta opcional; y Determine an end state and go back through an optional route; Y
- f. F.
- Obtener resultados To get results
Según los resultados del genotipado, la exactitud del mismo se muestra a continuación: According to the results of the genotyping, its accuracy is shown below:
madre mother
total homocigosis heterocigosis número número exactitud número número exactitud total homozygosis heterozygous number number accuracy number number accuracy
número número exactitud number number accuracy
de sitio exacto de sitio exacto from exact site to exact site
de sitio exacto homocigosis 199.552 199.552 100,00% 66.238 63.968 96,57% Exact site homozygosis 199,552 199,552 100.00% 66,238 63,968 96.57%
265.790 263.520 99,15% padre 265,790 263,520 99.15% father
autosoma heterocigosis 65.409 64.735 98,97% 41.849 39.944 95,45% 107.258 104.679 97,60% autosomal heterozygosis 65,409 64,735 98.97% 41,849 39,944 95.45% 107,258 104,679 97.60%
264.961 264.287 99,75% 108.087 103.912 96,14% 264,961 264,287 99.75% 108,087 103,912 96.14%
373.048 368.199 98,70% cromosoma X 4.881 4.881 100,00% 1.718 1.478 86,03% 373,048 368,199 98.70% X chromosome 4,881 4,881 100.00% 1,718 1,478 86.03%
6.599 6.359 96,36% 6,599 6,359 96.36%
10 Aplicabilidad industrial 10 Industrial Applicability
El procedimiento de determinación de la información de bases de una región predeterminada en un genoma fetal, el sistema para determinar la información de bases de una región predeterminada en un genoma fetal y un medio The procedure for determining the base information of a predetermined region in a fetal genome, the system for determining the base information of a predetermined region in a fetal genome and a means
15 legible por ordenador, según la presente invención, se pueden aplicar de manera eficaz en el análisis de la secuencia de ácidos nucleicos de la región predeterminada en el genoma fetal. 15 readable by computer, according to the present invention, can be effectively applied in the analysis of the nucleic acid sequence of the predetermined region in the fetal genome.
La referencia a lo largo de la presente memoria descriptiva a "una realización", "algunas realizaciones", "otro ejemplo", "un ejemplo", "un ejemplo específico" o "algunos ejemplos", significa que un rasgo, estructura, material o 20 característica particular descritos en relación con la realización o ejemplo se incluyen, como mínimo, en una realización o ejemplo de la presente divulgación. De este modo, las apariciones de frases, tales como "en algunas realizaciones", "en una realización", "en otro ejemplo”, “en un ejemplo”, “en un ejemplo específico” o “en algunos ejemplos” en varios lugares a lo largo de la presente memoria descriptiva no necesariamente se refieren a la misma realización o ejemplo de la presente divulgación. Además, los rasgos, estructuras, materiales o características The reference throughout the present specification to "one embodiment", "some embodiments", "another example", "an example", "a specific example" or "some examples", means that a feature, structure, material or 20 particular feature described in connection with the embodiment or example are included, at a minimum, in one embodiment or example of the present disclosure. Thus, the occurrences of phrases, such as "in some embodiments", "in one embodiment", "in another example", "in an example", "in a specific example" or "in some examples" in various places throughout the present specification they do not necessarily refer to the same embodiment or example of the present disclosure In addition, the features, structures, materials or characteristics
25 particulares se pueden combinar en cualquier manera adecuada en una o más realizaciones o ejemplos. Individuals may be combined in any suitable manner in one or more embodiments or examples.
14 14
Claims (1)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2012/075478 WO2013170429A1 (en) | 2012-05-14 | 2012-05-14 | Method, system and computer readable medium for determining base information in predetermined area of fetus genome |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ES2656023T3 true ES2656023T3 (en) | 2018-02-22 |
Family
ID=49582977
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| ES12876917.1T Active ES2656023T3 (en) | 2012-05-14 | 2012-05-14 | Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome |
Country Status (11)
| Country | Link |
|---|---|
| US (2) | US20150094210A1 (en) |
| EP (1) | EP2851431B1 (en) |
| JP (1) | JP6045686B2 (en) |
| KR (1) | KR101770460B1 (en) |
| CN (1) | CN104053789B (en) |
| AU (1) | AU2012380221B2 (en) |
| ES (1) | ES2656023T3 (en) |
| PL (1) | PL2851431T3 (en) |
| RU (1) | RU2597981C2 (en) |
| SG (1) | SG11201407515RA (en) |
| WO (1) | WO2013170429A1 (en) |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130309666A1 (en) | 2013-01-25 | 2013-11-21 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
| US20160026759A1 (en) * | 2014-07-22 | 2016-01-28 | Yourgene Bioscience | Detecting Chromosomal Aneuploidy |
| WO2017004612A1 (en) * | 2015-07-02 | 2017-01-05 | Arima Genomics, Inc. | Accurate molecular deconvolution of mixtures samples |
| CN106011244B (en) * | 2016-05-31 | 2019-07-16 | 中国人民解放军军事医学科学院放射与辐射医学研究所 | Detect the application of the region 7q21.13 SNP reagent |
| MX2019003344A (en) * | 2016-09-22 | 2019-09-04 | Illumina Inc | Somatic copy number variation detection. |
| CN109996894A (en) * | 2016-11-18 | 2019-07-09 | 香港中文大学 | The antenatal test of the Noninvasive based on general haplotype for single-gene disorder |
| NZ759171A (en) | 2017-11-16 | 2022-05-27 | Illumina Inc | Systems and methods for determining microsatellite instability |
| CN108048541B (en) * | 2018-01-25 | 2020-11-20 | 广州精科医学检验所有限公司 | System for determining fetal alpha thalassemia gene haplotype |
| CN110349631B (en) * | 2019-07-30 | 2021-10-29 | 苏州亿康医学检验有限公司 | Analysis method and device for determining haplotype of offspring object |
| TWI752593B (en) | 2019-08-16 | 2022-01-11 | 香港中文大學 | Determination of base modifications of nucleic acids |
| US20230028790A1 (en) * | 2019-11-29 | 2023-01-26 | GC Genome Corporation | Artificial intelligence-based chromosomal abnormality detection method |
| CN113308548B (en) * | 2021-01-26 | 2023-03-28 | 天津华大医学检验所有限公司 | Method, device and storage medium for detecting fetal gene haplotype |
| CN112885408B (en) * | 2021-02-22 | 2024-10-01 | 中国农业大学 | Method and device for detecting SNP marker loci based on low-depth sequencing |
| CN119301269A (en) * | 2022-05-26 | 2025-01-10 | 深圳华大生命科学研究院 | Fetal genotype detection method based on haplotype |
| CN117392673B (en) * | 2023-12-12 | 2024-02-13 | 深圳赛陆医疗科技有限公司 | Base identification method and device, gene sequencer and medium |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130071837A1 (en) * | 2004-10-06 | 2013-03-21 | Stephen N. Winters-Hilt | Method and System for Characterizing or Identifying Molecules and Molecular Mixtures |
| TR201910868T4 (en) * | 2006-02-02 | 2019-08-21 | Univ Leland Stanford Junior | Non-invasive fetal genetic screening with digital analysis. |
| US8003326B2 (en) * | 2008-01-02 | 2011-08-23 | Children's Medical Center Corporation | Method for diagnosing autism spectrum disorder |
| CN102317473A (en) * | 2008-12-11 | 2012-01-11 | 加利福尼亚太平洋生物科学股份有限公司 | Shenzhen tcl new technology co. , ltd |
| LT3783110T (en) | 2009-11-05 | 2023-01-25 | The Chinese University Of Hong Kong | FETAL GENOME ANALYSIS FROM THE BIOLOGICAL SAMPLE OF THE MOTHER |
| US8725422B2 (en) * | 2010-10-13 | 2014-05-13 | Complete Genomics, Inc. | Methods for estimating genome-wide copy number variations |
| CN102127818A (en) * | 2010-12-15 | 2011-07-20 | 张康 | Method for creating fetus DNA library by utilizing peripheral blood of pregnant woman |
-
2012
- 2012-05-14 PL PL12876917T patent/PL2851431T3/en unknown
- 2012-05-14 US US14/395,065 patent/US20150094210A1/en not_active Abandoned
- 2012-05-14 CN CN201280067404.1A patent/CN104053789B/en active Active
- 2012-05-14 WO PCT/CN2012/075478 patent/WO2013170429A1/en not_active Ceased
- 2012-05-14 KR KR1020147031796A patent/KR101770460B1/en active Active
- 2012-05-14 RU RU2014150655/10A patent/RU2597981C2/en active IP Right Revival
- 2012-05-14 EP EP12876917.1A patent/EP2851431B1/en active Active
- 2012-05-14 AU AU2012380221A patent/AU2012380221B2/en active Active
- 2012-05-14 ES ES12876917.1T patent/ES2656023T3/en active Active
- 2012-05-14 SG SG11201407515RA patent/SG11201407515RA/en unknown
- 2012-05-14 JP JP2015511887A patent/JP6045686B2/en active Active
-
2018
- 2018-07-19 US US16/039,543 patent/US20180320235A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| US20180320235A1 (en) | 2018-11-08 |
| EP2851431B1 (en) | 2017-12-13 |
| JP6045686B2 (en) | 2016-12-14 |
| RU2597981C2 (en) | 2016-09-20 |
| JP2015525062A (en) | 2015-09-03 |
| WO2013170429A1 (en) | 2013-11-21 |
| RU2014150655A (en) | 2016-07-10 |
| HK1196401A1 (en) | 2014-12-12 |
| CN104053789A (en) | 2014-09-17 |
| SG11201407515RA (en) | 2014-12-30 |
| KR20140146193A (en) | 2014-12-24 |
| KR101770460B1 (en) | 2017-08-22 |
| US20150094210A1 (en) | 2015-04-02 |
| AU2012380221A1 (en) | 2014-11-06 |
| AU2012380221B2 (en) | 2016-09-29 |
| EP2851431A4 (en) | 2016-01-27 |
| PL2851431T3 (en) | 2018-04-30 |
| CN104053789B (en) | 2016-02-10 |
| EP2851431A1 (en) | 2015-03-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ES2656023T3 (en) | Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome | |
| US12494267B2 (en) | Methods for non-invasive prenatal ploidy calling | |
| ES2770342T3 (en) | Noninvasive Prenatal Paternity Testing Procedures | |
| ES2628874T3 (en) | Fetal genomic analysis from a maternal biological sample | |
| US20220042103A1 (en) | Methods for non-invasive prenatal ploidy calling | |
| AU2011358564B9 (en) | Methods for non-invasive prenatal ploidy calling | |
| ES2913402T3 (en) | Assay systems for genetic analysis | |
| EP2925348B1 (en) | Individualized vaccines for cancer | |
| EP2902500B1 (en) | Methods for non-invasive prenatal ploidy calling | |
| JP2016101168A5 (en) | ||
| WO2013053183A1 (en) | Method and system for genotyping predetermined region in nucleic acid sample | |
| US20130261984A1 (en) | Methods and systems for determining fetal chromosomal abnormalities | |
| CN105385755A (en) | Method for conducting SNP-haplotype analysis by means of multiplex PCR technology | |
| CN119724336A (en) | A sequence graph-based tool for identifying changes in short tandem repeat regions | |
| CN103397089B (en) | Semi-specific amplification primer group, method and kit for quickly detecting chromosome number abnormality | |
| ES2625079T3 (en) | Compositions and methods by highly multiplexed PCR | |
| AU2015252046A1 (en) | Methods for Non-Invasive Prenatal Ploidy Calling | |
| Cantor | have evolved from multi-allelic to 2.4 million bi-allelic | |
| HK1213600B (en) | Methods for non-invasive prenatal ploidy calling | |
| HK1213600A1 (en) | Methods for non-invasive prenatal ploidy calling | |
| HK1193173A (en) | Methods for non-invasive prenatal ploidy calling |