ES2656023T3 - Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome - Google Patents

Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome Download PDF

Info

Publication number
ES2656023T3
ES2656023T3 ES12876917.1T ES12876917T ES2656023T3 ES 2656023 T3 ES2656023 T3 ES 2656023T3 ES 12876917 T ES12876917 T ES 12876917T ES 2656023 T3 ES2656023 T3 ES 2656023T3
Authority
ES
Spain
Prior art keywords
sequencing
predetermined region
fetus
base information
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES12876917.1T
Other languages
Spanish (es)
Inventor
Shengpei CHEN
Huijuan GE
Xuchao LI
Shang YI
Jian Wang
Jun Wang
Huanming Yang
Xiuqing Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BGI Genomics Co Ltd
Original Assignee
BGI Genomics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BGI Genomics Co Ltd filed Critical BGI Genomics Co Ltd
Application granted granted Critical
Publication of ES2656023T3 publication Critical patent/ES2656023T3/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6809Methods for determination or identification of nucleic acids involving differential detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6858Allele-specific amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/10Ploidy or copy number detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/118Prognosis of disease development
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Analytical Chemistry (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Organic Chemistry (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Public Health (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Pathology (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Primary Health Care (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Artificial Intelligence (AREA)
  • Bioethics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

Procedimiento para determinar la información de bases de una región predeterminada en un genoma fetal, que comprende las siguientes etapas: extraer una muestra de ADN genómico de un feto de sangre periférica de una mujer embarazada, construir una biblioteca de secuenciación basándose en una muestra de ADN genómico del feto; someter la biblioteca de secuenciación a secuenciación para obtener un resultado de la secuenciación del feto que consiste en una pluralidad de datos de secuenciación; y determinar la información de bases de la región predeterminada basándose en el resultado de la secuenciación del feto en combinación con la información genética de un individuo relacionado utilizando un modelo oculto de Markov, en el que la información de bases de la región predeterminada comprende la recombinación más posible de haplotipos fetales, y la información de bases de la región predeterminada se obtiene mediante: la construcción de un vector de distribución de probabilidades de un estado inicial y una matriz de transición de la recombinación de haplotipos; la construcción de una matriz de probabilidades de las observaciones; la construcción de una matriz de probabilidades parciales y un cursor de inversión; la determinación de un estado final y el retroceso por una ruta opcional; y la obtención de un resultado para analizar la secuencia de ADN del genoma fetal.Procedure for determining the base information of a predetermined region in a fetal genome, which comprises the following steps: extracting a genomic DNA sample from a peripheral blood fetus of a pregnant woman, constructing a sequencing library based on a DNA sample genomic of the fetus; Sequencing the sequencing library to obtain a fetus sequencing result consisting of a plurality of sequencing data; and determining the base information of the predetermined region based on the result of the sequencing of the fetus in combination with the genetic information of a related individual using a hidden Markov model, in which the base information of the predetermined region comprises recombination most possible of fetal haplotypes, and the baseline information of the predetermined region is obtained by: the construction of a probability distribution vector of an initial state and a transition matrix of the haplotype recombination; the construction of a matrix of probabilities of the observations; the construction of a matrix of partial probabilities and an investment cursor; the determination of a final state and the withdrawal by an optional route; and obtaining a result to analyze the fetal genome DNA sequence.

Description

imagen1image 1

imagen2image2

imagen3image3

imagen4image4

imagen5image5

5 5

15 fifteen

25 25

35 35

45 Four. Five

55 55

65 65

Etapa 5, salida de resultados Stage 5, output of results

De este modo, se puede analizar de manera eficaz la secuencia del genoma fetal. En comparación con otro procedimiento existente de detección prenatal, el procedimiento de la presente divulgación puede tener las siguientes ventajas técnicas, que incorpora principalmente la precisión y cantidad de información genética que puede obtenerse: In this way, the fetal genome sequence can be analyzed effectively. Compared to another existing prenatal screening procedure, the procedure of the present disclosure may have the following technical advantages, which mainly incorporates the precision and amount of genetic information that can be obtained:

1) Según las realizaciones de la presente divulgación, un sitio a detectar no se limita a un sitio parental, para un sitio materno, es decir, un sitio heterocigoto materno, si un feto hereda un sitio de patopoiesia materna también puede detectarse de forma excelente, con una precisión de hasta el 95% o más; y se puede detectar una pluralidad de tipos de anormalidad que agranda el intervalo de detección de la enfermedad. 1) According to the embodiments of the present disclosure, a site to be detected is not limited to a parental site, for a maternal site, that is, a maternal heterozygous site, if a fetus inherits a maternal pathopoietic site it can also be detected excellently , with an accuracy of up to 95% or more; and a plurality of types of abnormality that enlarge the disease detection range can be detected.

2) Según las realizaciones de la presente divulgación, la información de una pluralidad de sitios y enfermedades se puede obtener mediante una sola secuenciación; mientras que las secuencias de genes, que tienen una baja cobertura en el plasma de la mujer embarazada que no se puede determinar con precisión solamente mejorando la profundidad de la secuenciación, se pueden obtener mediante el procedimiento de la presente divulgación con un resultado preciso y fiable. 2) According to the embodiments of the present disclosure, information from a plurality of sites and diseases can be obtained by a single sequencing; while gene sequences, which have a low plasma coverage of the pregnant woman that cannot be determined with precision only by improving the depth of sequencing, can be obtained by the method of the present disclosure with a precise and reliable result .

3. Según las realizaciones de la presente divulgación, se puede realizar una representación gráfica con una enfermedad genética, algunas enfermedades relacionadas se pueden deducir directamente con la información de otros sitios, con una gran cantidad de información obtenida en una vez, lo cual tiene un significado más instructivo para la detección clínica. 3. According to the embodiments of the present disclosure, a graphic representation can be made with a genetic disease, some related diseases can be deduced directly with the information from other sites, with a large amount of information obtained at one time, which has a more instructive meaning for clinical detection.

Además, según las realizaciones de la presente divulgación, el procedimiento de determinación de la información de bases de una región predeterminada en un genoma fetal, no limitado a ciertos sitios polimórficos genéticos, tales como SNP o STR, está adaptado para todos los sitios polimórficos genéticos, el cual se puede utilizar en paralelo para una pluralidad de sitios para verificarse entre sí. Además de aplicar la información genómica de detección no invasiva prenatal de un feto, consiguiendo el propósito de la detección de la enfermedad, el procedimiento de la presente divulgación también se puede utilizar en la identificación de paternidad prenatal no invasiva, es decir, la determinación de la identidad del padre del feto antes del nacimiento, la disposición de asistencia para disputas que implican responsabilidades y obligaciones de crianza, bienes y casos de agresión sexual, etc. Furthermore, according to the embodiments of the present disclosure, the method of determining the base information of a predetermined region in a fetal genome, not limited to certain genetic polymorphic sites, such as SNP or STR, is adapted for all genetic polymorphic sites. , which can be used in parallel for a plurality of sites to verify each other. In addition to applying the genomic information of non-invasive prenatal detection of a fetus, achieving the purpose of the detection of the disease, the method of the present disclosure can also be used in the identification of non-invasive prenatal paternity, that is, the determination of the identity of the father of the fetus before birth, the provision of assistance for disputes involving responsibilities and obligations of parenting, property and cases of sexual assault, etc.

Sistema para determinar la información de bases de una región predeterminada en un genoma fetal System for determining the base information of a predetermined region in a fetal genome

En otro aspecto de la presente divulgación, se da a conocer un sistema para determinar la información de bases de una región predeterminada en un genoma fetal. Según las realizaciones de la presente divulgación, en referencia a la figura 2, el sistema -1000-puede comprender: un aparato de construcción de una biblioteca -100-, un aparato de secuenciación -200-y un aparato de análisis -400-. In another aspect of the present disclosure, a system for determining the base information of a predetermined region in a fetal genome is disclosed. According to the embodiments of the present disclosure, with reference to Figure 2, the system -1000-may comprise: a construction apparatus of a library -100-, a sequencing apparatus -200- and an analysis apparatus -400-.

Según las realizaciones de la presente divulgación, aparato de construcción de una biblioteca -100-está adaptado para construir una biblioteca de secuenciación basándose en una muestra de ADN genómico de un feto. Según las realizaciones de la presente divulgación, el aparato de secuenciación -200-está conectado al aparato de construcción de una biblioteca -100-y está adaptado para someter la biblioteca de secuenciación a secuenciación para obtener un resultado de la secuenciación del feto que consiste en una pluralidad de datos de secuenciación. Según las realizaciones de la presente divulgación, el sistema -1000-puede comprender también un aparato de extracción de muestras de ADN adaptado para extraer la muestra de ADN genómico del feto a partir de sangre periférica de mujer embarazada. De este modo, el sistema puede estar adaptado para la detección prenatal no invasiva. According to the embodiments of the present disclosure, a 100-library building apparatus is adapted to construct a sequencing library based on a genomic DNA sample of a fetus. According to the embodiments of the present disclosure, the sequencing apparatus -200-is connected to the apparatus for building a library -100- and is adapted to subject the sequencing library to sequencing to obtain a result of the sequencing of the fetus consisting of a plurality of sequencing data. According to the embodiments of the present disclosure, the -1000-system may also comprise a DNA sample extraction apparatus adapted to extract the genomic DNA sample from the fetus from peripheral blood of pregnant woman. In this way, the system can be adapted for non-invasive prenatal detection.

Según las realizaciones de la presente divulgación, opcionalmente, el sistema puede comprender también un aparato de alineación -300-. Según las realizaciones de la presente divulgación, el aparato de alineación -300-está conectado al aparato de secuenciación -200-y está adaptado para alinear el resultado de la secuenciación del feto con una secuencia de referencia, para determinar el resultado de la secuenciación que deriva de la región predeterminada. Según las realizaciones de la presente divulgación, los procedimientos y dispositivos para la secuenciación no están sometidos a ningún tipo de restricción especial, incluyendo, pero sin limitarse a los mismos, procedimiento de terminación de cadena (Sanger); es preferente un procedimiento de secuenciación de alto rendimiento. De este modo, utilizando características de secuenciación de alto rendimiento y profundas de estos aparatos, puede mejorarse adicionalmente la eficacia, mediante lo cual se puede mejorar adicionalmente la precisión y exactitud del análisis posterior con los datos de secuenciación, tal como una prueba estadística. El procedimiento de secuenciación de alto rendimiento incluye, pero sin limitarse a las mismas, una tecnología de secuenciación Next-Generation o una tecnología de secuenciación única. La plataforma de secuenciación Next-Generation (Metzker ML. Sequencing technologies-the next generation. Nat Rev Genet. Enero 2010; 11 (1): 31-46) incluye, pero sin limitarse a las mismas, las plataformas de secuenciación Illumina-Solexa (GATM, HiSeq2000TM), etc), ABI-Solid y Roche-454 (pirosecuenciación); la plataforma (tecnología) de secuenciación única incluye, pero sin limitarse a las mismas, secuenciación True Single Molecule DNA de Helicos Company, secuenciación de una única molécula en tiempo real (SMRT®) de Pacific Biosciences Company y la tecnología de secuenciación por nonaporos de Oxford Nanopore According to the embodiments of the present disclosure, optionally, the system may also comprise an alignment apparatus -300-. According to the embodiments of the present disclosure, the alignment apparatus -300-is connected to the sequencing apparatus -200- and is adapted to align the result of the sequencing of the fetus with a reference sequence, to determine the result of the sequencing that derives from the default region. According to the embodiments of the present disclosure, the procedures and devices for sequencing are not subject to any special restriction, including, but not limited to, chain termination procedure (Sanger); a high performance sequencing process is preferred. Thus, using high performance and deep sequencing characteristics of these devices, the efficiency can be further improved, whereby the accuracy and accuracy of the subsequent analysis with the sequencing data, such as a statistical test, can be further improved. The high performance sequencing procedure includes, but is not limited to, a Next-Generation sequencing technology or a unique sequencing technology. The Next-Generation sequencing platform (Metzker ML. Sequencing technologies-the next generation. Nat Rev Genet. January 2010; 11 (1): 31-46) includes, but is not limited to, Illumina-Solexa sequencing platforms (GATM, HiSeq2000TM), etc.), ABI-Solid and Roche-454 (pyrosequencing); the unique sequencing platform (technology) includes, but is not limited to, True Single Molecule DNA sequencing from Helicos Company, real-time single molecule (SMRT®) sequencing from Pacific Biosciences Company, and nonaporous sequencing technology from Oxford Nanopore

7 7

Technologies (Rusk, Nicole (). Cheap Third-Generation Sequencing. Nature Methods 6 (4): 244-245), etc. Con el desarrollo gradual de la tecnología de secuenciación, un experto en la materia puede comprender otros procedimientos de secuenciación y también se pueden utilizar aparatos para la secuenciación del genoma completo. Según los ejemplos específicos de la presente divulgación, la biblioteca de secuenciación del genoma completo se 5 puede someter a secuenciación, como mínimo, mediante un procedimiento seleccionado de Illumina-Solexa, ABI-SOLID, Roche-454 y un aparato de secuenciación de una única molécula. Según las realizaciones de la presente divulgación, un tipo de secuencia de referencia utilizada puede no estar sometido a ningún tipo de restricción especial, la cual puede ser cualquiera de las secuencias conocidas contenidas en una región diana. Según las realizaciones de la presente divulgación, la secuencia de referencia puede utilizar un genoma de referencia humano Technologies (Rusk, Nicole (). Cheap Third-Generation Sequencing. Nature Methods 6 (4): 244-245), etc. With the gradual development of sequencing technology, one skilled in the art can understand other sequencing procedures and apparatus for sequencing the entire genome can also be used. According to the specific examples of the present disclosure, the complete genome sequencing library can be subjected to sequencing, at a minimum, by a method selected from Illumina-Solexa, ABI-SOLID, Roche-454 and a single sequencing apparatus. molecule. According to the embodiments of the present disclosure, a type of reference sequence used may not be subject to any special restriction, which may be any of the known sequences contained in a target region. According to the embodiments of the present disclosure, the reference sequence can use a human reference genome

10 conocido. Por ejemplo, según las realizaciones de la presente divulgación, el genoma de referencia humano es NCBI 36.3, HG18. Además, según las realizaciones de la presente divulgación, los procedimientos de alineación no están sometidos a ninguna restricción especial. Según ejemplos específicos, se puede utilizar SOAP para la alineación. 10 known. For example, according to the embodiments of the present disclosure, the human reference genome is NCBI 36.3, HG18. In addition, according to the embodiments of the present disclosure, the alignment procedures are not subject to any special restrictions. According to specific examples, SOAP can be used for alignment.

15 Según las realizaciones de la presente divulgación, el aparato de análisis -400-está conectado al aparato de secuenciación y está adaptado para determinar la información de bases de la región predeterminada basándose en el resultado de la secuenciación del feto en combinación con información genética de un individuo relacionado utilizando un modelo oculto de Markov. According to the embodiments of the present disclosure, the analysis apparatus -400-is connected to the sequencing apparatus and is adapted to determine the base information of the predetermined region based on the result of the sequencing of the fetus in combination with genetic information of a related individual using a hidden Markov model.

20 Según las realizaciones de la presente divulgación, en el algoritmo de Viterbi, se utiliza 0,25 como una distribución de probabilidades de un estado inicial, se utiliza re/N como una probabilidad de recombinación, siendo re 25∼30, de manera preferente, siendo re 25 y siendo N la longitud de la región predeterminada, 20 According to the embodiments of the present disclosure, in the Viterbi algorithm, 0.25 is used as a probability distribution of an initial state, re / N is used as a probability of recombination, being re 25 re30, preferably , being re 25 and N being the length of the predetermined region,

imagen6image6

30 se utiliza como una matriz de transición de la recombinación, siendo pr re/N. Según las realizaciones de la presente divulgación, el aparato de alineación está adaptado para determinar una 30 is used as a transition matrix of recombination, being pr re / N. According to the embodiments of the present disclosure, the alignment apparatus is adapted to determine a

imagen7image7

El análisis con datos de secuenciación, que se describe en detalle anteriormente, está también adaptado al sistema The analysis with sequencing data, which is described in detail above, is also adapted to the system

45 para determinar la información de bases de una región predeterminada en un genoma fetal, el cual se omite por razones de brevedad. 45 to determine the base information of a predetermined region in a fetal genome, which is omitted for reasons of brevity.

De este modo, utilizando el sistema se puede implementar de manera eficaz el procedimiento anterior de determinación de la información de bases de una región predeterminada en un genoma fetal, el cual puede Thus, using the system, the above procedure for determining the base information of a predetermined region in a fetal genome can be effectively implemented, which can

50 determinar la secuencia de ácidos nucleicos de una región predeterminada en un genoma fetal gracias al modelo oculto de Markov, por ejemplo utilizando el algoritmo de Viterbi, y con referencia a la información genética de un individuo relacionado, mediante lo cual se puede realizar de manera eficaz una detección genética prenatal con la información genética del genoma fetal. 50 determine the nucleic acid sequence of a predetermined region in a fetal genome thanks to the hidden Markov model, for example using the Viterbi algorithm, and with reference to the genetic information of a related individual, whereby it can be performed in a manner effective a prenatal genetic detection with the genetic information of the fetal genome.

55 Además, según las realizaciones de la presente divulgación, la región predeterminada es un sitio en el que se ha determinado previamente que tiene un polimorfismo genético y el polimorfismo genético es, como mínimo, un polimorfismo seleccionado de polimorfismo de nucleótido único y STR. In addition, according to the embodiments of the present disclosure, the predetermined region is a site where it has been previously determined to have a genetic polymorphism and the genetic polymorphism is at least a polymorphism selected from single nucleotide polymorphism and STR.

El término "conectado" debe entenderse ampliamente, el cual puede referirse a una conexión directa o una conexión 60 indirecta, siempre que se consiga la conexión funcional anterior. The term "connected" should be broadly understood, which may refer to a direct connection or an indirect connection 60, provided the previous functional connection is achieved.

Cabe señalar que un experto en la materia puede entender que las características y ventajas del procedimiento de determinación de la información de bases de una región predeterminada en un genoma fetal descrito anteriormente también pueden adaptarse al sistema para determinar la información de bases de una región predeterminada en un It should be noted that one skilled in the art can understand that the characteristics and advantages of the procedure for determining the base information of a predetermined region in a fetal genome described above can also be adapted to the system to determine the base information of a predetermined region in a

65 genoma fetal, lo cual se omite por razones de brevedad. 65 fetal genome, which is omitted for reasons of brevity.

8 8

imagen8image8

imagen9image9

imagen10image10

imagen11image11

imagen12image12

c. C.
Calcular la información de secuenciación de cada sitio y la construcción de una matriz de probabilidades de las observaciones (otras eran iguales que las descritas en el "procedimiento general"); Calculate the sequencing information of each site and the construction of a probability matrix of the observations (others were the same as those described in the "general procedure");

d. d.
La construcción de una matriz de probabilidades parciales y un cursor de inversión (otras eran iguales que las descritas en el "procedimiento general"); The construction of a matrix of partial probabilities and an investment cursor (others were the same as those described in the "general procedure");

e. and.
Determinar un estado final y retroceder por una ruta opcional; y Determine an end state and go back through an optional route; Y

f. F.
Obtener resultados To get results

Según los resultados del genotipado, la exactitud del mismo se muestra a continuación: According to the results of the genotyping, its accuracy is shown below:

madre mother

total homocigosis heterocigosis número número exactitud número número exactitud total homozygosis heterozygous number number accuracy number number accuracy

número número exactitud number number accuracy

de sitio exacto de sitio exacto from exact site to exact site

de sitio exacto homocigosis 199.552 199.552 100,00% 66.238 63.968 96,57% Exact site homozygosis 199,552 199,552 100.00% 66,238 63,968 96.57%

265.790 263.520 99,15% padre 265,790 263,520 99.15% father

autosoma heterocigosis 65.409 64.735 98,97% 41.849 39.944 95,45% 107.258 104.679 97,60% autosomal heterozygosis 65,409 64,735 98.97% 41,849 39,944 95.45% 107,258 104,679 97.60%

264.961 264.287 99,75% 108.087 103.912 96,14% 264,961 264,287 99.75% 108,087 103,912 96.14%

373.048 368.199 98,70% cromosoma X 4.881 4.881 100,00% 1.718 1.478 86,03% 373,048 368,199 98.70% X chromosome 4,881 4,881 100.00% 1,718 1,478 86.03%

6.599 6.359 96,36% 6,599 6,359 96.36%

10 Aplicabilidad industrial 10 Industrial Applicability

El procedimiento de determinación de la información de bases de una región predeterminada en un genoma fetal, el sistema para determinar la información de bases de una región predeterminada en un genoma fetal y un medio The procedure for determining the base information of a predetermined region in a fetal genome, the system for determining the base information of a predetermined region in a fetal genome and a means

15 legible por ordenador, según la presente invención, se pueden aplicar de manera eficaz en el análisis de la secuencia de ácidos nucleicos de la región predeterminada en el genoma fetal. 15 readable by computer, according to the present invention, can be effectively applied in the analysis of the nucleic acid sequence of the predetermined region in the fetal genome.

La referencia a lo largo de la presente memoria descriptiva a "una realización", "algunas realizaciones", "otro ejemplo", "un ejemplo", "un ejemplo específico" o "algunos ejemplos", significa que un rasgo, estructura, material o 20 característica particular descritos en relación con la realización o ejemplo se incluyen, como mínimo, en una realización o ejemplo de la presente divulgación. De este modo, las apariciones de frases, tales como "en algunas realizaciones", "en una realización", "en otro ejemplo”, “en un ejemplo”, “en un ejemplo específico” o “en algunos ejemplos” en varios lugares a lo largo de la presente memoria descriptiva no necesariamente se refieren a la misma realización o ejemplo de la presente divulgación. Además, los rasgos, estructuras, materiales o características The reference throughout the present specification to "one embodiment", "some embodiments", "another example", "an example", "a specific example" or "some examples", means that a feature, structure, material or 20 particular feature described in connection with the embodiment or example are included, at a minimum, in one embodiment or example of the present disclosure. Thus, the occurrences of phrases, such as "in some embodiments", "in one embodiment", "in another example", "in an example", "in a specific example" or "in some examples" in various places throughout the present specification they do not necessarily refer to the same embodiment or example of the present disclosure In addition, the features, structures, materials or characteristics

25 particulares se pueden combinar en cualquier manera adecuada en una o más realizaciones o ejemplos. Individuals may be combined in any suitable manner in one or more embodiments or examples.

14 14

Claims (1)

imagen1image 1 imagen2image2 imagen3image3
ES12876917.1T 2012-05-14 2012-05-14 Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome Active ES2656023T3 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/075478 WO2013170429A1 (en) 2012-05-14 2012-05-14 Method, system and computer readable medium for determining base information in predetermined area of fetus genome

Publications (1)

Publication Number Publication Date
ES2656023T3 true ES2656023T3 (en) 2018-02-22

Family

ID=49582977

Family Applications (1)

Application Number Title Priority Date Filing Date
ES12876917.1T Active ES2656023T3 (en) 2012-05-14 2012-05-14 Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome

Country Status (11)

Country Link
US (2) US20150094210A1 (en)
EP (1) EP2851431B1 (en)
JP (1) JP6045686B2 (en)
KR (1) KR101770460B1 (en)
CN (1) CN104053789B (en)
AU (1) AU2012380221B2 (en)
ES (1) ES2656023T3 (en)
PL (1) PL2851431T3 (en)
RU (1) RU2597981C2 (en)
SG (1) SG11201407515RA (en)
WO (1) WO2013170429A1 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130309666A1 (en) 2013-01-25 2013-11-21 Sequenom, Inc. Methods and processes for non-invasive assessment of genetic variations
US20160026759A1 (en) * 2014-07-22 2016-01-28 Yourgene Bioscience Detecting Chromosomal Aneuploidy
WO2017004612A1 (en) * 2015-07-02 2017-01-05 Arima Genomics, Inc. Accurate molecular deconvolution of mixtures samples
CN106011244B (en) * 2016-05-31 2019-07-16 中国人民解放军军事医学科学院放射与辐射医学研究所 Detect the application of the region 7q21.13 SNP reagent
MX2019003344A (en) * 2016-09-22 2019-09-04 Illumina Inc Somatic copy number variation detection.
CN109996894A (en) * 2016-11-18 2019-07-09 香港中文大学 The antenatal test of the Noninvasive based on general haplotype for single-gene disorder
NZ759171A (en) 2017-11-16 2022-05-27 Illumina Inc Systems and methods for determining microsatellite instability
CN108048541B (en) * 2018-01-25 2020-11-20 广州精科医学检验所有限公司 System for determining fetal alpha thalassemia gene haplotype
CN110349631B (en) * 2019-07-30 2021-10-29 苏州亿康医学检验有限公司 Analysis method and device for determining haplotype of offspring object
TWI752593B (en) 2019-08-16 2022-01-11 香港中文大學 Determination of base modifications of nucleic acids
US20230028790A1 (en) * 2019-11-29 2023-01-26 GC Genome Corporation Artificial intelligence-based chromosomal abnormality detection method
CN113308548B (en) * 2021-01-26 2023-03-28 天津华大医学检验所有限公司 Method, device and storage medium for detecting fetal gene haplotype
CN112885408B (en) * 2021-02-22 2024-10-01 中国农业大学 Method and device for detecting SNP marker loci based on low-depth sequencing
CN119301269A (en) * 2022-05-26 2025-01-10 深圳华大生命科学研究院 Fetal genotype detection method based on haplotype
CN117392673B (en) * 2023-12-12 2024-02-13 深圳赛陆医疗科技有限公司 Base identification method and device, gene sequencer and medium

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130071837A1 (en) * 2004-10-06 2013-03-21 Stephen N. Winters-Hilt Method and System for Characterizing or Identifying Molecules and Molecular Mixtures
TR201910868T4 (en) * 2006-02-02 2019-08-21 Univ Leland Stanford Junior Non-invasive fetal genetic screening with digital analysis.
US8003326B2 (en) * 2008-01-02 2011-08-23 Children's Medical Center Corporation Method for diagnosing autism spectrum disorder
CN102317473A (en) * 2008-12-11 2012-01-11 加利福尼亚太平洋生物科学股份有限公司 Shenzhen tcl new technology co. , ltd
LT3783110T (en) 2009-11-05 2023-01-25 The Chinese University Of Hong Kong FETAL GENOME ANALYSIS FROM THE BIOLOGICAL SAMPLE OF THE MOTHER
US8725422B2 (en) * 2010-10-13 2014-05-13 Complete Genomics, Inc. Methods for estimating genome-wide copy number variations
CN102127818A (en) * 2010-12-15 2011-07-20 张康 Method for creating fetus DNA library by utilizing peripheral blood of pregnant woman

Also Published As

Publication number Publication date
US20180320235A1 (en) 2018-11-08
EP2851431B1 (en) 2017-12-13
JP6045686B2 (en) 2016-12-14
RU2597981C2 (en) 2016-09-20
JP2015525062A (en) 2015-09-03
WO2013170429A1 (en) 2013-11-21
RU2014150655A (en) 2016-07-10
HK1196401A1 (en) 2014-12-12
CN104053789A (en) 2014-09-17
SG11201407515RA (en) 2014-12-30
KR20140146193A (en) 2014-12-24
KR101770460B1 (en) 2017-08-22
US20150094210A1 (en) 2015-04-02
AU2012380221A1 (en) 2014-11-06
AU2012380221B2 (en) 2016-09-29
EP2851431A4 (en) 2016-01-27
PL2851431T3 (en) 2018-04-30
CN104053789B (en) 2016-02-10
EP2851431A1 (en) 2015-03-25

Similar Documents

Publication Publication Date Title
ES2656023T3 (en) Procedure, system and computer-readable medium to determine base information in a predetermined region of the fetus genome
US12494267B2 (en) Methods for non-invasive prenatal ploidy calling
ES2770342T3 (en) Noninvasive Prenatal Paternity Testing Procedures
ES2628874T3 (en) Fetal genomic analysis from a maternal biological sample
US20220042103A1 (en) Methods for non-invasive prenatal ploidy calling
AU2011358564B9 (en) Methods for non-invasive prenatal ploidy calling
ES2913402T3 (en) Assay systems for genetic analysis
EP2925348B1 (en) Individualized vaccines for cancer
EP2902500B1 (en) Methods for non-invasive prenatal ploidy calling
JP2016101168A5 (en)
WO2013053183A1 (en) Method and system for genotyping predetermined region in nucleic acid sample
US20130261984A1 (en) Methods and systems for determining fetal chromosomal abnormalities
CN105385755A (en) Method for conducting SNP-haplotype analysis by means of multiplex PCR technology
CN119724336A (en) A sequence graph-based tool for identifying changes in short tandem repeat regions
CN103397089B (en) Semi-specific amplification primer group, method and kit for quickly detecting chromosome number abnormality
ES2625079T3 (en) Compositions and methods by highly multiplexed PCR
AU2015252046A1 (en) Methods for Non-Invasive Prenatal Ploidy Calling
Cantor have evolved from multi-allelic to 2.4 million bi-allelic
HK1213600B (en) Methods for non-invasive prenatal ploidy calling
HK1213600A1 (en) Methods for non-invasive prenatal ploidy calling
HK1193173A (en) Methods for non-invasive prenatal ploidy calling