CL2019002276A1 - Método y sistemas para la compresión eficiente de lecturas de secuencias genómicas. - Google Patents
Método y sistemas para la compresión eficiente de lecturas de secuencias genómicas.Info
- Publication number
- CL2019002276A1 CL2019002276A1 CL2019002276A CL2019002276A CL2019002276A1 CL 2019002276 A1 CL2019002276 A1 CL 2019002276A1 CL 2019002276 A CL2019002276 A CL 2019002276A CL 2019002276 A CL2019002276 A CL 2019002276A CL 2019002276 A1 CL2019002276 A1 CL 2019002276A1
- Authority
- CL
- Chile
- Prior art keywords
- classification
- systems
- genomic sequence
- sequence reads
- efficient compression
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2282—Tablespace storage structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
- G06F16/2365—Ensuring data consistency and integrity
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/602—Providing cryptographic facilities or services
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
- G06F21/6245—Protecting personal data, e.g. for financial or medical purposes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/10—Ploidy or copy number detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/20—Sequence assembly
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/10—Signal processing, e.g. from mass spectrometry [MS] or from PCR
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B45/00—ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/10—Ontologies; Annotations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/40—Encryption of genetic data
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/50—Compression of genetic data
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B99/00—Subject matter not provided for in other groups of this subclass
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/3084—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
- H03M7/3086—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing a sliding window, e.g. LZ77
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/70—Type of the data to be coded, other than image and sound
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Biophysics (AREA)
- Databases & Information Systems (AREA)
- Bioethics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical & Material Sciences (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Computer Security & Cryptography (AREA)
- Computer Hardware Design (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Public Health (AREA)
- Evolutionary Computation (AREA)
- Epidemiology (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Television Signal Processing For Recording (AREA)
- Labeling Devices (AREA)
- Detection And Prevention Of Errors In Transmission (AREA)
Abstract
MÉTODO Y APARATO PARA LA COMPRESIÓN DE DATOS DE SECUENCIAS DE GENOMAS PRODUCIDOS POR MÁQUINAS DE SECUENCIACIÓN DEL GENOMA. LAS LECTURAS DE SECUENCIAS SE CODIFICAN MEDIANTE SU ALINEACIÓN CON RESPECTO A SECUENCIAS DE REFERENCIA PREEXISTENTES O CONSTRUIDAS, EN DONDE EL PROCESO DE CODIFICACIÓN ESTÁ COMPUESTO POR UNA CLASIFICACIÓN DE LAS LECTURAS EN CLASES DE DATOS, A LO CUAL SIGUE LA CODIFICACIÓN DE CADA CLASE EN TÉRMINOS DE UNA MULTIPLICIDAD DE DESCRIPTORES GENÓMICOS. LOS DESCRIPTORES GENÓMICOS DEL MISMO TIPO SE ORGANIZAN EN BLOQUES QUE SE COMPRIMEN MEDIANTE LA APLICACIÓN DE SUCESIVAS ETAPAS DE TRANSFORMACIÓN, BINARIZACIÓN Y CODIFICACIÓN DE ENTROPÍA. SE UTILIZAN MODELOS DE FUENTE Y CODIFICADORES DE ENTROPÍA ESPECÍFICOS PARA CADA CLASE DE DATOS Y PARA CADA DESCRIPTOR ASOCIADO.
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/EP2016/074311 WO2018068830A1 (en) | 2016-10-11 | 2016-10-11 | Method and system for the transmission of bioinformatics data |
| PCT/EP2016/074301 WO2018068828A1 (en) | 2016-10-11 | 2016-10-11 | Method and system for storing and accessing bioinformatics data |
| PCT/EP2016/074307 WO2018068829A1 (en) | 2016-10-11 | 2016-10-11 | Method and apparatus for compact representation of bioinformatics data |
| PCT/EP2016/074297 WO2018068827A1 (en) | 2016-10-11 | 2016-10-11 | Efficient data structures for bioinformatics information representation |
| PCT/US2017/017842 WO2018071055A1 (en) | 2016-10-11 | 2017-02-14 | Method and apparatus for the compact representation of bioinformatics data |
| PCT/US2017/041579 WO2018071078A1 (en) | 2016-10-11 | 2017-07-11 | Method and apparatus for the access to bioinformatics data structured in access units |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CL2019002276A1 true CL2019002276A1 (es) | 2019-11-29 |
Family
ID=61905752
Family Applications (6)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CL2019000973A CL2019000973A1 (es) | 2016-10-11 | 2019-04-10 | Método y sistemas para la indexación de datos bioinformáticos. |
| CL2019000968A CL2019000968A1 (es) | 2016-10-11 | 2019-04-10 | Método y sistema para el acceso selectivo de datos bioinformáticos almacenados o transmitidos. |
| CL2019000972A CL2019000972A1 (es) | 2016-10-11 | 2019-04-10 | Método y sistemas para la representación y procesamiento de datos de bioinformática mediante el uso de secuencias de referencia. |
| CL2019002275A CL2019002275A1 (es) | 2016-10-11 | 2019-08-12 | Método y sistemas para la reconstrucción de secuencias genómicas de referencia a partir de lecturas de secuencias genómicas comprimidas. |
| CL2019002277A CL2019002277A1 (es) | 2016-10-11 | 2019-08-12 | Método y aparato para la representación compacta de datos de bioinformática mediante el uso de múltiples descriptores genómicos. |
| CL2019002276A CL2019002276A1 (es) | 2016-10-11 | 2019-08-12 | Método y sistemas para la compresión eficiente de lecturas de secuencias genómicas. |
Family Applications Before (5)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CL2019000973A CL2019000973A1 (es) | 2016-10-11 | 2019-04-10 | Método y sistemas para la indexación de datos bioinformáticos. |
| CL2019000968A CL2019000968A1 (es) | 2016-10-11 | 2019-04-10 | Método y sistema para el acceso selectivo de datos bioinformáticos almacenados o transmitidos. |
| CL2019000972A CL2019000972A1 (es) | 2016-10-11 | 2019-04-10 | Método y sistemas para la representación y procesamiento de datos de bioinformática mediante el uso de secuencias de referencia. |
| CL2019002275A CL2019002275A1 (es) | 2016-10-11 | 2019-08-12 | Método y sistemas para la reconstrucción de secuencias genómicas de referencia a partir de lecturas de secuencias genómicas comprimidas. |
| CL2019002277A CL2019002277A1 (es) | 2016-10-11 | 2019-08-12 | Método y aparato para la representación compacta de datos de bioinformática mediante el uso de múltiples descriptores genómicos. |
Country Status (17)
| Country | Link |
|---|---|
| US (6) | US20200042735A1 (es) |
| EP (3) | EP3526694A4 (es) |
| JP (4) | JP2020505702A (es) |
| KR (4) | KR20190073426A (es) |
| CN (6) | CN110168651A (es) |
| AU (3) | AU2017342688A1 (es) |
| BR (7) | BR112019007359A2 (es) |
| CA (3) | CA3040138A1 (es) |
| CL (6) | CL2019000973A1 (es) |
| CO (6) | CO2019003639A2 (es) |
| EA (2) | EA201990917A1 (es) |
| IL (3) | IL265879B2 (es) |
| MX (2) | MX2019004130A (es) |
| PE (7) | PE20191058A1 (es) |
| PH (6) | PH12019550058A1 (es) |
| SG (3) | SG11201903270RA (es) |
| WO (4) | WO2018071055A1 (es) |
Families Citing this family (46)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB2526598B (en) | 2014-05-29 | 2018-11-28 | Imagination Tech Ltd | Allocation of primitives to primitive blocks |
| US11574287B2 (en) | 2017-10-10 | 2023-02-07 | Text IQ, Inc. | Automatic document classification |
| US11030324B2 (en) * | 2017-11-30 | 2021-06-08 | Koninklijke Philips N.V. | Proactive resistance to re-identification of genomic data |
| WO2019191083A1 (en) * | 2018-03-26 | 2019-10-03 | Colorado State University Research Foundation | Apparatuses, systems and methods for generating and tracking molecular digital signatures to ensure authenticity and integrity of synthetic dna molecules |
| MX2020012672A (es) * | 2018-05-31 | 2021-02-09 | Koninklijke Philips Nv | Sistema y metodo para interpretacion de alelos usando un genoma de referencia basado en graficos. |
| CN108753765B (zh) * | 2018-06-08 | 2020-12-08 | 中国科学院遗传与发育生物学研究所 | 一种构建超长连续dna序列的基因组组装方法 |
| US12210904B2 (en) * | 2018-06-29 | 2025-01-28 | International Business Machines Corporation | Hybridized storage optimization for genomic workloads |
| US11474978B2 (en) * | 2018-07-06 | 2022-10-18 | Capital One Services, Llc | Systems and methods for a data search engine based on data profiles |
| US12300358B2 (en) * | 2018-08-20 | 2025-05-13 | The Board Of Trustees Of The Leland Stanford Junior University | Systems and methods for compressing genetic sequencing data and uses thereof |
| GB2585816A (en) * | 2018-12-12 | 2021-01-27 | Univ York | Proof-of-work for blockchain applications |
| US20210074381A1 (en) * | 2019-09-11 | 2021-03-11 | Enancio | Method for the compression of genome sequence data |
| JP2022551261A (ja) * | 2019-10-01 | 2022-12-08 | コーニンクレッカ フィリップス エヌ ヴェ | ゲノムグラフにおける配列経路の効率的な識別及び抽出のためのシステム及び方法 |
| CN110797087B (zh) * | 2019-10-17 | 2020-11-03 | 南京医基云医疗数据研究院有限公司 | 测序序列处理方法及装置、存储介质、电子设备 |
| JP7631330B2 (ja) * | 2019-10-18 | 2025-02-18 | コーニンクレッカ フィリップス エヌ ヴェ | 多様な表形式データの効果的な圧縮、表現、および展開のためのシステムおよび方法 |
| US12322477B1 (en) * | 2019-12-04 | 2025-06-03 | John Hayward | Methods of efficiently transforming and comparing recombinable DNA information |
| US12445148B2 (en) | 2020-01-03 | 2025-10-14 | Koninklijke Philips N.V. | System and method for effective compression representation and decompression of diverse tabulated data |
| CN115088038A (zh) * | 2020-02-07 | 2022-09-20 | 皇家飞利浦有限公司 | 基于新上下文的经比对的测序数据中的改进质量值压缩框架 |
| CN111243663B (zh) * | 2020-02-26 | 2022-06-07 | 西安交通大学 | 一种基于模式增长算法的基因变异检测方法 |
| CN111370070B (zh) * | 2020-02-27 | 2023-10-27 | 中国科学院计算技术研究所 | 一种针对大数据基因测序文件的压缩处理方法 |
| US12006539B2 (en) | 2020-03-17 | 2024-06-11 | Western Digital Technologies, Inc. | Reference-guided genome sequencing |
| US12014802B2 (en) | 2020-03-17 | 2024-06-18 | Western Digital Technologies, Inc. | Devices and methods for locating a sample read in a reference genome |
| US11837330B2 (en) | 2020-03-18 | 2023-12-05 | Western Digital Technologies, Inc. | Reference-guided genome sequencing |
| BR112022020116A2 (pt) | 2020-04-07 | 2022-12-20 | Koninklijke Philips Nv | Método e sistema para agruparem dados genômicos |
| EP3896698A1 (en) * | 2020-04-15 | 2021-10-20 | Genomsys SA | Method and system for the efficient data compression in mpeg-g |
| CN111459208A (zh) * | 2020-04-17 | 2020-07-28 | 南京铁道职业技术学院 | 针对地铁供电系统电能的操纵系统及其方法 |
| US12224042B2 (en) | 2020-06-22 | 2025-02-11 | SanDisk Technologies, Inc. | Devices and methods for genome sequencing |
| US12093803B2 (en) | 2020-07-01 | 2024-09-17 | International Business Machines Corporation | Downsampling genomic sequence data |
| CN115917657A (zh) * | 2020-09-14 | 2023-04-04 | Illumina公司 | 用于个性化医学的自定义数据文件 |
| US20230377692A1 (en) * | 2020-10-06 | 2023-11-23 | Koninklijke Philips N.V. | Methods and systems for storing genomic data in a file structure comprising an information metadata structure |
| IL301901A (en) * | 2020-10-06 | 2023-06-01 | Koninklijke Philips Nv | Methods and systems for storing genomic data in a file structure that includes protection metadata |
| CN112836355B (zh) * | 2021-01-14 | 2023-04-18 | 西安科技大学 | 一种预测采煤工作面顶板来压概率的方法 |
| JP7118199B1 (ja) * | 2021-03-26 | 2022-08-15 | エヌ・ティ・ティ・コミュニケーションズ株式会社 | 処理システム、処理方法及び処理プログラム |
| US12406413B2 (en) | 2021-05-10 | 2025-09-02 | Optum Services (Ireland) Limited | Predictive data analysis using image representations of genomic data |
| ES2930699A1 (es) * | 2021-06-10 | 2022-12-20 | Veritas Intercontinental S L | Metodo de analisis genomico en una plataforma bioinformatica |
| CN113670643B (zh) * | 2021-08-30 | 2023-05-12 | 四川虹美智能科技有限公司 | 智能空调测试方法及系统 |
| CN113643761B (zh) * | 2021-10-13 | 2022-01-18 | 苏州赛美科基因科技有限公司 | 一种用于解读二代测序结果所需数据的提取方法 |
| US20230187020A1 (en) * | 2021-12-15 | 2023-06-15 | Illumina Software, Inc. | Systems and methods for iterative and scalable population-scale variant analysis |
| US12431218B2 (en) | 2022-03-08 | 2025-09-30 | Illumina, Inc. | Multi-pass software-accelerated genomic read mapping engine |
| CN115458050B (zh) * | 2022-08-05 | 2026-01-06 | 武汉大学 | 多基因发现网络构造方法、装置、设备及存储介质 |
| CN115391284B (zh) * | 2022-10-31 | 2023-02-03 | 四川大学华西医院 | 基因数据文件快速识别方法、系统和计算机可读存储介质 |
| CN118435282A (zh) * | 2022-12-02 | 2024-08-02 | 香港城市大学 | 经压缩基因组序列的基于强化学习的网络传输 |
| CN116541348B (zh) * | 2023-03-22 | 2023-09-26 | 河北热点科技股份有限公司 | 数据智能存储方法及终端查询一体机 |
| CN116739646B (zh) * | 2023-08-15 | 2023-11-24 | 南京易联阳光信息技术股份有限公司 | 网络交易大数据分析方法及分析系统 |
| CN117153270B (zh) * | 2023-10-30 | 2024-02-02 | 吉林华瑞基因科技有限公司 | 一种基因二代测序数据处理方法 |
| CN117708755B (zh) * | 2023-12-17 | 2024-06-21 | 重庆文理学院 | 基于生态环境的数据处理方法及装置 |
| WO2025137316A1 (en) * | 2023-12-21 | 2025-06-26 | Illumina, Inc. | Sequence data processing, retention, and recovery |
Family Cites Families (54)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6303297B1 (en) * | 1992-07-17 | 2001-10-16 | Incyte Pharmaceuticals, Inc. | Database for storage and analysis of full-length sequences |
| JP3429674B2 (ja) | 1998-04-28 | 2003-07-22 | 沖電気工業株式会社 | 多重通信システム |
| EP1410301A4 (en) * | 2000-04-12 | 2008-01-23 | Cleveland Clinic Foundation | SYSTEM FOR IDENTIFYING AND ANALYZING GENE EXPRESSION CONTAINING ELEMENTS RICH IN ADENYLATE URIDYLATE (ARE) |
| FR2820563B1 (fr) * | 2001-02-02 | 2003-05-16 | Expway | Procede de compression/decompression d'un document structure |
| US20040153255A1 (en) * | 2003-02-03 | 2004-08-05 | Ahn Tae-Jin | Apparatus and method for encoding DNA sequence, and computer readable medium |
| DE10320711A1 (de) * | 2003-05-08 | 2004-12-16 | Siemens Ag | Verfahren und Anordnung zur Einrichtung und Aktualisierung einer Benutzeroberfläche zum Zugriff auf Informationsseiten in einem Datennetz |
| WO2005024562A2 (en) * | 2003-08-11 | 2005-03-17 | Eloret Corporation | System and method for pattern recognition in sequential data |
| US7805282B2 (en) * | 2004-03-30 | 2010-09-28 | New York University | Process, software arrangement and computer-accessible medium for obtaining information associated with a haplotype |
| WO2006052242A1 (en) * | 2004-11-08 | 2006-05-18 | Seirad, Inc. | Methods and systems for compressing and comparing genomic data |
| US20130332133A1 (en) * | 2006-05-11 | 2013-12-12 | Ramot At Tel Aviv University Ltd. | Classification of Protein Sequences and Uses of Classified Proteins |
| SE531398C2 (sv) | 2007-02-16 | 2009-03-24 | Scalado Ab | Generering av en dataström och identifiering av positioner inuti en dataström |
| KR101369745B1 (ko) * | 2007-04-11 | 2014-03-07 | 삼성전자주식회사 | 비동기화된 비트스트림들의 다중화 및 역다중화 방법 및장치 |
| US8832112B2 (en) * | 2008-06-17 | 2014-09-09 | International Business Machines Corporation | Encoded matrix index |
| GB2477703A (en) * | 2008-11-14 | 2011-08-10 | Real Time Genomics Inc | A method and system for analysing data sequences |
| US20100217532A1 (en) * | 2009-02-25 | 2010-08-26 | University Of Delaware | Systems and methods for identifying structurally or functionally significant amino acid sequences |
| EP2494060B1 (en) * | 2009-10-30 | 2016-04-27 | Synthetic Genomics, Inc. | Encoding text into nucleic acid sequences |
| EP2362657B1 (en) * | 2010-02-18 | 2013-04-24 | Research In Motion Limited | Parallel entropy coding and decoding methods and devices |
| US20140228223A1 (en) * | 2010-05-10 | 2014-08-14 | Andreas Gnirke | High throughput paired-end sequencing of large-insert clone libraries |
| EP3657507A1 (en) * | 2010-05-25 | 2020-05-27 | The Regents of The University of California | Bambam: parallel comparative analysis of high-throughput sequencing data |
| CN111192634A (zh) * | 2011-01-19 | 2020-05-22 | 皇家飞利浦电子股份有限公司 | 用于处理基因组数据的方法 |
| US20120230338A1 (en) * | 2011-03-09 | 2012-09-13 | Annai Systems, Inc. | Biological data networks and methods therefor |
| EP2718862B1 (en) * | 2011-06-06 | 2018-10-31 | Koninklijke Philips N.V. | Method for assembly of nucleic acid sequence data |
| PL3343781T3 (pl) * | 2011-06-16 | 2022-03-28 | Ge Video Compression, Llc | Inicjalizacja kontekstu w kodowaniu entropijnym |
| US8707289B2 (en) * | 2011-07-20 | 2014-04-22 | Google Inc. | Multiple application versions |
| JP6130839B2 (ja) * | 2011-10-06 | 2017-05-17 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | エントロピー符号化 |
| CN104094266A (zh) * | 2011-11-07 | 2014-10-08 | 独创系统公司 | 用于识别原因性基因组变体的方法和系统 |
| KR101922129B1 (ko) * | 2011-12-05 | 2018-11-26 | 삼성전자주식회사 | 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치 |
| JP6025859B2 (ja) * | 2011-12-08 | 2016-11-16 | ファイヴ3 ゲノミクス,エルエルシー | ゲノムデータの動的指標付与および可視化を提供する分散システム |
| EP2608096B1 (en) * | 2011-12-24 | 2020-08-05 | Tata Consultancy Services Ltd. | Compression of genomic data file |
| US9600625B2 (en) * | 2012-04-23 | 2017-03-21 | Bina Technologies, Inc. | Systems and methods for processing nucleic acid sequence data |
| CN103049680B (zh) * | 2012-12-29 | 2016-09-07 | 深圳先进技术研究院 | 基因测序数据读取方法及系统 |
| US9679104B2 (en) * | 2013-01-17 | 2017-06-13 | Edico Genome, Corp. | Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform |
| WO2014145503A2 (en) * | 2013-03-15 | 2014-09-18 | Lieber Institute For Brain Development | Sequence alignment using divide and conquer maximum oligonucleotide mapping (dcmom), apparatus, system and method related thereto |
| JP6054790B2 (ja) * | 2013-03-28 | 2016-12-27 | 三菱スペース・ソフトウエア株式会社 | 遺伝子情報記憶装置、遺伝子情報検索装置、遺伝子情報記憶プログラム、遺伝子情報検索プログラム、遺伝子情報記憶方法、遺伝子情報検索方法及び遺伝子情報検索システム |
| GB2512829B (en) * | 2013-04-05 | 2015-05-27 | Canon Kk | Method and apparatus for encoding or decoding an image with inter layer motion information prediction according to motion information compression scheme |
| WO2014186604A1 (en) * | 2013-05-15 | 2014-11-20 | Edico Genome Corp. | Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform |
| KR101522087B1 (ko) * | 2013-06-19 | 2015-05-28 | 삼성에스디에스 주식회사 | 미스매치를 고려한 염기 서열 정렬 시스템 및 방법 |
| CN103336916B (zh) * | 2013-07-05 | 2016-04-06 | 中国科学院数学与系统科学研究院 | 一种测序序列映射方法及系统 |
| US20150032711A1 (en) * | 2013-07-06 | 2015-01-29 | Victor Kunin | Methods for identification of organisms, assigning reads to organisms, and identification of genes in metagenomic sequences |
| KR101493982B1 (ko) * | 2013-09-26 | 2015-02-23 | 대한민국 | 품종인식 코드화 시스템 및 이를 이용한 코드화 방법 |
| CN104699998A (zh) * | 2013-12-06 | 2015-06-10 | 国际商业机器公司 | 用于对基因组进行压缩和解压缩的方法和装置 |
| US10902937B2 (en) * | 2014-02-12 | 2021-01-26 | International Business Machines Corporation | Lossless compression of DNA sequences |
| US9639542B2 (en) * | 2014-02-14 | 2017-05-02 | Sap Se | Dynamic mapping of extensible datasets to relational database schemas |
| WO2015127058A1 (en) * | 2014-02-19 | 2015-08-27 | Hospodor Andrew | Efficient encoding and storage and retrieval of genomic data |
| US9354922B2 (en) | 2014-04-02 | 2016-05-31 | International Business Machines Corporation | Metadata-driven workflows and integration with genomic data processing systems and techniques |
| US20150379195A1 (en) * | 2014-06-25 | 2015-12-31 | The Board Of Trustees Of The Leland Stanford Junior University | Software haplotying of hla loci |
| GB2527588B (en) * | 2014-06-27 | 2016-05-18 | Gurulogic Microsystems Oy | Encoder and decoder |
| US20160019339A1 (en) * | 2014-07-06 | 2016-01-21 | Mercator BioLogic Incorporated | Bioinformatics tools, systems and methods for sequence assembly |
| US10230390B2 (en) * | 2014-08-29 | 2019-03-12 | Bonnie Berger Leighton | Compressively-accelerated read mapping framework for next-generation sequencing |
| US10116632B2 (en) * | 2014-09-12 | 2018-10-30 | New York University | System, method and computer-accessible medium for secure and compressed transmission of genomic data |
| US20160125130A1 (en) * | 2014-11-05 | 2016-05-05 | Agilent Technologies, Inc. | Method for assigning target-enriched sequence reads to a genomic location |
| WO2016202918A1 (en) * | 2015-06-16 | 2016-12-22 | Gottfried Wilhelm Leibniz Universität Hannover | Method for compressing genomic data |
| CN105956417A (zh) * | 2016-05-04 | 2016-09-21 | 西安电子科技大学 | 云环境下基于编辑距离的相似碱基序列查询方法 |
| CN105975811B (zh) * | 2016-05-09 | 2019-03-15 | 管仁初 | 一种智能比对的基因序列分析装置 |
-
2017
- 2017-02-14 SG SG11201903270RA patent/SG11201903270RA/en unknown
- 2017-02-14 EP EP17859972.6A patent/EP3526694A4/en not_active Withdrawn
- 2017-02-14 WO PCT/US2017/017842 patent/WO2018071055A1/en not_active Ceased
- 2017-02-14 WO PCT/US2017/017841 patent/WO2018071054A1/en not_active Ceased
- 2017-02-14 JP JP2019540510A patent/JP2020505702A/ja not_active Withdrawn
- 2017-02-14 US US16/341,426 patent/US20200042735A1/en not_active Abandoned
- 2017-02-14 AU AU2017342688A patent/AU2017342688A1/en not_active Abandoned
- 2017-02-14 PE PE2019000804A patent/PE20191058A1/es unknown
- 2017-02-14 BR BR112019007359A patent/BR112019007359A2/pt not_active IP Right Cessation
- 2017-02-14 CN CN201780062919.5A patent/CN110168651A/zh active Pending
- 2017-02-14 CA CA3040138A patent/CA3040138A1/en not_active Abandoned
- 2017-02-14 KR KR1020197013567A patent/KR20190073426A/ko not_active Withdrawn
- 2017-02-14 MX MX2019004130A patent/MX2019004130A/es unknown
- 2017-07-11 EP EP17860868.3A patent/EP3526707A4/en not_active Withdrawn
- 2017-07-11 BR BR112019007360A patent/BR112019007360A2/pt not_active Application Discontinuation
- 2017-07-11 JP JP2019540511A patent/JP7079786B2/ja active Active
- 2017-07-11 SG SG11201903272XA patent/SG11201903272XA/en unknown
- 2017-07-11 CN CN201780063013.5A patent/CN110506272B/zh active Active
- 2017-07-11 US US16/337,642 patent/US11404143B2/en active Active
- 2017-07-11 BR BR112019007357A patent/BR112019007357A2/pt not_active Application Discontinuation
- 2017-07-11 JP JP2019540512A patent/JP2019537172A/ja not_active Withdrawn
- 2017-07-11 CA CA3040145A patent/CA3040145A1/en not_active Abandoned
- 2017-07-11 CA CA3040147A patent/CA3040147A1/en not_active Abandoned
- 2017-07-11 US US16/337,639 patent/US20190214111A1/en not_active Abandoned
- 2017-07-11 PE PE2019000803A patent/PE20191057A1/es unknown
- 2017-07-11 SG SG11201903271UA patent/SG11201903271UA/en unknown
- 2017-07-11 EA EA201990917A patent/EA201990917A1/ru unknown
- 2017-07-11 PE PE2019000805A patent/PE20191227A1/es unknown
- 2017-07-11 WO PCT/US2017/041591 patent/WO2018071080A2/en not_active Ceased
- 2017-07-11 JP JP2019540513A patent/JP2020500383A/ja not_active Withdrawn
- 2017-07-11 EA EA201990916A patent/EA201990916A1/ru unknown
- 2017-07-11 IL IL265879A patent/IL265879B2/en unknown
- 2017-07-11 KR KR1020197013418A patent/KR20190062541A/ko not_active Withdrawn
- 2017-07-11 CN CN201780062885.XA patent/CN110114830B/zh active Active
- 2017-07-11 MX MX2019004128A patent/MX2019004128A/es unknown
- 2017-07-11 AU AU2017341685A patent/AU2017341685A1/en not_active Abandoned
- 2017-07-11 KR KR1020197013419A patent/KR20190069469A/ko not_active Withdrawn
- 2017-07-11 BR BR112019007363A patent/BR112019007363A2/pt not_active Application Discontinuation
- 2017-07-11 EP EP17860980.6A patent/EP3526657A4/en active Pending
- 2017-07-11 CN CN201780063014.XA patent/CN110121577B/zh active Active
- 2017-07-11 AU AU2017341684A patent/AU2017341684A1/en not_active Abandoned
- 2017-07-11 WO PCT/US2017/041585 patent/WO2018071079A1/en not_active Ceased
- 2017-07-11 PE PE2019000802A patent/PE20191056A1/es unknown
- 2017-12-14 KR KR1020197026863A patent/KR20190117652A/ko not_active Withdrawn
- 2017-12-14 BR BR112019016230A patent/BR112019016230A2/pt not_active Application Discontinuation
- 2017-12-14 US US16/485,623 patent/US20190385702A1/en active Pending
- 2017-12-14 PE PE2019001667A patent/PE20200323A1/es unknown
- 2017-12-14 CN CN201780086529.1A patent/CN110603595B/zh active Active
- 2017-12-15 PE PE2019001669A patent/PE20200226A1/es unknown
- 2017-12-15 US US16/485,649 patent/US20200051667A1/en active Pending
- 2017-12-15 CN CN201780086770.4A patent/CN110678929B/zh active Active
- 2017-12-15 BR BR112019016232A patent/BR112019016232A2/pt not_active Application Discontinuation
-
2018
- 2018-02-14 BR BR112019016236A patent/BR112019016236A2/pt unknown
- 2018-02-14 PE PE2019001668A patent/PE20200227A1/es unknown
- 2018-02-14 US US16/485,670 patent/US20200051665A1/en active Pending
-
2019
- 2019-04-08 IL IL265928A patent/IL265928B/en active IP Right Grant
- 2019-04-10 CL CL2019000973A patent/CL2019000973A1/es unknown
- 2019-04-10 CL CL2019000968A patent/CL2019000968A1/es unknown
- 2019-04-10 CL CL2019000972A patent/CL2019000972A1/es unknown
- 2019-04-11 CO CONC2019/0003639A patent/CO2019003639A2/es unknown
- 2019-04-11 PH PH12019550058A patent/PH12019550058A1/en unknown
- 2019-04-11 IL IL265972A patent/IL265972A/en unknown
- 2019-04-11 PH PH12019550057A patent/PH12019550057A1/en unknown
- 2019-04-11 PH PH12019550060A patent/PH12019550060A1/en unknown
- 2019-04-11 PH PH12019550059A patent/PH12019550059A1/en unknown
- 2019-04-11 CO CONC2019/0003638A patent/CO2019003638A2/es unknown
- 2019-04-11 CO CONC2019/0003595A patent/CO2019003595A2/es unknown
- 2019-04-15 CO CONC2019/0003842A patent/CO2019003842A2/es unknown
- 2019-08-12 CL CL2019002275A patent/CL2019002275A1/es unknown
- 2019-08-12 CL CL2019002277A patent/CL2019002277A1/es unknown
- 2019-08-12 CL CL2019002276A patent/CL2019002276A1/es unknown
- 2019-08-13 PH PH12019501879A patent/PH12019501879A1/en unknown
- 2019-08-13 PH PH12019501881A patent/PH12019501881A1/en unknown
- 2019-09-12 CO CONC2019/0009922A patent/CO2019009922A2/es unknown
- 2019-09-12 CO CONC2019/0009920A patent/CO2019009920A2/es unknown
Also Published As
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CL2019002276A1 (es) | Método y sistemas para la compresión eficiente de lecturas de secuencias genómicas. | |
| CO2019009919A2 (es) | Método y sistemas para la compresión eficiente de lecturas de secuencias genómicas | |
| MX2024007272A (es) | Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo de codificacion de datos tridimensionales y dispositivo de decodificacion de datos tridimensionales. | |
| MX2019013109A (es) | Dispositivo de procesamiento de datos y metodo de procesamiento de datos. | |
| MY204138A (en) | Three-dimensional data encoding method, three- dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device | |
| CO2017007499A2 (es) | Agrupación de índice de paleta para codificación de video | |
| MX2019010795A (es) | Metodo de codificacion de video utilizando division de bloque de arbol binario. | |
| GB2545070A (en) | Generating molecular encoding information for data storage | |
| CO6721060A2 (es) | Aparato y método de procesamiento de imagen. | |
| AR092787A1 (es) | Mejora del rendimiento para codificacion del nivel de coeficiente cabac | |
| MX2019015119A (es) | Dispositivo de procesamiento de datos y metodo de procesamiento de datos. | |
| MX2019015354A (es) | Dispositivo de procesamiento de datos y metodo de procesamiento de datos. | |
| MX2019009680A (es) | Metodo y aparato para la representacion compacta de datos de bioinformatica mediante el uso de multiples descriptores genomicos. | |
| MX373707B (es) | Aparato de intercalado de paridad para codificar la informacion de señalizacion de longitud variable y metodo de intercalado de paridad que lo utiliza. | |
| PH12019500294A1 (en) | Method and apparatuse for coding and decoding polar codes | |
| AR102919A1 (es) | Variantes de lipasa y polinucleótidos que las codifican | |
| PH12019500793A1 (en) | Method and apparatus for compact representation of bioinformatics data | |
| EA201991906A1 (ru) | Способ и системы для восстановления геномных референсных последовательностей из сжатых прочтений геномной последовательности | |
| EP3193260A3 (en) | Encoding program, encoding method, encoding device, decoding program, decoding method, and decoding device | |
| MX2018006141A (es) | Metodo y dispositivo de decodificacion de video, y metodo y dispositivo de codificacion del mismo. | |
| MX2019009681A (es) | Metodo y sistemas para la compresion eficiente de lecturas de secuencias genomicas. | |
| AR110436A1 (es) | Método de codificación de vídeo, método de decodificación de vídeo, dispositivo de codificación de vídeo y dispositivo de decodificación de vídeo | |
| EA201990923A1 (ru) | Способ и устройство для доступа к биоинформационным данным, структурированным в виде единиц доступа | |
| BR112016029539A2 (pt) | sistemas e métodos para cópia intrabloco | |
| TH167859A (th) | วิธีการและอุปกรณ์เข้ารหัสเสียง |