ES2167101T3 - Procedimiento para el agrupamiento de secuencias en familias. - Google Patents

Procedimiento para el agrupamiento de secuencias en familias.

Info

Publication number
ES2167101T3
ES2167101T3 ES98951173T ES98951173T ES2167101T3 ES 2167101 T3 ES2167101 T3 ES 2167101T3 ES 98951173 T ES98951173 T ES 98951173T ES 98951173 T ES98951173 T ES 98951173T ES 2167101 T3 ES2167101 T3 ES 2167101T3
Authority
ES
Spain
Prior art keywords
sequences
groups
grouping
families
procedure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
ES98951173T
Other languages
English (en)
Inventor
Martin Vingron
Antje Krause
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Deutsches Krebsforschungszentrum DKFZ
Original Assignee
Deutsches Krebsforschungszentrum DKFZ
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsches Krebsforschungszentrum DKFZ filed Critical Deutsches Krebsforschungszentrum DKFZ
Application granted granted Critical
Publication of ES2167101T3 publication Critical patent/ES2167101T3/es
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/30Unsupervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99932Access augmentation or optimizing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99951File or database maintenance
    • Y10S707/99952Coherency, e.g. same view to multiple users
    • Y10S707/99953Recoverability

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Bioethics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Public Health (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Peptides Or Proteins (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Table Devices Or Equipment (AREA)
  • Magnetic Resonance Imaging Apparatus (AREA)
  • Crystals, And After-Treatments Of Crystals (AREA)

Abstract

Procedimiento para el agrupamiento de secuencias en familias, en el que { con un programa de búsqueda en banco de datos se averiguan a partir de un banco de datos de secuencias, como cantidad positiva, todas las secuencias similares a una secuencia de consulta, secuencias para las que la probabilidad de que aparezca la semejanza casualmente queda por debajo de un valor umbral preestablecido, { de esta cantidad positiva se elige al menos una secuencia como secuencia de búsqueda, { a continuación, el procedimiento de búsqueda descrito con la secuencia de búsqueda averiguada como secuencia de consulta se repite hasta que la cantidad positiva recién averiguada contenga secuencias que no estén contenidas en las cantidades positivas previamente averiguadas y haya una cantidad de corte entre la cantidad positiva de la secuencia de consulta y la cantidad positiva de la secuencia actual de búsqueda y { todas las diferentes secuencias contenidas en las cantidades positivas calculadas se extraen comoracimo.
ES98951173T 1997-10-17 1998-08-14 Procedimiento para el agrupamiento de secuencias en familias. Expired - Lifetime ES2167101T3 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE19745665A DE19745665C1 (de) 1997-10-17 1997-10-17 Verfahren zur Eingruppierung von Sequenzen in Familien

Publications (1)

Publication Number Publication Date
ES2167101T3 true ES2167101T3 (es) 2002-05-01

Family

ID=7845680

Family Applications (1)

Application Number Title Priority Date Filing Date
ES98951173T Expired - Lifetime ES2167101T3 (es) 1997-10-17 1998-08-14 Procedimiento para el agrupamiento de secuencias en familias.

Country Status (9)

Country Link
US (1) US6304868B1 (es)
EP (1) EP1027669B1 (es)
AT (1) ATE208068T1 (es)
CA (1) CA2305318A1 (es)
DE (2) DE19745665C1 (es)
DK (1) DK1027669T3 (es)
ES (1) ES2167101T3 (es)
PT (1) PT1027669E (es)
WO (1) WO1999021107A1 (es)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8473215B2 (en) * 2003-04-25 2013-06-25 Leland Stanford Junior University Method for clustering data items through distance-merging and density-merging techniques
DE10323917A1 (de) * 2003-05-23 2004-12-16 Protagen Ag Verfahren und System zur Aufklärung der Primärstruktur von Biopolymeren
US20040249791A1 (en) * 2003-06-03 2004-12-09 Waters Michael D. Method and system for developing and querying a sequence driven contextual knowledge base
GB0400974D0 (en) * 2004-01-16 2004-02-18 Solexa Ltd Multiple inexact matching
US20060106545A1 (en) * 2004-11-12 2006-05-18 Jubilant Biosys Ltd. Methods of clustering proteins
EP2000935A3 (en) * 2007-05-10 2012-07-18 F. Hoffmann-La Roche AG Method of processing protein peptide data and system
CN108388771B (zh) * 2018-01-24 2021-10-08 安徽微分基因科技有限公司 一种生物多样性自动分析方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0793370A (ja) * 1993-09-27 1995-04-07 Hitachi Device Eng Co Ltd 遺伝子データベース検索システム
US5664174A (en) * 1995-05-09 1997-09-02 International Business Machines Corporation System and method for discovering similar time sequences in databases

Also Published As

Publication number Publication date
EP1027669A1 (de) 2000-08-16
WO1999021107A1 (de) 1999-04-29
DE59801996D1 (de) 2001-12-06
US6304868B1 (en) 2001-10-16
ATE208068T1 (de) 2001-11-15
EP1027669B1 (de) 2001-10-31
DE19745665C1 (de) 1999-05-12
PT1027669E (pt) 2002-04-29
CA2305318A1 (en) 1999-04-29
DK1027669T3 (da) 2002-01-28

Similar Documents

Publication Publication Date Title
BR9809154A (pt) Aparelho de teste microbiológico automatizado e métodos para o mesmo
ATE210191T1 (de) Nukleotid-sequenzierungsmethode
GB0400974D0 (en) Multiple inexact matching
SG143036A1 (en) Managing filesystem versions
BR0111133A (pt) Instalação para produzir pneus de diferentes tipos simultaneamente e método para fabricar pneus de diferentes tipos em uma instalação automática
EP0777750A4 (en) HIGH SALES METHOD FOR DETECTING SEQUENCES OR GENETIC CHANGES IN NUCLEIC ACIDS
AR012415A1 (es) Metodo y disposicion de computadora para procesar de forma dinamica un indice para crear una serie de preguntas para uso en una disposicion deretiro de informacion
AP9901654A0 (en) Gene sequencer and methods.
BR9707056A (pt) Processos e composições para determinar a sequência de moléculas de ácido nucleico
WO2002103028A3 (en) In silico screening for phenotype-associated expressed sequences
ATE425985T1 (de) Verfahren zur markierung von rns
ES2167101T3 (es) Procedimiento para el agrupamiento de secuencias en familias.
WO2002086081A3 (en) Methods and systems for identifying proteins
DE60208431D1 (de) Verfahren zur automatischen etikettierung von zweigen
GB2377272A (en) Method and apparatus for DNA sequencing
Ogura et al. Proteomic characterization of seeds from yellow lupin (L upinus luteus L.)
PT879296E (pt) Marcadores de adn para o tamanho da ninhada no suino
WO2003083720A3 (en) Database searching method and system
ATE481694T1 (de) Verfahren zur auswahl von ästen für das ausrichten von sonden
WO2003096223A1 (en) Mutant sequence analyzer
ES2184335T3 (es) Procedimiento de representacion de interconexiones entre una pluralidad de archivos de datos y un aparato y un programa de ordenador para realizar todas las etapas de dicho procedimiento.
ES2175116T3 (es) Metodos para determinar el genotipo del color del pelaje de un cerdo.
DE69734586D1 (de) Retinoid-metabolisierendes protein
AU2003201908A1 (en) Method of constructing stereostructure of protein having plural number of chains
Pagani et al. Reference proteome of highly purified human Th1 cells reveals strong effects on metabolism and protein ubiquitination upon differentiation