ES2128395T3 - Aparato y metodo a utilizar en la alineacion de cuerpos de texto bilingues. - Google Patents

Aparato y metodo a utilizar en la alineacion de cuerpos de texto bilingues.

Info

Publication number
ES2128395T3
ES2128395T3 ES93308661T ES93308661T ES2128395T3 ES 2128395 T3 ES2128395 T3 ES 2128395T3 ES 93308661 T ES93308661 T ES 93308661T ES 93308661 T ES93308661 T ES 93308661T ES 2128395 T3 ES2128395 T3 ES 2128395T3
Authority
ES
Spain
Prior art keywords
files
commands
text
alignment
similar
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
ES93308661T
Other languages
English (en)
Inventor
Timothy Francis O'donoghue
Thomas Juliusz Wachtel
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Technology Europe Ltd
Canon Europa NV
Original Assignee
Canon Research Centre Europe Ltd
Canon Europa NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Research Centre Europe Ltd, Canon Europa NV filed Critical Canon Research Centre Europe Ltd
Application granted granted Critical
Publication of ES2128395T3 publication Critical patent/ES2128395T3/es
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/45Example-based machine translation; Alignment

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

SE LEEN UNA PLURALIDAD DE ARCHIVOS DE TEXTO FUENTE, QUE CONTIENEN INFORMACION SIMILAR PERO ESTAN ESCRITOS EN DIFERENTES LENGUAJES NATURALES. LOS ARCHIVOS TIENEN FORMATOS CORRELACIONADOS YA QUE SE UTILIZAN LOS MISMOS COMANDOS DE FORMATO EN PUNTOS PARECIDOS DE LOS ARCHIVOS. LOS TEXTOS QUE SON SIMILARES, DE LOS ARCHIVOS RESPECTIVOS, SE ALINEAN A BASE DE IDENTIFICAR SU POSICION ENTRE COMANDOS DE PROCESAMIENTO DE TEXTOS EQUIVALENTES. PREFERENTEMENTE SE CREAN ARCHIVOS INTERMEDIOS EN QUE LOS COMANDOS DE PROCESAMIENTO DE TEXTOS (WP) SE CONVIERTEN A UNA FORMA IDENTIFICABLE. SE IDENTIFICA EL TEXTO ALINEADO, QUE DIFIERE DE LOS ARCHIVOS INTERMEDIOS MIENTRAS QUE LOS COMANDOS DE WP NO DIFIEREN, MEDIANTE UNA OPERACION DE COMPARACION DIFERENCIAL, TAL COMO UNA LLAMADA A UN DIFF DENTRO DE UN ENTORNO UNIX.
ES93308661T 1992-10-30 1993-10-29 Aparato y metodo a utilizar en la alineacion de cuerpos de texto bilingues. Expired - Lifetime ES2128395T3 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB9222768A GB2272091B (en) 1992-10-30 1992-10-30 Apparatus for use in aligning bilingual corpora

Publications (1)

Publication Number Publication Date
ES2128395T3 true ES2128395T3 (es) 1999-05-16

Family

ID=10724271

Family Applications (1)

Application Number Title Priority Date Filing Date
ES93308661T Expired - Lifetime ES2128395T3 (es) 1992-10-30 1993-10-29 Aparato y metodo a utilizar en la alineacion de cuerpos de texto bilingues.

Country Status (5)

Country Link
US (1) US5893134A (es)
EP (1) EP0597611B1 (es)
DE (1) DE69322741T2 (es)
ES (1) ES2128395T3 (es)
GB (1) GB2272091B (es)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69837979T2 (de) * 1997-06-27 2008-03-06 International Business Machines Corp. System zum Extrahieren einer mehrsprachigen Terminologie
JPH1185756A (ja) * 1997-09-03 1999-03-30 Sharp Corp 翻訳装置及び翻訳装置制御プログラムを記憶した媒体
US6151022A (en) * 1997-12-01 2000-11-21 Microsoft Corporation Method and apparatus for statically testing visual resources
US6345243B1 (en) * 1998-05-27 2002-02-05 Lionbridge Technologies, Inc. System, method, and product for dynamically propagating translations in a translation-memory system
US6345244B1 (en) * 1998-05-27 2002-02-05 Lionbridge Technologies, Inc. System, method, and product for dynamically aligning translations in a translation-memory system
US6658626B1 (en) 1998-07-31 2003-12-02 The Regents Of The University Of California User interface for displaying document comparison information
US6240409B1 (en) 1998-07-31 2001-05-29 The Regents Of The University Of California Method and apparatus for detecting and summarizing document similarity within large document sets
US6493709B1 (en) * 1998-07-31 2002-12-10 The Regents Of The University Of California Method and apparatus for digitally shredding similar documents within large document sets in a data processing environment
US6490051B1 (en) * 1998-09-21 2002-12-03 Microsoft Corporation Printer driver and method for supporting worldwide single binary font format with built in support for double byte characters
GB9821969D0 (en) 1998-10-08 1998-12-02 Canon Kk Apparatus and method for processing natural language
US7133873B1 (en) * 1999-12-14 2006-11-07 United Parcel Service Of America, Inc. System and method for modifying output of computer program without source code modifications
US6473729B1 (en) * 1999-12-20 2002-10-29 Xerox Corporation Word phrase translation using a phrase index
US6484178B1 (en) * 1999-12-30 2002-11-19 The Merallis Company Universal claims formatter
US8706747B2 (en) 2000-07-06 2014-04-22 Google Inc. Systems and methods for searching using queries written in a different character-set and/or language from the target pages
US7137108B1 (en) * 2000-10-26 2006-11-14 International Business Machines Corporation Identifying non-externalized text strings that are not hard-coded
US7016829B2 (en) * 2001-05-04 2006-03-21 Microsoft Corporation Method and apparatus for unsupervised training of natural language processing units
WO2003005166A2 (en) 2001-07-03 2003-01-16 University Of Southern California A syntax-based statistical translation model
US7146321B2 (en) * 2001-10-31 2006-12-05 Dictaphone Corporation Distributed speech recognition system
US7133829B2 (en) * 2001-10-31 2006-11-07 Dictaphone Corporation Dynamic insertion of a speech recognition engine within a distributed speech recognition system
US6785654B2 (en) 2001-11-30 2004-08-31 Dictaphone Corporation Distributed speech recognition system with speech recognition engines offering multiple functionalities
US6766294B2 (en) * 2001-11-30 2004-07-20 Dictaphone Corporation Performance gauge for a distributed speech recognition system
US20030128856A1 (en) * 2002-01-08 2003-07-10 Boor Steven E. Digitally programmable gain amplifier
US20030154071A1 (en) * 2002-02-11 2003-08-14 Shreve Gregory M. Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents
AU2003269808A1 (en) * 2002-03-26 2004-01-06 University Of Southern California Constructing a translation lexicon from comparable, non-parallel corpora
US7236931B2 (en) * 2002-05-01 2007-06-26 Usb Ag, Stamford Branch Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems
US7292975B2 (en) * 2002-05-01 2007-11-06 Nuance Communications, Inc. Systems and methods for evaluating speaker suitability for automatic speech recognition aided transcription
JP3943582B2 (ja) * 2003-05-30 2007-07-11 富士通株式会社 対訳文対応付け装置
US8548794B2 (en) * 2003-07-02 2013-10-01 University Of Southern California Statistical noun phrase translation
US7711545B2 (en) * 2003-07-02 2010-05-04 Language Weaver, Inc. Empirical methods for splitting compound words with application to machine translation
US20070033154A1 (en) * 2003-10-29 2007-02-08 Trainum Michael W System and method managing documents
US8296127B2 (en) 2004-03-23 2012-10-23 University Of Southern California Discovery of parallel text portions in comparable collections of corpora and training using comparable texts
US8666725B2 (en) 2004-04-16 2014-03-04 University Of Southern California Selection and use of nonstatistical translation components in a statistical machine translation framework
US8972444B2 (en) 2004-06-25 2015-03-03 Google Inc. Nonstandard locality-based text entry
US8392453B2 (en) 2004-06-25 2013-03-05 Google Inc. Nonstandard text entry
DE112005002534T5 (de) * 2004-10-12 2007-11-08 University Of Southern California, Los Angeles Training für eine Text-Text-Anwendung, die eine Zeichenketten-Baum-Umwandlung zum Training und Decodieren verwendet
US7680646B2 (en) * 2004-12-21 2010-03-16 Xerox Corporation Retrieval method for translation memories containing highly structured documents
US7882116B2 (en) 2005-05-18 2011-02-01 International Business Machines Corporation Method for localization of programming modeling resources
US8676563B2 (en) 2009-10-01 2014-03-18 Language Weaver, Inc. Providing human-generated and machine-generated trusted translations
US8886517B2 (en) 2005-06-17 2014-11-11 Language Weaver, Inc. Trust scoring for language translation systems
US7624020B2 (en) * 2005-09-09 2009-11-24 Language Weaver, Inc. Adapter for allowing both online and offline training of a text to text system
US8032372B1 (en) 2005-09-13 2011-10-04 Escription, Inc. Dictation selection
US10319252B2 (en) 2005-11-09 2019-06-11 Sdl Inc. Language capability assessment and training apparatus and techniques
CN101030196B (zh) * 2006-02-28 2010-05-12 株式会社东芝 训练双语词对齐模型的方法和装置、双语词对齐方法和装置
CN101030197A (zh) * 2006-02-28 2007-09-05 株式会社东芝 双语词对齐方法和装置、训练双语词对齐模型的方法和装置
US8943080B2 (en) 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
US7805289B2 (en) * 2006-07-10 2010-09-28 Microsoft Corporation Aligning hierarchal and sequential document trees to identify parallel data
US8886518B1 (en) 2006-08-07 2014-11-11 Language Weaver, Inc. System and method for capitalizing machine translated text
US7979425B2 (en) 2006-10-25 2011-07-12 Google Inc. Server-side match
US8433556B2 (en) 2006-11-02 2013-04-30 University Of Southern California Semi-supervised training for statistical word alignment
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US8468149B1 (en) 2007-01-26 2013-06-18 Language Weaver, Inc. Multi-lingual online community
US8615389B1 (en) 2007-03-16 2013-12-24 Language Weaver, Inc. Generation and exploitation of an approximate language model
US8831928B2 (en) * 2007-04-04 2014-09-09 Language Weaver, Inc. Customizable machine translation service
US8825466B1 (en) 2007-06-08 2014-09-02 Language Weaver, Inc. Modification of annotated bilingual segment pairs in syntax-based machine translation
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
US8380486B2 (en) 2009-10-01 2013-02-19 Language Weaver, Inc. Providing machine-generated translations and corresponding trust levels
US10417646B2 (en) 2010-03-09 2019-09-17 Sdl Inc. Predicting the cost associated with translating textual content
TWI431491B (zh) * 2010-12-20 2014-03-21 King Yuan Electronics Co Ltd 晶圓機台測試檔案之比對裝置以及比對方法
US11003838B2 (en) 2011-04-18 2021-05-11 Sdl Inc. Systems and methods for monitoring post translation editing
US8694303B2 (en) 2011-06-15 2014-04-08 Language Weaver, Inc. Systems and methods for tuning parameters in statistical machine translation
US9305082B2 (en) * 2011-09-30 2016-04-05 Thomson Reuters Global Resources Systems, methods, and interfaces for analyzing conceptually-related portions of text
US8886515B2 (en) 2011-10-19 2014-11-11 Language Weaver, Inc. Systems and methods for enhancing machine translation post edit review processes
US8942973B2 (en) 2012-03-09 2015-01-27 Language Weaver, Inc. Content page URL translation
CN103383680A (zh) * 2012-05-02 2013-11-06 成都勤智数码科技股份有限公司 It运维系统的多语化系统
CN103383679A (zh) * 2012-05-02 2013-11-06 成都勤智数码科技股份有限公司 多语统一的it运维知识库实现方法
US10261994B2 (en) 2012-05-25 2019-04-16 Sdl Inc. Method and system for automatic management of reputation of translators
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58101365A (ja) * 1981-12-14 1983-06-16 Hitachi Ltd 機械翻訳システム
US4734036A (en) * 1984-11-30 1988-03-29 Helene Kasha Method and device for learning a language
JPH0664585B2 (ja) * 1984-12-25 1994-08-22 株式会社東芝 翻訳編集装置
JPS61223978A (ja) * 1985-03-29 1986-10-04 Toshiba Corp 翻訳表示装置
US5351189A (en) * 1985-03-29 1994-09-27 Kabushiki Kaisha Toshiba Machine translation system including separated side-by-side display of original and corresponding translated sentences
US4965763A (en) * 1987-03-03 1990-10-23 International Business Machines Corporation Computer method for automatic extraction of commonly specified information from business correspondence
NL8900600A (nl) * 1989-03-13 1990-10-01 Bso Buro Voor Systeemontwikkel Tweetalige kennisbank.
GB9103080D0 (en) * 1991-02-14 1991-04-03 British And Foreign Bible The Analysing textual documents
JPH04264971A (ja) * 1991-02-20 1992-09-21 Nippon Computer Kenkyusho:Kk 学習型共起辞書作成装置
US5477451A (en) * 1991-07-25 1995-12-19 International Business Machines Corp. Method and system for natural language translation
JPH05151260A (ja) * 1991-11-29 1993-06-18 Hitachi Ltd 翻訳テンプレート学習方法および翻訳テンプレート学習システム
JP3220560B2 (ja) * 1992-05-26 2001-10-22 シャープ株式会社 機械翻訳装置
WO1994025921A1 (en) * 1993-04-26 1994-11-10 Taligent, Inc. Text transliteration system
US5331556A (en) * 1993-06-28 1994-07-19 General Electric Company Method for natural language data processing using morphological and part-of-speech information

Also Published As

Publication number Publication date
EP0597611B1 (en) 1998-12-23
US5893134A (en) 1999-04-06
GB2272091B (en) 1996-10-23
DE69322741D1 (de) 1999-02-04
GB9222768D0 (en) 1992-12-09
EP0597611A2 (en) 1994-05-18
DE69322741T2 (de) 1999-06-02
EP0597611A3 (en) 1994-09-21
GB2272091A (en) 1994-05-04

Similar Documents

Publication Publication Date Title
ES2128395T3 (es) Aparato y metodo a utilizar en la alineacion de cuerpos de texto bilingues.
ES2189962T3 (es) Metodo y sistema para visualizar una guia electronica de programas.
ES2072261T3 (es) Sistema de composicion de documento que utiliza formatos designados y conjunto de caracteres designado.
EP0569133A3 (es)
ATE538440T1 (de) Interaktive, elektronisch präsentierte karte
ATE392667T1 (de) Verfahren und computersystem zum indexieren strukturierter dokumente
KR920004953A (ko) 문자처리방법 및 장치
ES2240190T3 (es) Metodo para determinacion de atributos de poblacion y programa de ordenador para realizar el metodo.
MX165527B (es) Sistema de informacion direccional
ES8609699A1 (es) Un dispositivo y un metodo para extraer por cartografia la informacion de una imagen sobre un plano.
EA200400472A1 (ru) Способ и система для обеспечения тактической информации в случае кризисных ситуаций
WO2001022284A3 (en) A generalized system for automatically hyperlinking multimedia product documents
ES2159140T3 (es) Tratamiento de datos digitales e informacion sobre guia de programas.
GB2336984A (en) Method for rendering shadows on a graphical display
DE69714020D1 (de) Rückgewinnung von taxanen aus koniferen
ES2185375T3 (es) Sistema de aplicacion cooperativa de television con multiples dispositivos de equipos de television de usuarios.
DE602004010807D1 (de) Techniken zur bereitstellung eines virtuellen arbeitsraums, bestehend aus einer vielzahl elektronischer einrichtungen
CA2266724A1 (en) System for translating visual display object files and method of operation thereof
DE69834574D1 (de) Miscmicellen aus lipopeptiden für induzierung einer immunantwort
ES2167026T3 (es) Procedimiento y terminal para la asignacion espacial de informaciones referidas a un lugar.
WO2003058513A3 (en) System and method for compiling and displaying a pruned information set
ES2187504T3 (es) Vectores no oncovirales, basados en el vmmp, con encapsidacion defectuosa.
SE0001034L (sv) Anordning och datorprogram för att understödja läsning av en text
KR20040090867A (ko) 그레비티 레이어 방식을 이용한 xsl 스타일 시트의자동 생성 방법
PE20010812A1 (es) Metodo para obtener informacion de consumidores

Legal Events

Date Code Title Description
FG2A Definitive protection

Ref document number: 597611

Country of ref document: ES