ATE400047T1 - Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen - Google Patents

Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen

Info

Publication number
ATE400047T1
ATE400047T1 AT05716729T AT05716729T ATE400047T1 AT E400047 T1 ATE400047 T1 AT E400047T1 AT 05716729 T AT05716729 T AT 05716729T AT 05716729 T AT05716729 T AT 05716729T AT E400047 T1 ATE400047 T1 AT E400047T1
Authority
AT
Austria
Prior art keywords
speech recognition
recognition
outside
automatically providing
domain
Prior art date
Application number
AT05716729T
Other languages
English (en)
Inventor
Daniele Colibro
Claudio Vair
Luciano Fissore
Cosmin Popovici
Original Assignee
Loquendo Spa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Loquendo Spa filed Critical Loquendo Spa
Application granted granted Critical
Publication of ATE400047T1 publication Critical patent/ATE400047T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
AT05716729T 2005-02-17 2005-02-17 Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen ATE400047T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2005/050712 WO2006087040A1 (en) 2005-02-17 2005-02-17 Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system

Publications (1)

Publication Number Publication Date
ATE400047T1 true ATE400047T1 (de) 2008-07-15

Family

ID=34960407

Family Applications (1)

Application Number Title Priority Date Filing Date
AT05716729T ATE400047T1 (de) 2005-02-17 2005-02-17 Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen

Country Status (7)

Country Link
US (1) US9224391B2 (de)
EP (1) EP1851756B1 (de)
AT (1) ATE400047T1 (de)
CA (1) CA2597803C (de)
DE (1) DE602005007939D1 (de)
ES (1) ES2309728T3 (de)
WO (1) WO2006087040A1 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1851756B1 (de) * 2005-02-17 2008-07-02 Loquendo S.p.A. Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
JP5088701B2 (ja) * 2006-05-31 2012-12-05 日本電気株式会社 言語モデル学習システム、言語モデル学習方法、および言語モデル学習用プログラム
US8135590B2 (en) 2007-01-11 2012-03-13 Microsoft Corporation Position-dependent phonetic models for reliable pronunciation identification
GB2471811B (en) * 2008-05-09 2012-05-16 Fujitsu Ltd Speech recognition dictionary creating support device,computer readable medium storing processing program, and processing method
US8364481B2 (en) 2008-07-02 2013-01-29 Google Inc. Speech recognition with parallel recognition tasks
US8478592B2 (en) * 2008-07-08 2013-07-02 Nuance Communications, Inc. Enhancing media playback with speech recognition
JP4703787B2 (ja) * 2009-01-28 2011-06-15 三菱電機株式会社 音声認識装置
US9280969B2 (en) * 2009-06-10 2016-03-08 Microsoft Technology Licensing, Llc Model training for automatic speech recognition from imperfect transcription data
US10957310B1 (en) 2012-07-23 2021-03-23 Soundhound, Inc. Integrated programming framework for speech and text understanding with meaning parsing
JP6233798B2 (ja) * 2013-09-11 2017-11-22 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation データを変換する装置及び方法
US11295730B1 (en) * 2014-02-27 2022-04-05 Soundhound, Inc. Using phonetic variants in a local context to improve natural language understanding
US10614108B2 (en) * 2015-11-10 2020-04-07 International Business Machines Corporation User interface for streaming spoken query
CN113383384A (zh) * 2019-01-25 2021-09-10 索美智能有限公司 语音动画的实时生成
US11443734B2 (en) 2019-08-26 2022-09-13 Nice Ltd. System and method for combining phonetic and automatic speech recognition search
US20220383868A1 (en) * 2021-05-31 2022-12-01 Analog Devices, Inc. Natural language interfaces

Family Cites Families (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4723290A (en) * 1983-05-16 1988-02-02 Kabushiki Kaisha Toshiba Speech recognition apparatus
US4882757A (en) * 1986-04-25 1989-11-21 Texas Instruments Incorporated Speech recognition system
US4977598A (en) * 1989-04-13 1990-12-11 Texas Instruments Incorporated Efficient pruning algorithm for hidden markov model speech recognition
US5349645A (en) * 1991-12-31 1994-09-20 Matsushita Electric Industrial Co., Ltd. Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches
US5384893A (en) * 1992-09-23 1995-01-24 Emerson & Stern Associates, Inc. Method and apparatus for speech synthesis based on prosodic analysis
US5878164A (en) * 1994-01-21 1999-03-02 Lucent Technologies Inc. Interleaved segmental method for handwriting recognition
CA2372671C (en) * 1994-10-25 2007-01-02 British Telecommunications Public Limited Company Voice-operated services
US5617488A (en) * 1995-02-01 1997-04-01 The Research Foundation Of State University Of New York Relaxation word recognizer
US5710866A (en) * 1995-05-26 1998-01-20 Microsoft Corporation System and method for speech recognition using dynamically adjusted confidence measure
US5806029A (en) * 1995-09-15 1998-09-08 At&T Corp Signal conditioned minimum error rate training for continuous speech recognition
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5797123A (en) * 1996-10-01 1998-08-18 Lucent Technologies Inc. Method of key-phase detection and verification for flexible speech understanding
US5884259A (en) * 1997-02-12 1999-03-16 International Business Machines Corporation Method and apparatus for a time-synchronous tree-based search strategy
US6006181A (en) * 1997-09-12 1999-12-21 Lucent Technologies Inc. Method and apparatus for continuous speech recognition using a layered, self-adjusting decoder network
US6108410A (en) * 1997-09-16 2000-08-22 Nynex Science And Technology Inc. Methods and apparatus for automating the detection, reporting and correction of operator input errors
US6757652B1 (en) * 1998-03-03 2004-06-29 Koninklijke Philips Electronics N.V. Multiple stage speech recognizer
US7043426B2 (en) * 1998-04-01 2006-05-09 Cyberpulse, L.L.C. Structured speech recognition
ITTO980383A1 (it) * 1998-05-07 1999-11-07 Cselt Centro Studi Lab Telecom Procedimento e dispositivo di riconoscimento vocale con doppio passo di riconoscimento neurale e markoviano.
DE19842151A1 (de) * 1998-09-15 2000-03-23 Philips Corp Intellectual Pty Verfahren zur Adaption von linguistischen Sprachmodellen
US6188976B1 (en) * 1998-10-23 2001-02-13 International Business Machines Corporation Apparatus and method for building domain-specific language models
US6438520B1 (en) * 1999-01-20 2002-08-20 Lucent Technologies Inc. Apparatus, method and system for cross-speaker speech recognition for telecommunication applications
US6356865B1 (en) * 1999-01-29 2002-03-12 Sony Corporation Method and apparatus for performing spoken language translation
US6282507B1 (en) * 1999-01-29 2001-08-28 Sony Corporation Method and apparatus for interactive source language expression recognition and alternative hypothesis presentation and selection
CA2376277C (en) * 1999-06-11 2011-03-15 Telstra New Wave Pty Ltd A method of developing an interactive system
US6691089B1 (en) * 1999-09-30 2004-02-10 Mindspeed Technologies Inc. User configurable levels of security for a speaker verification system
AU2001268293A1 (en) * 2000-06-12 2001-12-24 L And H Holdings Usa, Inc. Using utterance-level confidence estimates
JP3379090B2 (ja) * 2001-03-02 2003-02-17 インターナショナル・ビジネス・マシーンズ・コーポレーション 機械翻訳システム、機械翻訳方法、及び機械翻訳用プログラム
WO2002086864A1 (en) * 2001-04-18 2002-10-31 Rutgers, The State University Of New Jersey System and method for adaptive language understanding by computers
US20030009335A1 (en) * 2001-07-05 2003-01-09 Johan Schalkwyk Speech recognition with dynamic grammars
WO2004023455A2 (en) * 2002-09-06 2004-03-18 Voice Signal Technologies, Inc. Methods, systems, and programming for performing speech recognition
US7016849B2 (en) * 2002-03-25 2006-03-21 Sri International Method and apparatus for providing speech-driven routing between spoken language applications
US7092883B1 (en) * 2002-03-29 2006-08-15 At&T Generating confidence scores from word lattices
US7197457B2 (en) * 2003-04-30 2007-03-27 Robert Bosch Gmbh Method for statistical language modeling in speech recognition
US7603267B2 (en) * 2003-05-01 2009-10-13 Microsoft Corporation Rules-based grammar for slots and statistical model for preterminals in natural language understanding system
US7383172B1 (en) * 2003-08-15 2008-06-03 Patrick William Jamieson Process and system for semantically recognizing, correcting, and suggesting domain specific speech
GB0406619D0 (en) * 2004-03-24 2004-04-28 British Telecomm Induction of grammar rules
US20060009974A1 (en) * 2004-07-09 2006-01-12 Matsushita Electric Industrial Co., Ltd. Hands-free voice dialing for portable and remote devices
US7574356B2 (en) * 2004-07-19 2009-08-11 At&T Intellectual Property Ii, L.P. System and method for spelling recognition using speech and non-speech input
US20070016401A1 (en) * 2004-08-12 2007-01-18 Farzad Ehsani Speech-to-speech translation system with user-modifiable paraphrasing grammars
CA2499305A1 (en) * 2005-03-04 2006-09-04 668158 B.C. Ltd. Method and apparatus for providing geographically targeted information and advertising
ATE403213T1 (de) * 2004-12-28 2008-08-15 Loquendo Spa System und verfahren zur automatischen spracherkennung
US7379870B1 (en) * 2005-02-03 2008-05-27 Hrl Laboratories, Llc Contextual filtering
EP1851756B1 (de) * 2005-02-17 2008-07-02 Loquendo S.p.A. Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
US7624020B2 (en) * 2005-09-09 2009-11-24 Language Weaver, Inc. Adapter for allowing both online and offline training of a text to text system
WO2007046267A1 (ja) * 2005-10-20 2007-04-26 Nec Corporation 音声判別システム、音声判別方法及び音声判別用プログラム
JP2009521736A (ja) * 2005-11-07 2009-06-04 スキャンスカウト,インコーポレイテッド リッチメディアと共に広告をレンダリングするための技術
US7899669B2 (en) * 2005-12-12 2011-03-01 Gregory John Gadbois Multi-voice speech recognition
JP5248867B2 (ja) * 2006-01-31 2013-07-31 本田技研工業株式会社 会話システムおよび会話ソフトウェア
EP2523442A1 (de) * 2006-02-10 2012-11-14 Spinvox Limited Benutzerunabhängiges, vorrichtungsunabhängiges Multiskala-Sprachnachrichten-zu-Text-Umwandlungssystem
US7890325B2 (en) * 2006-03-16 2011-02-15 Microsoft Corporation Subword unit posterior probability for measuring confidence
US20070226164A1 (en) * 2006-03-21 2007-09-27 Honeywell International Inc. Type variables and/or temporal constraints in plan recognition
US20080133245A1 (en) * 2006-12-04 2008-06-05 Sehda, Inc. Methods for speech-to-speech translation

Also Published As

Publication number Publication date
US20080270129A1 (en) 2008-10-30
US9224391B2 (en) 2015-12-29
CA2597803A1 (en) 2006-08-24
EP1851756B1 (de) 2008-07-02
ES2309728T3 (es) 2008-12-16
DE602005007939D1 (de) 2008-08-14
CA2597803C (en) 2014-05-13
EP1851756A1 (de) 2007-11-07
WO2006087040A1 (en) 2006-08-24

Similar Documents

Publication Publication Date Title
ATE400047T1 (de) Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
DE602005018552D1 (de) Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
EP4625408A3 (de) Spracherkennungssysteme und -verfahren
WO2007115088A3 (en) A system and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy
SG11201912061WA (en) Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
WO2011074771A3 (ko) 외국어 학습 장치 및 그 제공 방법.
EP1696421A3 (de) Lernen zur Spracherkennung
EP4235648A3 (de) Beeinflussung eines sprachenmodells
EP4270385A3 (de) Automatische bestimmung der sprache zur spracherkennung von gesprochener äusserung, die über eine automatisierte assistentenschnittstelle empfangen wird
ATE524777T1 (de) Automatische aktualisierung eines sprachmodells
WO2007118020A3 (en) Method and system for managing pronunciation dictionaries in a speech application
DE602004018290D1 (de) Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen
ATE419616T1 (de) Verfahren, einrichtung und computerprogramm zur spracherkennung
WO2019161193A3 (en) System and method for adaptive detection of spoken language via multiple speech models
DE60228716D1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
DE602007004733D1 (de) Sprechererkennung
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
WO2008084575A1 (ja) 車載用音声認識装置
ATE362632T1 (de) Nachrichtenübertragungsgerät
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
ATE470218T1 (de) System und verfahren zur verbesserung der genauigkeit der spracherkennung
TW200630958A (en) Method and device of speech recognition and language-understanding analysis and nature-language dialogue system using the method
BRPI0406937A (pt) Método e aparelho para supressão de ruìdo dentro de um sistema de reconhecimento de fala distribuìdo

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties