ATE400047T1 - Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen - Google Patents
Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegenInfo
- Publication number
- ATE400047T1 ATE400047T1 AT05716729T AT05716729T ATE400047T1 AT E400047 T1 ATE400047 T1 AT E400047T1 AT 05716729 T AT05716729 T AT 05716729T AT 05716729 T AT05716729 T AT 05716729T AT E400047 T1 ATE400047 T1 AT E400047T1
- Authority
- AT
- Austria
- Prior art keywords
- speech recognition
- recognition
- outside
- automatically providing
- domain
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 3
- 239000000203 mixture Substances 0.000 title abstract 3
- 230000008449 language Effects 0.000 title 1
- 238000009472 formulation Methods 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/EP2005/050712 WO2006087040A1 (en) | 2005-02-17 | 2005-02-17 | Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE400047T1 true ATE400047T1 (de) | 2008-07-15 |
Family
ID=34960407
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT05716729T ATE400047T1 (de) | 2005-02-17 | 2005-02-17 | Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US9224391B2 (de) |
| EP (1) | EP1851756B1 (de) |
| AT (1) | ATE400047T1 (de) |
| CA (1) | CA2597803C (de) |
| DE (1) | DE602005007939D1 (de) |
| ES (1) | ES2309728T3 (de) |
| WO (1) | WO2006087040A1 (de) |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006087040A1 (en) * | 2005-02-17 | 2006-08-24 | Loquendo S.P.A. | Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system |
| JP5088701B2 (ja) * | 2006-05-31 | 2012-12-05 | 日本電気株式会社 | 言語モデル学習システム、言語モデル学習方法、および言語モデル学習用プログラム |
| US8135590B2 (en) | 2007-01-11 | 2012-03-13 | Microsoft Corporation | Position-dependent phonetic models for reliable pronunciation identification |
| GB2471811B (en) * | 2008-05-09 | 2012-05-16 | Fujitsu Ltd | Speech recognition dictionary creating support device,computer readable medium storing processing program, and processing method |
| US8364481B2 (en) * | 2008-07-02 | 2013-01-29 | Google Inc. | Speech recognition with parallel recognition tasks |
| US8478592B2 (en) * | 2008-07-08 | 2013-07-02 | Nuance Communications, Inc. | Enhancing media playback with speech recognition |
| US8099290B2 (en) * | 2009-01-28 | 2012-01-17 | Mitsubishi Electric Corporation | Voice recognition device |
| US9280969B2 (en) * | 2009-06-10 | 2016-03-08 | Microsoft Technology Licensing, Llc | Model training for automatic speech recognition from imperfect transcription data |
| US10957310B1 (en) | 2012-07-23 | 2021-03-23 | Soundhound, Inc. | Integrated programming framework for speech and text understanding with meaning parsing |
| JP6233798B2 (ja) | 2013-09-11 | 2017-11-22 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | データを変換する装置及び方法 |
| US11295730B1 (en) * | 2014-02-27 | 2022-04-05 | Soundhound, Inc. | Using phonetic variants in a local context to improve natural language understanding |
| US10614108B2 (en) * | 2015-11-10 | 2020-04-07 | International Business Machines Corporation | User interface for streaming spoken query |
| CN113383384A (zh) * | 2019-01-25 | 2021-09-10 | 索美智能有限公司 | 语音动画的实时生成 |
| US11443734B2 (en) | 2019-08-26 | 2022-09-13 | Nice Ltd. | System and method for combining phonetic and automatic speech recognition search |
| US20220383868A1 (en) * | 2021-05-31 | 2022-12-01 | Analog Devices, Inc. | Natural language interfaces |
Family Cites Families (52)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4723290A (en) * | 1983-05-16 | 1988-02-02 | Kabushiki Kaisha Toshiba | Speech recognition apparatus |
| US4882757A (en) * | 1986-04-25 | 1989-11-21 | Texas Instruments Incorporated | Speech recognition system |
| US4977598A (en) * | 1989-04-13 | 1990-12-11 | Texas Instruments Incorporated | Efficient pruning algorithm for hidden markov model speech recognition |
| US5349645A (en) * | 1991-12-31 | 1994-09-20 | Matsushita Electric Industrial Co., Ltd. | Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches |
| US5384893A (en) * | 1992-09-23 | 1995-01-24 | Emerson & Stern Associates, Inc. | Method and apparatus for speech synthesis based on prosodic analysis |
| US5878164A (en) * | 1994-01-21 | 1999-03-02 | Lucent Technologies Inc. | Interleaved segmental method for handwriting recognition |
| EP0800698B1 (de) * | 1994-10-25 | 2002-01-23 | BRITISH TELECOMMUNICATIONS public limited company | Ansagedienste mit spracheingabe |
| US5617488A (en) * | 1995-02-01 | 1997-04-01 | The Research Foundation Of State University Of New York | Relaxation word recognizer |
| US5710866A (en) * | 1995-05-26 | 1998-01-20 | Microsoft Corporation | System and method for speech recognition using dynamically adjusted confidence measure |
| US5806029A (en) * | 1995-09-15 | 1998-09-08 | At&T Corp | Signal conditioned minimum error rate training for continuous speech recognition |
| US5799276A (en) * | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
| US5797123A (en) * | 1996-10-01 | 1998-08-18 | Lucent Technologies Inc. | Method of key-phase detection and verification for flexible speech understanding |
| US5884259A (en) * | 1997-02-12 | 1999-03-16 | International Business Machines Corporation | Method and apparatus for a time-synchronous tree-based search strategy |
| US6006181A (en) * | 1997-09-12 | 1999-12-21 | Lucent Technologies Inc. | Method and apparatus for continuous speech recognition using a layered, self-adjusting decoder network |
| US6108410A (en) * | 1997-09-16 | 2000-08-22 | Nynex Science And Technology Inc. | Methods and apparatus for automating the detection, reporting and correction of operator input errors |
| US6757652B1 (en) * | 1998-03-03 | 2004-06-29 | Koninklijke Philips Electronics N.V. | Multiple stage speech recognizer |
| US7043426B2 (en) * | 1998-04-01 | 2006-05-09 | Cyberpulse, L.L.C. | Structured speech recognition |
| ITTO980383A1 (it) * | 1998-05-07 | 1999-11-07 | Cselt Centro Studi Lab Telecom | Procedimento e dispositivo di riconoscimento vocale con doppio passo di riconoscimento neurale e markoviano. |
| DE19842151A1 (de) * | 1998-09-15 | 2000-03-23 | Philips Corp Intellectual Pty | Verfahren zur Adaption von linguistischen Sprachmodellen |
| US6188976B1 (en) * | 1998-10-23 | 2001-02-13 | International Business Machines Corporation | Apparatus and method for building domain-specific language models |
| US6438520B1 (en) * | 1999-01-20 | 2002-08-20 | Lucent Technologies Inc. | Apparatus, method and system for cross-speaker speech recognition for telecommunication applications |
| US6282507B1 (en) * | 1999-01-29 | 2001-08-28 | Sony Corporation | Method and apparatus for interactive source language expression recognition and alternative hypothesis presentation and selection |
| US6356865B1 (en) * | 1999-01-29 | 2002-03-12 | Sony Corporation | Method and apparatus for performing spoken language translation |
| ATE411591T1 (de) * | 1999-06-11 | 2008-10-15 | Telstra Corp Ltd | Verfahren zur entwicklung eines interaktiven systems |
| US6691089B1 (en) * | 1999-09-30 | 2004-02-10 | Mindspeed Technologies Inc. | User configurable levels of security for a speaker verification system |
| US7003456B2 (en) * | 2000-06-12 | 2006-02-21 | Scansoft, Inc. | Methods and systems of routing utterances based on confidence estimates |
| JP3379090B2 (ja) * | 2001-03-02 | 2003-02-17 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 機械翻訳システム、機械翻訳方法、及び機械翻訳用プログラム |
| WO2002086864A1 (en) * | 2001-04-18 | 2002-10-31 | Rutgers, The State University Of New Jersey | System and method for adaptive language understanding by computers |
| US20030009335A1 (en) * | 2001-07-05 | 2003-01-09 | Johan Schalkwyk | Speech recognition with dynamic grammars |
| US7225130B2 (en) * | 2001-09-05 | 2007-05-29 | Voice Signal Technologies, Inc. | Methods, systems, and programming for performing speech recognition |
| US7016849B2 (en) * | 2002-03-25 | 2006-03-21 | Sri International | Method and apparatus for providing speech-driven routing between spoken language applications |
| US7092883B1 (en) * | 2002-03-29 | 2006-08-15 | At&T | Generating confidence scores from word lattices |
| US7197457B2 (en) * | 2003-04-30 | 2007-03-27 | Robert Bosch Gmbh | Method for statistical language modeling in speech recognition |
| US7603267B2 (en) * | 2003-05-01 | 2009-10-13 | Microsoft Corporation | Rules-based grammar for slots and statistical model for preterminals in natural language understanding system |
| US7383172B1 (en) * | 2003-08-15 | 2008-06-03 | Patrick William Jamieson | Process and system for semantically recognizing, correcting, and suggesting domain specific speech |
| GB0406619D0 (en) * | 2004-03-24 | 2004-04-28 | British Telecomm | Induction of grammar rules |
| US20060009974A1 (en) * | 2004-07-09 | 2006-01-12 | Matsushita Electric Industrial Co., Ltd. | Hands-free voice dialing for portable and remote devices |
| US7574356B2 (en) * | 2004-07-19 | 2009-08-11 | At&T Intellectual Property Ii, L.P. | System and method for spelling recognition using speech and non-speech input |
| US20070016401A1 (en) * | 2004-08-12 | 2007-01-18 | Farzad Ehsani | Speech-to-speech translation system with user-modifiable paraphrasing grammars |
| CA2499305A1 (en) * | 2005-03-04 | 2006-09-04 | 668158 B.C. Ltd. | Method and apparatus for providing geographically targeted information and advertising |
| US7912713B2 (en) * | 2004-12-28 | 2011-03-22 | Loquendo S.P.A. | Automatic speech recognition system and method using weighted confidence measure |
| US7379870B1 (en) * | 2005-02-03 | 2008-05-27 | Hrl Laboratories, Llc | Contextual filtering |
| WO2006087040A1 (en) * | 2005-02-17 | 2006-08-24 | Loquendo S.P.A. | Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system |
| US7624020B2 (en) * | 2005-09-09 | 2009-11-24 | Language Weaver, Inc. | Adapter for allowing both online and offline training of a text to text system |
| WO2007046267A1 (ja) * | 2005-10-20 | 2007-04-26 | Nec Corporation | 音声判別システム、音声判別方法及び音声判別用プログラム |
| WO2007056451A2 (en) * | 2005-11-07 | 2007-05-18 | Scanscout, Inc. | Techniques for rendering advertisments with rich media |
| DE602006010505D1 (de) * | 2005-12-12 | 2009-12-31 | Gregory John Gadbois | Mehrstimmige Spracherkennung |
| WO2007088877A1 (ja) * | 2006-01-31 | 2007-08-09 | Honda Motor Co., Ltd. | 会話システムおよび会話ソフトウェア |
| EP2523441B1 (de) * | 2006-02-10 | 2014-01-29 | Nuance Communications, Inc. | Benutzerunabhängiges, vorrichtungsunabhängiges Multiskala-Sprachnachrichten-zu-Text-Umwandlungssystem |
| US7890325B2 (en) * | 2006-03-16 | 2011-02-15 | Microsoft Corporation | Subword unit posterior probability for measuring confidence |
| US20070226164A1 (en) * | 2006-03-21 | 2007-09-27 | Honeywell International Inc. | Type variables and/or temporal constraints in plan recognition |
| US20080133245A1 (en) * | 2006-12-04 | 2008-06-05 | Sehda, Inc. | Methods for speech-to-speech translation |
-
2005
- 2005-02-17 WO PCT/EP2005/050712 patent/WO2006087040A1/en not_active Ceased
- 2005-02-17 DE DE602005007939T patent/DE602005007939D1/de not_active Expired - Lifetime
- 2005-02-17 US US11/884,473 patent/US9224391B2/en not_active Expired - Fee Related
- 2005-02-17 CA CA2597803A patent/CA2597803C/en not_active Expired - Fee Related
- 2005-02-17 ES ES05716729T patent/ES2309728T3/es not_active Expired - Lifetime
- 2005-02-17 EP EP05716729A patent/EP1851756B1/de not_active Expired - Lifetime
- 2005-02-17 AT AT05716729T patent/ATE400047T1/de not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| US9224391B2 (en) | 2015-12-29 |
| WO2006087040A1 (en) | 2006-08-24 |
| EP1851756A1 (de) | 2007-11-07 |
| US20080270129A1 (en) | 2008-10-30 |
| CA2597803A1 (en) | 2006-08-24 |
| CA2597803C (en) | 2014-05-13 |
| ES2309728T3 (es) | 2008-12-16 |
| EP1851756B1 (de) | 2008-07-02 |
| DE602005007939D1 (de) | 2008-08-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE400047T1 (de) | Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen | |
| DE602005018552D1 (de) | Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung | |
| ATE403213T1 (de) | System und verfahren zur automatischen spracherkennung | |
| WO2007115088A3 (en) | A system and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy | |
| SG11201912061WA (en) | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface | |
| EP1696421A3 (de) | Lernen zur Spracherkennung | |
| ATE524777T1 (de) | Automatische aktualisierung eines sprachmodells | |
| WO2007118020A3 (en) | Method and system for managing pronunciation dictionaries in a speech application | |
| EP4235648A3 (de) | Beeinflussung eines sprachenmodells | |
| ATE419616T1 (de) | Verfahren, einrichtung und computerprogramm zur spracherkennung | |
| WO2019161193A3 (en) | System and method for adaptive detection of spoken language via multiple speech models | |
| ATE457511T1 (de) | Sprechererkennung | |
| DE60228716D1 (de) | Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text | |
| TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
| ATE417346T1 (de) | Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen | |
| ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
| WO2008084575A1 (ja) | 車載用音声認識装置 | |
| ATE362632T1 (de) | Nachrichtenübertragungsgerät | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| DE602005021665D1 (de) | System und verfahren zur verbesserung der genauigkeit der spracherkennung | |
| BRPI0406937A (pt) | Método e aparelho para supressão de ruìdo dentro de um sistema de reconhecimento de fala distribuìdo | |
| EP4053837A4 (de) | Automatischer spracherkenner und spracherkennungsverfahren mit tastaturmakrofunktion | |
| ATE405920T1 (de) | Erzeugen einer spracherkennungsgrammatik für alphanumerische ausdrücke | |
| DE602005019070D1 (de) | Her einheiten und sprachsynthesevorrichtung | |
| DE60219030D1 (de) | Verfahren zur mehrsprachigen Spracherkennung |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |