ATE445215T1 - Spracherkennung für grosse dynamische vokabulare - Google Patents
Spracherkennung für grosse dynamische vokabulareInfo
- Publication number
- ATE445215T1 ATE445215T1 AT04767631T AT04767631T ATE445215T1 AT E445215 T1 ATE445215 T1 AT E445215T1 AT 04767631 T AT04767631 T AT 04767631T AT 04767631 T AT04767631 T AT 04767631T AT E445215 T1 ATE445215 T1 AT E445215T1
- Authority
- AT
- Austria
- Prior art keywords
- large dynamic
- language recognition
- markov
- vocabulary
- vocabularies
- Prior art date
Links
- 238000000034 method Methods 0.000 abstract 1
- 238000013138 pruning Methods 0.000 abstract 1
- 238000013518 transcription Methods 0.000 abstract 1
- 230000035897 transcription Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/083—Recognition networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Image Processing (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR0308341A FR2857528B1 (fr) | 2003-07-08 | 2003-07-08 | Reconnaissance vocale pour les larges vocabulaires dynamiques |
| PCT/FR2004/001799 WO2005006308A1 (fr) | 2003-07-08 | 2004-07-08 | Reconnaissance vocale pour les larges vocabulaires dynamiques |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE445215T1 true ATE445215T1 (de) | 2009-10-15 |
Family
ID=33522861
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04767631T ATE445215T1 (de) | 2003-07-08 | 2004-07-08 | Spracherkennung für grosse dynamische vokabulare |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US20070038451A1 (de) |
| EP (1) | EP1642264B1 (de) |
| AT (1) | ATE445215T1 (de) |
| AU (1) | AU2004256561A1 (de) |
| CA (1) | CA2531496C (de) |
| DE (1) | DE602004023508D1 (de) |
| FR (1) | FR2857528B1 (de) |
| WO (1) | WO2005006308A1 (de) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4579595B2 (ja) * | 2004-06-29 | 2010-11-10 | キヤノン株式会社 | 音声認識文法作成装置、音声認識文法作成方法、プログラム、及び記憶媒体 |
| DE602005012596D1 (de) * | 2004-10-19 | 2009-03-19 | France Telecom | Spracherkennungsverfahren mit temporaler markereinfügung und entsprechendes system |
| EP1890578B1 (de) | 2005-04-18 | 2010-02-10 | Koninklijke Philips Electronics N.V. | Kaffeemaschine mit mitteln zur erzeugung einer drehung in einem getränkestrom |
| US8510109B2 (en) | 2007-08-22 | 2013-08-13 | Canyon Ip Holdings Llc | Continuous speech transcription performance indication |
| US7902447B1 (en) * | 2006-10-03 | 2011-03-08 | Sony Computer Entertainment Inc. | Automatic composition of sound sequences using finite state automata |
| US9973450B2 (en) | 2007-09-17 | 2018-05-15 | Amazon Technologies, Inc. | Methods and systems for dynamically updating web service profile information by parsing transcribed message strings |
| US8447120B2 (en) * | 2008-10-04 | 2013-05-21 | Microsoft Corporation | Incremental feature indexing for scalable location recognition |
| KR20110006004A (ko) * | 2009-07-13 | 2011-01-20 | 삼성전자주식회사 | 결합인식단위 최적화 장치 및 그 방법 |
| US9063931B2 (en) * | 2011-02-16 | 2015-06-23 | Ming-Yuan Wu | Multiple language translation system |
| US8914286B1 (en) * | 2011-04-14 | 2014-12-16 | Canyon IP Holdings, LLC | Speech recognition with hierarchical networks |
| US9607612B2 (en) | 2013-05-20 | 2017-03-28 | Intel Corporation | Natural human-computer interaction for virtual personal assistant systems |
| CN107293298B (zh) * | 2016-04-05 | 2021-02-19 | 富泰华工业(深圳)有限公司 | 语音控制系统及方法 |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0782348B2 (ja) * | 1992-03-21 | 1995-09-06 | 株式会社エイ・ティ・アール自動翻訳電話研究所 | 音声認識用サブワードモデル生成方法 |
| US6073095A (en) * | 1997-10-15 | 2000-06-06 | International Business Machines Corporation | Fast vocabulary independent method and apparatus for spotting words in speech |
| US5983180A (en) * | 1997-10-23 | 1999-11-09 | Softsound Limited | Recognition of sequential data using finite state sequence models organized in a tree structure |
| US6456970B1 (en) * | 1998-07-31 | 2002-09-24 | Texas Instruments Incorporated | Minimization of search network in speech recognition |
| US6324510B1 (en) * | 1998-11-06 | 2001-11-27 | Lernout & Hauspie Speech Products N.V. | Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains |
| US6629073B1 (en) * | 2000-04-27 | 2003-09-30 | Microsoft Corporation | Speech recognition method and apparatus utilizing multi-unit models |
| AU2001262407A1 (en) * | 2000-05-23 | 2001-12-03 | Thomson Licensing S.A. | Dynamic language models for speech recognition |
| US7035802B1 (en) * | 2000-07-31 | 2006-04-25 | Matsushita Electric Industrial Co., Ltd. | Recognition system using lexical trees |
| US20020087313A1 (en) * | 2000-12-29 | 2002-07-04 | Lee Victor Wai Leung | Computer-implemented intelligent speech model partitioning method and system |
| JP2003208195A (ja) * | 2002-01-16 | 2003-07-25 | Sharp Corp | 連続音声認識装置および連続音声認識方法、連続音声認識プログラム、並びに、プログラム記録媒体 |
-
2003
- 2003-07-08 FR FR0308341A patent/FR2857528B1/fr not_active Expired - Fee Related
-
2004
- 2004-07-08 AU AU2004256561A patent/AU2004256561A1/en not_active Abandoned
- 2004-07-08 DE DE602004023508T patent/DE602004023508D1/de not_active Expired - Lifetime
- 2004-07-08 WO PCT/FR2004/001799 patent/WO2005006308A1/fr not_active Ceased
- 2004-07-08 EP EP04767631A patent/EP1642264B1/de not_active Expired - Lifetime
- 2004-07-08 US US10/563,624 patent/US20070038451A1/en not_active Abandoned
- 2004-07-08 CA CA2531496A patent/CA2531496C/fr not_active Expired - Fee Related
- 2004-07-08 AT AT04767631T patent/ATE445215T1/de not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| EP1642264A1 (de) | 2006-04-05 |
| AU2004256561A1 (en) | 2005-01-20 |
| CA2531496A1 (fr) | 2005-01-20 |
| CA2531496C (fr) | 2014-05-06 |
| EP1642264B1 (de) | 2009-10-07 |
| FR2857528B1 (fr) | 2006-01-06 |
| WO2005006308A1 (fr) | 2005-01-20 |
| US20070038451A1 (en) | 2007-02-15 |
| DE602004023508D1 (de) | 2009-11-19 |
| FR2857528A1 (fr) | 2005-01-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11676585B1 (en) | Hybrid decoding using hardware and software for automatic speech recognition systems | |
| US8972243B1 (en) | Parse information encoding in a finite state transducer | |
| US10140973B1 (en) | Text-to-speech processing using previously speech processed data | |
| CN105118501B (zh) | 语音识别的方法及系统 | |
| DE69827667D1 (de) | Vokoder basierter spracherkenner | |
| DE60111329D1 (de) | Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung | |
| ATE445215T1 (de) | Spracherkennung für grosse dynamische vokabulare | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| US11705116B2 (en) | Language and grammar model adaptation using model weight data | |
| CN103021408B (zh) | 一种发音稳定段辅助的语音识别优化解码方法及装置 | |
| KR20170134115A (ko) | Wfst의 최적화를 이용하는 음성 인식 장치 및 음성 인식 방법 | |
| DE602005009091D1 (de) | Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke | |
| WO2007034478A3 (en) | System and method for correcting speech | |
| ATE263997T1 (de) | Zwischen-wörter verbindung phonemische modelle | |
| TW200627376A (en) | Method and apparatus for constructing Chinese new words by the input voice | |
| JP4581549B2 (ja) | 音声処理装置および方法、記録媒体、並びにプログラム | |
| CN101751924A (zh) | 嵌入式平台大词汇量语音命令词的识别方法 | |
| Li et al. | Adapting grapheme-to-phoneme conversion for name recognition | |
| WO2001026092A3 (en) | Attribute-based word modeling | |
| Eide | Automatic modeling of pronunciation variations | |
| Tolba et al. | Speech recognition by intelligent machines | |
| JP4972660B2 (ja) | 音声学習装置及びプログラム | |
| KR101578766B1 (ko) | 음성 인식용 탐색 공간 생성 장치 및 방법 | |
| Chen et al. | Large vocabulary word recognition based on tree-trellis search | |
| Rotovnik et al. | A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |