ATE403213T1 - System und verfahren zur automatischen spracherkennung - Google Patents
System und verfahren zur automatischen spracherkennungInfo
- Publication number
- ATE403213T1 ATE403213T1 AT04805044T AT04805044T ATE403213T1 AT E403213 T1 ATE403213 T1 AT E403213T1 AT 04805044 T AT04805044 T AT 04805044T AT 04805044 T AT04805044 T AT 04805044T AT E403213 T1 ATE403213 T1 AT E403213T1
- Authority
- AT
- Austria
- Prior art keywords
- hypothesis
- computing
- confidence measure
- recognition
- differential
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 238000012935 Averaging Methods 0.000 abstract 1
- 230000001186 cumulative effect Effects 0.000 abstract 1
- 238000009826 distribution Methods 0.000 abstract 1
- 238000005315 distribution function Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/EP2004/053718 WO2006069600A1 (en) | 2004-12-28 | 2004-12-28 | Automatic speech recognition system and method |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE403213T1 true ATE403213T1 (de) | 2008-08-15 |
Family
ID=34959840
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04805044T ATE403213T1 (de) | 2004-12-28 | 2004-12-28 | System und verfahren zur automatischen spracherkennung |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US7912713B2 (de) |
| EP (1) | EP1831870B1 (de) |
| AT (1) | ATE403213T1 (de) |
| CA (1) | CA2592861C (de) |
| DE (1) | DE602004015518D1 (de) |
| ES (1) | ES2311872T3 (de) |
| WO (1) | WO2006069600A1 (de) |
Families Citing this family (41)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7844464B2 (en) * | 2005-07-22 | 2010-11-30 | Multimodal Technologies, Inc. | Content-based audio playback emphasis |
| WO2006087040A1 (en) * | 2005-02-17 | 2006-08-24 | Loquendo S.P.A. | Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system |
| US7809566B2 (en) * | 2005-10-14 | 2010-10-05 | Nuance Communications, Inc. | One-step repair of misrecognized recognition strings |
| US20070124147A1 (en) * | 2005-11-30 | 2007-05-31 | International Business Machines Corporation | Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems |
| US8126881B1 (en) | 2007-12-12 | 2012-02-28 | Vast.com, Inc. | Predictive conversion systems and methods |
| CN101465123B (zh) * | 2007-12-20 | 2011-07-06 | 株式会社东芝 | 说话人认证的验证方法和装置以及说话人认证系统 |
| US9280969B2 (en) * | 2009-06-10 | 2016-03-08 | Microsoft Technology Licensing, Llc | Model training for automatic speech recognition from imperfect transcription data |
| KR20110010939A (ko) * | 2009-07-27 | 2011-02-08 | 삼성전자주식회사 | 휴대용 단말기에서 음성 인식 성능을 향상시키기 위한 장치 및 방법 |
| US8983845B1 (en) * | 2010-03-26 | 2015-03-17 | Google Inc. | Third-party audio subsystem enhancement |
| US8639508B2 (en) * | 2011-02-14 | 2014-01-28 | General Motors Llc | User-specific confidence thresholds for speech recognition |
| US20130080165A1 (en) * | 2011-09-24 | 2013-03-28 | Microsoft Corporation | Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition |
| KR20130059476A (ko) * | 2011-11-28 | 2013-06-07 | 한국전자통신연구원 | 음성 인식용 탐색 공간 생성 방법 및 장치 |
| US8990080B2 (en) | 2012-01-27 | 2015-03-24 | Microsoft Corporation | Techniques to normalize names efficiently for name-based speech recognition grammars |
| US9269349B2 (en) * | 2012-05-24 | 2016-02-23 | Nuance Communications, Inc. | Automatic methods to predict error rates and detect performance degradation |
| US9336771B2 (en) * | 2012-11-01 | 2016-05-10 | Google Inc. | Speech recognition using non-parametric models |
| US9697827B1 (en) * | 2012-12-11 | 2017-07-04 | Amazon Technologies, Inc. | Error reduction in speech processing |
| JP6199994B2 (ja) * | 2013-01-22 | 2017-09-20 | インタラクティブ・インテリジェンス・インコーポレイテッド | コンテキスト情報を使用した音声認識システムにおける誤警報低減 |
| US9465873B1 (en) | 2013-03-07 | 2016-10-11 | Vast.com, Inc. | Systems, methods, and devices for identifying and presenting identifications of significant attributes of unique items |
| US9104718B1 (en) | 2013-03-07 | 2015-08-11 | Vast.com, Inc. | Systems, methods, and devices for measuring similarity of and generating recommendations for unique items |
| US10007946B1 (en) | 2013-03-07 | 2018-06-26 | Vast.com, Inc. | Systems, methods, and devices for measuring similarity of and generating recommendations for unique items |
| US9830635B1 (en) | 2013-03-13 | 2017-11-28 | Vast.com, Inc. | Systems, methods, and devices for determining and displaying market relative position of unique items |
| US9159317B2 (en) * | 2013-06-14 | 2015-10-13 | Mitsubishi Electric Research Laboratories, Inc. | System and method for recognizing speech |
| US10438581B2 (en) * | 2013-07-31 | 2019-10-08 | Google Llc | Speech recognition using neural networks |
| US10867597B2 (en) * | 2013-09-02 | 2020-12-15 | Microsoft Technology Licensing, Llc | Assignment of semantic labels to a sequence of words using neural network architectures |
| CN103530528A (zh) * | 2013-10-29 | 2014-01-22 | 华为技术有限公司 | 评估方法及装置 |
| US9613619B2 (en) * | 2013-10-30 | 2017-04-04 | Genesys Telecommunications Laboratories, Inc. | Predicting recognition quality of a phrase in automatic speech recognition systems |
| US10127596B1 (en) | 2013-12-10 | 2018-11-13 | Vast.com, Inc. | Systems, methods, and devices for generating recommendations of unique items |
| GB2523353B (en) * | 2014-02-21 | 2017-03-01 | Jaguar Land Rover Ltd | System for use in a vehicle |
| US10127901B2 (en) | 2014-06-13 | 2018-11-13 | Microsoft Technology Licensing, Llc | Hyper-structure recurrent neural networks for text-to-speech |
| JP6461660B2 (ja) * | 2015-03-19 | 2019-01-30 | 株式会社東芝 | 検出装置、検出方法およびプログラム |
| KR102413692B1 (ko) * | 2015-07-24 | 2022-06-27 | 삼성전자주식회사 | 음성 인식을 위한 음향 점수 계산 장치 및 방법, 음성 인식 장치 및 방법, 전자 장치 |
| KR102434604B1 (ko) * | 2016-01-05 | 2022-08-23 | 한국전자통신연구원 | 개인화된 음성 인식을 수행하기 위한 음성 인식 단말, 음성 인식 서버 및 음성 인식 방법 |
| US10176799B2 (en) * | 2016-02-02 | 2019-01-08 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for training language models to reduce recognition errors |
| JP6495850B2 (ja) * | 2016-03-14 | 2019-04-03 | 株式会社東芝 | 情報処理装置、情報処理方法、プログラムおよび認識システム |
| CN109496334B (zh) * | 2016-08-09 | 2022-03-11 | 华为技术有限公司 | 用于评估语音质量的设备和方法 |
| US10403268B2 (en) * | 2016-09-08 | 2019-09-03 | Intel IP Corporation | Method and system of automatic speech recognition using posterior confidence scores |
| US10607601B2 (en) * | 2017-05-11 | 2020-03-31 | International Business Machines Corporation | Speech recognition by selecting and refining hot words |
| US10268704B1 (en) | 2017-10-12 | 2019-04-23 | Vast.com, Inc. | Partitioned distributed database systems, devices, and methods |
| WO2020231188A1 (ko) * | 2019-05-13 | 2020-11-19 | 삼성전자주식회사 | 검증 뉴럴 네트워크를 이용한 분류 결과 검증 방법, 분류 결과 학습 방법 및 상기 방법을 수행하는 컴퓨팅 장치 |
| US11551671B2 (en) | 2019-05-16 | 2023-01-10 | Samsung Electronics Co., Ltd. | Electronic device and method of controlling thereof |
| US12106358B1 (en) | 2020-07-24 | 2024-10-01 | Vast.com, Inc. | Systems, methods, and devices for unified e-commerce platforms for unique items |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5710866A (en) * | 1995-05-26 | 1998-01-20 | Microsoft Corporation | System and method for speech recognition using dynamically adjusted confidence measure |
| DE19842405A1 (de) * | 1998-09-16 | 2000-03-23 | Philips Corp Intellectual Pty | Spracherkennungsverfahren mit Konfidenzmaßbewertung |
| US6539353B1 (en) * | 1999-10-12 | 2003-03-25 | Microsoft Corporation | Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition |
| ITTO20020170A1 (it) | 2002-02-28 | 2003-08-28 | Loquendo Spa | Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale. |
-
2004
- 2004-12-28 US US11/794,356 patent/US7912713B2/en not_active Expired - Fee Related
- 2004-12-28 ES ES04805044T patent/ES2311872T3/es not_active Expired - Lifetime
- 2004-12-28 AT AT04805044T patent/ATE403213T1/de not_active IP Right Cessation
- 2004-12-28 CA CA2592861A patent/CA2592861C/en not_active Expired - Fee Related
- 2004-12-28 WO PCT/EP2004/053718 patent/WO2006069600A1/en not_active Ceased
- 2004-12-28 EP EP04805044A patent/EP1831870B1/de not_active Expired - Lifetime
- 2004-12-28 DE DE602004015518T patent/DE602004015518D1/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| WO2006069600A1 (en) | 2006-07-06 |
| DE602004015518D1 (de) | 2008-09-11 |
| WO2006069600A8 (en) | 2007-03-29 |
| CA2592861C (en) | 2015-10-27 |
| US7912713B2 (en) | 2011-03-22 |
| ES2311872T3 (es) | 2009-02-16 |
| EP1831870B1 (de) | 2008-07-30 |
| CA2592861A1 (en) | 2006-07-06 |
| US20080114595A1 (en) | 2008-05-15 |
| EP1831870A1 (de) | 2007-09-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE403213T1 (de) | System und verfahren zur automatischen spracherkennung | |
| Cucchiarini et al. | Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms | |
| Hu et al. | A tandem algorithm for pitch estimation and voiced speech segregation | |
| US9489864B2 (en) | Systems and methods for an automated pronunciation assessment system for similar vowel pairs | |
| TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
| DE602005018552D1 (de) | Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung | |
| Mustafa et al. | Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker | |
| Keshet | Automatic speech recognition: A primer for speech-language pathology researchers | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| WO2007015869A3 (en) | Spoken language proficiency assessment by computer | |
| Barker et al. | Speech fragment decoding techniques for simultaneous speaker identification and speech recognition | |
| Ullmann et al. | Objective intelligibility assessment of text-to-speech systems through utterance verification | |
| Xue et al. | Measuring the intelligibility of dysarthric speech through automatic speech recognition in a pluricentric language | |
| DE602004004572D1 (de) | Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung | |
| ATE400047T1 (de) | Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen | |
| Sinclair et al. | A semi-markov model for speech segmentation with an utterance-break prior | |
| KR102274751B1 (ko) | 평가정보를 제공하는 사용자 맞춤형 발음 평가 시스템 | |
| Wahidah et al. | Makhraj recognition using speech processing | |
| Rose | Forensic voice comparison with monophthongal formant trajectories-a likelihood ratio-based discrimination of “schwa” vowel acoustics in a close social group of young Australian females | |
| Rajpal et al. | Native Language Identification Using Spectral and Source-Based Features. | |
| Li | Tone sandhi and tonal coarticulation in Fuzhou Min | |
| Hammer et al. | Balancing word lists in speech audiometry through large spoken language corpora | |
| Córdoba et al. | Language identification techniques based on full recognition in an air traffic control task | |
| Seaward et al. | Improving the accuracy of automated cleft speech evaluation | |
| Bhat et al. | Automatic assessment of articulation errors in hindi speech at phone level |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |