ATE403213T1 - System und verfahren zur automatischen spracherkennung - Google Patents

System und verfahren zur automatischen spracherkennung

Info

Publication number
ATE403213T1
ATE403213T1 AT04805044T AT04805044T ATE403213T1 AT E403213 T1 ATE403213 T1 AT E403213T1 AT 04805044 T AT04805044 T AT 04805044T AT 04805044 T AT04805044 T AT 04805044T AT E403213 T1 ATE403213 T1 AT E403213T1
Authority
AT
Austria
Prior art keywords
hypothesis
computing
confidence measure
recognition
differential
Prior art date
Application number
AT04805044T
Other languages
English (en)
Inventor
Daniele Colibro
Claudio Vair
Original Assignee
Loquendo Spa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Loquendo Spa filed Critical Loquendo Spa
Application granted granted Critical
Publication of ATE403213T1 publication Critical patent/ATE403213T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
AT04805044T 2004-12-28 2004-12-28 System und verfahren zur automatischen spracherkennung ATE403213T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2004/053718 WO2006069600A1 (en) 2004-12-28 2004-12-28 Automatic speech recognition system and method

Publications (1)

Publication Number Publication Date
ATE403213T1 true ATE403213T1 (de) 2008-08-15

Family

ID=34959840

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04805044T ATE403213T1 (de) 2004-12-28 2004-12-28 System und verfahren zur automatischen spracherkennung

Country Status (7)

Country Link
US (1) US7912713B2 (de)
EP (1) EP1831870B1 (de)
AT (1) ATE403213T1 (de)
CA (1) CA2592861C (de)
DE (1) DE602004015518D1 (de)
ES (1) ES2311872T3 (de)
WO (1) WO2006069600A1 (de)

Families Citing this family (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7844464B2 (en) * 2005-07-22 2010-11-30 Multimodal Technologies, Inc. Content-based audio playback emphasis
WO2006087040A1 (en) * 2005-02-17 2006-08-24 Loquendo S.P.A. Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system
US7809566B2 (en) * 2005-10-14 2010-10-05 Nuance Communications, Inc. One-step repair of misrecognized recognition strings
US20070124147A1 (en) * 2005-11-30 2007-05-31 International Business Machines Corporation Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems
US8126881B1 (en) 2007-12-12 2012-02-28 Vast.com, Inc. Predictive conversion systems and methods
CN101465123B (zh) * 2007-12-20 2011-07-06 株式会社东芝 说话人认证的验证方法和装置以及说话人认证系统
US9280969B2 (en) * 2009-06-10 2016-03-08 Microsoft Technology Licensing, Llc Model training for automatic speech recognition from imperfect transcription data
KR20110010939A (ko) * 2009-07-27 2011-02-08 삼성전자주식회사 휴대용 단말기에서 음성 인식 성능을 향상시키기 위한 장치 및 방법
US8983845B1 (en) * 2010-03-26 2015-03-17 Google Inc. Third-party audio subsystem enhancement
US8639508B2 (en) * 2011-02-14 2014-01-28 General Motors Llc User-specific confidence thresholds for speech recognition
US20130080165A1 (en) * 2011-09-24 2013-03-28 Microsoft Corporation Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition
KR20130059476A (ko) * 2011-11-28 2013-06-07 한국전자통신연구원 음성 인식용 탐색 공간 생성 방법 및 장치
US8990080B2 (en) 2012-01-27 2015-03-24 Microsoft Corporation Techniques to normalize names efficiently for name-based speech recognition grammars
US9269349B2 (en) * 2012-05-24 2016-02-23 Nuance Communications, Inc. Automatic methods to predict error rates and detect performance degradation
US9336771B2 (en) * 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US9697827B1 (en) * 2012-12-11 2017-07-04 Amazon Technologies, Inc. Error reduction in speech processing
JP6199994B2 (ja) * 2013-01-22 2017-09-20 インタラクティブ・インテリジェンス・インコーポレイテッド コンテキスト情報を使用した音声認識システムにおける誤警報低減
US9465873B1 (en) 2013-03-07 2016-10-11 Vast.com, Inc. Systems, methods, and devices for identifying and presenting identifications of significant attributes of unique items
US9104718B1 (en) 2013-03-07 2015-08-11 Vast.com, Inc. Systems, methods, and devices for measuring similarity of and generating recommendations for unique items
US10007946B1 (en) 2013-03-07 2018-06-26 Vast.com, Inc. Systems, methods, and devices for measuring similarity of and generating recommendations for unique items
US9830635B1 (en) 2013-03-13 2017-11-28 Vast.com, Inc. Systems, methods, and devices for determining and displaying market relative position of unique items
US9159317B2 (en) * 2013-06-14 2015-10-13 Mitsubishi Electric Research Laboratories, Inc. System and method for recognizing speech
US10438581B2 (en) * 2013-07-31 2019-10-08 Google Llc Speech recognition using neural networks
US10867597B2 (en) * 2013-09-02 2020-12-15 Microsoft Technology Licensing, Llc Assignment of semantic labels to a sequence of words using neural network architectures
CN103530528A (zh) * 2013-10-29 2014-01-22 华为技术有限公司 评估方法及装置
US9613619B2 (en) * 2013-10-30 2017-04-04 Genesys Telecommunications Laboratories, Inc. Predicting recognition quality of a phrase in automatic speech recognition systems
US10127596B1 (en) 2013-12-10 2018-11-13 Vast.com, Inc. Systems, methods, and devices for generating recommendations of unique items
GB2523353B (en) * 2014-02-21 2017-03-01 Jaguar Land Rover Ltd System for use in a vehicle
US10127901B2 (en) 2014-06-13 2018-11-13 Microsoft Technology Licensing, Llc Hyper-structure recurrent neural networks for text-to-speech
JP6461660B2 (ja) * 2015-03-19 2019-01-30 株式会社東芝 検出装置、検出方法およびプログラム
KR102413692B1 (ko) * 2015-07-24 2022-06-27 삼성전자주식회사 음성 인식을 위한 음향 점수 계산 장치 및 방법, 음성 인식 장치 및 방법, 전자 장치
KR102434604B1 (ko) * 2016-01-05 2022-08-23 한국전자통신연구원 개인화된 음성 인식을 수행하기 위한 음성 인식 단말, 음성 인식 서버 및 음성 인식 방법
US10176799B2 (en) * 2016-02-02 2019-01-08 Mitsubishi Electric Research Laboratories, Inc. Method and system for training language models to reduce recognition errors
JP6495850B2 (ja) * 2016-03-14 2019-04-03 株式会社東芝 情報処理装置、情報処理方法、プログラムおよび認識システム
CN109496334B (zh) * 2016-08-09 2022-03-11 华为技术有限公司 用于评估语音质量的设备和方法
US10403268B2 (en) * 2016-09-08 2019-09-03 Intel IP Corporation Method and system of automatic speech recognition using posterior confidence scores
US10607601B2 (en) * 2017-05-11 2020-03-31 International Business Machines Corporation Speech recognition by selecting and refining hot words
US10268704B1 (en) 2017-10-12 2019-04-23 Vast.com, Inc. Partitioned distributed database systems, devices, and methods
WO2020231188A1 (ko) * 2019-05-13 2020-11-19 삼성전자주식회사 검증 뉴럴 네트워크를 이용한 분류 결과 검증 방법, 분류 결과 학습 방법 및 상기 방법을 수행하는 컴퓨팅 장치
US11551671B2 (en) 2019-05-16 2023-01-10 Samsung Electronics Co., Ltd. Electronic device and method of controlling thereof
US12106358B1 (en) 2020-07-24 2024-10-01 Vast.com, Inc. Systems, methods, and devices for unified e-commerce platforms for unique items

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5710866A (en) * 1995-05-26 1998-01-20 Microsoft Corporation System and method for speech recognition using dynamically adjusted confidence measure
DE19842405A1 (de) * 1998-09-16 2000-03-23 Philips Corp Intellectual Pty Spracherkennungsverfahren mit Konfidenzmaßbewertung
US6539353B1 (en) * 1999-10-12 2003-03-25 Microsoft Corporation Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition
ITTO20020170A1 (it) 2002-02-28 2003-08-28 Loquendo Spa Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

Also Published As

Publication number Publication date
WO2006069600A1 (en) 2006-07-06
DE602004015518D1 (de) 2008-09-11
WO2006069600A8 (en) 2007-03-29
CA2592861C (en) 2015-10-27
US7912713B2 (en) 2011-03-22
ES2311872T3 (es) 2009-02-16
EP1831870B1 (de) 2008-07-30
CA2592861A1 (en) 2006-07-06
US20080114595A1 (en) 2008-05-15
EP1831870A1 (de) 2007-09-12

Similar Documents

Publication Publication Date Title
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
Cucchiarini et al. Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms
Hu et al. A tandem algorithm for pitch estimation and voiced speech segregation
US9489864B2 (en) Systems and methods for an automated pronunciation assessment system for similar vowel pairs
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
DE602005018552D1 (de) Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
Mustafa et al. Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker
Keshet Automatic speech recognition: A primer for speech-language pathology researchers
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
WO2007015869A3 (en) Spoken language proficiency assessment by computer
Barker et al. Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
Ullmann et al. Objective intelligibility assessment of text-to-speech systems through utterance verification
Xue et al. Measuring the intelligibility of dysarthric speech through automatic speech recognition in a pluricentric language
DE602004004572D1 (de) Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung
ATE400047T1 (de) Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
Sinclair et al. A semi-markov model for speech segmentation with an utterance-break prior
KR102274751B1 (ko) 평가정보를 제공하는 사용자 맞춤형 발음 평가 시스템
Wahidah et al. Makhraj recognition using speech processing
Rose Forensic voice comparison with monophthongal formant trajectories-a likelihood ratio-based discrimination of “schwa” vowel acoustics in a close social group of young Australian females
Rajpal et al. Native Language Identification Using Spectral and Source-Based Features.
Li Tone sandhi and tonal coarticulation in Fuzhou Min
Hammer et al. Balancing word lists in speech audiometry through large spoken language corpora
Córdoba et al. Language identification techniques based on full recognition in an air traffic control task
Seaward et al. Improving the accuracy of automated cleft speech evaluation
Bhat et al. Automatic assessment of articulation errors in hindi speech at phone level

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties