ATE403213T1 - System und verfahren zur automatischen spracherkennung - Google Patents

System und verfahren zur automatischen spracherkennung

Info

Publication number
ATE403213T1
ATE403213T1 AT04805044T AT04805044T ATE403213T1 AT E403213 T1 ATE403213 T1 AT E403213T1 AT 04805044 T AT04805044 T AT 04805044T AT 04805044 T AT04805044 T AT 04805044T AT E403213 T1 ATE403213 T1 AT E403213T1
Authority
AT
Austria
Prior art keywords
hypothesis
computing
confidence measure
recognition
differential
Prior art date
Application number
AT04805044T
Other languages
English (en)
Inventor
Daniele Colibro
Claudio Vair
Original Assignee
Loquendo Spa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Loquendo Spa filed Critical Loquendo Spa
Application granted granted Critical
Publication of ATE403213T1 publication Critical patent/ATE403213T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
AT04805044T 2004-12-28 2004-12-28 System und verfahren zur automatischen spracherkennung ATE403213T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2004/053718 WO2006069600A1 (en) 2004-12-28 2004-12-28 Automatic speech recognition system and method

Publications (1)

Publication Number Publication Date
ATE403213T1 true ATE403213T1 (de) 2008-08-15

Family

ID=34959840

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04805044T ATE403213T1 (de) 2004-12-28 2004-12-28 System und verfahren zur automatischen spracherkennung

Country Status (7)

Country Link
US (1) US7912713B2 (de)
EP (1) EP1831870B1 (de)
AT (1) ATE403213T1 (de)
CA (1) CA2592861C (de)
DE (1) DE602004015518D1 (de)
ES (1) ES2311872T3 (de)
WO (1) WO2006069600A1 (de)

Families Citing this family (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7844464B2 (en) * 2005-07-22 2010-11-30 Multimodal Technologies, Inc. Content-based audio playback emphasis
EP1851756B1 (de) * 2005-02-17 2008-07-02 Loquendo S.p.A. Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
US7809566B2 (en) * 2005-10-14 2010-10-05 Nuance Communications, Inc. One-step repair of misrecognized recognition strings
US20070124147A1 (en) * 2005-11-30 2007-05-31 International Business Machines Corporation Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems
US8126881B1 (en) 2007-12-12 2012-02-28 Vast.com, Inc. Predictive conversion systems and methods
CN101465123B (zh) * 2007-12-20 2011-07-06 株式会社东芝 说话人认证的验证方法和装置以及说话人认证系统
US9280969B2 (en) * 2009-06-10 2016-03-08 Microsoft Technology Licensing, Llc Model training for automatic speech recognition from imperfect transcription data
KR20110010939A (ko) * 2009-07-27 2011-02-08 삼성전자주식회사 휴대용 단말기에서 음성 인식 성능을 향상시키기 위한 장치 및 방법
US8983845B1 (en) * 2010-03-26 2015-03-17 Google Inc. Third-party audio subsystem enhancement
US8639508B2 (en) * 2011-02-14 2014-01-28 General Motors Llc User-specific confidence thresholds for speech recognition
US20130080165A1 (en) * 2011-09-24 2013-03-28 Microsoft Corporation Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition
KR20130059476A (ko) * 2011-11-28 2013-06-07 한국전자통신연구원 음성 인식용 탐색 공간 생성 방법 및 장치
US8990080B2 (en) 2012-01-27 2015-03-24 Microsoft Corporation Techniques to normalize names efficiently for name-based speech recognition grammars
US9269349B2 (en) * 2012-05-24 2016-02-23 Nuance Communications, Inc. Automatic methods to predict error rates and detect performance degradation
US9336771B2 (en) * 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US9697827B1 (en) * 2012-12-11 2017-07-04 Amazon Technologies, Inc. Error reduction in speech processing
WO2014116199A1 (en) * 2013-01-22 2014-07-31 Interactive Intelligence, Inc. False alarm reduction in speech recognition systems using contextual information
US9104718B1 (en) 2013-03-07 2015-08-11 Vast.com, Inc. Systems, methods, and devices for measuring similarity of and generating recommendations for unique items
US10007946B1 (en) 2013-03-07 2018-06-26 Vast.com, Inc. Systems, methods, and devices for measuring similarity of and generating recommendations for unique items
US9465873B1 (en) 2013-03-07 2016-10-11 Vast.com, Inc. Systems, methods, and devices for identifying and presenting identifications of significant attributes of unique items
US9830635B1 (en) 2013-03-13 2017-11-28 Vast.com, Inc. Systems, methods, and devices for determining and displaying market relative position of unique items
US9159317B2 (en) * 2013-06-14 2015-10-13 Mitsubishi Electric Research Laboratories, Inc. System and method for recognizing speech
US10438581B2 (en) * 2013-07-31 2019-10-08 Google Llc Speech recognition using neural networks
US10867597B2 (en) * 2013-09-02 2020-12-15 Microsoft Technology Licensing, Llc Assignment of semantic labels to a sequence of words using neural network architectures
CN103530528A (zh) * 2013-10-29 2014-01-22 华为技术有限公司 评估方法及装置
US9613619B2 (en) * 2013-10-30 2017-04-04 Genesys Telecommunications Laboratories, Inc. Predicting recognition quality of a phrase in automatic speech recognition systems
US10127596B1 (en) 2013-12-10 2018-11-13 Vast.com, Inc. Systems, methods, and devices for generating recommendations of unique items
GB2523353B (en) * 2014-02-21 2017-03-01 Jaguar Land Rover Ltd System for use in a vehicle
US10127901B2 (en) 2014-06-13 2018-11-13 Microsoft Technology Licensing, Llc Hyper-structure recurrent neural networks for text-to-speech
JP6461660B2 (ja) * 2015-03-19 2019-01-30 株式会社東芝 検出装置、検出方法およびプログラム
KR102413692B1 (ko) * 2015-07-24 2022-06-27 삼성전자주식회사 음성 인식을 위한 음향 점수 계산 장치 및 방법, 음성 인식 장치 및 방법, 전자 장치
KR102434604B1 (ko) * 2016-01-05 2022-08-23 한국전자통신연구원 개인화된 음성 인식을 수행하기 위한 음성 인식 단말, 음성 인식 서버 및 음성 인식 방법
US10176799B2 (en) * 2016-02-02 2019-01-08 Mitsubishi Electric Research Laboratories, Inc. Method and system for training language models to reduce recognition errors
JP6495850B2 (ja) * 2016-03-14 2019-04-03 株式会社東芝 情報処理装置、情報処理方法、プログラムおよび認識システム
CN109496334B (zh) * 2016-08-09 2022-03-11 华为技术有限公司 用于评估语音质量的设备和方法
US10403268B2 (en) * 2016-09-08 2019-09-03 Intel IP Corporation Method and system of automatic speech recognition using posterior confidence scores
US10607601B2 (en) * 2017-05-11 2020-03-31 International Business Machines Corporation Speech recognition by selecting and refining hot words
US10268704B1 (en) 2017-10-12 2019-04-23 Vast.com, Inc. Partitioned distributed database systems, devices, and methods
WO2020231188A1 (ko) * 2019-05-13 2020-11-19 삼성전자주식회사 검증 뉴럴 네트워크를 이용한 분류 결과 검증 방법, 분류 결과 학습 방법 및 상기 방법을 수행하는 컴퓨팅 장치
WO2020231151A1 (en) 2019-05-16 2020-11-19 Samsung Electronics Co., Ltd. Electronic device and method of controlling thereof
US12106358B1 (en) 2020-07-24 2024-10-01 Vast.com, Inc. Systems, methods, and devices for unified e-commerce platforms for unique items

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5710866A (en) * 1995-05-26 1998-01-20 Microsoft Corporation System and method for speech recognition using dynamically adjusted confidence measure
DE19842405A1 (de) * 1998-09-16 2000-03-23 Philips Corp Intellectual Pty Spracherkennungsverfahren mit Konfidenzmaßbewertung
US6539353B1 (en) * 1999-10-12 2003-03-25 Microsoft Corporation Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition
ITTO20020170A1 (it) 2002-02-28 2003-08-28 Loquendo Spa Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

Also Published As

Publication number Publication date
CA2592861C (en) 2015-10-27
EP1831870A1 (de) 2007-09-12
ES2311872T3 (es) 2009-02-16
DE602004015518D1 (de) 2008-09-11
EP1831870B1 (de) 2008-07-30
US7912713B2 (en) 2011-03-22
CA2592861A1 (en) 2006-07-06
WO2006069600A8 (en) 2007-03-29
US20080114595A1 (en) 2008-05-15
WO2006069600A1 (en) 2006-07-06

Similar Documents

Publication Publication Date Title
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
Cucchiarini et al. Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms
Hu et al. A tandem algorithm for pitch estimation and voiced speech segregation
US9489864B2 (en) Systems and methods for an automated pronunciation assessment system for similar vowel pairs
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
DE602005018552D1 (de) Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
ATE403928T1 (de) Sprachdialogkontrolle basierend auf signalvorverarbeitung
WO2009025356A1 (ja) 音声認識装置および音声認識方法
Mustafa et al. Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
Ullmann et al. Objective intelligibility assessment of text-to-speech systems through utterance verification
DE602004004572D1 (de) Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung
ATE400047T1 (de) Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
Sinclair et al. A semi-markov model for speech segmentation with an utterance-break prior
KR102274751B1 (ko) 평가정보를 제공하는 사용자 맞춤형 발음 평가 시스템
Wahidah et al. Makhraj recognition using speech processing
Rose Forensic voice comparison with monophthongal formant trajectories-a likelihood ratio-based discrimination of “schwa” vowel acoustics in a close social group of young Australian females
Rajpal et al. Native Language Identification Using Spectral and Source-Based Features.
Li Tone sandhi and tonal coarticulation in Fuzhou Min
Hammer et al. Balancing word lists in speech audiometry through large spoken language corpora
Seaward et al. Improving the accuracy of automated cleft speech evaluation
Córdoba et al. Language identification techniques based on full recognition in an air traffic control task
Bhat et al. Automatic assessment of articulation errors in hindi speech at phone level
Cho et al. Automatic Detection of Prosodic Focus in American English.
DE602004014416D1 (de) Spracherkennung durch kontextuelle modellierung der spracheinheiten

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties