ATE403213T1 - System und verfahren zur automatischen spracherkennung - Google Patents

System und verfahren zur automatischen spracherkennung

Info

Publication number: ATE403213T1
Authority: AT; Austria
Prior art keywords: hypothesis; computing; confidence measure; recognition; differential
Prior art date: 2004-12-28

Application number

AT04805044T

Other languages

English (en)

Inventor

Daniele Colibro

Claudio Vair

Original Assignee

Loquendo Spa

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2004-12-28

Filing date

2004-12-28

Publication date

2008-08-15

2004-12-28 Application filed by Loquendo Spa filed Critical Loquendo Spa

2008-08-15 Application granted granted Critical

2008-08-15 Publication of ATE403213T1 publication Critical patent/ATE403213T1/de

Links

238000000034 method Methods 0.000 title abstract 2
238000012935 Averaging Methods 0.000 abstract 1
230000001186 cumulative effect Effects 0.000 abstract 1
238000009826 distribution Methods 0.000 abstract 1
238000005315 distribution function Methods 0.000 abstract 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Machine Translation (AREA)
Document Processing Apparatus (AREA)
Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

AT04805044T 2004-12-28 2004-12-28 System und verfahren zur automatischen spracherkennung ATE403213T1 (de)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
PCT/EP2004/053718 WO2006069600A1 (en)	2004-12-28	2004-12-28	Automatic speech recognition system and method

Publications (1)

Publication Number	Publication Date
ATE403213T1 true ATE403213T1 (de)	2008-08-15

Family

ID=34959840

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT04805044T ATE403213T1 (de)	2004-12-28	2004-12-28	System und verfahren zur automatischen spracherkennung

Country Status (7)

Country	Link
US (1)	US7912713B2 (de)
EP (1)	EP1831870B1 (de)
AT (1)	ATE403213T1 (de)
CA (1)	CA2592861C (de)
DE (1)	DE602004015518D1 (de)
ES (1)	ES2311872T3 (de)
WO (1)	WO2006069600A1 (de)

Families Citing this family (41)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7844464B2 (en) *	2005-07-22	2010-11-30	Multimodal Technologies, Inc.	Content-based audio playback emphasis
EP1851756B1 (de) *	2005-02-17	2008-07-02	Loquendo S.p.A.	Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
US7809566B2 (en) *	2005-10-14	2010-10-05	Nuance Communications, Inc.	One-step repair of misrecognized recognition strings
US20070124147A1 (en) *	2005-11-30	2007-05-31	International Business Machines Corporation	Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems
US8126881B1 (en)	2007-12-12	2012-02-28	Vast.com, Inc.	Predictive conversion systems and methods
CN101465123B (zh) *	2007-12-20	2011-07-06	株式会社东芝	说话人认证的验证方法和装置以及说话人认证系统
US9280969B2 (en) *	2009-06-10	2016-03-08	Microsoft Technology Licensing, Llc	Model training for automatic speech recognition from imperfect transcription data
KR20110010939A (ko) *	2009-07-27	2011-02-08	삼성전자주식회사	휴대용 단말기에서 음성 인식 성능을 향상시키기 위한 장치 및 방법
US8983845B1 (en) *	2010-03-26	2015-03-17	Google Inc.	Third-party audio subsystem enhancement
US8639508B2 (en) *	2011-02-14	2014-01-28	General Motors Llc	User-specific confidence thresholds for speech recognition
US20130080165A1 (en) *	2011-09-24	2013-03-28	Microsoft Corporation	Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition
KR20130059476A (ko) *	2011-11-28	2013-06-07	한국전자통신연구원	음성 인식용 탐색 공간 생성 방법 및 장치
US8990080B2 (en)	2012-01-27	2015-03-24	Microsoft Corporation	Techniques to normalize names efficiently for name-based speech recognition grammars
US9269349B2 (en) *	2012-05-24	2016-02-23	Nuance Communications, Inc.	Automatic methods to predict error rates and detect performance degradation
US9336771B2 (en) *	2012-11-01	2016-05-10	Google Inc.	Speech recognition using non-parametric models
US9697827B1 (en) *	2012-12-11	2017-07-04	Amazon Technologies, Inc.	Error reduction in speech processing
WO2014116199A1 (en) *	2013-01-22	2014-07-31	Interactive Intelligence, Inc.	False alarm reduction in speech recognition systems using contextual information
US9104718B1 (en)	2013-03-07	2015-08-11	Vast.com, Inc.	Systems, methods, and devices for measuring similarity of and generating recommendations for unique items
US10007946B1 (en)	2013-03-07	2018-06-26	Vast.com, Inc.	Systems, methods, and devices for measuring similarity of and generating recommendations for unique items
US9465873B1 (en)	2013-03-07	2016-10-11	Vast.com, Inc.	Systems, methods, and devices for identifying and presenting identifications of significant attributes of unique items
US9830635B1 (en)	2013-03-13	2017-11-28	Vast.com, Inc.	Systems, methods, and devices for determining and displaying market relative position of unique items
US9159317B2 (en) *	2013-06-14	2015-10-13	Mitsubishi Electric Research Laboratories, Inc.	System and method for recognizing speech
US10438581B2 (en) *	2013-07-31	2019-10-08	Google Llc	Speech recognition using neural networks
US10867597B2 (en) *	2013-09-02	2020-12-15	Microsoft Technology Licensing, Llc	Assignment of semantic labels to a sequence of words using neural network architectures
CN103530528A (zh) *	2013-10-29	2014-01-22	华为技术有限公司	评估方法及装置
US9613619B2 (en) *	2013-10-30	2017-04-04	Genesys Telecommunications Laboratories, Inc.	Predicting recognition quality of a phrase in automatic speech recognition systems
US10127596B1 (en)	2013-12-10	2018-11-13	Vast.com, Inc.	Systems, methods, and devices for generating recommendations of unique items
GB2523353B (en) *	2014-02-21	2017-03-01	Jaguar Land Rover Ltd	System for use in a vehicle
US10127901B2 (en)	2014-06-13	2018-11-13	Microsoft Technology Licensing, Llc	Hyper-structure recurrent neural networks for text-to-speech
JP6461660B2 (ja) *	2015-03-19	2019-01-30	株式会社東芝	検出装置、検出方法およびプログラム
KR102413692B1 (ko) *	2015-07-24	2022-06-27	삼성전자주식회사	음성 인식을 위한 음향 점수 계산 장치 및 방법, 음성 인식 장치 및 방법, 전자 장치
KR102434604B1 (ko) *	2016-01-05	2022-08-23	한국전자통신연구원	개인화된 음성 인식을 수행하기 위한 음성 인식 단말, 음성 인식 서버 및 음성 인식 방법
US10176799B2 (en) *	2016-02-02	2019-01-08	Mitsubishi Electric Research Laboratories, Inc.	Method and system for training language models to reduce recognition errors
JP6495850B2 (ja) *	2016-03-14	2019-04-03	株式会社東芝	情報処理装置、情報処理方法、プログラムおよび認識システム
CN109496334B (zh) *	2016-08-09	2022-03-11	华为技术有限公司	用于评估语音质量的设备和方法
US10403268B2 (en) *	2016-09-08	2019-09-03	Intel IP Corporation	Method and system of automatic speech recognition using posterior confidence scores
US10607601B2 (en) *	2017-05-11	2020-03-31	International Business Machines Corporation	Speech recognition by selecting and refining hot words
US10268704B1 (en)	2017-10-12	2019-04-23	Vast.com, Inc.	Partitioned distributed database systems, devices, and methods
WO2020231188A1 (ko) *	2019-05-13	2020-11-19	삼성전자주식회사	검증 뉴럴 네트워크를 이용한 분류 결과 검증 방법, 분류 결과 학습 방법 및 상기 방법을 수행하는 컴퓨팅 장치
WO2020231151A1 (en)	2019-05-16	2020-11-19	Samsung Electronics Co., Ltd.	Electronic device and method of controlling thereof
US12106358B1 (en)	2020-07-24	2024-10-01	Vast.com, Inc.	Systems, methods, and devices for unified e-commerce platforms for unique items

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5710866A (en) *	1995-05-26	1998-01-20	Microsoft Corporation	System and method for speech recognition using dynamically adjusted confidence measure
DE19842405A1 (de) *	1998-09-16	2000-03-23	Philips Corp Intellectual Pty	Spracherkennungsverfahren mit Konfidenzmaßbewertung
US6539353B1 (en) *	1999-10-12	2003-03-25	Microsoft Corporation	Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition
ITTO20020170A1 (it)	2002-02-28	2003-08-28	Loquendo Spa	Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

2004
- 2004-12-28 AT AT04805044T patent/ATE403213T1/de not_active IP Right Cessation
- 2004-12-28 US US11/794,356 patent/US7912713B2/en not_active Expired - Fee Related
- 2004-12-28 CA CA2592861A patent/CA2592861C/en not_active Expired - Fee Related
- 2004-12-28 EP EP04805044A patent/EP1831870B1/de not_active Expired - Lifetime
- 2004-12-28 ES ES04805044T patent/ES2311872T3/es not_active Expired - Lifetime
- 2004-12-28 DE DE602004015518T patent/DE602004015518D1/de not_active Expired - Lifetime
- 2004-12-28 WO PCT/EP2004/053718 patent/WO2006069600A1/en not_active Ceased

Also Published As

Publication number	Publication date
CA2592861C (en)	2015-10-27
EP1831870A1 (de)	2007-09-12
ES2311872T3 (es)	2009-02-16
DE602004015518D1 (de)	2008-09-11
EP1831870B1 (de)	2008-07-30
US7912713B2 (en)	2011-03-22
CA2592861A1 (en)	2006-07-06
WO2006069600A8 (en)	2007-03-29
US20080114595A1 (en)	2008-05-15
WO2006069600A1 (en)	2006-07-06

Similar Documents

Publication	Publication Date	Title
ATE403213T1 (de)	2008-08-15	System und verfahren zur automatischen spracherkennung
Cucchiarini et al.	2000	Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms
Hu et al.	2010	A tandem algorithm for pitch estimation and voiced speech segregation
US9489864B2 (en)	2016-11-08	Systems and methods for an automated pronunciation assessment system for similar vowel pairs
TW200638337A (en)	2006-11-01	Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
DE602005018552D1 (de)	2010-02-04	Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
ATE403928T1 (de)	2008-08-15	Sprachdialogkontrolle basierend auf signalvorverarbeitung
WO2009025356A1 (ja)	2009-02-26	音声認識装置および音声認識方法
Mustafa et al.	2015	Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker
ATE457510T1 (de)	2010-02-15	Spracherkennungssystem mit riesigem vokabular
Ullmann et al.	2015	Objective intelligibility assessment of text-to-speech systems through utterance verification
DE602004004572D1 (de)	2007-03-22	Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung
ATE400047T1 (de)	2008-07-15	Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
Sinclair et al.	2014	A semi-markov model for speech segmentation with an utterance-break prior
KR102274751B1 (ko)	2021-07-08	평가정보를 제공하는 사용자 맞춤형 발음 평가 시스템
Wahidah et al.	2012	Makhraj recognition using speech processing
Rose	2015	Forensic voice comparison with monophthongal formant trajectories-a likelihood ratio-based discrimination of “schwa” vowel acoustics in a close social group of young Australian females
Rajpal et al.	2016	Native Language Identification Using Spectral and Source-Based Features.
Li	2015	Tone sandhi and tonal coarticulation in Fuzhou Min
Hammer et al.	2013	Balancing word lists in speech audiometry through large spoken language corpora
Seaward et al.	2018	Improving the accuracy of automated cleft speech evaluation
Córdoba et al.	2004	Language identification techniques based on full recognition in an air traffic control task
Bhat et al.	2015	Automatic assessment of articulation errors in hindi speech at phone level
Cho et al.	2019	Automatic Detection of Prosodic Focus in American English.
DE602004014416D1 (de)	2008-07-24	Spracherkennung durch kontextuelle modellierung der spracheinheiten

Legal Events

Date	Code	Title	Description
2009-01-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties