ATE398323T1 - Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache - Google Patents

Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache

Info

Publication number: ATE398323T1
Authority: AT; Austria
Prior art keywords: segment; correct; incorrect; state sequence; recognition
Prior art date: 2000-04-05

Application number

AT01923898T

Other languages

English (en)

Inventor

Girija Yegnanarayanan

Vladimir Sejnoha

Ramesh Sarukkai

Original Assignee

Lernout & Hauspie Speechprod

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2000-04-05

Filing date

2001-04-03

Publication date

2008-07-15

2001-04-03 Application filed by Lernout & Hauspie Speechprod filed Critical Lernout & Hauspie Speechprod

2008-07-15 Application granted granted Critical

2008-07-15 Publication of ATE398323T1 publication Critical patent/ATE398323T1/de

Links

238000002864 sequence alignment Methods 0.000 abstract 3
238000000034 method Methods 0.000 abstract 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G10L15/146—Training of HMMs with insufficient amount of training data, e.g. state sharing, tying, deleted interpolation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Multimedia (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Acoustics & Sound (AREA)
Probability & Statistics with Applications (AREA)
Computational Linguistics (AREA)
Machine Translation (AREA)
Character Discrimination (AREA)
Image Analysis (AREA)
Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
Measuring Temperature Or Quantity Of Heat (AREA)
Electrophonic Musical Instruments (AREA)
Document Processing Apparatus (AREA)
Pens And Brushes (AREA)
Display Devices Of Pinball Game Machines (AREA)
Management, Administration, Business Operations System, And Electronic Commerce (AREA)

AT01923898T 2000-04-05 2001-04-03 Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache ATE398323T1 (de)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
US09/543,202 US6490555B1 (en)	1997-03-14	2000-04-05	Discriminatively trained mixture models in continuous speech recognition

Publications (1)

Publication Number	Publication Date
ATE398323T1 true ATE398323T1 (de)	2008-07-15

Family

ID=24167006

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT01923898T ATE398323T1 (de)	2000-04-05	2001-04-03	Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache

Country Status (7)

Country	Link
US (1)	US6490555B1 (de)
EP (1)	EP1269464B1 (de)
JP (1)	JP5134751B2 (de)
AT (1)	ATE398323T1 (de)
AU (1)	AU2001250579A1 (de)
DE (1)	DE60134395D1 (de)
WO (1)	WO2001075862A2 (de)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7020845B1 (en) *	1999-11-15	2006-03-28	Gottfurcht Elliot A	Navigating internet content on a television using a simplified interface and a remote control
US7003455B1 (en) *	2000-10-16	2006-02-21	Microsoft Corporation	Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech
DE10120513C1 (de)	2001-04-26	2003-01-09	Siemens Ag	Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache
AUPR579601A0 (en) *	2001-06-19	2001-07-12	Syrinx Speech Systems Pty Limited	On-line environmental and speaker model adaptation
US20040150676A1 (en) *	2002-03-25	2004-08-05	Gottfurcht Elliot A.	Apparatus and method for simple wide-area network navigation
US7117148B2 (en) *	2002-04-05	2006-10-03	Microsoft Corporation	Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
DE10220524B4 (de)	2002-05-08	2006-08-10	Sap Ag	Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
EP1363271A1 (de)	2002-05-08	2003-11-19	Sap Ag	Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
FI121583B (fi) *	2002-07-05	2011-01-14	Syslore Oy	Symbolijonon etsintä
US7752045B2 (en) *	2002-10-07	2010-07-06	Carnegie Mellon University	Systems and methods for comparing speech elements
EP1450350A1 (de) *	2003-02-20	2004-08-25	Sony International (Europe) GmbH	Verfahren zur Spracherkennung mittels Attributen
US20040193412A1 (en) *	2003-03-18	2004-09-30	Aurilab, Llc	Non-linear score scrunching for more efficient comparison of hypotheses
US20040186714A1 (en) *	2003-03-18	2004-09-23	Aurilab, Llc	Speech recognition improvement through post-processsing
US8019602B2 (en) *	2004-01-20	2011-09-13	Microsoft Corporation	Automatic speech recognition learning using user corrections
GB0420464D0 (en) *	2004-09-14	2004-10-20	Zentian Ltd	A speech recognition circuit and method
EP1743897A1 (de) *	2005-07-15	2007-01-17	Gesellschaft für Biotechnologische Forschung mbH	Aus Sorangium cellulosum erhältliche biologisch aktive Verbindungen
US20070083373A1 (en) *	2005-10-11	2007-04-12	Matsushita Electric Industrial Co., Ltd.	Discriminative training of HMM models using maximum margin estimation for speech recognition
US8301449B2 (en) *	2006-10-16	2012-10-30	Microsoft Corporation	Minimum classification error training with growth transformation optimization
US7885812B2 (en) *	2006-11-15	2011-02-08	Microsoft Corporation	Joint training of feature extraction and acoustic model parameters for speech recognition
US20080147579A1 (en) *	2006-12-14	2008-06-19	Microsoft Corporation	Discriminative training using boosted lasso
US7856351B2 (en) *	2007-01-19	2010-12-21	Microsoft Corporation	Integrated speech recognition and semantic classification
US8423364B2 (en) *	2007-02-20	2013-04-16	Microsoft Corporation	Generic framework for large-margin MCE training in speech recognition
EP2133868A4 (de) *	2007-02-28	2013-01-16	Nec Corp	Gewichtskoeffizienten-lernsystem und audioerkennungssystem
US20080243503A1 (en) *	2007-03-30	2008-10-02	Microsoft Corporation	Minimum divergence based discriminative training for pattern recognition
US8239332B2 (en)	2007-11-20	2012-08-07	Microsoft Corporation	Constrained line search optimization for discriminative training of HMMS
US8843370B2 (en) *	2007-11-26	2014-09-23	Nuance Communications, Inc.	Joint discriminative training of multiple speech recognizers
JP5327054B2 (ja) *	2007-12-18	2013-10-30	日本電気株式会社	発音変動規則抽出装置、発音変動規則抽出方法、および発音変動規則抽出用プログラム
US9240184B1 (en) *	2012-11-15	2016-01-19	Google Inc.	Frame-level combination of deep neural network and gaussian mixture models
US9817881B2 (en) *	2013-10-16	2017-11-14	Cypress Semiconductor Corporation	Hidden markov model processing engine
JP6461308B2 (ja) *	2015-04-16	2019-01-30	三菱電機株式会社	音声認識装置およびリスコアリング装置
CN111354344B (zh) *	2020-03-09	2023-08-22	第四范式（北京）技术有限公司	语音识别模型的训练方法、装置、电子设备及存储介质
CN114387959B (zh) *	2020-10-19	2024-10-11	北京爱语吧科技有限公司	一种基于语音的日语发音评测方法和系统

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4741036A (en)	1985-01-31	1988-04-26	International Business Machines Corporation	Determination of phone weights for markov models in a speech recognition system
US5027406A (en) *	1988-12-06	1991-06-25	Dragon Systems, Inc.	Method for interactive speech recognition and training
US5388183A (en)	1991-09-30	1995-02-07	Kurzwell Applied Intelligence, Inc.	Speech recognition providing multiple outputs
US5280563A (en)	1991-12-20	1994-01-18	Kurzweil Applied Intelligence, Inc.	Method of optimizing a composite speech recognition expert
ES2128390T3 (es)	1992-03-02	1999-05-16	At & T Corp	Metodo de adiestramiento y dispositivo para reconocimiento de voz.
US5832430A (en) *	1994-12-29	1998-11-03	Lucent Technologies, Inc.	Devices and methods for speech recognition of vocabulary words with simultaneous detection and verification
US5675706A (en) *	1995-03-31	1997-10-07	Lucent Technologies Inc.	Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition
US5737489A (en) *	1995-09-15	1998-04-07	Lucent Technologies Inc.	Discriminative utterance verification for connected digits recognition
US5895447A (en) *	1996-02-02	1999-04-20	International Business Machines Corporation	Speech recognition using thresholded speaker class model selection or model adaptation
US5991720A (en) *	1996-05-06	1999-11-23	Matsushita Electric Industrial Co., Ltd.	Speech recognition system employing multiple grammar networks
JPH10207485A (ja) *	1997-01-22	1998-08-07	Toshiba Corp	音声認識装置及び話者適応方法
US6122613A (en) *	1997-01-30	2000-09-19	Dragon Systems, Inc.	Speech recognition using multiple recognizers (selectively) applied to the same input sample
US6292778B1 (en) *	1998-10-30	2001-09-18	Lucent Technologies Inc.	Task-independent utterance verification with subword-based minimum verification error training
US7216079B1 (en)	1999-11-02	2007-05-08	Speechworks International, Inc.	Method and apparatus for discriminative training of acoustic models of a speech recognition system

2000
- 2000-04-05 US US09/543,202 patent/US6490555B1/en not_active Expired - Lifetime
2001
- 2001-04-03 AT AT01923898T patent/ATE398323T1/de not_active IP Right Cessation
- 2001-04-03 AU AU2001250579A patent/AU2001250579A1/en not_active Abandoned
- 2001-04-03 WO PCT/IB2001/000726 patent/WO2001075862A2/en not_active Ceased
- 2001-04-03 JP JP2001573458A patent/JP5134751B2/ja not_active Expired - Fee Related
- 2001-04-03 DE DE60134395T patent/DE60134395D1/de not_active Expired - Lifetime
- 2001-04-03 EP EP01923898A patent/EP1269464B1/de not_active Expired - Lifetime

Also Published As

Publication number	Publication date
EP1269464B1 (de)	2008-06-11
JP5134751B2 (ja)	2013-01-30
WO2001075862A3 (en)	2002-01-10
WO2001075862A2 (en)	2001-10-11
EP1269464A2 (de)	2003-01-02
DE60134395D1 (de)	2008-07-24
JP2004512544A (ja)	2004-04-22
US6490555B1 (en)	2002-12-03
AU2001250579A1 (en)	2001-10-15

Legal Events

Date	Code	Title	Description
2008-12-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE398323T1 (de)	2008-07-15	Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache
US8498857B2 (en)	2013-07-30	System and method for rapid prototyping of existing speech recognition solutions in different languages
CN101246685B (zh)	2011-03-30	计算机辅助语言学习系统中的发音质量评价方法
WO2009025356A1 (ja)	2009-02-26	音声認識装置および音声認識方法
CA2177638A1 (en)	1997-02-12	Utterance verification using word based minimum verification error training for recognizing a keyword string
US20020087317A1 (en)	2002-07-04	Computer-implemented dynamic pronunciation method and system
CN105261246A (zh)	2016-01-20	一种基于大数据挖掘技术的英语口语纠错系统
Gallwitz et al.	2002	Integrated recognition of words and prosodic phrase boundaries
EP1460615A1 (de)	2004-09-22	Sprachverarbeitungseinrichtung und -verfahren, aufzeichnungsmedium und programm
Bernstein et al.	1996	Speech recognition by computer
WO2007034478A3 (en)	2009-04-30	System and method for correcting speech
DE69916297D1 (de)	2004-05-13	Zwischen-wörter verbindung phonemische modelle
Li et al.	2018	Improving mandarin tone mispronunciation detection for non-native learners with soft-target tone labels and blstm-based deep models
Alfadhli et al.	2024	Qari: A Hybrid CTC/Attention-Based Model for Quran Recitation Recognition Using Bidirectional LSTMP in an End-to-End Architecture
Yamashita et al.	2005	Automatic scoring for prosodic proficiency of English sentences spoken by Japanese based on utterance comparison
Uchat	2006	Hidden Markov Model and Speech Recognition
Rayner et al.	2015	Supervised learning of response grammars in a spoken call system.
Sawada et al.	2014	Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014.
Fujisawa et al.	1998	Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique.
Hagen et al.	2005	Data driven subword unit modeling for speech recognition and its application to interactive reading tutors.
Deville et al.	1999	Automatic detection and correction of pronunciation errors for foreign language learners: the demosthenes application.
Vicsi	2012	Thinking about the present and future of the complex speech recognition
Hernández-Mena et al.	2015	Creating a grammar-based speech recognition parser for Mexican Spanish using HTK, compatible with CMU Sphinx-III system
Ibaoc et al.	2026	Cebuano Speech Tutor Using Hybrid TDNN-HMM Phoneme and Prosodic Feature Recognition
KR100570262B1 (ko)	2006-04-12	발음의 유창성을 평가하는 방법