ATE398323T1 - Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache - Google Patents
Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender spracheInfo
- Publication number
- ATE398323T1 ATE398323T1 AT01923898T AT01923898T ATE398323T1 AT E398323 T1 ATE398323 T1 AT E398323T1 AT 01923898 T AT01923898 T AT 01923898T AT 01923898 T AT01923898 T AT 01923898T AT E398323 T1 ATE398323 T1 AT E398323T1
- Authority
- AT
- Austria
- Prior art keywords
- segment
- correct
- incorrect
- state sequence
- recognition
- Prior art date
Links
- 238000002864 sequence alignment Methods 0.000 abstract 3
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G10L15/146—Training of HMMs with insufficient amount of training data, e.g. state sharing, tying, deleted interpolation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Machine Translation (AREA)
- Character Discrimination (AREA)
- Image Analysis (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Measuring Temperature Or Quantity Of Heat (AREA)
- Electrophonic Musical Instruments (AREA)
- Document Processing Apparatus (AREA)
- Pens And Brushes (AREA)
- Display Devices Of Pinball Game Machines (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/543,202 US6490555B1 (en) | 1997-03-14 | 2000-04-05 | Discriminatively trained mixture models in continuous speech recognition |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE398323T1 true ATE398323T1 (de) | 2008-07-15 |
Family
ID=24167006
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT01923898T ATE398323T1 (de) | 2000-04-05 | 2001-04-03 | Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US6490555B1 (de) |
| EP (1) | EP1269464B1 (de) |
| JP (1) | JP5134751B2 (de) |
| AT (1) | ATE398323T1 (de) |
| AU (1) | AU2001250579A1 (de) |
| DE (1) | DE60134395D1 (de) |
| WO (1) | WO2001075862A2 (de) |
Families Citing this family (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7020845B1 (en) * | 1999-11-15 | 2006-03-28 | Gottfurcht Elliot A | Navigating internet content on a television using a simplified interface and a remote control |
| US7003455B1 (en) * | 2000-10-16 | 2006-02-21 | Microsoft Corporation | Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech |
| DE10120513C1 (de) | 2001-04-26 | 2003-01-09 | Siemens Ag | Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache |
| AUPR579601A0 (en) * | 2001-06-19 | 2001-07-12 | Syrinx Speech Systems Pty Limited | On-line environmental and speaker model adaptation |
| US20040150676A1 (en) * | 2002-03-25 | 2004-08-05 | Gottfurcht Elliot A. | Apparatus and method for simple wide-area network navigation |
| US7117148B2 (en) * | 2002-04-05 | 2006-10-03 | Microsoft Corporation | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization |
| DE10220524B4 (de) | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
| EP1363271A1 (de) | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
| FI121583B (fi) * | 2002-07-05 | 2011-01-14 | Syslore Oy | Symbolijonon etsintä |
| US7752045B2 (en) * | 2002-10-07 | 2010-07-06 | Carnegie Mellon University | Systems and methods for comparing speech elements |
| EP1450350A1 (de) * | 2003-02-20 | 2004-08-25 | Sony International (Europe) GmbH | Verfahren zur Spracherkennung mittels Attributen |
| US20040193412A1 (en) * | 2003-03-18 | 2004-09-30 | Aurilab, Llc | Non-linear score scrunching for more efficient comparison of hypotheses |
| US20040186714A1 (en) * | 2003-03-18 | 2004-09-23 | Aurilab, Llc | Speech recognition improvement through post-processsing |
| US8019602B2 (en) * | 2004-01-20 | 2011-09-13 | Microsoft Corporation | Automatic speech recognition learning using user corrections |
| GB0420464D0 (en) * | 2004-09-14 | 2004-10-20 | Zentian Ltd | A speech recognition circuit and method |
| EP1743897A1 (de) * | 2005-07-15 | 2007-01-17 | Gesellschaft für Biotechnologische Forschung mbH | Aus Sorangium cellulosum erhältliche biologisch aktive Verbindungen |
| US20070083373A1 (en) * | 2005-10-11 | 2007-04-12 | Matsushita Electric Industrial Co., Ltd. | Discriminative training of HMM models using maximum margin estimation for speech recognition |
| US8301449B2 (en) * | 2006-10-16 | 2012-10-30 | Microsoft Corporation | Minimum classification error training with growth transformation optimization |
| US7885812B2 (en) * | 2006-11-15 | 2011-02-08 | Microsoft Corporation | Joint training of feature extraction and acoustic model parameters for speech recognition |
| US20080147579A1 (en) * | 2006-12-14 | 2008-06-19 | Microsoft Corporation | Discriminative training using boosted lasso |
| US7856351B2 (en) * | 2007-01-19 | 2010-12-21 | Microsoft Corporation | Integrated speech recognition and semantic classification |
| US8423364B2 (en) * | 2007-02-20 | 2013-04-16 | Microsoft Corporation | Generic framework for large-margin MCE training in speech recognition |
| EP2133868A4 (de) * | 2007-02-28 | 2013-01-16 | Nec Corp | Gewichtskoeffizienten-lernsystem und audioerkennungssystem |
| US20080243503A1 (en) * | 2007-03-30 | 2008-10-02 | Microsoft Corporation | Minimum divergence based discriminative training for pattern recognition |
| US8239332B2 (en) | 2007-11-20 | 2012-08-07 | Microsoft Corporation | Constrained line search optimization for discriminative training of HMMS |
| US8843370B2 (en) * | 2007-11-26 | 2014-09-23 | Nuance Communications, Inc. | Joint discriminative training of multiple speech recognizers |
| JP5327054B2 (ja) * | 2007-12-18 | 2013-10-30 | 日本電気株式会社 | 発音変動規則抽出装置、発音変動規則抽出方法、および発音変動規則抽出用プログラム |
| US9240184B1 (en) * | 2012-11-15 | 2016-01-19 | Google Inc. | Frame-level combination of deep neural network and gaussian mixture models |
| US9817881B2 (en) * | 2013-10-16 | 2017-11-14 | Cypress Semiconductor Corporation | Hidden markov model processing engine |
| JP6461308B2 (ja) * | 2015-04-16 | 2019-01-30 | 三菱電機株式会社 | 音声認識装置およびリスコアリング装置 |
| CN111354344B (zh) * | 2020-03-09 | 2023-08-22 | 第四范式(北京)技术有限公司 | 语音识别模型的训练方法、装置、电子设备及存储介质 |
| CN114387959B (zh) * | 2020-10-19 | 2024-10-11 | 北京爱语吧科技有限公司 | 一种基于语音的日语发音评测方法和系统 |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4741036A (en) | 1985-01-31 | 1988-04-26 | International Business Machines Corporation | Determination of phone weights for markov models in a speech recognition system |
| US5027406A (en) * | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
| US5388183A (en) | 1991-09-30 | 1995-02-07 | Kurzwell Applied Intelligence, Inc. | Speech recognition providing multiple outputs |
| US5280563A (en) | 1991-12-20 | 1994-01-18 | Kurzweil Applied Intelligence, Inc. | Method of optimizing a composite speech recognition expert |
| ES2128390T3 (es) | 1992-03-02 | 1999-05-16 | At & T Corp | Metodo de adiestramiento y dispositivo para reconocimiento de voz. |
| US5832430A (en) * | 1994-12-29 | 1998-11-03 | Lucent Technologies, Inc. | Devices and methods for speech recognition of vocabulary words with simultaneous detection and verification |
| US5675706A (en) * | 1995-03-31 | 1997-10-07 | Lucent Technologies Inc. | Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition |
| US5737489A (en) * | 1995-09-15 | 1998-04-07 | Lucent Technologies Inc. | Discriminative utterance verification for connected digits recognition |
| US5895447A (en) * | 1996-02-02 | 1999-04-20 | International Business Machines Corporation | Speech recognition using thresholded speaker class model selection or model adaptation |
| US5991720A (en) * | 1996-05-06 | 1999-11-23 | Matsushita Electric Industrial Co., Ltd. | Speech recognition system employing multiple grammar networks |
| JPH10207485A (ja) * | 1997-01-22 | 1998-08-07 | Toshiba Corp | 音声認識装置及び話者適応方法 |
| US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
| US6292778B1 (en) * | 1998-10-30 | 2001-09-18 | Lucent Technologies Inc. | Task-independent utterance verification with subword-based minimum verification error training |
| US7216079B1 (en) | 1999-11-02 | 2007-05-08 | Speechworks International, Inc. | Method and apparatus for discriminative training of acoustic models of a speech recognition system |
-
2000
- 2000-04-05 US US09/543,202 patent/US6490555B1/en not_active Expired - Lifetime
-
2001
- 2001-04-03 AT AT01923898T patent/ATE398323T1/de not_active IP Right Cessation
- 2001-04-03 AU AU2001250579A patent/AU2001250579A1/en not_active Abandoned
- 2001-04-03 WO PCT/IB2001/000726 patent/WO2001075862A2/en not_active Ceased
- 2001-04-03 JP JP2001573458A patent/JP5134751B2/ja not_active Expired - Fee Related
- 2001-04-03 DE DE60134395T patent/DE60134395D1/de not_active Expired - Lifetime
- 2001-04-03 EP EP01923898A patent/EP1269464B1/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1269464B1 (de) | 2008-06-11 |
| JP5134751B2 (ja) | 2013-01-30 |
| WO2001075862A3 (en) | 2002-01-10 |
| WO2001075862A2 (en) | 2001-10-11 |
| EP1269464A2 (de) | 2003-01-02 |
| DE60134395D1 (de) | 2008-07-24 |
| JP2004512544A (ja) | 2004-04-22 |
| US6490555B1 (en) | 2002-12-03 |
| AU2001250579A1 (en) | 2001-10-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE398323T1 (de) | Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache | |
| US8498857B2 (en) | System and method for rapid prototyping of existing speech recognition solutions in different languages | |
| CN101246685B (zh) | 计算机辅助语言学习系统中的发音质量评价方法 | |
| WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
| CA2177638A1 (en) | Utterance verification using word based minimum verification error training for recognizing a keyword string | |
| US20020087317A1 (en) | Computer-implemented dynamic pronunciation method and system | |
| CN105261246A (zh) | 一种基于大数据挖掘技术的英语口语纠错系统 | |
| Gallwitz et al. | Integrated recognition of words and prosodic phrase boundaries | |
| EP1460615A1 (de) | Sprachverarbeitungseinrichtung und -verfahren, aufzeichnungsmedium und programm | |
| Bernstein et al. | Speech recognition by computer | |
| WO2007034478A3 (en) | System and method for correcting speech | |
| DE69916297D1 (de) | Zwischen-wörter verbindung phonemische modelle | |
| Li et al. | Improving mandarin tone mispronunciation detection for non-native learners with soft-target tone labels and blstm-based deep models | |
| Alfadhli et al. | Qari: A Hybrid CTC/Attention-Based Model for Quran Recitation Recognition Using Bidirectional LSTMP in an End-to-End Architecture | |
| Yamashita et al. | Automatic scoring for prosodic proficiency of English sentences spoken by Japanese based on utterance comparison | |
| Uchat | Hidden Markov Model and Speech Recognition | |
| Rayner et al. | Supervised learning of response grammars in a spoken call system. | |
| Sawada et al. | Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014. | |
| Fujisawa et al. | Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique. | |
| Hagen et al. | Data driven subword unit modeling for speech recognition and its application to interactive reading tutors. | |
| Deville et al. | Automatic detection and correction of pronunciation errors for foreign language learners: the demosthenes application. | |
| Vicsi | Thinking about the present and future of the complex speech recognition | |
| Hernández-Mena et al. | Creating a grammar-based speech recognition parser for Mexican Spanish using HTK, compatible with CMU Sphinx-III system | |
| Ibaoc et al. | Cebuano Speech Tutor Using Hybrid TDNN-HMM Phoneme and Prosodic Feature Recognition | |
| KR100570262B1 (ko) | 발음의 유창성을 평가하는 방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |