ATE398324T1 - Spracherkennung durch kontextuelle modellierung der spracheinheiten - Google Patents
Spracherkennung durch kontextuelle modellierung der spracheinheitenInfo
- Publication number
- ATE398324T1 ATE398324T1 AT04742550T AT04742550T ATE398324T1 AT E398324 T1 ATE398324 T1 AT E398324T1 AT 04742550 T AT04742550 T AT 04742550T AT 04742550 T AT04742550 T AT 04742550T AT E398324 T1 ATE398324 T1 AT E398324T1
- Authority
- AT
- Austria
- Prior art keywords
- units
- language
- acoustic
- voice units
- states
- Prior art date
Links
- 230000001419 dependent effect Effects 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/022—Demisyllables, biphones or triphones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/FR2004/000972 WO2005112000A1 (fr) | 2004-04-20 | 2004-04-20 | Procede et systeme de reconnaissance vocale par modelisation contextuelle d’unites vocales |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE398324T1 true ATE398324T1 (de) | 2008-07-15 |
Family
ID=34958050
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04742550T ATE398324T1 (de) | 2004-04-20 | 2004-04-20 | Spracherkennung durch kontextuelle modellierung der spracheinheiten |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US7818172B2 (de) |
| EP (1) | EP1741092B1 (de) |
| AT (1) | ATE398324T1 (de) |
| DE (1) | DE602004014416D1 (de) |
| WO (1) | WO2005112000A1 (de) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8145488B2 (en) * | 2008-09-16 | 2012-03-27 | Microsoft Corporation | Parameter clustering and sharing for variable-parameter hidden markov models |
| US8160878B2 (en) * | 2008-09-16 | 2012-04-17 | Microsoft Corporation | Piecewise-based variable-parameter Hidden Markov Models and the training thereof |
| KR101625304B1 (ko) * | 2014-11-18 | 2016-05-27 | 경희대학교 산학협력단 | 음향 정보에 기초한 사용자 다수 행위 인식 방법 |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0782348B2 (ja) * | 1992-03-21 | 1995-09-06 | 株式会社エイ・ティ・アール自動翻訳電話研究所 | 音声認識用サブワードモデル生成方法 |
| GB9223066D0 (en) * | 1992-11-04 | 1992-12-16 | Secr Defence | Children's speech training aid |
| US5737490A (en) * | 1993-09-30 | 1998-04-07 | Apple Computer, Inc. | Method and apparatus for constructing continuous parameter fenonic hidden markov models by replacing phonetic models with continous fenonic models |
| US5794197A (en) * | 1994-01-21 | 1998-08-11 | Micrsoft Corporation | Senone tree representation and evaluation |
| JP3581401B2 (ja) * | 1994-10-07 | 2004-10-27 | キヤノン株式会社 | 音声認識方法 |
| US5937384A (en) * | 1996-05-01 | 1999-08-10 | Microsoft Corporation | Method and system for speech recognition using continuous density hidden Markov models |
| US5806030A (en) * | 1996-05-06 | 1998-09-08 | Matsushita Electric Industrial Co Ltd | Low complexity, high accuracy clustering method for speech recognizer |
| US20060074664A1 (en) * | 2000-01-10 | 2006-04-06 | Lam Kwok L | System and method for utterance verification of chinese long and short keywords |
| JP2002366187A (ja) * | 2001-06-08 | 2002-12-20 | Sony Corp | 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体 |
-
2004
- 2004-04-20 AT AT04742550T patent/ATE398324T1/de not_active IP Right Cessation
- 2004-04-20 EP EP04742550A patent/EP1741092B1/de not_active Expired - Lifetime
- 2004-04-20 DE DE602004014416T patent/DE602004014416D1/de not_active Expired - Lifetime
- 2004-04-20 US US11/587,136 patent/US7818172B2/en not_active Expired - Fee Related
- 2004-04-20 WO PCT/FR2004/000972 patent/WO2005112000A1/fr not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| WO2005112000A1 (fr) | 2005-11-24 |
| EP1741092A1 (de) | 2007-01-10 |
| EP1741092B1 (de) | 2008-06-11 |
| DE602004014416D1 (de) | 2008-07-24 |
| US7818172B2 (en) | 2010-10-19 |
| US20070271096A1 (en) | 2007-11-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10074363B2 (en) | Method and apparatus for keyword speech recognition | |
| Tao et al. | Exploring deep learning architectures for automatically grading non-native spontaneous speech | |
| CN111433847B (zh) | 语音转换的方法及训练方法、智能装置和存储介质 | |
| Bell et al. | Prosodic adaptation in human-computer interaction | |
| JP7070894B2 (ja) | 時系列情報の学習システム、方法およびニューラルネットワークモデル | |
| Furui et al. | Fundamental technologies in modern speech recognition | |
| CN107871496B (zh) | 语音识别方法和装置 | |
| ATE419616T1 (de) | Verfahren, einrichtung und computerprogramm zur spracherkennung | |
| RU2432623C2 (ru) | Способ и устройство для естественно-речевого распознавания речевого высказывания | |
| WO2008087934A1 (ja) | 拡張認識辞書学習装置と音声認識システム | |
| ATE403213T1 (de) | System und verfahren zur automatischen spracherkennung | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| EP1507255A3 (de) | Blasteilung für kompakte akustische Modelle | |
| CN105590625A (zh) | 声学模型自适应方法及系统 | |
| DE60134395D1 (de) | Diskriminatives Trainieren von Hidden Markov Modellen für die Erkennung fliessender Sprache | |
| CN104934028A (zh) | 用于语音合成的深度神经网络模型的训练方法及装置 | |
| CN109147774B (zh) | 一种改进的延时神经网络声学模型 | |
| CN110706714A (zh) | 说话者模型制作系统 | |
| US20180308501A1 (en) | Multi speaker attribution using personal grammar detection | |
| ATE349750T1 (de) | Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung | |
| KR20220090171A (ko) | 음성 인식 장치, 프로그램 및 그것의 학습 제어 방법 | |
| CN116778967A (zh) | 基于预训练模型的多模态情感识别方法及装置 | |
| ATE353156T1 (de) | Verfolgen von vokaltraktresonanzen unter verwendung einer zielgeführten einschränkung | |
| CN107910008A (zh) | 一种用于个人设备的基于多声学模型的语音识别方法 | |
| ATE398324T1 (de) | Spracherkennung durch kontextuelle modellierung der spracheinheiten |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |