ATE398324T1 - Spracherkennung durch kontextuelle modellierung der spracheinheiten - Google Patents

Spracherkennung durch kontextuelle modellierung der spracheinheiten

Info

Publication number
ATE398324T1
ATE398324T1 AT04742550T AT04742550T ATE398324T1 AT E398324 T1 ATE398324 T1 AT E398324T1 AT 04742550 T AT04742550 T AT 04742550T AT 04742550 T AT04742550 T AT 04742550T AT E398324 T1 ATE398324 T1 AT E398324T1
Authority
AT
Austria
Prior art keywords
units
language
acoustic
voice units
states
Prior art date
Application number
AT04742550T
Other languages
English (en)
Inventor
Ronaldo Messina
Denis Jouvet
Original Assignee
France Telecom
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom filed Critical France Telecom
Application granted granted Critical
Publication of ATE398324T1 publication Critical patent/ATE398324T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/022Demisyllables, biphones or triphones being the recognition units

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
AT04742550T 2004-04-20 2004-04-20 Spracherkennung durch kontextuelle modellierung der spracheinheiten ATE398324T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/FR2004/000972 WO2005112000A1 (fr) 2004-04-20 2004-04-20 Procede et systeme de reconnaissance vocale par modelisation contextuelle d’unites vocales

Publications (1)

Publication Number Publication Date
ATE398324T1 true ATE398324T1 (de) 2008-07-15

Family

ID=34958050

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04742550T ATE398324T1 (de) 2004-04-20 2004-04-20 Spracherkennung durch kontextuelle modellierung der spracheinheiten

Country Status (5)

Country Link
US (1) US7818172B2 (de)
EP (1) EP1741092B1 (de)
AT (1) ATE398324T1 (de)
DE (1) DE602004014416D1 (de)
WO (1) WO2005112000A1 (de)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8145488B2 (en) * 2008-09-16 2012-03-27 Microsoft Corporation Parameter clustering and sharing for variable-parameter hidden markov models
US8160878B2 (en) * 2008-09-16 2012-04-17 Microsoft Corporation Piecewise-based variable-parameter Hidden Markov Models and the training thereof
KR101625304B1 (ko) * 2014-11-18 2016-05-27 경희대학교 산학협력단 음향 정보에 기초한 사용자 다수 행위 인식 방법

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0782348B2 (ja) * 1992-03-21 1995-09-06 株式会社エイ・ティ・アール自動翻訳電話研究所 音声認識用サブワードモデル生成方法
GB9223066D0 (en) * 1992-11-04 1992-12-16 Secr Defence Children's speech training aid
US5737490A (en) * 1993-09-30 1998-04-07 Apple Computer, Inc. Method and apparatus for constructing continuous parameter fenonic hidden markov models by replacing phonetic models with continous fenonic models
US5794197A (en) * 1994-01-21 1998-08-11 Micrsoft Corporation Senone tree representation and evaluation
JP3581401B2 (ja) * 1994-10-07 2004-10-27 キヤノン株式会社 音声認識方法
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US5806030A (en) * 1996-05-06 1998-09-08 Matsushita Electric Industrial Co Ltd Low complexity, high accuracy clustering method for speech recognizer
US20060074664A1 (en) * 2000-01-10 2006-04-06 Lam Kwok L System and method for utterance verification of chinese long and short keywords
JP2002366187A (ja) * 2001-06-08 2002-12-20 Sony Corp 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体

Also Published As

Publication number Publication date
WO2005112000A1 (fr) 2005-11-24
EP1741092A1 (de) 2007-01-10
EP1741092B1 (de) 2008-06-11
DE602004014416D1 (de) 2008-07-24
US7818172B2 (en) 2010-10-19
US20070271096A1 (en) 2007-11-22

Similar Documents

Publication Publication Date Title
US10074363B2 (en) Method and apparatus for keyword speech recognition
Tao et al. Exploring deep learning architectures for automatically grading non-native spontaneous speech
CN111433847B (zh) 语音转换的方法及训练方法、智能装置和存储介质
Bell et al. Prosodic adaptation in human-computer interaction
JP7070894B2 (ja) 時系列情報の学習システム、方法およびニューラルネットワークモデル
Furui et al. Fundamental technologies in modern speech recognition
CN107871496B (zh) 语音识别方法和装置
ATE419616T1 (de) Verfahren, einrichtung und computerprogramm zur spracherkennung
RU2432623C2 (ru) Способ и устройство для естественно-речевого распознавания речевого высказывания
WO2008087934A1 (ja) 拡張認識辞書学習装置と音声認識システム
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
EP1507255A3 (de) Blasteilung für kompakte akustische Modelle
CN105590625A (zh) 声学模型自适应方法及系统
DE60134395D1 (de) Diskriminatives Trainieren von Hidden Markov Modellen für die Erkennung fliessender Sprache
CN104934028A (zh) 用于语音合成的深度神经网络模型的训练方法及装置
CN109147774B (zh) 一种改进的延时神经网络声学模型
CN110706714A (zh) 说话者模型制作系统
US20180308501A1 (en) Multi speaker attribution using personal grammar detection
ATE349750T1 (de) Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung
KR20220090171A (ko) 음성 인식 장치, 프로그램 및 그것의 학습 제어 방법
CN116778967A (zh) 基于预训练模型的多模态情感识别方法及装置
ATE353156T1 (de) Verfolgen von vokaltraktresonanzen unter verwendung einer zielgeführten einschränkung
CN107910008A (zh) 一种用于个人设备的基于多声学模型的语音识别方法
ATE398324T1 (de) Spracherkennung durch kontextuelle modellierung der spracheinheiten

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties