ATE231642T1 - Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten - Google Patents

Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten

Info

Publication number
ATE231642T1
ATE231642T1 AT99924814T AT99924814T ATE231642T1 AT E231642 T1 ATE231642 T1 AT E231642T1 AT 99924814 T AT99924814 T AT 99924814T AT 99924814 T AT99924814 T AT 99924814T AT E231642 T1 ATE231642 T1 AT E231642T1
Authority
AT
Austria
Prior art keywords
generator
speech recognizer
smoothing
speech
speech data
Prior art date
Application number
AT99924814T
Other languages
English (en)
Inventor
Volker Fischer
Yuqing Gao
Michael A Picheny
Siegfried Kunzmann
Original Assignee
Ibm
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ibm filed Critical Ibm
Application granted granted Critical
Publication of ATE231642T1 publication Critical patent/ATE231642T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Diaphragms For Electromechanical Transducers (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
AT99924814T 1998-04-22 1999-04-21 Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten ATE231642T1 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US8265698P 1998-04-22 1998-04-22
US6611398A 1998-04-23 1998-04-23
PCT/EP1999/002673 WO1999054869A1 (en) 1998-04-22 1999-04-21 Adaptation of a speech recognizer for dialectal and linguistic domain variations

Publications (1)

Publication Number Publication Date
ATE231642T1 true ATE231642T1 (de) 2003-02-15

Family

ID=26746379

Family Applications (1)

Application Number Title Priority Date Filing Date
AT99924814T ATE231642T1 (de) 1998-04-22 1999-04-21 Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten

Country Status (6)

Country Link
EP (1) EP1074019B1 (de)
CN (1) CN1157711C (de)
AT (1) ATE231642T1 (de)
DE (1) DE69905030T2 (de)
TW (1) TW477964B (de)
WO (1) WO1999054869A1 (de)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10014337A1 (de) * 2000-03-24 2001-09-27 Philips Corp Intellectual Pty Verfahren zum Erzeugen eines Sprachmodells und eines akustischen Modells für ein Spracherkennungssystem
ATE297588T1 (de) 2000-11-14 2005-06-15 Ibm Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
ES2208210T3 (es) * 2000-12-18 2004-06-16 Siemens Aktiengesellschaft Procedimiento y disposicion para el reconocimiento de voz para un aparato pequeño.
DE602006013969D1 (de) * 2006-08-11 2010-06-10 Harman Becker Automotive Sys Spracherkennung mittels eines statistischen Sprachmodells unter Verwendung von Quadratwurzelglättung
CN102543071B (zh) * 2011-12-16 2013-12-11 安徽科大讯飞信息科技股份有限公司 用于移动设备的语音识别系统和方法
CN103839546A (zh) * 2014-03-26 2014-06-04 合肥新涛信息科技有限公司 一种基于江淮语系的语音识别系统
CN104766607A (zh) * 2015-03-05 2015-07-08 广州视源电子科技股份有限公司 一种电视节目推荐方法与系统
CN104751844A (zh) * 2015-03-12 2015-07-01 深圳市富途网络科技有限公司 用于证券信息交互的语音识别方法及其系统
CN106384587B (zh) * 2015-07-24 2019-11-15 科大讯飞股份有限公司 一种语音识别方法及系统
CN107452403B (zh) * 2017-09-12 2020-07-07 清华大学 一种说话人标记方法
CN112133290A (zh) * 2019-06-25 2020-12-25 南京航空航天大学 一种针对民航陆空通话领域的基于迁移学习的语音识别方法
CN112767961B (zh) * 2021-02-07 2022-06-03 哈尔滨琦音科技有限公司 一种基于云端计算的口音矫正方法

Also Published As

Publication number Publication date
TW477964B (en) 2002-03-01
DE69905030T2 (de) 2003-11-27
EP1074019B1 (de) 2003-01-22
CN1298533A (zh) 2001-06-06
EP1074019A1 (de) 2001-02-07
CN1157711C (zh) 2004-07-14
WO1999054869A1 (en) 1999-10-28
DE69905030D1 (de) 2003-02-27

Similar Documents

Publication Publication Date Title
JPH06332494A (ja) 音声を第1の言語から第2の言語に翻訳する際に音声理解を高めるための装置
ATE231642T1 (de) Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten
FI910360A0 (fi) Spraoktraening.
EP0749109A3 (de) Spracherkennung für Tonsprachen
Erro et al. Flexible harmonic/stochastic speech synthesis.
Seresangtakul et al. Analysis of pitch contour of Thai tone using Fujisaki's model
JP3220163B2 (ja) 音源生成装置、音声合成装置および方法
Hisada et al. Real-time clarification of esophageal speech using a comb filter
Gutiérrez-Arriola et al. A new multi-speaker formant synthesizer that applies voice conversion techniques.
JPH0580791A (ja) 音声規則合成装置および方法
Cheng et al. HMM-based mandarin singing voice synthesis using tailored synthesis units and question sets
JP3270668B2 (ja) テキストからスピーチへの人工的ニューラルネットワークに基づく韻律の合成装置
Banga et al. Concatenative Text-to-Speech Synthesis based on Sinusoidal Modeling
Seresangtakul et al. Analysis and synthesis of pitch contour of Thai tone using Fujisaki's model
JP2001100777A (ja) 音声合成方法及び装置
Muralishankar et al. Human touch to Tamil speech synthesizer
Lai F0 control model for mandarin singing voice synthesis
ATE214831T1 (de) Verfahren und anordnung zur bestimmung spektraler sprachcharakteristika in einer gesprochenen äusserung
Fujisaki et al. The command-response model for the generation of F/sub 0/contours of Cantonese utterances
Minematsu et al. Prosodic manipulation system of speech material for perceptual experiments
JPH08171394A (ja) 音声合成装置
Wen et al. Prosody modification for vocoder based on amplitude spectrum of residual signal
Rudzicz Speech Synthesis
Seresangtakul et al. Synthesis of polysyllabic sequences of Thai tones using a generative model of fundamental frequency contours
Takara et al. A study on the pitch pattern of a singing voice synthesis system based on the cepstral method.

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties