ATE231642T1 - Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten - Google Patents

Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten

Info

Publication number: ATE231642T1
Authority: AT; Austria
Prior art keywords: generator; speech recognizer; smoothing; speech; speech data
Prior art date: 1998-04-22

Application number

AT99924814T

Other languages

English (en)

Inventor

Volker Fischer

Yuqing Gao

Michael A Picheny

Siegfried Kunzmann

Original Assignee

Ibm

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1998-04-22

Filing date

1999-04-21

Publication date

2003-02-15

1999-04-21 Application filed by Ibm filed Critical Ibm

2003-02-15 Application granted granted Critical

2003-02-15 Publication of ATE231642T1 publication Critical patent/ATE231642T1/de

Links

230000006978 adaptation Effects 0.000 title 1
238000009499 grossing Methods 0.000 abstract 4
238000000034 method Methods 0.000 abstract 3
230000009286 beneficial effect Effects 0.000 abstract 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker

Landscapes

Engineering & Computer Science (AREA)
Artificial Intelligence (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Machine Translation (AREA)
Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
Diaphragms For Electromechanical Transducers (AREA)

AT99924814T 1998-04-22 1999-04-21 Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten ATE231642T1 (de)

Applications Claiming Priority (3)

Application Number	Priority Date	Filing Date	Title
US8265698P	1998-04-22	1998-04-22
US6611398A	1998-04-23	1998-04-23
PCT/EP1999/002673 WO1999054869A1 (en)	1998-04-22	1999-04-21	Adaptation of a speech recognizer for dialectal and linguistic domain variations

Publications (1)

Publication Number	Publication Date
ATE231642T1 true ATE231642T1 (de)	2003-02-15

Family

ID=26746379

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT99924814T ATE231642T1 (de)	1998-04-22	1999-04-21	Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten

Country Status (6)

Country	Link
EP (1)	EP1074019B1 (de)
CN (1)	CN1157711C (de)
AT (1)	ATE231642T1 (de)
DE (1)	DE69905030T2 (de)
TW (1)	TW477964B (de)
WO (1)	WO1999054869A1 (de)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
DE10014337A1 (de) *	2000-03-24	2001-09-27	Philips Corp Intellectual Pty	Verfahren zum Erzeugen eines Sprachmodells und eines akustischen Modells für ein Spracherkennungssystem
DE60111329T2 (de)	2000-11-14	2006-03-16	International Business Machines Corp.	Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
EP1215653B1 (de) *	2000-12-18	2003-09-17	Siemens Aktiengesellschaft	Verfahren und Anordnung zur Spracherkennung für ein Kleingerät
EP1887562B1 (de) *	2006-08-11	2010-04-28	Harman/Becker Automotive Systems GmbH	Spracherkennung mittels eines statistischen Sprachmodells unter Verwendung von Quadratwurzelglättung
CN102543071B (zh) *	2011-12-16	2013-12-11	安徽科大讯飞信息科技股份有限公司	用于移动设备的语音识别系统和方法
CN103839546A (zh) *	2014-03-26	2014-06-04	合肥新涛信息科技有限公司	一种基于江淮语系的语音识别系统
CN104766607A (zh) *	2015-03-05	2015-07-08	广州视源电子科技股份有限公司	一种电视节目推荐方法与系统
CN104751844A (zh) *	2015-03-12	2015-07-01	深圳市富途网络科技有限公司	用于证券信息交互的语音识别方法及其系统
CN106384587B (zh) *	2015-07-24	2019-11-15	科大讯飞股份有限公司	一种语音识别方法及系统
CN107452403B (zh) *	2017-09-12	2020-07-07	清华大学	一种说话人标记方法
CN112133290A (zh) *	2019-06-25	2020-12-25	南京航空航天大学	一种针对民航陆空通话领域的基于迁移学习的语音识别方法
CN112767961B (zh) *	2021-02-07	2022-06-03	哈尔滨琦音科技有限公司	一种基于云端计算的口音矫正方法

1999
- 1999-03-12 TW TW088103857A patent/TW477964B/zh not_active IP Right Cessation
- 1999-04-21 AT AT99924814T patent/ATE231642T1/de not_active IP Right Cessation
- 1999-04-21 CN CNB99805299XA patent/CN1157711C/zh not_active Expired - Fee Related
- 1999-04-21 DE DE69905030T patent/DE69905030T2/de not_active Expired - Lifetime
- 1999-04-21 WO PCT/EP1999/002673 patent/WO1999054869A1/en not_active Ceased
- 1999-04-21 EP EP99924814A patent/EP1074019B1/de not_active Expired - Lifetime

Also Published As

Publication number	Publication date
EP1074019A1 (de)	2001-02-07
EP1074019B1 (de)	2003-01-22
TW477964B (en)	2002-03-01
DE69905030T2 (de)	2003-11-27
CN1298533A (zh)	2001-06-06
DE69905030D1 (de)	2003-02-27
CN1157711C (zh)	2004-07-14
WO1999054869A1 (en)	1999-10-28

Similar Documents

Publication	Publication Date	Title
JPH06332494A (ja)	1994-12-02	音声を第１の言語から第２の言語に翻訳する際に音声理解を高めるための装置
ATE231642T1 (de)	2003-02-15	Anpassung eines spracherkenners zu dialektischen und linguistischen gebietsvarianten
EP0749109A3 (de)	1998-04-29	Spracherkennung für Tonsprachen
Erro et al.	2007	Flexible harmonic/stochastic speech synthesis.
Seresangtakul et al.	2002	Analysis of pitch contour of Thai tone using Fujisaki's model
JP3220163B2 (ja)	2001-10-22	音源生成装置、音声合成装置および方法
Seresangtakul et al.	2003	A generative model of fundamental frequency contours for polysyllabic words of Thai tones
Hisada et al.	2002	Real-time clarification of esophageal speech using a comb filter
Gutiérrez-Arriola et al.	2001	A new multi-speaker formant synthesizer that applies voice conversion techniques.
Gu et al.	2005	Analysis of the effects of word emphasis and echo question on F0 contours of Cantonese utterances.
JPH0580791A (ja)	1993-04-02	音声規則合成装置および方法
Cheng et al.	2013	HMM-based mandarin singing voice synthesis using tailored synthesis units and question sets
JP3270668B2 (ja)	2002-04-02	テキストからスピーチへの人工的ニューラルネットワークに基づく韻律の合成装置
Banga et al.	2002	Concatenative Text-to-Speech Synthesis based on Sinusoidal Modeling
Seresangtakul et al.	2003	Analysis and synthesis of pitch contour of Thai tone using Fujisaki's model
JP2001100777A (ja)	2001-04-13	音声合成方法及び装置
Muralishankar et al.	2001	Human touch to Tamil speech synthesizer
Lai	2007	F0 control model for mandarin singing voice synthesis
ATE214831T1 (de)	2002-04-15	Verfahren und anordnung zur bestimmung spektraler sprachcharakteristika in einer gesprochenen äusserung
Fujisaki et al.	2004	The command-response model for the generation of F/sub 0/contours of Cantonese utterances
Minematsu et al.	1996	Prosodic manipulation system of speech material for perceptual experiments
JPH08171394A (ja)	1996-07-02	音声合成装置
Wen et al.	2012	Prosody modification for vocoder based on amplitude spectrum of residual signal
Rudzicz	2022	Speech Synthesis
Seresangtakul et al.	2005	Synthesis of polysyllabic sequences of Thai tones using a generative model of fundamental frequency contours

Legal Events

Date	Code	Title	Description
2003-07-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties