ATE312398T1 - Sprecheranpassung für die spracherkennung - Google Patents

Sprecheranpassung für die spracherkennung

Info

Publication number: ATE312398T1
Authority: AT; Austria
Prior art keywords: speaker adaptation; voice recognition; domain; adaptation; feature
Prior art date: 2001-05-24

Application number

AT02253651T

Other languages

English (en)

Inventor

Luca Rigazio

Patrick Nguyen

David Kryze

Jean-Claude Junqua

Original Assignee

Matsushita Electric Industrial Co Ltd

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2001-05-24

Filing date

2002-05-23

Publication date

2005-12-15

2002-05-23 Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd

2005-12-15 Application granted granted Critical

2005-12-15 Publication of ATE312398T1 publication Critical patent/ATE312398T1/de

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering

Landscapes

Engineering & Computer Science (AREA)
Multimedia (AREA)
Acoustics & Sound (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Computational Linguistics (AREA)
Artificial Intelligence (AREA)
Signal Processing (AREA)
Quality & Reliability (AREA)
Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Noise Elimination (AREA)
Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Complex Calculations (AREA)
Soundproofing, Sound Blocking, And Sound Damping (AREA)

AT02253651T 2001-05-24 2002-05-23 Sprecheranpassung für die spracherkennung ATE312398T1 (de)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
US09/864,838 US6915259B2 (en)	2001-05-24	2001-05-24	Speaker and environment adaptation based on linear separation of variability sources

Publications (1)

Publication Number	Publication Date
ATE312398T1 true ATE312398T1 (de)	2005-12-15

Family

ID=25344185

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT02253651T ATE312398T1 (de)	2001-05-24	2002-05-23	Sprecheranpassung für die spracherkennung

Country Status (4)

Country	Link
US (1)	US6915259B2 (de)
EP (1)	EP1262953B1 (de)
AT (1)	ATE312398T1 (de)
DE (1)	DE60207784T9 (de)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JP2002366187A (ja) *	2001-06-08	2002-12-20	Sony Corp	音声認識装置および音声認識方法、並びにプログラムおよび記録媒体
CN1453767A (zh) *	2002-04-26	2003-11-05	日本先锋公司	语音识别装置以及语音识别方法
US7103540B2 (en)	2002-05-20	2006-09-05	Microsoft Corporation	Method of pattern recognition using noise reduction uncertainty
US7107210B2 (en) *	2002-05-20	2006-09-12	Microsoft Corporation	Method of noise reduction based on dynamic aspects of speech
US7174292B2 (en)	2002-05-20	2007-02-06	Microsoft Corporation	Method of determining uncertainty associated with acoustic distortion-based noise reduction
US7340396B2 (en) *	2003-02-18	2008-03-04	Motorola, Inc.	Method and apparatus for providing a speaker adapted speech recognition model set
US7729909B2 (en) *	2005-03-04	2010-06-01	Panasonic Corporation	Block-diagonal covariance joint subspace tying and model compensation for noise robust automatic speech recognition
US7729908B2 (en) *	2005-03-04	2010-06-01	Panasonic Corporation	Joint signal and model based noise matching noise robustness method for automatic speech recognition
US9571652B1 (en)	2005-04-21	2017-02-14	Verint Americas Inc.	Enhanced diarization systems, media and methods of use
US7877255B2 (en) *	2006-03-31	2011-01-25	Voice Signal Technologies, Inc.	Speech recognition using channel verification
US8566093B2 (en) *	2006-05-16	2013-10-22	Loquendo S.P.A.	Intersession variability compensation for automatic extraction of information from voice
US8180637B2 (en) *	2007-12-03	2012-05-15	Microsoft Corporation	High performance HMM adaptation with joint compensation of additive and convolutive distortions
US8798994B2 (en) *	2008-02-06	2014-08-05	International Business Machines Corporation	Resource conservative transformation based unsupervised speaker adaptation
US8751227B2 (en) *	2008-04-30	2014-06-10	Nec Corporation	Acoustic model learning device and speech recognition device
US9798653B1 (en) *	2010-05-05	2017-10-24	Nuance Communications, Inc.	Methods, apparatus and data structure for cross-language speech adaptation
KR20120054845A (ko) *	2010-11-22	2012-05-31	삼성전자주식회사	로봇의 음성인식방법
GB2493413B (en)	2011-07-25	2013-12-25	Ibm	Maintaining and supplying speech models
US8543398B1 (en)	2012-02-29	2013-09-24	Google Inc.	Training an automatic speech recognition system using compressed word frequencies
US9984678B2 (en) *	2012-03-23	2018-05-29	Microsoft Technology Licensing, Llc	Factored transforms for separable adaptation of acoustic models
US8374865B1 (en)	2012-04-26	2013-02-12	Google Inc.	Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
US8571859B1 (en)	2012-05-31	2013-10-29	Google Inc.	Multi-stage speaker adaptation
US8805684B1 (en) *	2012-05-31	2014-08-12	Google Inc.	Distributed speaker adaptation
US8554559B1 (en)	2012-07-13	2013-10-08	Google Inc.	Localized speech recognition with offload
US9368116B2 (en)	2012-09-07	2016-06-14	Verint Systems Ltd.	Speaker separation in diarization
US9123333B2 (en)	2012-09-12	2015-09-01	Google Inc.	Minimum bayesian risk methods for automatic speech recognition
US10134401B2 (en) *	2012-11-21	2018-11-20	Verint Systems Ltd.	Diarization using linguistic labeling
JP6000094B2 (ja) *	2012-12-03	2016-09-28	日本電信電話株式会社	話者適応化装置、話者適応化方法、プログラム
US9275638B2 (en)	2013-03-12	2016-03-01	Google Technology Holdings LLC	Method and apparatus for training a voice recognition model database
US9460722B2 (en)	2013-07-17	2016-10-04	Verint Systems Ltd.	Blind diarization of recorded calls with arbitrary number of speakers
US9984706B2 (en)	2013-08-01	2018-05-29	Verint Systems Ltd.	Voice activity detection using a soft decision mechanism
US9875742B2 (en)	2015-01-26	2018-01-23	Verint Systems Ltd.	Word-level blind diarization of recorded calls with arbitrary number of speakers
US9865256B2 (en)	2015-02-27	2018-01-09	Storz Endoskop Produktions Gmbh	System and method for calibrating a speech recognition system to an operating environment
US11538128B2 (en)	2018-05-14	2022-12-27	Verint Americas Inc.	User interface for fraud alert management
US10887452B2 (en)	2018-10-25	2021-01-05	Verint Americas Inc.	System architecture for fraud detection
IL303147B2 (en)	2019-06-20	2024-09-01	Verint Americas Inc	Systems and methods for authentication and fraud detection
US11868453B2 (en)	2019-11-07	2024-01-09	Verint Americas Inc.	Systems and methods for customer authentication based on audio-of-interest
CN113261056B (zh)	2019-12-04	2024-08-02	谷歌有限责任公司	使用说话者相关语音模型的说话者感知

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5131043A (en) *	1983-09-05	1992-07-14	Matsushita Electric Industrial Co., Ltd.	Method of and apparatus for speech recognition wherein decisions are made based on phonemes
US5345536A (en) *	1990-12-21	1994-09-06	Matsushita Electric Industrial Co., Ltd.	Method of speech recognition
JP2870224B2 (ja) *	1991-06-19	1999-03-17	松下電器産業株式会社	音声認識方法
NO179421C (no) *	1993-03-26	1996-10-02	Statoil As	Apparat for fordeling av en ström av injeksjonsfluid i adskilte soner i en grunnformasjon
US5664059A (en) *	1993-04-29	1997-09-02	Panasonic Technologies, Inc.	Self-learning speaker adaptation based on spectral variation source decomposition
JP3114468B2 (ja) *	1993-11-25	2000-12-04	松下電器産業株式会社	音声認識方法
US5822728A (en) *	1995-09-08	1998-10-13	Matsushita Electric Industrial Co., Ltd.	Multistage word recognizer based on reliably detected phoneme similarity regions
US5684925A (en) *	1995-09-08	1997-11-04	Matsushita Electric Industrial Co., Ltd.	Speech representation by feature-based word prototypes comprising phoneme targets having reliable high similarity
JP3001037B2 (ja)	1995-12-13	2000-01-17	日本電気株式会社	音声認識装置
US6026359A (en) *	1996-09-20	2000-02-15	Nippon Telegraph And Telephone Corporation	Scheme for model adaptation in pattern recognition based on Taylor expansion

2001
- 2001-05-24 US US09/864,838 patent/US6915259B2/en not_active Expired - Lifetime
2002
- 2002-05-23 DE DE60207784T patent/DE60207784T9/de not_active Expired - Fee Related
- 2002-05-23 EP EP02253651A patent/EP1262953B1/de not_active Expired - Lifetime
- 2002-05-23 AT AT02253651T patent/ATE312398T1/de not_active IP Right Cessation

Also Published As

Publication number	Publication date
US20030050780A1 (en)	2003-03-13
US6915259B2 (en)	2005-07-05
DE60207784D1 (de)	2006-01-12
EP1262953A3 (de)	2004-04-07
EP1262953A2 (de)	2002-12-04
DE60207784T9 (de)	2006-12-14
EP1262953B1 (de)	2005-12-07
DE60207784T2 (de)	2006-07-06

Legal Events

Date	Code	Title	Description
2006-06-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE312398T1 (de)	2005-12-15	Sprecheranpassung für die spracherkennung
Scherer	2003	Vocal communication of emotion: A review of research paradigms
DE60233763D1 (de)	2009-10-29	Spracherkennungsystem mittels impliziter Sprecheradaptation
DE60004331D1 (de)	2003-09-11	Sprecher-erkennung
DE60125542D1 (de)	2007-02-08	System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen
DE60124408D1 (de)	2006-12-21	Kombination von digitaler zeitverschiebung und hmm in sprecherabhängiger- und sprecherunabhängiger weise für die spracherkennung
DE50209455D1 (de)	2007-03-29	Verfahren zum Training oder zur Adaption eines Spracherkenners
WO2006023631A3 (en)	2007-02-15	Document transcription system training
ATE541287T1 (de)	2012-01-15	Rechnerisch effizienter hintergrundrauschunterdrücker für die sprachcodierung und spracherkennung
ATE347162T1 (de)	2006-12-15	Rauschunterdrückung zur robusten spracherkennung
DE60002584D1 (de)	2003-06-12	Anwendung von Referenzdaten für Spracherkennung
ATE363120T1 (de)	2007-06-15	Audio-dialogsystem und sprachgesteuertes browsing-verfahren
EP1533791A3 (de)	2008-04-23	Sprachaktivitätsdetektion und Verbesserung der Sprachverständlichkeit
DE60212617D1 (de)	2006-08-03	Vorrichtung zur sprachverbesserung
DE502004002300D1 (de)	2007-01-25	Verfahren zur sprecherabhängigen spracherkennung und spracherkennungssystem
EP1189204A3 (de)	2002-08-28	HMM-basierte Erkennung von verrauschter Sprache
DE602005001995D1 (de)	2007-09-27	Basisband-Modem und Verfahren zur Spracherkennung und verwendendes Mobilkommunikationsendgerät
CN105931651B (zh)	2019-09-24	助听设备中的语音信号处理方法、装置及助听设备
AU2002364899A8 (en)	2003-06-10	A method and apparatus to perform speech recognition over a voice channel
ATE316283T1 (de)	2006-02-15	Vorrichtung zur verbesserung der spracherkennung
DE60109650D1 (de)	2005-04-28	Taktiles kommunikationssystem
ATE441918T1 (de)	2009-09-15	Sprachdialogverfahren und -system
JP2000276190A (ja)	2000-10-06	発声を必要としない音声通話装置
WO2005020208A3 (en)	2005-04-28	Topological voiceprints for speaker identification
Bonde et al.	2005	Noise robust automatic speech recognition with adaptive quantile based noise estimation and speech band emphasizing filter bank