ATE312398T1 - Sprecheranpassung für die spracherkennung - Google Patents
Sprecheranpassung für die spracherkennungInfo
- Publication number
- ATE312398T1 ATE312398T1 AT02253651T AT02253651T ATE312398T1 AT E312398 T1 ATE312398 T1 AT E312398T1 AT 02253651 T AT02253651 T AT 02253651T AT 02253651 T AT02253651 T AT 02253651T AT E312398 T1 ATE312398 T1 AT E312398T1
- Authority
- AT
- Austria
- Prior art keywords
- speaker adaptation
- voice recognition
- domain
- adaptation
- feature
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Artificial Intelligence (AREA)
- Signal Processing (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Noise Elimination (AREA)
- Complex Calculations (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/864,838 US6915259B2 (en) | 2001-05-24 | 2001-05-24 | Speaker and environment adaptation based on linear separation of variability sources |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE312398T1 true ATE312398T1 (de) | 2005-12-15 |
Family
ID=25344185
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT02253651T ATE312398T1 (de) | 2001-05-24 | 2002-05-23 | Sprecheranpassung für die spracherkennung |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US6915259B2 (de) |
| EP (1) | EP1262953B1 (de) |
| AT (1) | ATE312398T1 (de) |
| DE (1) | DE60207784T9 (de) |
Families Citing this family (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002366187A (ja) * | 2001-06-08 | 2002-12-20 | Sony Corp | 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体 |
| CN1453767A (zh) * | 2002-04-26 | 2003-11-05 | 日本先锋公司 | 语音识别装置以及语音识别方法 |
| US7103540B2 (en) * | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
| US7107210B2 (en) * | 2002-05-20 | 2006-09-12 | Microsoft Corporation | Method of noise reduction based on dynamic aspects of speech |
| US7174292B2 (en) | 2002-05-20 | 2007-02-06 | Microsoft Corporation | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
| US7340396B2 (en) * | 2003-02-18 | 2008-03-04 | Motorola, Inc. | Method and apparatus for providing a speaker adapted speech recognition model set |
| US7729908B2 (en) * | 2005-03-04 | 2010-06-01 | Panasonic Corporation | Joint signal and model based noise matching noise robustness method for automatic speech recognition |
| US7729909B2 (en) * | 2005-03-04 | 2010-06-01 | Panasonic Corporation | Block-diagonal covariance joint subspace tying and model compensation for noise robust automatic speech recognition |
| US9571652B1 (en) | 2005-04-21 | 2017-02-14 | Verint Americas Inc. | Enhanced diarization systems, media and methods of use |
| US7877255B2 (en) * | 2006-03-31 | 2011-01-25 | Voice Signal Technologies, Inc. | Speech recognition using channel verification |
| CA2652302C (en) * | 2006-05-16 | 2015-04-07 | Loquendo S.P.A. | Intersession variability compensation for automatic extraction of information from voice |
| US8180637B2 (en) * | 2007-12-03 | 2012-05-15 | Microsoft Corporation | High performance HMM adaptation with joint compensation of additive and convolutive distortions |
| US8798994B2 (en) * | 2008-02-06 | 2014-08-05 | International Business Machines Corporation | Resource conservative transformation based unsupervised speaker adaptation |
| JP5423670B2 (ja) * | 2008-04-30 | 2014-02-19 | 日本電気株式会社 | 音響モデル学習装置および音声認識装置 |
| US9798653B1 (en) * | 2010-05-05 | 2017-10-24 | Nuance Communications, Inc. | Methods, apparatus and data structure for cross-language speech adaptation |
| KR20120054845A (ko) * | 2010-11-22 | 2012-05-31 | 삼성전자주식회사 | 로봇의 음성인식방법 |
| GB2493413B (en) | 2011-07-25 | 2013-12-25 | Ibm | Maintaining and supplying speech models |
| US8543398B1 (en) | 2012-02-29 | 2013-09-24 | Google Inc. | Training an automatic speech recognition system using compressed word frequencies |
| US9984678B2 (en) * | 2012-03-23 | 2018-05-29 | Microsoft Technology Licensing, Llc | Factored transforms for separable adaptation of acoustic models |
| US8374865B1 (en) | 2012-04-26 | 2013-02-12 | Google Inc. | Sampling training data for an automatic speech recognition system based on a benchmark classification distribution |
| US8805684B1 (en) * | 2012-05-31 | 2014-08-12 | Google Inc. | Distributed speaker adaptation |
| US8571859B1 (en) | 2012-05-31 | 2013-10-29 | Google Inc. | Multi-stage speaker adaptation |
| US8880398B1 (en) | 2012-07-13 | 2014-11-04 | Google Inc. | Localized speech recognition with offload |
| US9368116B2 (en) | 2012-09-07 | 2016-06-14 | Verint Systems Ltd. | Speaker separation in diarization |
| US9123333B2 (en) | 2012-09-12 | 2015-09-01 | Google Inc. | Minimum bayesian risk methods for automatic speech recognition |
| US10134401B2 (en) * | 2012-11-21 | 2018-11-20 | Verint Systems Ltd. | Diarization using linguistic labeling |
| JP6000094B2 (ja) * | 2012-12-03 | 2016-09-28 | 日本電信電話株式会社 | 話者適応化装置、話者適応化方法、プログラム |
| US9275638B2 (en) | 2013-03-12 | 2016-03-01 | Google Technology Holdings LLC | Method and apparatus for training a voice recognition model database |
| US9460722B2 (en) | 2013-07-17 | 2016-10-04 | Verint Systems Ltd. | Blind diarization of recorded calls with arbitrary number of speakers |
| US9984706B2 (en) | 2013-08-01 | 2018-05-29 | Verint Systems Ltd. | Voice activity detection using a soft decision mechanism |
| US9875743B2 (en) | 2015-01-26 | 2018-01-23 | Verint Systems Ltd. | Acoustic signature building for a speaker from multiple sessions |
| US9865256B2 (en) | 2015-02-27 | 2018-01-09 | Storz Endoskop Produktions Gmbh | System and method for calibrating a speech recognition system to an operating environment |
| US11538128B2 (en) | 2018-05-14 | 2022-12-27 | Verint Americas Inc. | User interface for fraud alert management |
| US10887452B2 (en) | 2018-10-25 | 2021-01-05 | Verint Americas Inc. | System architecture for fraud detection |
| IL303147B2 (en) | 2019-06-20 | 2024-09-01 | Verint Americas Inc | Systems and methods for authentication and fraud detection |
| US11868453B2 (en) | 2019-11-07 | 2024-01-09 | Verint Americas Inc. | Systems and methods for customer authentication based on audio-of-interest |
| US11238847B2 (en) | 2019-12-04 | 2022-02-01 | Google Llc | Speaker awareness using speaker dependent speech model(s) |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5131043A (en) | 1983-09-05 | 1992-07-14 | Matsushita Electric Industrial Co., Ltd. | Method of and apparatus for speech recognition wherein decisions are made based on phonemes |
| US5345536A (en) | 1990-12-21 | 1994-09-06 | Matsushita Electric Industrial Co., Ltd. | Method of speech recognition |
| JP2870224B2 (ja) | 1991-06-19 | 1999-03-17 | 松下電器産業株式会社 | 音声認識方法 |
| NO179421C (no) * | 1993-03-26 | 1996-10-02 | Statoil As | Apparat for fordeling av en ström av injeksjonsfluid i adskilte soner i en grunnformasjon |
| US5664059A (en) | 1993-04-29 | 1997-09-02 | Panasonic Technologies, Inc. | Self-learning speaker adaptation based on spectral variation source decomposition |
| JP3114468B2 (ja) | 1993-11-25 | 2000-12-04 | 松下電器産業株式会社 | 音声認識方法 |
| US5684925A (en) | 1995-09-08 | 1997-11-04 | Matsushita Electric Industrial Co., Ltd. | Speech representation by feature-based word prototypes comprising phoneme targets having reliable high similarity |
| US5822728A (en) | 1995-09-08 | 1998-10-13 | Matsushita Electric Industrial Co., Ltd. | Multistage word recognizer based on reliably detected phoneme similarity regions |
| JP3001037B2 (ja) | 1995-12-13 | 2000-01-17 | 日本電気株式会社 | 音声認識装置 |
| US6026359A (en) * | 1996-09-20 | 2000-02-15 | Nippon Telegraph And Telephone Corporation | Scheme for model adaptation in pattern recognition based on Taylor expansion |
-
2001
- 2001-05-24 US US09/864,838 patent/US6915259B2/en not_active Expired - Lifetime
-
2002
- 2002-05-23 EP EP02253651A patent/EP1262953B1/de not_active Expired - Lifetime
- 2002-05-23 DE DE60207784T patent/DE60207784T9/de not_active Expired - Fee Related
- 2002-05-23 AT AT02253651T patent/ATE312398T1/de not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| DE60207784T2 (de) | 2006-07-06 |
| US20030050780A1 (en) | 2003-03-13 |
| DE60207784T9 (de) | 2006-12-14 |
| EP1262953A3 (de) | 2004-04-07 |
| US6915259B2 (en) | 2005-07-05 |
| DE60207784D1 (de) | 2006-01-12 |
| EP1262953A2 (de) | 2002-12-04 |
| EP1262953B1 (de) | 2005-12-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE312398T1 (de) | Sprecheranpassung für die spracherkennung | |
| Scherer | Vocal communication of emotion: A review of research paradigms | |
| ATE443316T1 (de) | Spracherkennungsystem mittels impliziter sprecheradaptation | |
| DE60004331D1 (de) | Sprecher-erkennung | |
| ATE410768T1 (de) | System und verfahren zum betrieb eines spracherkennungssystems in einem fahrzeug | |
| DE60125542D1 (de) | System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen | |
| ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
| EP1022722A3 (de) | Sprecheradaptation auf der Basis von Stimm-Eigenvektoren | |
| DE60124408D1 (de) | Kombination von digitaler zeitverschiebung und hmm in sprecherabhängiger- und sprecherunabhängiger weise für die spracherkennung | |
| DE50209455D1 (de) | Verfahren zum Training oder zur Adaption eines Spracherkenners | |
| ATE541287T1 (de) | Rechnerisch effizienter hintergrundrauschunterdrücker für die sprachcodierung und spracherkennung | |
| ATE347162T1 (de) | Rauschunterdrückung zur robusten spracherkennung | |
| DE60002584D1 (de) | Anwendung von Referenzdaten für Spracherkennung | |
| ATE363120T1 (de) | Audio-dialogsystem und sprachgesteuertes browsing-verfahren | |
| EP1533791A3 (de) | Sprachaktivitätsdetektion und Verbesserung der Sprachverständlichkeit | |
| DE60212617D1 (de) | Vorrichtung zur sprachverbesserung | |
| DE502004002300D1 (de) | Verfahren zur sprecherabhängigen spracherkennung und spracherkennungssystem | |
| EP1189204A3 (de) | HMM-basierte Erkennung von verrauschter Sprache | |
| DE602005001995D1 (de) | Basisband-Modem und Verfahren zur Spracherkennung und verwendendes Mobilkommunikationsendgerät | |
| WO2002054719A3 (en) | Method and apparatus for active reduction of speakerphone echo | |
| CN105931651B (zh) | 助听设备中的语音信号处理方法、装置及助听设备 | |
| DE60303278D1 (de) | Vorrichtung zur Verbesserung der Spracherkennung | |
| DE60109650D1 (de) | Taktiles kommunikationssystem | |
| ATE441918T1 (de) | Sprachdialogverfahren und -system | |
| JP2000276190A (ja) | 発声を必要としない音声通話装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |