ATE347162T1 - Rauschunterdrückung zur robusten spracherkennung - Google Patents
Rauschunterdrückung zur robusten spracherkennungInfo
- Publication number
- ATE347162T1 ATE347162T1 AT04103533T AT04103533T ATE347162T1 AT E347162 T1 ATE347162 T1 AT E347162T1 AT 04103533 T AT04103533 T AT 04103533T AT 04103533 T AT04103533 T AT 04103533T AT E347162 T1 ATE347162 T1 AT E347162T1
- Authority
- AT
- Austria
- Prior art keywords
- speech recognition
- noise cancellation
- noise
- robust speech
- sum
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Noise Elimination (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/647,586 US7516067B2 (en) | 2003-08-25 | 2003-08-25 | Method and apparatus using harmonic-model-based front end for robust speech recognition |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE347162T1 true ATE347162T1 (de) | 2006-12-15 |
Family
ID=34104651
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04103533T ATE347162T1 (de) | 2003-08-25 | 2004-07-23 | Rauschunterdrückung zur robusten spracherkennung |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US7516067B2 (de) |
| EP (1) | EP1511011B1 (de) |
| JP (1) | JP4731855B2 (de) |
| KR (1) | KR101087319B1 (de) |
| CN (1) | CN1591574B (de) |
| AT (1) | ATE347162T1 (de) |
| DE (1) | DE602004003439T2 (de) |
Families Citing this family (27)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7447630B2 (en) * | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
| KR100744352B1 (ko) * | 2005-08-01 | 2007-07-30 | 삼성전자주식회사 | 음성 신호의 하모닉 성분을 이용한 유/무성음 분리 정보를추출하는 방법 및 그 장치 |
| US9185487B2 (en) | 2006-01-30 | 2015-11-10 | Audience, Inc. | System and method for providing noise suppression utilizing null processing noise subtraction |
| US8005671B2 (en) * | 2006-12-04 | 2011-08-23 | Qualcomm Incorporated | Systems and methods for dynamic normalization to reduce loss in precision for low-level signals |
| JP5089295B2 (ja) * | 2007-08-31 | 2012-12-05 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声処理システム、方法及びプログラム |
| KR100919223B1 (ko) * | 2007-09-19 | 2009-09-28 | 한국전자통신연구원 | 부대역의 불확실성 정보를 이용한 잡음환경에서의 음성인식 방법 및 장치 |
| US8306817B2 (en) * | 2008-01-08 | 2012-11-06 | Microsoft Corporation | Speech recognition with non-linear noise reduction on Mel-frequency cepstra |
| JP5640238B2 (ja) * | 2008-02-28 | 2014-12-17 | 株式会社通信放送国際研究所 | 特異点信号処理システムおよびそのプログラム |
| US8473287B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
| US8538035B2 (en) | 2010-04-29 | 2013-09-17 | Audience, Inc. | Multi-microphone robust noise suppression |
| US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
| US8781137B1 (en) | 2010-04-27 | 2014-07-15 | Audience, Inc. | Wind noise detection and suppression |
| US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
| US9245538B1 (en) * | 2010-05-20 | 2016-01-26 | Audience, Inc. | Bandwidth enhancement of speech signals assisted by noise reduction |
| US8447596B2 (en) * | 2010-07-12 | 2013-05-21 | Audience, Inc. | Monaural noise suppression based on computational auditory scene analysis |
| US9792925B2 (en) * | 2010-11-25 | 2017-10-17 | Nec Corporation | Signal processing device, signal processing method and signal processing program |
| FR2980620A1 (fr) * | 2011-09-23 | 2013-03-29 | France Telecom | Traitement d'amelioration de la qualite des signaux audiofrequences decodes |
| US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
| CN106797512B (zh) | 2014-08-28 | 2019-10-25 | 美商楼氏电子有限公司 | 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质 |
| US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
| RU2712125C2 (ru) * | 2015-09-25 | 2020-01-24 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Кодер и способ кодирования аудиосигнала с уменьшенным фоновым шумом с использованием кодирования с линейным предсказанием |
| WO2017143334A1 (en) * | 2016-02-19 | 2017-08-24 | New York University | Method and system for multi-talker babble noise reduction using q-factor based signal decomposition |
| CN108175436A (zh) * | 2017-12-28 | 2018-06-19 | 北京航空航天大学 | 一种肠鸣音智能自动识别方法 |
| US11545143B2 (en) * | 2021-05-18 | 2023-01-03 | Boris Fridman-Mintz | Recognition or synthesis of human-uttered harmonic sounds |
| CN114141246B (zh) * | 2021-12-10 | 2025-07-08 | 北京百度网讯科技有限公司 | 用于识别语音的方法、用于训练模型的方法及装置 |
| CN114999500B (zh) * | 2022-05-30 | 2025-07-04 | 广东电网有限责任公司 | 一种基于基频信息的声纹识别方法及装置 |
| CN118430566B (zh) * | 2024-07-03 | 2024-10-11 | 陕西大才科技有限公司 | 一种语音通联方法及系统 |
Family Cites Families (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH06289897A (ja) * | 1993-03-31 | 1994-10-18 | Sony Corp | 音声信号処理装置 |
| US5701390A (en) * | 1995-02-22 | 1997-12-23 | Digital Voice Systems, Inc. | Synthesis of MBE-based coded speech using regenerated phase information |
| GB9512284D0 (en) * | 1995-06-16 | 1995-08-16 | Nokia Mobile Phones Ltd | Speech Synthesiser |
| JP3591068B2 (ja) * | 1995-06-30 | 2004-11-17 | ソニー株式会社 | 音声信号の雑音低減方法 |
| JPH0944186A (ja) * | 1995-07-31 | 1997-02-14 | Matsushita Electric Ind Co Ltd | 雑音抑制装置 |
| JP4132109B2 (ja) * | 1995-10-26 | 2008-08-13 | ソニー株式会社 | 音声信号の再生方法及び装置、並びに音声復号化方法及び装置、並びに音声合成方法及び装置 |
| JPH09152891A (ja) * | 1995-11-28 | 1997-06-10 | Takayoshi Hirata | 非調和的周期検出法を用いた準周期的雑音の除去方式 |
| US5913187A (en) | 1997-08-29 | 1999-06-15 | Nortel Networks Corporation | Nonlinear filter for noise suppression in linear prediction speech processing devices |
| US6453285B1 (en) * | 1998-08-21 | 2002-09-17 | Polycom, Inc. | Speech activity detector for use in noise reduction system, and methods therefor |
| US6253171B1 (en) * | 1999-02-23 | 2001-06-26 | Comsat Corporation | Method of determining the voicing probability of speech signals |
| US6529868B1 (en) * | 2000-03-28 | 2003-03-04 | Tellabs Operations, Inc. | Communication system noise cancellation power signal calculation techniques |
| TW466471B (en) * | 2000-04-07 | 2001-12-01 | Ind Tech Res Inst | Method for performing noise adaptation in voice recognition unit |
| US20020039425A1 (en) * | 2000-07-19 | 2002-04-04 | Burnett Gregory C. | Method and apparatus for removing noise from electronic signals |
| US7020605B2 (en) * | 2000-09-15 | 2006-03-28 | Mindspeed Technologies, Inc. | Speech coding system with time-domain noise attenuation |
| JP3586205B2 (ja) * | 2001-02-22 | 2004-11-10 | 日本電信電話株式会社 | 音声スペクトル改善方法、音声スペクトル改善装置、音声スペクトル改善プログラム、プログラムを記憶した記憶媒体 |
| US7120580B2 (en) * | 2001-08-15 | 2006-10-10 | Sri International | Method and apparatus for recognizing speech in a noisy environment |
| US6952482B2 (en) * | 2001-10-02 | 2005-10-04 | Siemens Corporation Research, Inc. | Method and apparatus for noise filtering |
| US7447630B2 (en) * | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
| US7464029B2 (en) * | 2005-07-22 | 2008-12-09 | Qualcomm Incorporated | Robust separation of speech signals in a noisy environment |
| KR101414233B1 (ko) * | 2007-01-05 | 2014-07-02 | 삼성전자 주식회사 | 음성 신호의 명료도를 향상시키는 장치 및 방법 |
-
2003
- 2003-08-25 US US10/647,586 patent/US7516067B2/en not_active Expired - Fee Related
-
2004
- 2004-07-23 DE DE602004003439T patent/DE602004003439T2/de not_active Expired - Lifetime
- 2004-07-23 AT AT04103533T patent/ATE347162T1/de not_active IP Right Cessation
- 2004-07-23 EP EP04103533A patent/EP1511011B1/de not_active Expired - Lifetime
- 2004-08-19 JP JP2004239995A patent/JP4731855B2/ja not_active Expired - Fee Related
- 2004-08-24 KR KR1020040066834A patent/KR101087319B1/ko not_active Expired - Fee Related
- 2004-08-25 CN CN200410068536.3A patent/CN1591574B/zh not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| JP4731855B2 (ja) | 2011-07-27 |
| DE602004003439D1 (de) | 2007-01-11 |
| EP1511011B1 (de) | 2006-11-29 |
| CN1591574B (zh) | 2010-06-23 |
| KR101087319B1 (ko) | 2011-11-25 |
| US7516067B2 (en) | 2009-04-07 |
| JP2005070779A (ja) | 2005-03-17 |
| EP1511011A2 (de) | 2005-03-02 |
| EP1511011A3 (de) | 2005-04-13 |
| US20050049857A1 (en) | 2005-03-03 |
| KR20050022371A (ko) | 2005-03-07 |
| CN1591574A (zh) | 2005-03-09 |
| DE602004003439T2 (de) | 2007-03-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE347162T1 (de) | Rauschunterdrückung zur robusten spracherkennung | |
| WO2005055197A3 (en) | Noise suppressor for speech coding and speech recognition | |
| DE60329446D1 (de) | Nichtlineares Modell zur Geräuschunterdrückung von verzerrten Signalen | |
| US9570072B2 (en) | System and method for noise reduction in processing speech signals by targeting speech and disregarding noise | |
| SE0004163D0 (sv) | Enhancing perceptual performance of high frequency reconstruction coding methods by adaptive filtering | |
| ATE425532T1 (de) | Modellbasierte verbesserung von sprachsignalen | |
| ATE492015T1 (de) | Verbesserung der sprachverständlichkeit mit einem psychoakustischen model und einer überabgetasteten filterbank | |
| DE60321786D1 (de) | Verfahren und anordnung zur grundfrequenzverbesserung eines decodierten sprachsignals | |
| DE602007004738D1 (de) | Verfahren zur unterdrückung akustischer restechos nach echounterdrückung bei einer freisprecheinrichtung | |
| WO2002093876A3 (en) | Final signal from a near-end signal and a far-end signal | |
| WO2004045244A8 (en) | Adaptative noise canceling microphone system | |
| EP1308932A3 (de) | Adaptive Postfilterverfahren und Vorrichtungen zur Sprachdekodierung | |
| AU2003245443A1 (en) | Improving speech recognition of mobile devices | |
| FI20100431A7 (fi) | Järjestelmä ja menetelmä häiriönpoiston mahdollistamiseksi käyttäen häiriönvähennyskäsittelyä | |
| CA2485800A1 (en) | Method and apparatus for multi-sensory speech enhancement | |
| DE59914782D1 (de) | Verfahren zur Störbefreiung eines Mikrophonsignals | |
| FR2898209B1 (fr) | Procede de debruitage d'un signal audio | |
| WO2009151578A3 (en) | Method and apparatus for blind signal recovery in noisy, reverberant environments | |
| WO2007111646A3 (en) | Speech post-processing using mdct coefficients | |
| DE69920461D1 (de) | Verfahren und Vorrichtung zur robusten Merkmalsextraktion für die Spracherkennung | |
| DE60038279D1 (de) | Beitband Sprachkodierung mit parametrischer Kodierung des Hochfrequenzanteils | |
| DE60117558D1 (de) | Verfahren zur rauschrobusten klassifikation in der sprachkodierung | |
| DE60034429D1 (de) | Verfahren und vorrichtung zur bestimmung von sprachkodierparametern | |
| DE602005004464D1 (de) | Sprachverbesserung | |
| WO2004002028A3 (en) | Audio signal processing apparatus and method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |