ATE347162T1 - Rauschunterdrückung zur robusten spracherkennung - Google Patents

Rauschunterdrückung zur robusten spracherkennung

Info

Publication number
ATE347162T1
ATE347162T1 AT04103533T AT04103533T ATE347162T1 AT E347162 T1 ATE347162 T1 AT E347162T1 AT 04103533 T AT04103533 T AT 04103533T AT 04103533 T AT04103533 T AT 04103533T AT E347162 T1 ATE347162 T1 AT E347162T1
Authority
AT
Austria
Prior art keywords
speech recognition
noise cancellation
noise
robust speech
sum
Prior art date
Application number
AT04103533T
Other languages
English (en)
Inventor
Michael L Seltzer
James Droppo
Alejandro Acero
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE347162T1 publication Critical patent/ATE347162T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Noise Elimination (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Telephonic Communication Services (AREA)
AT04103533T 2003-08-25 2004-07-23 Rauschunterdrückung zur robusten spracherkennung ATE347162T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/647,586 US7516067B2 (en) 2003-08-25 2003-08-25 Method and apparatus using harmonic-model-based front end for robust speech recognition

Publications (1)

Publication Number Publication Date
ATE347162T1 true ATE347162T1 (de) 2006-12-15

Family

ID=34104651

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04103533T ATE347162T1 (de) 2003-08-25 2004-07-23 Rauschunterdrückung zur robusten spracherkennung

Country Status (7)

Country Link
US (1) US7516067B2 (de)
EP (1) EP1511011B1 (de)
JP (1) JP4731855B2 (de)
KR (1) KR101087319B1 (de)
CN (1) CN1591574B (de)
AT (1) ATE347162T1 (de)
DE (1) DE602004003439T2 (de)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7447630B2 (en) * 2003-11-26 2008-11-04 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement
KR100744352B1 (ko) * 2005-08-01 2007-07-30 삼성전자주식회사 음성 신호의 하모닉 성분을 이용한 유/무성음 분리 정보를추출하는 방법 및 그 장치
US9185487B2 (en) 2006-01-30 2015-11-10 Audience, Inc. System and method for providing noise suppression utilizing null processing noise subtraction
US8005671B2 (en) * 2006-12-04 2011-08-23 Qualcomm Incorporated Systems and methods for dynamic normalization to reduce loss in precision for low-level signals
JP5089295B2 (ja) * 2007-08-31 2012-12-05 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声処理システム、方法及びプログラム
KR100919223B1 (ko) * 2007-09-19 2009-09-28 한국전자통신연구원 부대역의 불확실성 정보를 이용한 잡음환경에서의 음성인식 방법 및 장치
US8306817B2 (en) * 2008-01-08 2012-11-06 Microsoft Corporation Speech recognition with non-linear noise reduction on Mel-frequency cepstra
JP5640238B2 (ja) * 2008-02-28 2014-12-17 株式会社通信放送国際研究所 特異点信号処理システムおよびそのプログラム
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
US8538035B2 (en) 2010-04-29 2013-09-17 Audience, Inc. Multi-microphone robust noise suppression
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
US8781137B1 (en) 2010-04-27 2014-07-15 Audience, Inc. Wind noise detection and suppression
US9558755B1 (en) 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
US9245538B1 (en) * 2010-05-20 2016-01-26 Audience, Inc. Bandwidth enhancement of speech signals assisted by noise reduction
US8447596B2 (en) * 2010-07-12 2013-05-21 Audience, Inc. Monaural noise suppression based on computational auditory scene analysis
US9792925B2 (en) * 2010-11-25 2017-10-17 Nec Corporation Signal processing device, signal processing method and signal processing program
FR2980620A1 (fr) * 2011-09-23 2013-03-29 France Telecom Traitement d'amelioration de la qualite des signaux audiofrequences decodes
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
CN106797512B (zh) 2014-08-28 2019-10-25 美商楼氏电子有限公司 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
RU2712125C2 (ru) * 2015-09-25 2020-01-24 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Кодер и способ кодирования аудиосигнала с уменьшенным фоновым шумом с использованием кодирования с линейным предсказанием
WO2017143334A1 (en) * 2016-02-19 2017-08-24 New York University Method and system for multi-talker babble noise reduction using q-factor based signal decomposition
CN108175436A (zh) * 2017-12-28 2018-06-19 北京航空航天大学 一种肠鸣音智能自动识别方法
US11545143B2 (en) * 2021-05-18 2023-01-03 Boris Fridman-Mintz Recognition or synthesis of human-uttered harmonic sounds
CN114141246B (zh) * 2021-12-10 2025-07-08 北京百度网讯科技有限公司 用于识别语音的方法、用于训练模型的方法及装置
CN114999500B (zh) * 2022-05-30 2025-07-04 广东电网有限责任公司 一种基于基频信息的声纹识别方法及装置
CN118430566B (zh) * 2024-07-03 2024-10-11 陕西大才科技有限公司 一种语音通联方法及系统

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06289897A (ja) * 1993-03-31 1994-10-18 Sony Corp 音声信号処理装置
US5701390A (en) * 1995-02-22 1997-12-23 Digital Voice Systems, Inc. Synthesis of MBE-based coded speech using regenerated phase information
GB9512284D0 (en) * 1995-06-16 1995-08-16 Nokia Mobile Phones Ltd Speech Synthesiser
JP3591068B2 (ja) * 1995-06-30 2004-11-17 ソニー株式会社 音声信号の雑音低減方法
JPH0944186A (ja) * 1995-07-31 1997-02-14 Matsushita Electric Ind Co Ltd 雑音抑制装置
JP4132109B2 (ja) * 1995-10-26 2008-08-13 ソニー株式会社 音声信号の再生方法及び装置、並びに音声復号化方法及び装置、並びに音声合成方法及び装置
JPH09152891A (ja) * 1995-11-28 1997-06-10 Takayoshi Hirata 非調和的周期検出法を用いた準周期的雑音の除去方式
US5913187A (en) 1997-08-29 1999-06-15 Nortel Networks Corporation Nonlinear filter for noise suppression in linear prediction speech processing devices
US6453285B1 (en) * 1998-08-21 2002-09-17 Polycom, Inc. Speech activity detector for use in noise reduction system, and methods therefor
US6253171B1 (en) * 1999-02-23 2001-06-26 Comsat Corporation Method of determining the voicing probability of speech signals
US6529868B1 (en) * 2000-03-28 2003-03-04 Tellabs Operations, Inc. Communication system noise cancellation power signal calculation techniques
TW466471B (en) * 2000-04-07 2001-12-01 Ind Tech Res Inst Method for performing noise adaptation in voice recognition unit
US20020039425A1 (en) * 2000-07-19 2002-04-04 Burnett Gregory C. Method and apparatus for removing noise from electronic signals
US7020605B2 (en) * 2000-09-15 2006-03-28 Mindspeed Technologies, Inc. Speech coding system with time-domain noise attenuation
JP3586205B2 (ja) * 2001-02-22 2004-11-10 日本電信電話株式会社 音声スペクトル改善方法、音声スペクトル改善装置、音声スペクトル改善プログラム、プログラムを記憶した記憶媒体
US7120580B2 (en) * 2001-08-15 2006-10-10 Sri International Method and apparatus for recognizing speech in a noisy environment
US6952482B2 (en) * 2001-10-02 2005-10-04 Siemens Corporation Research, Inc. Method and apparatus for noise filtering
US7447630B2 (en) * 2003-11-26 2008-11-04 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement
US7464029B2 (en) * 2005-07-22 2008-12-09 Qualcomm Incorporated Robust separation of speech signals in a noisy environment
KR101414233B1 (ko) * 2007-01-05 2014-07-02 삼성전자 주식회사 음성 신호의 명료도를 향상시키는 장치 및 방법

Also Published As

Publication number Publication date
JP4731855B2 (ja) 2011-07-27
DE602004003439D1 (de) 2007-01-11
EP1511011B1 (de) 2006-11-29
CN1591574B (zh) 2010-06-23
KR101087319B1 (ko) 2011-11-25
US7516067B2 (en) 2009-04-07
JP2005070779A (ja) 2005-03-17
EP1511011A2 (de) 2005-03-02
EP1511011A3 (de) 2005-04-13
US20050049857A1 (en) 2005-03-03
KR20050022371A (ko) 2005-03-07
CN1591574A (zh) 2005-03-09
DE602004003439T2 (de) 2007-03-29

Similar Documents

Publication Publication Date Title
ATE347162T1 (de) Rauschunterdrückung zur robusten spracherkennung
WO2005055197A3 (en) Noise suppressor for speech coding and speech recognition
DE60329446D1 (de) Nichtlineares Modell zur Geräuschunterdrückung von verzerrten Signalen
US9570072B2 (en) System and method for noise reduction in processing speech signals by targeting speech and disregarding noise
SE0004163D0 (sv) Enhancing perceptual performance of high frequency reconstruction coding methods by adaptive filtering
ATE425532T1 (de) Modellbasierte verbesserung von sprachsignalen
ATE492015T1 (de) Verbesserung der sprachverständlichkeit mit einem psychoakustischen model und einer überabgetasteten filterbank
DE60321786D1 (de) Verfahren und anordnung zur grundfrequenzverbesserung eines decodierten sprachsignals
DE602007004738D1 (de) Verfahren zur unterdrückung akustischer restechos nach echounterdrückung bei einer freisprecheinrichtung
WO2002093876A3 (en) Final signal from a near-end signal and a far-end signal
WO2004045244A8 (en) Adaptative noise canceling microphone system
EP1308932A3 (de) Adaptive Postfilterverfahren und Vorrichtungen zur Sprachdekodierung
AU2003245443A1 (en) Improving speech recognition of mobile devices
FI20100431A7 (fi) Järjestelmä ja menetelmä häiriönpoiston mahdollistamiseksi käyttäen häiriönvähennyskäsittelyä
CA2485800A1 (en) Method and apparatus for multi-sensory speech enhancement
DE59914782D1 (de) Verfahren zur Störbefreiung eines Mikrophonsignals
FR2898209B1 (fr) Procede de debruitage d'un signal audio
WO2009151578A3 (en) Method and apparatus for blind signal recovery in noisy, reverberant environments
WO2007111646A3 (en) Speech post-processing using mdct coefficients
DE69920461D1 (de) Verfahren und Vorrichtung zur robusten Merkmalsextraktion für die Spracherkennung
DE60038279D1 (de) Beitband Sprachkodierung mit parametrischer Kodierung des Hochfrequenzanteils
DE60117558D1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
DE60034429D1 (de) Verfahren und vorrichtung zur bestimmung von sprachkodierparametern
DE602005004464D1 (de) Sprachverbesserung
WO2004002028A3 (en) Audio signal processing apparatus and method

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties