ATE508452T1 - Unterscheidung zwischen vordergrundsprache und hintergrundgeräuschen - Google Patents

Unterscheidung zwischen vordergrundsprache und hintergrundgeräuschen

Info

Publication number
ATE508452T1
ATE508452T1 AT07021933T AT07021933T ATE508452T1 AT E508452 T1 ATE508452 T1 AT E508452T1 AT 07021933 T AT07021933 T AT 07021933T AT 07021933 T AT07021933 T AT 07021933T AT E508452 T1 ATE508452 T1 AT E508452T1
Authority
AT
Austria
Prior art keywords
differentiation
background noise
stochastic
foreground
model
Prior art date
Application number
AT07021933T
Other languages
English (en)
Inventor
Tobias Herbig
Oliver Gaupp
Franz Gerl
Original Assignee
Harman Becker Automotive Sys
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harman Becker Automotive Sys filed Critical Harman Becker Automotive Sys
Application granted granted Critical
Publication of ATE508452T1 publication Critical patent/ATE508452T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Machine Translation (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Details Of Television Scanning (AREA)
AT07021933T 2007-11-12 2007-11-12 Unterscheidung zwischen vordergrundsprache und hintergrundgeräuschen ATE508452T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP07021933A EP2058797B1 (de) 2007-11-12 2007-11-12 Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen

Publications (1)

Publication Number Publication Date
ATE508452T1 true ATE508452T1 (de) 2011-05-15

Family

ID=39015777

Family Applications (1)

Application Number Title Priority Date Filing Date
AT07021933T ATE508452T1 (de) 2007-11-12 2007-11-12 Unterscheidung zwischen vordergrundsprache und hintergrundgeräuschen

Country Status (4)

Country Link
US (1) US8131544B2 (de)
EP (1) EP2058797B1 (de)
AT (1) ATE508452T1 (de)
DE (1) DE602007014382D1 (de)

Families Citing this family (60)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
JP4867516B2 (ja) * 2006-08-01 2012-02-01 ヤマハ株式会社 音声会議システム
JP2009086581A (ja) * 2007-10-03 2009-04-23 Toshiba Corp 音声認識の話者モデルを作成する装置およびプログラム
US8355511B2 (en) * 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
EP2189976B1 (de) * 2008-11-21 2012-10-24 Nuance Communications, Inc. Verfahren zur Adaption eines Codierungsbuches für Spracherkennung
US8275148B2 (en) * 2009-07-28 2012-09-25 Fortemedia, Inc. Audio processing apparatus and method
KR101581885B1 (ko) * 2009-08-26 2016-01-04 삼성전자주식회사 복소 스펙트럼 잡음 제거 장치 및 방법
CN102725715B (zh) * 2009-10-20 2016-11-09 谱瑞科技股份有限公司 减少触控屏幕控制器中的耦合噪声影响的方法和设备
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
US9008329B1 (en) * 2010-01-26 2015-04-14 Audience, Inc. Noise reduction using multi-feature cluster tracker
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
US8538035B2 (en) 2010-04-29 2013-09-17 Audience, Inc. Multi-microphone robust noise suppression
US8781137B1 (en) 2010-04-27 2014-07-15 Audience, Inc. Wind noise detection and suppression
US9558755B1 (en) 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
US8447596B2 (en) * 2010-07-12 2013-05-21 Audience, Inc. Monaural noise suppression based on computational auditory scene analysis
US9128570B2 (en) 2011-02-07 2015-09-08 Cypress Semiconductor Corporation Noise filtering devices, systems and methods for capacitance sensing devices
CN102655006A (zh) * 2011-03-03 2012-09-05 富泰华工业(深圳)有限公司 语音传输装置及其语音传输方法
US9224388B2 (en) 2011-03-04 2015-12-29 Qualcomm Incorporated Sound recognition method and system
US8849663B2 (en) * 2011-03-21 2014-09-30 The Intellisis Corporation Systems and methods for segmenting and/or classifying an audio signal from transformed audio information
US9142220B2 (en) 2011-03-25 2015-09-22 The Intellisis Corporation Systems and methods for reconstructing an audio signal from transformed audio information
US9170322B1 (en) 2011-04-05 2015-10-27 Parade Technologies, Ltd. Method and apparatus for automating noise reduction tuning in real time
US9323385B2 (en) 2011-04-05 2016-04-26 Parade Technologies, Ltd. Noise detection for a capacitance sensing panel
CN103650040B (zh) * 2011-05-16 2017-08-25 谷歌公司 使用多特征建模分析语音/噪声可能性的噪声抑制方法和装置
KR101801327B1 (ko) * 2011-07-29 2017-11-27 삼성전자주식회사 감정 정보 생성 장치, 감정 정보 생성 방법 및 감정 정보 기반 기능 추천 장치
US8620646B2 (en) * 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8548803B2 (en) 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
MX346827B (es) * 2011-10-17 2017-04-03 Koninklijke Philips Nv Sistema de monitoreo medico con base en analisis del sonido en un entorno medico.
US20150287406A1 (en) * 2012-03-23 2015-10-08 Google Inc. Estimating Speech in the Presence of Noise
US9881616B2 (en) * 2012-06-06 2018-01-30 Qualcomm Incorporated Method and systems having improved speech recognition
TWI557722B (zh) * 2012-11-15 2016-11-11 緯創資通股份有限公司 語音干擾的濾除方法、系統,與電腦可讀記錄媒體
CN103971685B (zh) * 2013-01-30 2015-06-10 腾讯科技(深圳)有限公司 语音命令识别方法和系统
US9520138B2 (en) * 2013-03-15 2016-12-13 Broadcom Corporation Adaptive modulation filtering for spectral feature enhancement
US9489965B2 (en) * 2013-03-15 2016-11-08 Sri International Method and apparatus for acoustic signal characterization
US9570087B2 (en) * 2013-03-15 2017-02-14 Broadcom Corporation Single channel suppression of interfering sources
US9536540B2 (en) * 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
CN104143326B (zh) 2013-12-03 2016-11-02 腾讯科技(深圳)有限公司 一种语音命令识别方法和装置
CN106797512B (zh) 2014-08-28 2019-10-25 美商楼氏电子有限公司 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质
WO2016040885A1 (en) 2014-09-12 2016-03-17 Audience, Inc. Systems and methods for restoration of speech components
TWI584275B (zh) * 2014-11-25 2017-05-21 宏達國際電子股份有限公司 電子裝置和聲音信號的分析與播放方法
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
CN105096121B (zh) * 2015-06-25 2017-07-25 百度在线网络技术(北京)有限公司 声纹认证方法和装置
US20170150254A1 (en) * 2015-11-19 2017-05-25 Vocalzoom Systems Ltd. System, device, and method of sound isolation and signal enhancement
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones
CN105933323B (zh) * 2016-06-01 2019-05-31 百度在线网络技术(北京)有限公司 声纹注册、认证方法及装置
US20180166073A1 (en) * 2016-12-13 2018-06-14 Ford Global Technologies, Llc Speech Recognition Without Interrupting The Playback Audio
US10558421B2 (en) 2017-05-22 2020-02-11 International Business Machines Corporation Context based identification of non-relevant verbal communications
US10356362B1 (en) * 2018-01-16 2019-07-16 Google Llc Controlling focus of audio signals on speaker during videoconference
WO2021125037A1 (ja) * 2019-12-17 2021-06-24 ソニーグループ株式会社 信号処理装置、信号処理方法、プログラムおよび信号処理システム
US11274965B2 (en) 2020-02-10 2022-03-15 International Business Machines Corporation Noise model-based converter with signal steps based on uncertainty
CN113870879B (zh) * 2020-06-12 2024-12-13 青岛海尔电冰箱有限公司 智能家电麦克风的共享方法、智能家电和可读存储介质
US11694692B2 (en) 2020-11-11 2023-07-04 Bank Of America Corporation Systems and methods for audio enhancement and conversion
CN113870871A (zh) * 2021-08-19 2021-12-31 阿里巴巴达摩院(杭州)科技有限公司 音频处理方法、装置、存储介质、电子设备
CN115547308B (zh) * 2022-09-01 2024-09-20 北京达佳互联信息技术有限公司 一种音频识别模型训练方法、音频识别方法、装置、电子设备及存储介质
US20250201237A1 (en) * 2023-12-15 2025-06-19 Paypal, Inc. Split-and-merge framework for audio content processing
CN118098260B (zh) * 2024-03-26 2024-08-23 荣耀终端有限公司 一种语音信号处理方法及相关设备
CN119274568B (zh) * 2024-12-06 2025-03-14 深圳市宝立创科技有限公司 一种声学早教机的控制方法和系统

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5353376A (en) * 1992-03-20 1994-10-04 Texas Instruments Incorporated System and method for improved speech acquisition for hands-free voice telecommunication in a noisy environment
US6615170B1 (en) 2000-03-07 2003-09-02 International Business Machines Corporation Model-based voice activity detection system and method using a log-likelihood ratio and pitch
US6993481B2 (en) * 2000-12-04 2006-01-31 Global Ip Sound Ab Detection of speech activity using feature model adaptation
US7072834B2 (en) * 2002-04-05 2006-07-04 Intel Corporation Adapting to adverse acoustic environment in speech processing using playback training data
JP2005249816A (ja) * 2004-03-01 2005-09-15 Internatl Business Mach Corp <Ibm> 信号強調装置、方法及びプログラム、並びに音声認識装置、方法及びプログラム
JP2007093630A (ja) * 2005-09-05 2007-04-12 Advanced Telecommunication Research Institute International 音声強調装置
CA2536976A1 (en) * 2006-02-20 2007-08-20 Diaphonics, Inc. Method and apparatus for detecting speaker change in a voice transaction
US20070239441A1 (en) * 2006-03-29 2007-10-11 Jiri Navratil System and method for addressing channel mismatch through class specific transforms
CA2652302C (en) * 2006-05-16 2015-04-07 Loquendo S.P.A. Intersession variability compensation for automatic extraction of information from voice
US9966085B2 (en) 2006-12-30 2018-05-08 Google Technology Holdings LLC Method and noise suppression circuit incorporating a plurality of noise suppression techniques
ATE457511T1 (de) * 2007-10-10 2010-02-15 Harman Becker Automotive Sys Sprechererkennung

Also Published As

Publication number Publication date
US20090228272A1 (en) 2009-09-10
EP2058797A1 (de) 2009-05-13
US8131544B2 (en) 2012-03-06
DE602007014382D1 (de) 2011-06-16
EP2058797B1 (de) 2011-05-04

Similar Documents

Publication Publication Date Title
ATE508452T1 (de) Unterscheidung zwischen vordergrundsprache und hintergrundgeräuschen
DE602006006664D1 (de) Reduzierung von Hintergrundrauschen in Freisprechsystemen
EP2105040A4 (de) Ohrmodul für ein individuelles tonsystem
PL4503026T3 (pl) Sposób powiększania szerokości pasma sygnału audio
ATE473603T1 (de) Akustische lokalisierung eines sprechers
DE602006013365D1 (de) Mikrofon und Schallverstärkungssystem
WO2009078146A1 (ja) 騒音低減装置および騒音低減システム
ATE540398T1 (de) Sprachaktivitätsdetektionseinrichtung und verfahren
DE602009000122D1 (de) Kompression und Mischen für Hörgeräte
EP1711031A4 (de) Membran für einen lautsprecher und lautsprecher
EP1852383A4 (de) Regler für aufzug
EP1770044A4 (de) Geschwindigkeitsregler für aufzug
ATE484160T1 (de) Verfahren zur rückkopplungslöschung in einem hörgerät und hörgerät
EP2265038A4 (de) Mikrophoneinheit, nahspracheingabegerät, informationsverarbeitungssystem und verfahren zur herstellung der mikrophoneinheit
ATE422789T1 (de) Mikrofoneinrichtung mit orientierungssensor und entsprechendes verfahren zum betreiben der mikrofoneinrichtung
EP1940199A4 (de) Lautsprecher, membran für einen lautsprecher und aufhängung
EP2296143A4 (de) Audiosignal-decodierungseinrichtung und gleichgewichtseinstellverfahren für eine audiosignal-decodierungseinrichtung
ATE446572T1 (de) Verfahren und system zur bereitstellung eines tonsignals mit erweiterter bandbreite
DK1885156T3 (da) Høreapparat med en audiosignalgenerator
BRPI0914701A2 (pt) suporte de bobina de voz para uma estrutura de motor de transdutor de bobina, método para fabricar o mesmo, estrutura de motor de transdutor de bobina, e, alto-falante
DK1862033T3 (da) Transducerarrangement der forbedrer naturligheden af lyde
DE502004012101D1 (de) Mikrophonschutz für Hörgeräte
ATE501603T1 (de) Hörhilfe mit uv-sensor und betriebsverfahren
EP1992192A4 (de) Schallschwamm für lautsprecher
DK2257081T3 (da) Høreindretning med to eller flere mikrofoner

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties