ES2570961T3 - Estimación de varianza de ruido para mejorar la calidad de voz - Google Patents
Estimación de varianza de ruido para mejorar la calidad de vozInfo
- Publication number
- ES2570961T3 ES2570961T3 ES08726859T ES08726859T ES2570961T3 ES 2570961 T3 ES2570961 T3 ES 2570961T3 ES 08726859 T ES08726859 T ES 08726859T ES 08726859 T ES08726859 T ES 08726859T ES 2570961 T3 ES2570961 T3 ES 2570961T3
- Authority
- ES
- Spain
- Prior art keywords
- audio signal
- noise components
- amplitude
- estimate
- estimation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 abstract 9
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/12—Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuit For Audible Band Transducer (AREA)
- Noise Elimination (AREA)
- Monitoring And Testing Of Transmission In General (AREA)
- Telephone Function (AREA)
Abstract
Un procedimiento para obtener una estimación de varianza en componentes de ruido de una señal de audio formada por componentes de voz y de ruido, que comprende: obtener dicha estimación de varianza en componentes de ruido de una señal de audio a partir del promedio de estimaciones previas de la amplitud de las componentes de ruido de la señal de audio, en el que las estimaciones de la amplitud de las componentes de ruido de la señal de audio que tienen valores mayores que un umbral se excluyen de o se ponderan con un valor bajo en el promedio de las estimaciones previas de la amplitud de las componentes de ruido de la señal de audio, y en el que cada estimación de la amplitud de las componentes de ruido de la señal de audio es una función de una estimación de varianza en las componentes de ruido de la señal de audio, una estimación de varianza en las componentes de voz de la señal de audio y la amplitud de la señal de audio.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US91896407P | 2007-03-19 | 2007-03-19 | |
| PCT/US2008/003436 WO2008115435A1 (en) | 2007-03-19 | 2008-03-14 | Noise variance estimator for speech enhancement |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ES2570961T3 true ES2570961T3 (es) | 2016-05-23 |
Family
ID=39468801
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| ES08726859T Active ES2570961T3 (es) | 2007-03-19 | 2008-03-14 | Estimación de varianza de ruido para mejorar la calidad de voz |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US8280731B2 (es) |
| EP (2) | EP2137728B1 (es) |
| JP (1) | JP5186510B2 (es) |
| KR (1) | KR101141033B1 (es) |
| CN (1) | CN101647061B (es) |
| ES (1) | ES2570961T3 (es) |
| TW (1) | TWI420509B (es) |
| WO (1) | WO2008115435A1 (es) |
Families Citing this family (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
| US8521530B1 (en) * | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
| KR101581885B1 (ko) * | 2009-08-26 | 2016-01-04 | 삼성전자주식회사 | 복소 스펙트럼 잡음 제거 장치 및 방법 |
| US20110178800A1 (en) * | 2010-01-19 | 2011-07-21 | Lloyd Watts | Distortion Measurement for Noise Suppression System |
| US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
| US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
| SG187743A1 (en) | 2010-08-12 | 2013-03-28 | Fraunhofer Ges Forschung | Resampling output signals of qmf based audio codecs |
| JP5643686B2 (ja) * | 2011-03-11 | 2014-12-17 | 株式会社東芝 | 音声判別装置、音声判別方法および音声判別プログラム |
| US9173025B2 (en) | 2012-02-08 | 2015-10-27 | Dolby Laboratories Licensing Corporation | Combined suppression of noise, echo, and out-of-location signals |
| EP2828853B1 (en) | 2012-03-23 | 2018-09-12 | Dolby Laboratories Licensing Corporation | Method and system for bias corrected speech level determination |
| EP2828854B1 (en) | 2012-03-23 | 2016-03-16 | Dolby Laboratories Licensing Corporation | Hierarchical active voice detection |
| JP6182895B2 (ja) * | 2012-05-01 | 2017-08-23 | 株式会社リコー | 処理装置、処理方法、プログラム及び処理システム |
| US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
| US10306389B2 (en) | 2013-03-13 | 2019-05-28 | Kopin Corporation | Head wearable acoustic system with noise canceling microphone geometry apparatuses and methods |
| US12380906B2 (en) | 2013-03-13 | 2025-08-05 | Solos Technology Limited | Microphone configurations for eyewear devices, systems, apparatuses, and methods |
| US9312826B2 (en) | 2013-03-13 | 2016-04-12 | Kopin Corporation | Apparatuses and methods for acoustic channel auto-balancing during multi-channel signal extraction |
| US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
| CN103559887B (zh) * | 2013-11-04 | 2016-08-17 | 深港产学研基地 | 用于语音增强系统的背景噪声估计方法 |
| JP6361156B2 (ja) * | 2014-02-10 | 2018-07-25 | 沖電気工業株式会社 | 雑音推定装置、方法及びプログラム |
| CN103824563A (zh) * | 2014-02-21 | 2014-05-28 | 深圳市微纳集成电路与系统应用研究院 | 一种基于模块复用的助听器去噪装置和方法 |
| CN103854662B (zh) * | 2014-03-04 | 2017-03-15 | 中央军委装备发展部第六十三研究所 | 基于多域联合估计的自适应语音检测方法 |
| US9799330B2 (en) | 2014-08-28 | 2017-10-24 | Knowles Electronics, Llc | Multi-sourced noise suppression |
| JP6508491B2 (ja) * | 2014-12-12 | 2019-05-08 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | マルチチャネルオーディオ信号内の音声成分を強調するための信号処理装置 |
| CN105810214B (zh) * | 2014-12-31 | 2019-11-05 | 展讯通信(上海)有限公司 | 语音激活检测方法及装置 |
| EP3118851B1 (en) * | 2015-07-01 | 2021-01-06 | Oticon A/s | Enhancement of noisy speech based on statistical speech and noise models |
| US11631421B2 (en) * | 2015-10-18 | 2023-04-18 | Solos Technology Limited | Apparatuses and methods for enhanced speech recognition in variable environments |
| US20190137549A1 (en) * | 2017-11-03 | 2019-05-09 | Velodyne Lidar, Inc. | Systems and methods for multi-tier centroid calculation |
| EP3573058B1 (en) * | 2018-05-23 | 2021-02-24 | Harman Becker Automotive Systems GmbH | Dry sound and ambient sound separation |
| CN110164467B (zh) * | 2018-12-18 | 2022-11-25 | 腾讯科技(深圳)有限公司 | 语音降噪的方法和装置、计算设备和计算机可读存储介质 |
| CN110136738A (zh) * | 2019-06-13 | 2019-08-16 | 苏州思必驰信息科技有限公司 | 噪声估计方法及装置 |
| CN111613239B (zh) * | 2020-05-29 | 2023-09-05 | 北京达佳互联信息技术有限公司 | 音频去噪方法和装置、服务器、存储介质 |
| CN115188391B (zh) * | 2021-04-02 | 2025-06-13 | 深圳市三诺数字科技有限公司 | 一种远场双麦克风的语音增强方法及装置 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5706395A (en) * | 1995-04-19 | 1998-01-06 | Texas Instruments Incorporated | Adaptive weiner filtering using a dynamic suppression factor |
| SE506034C2 (sv) * | 1996-02-01 | 1997-11-03 | Ericsson Telefon Ab L M | Förfarande och anordning för förbättring av parametrar representerande brusigt tal |
| US6415253B1 (en) * | 1998-02-20 | 2002-07-02 | Meta-C Corporation | Method and apparatus for enhancing noise-corrupted speech |
| US6453285B1 (en) * | 1998-08-21 | 2002-09-17 | Polycom, Inc. | Speech activity detector for use in noise reduction system, and methods therefor |
| US6289309B1 (en) * | 1998-12-16 | 2001-09-11 | Sarnoff Corporation | Noise spectrum tracking for speech enhancement |
| US6910011B1 (en) * | 1999-08-16 | 2005-06-21 | Haman Becker Automotive Systems - Wavemakers, Inc. | Noisy acoustic signal enhancement |
| US6757395B1 (en) * | 2000-01-12 | 2004-06-29 | Sonic Innovations, Inc. | Noise reduction apparatus and method |
| US6804640B1 (en) * | 2000-02-29 | 2004-10-12 | Nuance Communications | Signal noise reduction using magnitude-domain spectral subtraction |
| JP3342864B2 (ja) * | 2000-09-13 | 2002-11-11 | 株式会社エントロピーソフトウェア研究所 | 音声の類似度検出方法及びその検出値を用いた音声認識方法、並びに、振動波の類似度検出方法及びその検出値を用いた機械の異常判定方法、並びに、画像の類似度検出方法及びその検出値を用いた画像認識方法、並びに、立体の類似度検出方法及びその検出値を用いた立体認識方法、並びに、動画像の類似度検出方法及びその検出値を用いた動画像認識方法 |
| JP4195267B2 (ja) * | 2002-03-14 | 2008-12-10 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、その音声認識方法及びプログラム |
| US20030187637A1 (en) * | 2002-03-29 | 2003-10-02 | At&T | Automatic feature compensation based on decomposition of speech and noise |
| EP1652404B1 (en) * | 2003-07-11 | 2010-11-03 | Cochlear Limited | Method and device for noise reduction |
| US7133825B2 (en) * | 2003-11-28 | 2006-11-07 | Skyworks Solutions, Inc. | Computationally efficient background noise suppressor for speech coding and speech recognition |
| CA2454296A1 (en) * | 2003-12-29 | 2005-06-29 | Nokia Corporation | Method and device for speech enhancement in the presence of background noise |
| US7492889B2 (en) | 2004-04-23 | 2009-02-17 | Acoustic Technologies, Inc. | Noise suppression based on bark band wiener filtering and modified doblinger noise estimate |
| US7454332B2 (en) * | 2004-06-15 | 2008-11-18 | Microsoft Corporation | Gain constrained noise suppression |
| US7742914B2 (en) * | 2005-03-07 | 2010-06-22 | Daniel A. Kosek | Audio spectral noise reduction method and apparatus |
| EP1760696B1 (en) * | 2005-09-03 | 2016-02-03 | GN ReSound A/S | Method and apparatus for improved estimation of non-stationary noise for speech enhancement |
| US8538763B2 (en) * | 2007-09-12 | 2013-09-17 | Dolby Laboratories Licensing Corporation | Speech enhancement with noise level estimation adjustment |
-
2008
- 2008-03-14 ES ES08726859T patent/ES2570961T3/es active Active
- 2008-03-14 US US12/531,690 patent/US8280731B2/en active Active
- 2008-03-14 CN CN2008800088867A patent/CN101647061B/zh active Active
- 2008-03-14 EP EP08726859.5A patent/EP2137728B1/en active Active
- 2008-03-14 EP EP16151957.4A patent/EP3070714B1/en active Active
- 2008-03-14 KR KR1020097019499A patent/KR101141033B1/ko active Active
- 2008-03-14 TW TW097109065A patent/TWI420509B/zh active
- 2008-03-14 WO PCT/US2008/003436 patent/WO2008115435A1/en not_active Ceased
- 2008-03-14 JP JP2009553646A patent/JP5186510B2/ja active Active
Also Published As
| Publication number | Publication date |
|---|---|
| KR20090122251A (ko) | 2009-11-26 |
| KR101141033B1 (ko) | 2012-05-03 |
| EP2137728B1 (en) | 2016-03-09 |
| EP2137728A1 (en) | 2009-12-30 |
| WO2008115435A1 (en) | 2008-09-25 |
| TW200844978A (en) | 2008-11-16 |
| CN101647061B (zh) | 2012-04-11 |
| JP5186510B2 (ja) | 2013-04-17 |
| US20100100386A1 (en) | 2010-04-22 |
| CN101647061A (zh) | 2010-02-10 |
| TWI420509B (zh) | 2013-12-21 |
| EP3070714A1 (en) | 2016-09-21 |
| US8280731B2 (en) | 2012-10-02 |
| JP2010521704A (ja) | 2010-06-24 |
| EP3070714B1 (en) | 2018-03-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ES2570961T3 (es) | Estimación de varianza de ruido para mejorar la calidad de voz | |
| LTPA2018510I1 (lt) | Dioksa-biciklo[3.2.1]oktan-2,3,4-triolio dariniai | |
| AR094279A1 (es) | Agregado de ruido de confort para modelar el ruido de fondo a bajas tasas de bits | |
| PL4503026T3 (pl) | Sposób powiększania szerokości pasma sygnału audio | |
| EP2486654A4 (en) | ADAPTIVE DYNAMIC RANGE EXTENSION OF AUDIO RECORDS | |
| MX2019005799A (es) | Estimacion del ruido de fondo en las señales de audio. | |
| ECSP12011946A (es) | Derivados de dioxa- biciclo [3.2.1] octano- 2 ,3,4- triol | |
| AR073992A1 (es) | Composiciones de recubrimiento acuosas | |
| GB201701046D0 (en) | Dynamic acoustic model switching to improve noisy speech recognition | |
| BR112016009563A2 (pt) | Extensão de largura de banda de áudio através da inserção de ruído temporal préformado no domínio de frequência | |
| WO2013138122A3 (en) | Automatic realtime speech impairment correction | |
| MX2013003803A (es) | Aparato y metodo para la estimacion de nivel de cuadros de audio codificados en el dominio de un flujo de bits. | |
| EP2738763A3 (en) | Speech enhancement apparatus and speech enhancement method | |
| MY183940A (en) | Gain shape estimation for improved tracking of high-band temporal characteristics | |
| Delcroix et al. | Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds | |
| PT3438979T (pt) | Estimativa de ruído de fundo em sinais de áudio | |
| MX2019001193A (es) | Metodo para procesar señal de voz/audio y aparato. | |
| AR101320A1 (es) | Método para estimar ruido en una señal de audio, estimador de ruido, codificador de audio, decodificador de audio, y sistema para transmitir señales de audio | |
| NZ747445A (en) | Composition containing caffeine and cycloalanylalanine | |
| MX390857B (es) | Mariscos pasteurizados, inocuos, con una vida de anaquel en refrigeracion extendida. | |
| WO2016100747A3 (en) | Method and apparatus for estimating waveform onset time | |
| DK3118851T3 (da) | Forbedring af støjende tale baseret på statistiske tale- og støjmodeller | |
| FI20075765A7 (fi) | Juusto ja menetelmä sen valmistamiseksi | |
| EP4265727A4 (en) | MODIFIED GLUCOSE DEHYDROGENASE | |
| WO2012150392A3 (fr) | Solution de rincage de greffon ou de tissu et procede de rincage dudit greffon ou tissu avant revascularisation |