ATE347161T1 - Rauschrobuste mustererkennung - Google Patents

Rauschrobuste mustererkennung

Info

Publication number: ATE347161T1
Authority: AT; Austria
Prior art keywords: noise; pattern recognition; training; signal; recognition model
Prior art date: 2000-10-16

Application number

AT01124141T

Other languages

English (en)

Inventor

Li Deng

Xuedong Huang

Michael D Plumpe

Original Assignee

Microsoft Corp

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2000-10-16

Filing date

2001-10-10

Publication date

2006-12-15

2001-10-10 Application filed by Microsoft Corp filed Critical Microsoft Corp

2006-12-15 Application granted granted Critical

2006-12-15 Publication of ATE347161T1 publication Critical patent/ATE347161T1/de

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Multimedia (AREA)
Acoustics & Sound (AREA)
Human Computer Interaction (AREA)
Audiology, Speech & Language Pathology (AREA)
Health & Medical Sciences (AREA)
Computational Linguistics (AREA)
Theoretical Computer Science (AREA)
Artificial Intelligence (AREA)
Data Mining & Analysis (AREA)
Bioinformatics & Cheminformatics (AREA)
General Physics & Mathematics (AREA)
Computer Vision & Pattern Recognition (AREA)
Quality & Reliability (AREA)
Signal Processing (AREA)
Bioinformatics & Computational Biology (AREA)
Evolutionary Computation (AREA)
Evolutionary Biology (AREA)
Life Sciences & Earth Sciences (AREA)
General Engineering & Computer Science (AREA)
Filters That Use Time-Delay Elements (AREA)
Noise Elimination (AREA)
Circuit For Audible Band Transducer (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Holo Graphy (AREA)
Inspection Of Paper Currency And Valuable Securities (AREA)

AT01124141T 2000-10-16 2001-10-10 Rauschrobuste mustererkennung ATE347161T1 (de)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
US09/688,950 US6876966B1 (en)	2000-10-16	2000-10-16	Pattern recognition training method and apparatus using inserted noise followed by noise reduction

Publications (1)

Publication Number	Publication Date
ATE347161T1 true ATE347161T1 (de)	2006-12-15

Family

ID=24766456

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT01124141T ATE347161T1 (de)	2000-10-16	2001-10-10	Rauschrobuste mustererkennung

Country Status (5)

Country	Link
US (1)	US6876966B1 (de)
EP (1)	EP1199708B1 (de)
JP (1)	JP4195211B2 (de)
AT (1)	ATE347161T1 (de)
DE (1)	DE60124842T2 (de)

Families Citing this family (71)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7542961B2 (en) *	2001-05-02	2009-06-02	Victor Gogolak	Method and system for analyzing drug adverse effects
US6778994B2 (en)	2001-05-02	2004-08-17	Victor Gogolak	Pharmacovigilance database
US7925612B2 (en) *	2001-05-02	2011-04-12	Victor Gogolak	Method for graphically depicting drug adverse effect risks
US7461006B2 (en) *	2001-08-29	2008-12-02	Victor Gogolak	Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US7165028B2 (en) *	2001-12-12	2007-01-16	Texas Instruments Incorporated	Method of speech recognition resistant to convolutive distortion and additive distortion
US7209881B2 (en) *	2001-12-20	2007-04-24	Matsushita Electric Industrial Co., Ltd.	Preparing acoustic models by sufficient statistics and noise-superimposed speech data
US7130776B2 (en) *	2002-03-25	2006-10-31	Lockheed Martin Corporation	Method and computer program product for producing a pattern recognition training set
US7117148B2 (en)	2002-04-05	2006-10-03	Microsoft Corporation	Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
US7103540B2 (en) *	2002-05-20	2006-09-05	Microsoft Corporation	Method of pattern recognition using noise reduction uncertainty
US7107210B2 (en) *	2002-05-20	2006-09-12	Microsoft Corporation	Method of noise reduction based on dynamic aspects of speech
US7174292B2 (en)	2002-05-20	2007-02-06	Microsoft Corporation	Method of determining uncertainty associated with acoustic distortion-based noise reduction
JP4352790B2 (ja) *	2002-10-31	2009-10-28	セイコーエプソン株式会社	音響モデル作成方法および音声認識装置ならびに音声認識装置を有する乗り物
US7370057B2 (en) *	2002-12-03	2008-05-06	Lockheed Martin Corporation	Framework for evaluating data cleansing applications
CN100356391C (zh) *	2003-05-21	2007-12-19	皇家飞利浦电子股份有限公司	验证身份的方法、识别设备和读/写设备
US8041026B1 (en)	2006-02-07	2011-10-18	Avaya Inc.	Event driven noise cancellation
US20070239444A1 (en) *	2006-03-29	2007-10-11	Motorola, Inc.	Voice signal perturbation for speech recognition
JP4245617B2 (ja) *	2006-04-06	2009-03-25	株式会社東芝	特徴量補正装置、特徴量補正方法および特徴量補正プログラム
JP4316583B2 (ja)	2006-04-07	2009-08-19	株式会社東芝	特徴量補正装置、特徴量補正方法および特徴量補正プログラム
US7840287B2 (en) *	2006-04-13	2010-11-23	Fisher-Rosemount Systems, Inc.	Robust process model identification in model based control techniques
US8407160B2 (en) *	2006-11-15	2013-03-26	The Trustees Of Columbia University In The City Of New York	Systems, methods, and media for generating sanitized data, sanitizing anomaly detection models, and/or generating sanitized anomaly detection models
US8195453B2 (en) *	2007-09-13	2012-06-05	Qnx Software Systems Limited	Distributed intelligibility testing system
WO2009039897A1 (en) *	2007-09-26	2009-04-02	Fraunhofer - Gesellschaft Zur Förderung Der Angewandten Forschung E.V.	Apparatus and method for extracting an ambient signal in an apparatus and method for obtaining weighting coefficients for extracting an ambient signal and computer program
US8615397B2 (en) *	2008-04-04	2013-12-24	Intuit Inc.	Identifying audio content using distorted target patterns
NO328622B1 (no)	2008-06-30	2010-04-06	Tandberg Telecom As	Anordning og fremgangsmate for reduksjon av tastaturstoy i konferanseutstyr
JP5150542B2 (ja) *	2009-03-26	2013-02-20	株式会社東芝	パターン認識装置、パターン認識方法、及び、プログラム
US11416214B2 (en)	2009-12-23	2022-08-16	Google Llc	Multi-modal input on an electronic device
EP3091535B1 (de)	2009-12-23	2023-10-11	Google LLC	Multimodale eingabe in eine elektronische vorrichtung
US8660842B2 (en) *	2010-03-09	2014-02-25	Honda Motor Co., Ltd.	Enhancing speech recognition using visual information
US8265928B2 (en) *	2010-04-14	2012-09-11	Google Inc.	Geotagged environmental audio for enhanced speech recognition accuracy
US8468012B2 (en)	2010-05-26	2013-06-18	Google Inc.	Acoustic model adaptation using geographic information
US8484023B2 (en) *	2010-09-24	2013-07-09	Nuance Communications, Inc.	Sparse representation features for speech recognition
US8352245B1 (en)	2010-12-30	2013-01-08	Google Inc.	Adjusting language models
US8296142B2 (en)	2011-01-21	2012-10-23	Google Inc.	Speech recognition using dock context
HUP1200018A2 (en)	2012-01-11	2013-07-29	77 Elektronika Mueszeripari Kft	Method of training a neural network, as well as a neural network
US8484017B1 (en)	2012-09-10	2013-07-09	Google Inc.	Identifying media content
US20140074466A1 (en)	2012-09-10	2014-03-13	Google Inc.	Answering questions using environmental context
US9734819B2 (en)	2013-02-21	2017-08-15	Google Technology Holdings LLC	Recognizing accented speech
US20140278393A1 (en)	2013-03-12	2014-09-18	Motorola Mobility Llc	Apparatus and Method for Power Efficient Signal Conditioning for a Voice Recognition System
US9237225B2 (en)	2013-03-12	2016-01-12	Google Technology Holdings LLC	Apparatus with dynamic audio signal pre-conditioning and methods therefor
US9275638B2 (en)	2013-03-12	2016-03-01	Google Technology Holdings LLC	Method and apparatus for training a voice recognition model database
US20140270249A1 (en)	2013-03-12	2014-09-18	Motorola Mobility Llc	Method and Apparatus for Estimating Variability of Background Noise for Noise Suppression
CN105580071B (zh) *	2013-05-06	2020-08-21	谷歌技术控股有限责任公司	用于训练声音识别模型数据库的方法和装置
CN103310789B (zh) *	2013-05-08	2016-04-06	北京大学深圳研究生院	一种基于改进的并行模型组合的声音事件识别方法
US9842592B2 (en)	2014-02-12	2017-12-12	Google Inc.	Language models using non-linguistic context
US9412365B2 (en)	2014-03-24	2016-08-09	Google Inc.	Enhanced maximum entropy models
US9858922B2 (en)	2014-06-23	2018-01-02	Google Inc.	Caching speech recognition scores
US9953646B2 (en)	2014-09-02	2018-04-24	Belleau Technologies	Method and system for dynamic speech recognition and tracking of prewritten script
US9299347B1 (en) *	2014-10-22	2016-03-29	Google Inc.	Speech recognition using associative mapping
KR102167719B1 (ko)	2014-12-08	2020-10-19	삼성전자주식회사	언어 모델 학습 방법 및 장치, 음성 인식 방법 및 장치
US9535905B2 (en) *	2014-12-12	2017-01-03	International Business Machines Corporation	Statistical process control and analytics for translation supply chain operational management
KR101988222B1 (ko) *	2015-02-12	2019-06-13	한국전자통신연구원	대어휘 연속 음성 인식 장치 및 방법
US10134394B2 (en)	2015-03-20	2018-11-20	Google Llc	Speech recognition using log-linear model
US9786270B2 (en)	2015-07-09	2017-10-10	Google Inc.	Generating acoustic models
KR102494139B1 (ko) *	2015-11-06	2023-01-31	삼성전자주식회사	뉴럴 네트워크 학습 장치 및 방법과, 음성 인식 장치 및 방법
US20170148466A1 (en) *	2015-11-25	2017-05-25	Tim Jackson	Method and system for reducing background sounds in a noisy environment
CN105448303B (zh) *	2015-11-27	2020-02-04	百度在线网络技术（北京）有限公司	语音信号的处理方法和装置
US10229672B1 (en)	2015-12-31	2019-03-12	Google Llc	Training acoustic models using connectionist temporal classification
US9978367B2 (en)	2016-03-16	2018-05-22	Google Llc	Determining dialog states for language models
US20180018973A1 (en)	2016-07-15	2018-01-18	Google Inc.	Speaker verification
US10832664B2 (en)	2016-08-19	2020-11-10	Google Llc	Automated speech recognition using language models that selectively use domain-specific model components
US10311860B2 (en)	2017-02-14	2019-06-04	Google Llc	Language model biasing system
US10706840B2 (en)	2017-08-18	2020-07-07	Google Llc	Encoder-decoder models for sequence to sequence mapping
JP7019096B2 (ja)	2018-08-30	2022-02-14	ドルビー・インターナショナル・アーベー	低ビットレート符号化オーディオの増強を制御する方法及び機器
CN110505332B (zh) *	2019-09-05	2025-05-09	深圳传音控股股份有限公司	一种降噪方法、装置、移动终端及存储介质
CN111210810A (zh) *	2019-12-17	2020-05-29	秒针信息技术有限公司	模型训练方法和装置
EP3862782A1 (de) *	2020-02-04	2021-08-11	Infineon Technologies AG	Vorrichtung und verfahren zur korrektur eines eingangssignals
CN111429930B (zh) *	2020-03-16	2023-02-28	云知声智能科技股份有限公司	一种基于自适应采样率的降噪模型处理方法及系统
CN111863008A (zh) *	2020-07-07	2020-10-30	北京达佳互联信息技术有限公司	一种音频降噪方法、装置及存储介质
CN112614484B (zh) *	2020-11-23	2022-05-20	北京百度网讯科技有限公司	特征信息挖掘方法、装置及电子设备
CN113515556A (zh) *	2021-04-15	2021-10-19	阿里巴巴新加坡控股有限公司	数据处理方法、客户端及电子设备
CN114190953B (zh) *	2021-12-09	2024-07-23	四川新源生物电子科技有限公司	针对脑电采集设备的脑电信号降噪模型的训练方法和系统

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
DE4309985A1 (de) *	1993-03-29	1994-10-06	Sel Alcatel Ag	Geräuschreduktion zur Spracherkennung
DE4322372A1 (de) *	1993-07-06	1995-01-12	Sel Alcatel Ag	Verfahren und Vorrichtung zur Spracherkennung
US6067517A (en) *	1996-02-02	2000-05-23	International Business Machines Corporation	Transcription of speech data with segments from acoustically dissimilar environments
US6026359A (en) *	1996-09-20	2000-02-15	Nippon Telegraph And Telephone Corporation	Scheme for model adaptation in pattern recognition based on Taylor expansion
US5950157A (en) *	1997-02-28	1999-09-07	Sri International	Method for establishing handset-dependent normalizing models for speaker recognition
US6529872B1 (en) *	2000-04-18	2003-03-04	Matsushita Electric Industrial Co., Ltd.	Method for noise adaptation in automatic speech recognition using transformed matrices

2000
- 2000-10-16 US US09/688,950 patent/US6876966B1/en not_active Expired - Lifetime
2001
- 2001-10-10 AT AT01124141T patent/ATE347161T1/de not_active IP Right Cessation
- 2001-10-10 DE DE60124842T patent/DE60124842T2/de not_active Expired - Lifetime
- 2001-10-10 EP EP01124141A patent/EP1199708B1/de not_active Expired - Lifetime
- 2001-10-16 JP JP2001317824A patent/JP4195211B2/ja not_active Expired - Fee Related

Also Published As

Publication number	Publication date
JP4195211B2 (ja)	2008-12-10
EP1199708B1 (de)	2006-11-29
DE60124842T2 (de)	2007-04-12
EP1199708A3 (de)	2003-10-15
DE60124842D1 (de)	2007-01-11
US6876966B1 (en)	2005-04-05
JP2002140089A (ja)	2002-05-17
EP1199708A2 (de)	2002-04-24

Legal Events

Date	Code	Title	Description
2007-05-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE347161T1 (de)	2006-12-15	Rauschrobuste mustererkennung
DE69823947D1 (de)	2004-06-24	Verfahren, Vorrichtung und Aufzeichnungsmedium zur Erzeugung von Tondaten
DE60139877D1 (de)	2009-10-22	Teileerkennungsdatenerzeugungsverfahren und vorrichtung, anbringvorrichtung für elektronische teile und aufzeichnungsmedium
TW356548B (en)	1999-04-21	Sound identifying device method of sound identification and the game machine using the said device
DE50211921D1 (de)	2008-04-30	Verfahren zum Abspielen von Audiodaten mit einem Unterhaltungsgerät
TW200705387A (en)	2007-02-01	Systems, methods, and apparatus for highband time warping
ATE308098T1 (de)	2005-11-15	Klassifizierung von schallquellen
DE60222739D1 (de)	2007-11-15	Gerät und Verfahren zur Erzeugung von digitalen Signalen, die jeweils einen analogen Signalwert kodieren
DE3769007D1 (de)	1991-05-08	Verfahren und vorrichtung zur aufzeichnung und/oder wiedergabe eines bildsignals und eines zugehoerigen tonsignals auf einen aufzeichnungstraeger und aufzeichungstraeger, gewonnen mittels dieses verfahrens.
DE69807807D1 (de)	2002-10-17	Verfahren und vorrichtung zur übertragung von inhaltsinformation und darauf bezogener zusatzinformation
DE60128270D1 (de)	2007-06-14	Verfahren und System zur Erzeugung von Sprechererkennungsdaten, und Verfahren und System zur Sprechererkennung
ATE412941T1 (de)	2008-11-15	Speicherschnittstellenprotokoll zur unterscheidung von statusinformationen von lesedaten
DE60130223D1 (de)	2007-10-11	Signalverarbeitung
DE69800320D1 (de)	2000-10-26	Verfahren und Vorrichtung zur Sprechererkennung durch Prüfung von mündlicher Information mittels Zwangsdekodierung
DE60138696D1 (de)	2009-06-25	Verfahren und system zum speichern eines codierungsmusters
ATE319160T1 (de)	2006-03-15	Verfahren zur rauschrobusten klassifikation in der sprachkodierung
DE60235211D1 (de)	2010-03-11	Verfahren zum Vorlöschen von Rauschen eines Bildes.
ATE487212T1 (de)	2010-11-15	Verstekte bedingte zufallfeldermodelle für phonetische klassifizierung und spracherkennung
DE60227308D1 (de)	2008-08-14	System, Verfahren und Vorrichtung zur Bestimmung der Grenze eines Informationselements
ATE286334T1 (de)	2005-01-15	Vorrichtung zur klassifikation von komplexen signalen mit linearer digitaler modulation
DE60140595D1 (de)	2010-01-07	Verfahren zur Geräuschunterdrückung
ATE381915T1 (de)	2008-01-15	Audioinformationsübertragungsvorrichtung und zugehöriges verfahren
DE69728469D1 (de)	2004-05-13	Gerät und Verfahren zur Ermitlung der Zeichenlinie mittels vereinfachter Projektionsinformation; Zeichenerkennungsgerät und Verfahren
DE69202990D1 (de)	1995-07-27	Verfahren zur Erkennung eines Störsignal für einen digitalen Datendemodulator, und Einrichtung zur Durchführung eines solchen Verfahrens.
ATE420526T1 (de)	2009-01-15	Verfahren und vorrichtung zur rauschverminderung in einem schallsignal