ATE282235T1 - Robuste merkmale für die erkennung von verrauschten sprachsignalen - Google Patents

Robuste merkmale für die erkennung von verrauschten sprachsignalen

Info

Publication number: ATE282235T1
Authority: AT; Austria
Prior art keywords: vectors; voice signals; speech; robust features; detecting noise
Prior art date: 2000-05-04

Application number

AT01925226T

Other languages

English (en)

Inventor

Stephane Dupont

Original Assignee

Faculte Polytechnique De Mons

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2000-05-04

Filing date

2001-04-25

Publication date

2004-11-15

2001-04-25 Application filed by Faculte Polytechnique De Mons filed Critical Faculte Polytechnique De Mons

2004-11-15 Application granted granted Critical

2004-11-15 Publication of ATE282235T1 publication Critical patent/ATE282235T1/de

Links

239000013598 vector Substances 0.000 abstract 3

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation

Landscapes

Engineering & Computer Science (AREA)
Multimedia (AREA)
Acoustics & Sound (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Quality & Reliability (AREA)
Signal Processing (AREA)
Artificial Intelligence (AREA)
Evolutionary Computation (AREA)
Machine Translation (AREA)
Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Noise Elimination (AREA)

AT01925226T 2000-05-04 2001-04-25 Robuste merkmale für die erkennung von verrauschten sprachsignalen ATE282235T1 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
EP00870094A EP1152399A1 (de)	2000-05-04	2000-05-04	Teilband-Sprachverarbeitung mit neuronalen Netzwerken
PCT/BE2001/000072 WO2001084537A1 (fr)	2000-05-04	2001-04-25	Parametres robustes pour la reconnaissance de parole bruitee

Publications (1)

Publication Number	Publication Date
ATE282235T1 true ATE282235T1 (de)	2004-11-15

Family

ID=8175744

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT01925226T ATE282235T1 (de)	2000-05-04	2001-04-25	Robuste merkmale für die erkennung von verrauschten sprachsignalen

Country Status (8)

Country	Link
US (1)	US7212965B2 (de)
EP (2)	EP1152399A1 (de)
JP (1)	JP2003532162A (de)
AT (1)	ATE282235T1 (de)
AU (1)	AU776919B2 (de)
CA (1)	CA2404441C (de)
DE (1)	DE60107072T2 (de)
WO (1)	WO2001084537A1 (de)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP1416472A1 (de) *	2002-10-30	2004-05-06	Swisscom AG	Bandbreitenabhängiges Spracherkennungssystem
US7620546B2 (en) *	2004-03-23	2009-11-17	Qnx Software Systems (Wavemakers), Inc.	Isolating speech signals utilizing neural networks
US20060206320A1 (en) *	2005-03-14	2006-09-14	Li Qi P	Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers
US20070239444A1 (en) *	2006-03-29	2007-10-11	Motorola, Inc.	Voice signal perturbation for speech recognition
US8386125B2 (en) *	2006-11-22	2013-02-26	General Motors Llc	Adaptive communication between a vehicle telematics unit and a call center based on acoustic conditions
CN101996628A (zh) *	2009-08-21	2011-03-30	索尼株式会社	提取语音信号的韵律特征的方法和装置
US8972256B2 (en) *	2011-10-17	2015-03-03	Nuance Communications, Inc.	System and method for dynamic noise adaptation for robust automatic speech recognition
US9418674B2 (en) *	2012-01-17	2016-08-16	GM Global Technology Operations LLC	Method and system for using vehicle sound information to enhance audio prompting
US9934780B2 (en)	2012-01-17	2018-04-03	GM Global Technology Operations LLC	Method and system for using sound related vehicle information to enhance spoken dialogue by modifying dialogue's prompt pitch
US9263040B2 (en)	2012-01-17	2016-02-16	GM Global Technology Operations LLC	Method and system for using sound related vehicle information to enhance speech recognition
US8571871B1 (en) *	2012-10-02	2013-10-29	Google Inc.	Methods and systems for adaptation of synthetic speech in an environment
US9280968B2 (en)	2013-10-04	2016-03-08	At&T Intellectual Property I, L.P.	System and method of using neural transforms of robust audio features for speech processing
US10720165B2 (en) *	2017-01-23	2020-07-21	Qualcomm Incorporated	Keyword voice authentication
US10283140B1 (en) *	2018-01-12	2019-05-07	Alibaba Group Holding Limited	Enhancing audio signals using sub-band deep neural networks
US10997967B2 (en) *	2019-04-18	2021-05-04	Honeywell International Inc.	Methods and systems for cockpit speech recognition acoustic model training with multi-level corpus data augmentation
CN110047468B (zh) *	2019-05-20	2022-01-25	北京达佳互联信息技术有限公司	语音识别方法、装置及存储介质

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JP2776848B2 (ja) *	1988-12-14	1998-07-16	株式会社日立製作所	雑音除去方法、それに用いるニューラルネットワークの学習方法
JP3084721B2 (ja) *	1990-02-23	2000-09-04	ソニー株式会社	雑音除去回路
JPH0566795A (ja) *	1991-09-06	1993-03-19	Gijutsu Kenkyu Kumiai Iryo Fukushi Kiki Kenkyusho	雑音抑圧装置とその調整装置
US5381512A (en) *	1992-06-24	1995-01-10	Moscom Corporation	Method and apparatus for speech feature recognition based on models of auditory signal processing
US6070140A (en) *	1995-06-05	2000-05-30	Tran; Bao Q.	Speech recognizer
US5806025A (en) *	1996-08-07	1998-09-08	U S West, Inc.	Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank
US5963899A (en) *	1996-08-07	1999-10-05	U S West, Inc.	Method and system for region based filtering of speech
US6035048A (en) *	1997-06-18	2000-03-07	Lucent Technologies Inc.	Method and apparatus for reducing noise in speech and audio signals
FR2765715B1 (fr) *	1997-07-04	1999-09-17	Sextant Avionique	Procede de recherche d'un modele de bruit dans des signaux sonores bruites
US6230122B1 (en) *	1998-09-09	2001-05-08	Sony Corporation	Speech detection with noise suppression based on principal components analysis
US6173258B1 (en) *	1998-09-09	2001-01-09	Sony Corporation	Method for reducing noise distortions in a speech recognition system
US6347297B1 (en) *	1998-10-05	2002-02-12	Legerity, Inc.	Matrix quantization with vector quantization error compensation and neural network postprocessing for robust speech recognition

2000
- 2000-05-04 EP EP00870094A patent/EP1152399A1/de not_active Withdrawn
2001
- 2001-04-25 EP EP01925226A patent/EP1279166B1/de not_active Expired - Lifetime
- 2001-04-25 AT AT01925226T patent/ATE282235T1/de not_active IP Right Cessation
- 2001-04-25 DE DE60107072T patent/DE60107072T2/de not_active Expired - Lifetime
- 2001-04-25 AU AU52051/01A patent/AU776919B2/en not_active Ceased
- 2001-04-25 US US10/275,451 patent/US7212965B2/en not_active Expired - Fee Related
- 2001-04-25 WO PCT/BE2001/000072 patent/WO2001084537A1/fr not_active Ceased
- 2001-04-25 CA CA002404441A patent/CA2404441C/fr not_active Expired - Fee Related
- 2001-04-25 JP JP2001581270A patent/JP2003532162A/ja active Pending

Also Published As

Publication number	Publication date
CA2404441A1 (fr)	2001-11-08
US7212965B2 (en)	2007-05-01
DE60107072T2 (de)	2005-10-27
EP1279166A1 (de)	2003-01-29
DE60107072D1 (de)	2004-12-16
WO2001084537A1 (fr)	2001-11-08
AU5205101A (en)	2001-11-12
EP1152399A1 (de)	2001-11-07
EP1279166B1 (de)	2004-11-10
US20030182114A1 (en)	2003-09-25
AU776919B2 (en)	2004-09-23
CA2404441C (fr)	2009-07-14
JP2003532162A (ja)	2003-10-28

Legal Events

Date	Code	Title	Description
2005-05-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE282235T1 (de)	2004-11-15	Robuste merkmale für die erkennung von verrauschten sprachsignalen
ATE352836T1 (de)	2007-02-15	Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern
DK215690A (da)	1990-09-07	Taleaktivitetsdetektor
EP0788091A3 (de)	1999-02-24	Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung
IL154397A0 (en)	2003-09-17	Voice enhancement system
WO2002052542A3 (de)	2002-11-07	Verfahren und anordnung zur bestimmung eines geräuschsignals einer geräuschquelle
EP1067800A4 (de)	2005-07-27	Verfahren zur signalverarbeitung und vorrichtung zur verarbeitung von bild/ton
EP0785419A3 (de)	1998-09-02	Sprachaktivitätserkennung
WO1998034216A3 (en)	2001-12-20	System and method for detecting a recorded voice
EP1168306A3 (de)	2002-10-02	Verfahren und Vorrichtung zur Verbesserung von der Verständlichkeit eines digital komprimierten Sprachsignals
IL125649A0 (en)	1999-04-11	A method and device for recognizing a sampled sound signal in noise
WO1994022131A3 (en)	1995-01-12	Speech recognition with pause detection
EP1113417A3 (de)	2001-12-05	Vorrichtung, Verfahren und Aufzeichnungsmedium zur Sprachsynthese
EP1861846A4 (de)	2010-06-23	Adaptive stimmenmodus-erweiterung für einen stimmenaktivitäts-detektor
CA2228948A1 (en)	1997-03-06	Pattern recognition
DE60325881D1 (de)	2009-03-05	Verfahren zum betreiben eines spracherkennungssystemes
WO1996028808A3 (de)	1996-10-24	Verfahren zur erkennung einer signalpause zwischen zwei mustern, welche in einem zeitvarianten mess-signal vorhanden sind
CA2162407A1 (en)	1996-05-11	A robust pitch estimation method and device for telephone speech
EP1093112A3 (de)	2002-02-06	Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung
EP1096475A3 (de)	2001-09-12	Verziehung der Frequenzen für Spracherkennung
CA2076606A1 (en)	1993-04-26	Method for detecting voice presence on a communication line
DE69012446D1 (de)	1994-10-20	Detektor für Niederfrequenz-Wechselstromsignale für eine Telefonverbindungsleitung.
ATE282879T1 (de)	2004-12-15	Signalverarbeitungsverfahren zur analyse von sprachsignal-transienten
EP1489597A8 (de)	2005-04-06	Vorrichtung zur Sprachdetektion
JP2564821B2 (ja)	1996-12-18	音声判定検出装置