ATE267443T1 - Vorrichtung zur sprachdetektion bei umgebungsgeräuschen - Google Patents

Vorrichtung zur sprachdetektion bei umgebungsgeräuschen

Info

Publication number
ATE267443T1
ATE267443T1 AT99301823T AT99301823T ATE267443T1 AT E267443 T1 ATE267443 T1 AT E267443T1 AT 99301823 T AT99301823 T AT 99301823T AT 99301823 T AT99301823 T AT 99301823T AT E267443 T1 ATE267443 T1 AT E267443T1
Authority
AT
Austria
Prior art keywords
speech
input signal
signal
term
frequency band
Prior art date
Application number
AT99301823T
Other languages
English (en)
Inventor
Yi Zhao
Jean-Claude Junqua
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Application granted granted Critical
Publication of ATE267443T1 publication Critical patent/ATE267443T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Image Analysis (AREA)
AT99301823T 1998-03-24 1999-03-11 Vorrichtung zur sprachdetektion bei umgebungsgeräuschen ATE267443T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/047,276 US6480823B1 (en) 1998-03-24 1998-03-24 Speech detection for noisy conditions

Publications (1)

Publication Number Publication Date
ATE267443T1 true ATE267443T1 (de) 2004-06-15

Family

ID=21948048

Family Applications (1)

Application Number Title Priority Date Filing Date
AT99301823T ATE267443T1 (de) 1998-03-24 1999-03-11 Vorrichtung zur sprachdetektion bei umgebungsgeräuschen

Country Status (9)

Country Link
US (1) US6480823B1 (de)
EP (1) EP0945854B1 (de)
JP (1) JPH11327582A (de)
KR (1) KR100330478B1 (de)
CN (1) CN1113306C (de)
AT (1) ATE267443T1 (de)
DE (1) DE69917361T2 (de)
ES (1) ES2221312T3 (de)
TW (1) TW436759B (de)

Families Citing this family (84)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6873953B1 (en) * 2000-05-22 2005-03-29 Nuance Communications Prosody based endpoint detection
US6640208B1 (en) * 2000-09-12 2003-10-28 Motorola, Inc. Voiced/unvoiced speech classifier
US6754623B2 (en) * 2001-01-31 2004-06-22 International Business Machines Corporation Methods and apparatus for ambient noise removal in speech recognition
US7277853B1 (en) * 2001-03-02 2007-10-02 Mindspeed Technologies, Inc. System and method for a endpoint detection of speech for improved speech recognition in noisy environments
US20020147585A1 (en) * 2001-04-06 2002-10-10 Poulsen Steven P. Voice activity detection
EP1391106B1 (de) 2001-04-30 2014-02-26 Polycom, Inc. Audiokonferenzplattform mit dynamischer sprachdetektionsschwelle
US6782363B2 (en) * 2001-05-04 2004-08-24 Lucent Technologies Inc. Method and apparatus for performing real-time endpoint detection in automatic speech recognition
US7289626B2 (en) * 2001-05-07 2007-10-30 Siemens Communications, Inc. Enhancement of sound quality for computer telephony systems
US7236929B2 (en) * 2001-05-09 2007-06-26 Plantronics, Inc. Echo suppression and speech detection techniques for telephony applications
US7277585B2 (en) * 2001-05-25 2007-10-02 Ricoh Company, Ltd. Image encoding method, image encoding apparatus and storage medium
JP2003087547A (ja) * 2001-09-12 2003-03-20 Ricoh Co Ltd 画像処理装置
US6901363B2 (en) * 2001-10-18 2005-05-31 Siemens Corporate Research, Inc. Method of denoising signal mixtures
US7299173B2 (en) 2002-01-30 2007-11-20 Motorola Inc. Method and apparatus for speech detection using time-frequency variance
DE10251113A1 (de) * 2002-11-02 2004-05-19 Philips Intellectual Property & Standards Gmbh Verfahren zum Betrieb eines Spracherkennungssystems
CN1830025A (zh) * 2003-08-01 2006-09-06 皇家飞利浦电子股份有限公司 驱动对话系统的方法
JP4587160B2 (ja) * 2004-03-26 2010-11-24 キヤノン株式会社 信号処理装置および方法
US7278092B2 (en) * 2004-04-28 2007-10-02 Amplify, Llc System, method and apparatus for selecting, displaying, managing, tracking and transferring access to content of web pages and other sources
JP4483468B2 (ja) * 2004-08-02 2010-06-16 ソニー株式会社 ノイズ低減回路、電子機器、ノイズ低減方法
US7457747B2 (en) * 2004-08-23 2008-11-25 Nokia Corporation Noise detection for audio encoding by mean and variance energy ratio
US7545435B2 (en) * 2004-10-15 2009-06-09 Lifesize Communications, Inc. Automatic backlight compensation and exposure control
US8149739B2 (en) * 2004-10-15 2012-04-03 Lifesize Communications, Inc. Background call validation
US20060106929A1 (en) * 2004-10-15 2006-05-18 Kenoyer Michael L Network conference communications
US7692683B2 (en) * 2004-10-15 2010-04-06 Lifesize Communications, Inc. Video conferencing system transcoder
KR100677396B1 (ko) * 2004-11-20 2007-02-02 엘지전자 주식회사 음성인식장치의 음성구간 검출방법
US7590529B2 (en) * 2005-02-04 2009-09-15 Microsoft Corporation Method and apparatus for reducing noise corruption from an alternative sensor signal during multi-sensory speech enhancement
US20060241937A1 (en) * 2005-04-21 2006-10-26 Ma Changxue C Method and apparatus for automatically discriminating information bearing audio segments and background noise audio segments
US20060248210A1 (en) * 2005-05-02 2006-11-02 Lifesize Communications, Inc. Controlling video display mode in a video conferencing system
US8170875B2 (en) 2005-06-15 2012-05-01 Qnx Software Systems Limited Speech end-pointer
US7664635B2 (en) * 2005-09-08 2010-02-16 Gables Engineering, Inc. Adaptive voice detection method and system
GB0519051D0 (en) * 2005-09-19 2005-10-26 Nokia Corp Search algorithm
US20070100611A1 (en) * 2005-10-27 2007-05-03 Intel Corporation Speech codec apparatus with spike reduction
KR100800873B1 (ko) * 2005-10-28 2008-02-04 삼성전자주식회사 음성 신호 검출 시스템 및 방법
KR100717401B1 (ko) * 2006-03-02 2007-05-11 삼성전자주식회사 역방향 누적 히스토그램을 이용한 음성 특징 벡터의 정규화방법 및 그 장치
CN101320559B (zh) * 2007-06-07 2011-05-18 华为技术有限公司 一种声音激活检测装置及方法
US8319814B2 (en) 2007-06-22 2012-11-27 Lifesize Communications, Inc. Video conferencing system which allows endpoints to perform continuous presence layout selection
US8139100B2 (en) 2007-07-13 2012-03-20 Lifesize Communications, Inc. Virtual multiway scaler compensation
CN101393744B (zh) * 2007-09-19 2011-09-14 华为技术有限公司 调整声音激活检测门限值的方法及装置
US9661267B2 (en) * 2007-09-20 2017-05-23 Lifesize, Inc. Videoconferencing system discovery
KR101437830B1 (ko) * 2007-11-13 2014-11-03 삼성전자주식회사 음성 구간 검출 방법 및 장치
KR20110023878A (ko) * 2008-06-09 2011-03-08 코닌클리케 필립스 일렉트로닉스 엔.브이. 오디오/비주얼 데이터 스트림의 요약을 생성하기 위한 방법 및 장치
CN101625857B (zh) * 2008-07-10 2012-05-09 新奥特(北京)视频技术有限公司 一种自适应的语音端点检测方法
US8514265B2 (en) 2008-10-02 2013-08-20 Lifesize Communications, Inc. Systems and methods for selecting videoconferencing endpoints for display in a composite video image
US20100110160A1 (en) * 2008-10-30 2010-05-06 Brandt Matthew K Videoconferencing Community with Live Images
WO2010048999A1 (en) * 2008-10-30 2010-05-06 Telefonaktiebolaget Lm Ericsson (Publ) Telephony content signal discrimination
SG174207A1 (en) * 2009-03-03 2011-10-28 Agency Science Tech & Res Methods for determining whether a signal includes a wanted signal and apparatuses configured to determine whether a signal includes a wanted signal
US8456510B2 (en) * 2009-03-04 2013-06-04 Lifesize Communications, Inc. Virtual distributed multipoint control unit
US8643695B2 (en) * 2009-03-04 2014-02-04 Lifesize Communications, Inc. Videoconferencing endpoint extension
JP5772591B2 (ja) * 2009-03-18 2015-09-02 日本電気株式会社 音声信号処理装置
US8305421B2 (en) * 2009-06-29 2012-11-06 Lifesize Communications, Inc. Automatic determination of a configuration for a conference
ES2371619B1 (es) * 2009-10-08 2012-08-08 Telefónica, S.A. Procedimiento de detección de segmentos de voz.
CN102044243B (zh) * 2009-10-15 2012-08-29 华为技术有限公司 语音激活检测方法与装置、编码器
US8350891B2 (en) * 2009-11-16 2013-01-08 Lifesize Communications, Inc. Determining a videoconference layout based on numbers of participants
CN102201231B (zh) * 2010-03-23 2012-10-24 创杰科技股份有限公司 语音侦测方法
JP2012058358A (ja) * 2010-09-07 2012-03-22 Sony Corp 雑音抑圧装置、雑音抑圧方法およびプログラム
JP5949550B2 (ja) * 2010-09-17 2016-07-06 日本電気株式会社 音声認識装置、音声認識方法、及びプログラム
ES2740173T3 (es) 2010-12-24 2020-02-05 Huawei Tech Co Ltd Un método y un aparato para realizar una detección de actividad de voz
WO2012083555A1 (en) 2010-12-24 2012-06-28 Huawei Technologies Co., Ltd. Method and apparatus for adaptively detecting voice activity in input audio signal
US9280982B1 (en) * 2011-03-29 2016-03-08 Google Technology Holdings LLC Nonstationary noise estimator (NNSE)
CN102800322B (zh) * 2011-05-27 2014-03-26 中国科学院声学研究所 一种噪声功率谱估计与语音活动性检测方法
US9280984B2 (en) * 2012-05-14 2016-03-08 Htc Corporation Noise cancellation method
CN103455021B (zh) * 2012-05-31 2016-08-24 科域半导体有限公司 改变检测系统和方法
CN103730110B (zh) * 2012-10-10 2017-03-01 北京百度网讯科技有限公司 一种检测语音端点的方法和装置
CN103839544B (zh) * 2012-11-27 2016-09-07 展讯通信(上海)有限公司 语音激活检测方法和装置
US9190061B1 (en) * 2013-03-15 2015-11-17 Google Inc. Visual speech detection using facial landmarks
CN103413554B (zh) * 2013-08-27 2016-02-03 广州顶毅电子有限公司 Dsp延时调整的去噪方法及装置
JP6045511B2 (ja) * 2014-01-08 2016-12-14 Psソリューションズ株式会社 音響信号検出システム、音響信号検出方法、音響信号検出サーバー、音響信号検出装置、及び音響信号検出プログラム
US9330684B1 (en) * 2015-03-27 2016-05-03 Continental Automotive Systems, Inc. Real-time wind buffet noise detection
WO2016188593A1 (en) * 2015-05-26 2016-12-01 Katholieke Universiteit Leuven Speech recognition system and method using an adaptive incremental learning approach
US9596502B1 (en) 2015-12-21 2017-03-14 Max Abecassis Integration of multiple synchronization methodologies
US9516373B1 (en) 2015-12-21 2016-12-06 Max Abecassis Presets of synchronized second screen functions
CN106887241A (zh) 2016-10-12 2017-06-23 阿里巴巴集团控股有限公司 一种语音信号检测方法与装置
KR102517939B1 (ko) * 2017-01-04 2023-04-04 하만 베커 오토모티브 시스템즈 게엠베하 원거리 장 사운드 캡처링
WO2019061055A1 (zh) * 2017-09-27 2019-04-04 深圳传音通讯有限公司 电子设备的测试方法及系统
CN109767774A (zh) 2017-11-08 2019-05-17 阿里巴巴集团控股有限公司 一种交互方法和设备
US10948581B2 (en) 2018-05-30 2021-03-16 Richwave Technology Corp. Methods and apparatus for detecting presence of an object in an environment
US10928502B2 (en) * 2018-05-30 2021-02-23 Richwave Technology Corp. Methods and apparatus for detecting presence of an object in an environment
CN108962249B (zh) * 2018-08-21 2023-03-31 广州市保伦电子有限公司 一种基于mfcc语音特征的语音匹配方法及存储介质
CN109065043B (zh) * 2018-08-21 2022-07-05 广州市保伦电子有限公司 一种命令词识别方法及计算机存储介质
US11380321B2 (en) * 2019-08-01 2022-07-05 Semiconductor Components Industries, Llc Methods and apparatus for a voice detector
CN112687273B (zh) * 2020-12-26 2024-04-16 科大讯飞股份有限公司 一种语音转写方法及装置
CN113345472B (zh) * 2021-05-08 2022-03-25 北京百度网讯科技有限公司 语音端点检测方法、装置、电子设备及存储介质
JP2024543155A (ja) * 2021-11-25 2024-11-19 華為技術有限公司 スピーチ認識方法、スピーチ認識機器及びシステム
CN115376513B (zh) * 2022-10-19 2023-05-12 广州小鹏汽车科技有限公司 语音交互方法、服务器及计算机可读存储介质
CN119375698A (zh) * 2024-12-27 2025-01-28 中建科工集团有限公司 基于音频侦测的充电桩继电器状态检测方法、装置及设备

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3909532A (en) * 1974-03-29 1975-09-30 Bell Telephone Labor Inc Apparatus and method for determining the beginning and the end of a speech utterance
US4032711A (en) 1975-12-31 1977-06-28 Bell Telephone Laboratories, Incorporated Speaker recognition arrangement
US4052568A (en) * 1976-04-23 1977-10-04 Communications Satellite Corporation Digital voice switch
JPS56104399A (en) 1980-01-23 1981-08-20 Hitachi Ltd Voice interval detection system
US4357491A (en) * 1980-09-16 1982-11-02 Northern Telecom Limited Method of and apparatus for detecting speech in a voice channel signal
USRE32172E (en) 1980-12-19 1986-06-03 At&T Bell Laboratories Endpoint detector
FR2502370A1 (fr) 1981-03-18 1982-09-24 Trt Telecom Radio Electr Dispositif de reduction du bruit dans un signal de parole mele de bruit
US4410763A (en) 1981-06-09 1983-10-18 Northern Telecom Limited Speech detector
US4531228A (en) 1981-10-20 1985-07-23 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
JPS5876899A (ja) * 1981-10-31 1983-05-10 株式会社東芝 音声区間検出装置
FR2535854A1 (fr) 1982-11-10 1984-05-11 Cit Alcatel Procede et dispositif d'evaluation du niveau de bruit sur une voie telephonique
JPS59139099A (ja) 1983-01-31 1984-08-09 株式会社東芝 音声区間検出装置
US4627091A (en) 1983-04-01 1986-12-02 Rca Corporation Low-energy-content voice detection apparatus
JPS603700A (ja) 1983-06-22 1985-01-10 日本電気株式会社 音声検出方式
AU584904B2 (en) * 1984-06-08 1989-06-08 Plessey Australia Pty. Limited Adaptive speech detector system
US4630304A (en) * 1985-07-01 1986-12-16 Motorola, Inc. Automatic background noise estimator for a noise suppression system
US4815136A (en) 1986-11-06 1989-03-21 American Telephone And Telegraph Company Voiceband signal classification
JPH01169499A (ja) 1987-12-24 1989-07-04 Fujitsu Ltd 単語音声区間切出し方式
US5222147A (en) 1989-04-13 1993-06-22 Kabushiki Kaisha Toshiba Speech recognition LSI system including recording/reproduction device
AU633673B2 (en) * 1990-01-18 1993-02-04 Matsushita Electric Industrial Co., Ltd. Signal processing device
US5313531A (en) * 1990-11-05 1994-05-17 International Business Machines Corporation Method and apparatus for speech analysis and speech recognition
US5305422A (en) * 1992-02-28 1994-04-19 Panasonic Technologies, Inc. Method for determining boundaries of isolated words within a speech signal
US5323337A (en) 1992-08-04 1994-06-21 Loral Aerospace Corp. Signal detector employing mean energy and variance of energy content comparison for noise detection
US5579431A (en) * 1992-10-05 1996-11-26 Panasonic Technologies, Inc. Speech detection in presence of noise by determining variance over time of frequency band limited energy
US5617508A (en) * 1992-10-05 1997-04-01 Panasonic Technologies Inc. Speech detection device for the detection of speech end points based on variance of frequency band limited energy
US5479560A (en) * 1992-10-30 1995-12-26 Technology Research Association Of Medical And Welfare Apparatus Formant detecting device and speech processing apparatus
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
US6266633B1 (en) * 1998-12-22 2001-07-24 Itt Manufacturing Enterprises Noise suppression and channel equalization preprocessor for speech and speaker recognizers: method and apparatus

Also Published As

Publication number Publication date
TW436759B (en) 2001-05-28
CN1242553A (zh) 2000-01-26
ES2221312T3 (es) 2004-12-16
DE69917361D1 (de) 2004-06-24
CN1113306C (zh) 2003-07-02
JPH11327582A (ja) 1999-11-26
KR100330478B1 (ko) 2002-04-01
EP0945854B1 (de) 2004-05-19
KR19990077910A (ko) 1999-10-25
EP0945854A3 (de) 1999-12-29
EP0945854A2 (de) 1999-09-29
DE69917361T2 (de) 2005-06-02
US6480823B1 (en) 2002-11-12

Similar Documents

Publication Publication Date Title
ATE267443T1 (de) Vorrichtung zur sprachdetektion bei umgebungsgeräuschen
ATE427546T1 (de) Sprachubertragungssystem und verfahren zur behandlung verlorener datenrahmen
ATE302991T1 (de) Verfahren zur signalgesteuerten schaltung zwischen verschiedenen audiokodierungssystemen
ATE214832T1 (de) Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem
EP1791115A3 (de) Klassiefizierungsbasierte Verschleierung von Rahmenausfall für Audiosignale
TW356548B (en) Sound identifying device method of sound identification and the game machine using the said device
WO1998001956A3 (en) Microphone noise rejection system
DE59914782D1 (de) Verfahren zur Störbefreiung eines Mikrophonsignals
ATE413751T1 (de) Verfahren und vorrichtung zur zweistufigen paketklassifikation unter verwendung einer spezifischen filteranpassung und gemeinsamen benutzung auf transportebene
ATE347161T1 (de) Rauschrobuste mustererkennung
ATE319160T1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
DE60325881D1 (de) Verfahren zum betreiben eines spracherkennungssystemes
KR890013951A (ko) 보청기 신호처리 시스템
KR930020862A (ko) 잡음억압장치
ATE287153T1 (de) Verfahren und vorrichtung zur bereitstellung einem verbesserten bereitschaftsbetrieb für infrarotsenderempfängern
DE60331475D1 (de) Verfahren und vorrichtung zur analyse von audiosignalen
EP1047047A3 (de) Verfahren und Vorrichtung zur Kodierung und Dekodierung von Audiosignalen und Aufzeichnungsträger mit Programmen dafür
DE602005017884D1 (de) Verfahren und Vorrichtung zur Sprachgeschwindigkeitsumwandlung
EP0780828A3 (de) Verfahren und System für die Spracherkennung
EP1160769A3 (de) Verfahren und Vorrichtung zur Darstellung der Verdeckungsschwelle in perzeptuellen Audiokodierern
DE60116559D1 (de) Verbessertes Verfahren zur Ermittlung der Qualität eines Sprachsignals
Chandra et al. Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual
DE60224142D1 (de) Verfahren und Vorrichtung zur Wellenformdämpfung von fehlerbehafteten Sprachrahmen
ATE420526T1 (de) Verfahren und vorrichtung zur rauschverminderung in einem schallsignal
DE60128245D1 (de) Verfahren und Vorrichtung zur Durchführung einer adaptiven Vorverzerrung

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties