ATE275750T1 - Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) - Google Patents

Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)

Info

Publication number
ATE275750T1
ATE275750T1 AT99968458T AT99968458T ATE275750T1 AT E275750 T1 ATE275750 T1 AT E275750T1 AT 99968458 T AT99968458 T AT 99968458T AT 99968458 T AT99968458 T AT 99968458T AT E275750 T1 ATE275750 T1 AT E275750T1
Authority
AT
Austria
Prior art keywords
speech
audio signal
pure
detection
valley
Prior art date
Application number
AT99968458T
Other languages
English (en)
Inventor
Chuang Gu
Ming-Chieh Lee
Wei-Ge Chen
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE275750T1 publication Critical patent/ATE275750T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Monitoring And Testing Of Exchanges (AREA)
  • Machine Translation (AREA)
AT99968458T 1998-11-30 1999-11-30 Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) ATE275750T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/201,705 US6205422B1 (en) 1998-11-30 1998-11-30 Morphological pure speech detection using valley percentage
PCT/US1999/028401 WO2000033294A1 (en) 1998-11-30 1999-11-30 Pure speech detection using valley percentage

Publications (1)

Publication Number Publication Date
ATE275750T1 true ATE275750T1 (de) 2004-09-15

Family

ID=22746956

Family Applications (1)

Application Number Title Priority Date Filing Date
AT99968458T ATE275750T1 (de) 1998-11-30 1999-11-30 Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)

Country Status (6)

Country Link
US (1) US6205422B1 (de)
EP (1) EP1141938B1 (de)
JP (1) JP4652575B2 (de)
AT (1) ATE275750T1 (de)
DE (1) DE69920047T2 (de)
WO (1) WO2000033294A1 (de)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6801895B1 (en) * 1998-12-07 2004-10-05 At&T Corp. Method and apparatus for segmenting a multi-media program based upon audio events
KR100429896B1 (ko) * 2001-11-22 2004-05-03 한국전자통신연구원 잡음 환경에서의 음성신호 검출방법 및 그 장치
WO2005124722A2 (en) * 2004-06-12 2005-12-29 Spl Development, Inc. Aural rehabilitation system and method
KR100713366B1 (ko) * 2005-07-11 2007-05-04 삼성전자주식회사 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치
US20070011001A1 (en) * 2005-07-11 2007-01-11 Samsung Electronics Co., Ltd. Apparatus for predicting the spectral information of voice signals and a method therefor
KR100800873B1 (ko) 2005-10-28 2008-02-04 삼성전자주식회사 음성 신호 검출 시스템 및 방법
KR100790110B1 (ko) * 2006-03-18 2008-01-02 삼성전자주식회사 모폴로지 기반의 음성 신호 코덱 방법 및 장치
KR100762596B1 (ko) * 2006-04-05 2007-10-01 삼성전자주식회사 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
KR100860830B1 (ko) * 2006-12-13 2008-09-30 삼성전자주식회사 음성 신호의 스펙트럼 정보 추정 장치 및 방법
US8935158B2 (en) 2006-12-13 2015-01-13 Samsung Electronics Co., Ltd. Apparatus and method for comparing frames using spectral information of audio signal
US8355511B2 (en) * 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) * 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
EP2724340B1 (de) * 2011-07-07 2019-05-15 Nuance Communications, Inc. Einkanalige unterdrückung von impulsartigen interferenzen in geräuschbehafteten sprachsignalen
US9286907B2 (en) * 2011-11-23 2016-03-15 Creative Technology Ltd Smart rejecter for keyboard click noise
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
WO2016033364A1 (en) 2014-08-28 2016-03-03 Audience, Inc. Multi-sourced noise suppression
US20170264942A1 (en) * 2016-03-11 2017-09-14 Mediatek Inc. Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction
US12016098B1 (en) 2019-09-12 2024-06-18 Renesas Electronics America System and method for user presence detection based on audio events

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4063033A (en) * 1975-12-30 1977-12-13 Rca Corporation Signal quality evaluator
US4281218A (en) * 1979-10-26 1981-07-28 Bell Telephone Laboratories, Incorporated Speech-nonspeech detector-classifier
US4630304A (en) * 1985-07-01 1986-12-16 Motorola, Inc. Automatic background noise estimator for a noise suppression system
US4628529A (en) * 1985-07-01 1986-12-09 Motorola, Inc. Noise suppression system
JPH01158499A (ja) * 1987-12-16 1989-06-21 Hitachi Ltd 定常雑音除去方式
CA2011775C (en) * 1989-03-10 1995-06-27 Yutaka Kaneda Method of detecting acoustic signal
US4975657A (en) * 1989-11-02 1990-12-04 Motorola Inc. Speech detector for automatic level control systems
US5323337A (en) * 1992-08-04 1994-06-21 Loral Aerospace Corp. Signal detector employing mean energy and variance of energy content comparison for noise detection
US5479560A (en) * 1992-10-30 1995-12-26 Technology Research Association Of Medical And Welfare Apparatus Formant detecting device and speech processing apparatus
JP3626492B2 (ja) * 1993-07-07 2005-03-09 ポリコム・インコーポレイテッド 会話の品質向上のための背景雑音の低減
US5826230A (en) 1994-07-18 1998-10-20 Matsushita Electric Industrial Co., Ltd. Speech detection device
US6037988A (en) 1996-03-22 2000-03-14 Microsoft Corp Method for generating sprites for object-based coding sytems using masks and rounding average
US6075875A (en) 1996-09-30 2000-06-13 Microsoft Corporation Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results
JP3607450B2 (ja) * 1997-03-05 2005-01-05 Kddi株式会社 オーディオ情報分類装置
JP3160228B2 (ja) * 1997-04-30 2001-04-25 日本放送協会 音声区間検出方法およびその装置

Also Published As

Publication number Publication date
JP4652575B2 (ja) 2011-03-16
WO2000033294A9 (en) 2001-07-05
JP2002531882A (ja) 2002-09-24
EP1141938B1 (de) 2004-09-08
DE69920047T2 (de) 2005-01-20
US6205422B1 (en) 2001-03-20
WO2000033294A1 (en) 2000-06-08
EP1141938A1 (de) 2001-10-10
DE69920047D1 (de) 2004-10-14

Similar Documents

Publication Publication Date Title
ATE275750T1 (de) Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)
Chatlani et al. Local binary patterns for 1-D signal processing
US8046215B2 (en) Method and apparatus to detect voice activity by adding a random signal
Singh et al. Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination
US20090076814A1 (en) Apparatus and method for determining speech signal
CN102930870A (zh) 利用抗噪幂归一化倒谱系数的鸟类声音识别方法
CN102194452A (zh) 复杂背景噪声中的语音激活检测方法
Jaafar et al. Automatic syllables segmentation for frog identification system
US5101434A (en) Voice recognition using segmented time encoded speech
Kwon et al. Speaker change detection using a new weighted distance measure.
Pourhomayoun et al. Bioacoustic signal classification based on continuous region processing, grid masking and artificial neural network
Chandra et al. Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual
CN110299133B (zh) 基于关键字判定非法广播的方法
Kumar et al. Classification of voiced and non-voiced speech signals using empirical wavelet transform and multi-level local patterns
JPH0462398B2 (de)
Hu et al. Separation of stop consonants
Song et al. Feature extraction and classification for audio information in news video
RU94014278A (ru) Способ распознавания изолированных слов речи с адаптацией к диктору
Pencak et al. The NP speech activity detection algorithm
Abu-Shikhah et al. A novel pitch estimation technique using the Teager energy function
Iyer et al. Structural usable speech measure using lpc residual
Benincasa et al. Voicing state determination of co-channel speech
de la Torre et al. Noise robust model-based voice activity detection.
You et al. Environmental sounds recognition using tespar
KR100835993B1 (ko) 마스킹 확률을 이용한 음성 인식 전처리 방법 및 전처리장치

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties