EP0874352A3 - Sprachaktivitätserkennung - Google Patents

Sprachaktivitätserkennung Download PDF

Info

Publication number
EP0874352A3
EP0874352A3 EP98102842A EP98102842A EP0874352A3 EP 0874352 A3 EP0874352 A3 EP 0874352A3 EP 98102842 A EP98102842 A EP 98102842A EP 98102842 A EP98102842 A EP 98102842A EP 0874352 A3 EP0874352 A3 EP 0874352A3
Authority
EP
European Patent Office
Prior art keywords
speech
activity identification
voice activity
activity detection
controlling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP98102842A
Other languages
English (en)
French (fr)
Other versions
EP0874352A2 (de
EP0874352B1 (de
Inventor
Joachim Dipl.-Ing. Stegmann
Gerhard Dipl.-Ing. Schröder
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Deutsche Telekom AG
Original Assignee
Deutsche Telekom AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsche Telekom AG filed Critical Deutsche Telekom AG
Publication of EP0874352A2 publication Critical patent/EP0874352A2/de
Publication of EP0874352A3 publication Critical patent/EP0874352A3/de
Application granted granted Critical
Publication of EP0874352B1 publication Critical patent/EP0874352B1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Geophysics And Detection Of Objects (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)

Abstract

Es werden ein Verfahren und eine Schaltungsanordnung zur automatischen Sprachaktivitätserkennung auf Basis der Wavelet-Transformation angegeben. Zur quellengesteuerten Reduktion der mittleren Übertragungsrate wird eine Sprachaktivitätserkennungsschaltung bzw. ein -modul zur Steuerung eines Sprachcodierers und eines Sprachdecodierers und zur Steuerung eines Hintergrundgeräuschcodierers sowie eines Hintergrundgeräuschdecodierers verwendet. Nach der Segmentierung eines Sprachsignals wird für jeden Rahmen eine Wavelet-Transformation berechnet, aus der ein Satz Parameter ermittelt wird, aus denen wiederum mit Hilfe fester Schwellen ein Satz binärer Entscheidungsvariablen in einer Rechenschaltung (32) berechnet wird. Die Entscheidungsvariablen steuern eine Entscheidungslogik (42), deren Resultat nach zeitlicher Glättung in einer Schaltung (44) für jeden Rahmen eine Aussage "Sprache vorhanden / keine Sprache" liefert. Die Schaltung selbst besteht im wesentlichen aus einer Segmentierungsschaltung (28), einer Wavelet-Transformationsschaltung (30), einer Rechenschaltung für die Energiegrößen (32), einer Schaltung für Pausendetektion (34), einer Schaltung für das Stationaritätsmaß (35), einem ersten und einem zweiten Hintergrunddetektor (36 bzw. 37), einer nachgeschalteten Entscheidungslogik (42) und der Schaltung (44) für die zeitliche Glättung, die an ihrem Ausgang (45) die gewünschte Aussage liefert.
EP98102842A 1997-04-22 1998-02-19 Sprachaktivitätserkennung Expired - Lifetime EP0874352B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19716862 1997-04-22
DE19716862A DE19716862A1 (de) 1997-04-22 1997-04-22 Sprachaktivitätserkennung

Publications (3)

Publication Number Publication Date
EP0874352A2 EP0874352A2 (de) 1998-10-28
EP0874352A3 true EP0874352A3 (de) 1999-06-02
EP0874352B1 EP0874352B1 (de) 2003-10-15

Family

ID=7827317

Family Applications (1)

Application Number Title Priority Date Filing Date
EP98102842A Expired - Lifetime EP0874352B1 (de) 1997-04-22 1998-02-19 Sprachaktivitätserkennung

Country Status (4)

Country Link
US (1) US6374211B2 (de)
EP (1) EP0874352B1 (de)
AT (1) ATE252265T1 (de)
DE (2) DE19716862A1 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10026872A1 (de) * 2000-04-28 2001-10-31 Deutsche Telekom Ag Verfahren zur Berechnung einer Sprachaktivitätsentscheidung (Voice Activity Detector)
US7254532B2 (en) 2000-04-28 2007-08-07 Deutsche Telekom Ag Method for making a voice activity decision
US7505594B2 (en) * 2000-12-19 2009-03-17 Qualcomm Incorporated Discontinuous transmission (DTX) controller system and method
US6725191B2 (en) * 2001-07-19 2004-04-20 Vocaltec Communications Limited Method and apparatus for transmitting voice over internet
US8315865B2 (en) * 2004-05-04 2012-11-20 Hewlett-Packard Development Company, L.P. Method and apparatus for adaptive conversation detection employing minimal computation
US7574353B2 (en) * 2004-11-18 2009-08-11 Lsi Logic Corporation Transmit/receive data paths for voice-over-internet (VoIP) communication systems
US7983922B2 (en) * 2005-04-15 2011-07-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
KR100655953B1 (ko) * 2006-02-06 2006-12-11 한양대학교 산학협력단 웨이블릿 패킷 변환을 이용한 음성 처리 시스템 및 그 방법
US7680657B2 (en) * 2006-08-15 2010-03-16 Microsoft Corporation Auto segmentation based partitioning and clustering approach to robust endpointing
KR100789084B1 (ko) 2006-11-21 2007-12-26 한양대학교 산학협력단 웨이블릿 패킷 영역에서 비선형 구조의 과중 이득에 의한음질 개선 방법
US9361883B2 (en) * 2012-05-01 2016-06-07 Microsoft Technology Licensing, Llc Dictation with incremental recognition of speech
CN104019885A (zh) 2013-02-28 2014-09-03 杜比实验室特许公司 声场分析系统
EP2974253B1 (de) 2013-03-15 2019-05-08 Dolby Laboratories Licensing Corporation Normalisierung von schallfeldausrichtungen auf basis von auditorischer szenenanalyse
US10917611B2 (en) 2015-06-09 2021-02-09 Avaya Inc. Video adaptation in conferencing using power or view indications
WO2020252782A1 (zh) * 2019-06-21 2020-12-24 深圳市汇顶科技股份有限公司 语音检测方法、语音检测装置、语音处理芯片以及电子设备

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
EP0751495A2 (de) * 1995-06-30 1997-01-02 Deutsche Telekom AG Verfahren und Anordnung zur Kodierung von Sprachsignalen

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5152007A (en) * 1991-04-23 1992-09-29 Motorola, Inc. Method and apparatus for detecting speech
GB2272554A (en) * 1992-11-13 1994-05-18 Creative Tech Ltd Recognizing speech by using wavelet transform and transient response therefrom
US5388182A (en) * 1993-02-16 1995-02-07 Prometheus, Inc. Nonlinear method and apparatus for coding and decoding acoustic signals with data compression and noise suppression using cochlear filters, wavelet analysis, and irregular sampling reconstruction
JP3090842B2 (ja) * 1994-04-28 2000-09-25 沖電気工業株式会社 ビタビ復号法に適応した送信装置
FR2727236B1 (fr) * 1994-11-22 1996-12-27 Alcatel Mobile Comm France Detection d'activite vocale
US5822726A (en) * 1995-01-31 1998-10-13 Motorola, Inc. Speech presence detector based on sparse time-random signal samples
DE19538852A1 (de) * 1995-06-30 1997-01-02 Deutsche Telekom Ag Verfahren und Anordnung zur Klassifizierung von Sprachsignalen
US5781881A (en) * 1995-10-19 1998-07-14 Deutsche Telekom Ag Variable-subframe-length speech-coding classes derived from wavelet-transform parameters

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
EP0751495A2 (de) * 1995-06-30 1997-01-02 Deutsche Telekom AG Verfahren und Anordnung zur Kodierung von Sprachsignalen

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Digital cellular telecommunications system; Discontinuous Transmission (DTX) for Enhanced Full Rate (EFR) speech traffic channels (GSM 06.81)", EUROPEAN TELECOMMUNICATION STANDARD, FINAL DRAFT PRETS 300 729, November 1996 (1996-11-01), European Telecommunications Standards Institute (ETSI), XP002098616 *
BENYASSINE A ET AL: "ITU-T RECOMMENDATION G.729 ANNEX B: A SILENCE COMPRESSION SCHEME FOR USE WITH G.729 OPTIMIZED FOR V.70 DIGITAL SIMULTANEOUS VOICE AND DATA APPLICATIONS", IEEE COMMUNICATIONS MAGAZINE, vol. 35, no. 9, September 1997 (1997-09-01), pages 64 - 73, XP000704425 *
STEGMANN J ET AL: "ROBUST VOICE-ACTIVITY DETECTION BASED ON THE WAVELET TRANSFORM", PROCEEDINGS OF THE IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, 7 September 1997 (1997-09-07), pages 99 - 100, XP002073237 *

Also Published As

Publication number Publication date
EP0874352A2 (de) 1998-10-28
US6374211B2 (en) 2002-04-16
US20010014854A1 (en) 2001-08-16
ATE252265T1 (de) 2003-11-15
DE59809897D1 (de) 2003-11-20
EP0874352B1 (de) 2003-10-15
DE19716862A1 (de) 1998-10-29

Similar Documents

Publication Publication Date Title
EP0874352A3 (de) Sprachaktivitätserkennung
EP0932141A3 (de) Verfahren zur signalgesteuerten Schaltung zwischen verschiedenen Audiokodierungssystemen
WO1995028824A3 (en) Method of encoding a signal containing speech
CA2228948A1 (en) Pattern recognition
WO2000031719A3 (en) Speech coding with comfort noise variability feature for increased fidelity
EP1083542A3 (de) Verfahren und Vorrichtung zur Sprachdetektion
EP0770989A3 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE60117144D1 (de) Sprachübertragungssystem und verfahren zur behandlung verlorener datenrahmen
CA2124643A1 (en) Method and Device for Speech Signal Pitch Period Estimation and Classification in Digital Speech Coders
CA2177422A1 (en) Voice/Unvoiced Classification of Speech for Use in Speech Decoding During Frame Erasures
WO1999016052A3 (en) Speech recognition system for recognizing continuous and isolated speech
WO2000031720A3 (en) Complex signal activity detection for improved speech/noise classification of an audio signal
CA2158849A1 (en) Speech Recognition with Pause Detection
GB2307582A (en) System for recognizing spoken sounds from continuous speech and method of using same
CA2188369A1 (en) Method and an arrangement for classifying speech signals
EP0862162A3 (de) Spracherkennung mit nichtparametrischen Sprachmodellen
GB2308483A (en) Method and system for recognizing a boundary beween sounds in continuous speech
AU2112700A (en) A method and apparatus for determining speech coding parameters
EP0651521A3 (de) Verfahren zur Unterscheidung zwischen Geräusch und Empfangssignalen
MXPA00001875A (es) Sistema y metodo de reconocimiento de voz.
AU642311B2 (en) Method and system for speech recognition without noise interference
WO1999001942A3 (en) A method of noise reduction in speech signals and an apparatus for performing the method
CA2315324A1 (en) Speech signal decoding method and apparatus
FI98162B (fi) HMM-malliin perustuva puheentunnistusmenetelmä
EP0798695A3 (de) Verfahren und Vorrichtung zur Spracherkennung

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

17P Request for examination filed

Effective date: 19991202

AKX Designation fees paid

Free format text: AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

RIC1 Information provided on ipc code assigned before grant

Ipc: 7G 10L 11/02 A

RIC1 Information provided on ipc code assigned before grant

Ipc: 7G 10L 11/02 A

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20031015

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031015

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031015

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031015

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: GERMAN

REF Corresponds to:

Ref document number: 59809897

Country of ref document: DE

Date of ref document: 20031120

Kind code of ref document: P

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040115

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040115

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040115

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20040123

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040219

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040229

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040229

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20040716

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040315

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 19

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20160218

Year of fee payment: 19

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20160222

Year of fee payment: 19

Ref country code: NL

Payment date: 20160222

Year of fee payment: 19

Ref country code: BE

Payment date: 20160222

Year of fee payment: 19

Ref country code: AT

Payment date: 20160218

Year of fee payment: 19

Ref country code: FR

Payment date: 20160222

Year of fee payment: 19

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170228

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 59809897

Country of ref document: DE

REG Reference to a national code

Ref country code: NL

Ref legal event code: MM

Effective date: 20170301

REG Reference to a national code

Ref country code: AT

Ref legal event code: MM01

Ref document number: 252265

Country of ref document: AT

Kind code of ref document: T

Effective date: 20170219

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20170219

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170219

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170301

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20171031

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170901

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170228

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20170228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170219