EP1288914A2 - Procédé pour la correction de mesures de la qualité vocale - Google Patents

Procédé pour la correction de mesures de la qualité vocale Download PDF

Info

Publication number
EP1288914A2
EP1288914A2 EP02012790A EP02012790A EP1288914A2 EP 1288914 A2 EP1288914 A2 EP 1288914A2 EP 02012790 A EP02012790 A EP 02012790A EP 02012790 A EP02012790 A EP 02012790A EP 1288914 A2 EP1288914 A2 EP 1288914A2
Authority
EP
European Patent Office
Prior art keywords
speech
speech quality
value
measured
correction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP02012790A
Other languages
German (de)
English (en)
Other versions
EP1288914A3 (fr
EP1288914B1 (fr
Inventor
Jens Dr. Berger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Deutsche Telekom AG
Original Assignee
Deutsche Telekom AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsche Telekom AG filed Critical Deutsche Telekom AG
Publication of EP1288914A2 publication Critical patent/EP1288914A2/fr
Publication of EP1288914A3 publication Critical patent/EP1288914A3/fr
Application granted granted Critical
Publication of EP1288914B1 publication Critical patent/EP1288914B1/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02168Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses

Definitions

  • the invention relates to instrumental methods for measuring the speech quality of recorded or transmitted voice signals. Doing so will measure of speech quality assumed that e.g. with the ITU-T standard P.862 ("Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs ", ITU-T, Geneva, 2001) become.
  • P.862 Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs
  • the perceived speech quality z. B. for telephone connections or radio transmissions is mainly caused by simultaneous speech disorders, i.e. disorders during the Speech activity, determined. But noises in speech breaks also go into that Quality assessment, especially with high quality speech reproduction quality.
  • Speech quality determinations of speech signals are generally made using auditory ("subjective") investigations with test persons.
  • the goal of Instrumental ("objective") procedures for determining speech quality are made up of Properties of the speech signal to be evaluated by means of suitable computing methods To determine characteristic values that determine the speech quality of the speech signal to be assessed describe without having to resort to judgments from test subjects.
  • Known methods for instrumental determination of speech quality determine the speech quality based on a comparison between undisturbed reference speech signal (source speech signal) and the to be evaluated and possibly disturbed signal.
  • sample connection systems at which a known reference speech signal (source speech signal) is fed in at the source and z. B. transmitted over a telephone connection and recorded at the sink. After The speech signal is recorded and a speech quality value is calculated.
  • source speech signal source speech signal
  • Instrumental procedures for determining language quality are usually limited to Evaluation of sections with language activity.
  • the current ITU-T standard P.862 is also only limited to sections with the determination of the speech quality active language. Especially with high quality speech reproduction and noises that these methods (e.g. measuring methods subsequently only occur during pauses in speech) ITU-T Rec. P.862) unreliable quality values.
  • the speech quality here too optimistic because the speech quality felt by a listener is based on the entire signal including possible noises in the speech pauses.
  • Some instrumental procedures for determining speech quality such as B. the procedure according to ITU-T Rec. P.862, take into account the calculation of the speech quality values No sounds during the pauses in speech. The resulting measured values are especially with high playback quality in the case of voice activity but occurring noises during pauses in speech, unreliable. With the present method, the background noises are intended be taken into account in the pauses in the speech when determining the speech quality values.
  • the solution to the task assumes that the background noise in the Speech breaks regarding their disturbing influence on the perceived speech quality be rated.
  • intensity characteristics of the background noise are determined and with these values the speech quality measured values, which are determined by an instrumental Process, e.g. B. according to ITU-T Rec. P.862, were corrected.
  • the speech quality value using the source speech signal and the evaluating disturbed speech signal e.g. B. with the method according to ITU-T Rec. P.962, calculated.
  • These speech signals are also available to the subsequent correction process Input parameters available.
  • the correction procedure described here is required still the calculated speech quality value, the z. B. with the method according to ITU-T Rec. P.862 was calculated.
  • one or more intensity parameters are generated of the noise in pauses in speech.
  • This can e.g. B. the average loudness according to ISO 532 of the background noise during speech pauses.
  • Other intensity parameters e.g. Sharpness, impulsiveness, fluctuation strength
  • sharpness, impulsiveness, fluctuation strength can be included in the correction value. It it is assumed that increasing intensity values of one also increasing disturbance caused by the noise in speech pauses and thus lead to a greater reduction in the perceived speech quality.
  • the speech quality is rated too high by the described speech quality measurement methods.
  • One or more background noise intensity values during speech pauses are used to correct the measured speech quality value. Assuming that the measured speech quality value on a scale from 1 (low quality) to 5 (very good quality) according to ITU-T Recommendation P.800 ("Methods for objective and subjective assessment of quality", ITU-T, Geneva 1996 ), all values above a certain speech quality threshold value (e.g. above 3.0 in the ITU-T Rec. P.862 procedure) are reduced if background noise occurs during speech pauses.
  • This reduction depends on the intensity characteristics of the background noise, the proportion of the speech pauses in the overall signal PA and the speech quality value Y.
  • the function a (N) represents a clear and increasing weighting function of the intensity characteristic value N.
  • the corrected speech quality value is always less than or equal to the uncorrected value.
  • the correction is small if the intensity of the noise is low (N is small), there are only a few pauses in speech ( PA small) or the speech quality value is close to the speech quality threshold ( Y-YS small). Corrections are made more strongly if there are strong pause noises with an otherwise high speech quality Y.
  • the intensity parameters are weighted with the weighting functions a (N), b (M) and c (O) . Since an increasing disturbance with increasing values is assumed, clear and increasing weighting functions must also be used here.
  • the exemplary embodiment presented here shows an example of a correction of the speech quality values determined using the method according to ITU-T Rec. P.862 "PESQ" (status 2001).
  • This method provides a speech quality value by comparing an undisturbed source speech signal with the disturbed speech signal to be evaluated. These two speech signals are used to determine the mean loudness of the background noise in accordance with patent application DE 101 20 168.
  • the value N in sone thus calculated is used to correct the quality value calculated using the method according to ITU-T Rec. P.862 "PESQ” (status 2001) used.
  • the value of the threshold crossing YD is determined, ie by what proportion the measured speech quality Y exceeds the speech quality threshold YS .

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Monitoring And Testing Of Exchanges (AREA)
  • Noise Elimination (AREA)
EP02012790A 2001-08-29 2002-06-10 Procédé pour la correction de mesures de la qualité vocale Expired - Lifetime EP1288914B1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE10142846A DE10142846A1 (de) 2001-08-29 2001-08-29 Verfahren zur Korrektur von gemessenen Sprachqualitätswerten
DE10142846 2001-08-29

Publications (3)

Publication Number Publication Date
EP1288914A2 true EP1288914A2 (fr) 2003-03-05
EP1288914A3 EP1288914A3 (fr) 2004-05-19
EP1288914B1 EP1288914B1 (fr) 2005-12-21

Family

ID=7697364

Family Applications (1)

Application Number Title Priority Date Filing Date
EP02012790A Expired - Lifetime EP1288914B1 (fr) 2001-08-29 2002-06-10 Procédé pour la correction de mesures de la qualité vocale

Country Status (3)

Country Link
EP (1) EP1288914B1 (fr)
AT (1) ATE313846T1 (fr)
DE (2) DE10142846A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010119216A1 (fr) * 2009-04-17 2010-10-21 France Telecom Procede et dispositif d'evaluation objective de la qualite vocale d'un signal de parole prenant en compte la classification du bruit de fond contenu dans le signal

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI94810C (fi) * 1993-10-11 1995-10-25 Nokia Mobile Phones Ltd Menetelmä huonon GSM-puhekehyksen tunnistamiseksi
US5684921A (en) * 1995-07-13 1997-11-04 U S West Technologies, Inc. Method and system for identifying a corrupted speech message signal
US5809414A (en) * 1995-11-22 1998-09-15 Northern Telecom Limited User out-of-range indication for digital wireless systems
SE506341C2 (sv) * 1996-04-10 1997-12-08 Ericsson Telefon Ab L M Metod och anordning för rekonstruktion av en mottagen talsignal
EP0980064A1 (fr) * 1998-06-26 2000-02-16 Ascom AG Méthode pour effectuer une évaluation automatique de la qualité de transmission de signaux audio
DE19840548C2 (de) * 1998-08-27 2001-02-15 Deutsche Telekom Ag Verfahren zur instrumentellen Sprachqualitätsbestimmung
GB9911777D0 (en) * 1999-05-20 1999-07-21 Univ Southampton Transceiver

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010119216A1 (fr) * 2009-04-17 2010-10-21 France Telecom Procede et dispositif d'evaluation objective de la qualite vocale d'un signal de parole prenant en compte la classification du bruit de fond contenu dans le signal
FR2944640A1 (fr) * 2009-04-17 2010-10-22 France Telecom Procede et dispositif d'evaluation objective de la qualite vocale d'un signal de parole prenant en compte la classification du bruit de fond contenu dans le signal.
US8886529B2 (en) 2009-04-17 2014-11-11 France Telecom Method and device for the objective evaluation of the voice quality of a speech signal taking into account the classification of the background noise contained in the signal

Also Published As

Publication number Publication date
ATE313846T1 (de) 2006-01-15
EP1288914A3 (fr) 2004-05-19
DE10142846A1 (de) 2003-03-20
DE50205328D1 (de) 2006-01-26
EP1288914B1 (fr) 2005-12-21

Similar Documents

Publication Publication Date Title
DE69520067T2 (de) Verfahren und Einrichtung zur Kennzeichnung eines Eingangssignales
DE60126274T2 (de) Dynamische dienstqualität überwachung
DE19952538C2 (de) Automatische Verstärkungsregelung in einem Spracherkennungssystem
DE60108401T2 (de) System zur erhöhung der sprachqualität
DE60205232T2 (de) Verfahren und vorrichtung zur bestimmung der qualität eines sprachsignals
DE60122751T2 (de) Verfahren und vorrichtung für die objektive bewertung der sprachqualität ohne referenzsignal
EP1386307B1 (fr) Procede et dispositif pour determiner un niveau de qualite d'un signal audio
DE10017646A1 (de) Geräuschunterdrückung im Zeitbereich
DE60222770T2 (de) Verbessertes verfahren zur ermittlung der qualität eines sprachsignals
DE19957221A1 (de) Exponentielle Echo- und Geräuschabsenkung in Sprachpausen
DE112018003662T5 (de) Sprachsignalnivellierung
EP1382034B1 (fr) Procede de determination de valeurs caracteristiques d'intensite de bruits de fond dans des pauses de voix de signaux vocaux
EP1048025B1 (fr) Procede de determination instrumentale de la qualite vocale
DE2021126A1 (de) Spracherkennungsvorrichtung
EP1634277A1 (fr) Extraction de sections de signaux d'essai pour la mesure de la qualite d'un signal audio
DE602004006912T2 (de) Verfahren zur Verarbeitung eines akustischen Signals und ein Hörgerät
DE60110541T2 (de) Verfahren zur Spracherkennung mit geräuschabhängiger Normalisierung der Varianz
DE102009016656A1 (de) Verfahren und Hörvorrichtung zum Einstellen eines Hörgeräts mit in einer externen Einheit aufgezeichneten Daten
EP1288914B1 (fr) Procédé pour la correction de mesures de la qualité vocale
EP0946015B1 (fr) Procédé et dispositif d'estimation de la qualité de transmission
EP1005016A2 (fr) Procédé et dispositif de circuit pour mesurer le niveau de parole dans un système de traitement du signal de parole
DE10048157B4 (de) Verfahren zum Messen einer Frequenzselektivität und Verfahren und Vorrichtung zum Abschätzen einer Hörfilterform durch ein Frequenzselektivitäts-Meßverfahren
EP1351550A1 (fr) Procédé d'adaptation d'une amplification de signal dans une prothèse auditive et prothèse auditive
EP0535425A2 (fr) Procédé d'amplification de signaux acoustiques pour les malentendants et dispositif pour la réalisation du procédé
EP3796676A1 (fr) Procédé de fonctionnement d'un appareil auditif et appareil auditif

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

RIC1 Information provided on ipc code assigned before grant

Ipc: 7H 04M 3/22 B

Ipc: 7G 10L 19/00 A

17P Request for examination filed

Effective date: 20041119

AKX Designation fees paid

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20051221

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: GERMAN

REF Corresponds to:

Ref document number: 50205328

Country of ref document: DE

Date of ref document: 20060126

Kind code of ref document: P

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060321

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060321

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060321

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20060306

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060401

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060522

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20060922

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060610

BERE Be: lapsed

Owner name: DEUTSCHE TELEKOM A.G.

Effective date: 20060630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060610

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 15

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20160628

Year of fee payment: 15

Ref country code: DE

Payment date: 20160622

Year of fee payment: 15

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20160621

Year of fee payment: 15

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 50205328

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20170610

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20180228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170610

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20180103

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170630