EP0953970A3 - Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples - Google Patents

Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples Download PDF

Info

Publication number
EP0953970A3
EP0953970A3 EP99303390A EP99303390A EP0953970A3 EP 0953970 A3 EP0953970 A3 EP 0953970A3 EP 99303390 A EP99303390 A EP 99303390A EP 99303390 A EP99303390 A EP 99303390A EP 0953970 A3 EP0953970 A3 EP 0953970A3
Authority
EP
European Patent Office
Prior art keywords
pronunciations
spelled word
generate
decision trees
mixed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP99303390A
Other languages
German (de)
English (en)
Other versions
EP0953970B1 (fr
EP0953970A2 (fr
Inventor
Roland Kuhn
Jean-Claude Junqua
Matteo Contolini
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/067,764 external-priority patent/US6016471A/en
Priority claimed from US09/069,308 external-priority patent/US6230131B1/en
Priority claimed from US09/070,300 external-priority patent/US6029132A/en
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Publication of EP0953970A2 publication Critical patent/EP0953970A2/fr
Publication of EP0953970A3 publication Critical patent/EP0953970A3/fr
Application granted granted Critical
Publication of EP0953970B1 publication Critical patent/EP0953970B1/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
EP99303390A 1998-04-29 1999-04-29 Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples Expired - Lifetime EP0953970B1 (fr)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US09/067,764 US6016471A (en) 1998-04-29 1998-04-29 Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word
US67764 1998-04-29
US69308 1998-04-29
US09/069,308 US6230131B1 (en) 1998-04-29 1998-04-29 Method for generating spelling-to-pronunciation decision tree
US09/070,300 US6029132A (en) 1998-04-30 1998-04-30 Method for letter-to-sound in text-to-speech synthesis
US70300 1998-04-30

Publications (3)

Publication Number Publication Date
EP0953970A2 EP0953970A2 (fr) 1999-11-03
EP0953970A3 true EP0953970A3 (fr) 2000-01-19
EP0953970B1 EP0953970B1 (fr) 2004-03-03

Family

ID=27371225

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99303390A Expired - Lifetime EP0953970B1 (fr) 1998-04-29 1999-04-29 Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples

Country Status (7)

Country Link
EP (1) EP0953970B1 (fr)
JP (1) JP3481497B2 (fr)
KR (1) KR100509797B1 (fr)
CN (1) CN1118770C (fr)
AT (1) ATE261171T1 (fr)
DE (1) DE69915162D1 (fr)
TW (1) TW422967B (fr)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002539482A (ja) * 1999-03-08 2002-11-19 シーメンス アクチエンゲゼルシヤフト 見本音声を決定するための方法及び装置
WO2001048737A2 (fr) * 1999-12-23 2001-07-05 Intel Corporation Systeme de reconnaissance vocale dote d"un arbre lexical utilisant le modele de langage de type n-gram
US6684187B1 (en) 2000-06-30 2004-01-27 At&T Corp. Method and system for preselection of suitable units for concatenative speech
US6505158B1 (en) 2000-07-05 2003-01-07 At&T Corp. Synthesis-based pre-selection of suitable units for concatenative speech
AU2000276394A1 (en) * 2000-09-30 2002-04-15 Intel Corporation Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition
US6718232B2 (en) * 2000-10-13 2004-04-06 Sony Corporation Robot device and behavior control method for robot device
US6845358B2 (en) 2001-01-05 2005-01-18 Matsushita Electric Industrial Co., Ltd. Prosody template matching for text-to-speech systems
US20040078191A1 (en) * 2002-10-22 2004-04-22 Nokia Corporation Scalable neural network-based language identification from written text
US7146319B2 (en) * 2003-03-31 2006-12-05 Novauris Technologies Ltd. Phonetically based speech recognition system and method
FI118062B (fi) * 2003-04-30 2007-06-15 Nokia Corp Pienimuistinen päätöspuu
EP1638080B1 (fr) * 2004-08-11 2007-10-03 International Business Machines Corporation Procédé et système pour la conversion de texte en parole
US7558389B2 (en) * 2004-10-01 2009-07-07 At&T Intellectual Property Ii, L.P. Method and system of generating a speech signal with overlayed random frequency signal
GB2428853A (en) 2005-07-22 2007-02-07 Novauris Technologies Ltd Speech recognition application specific dictionary
JP2009525492A (ja) * 2005-08-01 2009-07-09 一秋 上川 英語音、および他のヨーロッパ言語音の表現方法と発音テクニックのシステム
JP4769223B2 (ja) * 2007-04-26 2011-09-07 旭化成株式会社 テキスト発音記号変換辞書作成装置、認識語彙辞書作成装置、及び音声認識装置
CN101452701B (zh) * 2007-12-05 2011-09-07 株式会社东芝 基于反模型的置信度估计方法及装置
KR101250897B1 (ko) * 2009-08-14 2013-04-04 한국전자통신연구원 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법
US20110238412A1 (en) * 2010-03-26 2011-09-29 Antoine Ezzat Method for Constructing Pronunciation Dictionaries
KR101780760B1 (ko) * 2011-06-30 2017-10-10 구글 인코포레이티드 가변길이 문맥을 이용한 음성인식
US9336771B2 (en) 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US9483581B2 (en) * 2013-06-10 2016-11-01 Google Inc. Evaluation of substitution contexts
US9741339B2 (en) * 2013-06-28 2017-08-22 Google Inc. Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores
JP6234134B2 (ja) * 2013-09-25 2017-11-22 三菱電機株式会社 音声合成装置
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9299347B1 (en) 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
CN107767858B (zh) * 2017-09-08 2021-05-04 科大讯飞股份有限公司 发音词典生成方法及装置、存储介质、电子设备
CN109376358B (zh) * 2018-10-25 2021-07-16 陈逸天 一种借用历史拼读经验的单词学习方法、装置和电子设备
KR102605159B1 (ko) * 2020-02-11 2023-11-23 주식회사 케이티 음성 인식 서비스를 제공하는 서버, 방법 및 컴퓨터 프로그램
CN117083669A (zh) * 2021-05-28 2023-11-17 微软技术许可有限责任公司 检测和改进单词实时误读的方法和系统
US12361936B2 (en) 2021-08-24 2025-07-15 Microsoft Technology Licensing, Llc Method and system of automated question generation for speech assistance

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0562138A1 (fr) * 1992-03-25 1993-09-29 International Business Machines Corporation Méthode et dispositif pour créer automatiquement des modèles de Markov de mots nouveaux devant être ajoutés à un vocabulaire destiné à la reconnaissance de la parole

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4852173A (en) * 1987-10-29 1989-07-25 International Business Machines Corporation Design and construction of a binary-tree system for language modelling
KR100355393B1 (ko) * 1995-06-30 2002-12-26 삼성전자 주식회사 음성합성에있어서의음소길이결정방법및음소길이결정트리의학습방법
JP3627299B2 (ja) * 1995-07-19 2005-03-09 ソニー株式会社 音声認識方法及び装置
US5758024A (en) * 1996-06-25 1998-05-26 Microsoft Corporation Method and system for encoding pronunciation prefix trees

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0562138A1 (fr) * 1992-03-25 1993-09-29 International Business Machines Corporation Méthode et dispositif pour créer automatiquement des modèles de Markov de mots nouveaux devant être ajoutés à un vocabulaire destiné à la reconnaissance de la parole

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ANDERSEN O ET AL: "Comparison of two tree-structured approaches for grapheme-to-phoneme conversion", PROCEEDINGS ICSLP 96. FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (CAT. NO.96TH8206), PROCEEDING OF FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. ICSLP '96, PHILADELPHIA, PA, USA, 3-6 OCT. 1996, 1996, New York, NY, USA, IEEE, USA, pages 1700 - 1703 vol.3, XP002123689, ISBN: 0-7803-3555-4 *

Also Published As

Publication number Publication date
CN1233803A (zh) 1999-11-03
TW422967B (en) 2001-02-21
EP0953970B1 (fr) 2004-03-03
JPH11344990A (ja) 1999-12-14
ATE261171T1 (de) 2004-03-15
EP0953970A2 (fr) 1999-11-03
CN1118770C (zh) 2003-08-20
KR100509797B1 (ko) 2005-08-23
DE69915162D1 (de) 2004-04-08
JP3481497B2 (ja) 2003-12-22
KR19990083555A (ko) 1999-11-25

Similar Documents

Publication Publication Date Title
EP0953970A3 (fr) Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples
US6016471A (en) Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word
Lamel et al. Bref, a large vocabulary spoken corpus for french1
US6233553B1 (en) Method and system for automatically determining phonetic transcriptions associated with spelled words
CN101826325B (zh) 对中英文语音信号进行识别的方法和装置
CN1731510B (zh) 混合语言文语转换
Selouani et al. Algerian Arabic speech database (ALGASD): corpus design and automatic speech recognition application
EP0867858A3 (fr) Génération des prononciations dans la reconnaissance de la parole
EP0874353A3 (fr) Génération des prononciations dans la reconnaissance de la parole
EP0387602A3 (fr) Procédé et dispositif pour la détermination automatique des règles phonologiques pour un système de reconnaissance de la parole continue
François et al. Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problem.
Grocholewski CORPORA-speech database for Polish diphones.
US20040044528A1 (en) Method and apparatus for generating decision tree questions for speech processing
Byrd Sex, dialects and reduction
Filipsson et al. LUKAS-a preliminary report on a new Swedish speech synthesis
Raptis et al. Expressive Speech Synthesis for Storytelling: The INNOETICS'Entry to the Blizzard Challenge 2016.
Rögnvaldsson The Icelandic speech recognition project Hjal
Engstrand et al. Phonetics and phonology of Swedish dialects around the year 2000: a research plan
Roux et al. Developing a Multilingual Telephone Based Information System in African Languages.
Sečujski et al. An overview of the AlfaNum text-to-speech synthesis system
KR100451919B1 (ko) 영어 발음 기호의 분해 및 합성 방법
Bamberg et al. Adaptable phoneme-based models for large-vocabulary speech recognition
Schaden A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names.
Maghbouleh A logistic regression model for detecting prominences
Koffi et al. Speech Synthesis by Syllable a Concatenation: Experimentation with Betine

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

17P Request for examination filed

Effective date: 20000515

AKX Designation fees paid

Free format text: AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

17Q First examination report despatched

Effective date: 20020712

RIC1 Information provided on ipc code assigned before grant

Ipc: 7G 10L 13/08 A

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: LI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20040303

Ref country code: FR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: CH

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: BE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040303

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 69915162

Country of ref document: DE

Date of ref document: 20040408

Kind code of ref document: P

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040429

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040429

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040430

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040603

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040603

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040603

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040604

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040614

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20040603

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

EN Fr: translation not filed
26N No opposition filed

Effective date: 20041206

REG Reference to a national code

Ref country code: GB

Ref legal event code: 728V

REG Reference to a national code

Ref country code: GB

Ref legal event code: 728Y

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040803

REG Reference to a national code

Ref country code: GB

Ref legal event code: 732E

Free format text: REGISTERED BETWEEN 20140612 AND 20140618

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20180329

Year of fee payment: 20

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20190428

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20190428