EP0953970A3 - Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples - Google Patents
Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples Download PDFInfo
- Publication number
- EP0953970A3 EP0953970A3 EP99303390A EP99303390A EP0953970A3 EP 0953970 A3 EP0953970 A3 EP 0953970A3 EP 99303390 A EP99303390 A EP 99303390A EP 99303390 A EP99303390 A EP 99303390A EP 0953970 A3 EP0953970 A3 EP 0953970A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- pronunciations
- spelled word
- generate
- decision trees
- mixed
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrically Operated Instructional Devices (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/067,764 US6016471A (en) | 1998-04-29 | 1998-04-29 | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
| US67764 | 1998-04-29 | ||
| US69308 | 1998-04-29 | ||
| US09/069,308 US6230131B1 (en) | 1998-04-29 | 1998-04-29 | Method for generating spelling-to-pronunciation decision tree |
| US09/070,300 US6029132A (en) | 1998-04-30 | 1998-04-30 | Method for letter-to-sound in text-to-speech synthesis |
| US70300 | 1998-04-30 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| EP0953970A2 EP0953970A2 (fr) | 1999-11-03 |
| EP0953970A3 true EP0953970A3 (fr) | 2000-01-19 |
| EP0953970B1 EP0953970B1 (fr) | 2004-03-03 |
Family
ID=27371225
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP99303390A Expired - Lifetime EP0953970B1 (fr) | 1998-04-29 | 1999-04-29 | Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples |
Country Status (7)
| Country | Link |
|---|---|
| EP (1) | EP0953970B1 (fr) |
| JP (1) | JP3481497B2 (fr) |
| KR (1) | KR100509797B1 (fr) |
| CN (1) | CN1118770C (fr) |
| AT (1) | ATE261171T1 (fr) |
| DE (1) | DE69915162D1 (fr) |
| TW (1) | TW422967B (fr) |
Families Citing this family (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002539482A (ja) * | 1999-03-08 | 2002-11-19 | シーメンス アクチエンゲゼルシヤフト | 見本音声を決定するための方法及び装置 |
| WO2001048737A2 (fr) * | 1999-12-23 | 2001-07-05 | Intel Corporation | Systeme de reconnaissance vocale dote d"un arbre lexical utilisant le modele de langage de type n-gram |
| US6684187B1 (en) | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
| US6505158B1 (en) | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
| AU2000276394A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition |
| US6718232B2 (en) * | 2000-10-13 | 2004-04-06 | Sony Corporation | Robot device and behavior control method for robot device |
| US6845358B2 (en) | 2001-01-05 | 2005-01-18 | Matsushita Electric Industrial Co., Ltd. | Prosody template matching for text-to-speech systems |
| US20040078191A1 (en) * | 2002-10-22 | 2004-04-22 | Nokia Corporation | Scalable neural network-based language identification from written text |
| US7146319B2 (en) * | 2003-03-31 | 2006-12-05 | Novauris Technologies Ltd. | Phonetically based speech recognition system and method |
| FI118062B (fi) * | 2003-04-30 | 2007-06-15 | Nokia Corp | Pienimuistinen päätöspuu |
| EP1638080B1 (fr) * | 2004-08-11 | 2007-10-03 | International Business Machines Corporation | Procédé et système pour la conversion de texte en parole |
| US7558389B2 (en) * | 2004-10-01 | 2009-07-07 | At&T Intellectual Property Ii, L.P. | Method and system of generating a speech signal with overlayed random frequency signal |
| GB2428853A (en) | 2005-07-22 | 2007-02-07 | Novauris Technologies Ltd | Speech recognition application specific dictionary |
| JP2009525492A (ja) * | 2005-08-01 | 2009-07-09 | 一秋 上川 | 英語音、および他のヨーロッパ言語音の表現方法と発音テクニックのシステム |
| JP4769223B2 (ja) * | 2007-04-26 | 2011-09-07 | 旭化成株式会社 | テキスト発音記号変換辞書作成装置、認識語彙辞書作成装置、及び音声認識装置 |
| CN101452701B (zh) * | 2007-12-05 | 2011-09-07 | 株式会社东芝 | 基于反模型的置信度估计方法及装置 |
| KR101250897B1 (ko) * | 2009-08-14 | 2013-04-04 | 한국전자통신연구원 | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 |
| US20110238412A1 (en) * | 2010-03-26 | 2011-09-29 | Antoine Ezzat | Method for Constructing Pronunciation Dictionaries |
| KR101780760B1 (ko) * | 2011-06-30 | 2017-10-10 | 구글 인코포레이티드 | 가변길이 문맥을 이용한 음성인식 |
| US9336771B2 (en) | 2012-11-01 | 2016-05-10 | Google Inc. | Speech recognition using non-parametric models |
| US9483581B2 (en) * | 2013-06-10 | 2016-11-01 | Google Inc. | Evaluation of substitution contexts |
| US9741339B2 (en) * | 2013-06-28 | 2017-08-22 | Google Inc. | Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores |
| JP6234134B2 (ja) * | 2013-09-25 | 2017-11-22 | 三菱電機株式会社 | 音声合成装置 |
| US9858922B2 (en) | 2014-06-23 | 2018-01-02 | Google Inc. | Caching speech recognition scores |
| US9299347B1 (en) | 2014-10-22 | 2016-03-29 | Google Inc. | Speech recognition using associative mapping |
| CN107767858B (zh) * | 2017-09-08 | 2021-05-04 | 科大讯飞股份有限公司 | 发音词典生成方法及装置、存储介质、电子设备 |
| CN109376358B (zh) * | 2018-10-25 | 2021-07-16 | 陈逸天 | 一种借用历史拼读经验的单词学习方法、装置和电子设备 |
| KR102605159B1 (ko) * | 2020-02-11 | 2023-11-23 | 주식회사 케이티 | 음성 인식 서비스를 제공하는 서버, 방법 및 컴퓨터 프로그램 |
| CN117083669A (zh) * | 2021-05-28 | 2023-11-17 | 微软技术许可有限责任公司 | 检测和改进单词实时误读的方法和系统 |
| US12361936B2 (en) | 2021-08-24 | 2025-07-15 | Microsoft Technology Licensing, Llc | Method and system of automated question generation for speech assistance |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0562138A1 (fr) * | 1992-03-25 | 1993-09-29 | International Business Machines Corporation | Méthode et dispositif pour créer automatiquement des modèles de Markov de mots nouveaux devant être ajoutés à un vocabulaire destiné à la reconnaissance de la parole |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4852173A (en) * | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
| KR100355393B1 (ko) * | 1995-06-30 | 2002-12-26 | 삼성전자 주식회사 | 음성합성에있어서의음소길이결정방법및음소길이결정트리의학습방법 |
| JP3627299B2 (ja) * | 1995-07-19 | 2005-03-09 | ソニー株式会社 | 音声認識方法及び装置 |
| US5758024A (en) * | 1996-06-25 | 1998-05-26 | Microsoft Corporation | Method and system for encoding pronunciation prefix trees |
-
1999
- 1999-04-28 KR KR10-1999-0015176A patent/KR100509797B1/ko not_active Expired - Lifetime
- 1999-04-28 TW TW088106840A patent/TW422967B/zh not_active IP Right Cessation
- 1999-04-28 JP JP12171099A patent/JP3481497B2/ja not_active Expired - Fee Related
- 1999-04-29 EP EP99303390A patent/EP0953970B1/fr not_active Expired - Lifetime
- 1999-04-29 DE DE69915162T patent/DE69915162D1/de not_active Expired - Lifetime
- 1999-04-29 CN CN99106310A patent/CN1118770C/zh not_active Expired - Lifetime
- 1999-04-29 AT AT99303390T patent/ATE261171T1/de not_active IP Right Cessation
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0562138A1 (fr) * | 1992-03-25 | 1993-09-29 | International Business Machines Corporation | Méthode et dispositif pour créer automatiquement des modèles de Markov de mots nouveaux devant être ajoutés à un vocabulaire destiné à la reconnaissance de la parole |
Non-Patent Citations (1)
| Title |
|---|
| ANDERSEN O ET AL: "Comparison of two tree-structured approaches for grapheme-to-phoneme conversion", PROCEEDINGS ICSLP 96. FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (CAT. NO.96TH8206), PROCEEDING OF FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. ICSLP '96, PHILADELPHIA, PA, USA, 3-6 OCT. 1996, 1996, New York, NY, USA, IEEE, USA, pages 1700 - 1703 vol.3, XP002123689, ISBN: 0-7803-3555-4 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN1233803A (zh) | 1999-11-03 |
| TW422967B (en) | 2001-02-21 |
| EP0953970B1 (fr) | 2004-03-03 |
| JPH11344990A (ja) | 1999-12-14 |
| ATE261171T1 (de) | 2004-03-15 |
| EP0953970A2 (fr) | 1999-11-03 |
| CN1118770C (zh) | 2003-08-20 |
| KR100509797B1 (ko) | 2005-08-23 |
| DE69915162D1 (de) | 2004-04-08 |
| JP3481497B2 (ja) | 2003-12-22 |
| KR19990083555A (ko) | 1999-11-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP0953970A3 (fr) | Procédé et dispositif utilisant des arbres de décision pour générer et juger des prononciations multiples | |
| US6016471A (en) | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word | |
| Lamel et al. | Bref, a large vocabulary spoken corpus for french1 | |
| US6233553B1 (en) | Method and system for automatically determining phonetic transcriptions associated with spelled words | |
| CN101826325B (zh) | 对中英文语音信号进行识别的方法和装置 | |
| CN1731510B (zh) | 混合语言文语转换 | |
| Selouani et al. | Algerian Arabic speech database (ALGASD): corpus design and automatic speech recognition application | |
| EP0867858A3 (fr) | Génération des prononciations dans la reconnaissance de la parole | |
| EP0874353A3 (fr) | Génération des prononciations dans la reconnaissance de la parole | |
| EP0387602A3 (fr) | Procédé et dispositif pour la détermination automatique des règles phonologiques pour un système de reconnaissance de la parole continue | |
| François et al. | Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problem. | |
| Grocholewski | CORPORA-speech database for Polish diphones. | |
| US20040044528A1 (en) | Method and apparatus for generating decision tree questions for speech processing | |
| Byrd | Sex, dialects and reduction | |
| Filipsson et al. | LUKAS-a preliminary report on a new Swedish speech synthesis | |
| Raptis et al. | Expressive Speech Synthesis for Storytelling: The INNOETICS'Entry to the Blizzard Challenge 2016. | |
| Rögnvaldsson | The Icelandic speech recognition project Hjal | |
| Engstrand et al. | Phonetics and phonology of Swedish dialects around the year 2000: a research plan | |
| Roux et al. | Developing a Multilingual Telephone Based Information System in African Languages. | |
| Sečujski et al. | An overview of the AlfaNum text-to-speech synthesis system | |
| KR100451919B1 (ko) | 영어 발음 기호의 분해 및 합성 방법 | |
| Bamberg et al. | Adaptable phoneme-based models for large-vocabulary speech recognition | |
| Schaden | A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names. | |
| Maghbouleh | A logistic regression model for detecting prominences | |
| Koffi et al. | Speech Synthesis by Syllable a Concatenation: Experimentation with Betine |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
| AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
| PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
| AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
| AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
| 17P | Request for examination filed |
Effective date: 20000515 |
|
| AKX | Designation fees paid |
Free format text: AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
| 17Q | First examination report despatched |
Effective date: 20020712 |
|
| RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7G 10L 13/08 A |
|
| GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
| GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
| GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
| AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: LI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED. Effective date: 20040303 Ref country code: FR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: CH Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: BE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040303 |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
| REF | Corresponds to: |
Ref document number: 69915162 Country of ref document: DE Date of ref document: 20040408 Kind code of ref document: P |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040429 Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040429 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040430 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040603 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040603 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040603 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040604 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20040614 |
|
| NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
| PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
| GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20040603 |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
| EN | Fr: translation not filed | ||
| 26N | No opposition filed |
Effective date: 20041206 |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: 728V |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: 728Y |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040803 |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: 732E Free format text: REGISTERED BETWEEN 20140612 AND 20140618 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20180329 Year of fee payment: 20 |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20190428 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20190428 |