ATE263997T1 - Zwischen-wörter verbindung phonemische modelle - Google Patents

Zwischen-wörter verbindung phonemische modelle

Info

Publication number
ATE263997T1
ATE263997T1 AT99952974T AT99952974T ATE263997T1 AT E263997 T1 ATE263997 T1 AT E263997T1 AT 99952974 T AT99952974 T AT 99952974T AT 99952974 T AT99952974 T AT 99952974T AT E263997 T1 ATE263997 T1 AT E263997T1
Authority
AT
Austria
Prior art keywords
word
phone
models
vocabulary
input utterance
Prior art date
Application number
AT99952974T
Other languages
English (en)
Inventor
Vladimir Sejnoha
Tom Lynch
Ramesh Sarukkai
Original Assignee
Lernout & Hauspie Speechprod
Vladimir Sejnoha
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lernout & Hauspie Speechprod, Vladimir Sejnoha filed Critical Lernout & Hauspie Speechprod
Application granted granted Critical
Publication of ATE263997T1 publication Critical patent/ATE263997T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • G10L15/05Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/022Demisyllables, biphones or triphones being the recognition units

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Navigation (AREA)
  • Telephone Function (AREA)
  • Telephone Set Structure (AREA)
AT99952974T 1998-09-29 1999-09-29 Zwischen-wörter verbindung phonemische modelle ATE263997T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10237398P 1998-09-29 1998-09-29
PCT/US1999/022501 WO2000019409A1 (en) 1998-09-29 1999-09-29 Inter-word triphone models

Publications (1)

Publication Number Publication Date
ATE263997T1 true ATE263997T1 (de) 2004-04-15

Family

ID=22289500

Family Applications (1)

Application Number Title Priority Date Filing Date
AT99952974T ATE263997T1 (de) 1998-09-29 1999-09-29 Zwischen-wörter verbindung phonemische modelle

Country Status (7)

Country Link
US (1) US6606594B1 (de)
EP (1) EP1116218B1 (de)
AT (1) ATE263997T1 (de)
AU (1) AU6501999A (de)
CA (1) CA2395012A1 (de)
DE (1) DE69916297D1 (de)
WO (1) WO2000019409A1 (de)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19939102C1 (de) * 1999-08-18 2000-10-26 Siemens Ag Verfahren und Anordnung zum Erkennen von Sprache
DE10120513C1 (de) * 2001-04-26 2003-01-09 Siemens Ag Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache
JP2003208195A (ja) * 2002-01-16 2003-07-25 Sharp Corp 連続音声認識装置および連続音声認識方法、連続音声認識プログラム、並びに、プログラム記録媒体
TWI454955B (zh) * 2006-12-29 2014-10-01 Nuance Communications Inc 使用模型檔產生動畫的方法及電腦可讀取的訊號承載媒體
KR100897554B1 (ko) * 2007-02-21 2009-05-15 삼성전자주식회사 분산 음성인식시스템 및 방법과 분산 음성인식을 위한 단말기
US8536976B2 (en) * 2008-06-11 2013-09-17 Veritrix, Inc. Single-channel multi-factor authentication
US8166297B2 (en) * 2008-07-02 2012-04-24 Veritrix, Inc. Systems and methods for controlling access to encrypted data stored on a mobile device
WO2010051342A1 (en) * 2008-11-03 2010-05-06 Veritrix, Inc. User authentication for social networks
US8914279B1 (en) * 2011-09-23 2014-12-16 Google Inc. Efficient parsing with structured prediction cascades
US9602666B2 (en) 2015-04-09 2017-03-21 Avaya Inc. Silence density models
US10134425B1 (en) * 2015-06-29 2018-11-20 Amazon Technologies, Inc. Direction-based speech endpointing
US10121471B2 (en) * 2015-06-29 2018-11-06 Amazon Technologies, Inc. Language model speech endpointing
US11615239B2 (en) * 2020-03-31 2023-03-28 Adobe Inc. Accuracy of natural language input classification utilizing response delay

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS57178295A (en) * 1981-04-27 1982-11-02 Nippon Electric Co Continuous word recognition apparatus
US5268990A (en) 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
US5502790A (en) * 1991-12-24 1996-03-26 Oki Electric Industry Co., Ltd. Speech recognition method and system using triphones, diphones, and phonemes
JPH0728487A (ja) 1993-03-26 1995-01-31 Texas Instr Inc <Ti> 音声認識方法
US5819221A (en) * 1994-08-31 1998-10-06 Texas Instruments Incorporated Speech recognition using clustered between word and/or phrase coarticulation
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US6163769A (en) * 1997-10-02 2000-12-19 Microsoft Corporation Text-to-speech using clustered context-dependent phoneme-based units

Also Published As

Publication number Publication date
US6606594B1 (en) 2003-08-12
DE69916297D1 (de) 2004-05-13
CA2395012A1 (en) 2000-04-06
WO2000019409A1 (en) 2000-04-06
EP1116218B1 (de) 2004-04-07
WO2000019409A9 (en) 2000-08-31
EP1116218A1 (de) 2001-07-18
AU6501999A (en) 2000-04-17

Similar Documents

Publication Publication Date Title
CA2315832A1 (en) System for using silence in speech recognition
CN100401375C (zh) 语音处理系统及方法
WO2007117814A3 (en) Voice signal perturbation for speech recognition
CA2275774A1 (en) Selection of superwords based on criteria relevant to both speech recognition and understanding
ATE263997T1 (de) Zwischen-wörter verbindung phonemische modelle
EP1291848A3 (de) Ausprachen in mehreren Sprachen zur Spracherkennung
WO2002054033A3 (en) Hierarchical language models for speech recognition
CA2202656A1 (en) Speech recognition
WO2006062707A3 (en) System and method for speech recognition-enabled automated call routing
ATE395685T1 (de) Spracherkennung durch wort-in-phrase-befehl
EP1205908A3 (de) Aussprache von neuen Wörtern zur Sprachverarbeitung
ATE314718T1 (de) Srecherangepasste spracherkennung
EP1629464A4 (de) Spracherkennungssystem und verfahren auf phonetischer basis
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
AU2001250579A1 (en) Discriminatively trained mixture models in continuous speech recognition
ATE235733T1 (de) Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
KR19980070329A (ko) 사용자 정의 문구의 화자 독립 인식을 위한 방법 및 시스템
CN109754790A (zh) 一种基于混合声学模型的语音识别系统及方法
Boite et al. A new approach towards keyword spotting.
ATE445215T1 (de) Spracherkennung für grosse dynamische vokabulare
CN110782895A (zh) 一种基于人工智能的人机语音系统
EP0916972A3 (de) Spracherkennungsverfahren und Spracherkennungsvorrichtung
WO2001026092A3 (en) Attribute-based word modeling
Roe Deployment of human-machine dialogue systems.
ATE441918T1 (de) Sprachdialogverfahren und -system

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties