CA2085842A1 - Systeme et methode de reconnaissance d'echantillons de paroles utilisant unr reseau neuronal - Google Patents

Systeme et methode de reconnaissance d'echantillons de paroles utilisant unr reseau neuronal

Info

Publication number
CA2085842A1
CA2085842A1 CA2085842A CA2085842A CA2085842A1 CA 2085842 A1 CA2085842 A1 CA 2085842A1 CA 2085842 A CA2085842 A CA 2085842A CA 2085842 A CA2085842 A CA 2085842A CA 2085842 A1 CA2085842 A1 CA 2085842A1
Authority
CA
Canada
Prior art keywords
letters
utterance
probability match
neural network
discriminate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA2085842A
Other languages
English (en)
Other versions
CA2085842C (fr
Inventor
Ronald A. Cole
Mark A. Fanty
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oregon Health and Science University
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2085842A1 publication Critical patent/CA2085842A1/fr
Application granted granted Critical
Publication of CA2085842C publication Critical patent/CA2085842C/fr
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
CA002085842A 1991-12-20 1992-12-18 Systeme et methode de reconnaissance d'echantillons de paroles utilisant unr reseau neuronal Expired - Fee Related CA2085842C (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US07/811,819 US5621857A (en) 1991-12-20 1991-12-20 Method and system for identifying and recognizing speech
US07/811,819 1991-12-20

Publications (2)

Publication Number Publication Date
CA2085842A1 true CA2085842A1 (fr) 1993-06-21
CA2085842C CA2085842C (fr) 1996-05-21

Family

ID=25207682

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002085842A Expired - Fee Related CA2085842C (fr) 1991-12-20 1992-12-18 Systeme et methode de reconnaissance d'echantillons de paroles utilisant unr reseau neuronal

Country Status (4)

Country Link
US (1) US5621857A (fr)
EP (1) EP0549265A2 (fr)
AU (1) AU3024892A (fr)
CA (1) CA2085842C (fr)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07210190A (ja) * 1993-12-30 1995-08-11 Internatl Business Mach Corp <Ibm> 音声認識方法及びシステム
WO1996010795A1 (fr) * 1994-10-03 1996-04-11 Helfgott & Karas, P.C. Systeme d'acces a une base de donnees
US6446038B1 (en) * 1996-04-01 2002-09-03 Qwest Communications International, Inc. Method and system for objectively evaluating speech
US5924066A (en) * 1997-09-26 1999-07-13 U S West, Inc. System and method for classifying a speech signal
US6438523B1 (en) 1998-05-20 2002-08-20 John A. Oberteuffer Processing handwritten and hand-drawn input and speech input
US6192337B1 (en) * 1998-08-14 2001-02-20 International Business Machines Corporation Apparatus and methods for rejecting confusible words during training associated with a speech recognition system
US6269335B1 (en) 1998-08-14 2001-07-31 International Business Machines Corporation Apparatus and methods for identifying homophones among words in a speech recognition system
US6185530B1 (en) 1998-08-14 2001-02-06 International Business Machines Corporation Apparatus and methods for identifying potential acoustic confusibility among words in a speech recognition system
US6304844B1 (en) * 2000-03-30 2001-10-16 Verbaltek, Inc. Spelling speech recognition apparatus and method for communications
US7028250B2 (en) * 2000-05-25 2006-04-11 Kanisa, Inc. System and method for automatically classifying text
US6625600B2 (en) 2001-04-12 2003-09-23 Telelogue, Inc. Method and apparatus for automatically processing a user's communication
US7610205B2 (en) * 2002-02-12 2009-10-27 Dolby Laboratories Licensing Corporation High quality time-scaling and pitch-scaling of audio signals
US7283954B2 (en) * 2001-04-13 2007-10-16 Dolby Laboratories Licensing Corporation Comparing audio using characterizations based on auditory events
US7711123B2 (en) * 2001-04-13 2010-05-04 Dolby Laboratories Licensing Corporation Segmenting audio signals into auditory events
US7461002B2 (en) * 2001-04-13 2008-12-02 Dolby Laboratories Licensing Corporation Method for time aligning audio signals using characterizations based on auditory events
DK1386312T3 (da) * 2001-05-10 2008-06-09 Dolby Lab Licensing Corp Forbedring af transient ydeevne af audio kodningssystemer med lav bithastighed ved reduktion af forudgående stöj
US20020172349A1 (en) * 2001-05-17 2002-11-21 Shea Phillip N. Neural net-call progress tone detector
US20030149566A1 (en) * 2002-01-02 2003-08-07 Esther Levin System and method for a spoken language interface to a large database of changing records
KR100462989B1 (ko) * 2002-10-07 2004-12-23 엘지전자 주식회사 음성 프롬프트 보드의 음성 토큰 관리 방법
KR100486735B1 (ko) * 2003-02-28 2005-05-03 삼성전자주식회사 최적구획 분류신경망 구성방법과 최적구획 분류신경망을이용한 자동 레이블링방법 및 장치
US20070171061A1 (en) * 2006-01-13 2007-07-26 Alpha Security Products, Inc. Theft deterrent device with dual sensor assembly
US8255216B2 (en) * 2006-10-30 2012-08-28 Nuance Communications, Inc. Speech recognition of character sequences
US8983832B2 (en) * 2008-07-03 2015-03-17 The Board Of Trustees Of The University Of Illinois Systems and methods for identifying speech sound features
US20100057452A1 (en) * 2008-08-28 2010-03-04 Microsoft Corporation Speech interfaces
US20130090926A1 (en) * 2011-09-16 2013-04-11 Qualcomm Incorporated Mobile device context information using speech detection
US10438581B2 (en) 2013-07-31 2019-10-08 Google Llc Speech recognition using neural networks
US10733979B2 (en) * 2015-10-09 2020-08-04 Google Llc Latency constraints for acoustic modeling
WO2017127646A1 (fr) * 2016-01-22 2017-07-27 Knowles Electronics, Llc Authentification vocale secrète partagée
CN107123420A (zh) * 2016-11-10 2017-09-01 厦门创材健康科技有限公司 一种语音识别系统及其交互方法
US11100932B2 (en) * 2017-02-10 2021-08-24 Synaptics Incorporated Robust start-end point detection algorithm using neural network
US11853884B2 (en) 2017-02-10 2023-12-26 Synaptics Incorporated Many or one detection classification systems and methods
CA3206223A1 (fr) * 2017-03-29 2018-10-04 Google Llc Conversion de texte en parole de bout en bout
US11081106B2 (en) * 2017-08-25 2021-08-03 Microsoft Technology Licensing, Llc Contextual spoken language understanding in a spoken dialogue system

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4040215A (en) * 1975-05-26 1977-08-09 Totsuka Komuten Co., Ltd. Decay-resisting construction of lower structure for wooden buildings
US4908865A (en) * 1984-12-27 1990-03-13 Texas Instruments Incorporated Speaker independent speech recognition method and system
US4977599A (en) * 1985-05-29 1990-12-11 International Business Machines Corporation Speech recognition employing a set of Markov models that includes Markov models representing transitions to and from silence
JPH0638199B2 (ja) * 1985-09-02 1994-05-18 日本電気株式会社 音声認識装置
WO1987002816A1 (fr) * 1985-10-30 1987-05-07 Central Institute For The Deaf Procedes et appareil de traitement de la parole
US4856067A (en) * 1986-08-21 1989-08-08 Oki Electric Industry Co., Ltd. Speech recognition system wherein the consonantal characteristics of input utterances are extracted
US4852170A (en) * 1986-12-18 1989-07-25 R & D Associates Real time computer speech recognition system
DE3888547T2 (de) * 1987-01-16 1994-06-30 Sharp Kk Gerät zur Sprachanalyse und -synthese.
US4752179A (en) * 1987-01-27 1988-06-21 Cascade Corporation Push-pull load handler for forklift truck
US4937872A (en) * 1987-04-03 1990-06-26 American Telephone And Telegraph Company Neural computation by time concentration
US4905285A (en) * 1987-04-03 1990-02-27 American Telephone And Telegraph Company, At&T Bell Laboratories Analysis arrangement based on a model of human neural responses
US5121428A (en) * 1988-01-20 1992-06-09 Ricoh Company, Ltd. Speaker verification system
JP2739950B2 (ja) * 1988-03-31 1998-04-15 株式会社東芝 パターン認識装置
US5278911A (en) * 1989-05-18 1994-01-11 Smiths Industries Public Limited Company Speech recognition using a neural net
US5212730A (en) * 1991-07-01 1993-05-18 Texas Instruments Incorporated Voice recognition of proper names using text-derived recognition models
US5263097A (en) * 1991-07-24 1993-11-16 Texas Instruments Incorporated Parameter normalized features for classification procedures, systems and methods

Also Published As

Publication number Publication date
EP0549265A3 (fr) 1994-01-26
EP0549265A2 (fr) 1993-06-30
AU3024892A (en) 1993-06-24
CA2085842C (fr) 1996-05-21
US5621857A (en) 1997-04-15

Similar Documents

Publication Publication Date Title
CA2085842A1 (fr) Systeme et methode de reconnaissance d&#39;echantillons de paroles utilisant unr reseau neuronal
US5787230A (en) System and method of intelligent Mandarin speech input for Chinese computers
Lippmann Speech recognition by machines and humans
Zissman et al. Automatic language identification
US6067520A (en) System and method of recognizing continuous mandarin speech utilizing chinese hidden markou models
Schlippe et al. Hausa large vocabulary continuous speech recognition.
Ahmed et al. Verification system for Quran recitation recordings
EP1398758B1 (fr) Procédé et système de génération des questions d&#39;un arbre de décision pour le traitement de la parole
Kurzekar et al. Continuous speech recognition system: A review
Haraty et al. CASRA+: A colloquial Arabic speech recognition application
Berkling et al. Language identification of six languages based on a common set of broad phonemes.
Abdo et al. Semi-automatic segmentation system for syllables extraction from continuous Arabic audio signal
Geutner et al. Transcribing multilingual broadcast news using hypothesis driven lexical adaptation
Oukas et al. Arabic speech recognition using deep learning and common voice dataset
Benıtez et al. Different confidence measures for word verification in speech recognition
JPH0827638B2 (ja) 音素を単位とした音声認識装置
Mari et al. Hidden Markov models and selectively trained neural networks for connected confusable word recognition
Mousa et al. Sub-lexical language models for German LVCSR
Elhadj et al. An accurate recognizer for basic arabic sounds
JP2813209B2 (ja) 大語彙音声認識装置
Ananthakrishna et al. Effect of time-domain windowing on isolated speech recognition system performance
Hecht et al. German broadcast news transcription.
JP3110025B2 (ja) 発声変形検出装置
Pitt et al. Using pronunciation data as a starting point in modeling word recognition
Schmid et al. Real-time, neural network-based, French alphabet recognition with telephone speech

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed