ATE442641T1 - Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist - Google Patents

Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist

Info

Publication number
ATE442641T1
ATE442641T1 AT04767759T AT04767759T ATE442641T1 AT E442641 T1 ATE442641 T1 AT E442641T1 AT 04767759 T AT04767759 T AT 04767759T AT 04767759 T AT04767759 T AT 04767759T AT E442641 T1 ATE442641 T1 AT E442641T1
Authority
AT
Austria
Prior art keywords
recognition method
acoustic models
system adapted
language recognition
native speakers
Prior art date
Application number
AT04767759T
Other languages
English (en)
Inventor
Denis Jouvet
Katarina Bartkova
Original Assignee
France Telecom
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom filed Critical France Telecom
Application granted granted Critical
Publication of ATE442641T1 publication Critical patent/ATE442641T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/005Language recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Stereophonic System (AREA)
AT04767759T 2004-07-22 2004-07-22 Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist ATE442641T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/FR2004/001958 WO2006021623A1 (fr) 2004-07-22 2004-07-22 Procede et systeme de reconnaissance vocale adaptes aux caracteristiques de locuteurs non-natifs

Publications (1)

Publication Number Publication Date
ATE442641T1 true ATE442641T1 (de) 2009-09-15

Family

ID=34958888

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04767759T ATE442641T1 (de) 2004-07-22 2004-07-22 Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist

Country Status (5)

Country Link
US (1) US20070294082A1 (de)
EP (1) EP1769489B1 (de)
AT (1) ATE442641T1 (de)
DE (1) DE602004023134D1 (de)
WO (1) WO2006021623A1 (de)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8825482B2 (en) 2005-09-15 2014-09-02 Sony Computer Entertainment Inc. Audio, video, simulation, and user interface paradigms
WO2008033095A1 (en) * 2006-09-15 2008-03-20 Agency For Science, Technology And Research Apparatus and method for speech utterance verification
US7472061B1 (en) * 2008-03-31 2008-12-30 International Business Machines Corporation Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations
US20100105015A1 (en) * 2008-10-23 2010-04-29 Judy Ravin System and method for facilitating the decoding or deciphering of foreign accents
EP2192575B1 (de) * 2008-11-27 2014-04-30 Nuance Communications, Inc. Spracherkennung auf Grundlage eines mehrsprachigen akustischen Modells
JP5326892B2 (ja) * 2008-12-26 2013-10-30 富士通株式会社 情報処理装置、プログラム、および音響モデルを生成する方法
US8301446B2 (en) * 2009-03-30 2012-10-30 Adacel Systems, Inc. System and method for training an acoustic model with reduced feature space variation
TWI391915B (zh) * 2009-11-17 2013-04-01 Inst Information Industry 語音變異模型建立裝置、方法及應用該裝置之語音辨識系統和方法
WO2011089651A1 (ja) * 2010-01-22 2011-07-28 三菱電機株式会社 認識辞書作成装置、音声認識装置及び音声合成装置
US8949125B1 (en) * 2010-06-16 2015-02-03 Google Inc. Annotating maps with user-contributed pronunciations
US8756062B2 (en) * 2010-12-10 2014-06-17 General Motors Llc Male acoustic model adaptation based on language-independent female speech data
US9679496B2 (en) 2011-12-01 2017-06-13 Arkady Zilberman Reverse language resonance systems and methods for foreign language acquisition
US9401140B1 (en) * 2012-08-22 2016-07-26 Amazon Technologies, Inc. Unsupervised acoustic model training
CN104143328B (zh) * 2013-08-15 2015-11-25 腾讯科技(深圳)有限公司 一种关键词检测方法和装置
JP6080978B2 (ja) * 2013-11-20 2017-02-15 三菱電機株式会社 音声認識装置および音声認識方法
US9747897B2 (en) * 2013-12-17 2017-08-29 Google Inc. Identifying substitute pronunciations
WO2015112149A1 (en) * 2014-01-23 2015-07-30 Nuance Communications, Inc. Method and apparatus for exploiting language skill information in automatic speech recognition
WO2016048350A1 (en) * 2014-09-26 2016-03-31 Nuance Communications, Inc. Improving automatic speech recognition of multilingual named entities
US10446136B2 (en) * 2017-05-11 2019-10-15 Ants Technology (Hk) Limited Accent invariant speech recognition
US10783873B1 (en) * 2017-12-15 2020-09-22 Educational Testing Service Native language identification with time delay deep neural networks trained separately on native and non-native english corpora
JP6970345B2 (ja) * 2018-08-21 2021-11-24 日本電信電話株式会社 学習装置、音声認識装置、学習方法、音声認識方法およびプログラム
CN109817213B (zh) * 2019-03-11 2024-01-23 腾讯科技(深圳)有限公司 用于自适应语种进行语音识别的方法、装置及设备
US20220223066A1 (en) * 2021-01-08 2022-07-14 Ping An Technology (Shenzhen) Co., Ltd. Method, device, and computer program product for english pronunciation assessment

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5865626A (en) * 1996-08-30 1999-02-02 Gte Internetworking Incorporated Multi-dialect speech recognition method and apparatus
US6085160A (en) * 1998-07-10 2000-07-04 Lernout & Hauspie Speech Products N.V. Language independent speech recognition
US6912499B1 (en) * 1999-08-31 2005-06-28 Nortel Networks Limited Method and apparatus for training a multilingual speech model set
US6549883B2 (en) * 1999-11-02 2003-04-15 Nortel Networks Limited Method and apparatus for generating multilingual transcription groups
EP1134726A1 (de) * 2000-03-15 2001-09-19 Siemens Aktiengesellschaft Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem
DE60111329T2 (de) * 2000-11-14 2006-03-16 International Business Machines Corp. Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
US6738738B2 (en) * 2000-12-23 2004-05-18 Tellme Networks, Inc. Automated transformation from American English to British English
EP1233406A1 (de) * 2001-02-14 2002-08-21 Sony International (Europe) GmbH Angepasste Spracherkennung für ausländische Sprecher
US7043431B2 (en) * 2001-08-31 2006-05-09 Nokia Corporation Multilingual speech recognition system using text derived recognition models
DE60219030T2 (de) * 2002-11-06 2007-12-06 Swisscom Fixnet Ag Verfahren zur mehrsprachigen Spracherkennung
WO2004047077A1 (en) * 2002-11-15 2004-06-03 Voice Signal Technologies, Inc. Multilingual speech recognition
US7593849B2 (en) * 2003-01-28 2009-09-22 Avaya, Inc. Normalization of speech accent
US7415411B2 (en) * 2004-03-04 2008-08-19 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for generating acoustic models for speaker independent speech recognition of foreign words uttered by non-native speakers
US20050197837A1 (en) * 2004-03-08 2005-09-08 Janne Suontausta Enhanced multilingual speech recognition system

Also Published As

Publication number Publication date
WO2006021623A1 (fr) 2006-03-02
DE602004023134D1 (de) 2009-10-22
US20070294082A1 (en) 2007-12-20
EP1769489A1 (de) 2007-04-04
EP1769489B1 (de) 2009-09-09

Similar Documents

Publication Publication Date Title
ATE442641T1 (de) Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist
JP6954680B2 (ja) 話者の確認方法及び話者の確認装置
WO2020256257A3 (ko) 잡음 환경에 강인한 화자 인식을 위한 심화신경망 기반의 특징 강화 및 변형된 손실 함수를 이용한 결합 학습 방법 및 장치
EP4531037A3 (de) End-zu-end-sprachumwandlung
WO2021074721A3 (en) System for automatic assessment of fluency in spoken language and a method thereof
WO2008114448A1 (ja) 音声認識システム、音声認識プログラムおよび音声認識方法
WO2006023631A3 (en) Document transcription system training
WO2020117639A3 (en) Text independent speaker recognition
WO2019161193A3 (en) System and method for adaptive detection of spoken language via multiple speech models
JP2017511915A5 (de)
EP4235648A3 (de) Beeinflussung eines sprachenmodells
EP4625408A3 (de) Spracherkennungssysteme und -verfahren
WO2007103520A3 (en) Codebook-less speech conversion method and system
ATE426526T1 (de) System und verfahren zur auswahl eines benutzersprachprofils fur eine vorrichtung in einem fahrzeug
WO2004100638A3 (en) Source-dependent text-to-speech system
JPWO2003015076A1 (ja) 鳴声の音声的特徴分析に基づく犬の感情判別装置及びその方法
CN105609101B (zh) 语音识别系统及语音识别方法
EP4425488A3 (de) Training eines akustischen modells mit korrigierten begriffen
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
WO2007129156A3 (en) Soft alignment in gaussian mixture model based transformation
CN113921026B (zh) 语音增强方法和装置
DK1933590T3 (da) Fremgangsmåde til höreapparattilpasning
CN107818792A (zh) 音频转换方法及装置
Leykum Acoustic characteristics of verbal irony in standard Austrian German
ATE357723T1 (de) Verfahren zur mehrsprachigen spracherkennung

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties