ATE442641T1 - Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist - Google Patents
Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst istInfo
- Publication number
- ATE442641T1 ATE442641T1 AT04767759T AT04767759T ATE442641T1 AT E442641 T1 ATE442641 T1 AT E442641T1 AT 04767759 T AT04767759 T AT 04767759T AT 04767759 T AT04767759 T AT 04767759T AT E442641 T1 ATE442641 T1 AT E442641T1
- Authority
- AT
- Austria
- Prior art keywords
- recognition method
- acoustic models
- system adapted
- language recognition
- native speakers
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 4
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/FR2004/001958 WO2006021623A1 (fr) | 2004-07-22 | 2004-07-22 | Procede et systeme de reconnaissance vocale adaptes aux caracteristiques de locuteurs non-natifs |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE442641T1 true ATE442641T1 (de) | 2009-09-15 |
Family
ID=34958888
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04767759T ATE442641T1 (de) | 2004-07-22 | 2004-07-22 | Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US20070294082A1 (de) |
| EP (1) | EP1769489B1 (de) |
| AT (1) | ATE442641T1 (de) |
| DE (1) | DE602004023134D1 (de) |
| WO (1) | WO2006021623A1 (de) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8825482B2 (en) | 2005-09-15 | 2014-09-02 | Sony Computer Entertainment Inc. | Audio, video, simulation, and user interface paradigms |
| WO2008033095A1 (en) * | 2006-09-15 | 2008-03-20 | Agency For Science, Technology And Research | Apparatus and method for speech utterance verification |
| US7472061B1 (en) * | 2008-03-31 | 2008-12-30 | International Business Machines Corporation | Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations |
| US20100105015A1 (en) * | 2008-10-23 | 2010-04-29 | Judy Ravin | System and method for facilitating the decoding or deciphering of foreign accents |
| EP2192575B1 (de) * | 2008-11-27 | 2014-04-30 | Nuance Communications, Inc. | Spracherkennung auf Grundlage eines mehrsprachigen akustischen Modells |
| JP5326892B2 (ja) * | 2008-12-26 | 2013-10-30 | 富士通株式会社 | 情報処理装置、プログラム、および音響モデルを生成する方法 |
| US8301446B2 (en) * | 2009-03-30 | 2012-10-30 | Adacel Systems, Inc. | System and method for training an acoustic model with reduced feature space variation |
| TWI391915B (zh) * | 2009-11-17 | 2013-04-01 | Inst Information Industry | 語音變異模型建立裝置、方法及應用該裝置之語音辨識系統和方法 |
| WO2011089651A1 (ja) * | 2010-01-22 | 2011-07-28 | 三菱電機株式会社 | 認識辞書作成装置、音声認識装置及び音声合成装置 |
| US8949125B1 (en) * | 2010-06-16 | 2015-02-03 | Google Inc. | Annotating maps with user-contributed pronunciations |
| US8756062B2 (en) * | 2010-12-10 | 2014-06-17 | General Motors Llc | Male acoustic model adaptation based on language-independent female speech data |
| US9679496B2 (en) | 2011-12-01 | 2017-06-13 | Arkady Zilberman | Reverse language resonance systems and methods for foreign language acquisition |
| US9401140B1 (en) * | 2012-08-22 | 2016-07-26 | Amazon Technologies, Inc. | Unsupervised acoustic model training |
| CN104143328B (zh) * | 2013-08-15 | 2015-11-25 | 腾讯科技(深圳)有限公司 | 一种关键词检测方法和装置 |
| JP6080978B2 (ja) * | 2013-11-20 | 2017-02-15 | 三菱電機株式会社 | 音声認識装置および音声認識方法 |
| US9747897B2 (en) * | 2013-12-17 | 2017-08-29 | Google Inc. | Identifying substitute pronunciations |
| WO2015112149A1 (en) * | 2014-01-23 | 2015-07-30 | Nuance Communications, Inc. | Method and apparatus for exploiting language skill information in automatic speech recognition |
| WO2016048350A1 (en) * | 2014-09-26 | 2016-03-31 | Nuance Communications, Inc. | Improving automatic speech recognition of multilingual named entities |
| US10446136B2 (en) * | 2017-05-11 | 2019-10-15 | Ants Technology (Hk) Limited | Accent invariant speech recognition |
| US10783873B1 (en) * | 2017-12-15 | 2020-09-22 | Educational Testing Service | Native language identification with time delay deep neural networks trained separately on native and non-native english corpora |
| JP6970345B2 (ja) * | 2018-08-21 | 2021-11-24 | 日本電信電話株式会社 | 学習装置、音声認識装置、学習方法、音声認識方法およびプログラム |
| CN109817213B (zh) * | 2019-03-11 | 2024-01-23 | 腾讯科技(深圳)有限公司 | 用于自适应语种进行语音识别的方法、装置及设备 |
| US20220223066A1 (en) * | 2021-01-08 | 2022-07-14 | Ping An Technology (Shenzhen) Co., Ltd. | Method, device, and computer program product for english pronunciation assessment |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5865626A (en) * | 1996-08-30 | 1999-02-02 | Gte Internetworking Incorporated | Multi-dialect speech recognition method and apparatus |
| US6085160A (en) * | 1998-07-10 | 2000-07-04 | Lernout & Hauspie Speech Products N.V. | Language independent speech recognition |
| US6912499B1 (en) * | 1999-08-31 | 2005-06-28 | Nortel Networks Limited | Method and apparatus for training a multilingual speech model set |
| US6549883B2 (en) * | 1999-11-02 | 2003-04-15 | Nortel Networks Limited | Method and apparatus for generating multilingual transcription groups |
| EP1134726A1 (de) * | 2000-03-15 | 2001-09-19 | Siemens Aktiengesellschaft | Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem |
| DE60111329T2 (de) * | 2000-11-14 | 2006-03-16 | International Business Machines Corp. | Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung |
| US6738738B2 (en) * | 2000-12-23 | 2004-05-18 | Tellme Networks, Inc. | Automated transformation from American English to British English |
| EP1233406A1 (de) * | 2001-02-14 | 2002-08-21 | Sony International (Europe) GmbH | Angepasste Spracherkennung für ausländische Sprecher |
| US7043431B2 (en) * | 2001-08-31 | 2006-05-09 | Nokia Corporation | Multilingual speech recognition system using text derived recognition models |
| DE60219030T2 (de) * | 2002-11-06 | 2007-12-06 | Swisscom Fixnet Ag | Verfahren zur mehrsprachigen Spracherkennung |
| WO2004047077A1 (en) * | 2002-11-15 | 2004-06-03 | Voice Signal Technologies, Inc. | Multilingual speech recognition |
| US7593849B2 (en) * | 2003-01-28 | 2009-09-22 | Avaya, Inc. | Normalization of speech accent |
| US7415411B2 (en) * | 2004-03-04 | 2008-08-19 | Telefonaktiebolaget L M Ericsson (Publ) | Method and apparatus for generating acoustic models for speaker independent speech recognition of foreign words uttered by non-native speakers |
| US20050197837A1 (en) * | 2004-03-08 | 2005-09-08 | Janne Suontausta | Enhanced multilingual speech recognition system |
-
2004
- 2004-07-22 WO PCT/FR2004/001958 patent/WO2006021623A1/fr not_active Ceased
- 2004-07-22 AT AT04767759T patent/ATE442641T1/de not_active IP Right Cessation
- 2004-07-22 EP EP04767759A patent/EP1769489B1/de not_active Expired - Lifetime
- 2004-07-22 US US11/658,010 patent/US20070294082A1/en not_active Abandoned
- 2004-07-22 DE DE602004023134T patent/DE602004023134D1/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| WO2006021623A1 (fr) | 2006-03-02 |
| DE602004023134D1 (de) | 2009-10-22 |
| US20070294082A1 (en) | 2007-12-20 |
| EP1769489A1 (de) | 2007-04-04 |
| EP1769489B1 (de) | 2009-09-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE442641T1 (de) | Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist | |
| JP6954680B2 (ja) | 話者の確認方法及び話者の確認装置 | |
| WO2020256257A3 (ko) | 잡음 환경에 강인한 화자 인식을 위한 심화신경망 기반의 특징 강화 및 변형된 손실 함수를 이용한 결합 학습 방법 및 장치 | |
| EP4531037A3 (de) | End-zu-end-sprachumwandlung | |
| WO2021074721A3 (en) | System for automatic assessment of fluency in spoken language and a method thereof | |
| WO2008114448A1 (ja) | 音声認識システム、音声認識プログラムおよび音声認識方法 | |
| WO2006023631A3 (en) | Document transcription system training | |
| WO2020117639A3 (en) | Text independent speaker recognition | |
| WO2019161193A3 (en) | System and method for adaptive detection of spoken language via multiple speech models | |
| JP2017511915A5 (de) | ||
| EP4235648A3 (de) | Beeinflussung eines sprachenmodells | |
| EP4625408A3 (de) | Spracherkennungssysteme und -verfahren | |
| WO2007103520A3 (en) | Codebook-less speech conversion method and system | |
| ATE426526T1 (de) | System und verfahren zur auswahl eines benutzersprachprofils fur eine vorrichtung in einem fahrzeug | |
| WO2004100638A3 (en) | Source-dependent text-to-speech system | |
| JPWO2003015076A1 (ja) | 鳴声の音声的特徴分析に基づく犬の感情判別装置及びその方法 | |
| CN105609101B (zh) | 语音识别系统及语音识别方法 | |
| EP4425488A3 (de) | Training eines akustischen modells mit korrigierten begriffen | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| WO2007129156A3 (en) | Soft alignment in gaussian mixture model based transformation | |
| CN113921026B (zh) | 语音增强方法和装置 | |
| DK1933590T3 (da) | Fremgangsmåde til höreapparattilpasning | |
| CN107818792A (zh) | 音频转换方法及装置 | |
| Leykum | Acoustic characteristics of verbal irony in standard Austrian German | |
| ATE357723T1 (de) | Verfahren zur mehrsprachigen spracherkennung |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |