ATE407420T1 - Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung - Google Patents
Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierungInfo
- Publication number
- ATE407420T1 ATE407420T1 AT02702130T AT02702130T ATE407420T1 AT E407420 T1 ATE407420 T1 AT E407420T1 AT 02702130 T AT02702130 T AT 02702130T AT 02702130 T AT02702130 T AT 02702130T AT E407420 T1 ATE407420 T1 AT E407420T1
- Authority
- AT
- Austria
- Prior art keywords
- acoustic feature
- speaker
- recognition system
- feature vector
- feature vectors
- Prior art date
Links
- 239000013598 vector Substances 0.000 title abstract 6
- 230000004048 modification Effects 0.000 title abstract 3
- 238000012986 modification Methods 0.000 title abstract 3
- 230000006978 adaptation Effects 0.000 abstract 2
- 230000001419 dependent effect Effects 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Telephonic Communication Services (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
- Mobile Radio Communication Systems (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Devices For Executing Special Programs (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/773,831 US7024359B2 (en) | 2001-01-31 | 2001-01-31 | Distributed voice recognition system using acoustic feature vector modification |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE407420T1 true ATE407420T1 (de) | 2008-09-15 |
Family
ID=25099445
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT02702130T ATE407420T1 (de) | 2001-01-31 | 2002-01-30 | Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung |
Country Status (11)
| Country | Link |
|---|---|
| US (1) | US7024359B2 (de) |
| EP (1) | EP1356453B1 (de) |
| JP (2) | JP4567290B2 (de) |
| KR (1) | KR100879410B1 (de) |
| CN (1) | CN1284133C (de) |
| AT (1) | ATE407420T1 (de) |
| AU (1) | AU2002235513A1 (de) |
| BR (1) | BR0206836A (de) |
| DE (1) | DE60228682D1 (de) |
| TW (1) | TW546633B (de) |
| WO (1) | WO2002065453A2 (de) |
Families Citing this family (48)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| USRE46109E1 (en) | 2001-03-29 | 2016-08-16 | Lg Electronics Inc. | Vehicle navigation system and method |
| US7392191B2 (en) * | 2001-03-29 | 2008-06-24 | Intellisist, Inc. | Method and device to distinguish between voice conversation and automated speech recognition |
| US20020143611A1 (en) * | 2001-03-29 | 2002-10-03 | Gilad Odinak | Vehicle parking validation system and method |
| US7406421B2 (en) * | 2001-10-26 | 2008-07-29 | Intellisist Inc. | Systems and methods for reviewing informational content in a vehicle |
| US6487494B2 (en) * | 2001-03-29 | 2002-11-26 | Wingcast, Llc | System and method for reducing the amount of repetitive data sent by a server to a client for vehicle navigation |
| US20050065779A1 (en) * | 2001-03-29 | 2005-03-24 | Gilad Odinak | Comprehensive multiple feature telematics system |
| US8175886B2 (en) | 2001-03-29 | 2012-05-08 | Intellisist, Inc. | Determination of signal-processing approach based on signal destination characteristics |
| US6885735B2 (en) * | 2001-03-29 | 2005-04-26 | Intellisist, Llc | System and method for transmitting voice input from a remote location over a wireless data channel |
| EP1293964A3 (de) * | 2001-09-13 | 2004-05-12 | Matsushita Electric Industrial Co., Ltd. | Anpassung eines Spracherkennungsverfahrens an bestimmte Benutzer und Umgebungen mit Datenübertragung zwischen einem Endgerät und einem Server |
| GB2391679B (en) * | 2002-02-04 | 2004-03-24 | Zentian Ltd | Speech recognition circuit using parallel processors |
| US8249880B2 (en) * | 2002-02-14 | 2012-08-21 | Intellisist, Inc. | Real-time display of system instructions |
| US7330538B2 (en) * | 2002-03-28 | 2008-02-12 | Gotvoice, Inc. | Closed-loop command and response system for automatic communications between interacting computer systems over an audio communications channel |
| US8239197B2 (en) | 2002-03-28 | 2012-08-07 | Intellisist, Inc. | Efficient conversion of voice messages into text |
| WO2003098946A1 (en) | 2002-05-16 | 2003-11-27 | Intellisist, Llc | System and method for dynamically configuring wireless network geographic coverage or service levels |
| TW567465B (en) * | 2002-09-02 | 2003-12-21 | Ind Tech Res Inst | Configurable distributed speech recognition system |
| GB0226648D0 (en) * | 2002-11-15 | 2002-12-24 | Koninkl Philips Electronics Nv | Usage data harvesting |
| US7533023B2 (en) * | 2003-02-12 | 2009-05-12 | Panasonic Corporation | Intermediary speech processor in network environments transforming customized speech parameters |
| DE10353068A1 (de) * | 2003-11-13 | 2005-06-23 | Voice Trust Ag | Verfahren zur Authentifizierung eines Benutzers anhand dessen Stimmprofils |
| US20050216266A1 (en) * | 2004-03-29 | 2005-09-29 | Yifan Gong | Incremental adjustment of state-dependent bias parameters for adaptive speech recognition |
| US7720012B1 (en) | 2004-07-09 | 2010-05-18 | Arrowhead Center, Inc. | Speaker identification in the presence of packet losses |
| GB2418764B (en) * | 2004-09-30 | 2008-04-09 | Fluency Voice Technology Ltd | Improving pattern recognition accuracy with distortions |
| US20060095261A1 (en) * | 2004-10-30 | 2006-05-04 | Ibm Corporation | Voice packet identification based on celp compression parameters |
| CN1811911B (zh) * | 2005-01-28 | 2010-06-23 | 北京捷通华声语音技术有限公司 | 自适应的语音变换处理方法 |
| JP4527679B2 (ja) | 2006-03-24 | 2010-08-18 | 学校法人早稲田大学 | 音声の類似度の評価を行う方法および装置 |
| US7725316B2 (en) * | 2006-07-05 | 2010-05-25 | General Motors Llc | Applying speech recognition adaptation in an automated speech recognition system of a telematics-equipped vehicle |
| JP4427530B2 (ja) * | 2006-09-21 | 2010-03-10 | 株式会社東芝 | 音声認識装置、プログラムおよび音声認識方法 |
| WO2008137616A1 (en) * | 2007-05-04 | 2008-11-13 | Nuance Communications, Inc. | Multi-class constrained maximum likelihood linear regression |
| US20090018826A1 (en) * | 2007-07-13 | 2009-01-15 | Berlin Andrew A | Methods, Systems and Devices for Speech Transduction |
| US8352265B1 (en) | 2007-12-24 | 2013-01-08 | Edward Lin | Hardware implemented backend search engine for a high-rate speech recognition system |
| US8639510B1 (en) | 2007-12-24 | 2014-01-28 | Kai Yu | Acoustic scoring unit implemented on a single FPGA or ASIC |
| US8463610B1 (en) | 2008-01-18 | 2013-06-11 | Patrick J. Bourke | Hardware-implemented scalable modular engine for low-power speech recognition |
| KR101217525B1 (ko) * | 2008-12-22 | 2013-01-18 | 한국전자통신연구원 | 비터비 디코더와 이를 이용한 음성 인식 방법 |
| US9418662B2 (en) * | 2009-01-21 | 2016-08-16 | Nokia Technologies Oy | Method, apparatus and computer program product for providing compound models for speech recognition adaptation |
| US8189925B2 (en) * | 2009-06-04 | 2012-05-29 | Microsoft Corporation | Geocoding by image matching |
| US8554562B2 (en) * | 2009-11-15 | 2013-10-08 | Nuance Communications, Inc. | Method and system for speaker diarization |
| CA2856496A1 (en) * | 2010-11-22 | 2012-05-31 | Listening Methods, Llc | System and method for pattern recognition and analysis |
| US10229701B2 (en) | 2013-02-28 | 2019-03-12 | Nuance Communications, Inc. | Server-side ASR adaptation to speaker, device and noise condition via non-ASR audio transmission |
| WO2014133525A1 (en) * | 2013-02-28 | 2014-09-04 | Nuance Communication, Inc. | Server-side asr adaptation to speaker, device and noise condition via non-asr audio transmission |
| US9282096B2 (en) | 2013-08-31 | 2016-03-08 | Steven Goldstein | Methods and systems for voice authentication service leveraging networking |
| US10405163B2 (en) | 2013-10-06 | 2019-09-03 | Staton Techiya, Llc | Methods and systems for establishing and maintaining presence information of neighboring bluetooth devices |
| US20170092278A1 (en) * | 2015-09-30 | 2017-03-30 | Apple Inc. | Speaker recognition |
| WO2017216786A1 (en) * | 2016-06-14 | 2017-12-21 | Omry Netzer | Automatic speech recognition |
| CN106782504B (zh) * | 2016-12-29 | 2019-01-22 | 百度在线网络技术(北京)有限公司 | 语音识别方法和装置 |
| EP3719679B1 (de) * | 2019-04-03 | 2021-06-09 | Fondation de L'institut de Recherche Idiap | Verfahren zum schutz biometrischer vorlagen sowie system und verfahren zur überprüfung der identität eines sprechers |
| US11545132B2 (en) | 2019-08-28 | 2023-01-03 | International Business Machines Corporation | Speech characterization using a synthesized reference audio signal |
| US11238847B2 (en) | 2019-12-04 | 2022-02-01 | Google Llc | Speaker awareness using speaker dependent speech model(s) |
| CN113345428B (zh) * | 2021-06-04 | 2023-08-04 | 北京华捷艾米科技有限公司 | 语音识别模型的匹配方法、装置、设备和存储介质 |
| WO2024111023A1 (ja) * | 2022-11-21 | 2024-05-30 | 楽天グループ株式会社 | 情報処理システム、情報処理方法、及び情報処理プログラム |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4926488A (en) * | 1987-07-09 | 1990-05-15 | International Business Machines Corporation | Normalization of speech by adaptive labelling |
| JP2980382B2 (ja) * | 1990-12-19 | 1999-11-22 | 富士通株式会社 | 話者適応音声認識方法および装置 |
| JPH06214596A (ja) * | 1993-01-14 | 1994-08-05 | Ricoh Co Ltd | 音声認識装置および話者適応化方法 |
| JP3413861B2 (ja) * | 1993-01-18 | 2003-06-09 | ヤマハ株式会社 | 電子楽器の鍵盤装置 |
| ZA948426B (en) | 1993-12-22 | 1995-06-30 | Qualcomm Inc | Distributed voice recognition system |
| JPH07210190A (ja) | 1993-12-30 | 1995-08-11 | Internatl Business Mach Corp <Ibm> | 音声認識方法及びシステム |
| US5864810A (en) * | 1995-01-20 | 1999-01-26 | Sri International | Method and apparatus for speech recognition adapted to an individual speaker |
| JP3697748B2 (ja) | 1995-08-21 | 2005-09-21 | セイコーエプソン株式会社 | 端末、音声認識装置 |
| JP3001037B2 (ja) * | 1995-12-13 | 2000-01-17 | 日本電気株式会社 | 音声認識装置 |
| DE69822296T2 (de) | 1997-10-20 | 2005-02-24 | Koninklijke Philips Electronics N.V. | Mustererkennungsregistrierung in einem verteilten system |
| JP2000276188A (ja) * | 1999-03-24 | 2000-10-06 | Sony Corp | 音声認識装置、音声認識方法、音声認識用制御プログラムを記録した記録媒体、通信端末装置、通信方法、音声認識通信の制御用プログラムを記録した記録媒体、サーバ装置、音声認識用データの送受信方法及び音声認識用データの送受信制御プログラムを記録した記録媒体 |
| JP3456444B2 (ja) * | 1999-05-10 | 2003-10-14 | 日本電気株式会社 | 音声判定装置及び方法並びに記録媒体 |
| US6421641B1 (en) * | 1999-11-12 | 2002-07-16 | International Business Machines Corporation | Methods and apparatus for fast adaptation of a band-quantized speech decoding system |
-
2001
- 2001-01-31 US US09/773,831 patent/US7024359B2/en not_active Expired - Lifetime
-
2002
- 2002-01-30 EP EP02702130A patent/EP1356453B1/de not_active Expired - Lifetime
- 2002-01-30 TW TW091101575A patent/TW546633B/zh not_active IP Right Cessation
- 2002-01-30 KR KR1020037010130A patent/KR100879410B1/ko not_active Expired - Lifetime
- 2002-01-30 JP JP2002565298A patent/JP4567290B2/ja not_active Expired - Lifetime
- 2002-01-30 AU AU2002235513A patent/AU2002235513A1/en not_active Abandoned
- 2002-01-30 CN CNB028060687A patent/CN1284133C/zh not_active Expired - Lifetime
- 2002-01-30 BR BR0206836-2A patent/BR0206836A/pt unknown
- 2002-01-30 AT AT02702130T patent/ATE407420T1/de not_active IP Right Cessation
- 2002-01-30 WO PCT/US2002/003014 patent/WO2002065453A2/en not_active Ceased
- 2002-01-30 DE DE60228682T patent/DE60228682D1/de not_active Expired - Lifetime
-
2009
- 2009-01-14 JP JP2009006033A patent/JP4976432B2/ja not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| US20020103639A1 (en) | 2002-08-01 |
| US7024359B2 (en) | 2006-04-04 |
| DE60228682D1 (de) | 2008-10-16 |
| CN1284133C (zh) | 2006-11-08 |
| JP4567290B2 (ja) | 2010-10-20 |
| KR100879410B1 (ko) | 2009-01-19 |
| EP1356453B1 (de) | 2008-09-03 |
| JP2004536330A (ja) | 2004-12-02 |
| JP2009151318A (ja) | 2009-07-09 |
| EP1356453A2 (de) | 2003-10-29 |
| WO2002065453A2 (en) | 2002-08-22 |
| TW546633B (en) | 2003-08-11 |
| AU2002235513A1 (en) | 2002-08-28 |
| JP4976432B2 (ja) | 2012-07-18 |
| HK1062738A1 (en) | 2004-11-19 |
| KR20040062433A (ko) | 2004-07-07 |
| CN1494712A (zh) | 2004-05-05 |
| BR0206836A (pt) | 2006-01-17 |
| WO2002065453A3 (en) | 2002-10-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE407420T1 (de) | Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung | |
| DE60125542D1 (de) | System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen | |
| DK1374223T3 (da) | Stemmegenkendelsessystem, der gör brug af implicit talertilpasning | |
| DE60124408D1 (de) | Kombination von digitaler zeitverschiebung und hmm in sprecherabhängiger- und sprecherunabhängiger weise für die spracherkennung | |
| GB0207343D0 (en) | Signal processing system | |
| ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
| AU3164800A (en) | Recognition engines with complementary language models | |
| GB2366434A (en) | Selective speaker adaption for an in-vehicle speech recognition system | |
| WO2003019528A1 (en) | Intonation generating method, speech synthesizing device by the method, and voice server | |
| WO2004090866A3 (en) | Phonetically based speech recognition system and method | |
| DE69822179D1 (de) | Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung | |
| WO2006033044A3 (en) | Method of training a robust speaker-dependent speech recognition system with speaker-dependent expressions and robust speaker-dependent speech recognition system | |
| WO2007117814A3 (en) | Voice signal perturbation for speech recognition | |
| TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
| WO2003036617A1 (en) | Speech recognition apparatus and speech recognition method | |
| DE60004331D1 (de) | Sprecher-erkennung | |
| DE69413912D1 (de) | Sprachumsetzungsverfahren | |
| DE602005009091D1 (de) | Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke | |
| DE50003680D1 (de) | Verfahren zur sprachgesteuerten identifizierung des nutzers eines telekommunikationsanschlusses im telekommunikationsnetz beim dialog mit einem sprachgesteuerten dialogsystem | |
| PL342208A1 (en) | Method of increasing probability speech recognition in speech recognising system | |
| DE50106405D1 (de) | Sprachgeführtes gerätesteuerungsverfahren mit einer optimierung für einen benutzer | |
| DE59802584D1 (de) | Vefahren zur spracherkennung unter verwendung von einer grammatik | |
| Books | User-Customized Password HMM Based Speaker Verification, BenZeghiba, Mohamed Faouzi and Bourlard, Hervé, Idiap-RR-35-2002 | |
| Books | Type of publication: Idiap-RR Citation: BenZeghiba-02a Number: Idiap-RR-10-2002 Year: 2002 Institution: IDIAP | |
| MXPA05006672A (es) | Sistema y metodo de reconocimiento de voz. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |