ATE464635T1 - Verfahren zum erzeugen und verwenden eines vektorcodebuchs, verfahren und einrichtung zum komprimieren von daten und verteiltes spracherkennungssystem - Google Patents
Verfahren zum erzeugen und verwenden eines vektorcodebuchs, verfahren und einrichtung zum komprimieren von daten und verteiltes spracherkennungssystemInfo
- Publication number
- ATE464635T1 ATE464635T1 AT04763512T AT04763512T ATE464635T1 AT E464635 T1 ATE464635 T1 AT E464635T1 AT 04763512 T AT04763512 T AT 04763512T AT 04763512 T AT04763512 T AT 04763512T AT E464635 T1 ATE464635 T1 AT E464635T1
- Authority
- AT
- Austria
- Prior art keywords
- sub
- sets
- generating
- compressing data
- feature
- Prior art date
Links
- 239000013598 vector Substances 0.000 title abstract 8
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/EP2004/008372 WO2006007871A1 (en) | 2004-07-23 | 2004-07-23 | Method for generating a vector codebook, method and device for compressing data, and distributed speech recognition system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE464635T1 true ATE464635T1 (de) | 2010-04-15 |
Family
ID=34958455
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04763512T ATE464635T1 (de) | 2004-07-23 | 2004-07-23 | Verfahren zum erzeugen und verwenden eines vektorcodebuchs, verfahren und einrichtung zum komprimieren von daten und verteiltes spracherkennungssystem |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US8214204B2 (de) |
| EP (1) | EP1771841B1 (de) |
| JP (1) | JP4703648B2 (de) |
| KR (1) | KR101010585B1 (de) |
| CN (1) | CN101019171B (de) |
| AT (1) | ATE464635T1 (de) |
| DE (1) | DE602004026645D1 (de) |
| WO (1) | WO2006007871A1 (de) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7587314B2 (en) * | 2005-08-29 | 2009-09-08 | Nokia Corporation | Single-codebook vector quantization for multiple-rate applications |
| US20070299667A1 (en) * | 2006-06-22 | 2007-12-27 | Texas Instruments, Incorporated | System and method for reducing storage requirements for a model containing mixed weighted distributions and automatic speech recognition model incorporating the same |
| CN101335004B (zh) * | 2007-11-02 | 2010-04-21 | 华为技术有限公司 | 一种多级量化的方法及装置 |
| GB0901262D0 (en) * | 2009-01-26 | 2009-03-11 | Mitsubishi Elec R&D Ct Europe | Video identification |
| KR101711158B1 (ko) * | 2010-12-22 | 2017-03-14 | 한국전자통신연구원 | 셀룰러 시스템에서 인접 셀간 간섭 제어 방법 |
| US9779731B1 (en) * | 2012-08-20 | 2017-10-03 | Amazon Technologies, Inc. | Echo cancellation based on shared reference signals |
| US10147441B1 (en) | 2013-12-19 | 2018-12-04 | Amazon Technologies, Inc. | Voice controlled system |
| CN103837890B (zh) * | 2014-02-26 | 2016-07-06 | 中国石油集团川庆钻探工程有限公司地球物理勘探公司 | 获取地震数据的方法及设备 |
| CA3001839C (en) * | 2015-10-14 | 2018-10-23 | Pindrop Security, Inc. | Call detail record analysis to identify fraudulent activity and fraud detection in interactive voice response systems |
| CN107564535B (zh) * | 2017-08-29 | 2020-09-01 | 中国人民解放军理工大学 | 一种分布式低速语音通话方法 |
| US11470194B2 (en) | 2019-08-19 | 2022-10-11 | Pindrop Security, Inc. | Caller verification via carrier metadata |
| CN112445943B (zh) * | 2019-09-05 | 2025-03-14 | 阿里巴巴集团控股有限公司 | 数据处理的方法、装置和系统 |
Family Cites Families (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4958225A (en) * | 1989-06-09 | 1990-09-18 | Utah State University Foundation | Full-search-equivalent method for matching data and a vector quantizer utilizing such method |
| US5061924B1 (en) | 1991-01-25 | 1996-04-30 | American Telephone & Telegraph | Efficient vector codebook |
| US5651026A (en) * | 1992-06-01 | 1997-07-22 | Hughes Electronics | Robust vector quantization of line spectral frequencies |
| JP3093879B2 (ja) * | 1992-07-27 | 2000-10-03 | オリンパス光学工業株式会社 | ベクトル量子化コードブック作成及び探索装置 |
| US5774839A (en) * | 1995-09-29 | 1998-06-30 | Rockwell International Corporation | Delayed decision switched prediction multi-stage LSF vector quantization |
| GB9622055D0 (en) * | 1996-10-23 | 1996-12-18 | Univ Strathclyde | Vector quantisation |
| US6009387A (en) * | 1997-03-20 | 1999-12-28 | International Business Machines Corporation | System and method of compression/decompressing a speech signal by using split vector quantization and scalar quantization |
| US6161086A (en) * | 1997-07-29 | 2000-12-12 | Texas Instruments Incorporated | Low-complexity speech coding with backward and inverse filtered target matching and a tree structured mutitap adaptive codebook search |
| US5946653A (en) | 1997-10-01 | 1999-08-31 | Motorola, Inc. | Speaker independent speech recognition system and method |
| US6067515A (en) * | 1997-10-27 | 2000-05-23 | Advanced Micro Devices, Inc. | Split matrix quantization with split vector quantization error compensation and selective enhanced processing for robust speech recognition |
| US5966688A (en) * | 1997-10-28 | 1999-10-12 | Hughes Electronics Corporation | Speech mode based multi-stage vector quantizer |
| US6148283A (en) * | 1998-09-23 | 2000-11-14 | Qualcomm Inc. | Method and apparatus using multi-path multi-stage vector quantizer |
| AU1445100A (en) | 1998-10-13 | 2000-05-01 | Hadasit Medical Research Services & Development Company Ltd | Method and system for determining a vector index to represent a plurality of speech parameters in signal processing for identifying an utterance |
| US7389227B2 (en) * | 2000-01-14 | 2008-06-17 | C & S Technology Co., Ltd. | High-speed search method for LSP quantizer using split VQ and fixed codebook of G.729 speech encoder |
| JP3483513B2 (ja) * | 2000-03-02 | 2004-01-06 | 沖電気工業株式会社 | 音声録音再生装置 |
| JP3367931B2 (ja) * | 2000-03-06 | 2003-01-20 | 日本電信電話株式会社 | 共役構造ベクトル量子化方法 |
| US6633839B2 (en) * | 2001-02-02 | 2003-10-14 | Motorola, Inc. | Method and apparatus for speech reconstruction in a distributed speech recognition system |
| US7003454B2 (en) * | 2001-05-16 | 2006-02-21 | Nokia Corporation | Method and system for line spectral frequency vector quantization in speech codec |
| CN1190772C (zh) * | 2002-09-30 | 2005-02-23 | 中国科学院声学研究所 | 语音识别系统及用于语音识别系统的特征矢量集的压缩方法 |
| US20040176950A1 (en) * | 2003-03-04 | 2004-09-09 | Docomo Communications Laboratories Usa, Inc. | Methods and apparatuses for variable dimension vector quantization |
-
2004
- 2004-07-23 CN CN2004800439812A patent/CN101019171B/zh not_active Expired - Fee Related
- 2004-07-23 WO PCT/EP2004/008372 patent/WO2006007871A1/en not_active Ceased
- 2004-07-23 US US11/658,090 patent/US8214204B2/en not_active Expired - Fee Related
- 2004-07-23 KR KR1020077004401A patent/KR101010585B1/ko not_active Expired - Fee Related
- 2004-07-23 AT AT04763512T patent/ATE464635T1/de not_active IP Right Cessation
- 2004-07-23 JP JP2007521800A patent/JP4703648B2/ja not_active Expired - Fee Related
- 2004-07-23 DE DE602004026645T patent/DE602004026645D1/de not_active Expired - Lifetime
- 2004-07-23 EP EP04763512A patent/EP1771841B1/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| JP4703648B2 (ja) | 2011-06-15 |
| CN101019171A (zh) | 2007-08-15 |
| JP2008507718A (ja) | 2008-03-13 |
| EP1771841A1 (de) | 2007-04-11 |
| KR101010585B1 (ko) | 2011-01-24 |
| KR20070047795A (ko) | 2007-05-07 |
| US8214204B2 (en) | 2012-07-03 |
| DE602004026645D1 (de) | 2010-05-27 |
| WO2006007871A8 (en) | 2006-03-16 |
| EP1771841B1 (de) | 2010-04-14 |
| CN101019171B (zh) | 2011-08-10 |
| WO2006007871A1 (en) | 2006-01-26 |
| US20090037172A1 (en) | 2009-02-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7835910B1 (en) | Exploiting unlabeled utterances for spoken language understanding | |
| Yamamoto et al. | Probability density distillation with generative adversarial networks for high-quality parallel waveform generation | |
| ATE464635T1 (de) | Verfahren zum erzeugen und verwenden eines vektorcodebuchs, verfahren und einrichtung zum komprimieren von daten und verteiltes spracherkennungssystem | |
| EP4531037A3 (de) | End-zu-end-sprachumwandlung | |
| WO2019191556A1 (en) | Knowledge transfer in permutation invariant training for single-channel multi-talker speech recognition | |
| EP4235369A3 (de) | Modalitätslernen auf mobilen vorrichtungen | |
| ATE417346T1 (de) | Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen | |
| DE602004017024D1 (de) | System und verfahren zum betrieb eines spracherkennungssystems in einem fahrzeug | |
| ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
| CN108986798B (zh) | 语音数据的处理方法、装置及设备 | |
| CN103000172A (zh) | 信号分类方法和装置 | |
| CN107369451B (zh) | 一种辅助鸟类繁殖期的物候研究的鸟类声音识别方法 | |
| DE69923026D1 (de) | Sprecher- und Umgebungsadaptation auf der Basis von Stimm-Eigenvektoren sowie der Maximalwahrscheinlichkeitsmethode | |
| MX2008002500A (es) | Incorporacion de entrenamiento de voz en tutorial de usuario interactivo. | |
| MX2025000752A (es) | Codificacion de contexto para informacion sobre el conjunto de nucleo de transformacion en el sistema de codificacion de imagen | |
| CA2947957A1 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
| Tu et al. | Mutual information enhanced training for speaker embedding | |
| Weninger et al. | Recognition of nonprototypical emotions in reverberated and noisy speech by nonnegative matrix factorization | |
| CN106875944A (zh) | 一种语音控制家庭智能终端的系统 | |
| CN111008531A (zh) | 语句选词模型的训练方法及装置、语句选词方法及装置 | |
| US8462984B2 (en) | Data pattern recognition and separation engine | |
| CN110544472B (zh) | 提升使用cnn网络结构的语音任务的性能的方法 | |
| CN111640450A (zh) | 多人声音频处理方法、装置、设备及可读存储介质 | |
| CN105336327B (zh) | 音频数据的增益控制方法及装置 | |
| Kawamura et al. | BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |