ATE272885T1 - Multimodaler sprachkodierer - Google Patents
Multimodaler sprachkodiererInfo
- Publication number
- ATE272885T1 ATE272885T1 AT00963447T AT00963447T ATE272885T1 AT E272885 T1 ATE272885 T1 AT E272885T1 AT 00963447 T AT00963447 T AT 00963447T AT 00963447 T AT00963447 T AT 00963447T AT E272885 T1 ATE272885 T1 AT E272885T1
- Authority
- AT
- Austria
- Prior art keywords
- rate
- speech
- compression system
- rate codec
- codec
- Prior art date
Links
- 230000006835 compression Effects 0.000 abstract 3
- 238000007906 compression Methods 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Lubricants (AREA)
- Ink Jet (AREA)
- Graft Or Block Polymers (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15532199P | 1999-09-22 | 1999-09-22 | |
| US09/574,396 US6782360B1 (en) | 1999-09-22 | 2000-05-19 | Gain quantization for a CELP speech coder |
| PCT/US2000/025182 WO2001022402A1 (en) | 1999-09-22 | 2000-09-15 | Multimode speech encoder |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE272885T1 true ATE272885T1 (de) | 2004-08-15 |
Family
ID=26852220
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT00963447T ATE272885T1 (de) | 1999-09-22 | 2000-09-15 | Multimodaler sprachkodierer |
Country Status (8)
| Country | Link |
|---|---|
| EP (1) | EP1214706B9 (de) |
| JP (2) | JP4176349B2 (de) |
| KR (1) | KR100488080B1 (de) |
| CN (1) | CN1245706C (de) |
| AT (1) | ATE272885T1 (de) |
| AU (1) | AU7486200A (de) |
| BR (1) | BRPI0014212B1 (de) |
| DE (1) | DE60012760T2 (de) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR100463418B1 (ko) * | 2002-11-11 | 2004-12-23 | 한국전자통신연구원 | Celp 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치 |
| CA2415105A1 (en) * | 2002-12-24 | 2004-06-24 | Voiceage Corporation | A method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding |
| FR2867649A1 (fr) * | 2003-12-10 | 2005-09-16 | France Telecom | Procede de codage multiple optimise |
| CN101138174B (zh) * | 2005-03-14 | 2013-04-24 | 松下电器产业株式会社 | 可扩展解码装置和可扩展解码方法 |
| US7177804B2 (en) * | 2005-05-31 | 2007-02-13 | Microsoft Corporation | Sub-band voice codec with multi-stage codebooks and redundant coding |
| CN101371295B (zh) * | 2006-01-18 | 2011-12-21 | Lg电子株式会社 | 用于编码和解码信号的设备和方法 |
| US8451915B2 (en) | 2007-03-21 | 2013-05-28 | Samsung Electronics Co., Ltd. | Efficient uplink feedback in a wireless communication system |
| KR20100006492A (ko) * | 2008-07-09 | 2010-01-19 | 삼성전자주식회사 | 부호화 방식 결정 방법 및 장치 |
| JP5710476B2 (ja) | 2008-07-10 | 2015-04-30 | ヴォイスエイジ・コーポレーション | スーパーフレームにおいてlpcフィルタの量子化および逆量子化を行うためのデバイスおよび方法 |
| KR101170466B1 (ko) | 2008-07-29 | 2012-08-03 | 한국전자통신연구원 | Mdct 영역에서의 후처리 방법, 및 장치 |
| JP2010122617A (ja) | 2008-11-21 | 2010-06-03 | Yamaha Corp | ノイズゲート、及び収音装置 |
| JP2010160496A (ja) * | 2010-02-15 | 2010-07-22 | Toshiba Corp | 信号処理装置および信号処理方法 |
| US9047875B2 (en) * | 2010-07-19 | 2015-06-02 | Futurewei Technologies, Inc. | Spectrum flatness control for bandwidth extension |
| EP3686888B1 (de) * | 2011-02-15 | 2025-04-02 | VoiceAge EVS LLC | Vorrichtung und verfahren zur quantisierung der verstärkung von adaptiven und festen beiträgen der anregung in einem celp-koder-dekoder |
| US9626982B2 (en) | 2011-02-15 | 2017-04-18 | Voiceage Corporation | Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec |
| US9026434B2 (en) * | 2011-04-11 | 2015-05-05 | Samsung Electronic Co., Ltd. | Frame erasure concealment for a multi rate speech and audio codec |
| US9336789B2 (en) * | 2013-02-21 | 2016-05-10 | Qualcomm Incorporated | Systems and methods for determining an interpolation factor set for synthesizing a speech signal |
| CN104517612B (zh) * | 2013-09-30 | 2018-10-12 | 上海爱聊信息科技有限公司 | 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法 |
| JP5981408B2 (ja) * | 2013-10-29 | 2016-08-31 | 株式会社Nttドコモ | 音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム |
| KR102745244B1 (ko) | 2014-03-28 | 2024-12-20 | 삼성전자주식회사 | 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치 |
| CN112927702B (zh) * | 2014-05-07 | 2024-11-12 | 三星电子株式会社 | 对线性预测系数量化的方法和装置及解量化的方法和装置 |
| MY203900A (en) * | 2014-07-28 | 2024-07-23 | Ericsson Telefon Ab L M | Pyramid vector quantizer shape search |
| US10109284B2 (en) * | 2016-02-12 | 2018-10-23 | Qualcomm Incorporated | Inter-channel encoding and decoding of multiple high-band audio signals |
| US10373630B2 (en) * | 2017-03-31 | 2019-08-06 | Intel Corporation | Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices |
| CN111183476B (zh) * | 2017-10-06 | 2024-03-22 | 索尼欧洲有限公司 | 基于子窗口序列内的rms功率的音频文件包络 |
| CN108122552B (zh) * | 2017-12-15 | 2021-10-15 | 上海智臻智能网络科技股份有限公司 | 语音情绪识别方法和装置 |
| WO2021029642A1 (en) * | 2019-08-13 | 2021-02-18 | Samsung Electronics Co., Ltd. | System and method for recognizing user's speech |
| CN113593521B (zh) * | 2021-07-29 | 2022-09-20 | 北京三快在线科技有限公司 | 语音合成方法、装置、设备及可读存储介质 |
| CN118430508B (zh) * | 2024-05-29 | 2024-09-17 | 中国矿业大学 | 基于神经音频编解码器的语音合成方法 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3353852B2 (ja) * | 1994-02-15 | 2002-12-03 | 日本電信電話株式会社 | 音声の符号化方法 |
| US5701390A (en) * | 1995-02-22 | 1997-12-23 | Digital Voice Systems, Inc. | Synthesis of MBE-based coded speech using regenerated phase information |
-
2000
- 2000-09-12 AU AU74862/00A patent/AU7486200A/en not_active Abandoned
- 2000-09-15 KR KR10-2002-7003768A patent/KR100488080B1/ko not_active Expired - Lifetime
- 2000-09-15 DE DE60012760T patent/DE60012760T2/de not_active Expired - Lifetime
- 2000-09-15 CN CNB008159408A patent/CN1245706C/zh not_active Expired - Fee Related
- 2000-09-15 AT AT00963447T patent/ATE272885T1/de not_active IP Right Cessation
- 2000-09-15 JP JP2001525686A patent/JP4176349B2/ja not_active Expired - Fee Related
- 2000-09-15 BR BRPI0014212A patent/BRPI0014212B1/pt not_active IP Right Cessation
- 2000-09-15 EP EP00963447A patent/EP1214706B9/de not_active Expired - Lifetime
-
2005
- 2005-07-11 JP JP2005202337A patent/JP2005338872A/ja active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| BRPI0014212B1 (pt) | 2016-07-26 |
| JP4176349B2 (ja) | 2008-11-05 |
| JP2003513296A (ja) | 2003-04-08 |
| EP1214706A1 (de) | 2002-06-19 |
| CN1451155A (zh) | 2003-10-22 |
| AU7486200A (en) | 2001-04-24 |
| KR100488080B1 (ko) | 2005-05-06 |
| CN1245706C (zh) | 2006-03-15 |
| DE60012760D1 (de) | 2004-09-09 |
| EP1214706B9 (de) | 2005-01-05 |
| JP2005338872A (ja) | 2005-12-08 |
| EP1214706B1 (de) | 2004-08-04 |
| KR20020033819A (ko) | 2002-05-07 |
| BR0014212A (pt) | 2003-06-10 |
| DE60012760T2 (de) | 2005-08-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE272885T1 (de) | Multimodaler sprachkodierer | |
| AU2001287969A1 (en) | Codebook structure and search for speech coding | |
| US7596486B2 (en) | Encoding an audio signal using different audio coder modes | |
| AU2003278014A8 (en) | Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs | |
| CA2096991A1 (en) | Celp-based speech compressor | |
| CA2306098A1 (en) | Multimode speech coding apparatus and decoding apparatus | |
| DE60024123D1 (de) | Lpc-harmonischer sprachkodierer mit überrahmenformat | |
| BR0304540A (pt) | Métodos para codificar um sinal de áudio, e para decodificar um sinal de áudio codificado, codificador para codificar um sinal de áudio, aparelho para fornecer um sinal de áudio, sinal de áudio codificado, meio de armazenagem, e, decodificador para decodificar um sinal de áudio codificado | |
| CA2611829A1 (en) | Sub-band voice codec with multi-stage codebooks and redundant coding | |
| CN101141644B (zh) | 编码集成系统和方法与解码集成系统和方法 | |
| CN101087319B (zh) | 一种发送和接收背景噪声的方法和装置及静音压缩系统 | |
| EP1204092A3 (de) | Sprachdekoder mit Wiedergabe des Hintergrundrauschens | |
| DE60027140D1 (de) | Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate | |
| Choudhary et al. | Study and performance of amr codecs for gsm | |
| WO2002023533A3 (en) | System for improved use of pitch enhancement with subcodebooks | |
| Wang et al. | Transcoding Scheme between AMR-WB and VMR-WB | |
| PL1756806T3 (pl) | Sposób kwantyzacji kodera mowy o bardzo małej przepływności | |
| BRPI0520115A2 (pt) | métodos para codificar e para decodificar sinais de áudio e codificador e decodificador para sinais de áudio | |
| Srinonchat et al. | New Bit Rate CELP coder for Speaker Dependent Coding System | |
| Ozawa et al. | M-LCELP speech coding at bit-rates below 4kbps | |
| Shikui et al. | Speech transcoding from AMR to G. 729 in excitation domain |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |