ATE272885T1 - Multimodaler sprachkodierer - Google Patents

Multimodaler sprachkodierer

Info

Publication number
ATE272885T1
ATE272885T1 AT00963447T AT00963447T ATE272885T1 AT E272885 T1 ATE272885 T1 AT E272885T1 AT 00963447 T AT00963447 T AT 00963447T AT 00963447 T AT00963447 T AT 00963447T AT E272885 T1 ATE272885 T1 AT E272885T1
Authority
AT
Austria
Prior art keywords
rate
speech
compression system
rate codec
codec
Prior art date
Application number
AT00963447T
Other languages
English (en)
Inventor
Yang Gao
Adil Benyassine
Jes Thyssen
Eyal Sholomot
Huan-Yu Su
Original Assignee
Conexant Systems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/574,396 external-priority patent/US6782360B1/en
Application filed by Conexant Systems Inc filed Critical Conexant Systems Inc
Application granted granted Critical
Publication of ATE272885T1 publication Critical patent/ATE272885T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Lubricants (AREA)
  • Ink Jet (AREA)
  • Graft Or Block Polymers (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
AT00963447T 1999-09-22 2000-09-15 Multimodaler sprachkodierer ATE272885T1 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US15532199P 1999-09-22 1999-09-22
US09/574,396 US6782360B1 (en) 1999-09-22 2000-05-19 Gain quantization for a CELP speech coder
PCT/US2000/025182 WO2001022402A1 (en) 1999-09-22 2000-09-15 Multimode speech encoder

Publications (1)

Publication Number Publication Date
ATE272885T1 true ATE272885T1 (de) 2004-08-15

Family

ID=26852220

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00963447T ATE272885T1 (de) 1999-09-22 2000-09-15 Multimodaler sprachkodierer

Country Status (8)

Country Link
EP (1) EP1214706B9 (de)
JP (2) JP4176349B2 (de)
KR (1) KR100488080B1 (de)
CN (1) CN1245706C (de)
AT (1) ATE272885T1 (de)
AU (1) AU7486200A (de)
BR (1) BRPI0014212B1 (de)
DE (1) DE60012760T2 (de)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100463418B1 (ko) * 2002-11-11 2004-12-23 한국전자통신연구원 Celp 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치
CA2415105A1 (en) * 2002-12-24 2004-06-24 Voiceage Corporation A method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding
FR2867649A1 (fr) * 2003-12-10 2005-09-16 France Telecom Procede de codage multiple optimise
CN101138174B (zh) * 2005-03-14 2013-04-24 松下电器产业株式会社 可扩展解码装置和可扩展解码方法
US7177804B2 (en) * 2005-05-31 2007-02-13 Microsoft Corporation Sub-band voice codec with multi-stage codebooks and redundant coding
CN101371295B (zh) * 2006-01-18 2011-12-21 Lg电子株式会社 用于编码和解码信号的设备和方法
US8451915B2 (en) 2007-03-21 2013-05-28 Samsung Electronics Co., Ltd. Efficient uplink feedback in a wireless communication system
KR20100006492A (ko) * 2008-07-09 2010-01-19 삼성전자주식회사 부호화 방식 결정 방법 및 장치
JP5710476B2 (ja) 2008-07-10 2015-04-30 ヴォイスエイジ・コーポレーション スーパーフレームにおいてlpcフィルタの量子化および逆量子化を行うためのデバイスおよび方法
KR101170466B1 (ko) 2008-07-29 2012-08-03 한국전자통신연구원 Mdct 영역에서의 후처리 방법, 및 장치
JP2010122617A (ja) 2008-11-21 2010-06-03 Yamaha Corp ノイズゲート、及び収音装置
JP2010160496A (ja) * 2010-02-15 2010-07-22 Toshiba Corp 信号処理装置および信号処理方法
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
EP3686888B1 (de) * 2011-02-15 2025-04-02 VoiceAge EVS LLC Vorrichtung und verfahren zur quantisierung der verstärkung von adaptiven und festen beiträgen der anregung in einem celp-koder-dekoder
US9626982B2 (en) 2011-02-15 2017-04-18 Voiceage Corporation Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec
US9026434B2 (en) * 2011-04-11 2015-05-05 Samsung Electronic Co., Ltd. Frame erasure concealment for a multi rate speech and audio codec
US9336789B2 (en) * 2013-02-21 2016-05-10 Qualcomm Incorporated Systems and methods for determining an interpolation factor set for synthesizing a speech signal
CN104517612B (zh) * 2013-09-30 2018-10-12 上海爱聊信息科技有限公司 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法
JP5981408B2 (ja) * 2013-10-29 2016-08-31 株式会社Nttドコモ 音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム
KR102745244B1 (ko) 2014-03-28 2024-12-20 삼성전자주식회사 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치
CN112927702B (zh) * 2014-05-07 2024-11-12 三星电子株式会社 对线性预测系数量化的方法和装置及解量化的方法和装置
MY203900A (en) * 2014-07-28 2024-07-23 Ericsson Telefon Ab L M Pyramid vector quantizer shape search
US10109284B2 (en) * 2016-02-12 2018-10-23 Qualcomm Incorporated Inter-channel encoding and decoding of multiple high-band audio signals
US10373630B2 (en) * 2017-03-31 2019-08-06 Intel Corporation Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices
CN111183476B (zh) * 2017-10-06 2024-03-22 索尼欧洲有限公司 基于子窗口序列内的rms功率的音频文件包络
CN108122552B (zh) * 2017-12-15 2021-10-15 上海智臻智能网络科技股份有限公司 语音情绪识别方法和装置
WO2021029642A1 (en) * 2019-08-13 2021-02-18 Samsung Electronics Co., Ltd. System and method for recognizing user's speech
CN113593521B (zh) * 2021-07-29 2022-09-20 北京三快在线科技有限公司 语音合成方法、装置、设备及可读存储介质
CN118430508B (zh) * 2024-05-29 2024-09-17 中国矿业大学 基于神经音频编解码器的语音合成方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3353852B2 (ja) * 1994-02-15 2002-12-03 日本電信電話株式会社 音声の符号化方法
US5701390A (en) * 1995-02-22 1997-12-23 Digital Voice Systems, Inc. Synthesis of MBE-based coded speech using regenerated phase information

Also Published As

Publication number Publication date
BRPI0014212B1 (pt) 2016-07-26
JP4176349B2 (ja) 2008-11-05
JP2003513296A (ja) 2003-04-08
EP1214706A1 (de) 2002-06-19
CN1451155A (zh) 2003-10-22
AU7486200A (en) 2001-04-24
KR100488080B1 (ko) 2005-05-06
CN1245706C (zh) 2006-03-15
DE60012760D1 (de) 2004-09-09
EP1214706B9 (de) 2005-01-05
JP2005338872A (ja) 2005-12-08
EP1214706B1 (de) 2004-08-04
KR20020033819A (ko) 2002-05-07
BR0014212A (pt) 2003-06-10
DE60012760T2 (de) 2005-08-04

Similar Documents

Publication Publication Date Title
ATE272885T1 (de) Multimodaler sprachkodierer
AU2001287969A1 (en) Codebook structure and search for speech coding
US7596486B2 (en) Encoding an audio signal using different audio coder modes
AU2003278014A8 (en) Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs
CA2096991A1 (en) Celp-based speech compressor
CA2306098A1 (en) Multimode speech coding apparatus and decoding apparatus
DE60024123D1 (de) Lpc-harmonischer sprachkodierer mit überrahmenformat
BR0304540A (pt) Métodos para codificar um sinal de áudio, e para decodificar um sinal de áudio codificado, codificador para codificar um sinal de áudio, aparelho para fornecer um sinal de áudio, sinal de áudio codificado, meio de armazenagem, e, decodificador para decodificar um sinal de áudio codificado
CA2611829A1 (en) Sub-band voice codec with multi-stage codebooks and redundant coding
CN101141644B (zh) 编码集成系统和方法与解码集成系统和方法
CN101087319B (zh) 一种发送和接收背景噪声的方法和装置及静音压缩系统
EP1204092A3 (de) Sprachdekoder mit Wiedergabe des Hintergrundrauschens
DE60027140D1 (de) Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate
Choudhary et al. Study and performance of amr codecs for gsm
WO2002023533A3 (en) System for improved use of pitch enhancement with subcodebooks
Wang et al. Transcoding Scheme between AMR-WB and VMR-WB
PL1756806T3 (pl) Sposób kwantyzacji kodera mowy o bardzo małej przepływności
BRPI0520115A2 (pt) métodos para codificar e para decodificar sinais de áudio e codificador e decodificador para sinais de áudio
Srinonchat et al. New Bit Rate CELP coder for Speaker Dependent Coding System
Ozawa et al. M-LCELP speech coding at bit-rates below 4kbps
Shikui et al. Speech transcoding from AMR to G. 729 in excitation domain

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties