ATE272885T1 - Multimodaler sprachkodierer - Google Patents

Multimodaler sprachkodierer

Info

Publication number: ATE272885T1
Authority: AT; Austria
Prior art keywords: rate; speech; compression system; rate codec; codec
Prior art date: 1999-09-22

Application number

AT00963447T

Other languages

English (en)

Inventor

Yang Gao

Adil Benyassine

Jes Thyssen

Eyal Sholomot

Huan-Yu Su

Original Assignee

Conexant Systems Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1999-09-22

Filing date

2000-09-15

Publication date

2004-08-15

2000-05-19 Priority claimed from US09/574,396 external-priority patent/US6782360B1/en

2000-09-15 Application filed by Conexant Systems Inc filed Critical Conexant Systems Inc

2004-08-15 Application granted granted Critical

2004-08-15 Publication of ATE272885T1 publication Critical patent/ATE272885T1/de

Links

230000006835 compression Effects 0.000 abstract 3
238000007906 compression Methods 0.000 abstract 3

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Signal Processing (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Quality & Reliability (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Lubricants (AREA)
Ink Jet (AREA)
Graft Or Block Polymers (AREA)
Reduction Or Emphasis Of Bandwidth Of Signals (AREA)

AT00963447T 1999-09-22 2000-09-15 Multimodaler sprachkodierer ATE272885T1 (de)

Applications Claiming Priority (3)

Application Number	Priority Date	Filing Date	Title
US15532199P	1999-09-22	1999-09-22
US09/574,396 US6782360B1 (en)	1999-09-22	2000-05-19	Gain quantization for a CELP speech coder
PCT/US2000/025182 WO2001022402A1 (en)	1999-09-22	2000-09-15	Multimode speech encoder

Publications (1)

Publication Number	Publication Date
ATE272885T1 true ATE272885T1 (de)	2004-08-15

Family

ID=26852220

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT00963447T ATE272885T1 (de)	1999-09-22	2000-09-15	Multimodaler sprachkodierer

Country Status (8)

Country	Link
EP (1)	EP1214706B9 (de)
JP (2)	JP4176349B2 (de)
KR (1)	KR100488080B1 (de)
CN (1)	CN1245706C (de)
AT (1)	ATE272885T1 (de)
AU (1)	AU7486200A (de)
BR (1)	BRPI0014212B1 (de)
DE (1)	DE60012760T2 (de)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
KR100463418B1 (ko) *	2002-11-11	2004-12-23	한국전자통신연구원	Ｃｅｌｐ 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치
CA2415105A1 (en) *	2002-12-24	2004-06-24	Voiceage Corporation	A method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding
FR2867649A1 (fr) *	2003-12-10	2005-09-16	France Telecom	Procede de codage multiple optimise
WO2006098274A1 (ja) *	2005-03-14	2006-09-21	Matsushita Electric Industrial Co., Ltd.	スケーラブル復号化装置およびスケーラブル復号化方法
US7177804B2 (en) *	2005-05-31	2007-02-13	Microsoft Corporation	Sub-band voice codec with multi-stage codebooks and redundant coding
CN101371296B (zh) *	2006-01-18	2012-08-29	Lg电子株式会社	用于编码和解码信号的设备和方法
US8451915B2 (en)	2007-03-21	2013-05-28	Samsung Electronics Co., Ltd.	Efficient uplink feedback in a wireless communication system
KR20100006492A (ko) *	2008-07-09	2010-01-19	삼성전자주식회사	부호화 방식 결정 방법 및 장치
NO2313887T3 (de)	2008-07-10	2018-02-10
KR101170466B1 (ko)	2008-07-29	2012-08-03	한국전자통신연구원	Ｍｄｃｔ 영역에서의 후처리 방법, 및 장치
JP2010122617A (ja) *	2008-11-21	2010-06-03	Yamaha Corp	ノイズゲート、及び収音装置
JP2010160496A (ja) *	2010-02-15	2010-07-22	Toshiba Corp	信号処理装置および信号処理方法
US9047875B2 (en) *	2010-07-19	2015-06-02	Futurewei Technologies, Inc.	Spectrum flatness control for bandwidth extension
US9626982B2 (en)	2011-02-15	2017-04-18	Voiceage Corporation	Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec
DE20163502T1 (de) *	2011-02-15	2020-12-10	Voiceage Evs Gmbh & Co. Kg	Vorrichtung und verfahren zur quantisierung der verstärkung von adaptiven und festen beiträgen der anregung in einem celp-koder-dekoder
US9026434B2 (en) *	2011-04-11	2015-05-05	Samsung Electronic Co., Ltd.	Frame erasure concealment for a multi rate speech and audio codec
US9336789B2 (en) *	2013-02-21	2016-05-10	Qualcomm Incorporated	Systems and methods for determining an interpolation factor set for synthesizing a speech signal
CN104517612B (zh) *	2013-09-30	2018-10-12	上海爱聊信息科技有限公司	基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法
JP5981408B2 (ja) *	2013-10-29	2016-08-31	株式会社Ｎｔｔドコモ	音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム
SG11201608787UA (en)	2014-03-28	2016-12-29	Samsung Electronics Co Ltd	Method and device for quantization of linear prediction coefficient and method and device for inverse quantization
CN112927702B (zh)	2014-05-07	2024-11-12	三星电子株式会社	对线性预测系数量化的方法和装置及解量化的方法和装置
SG10202000575WA (en) *	2014-07-28	2020-03-30	Ericsson Telefon Ab L M	Pyramid vector quantizer shape search
US10109284B2 (en) *	2016-02-12	2018-10-23	Qualcomm Incorporated	Inter-channel encoding and decoding of multiple high-band audio signals
US10373630B2 (en) *	2017-03-31	2019-08-06	Intel Corporation	Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices
WO2019068915A1 (en) *	2017-10-06	2019-04-11	Sony Europe Limited	AUDIO FILE ENVELOPE BASED ON RMS POWER IN SUB-WINDOW SEQUENCES
CN108122552B (zh) *	2017-12-15	2021-10-15	上海智臻智能网络科技股份有限公司	语音情绪识别方法和装置
US11532310B2 (en) *	2019-08-13	2022-12-20	Samsung Electronics Co., Ltd.	System and method for recognizing user's speech
CN113593521B (zh) *	2021-07-29	2022-09-20	北京三快在线科技有限公司	语音合成方法、装置、设备及可读存储介质
CN118430508B (zh) *	2024-05-29	2024-09-17	中国矿业大学	基于神经音频编解码器的语音合成方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JP3353852B2 (ja) *	1994-02-15	2002-12-03	日本電信電話株式会社	音声の符号化方法
US5701390A (en) *	1995-02-22	1997-12-23	Digital Voice Systems, Inc.	Synthesis of MBE-based coded speech using regenerated phase information

2000
- 2000-09-12 AU AU74862/00A patent/AU7486200A/en not_active Abandoned
- 2000-09-15 CN CNB008159408A patent/CN1245706C/zh not_active Expired - Fee Related
- 2000-09-15 AT AT00963447T patent/ATE272885T1/de not_active IP Right Cessation
- 2000-09-15 EP EP00963447A patent/EP1214706B9/de not_active Expired - Lifetime
- 2000-09-15 JP JP2001525686A patent/JP4176349B2/ja not_active Expired - Fee Related
- 2000-09-15 KR KR10-2002-7003768A patent/KR100488080B1/ko not_active Expired - Lifetime
- 2000-09-15 BR BRPI0014212A patent/BRPI0014212B1/pt not_active IP Right Cessation
- 2000-09-15 DE DE60012760T patent/DE60012760T2/de not_active Expired - Lifetime
2005
- 2005-07-11 JP JP2005202337A patent/JP2005338872A/ja active Pending

Also Published As

Publication number	Publication date
AU7486200A (en)	2001-04-24
JP4176349B2 (ja)	2008-11-05
EP1214706B9 (de)	2005-01-05
JP2003513296A (ja)	2003-04-08
KR100488080B1 (ko)	2005-05-06
EP1214706A1 (de)	2002-06-19
CN1451155A (zh)	2003-10-22
EP1214706B1 (de)	2004-08-04
KR20020033819A (ko)	2002-05-07
BR0014212A (pt)	2003-06-10
DE60012760D1 (de)	2004-09-09
JP2005338872A (ja)	2005-12-08
DE60012760T2 (de)	2005-08-04
CN1245706C (zh)	2006-03-15
BRPI0014212B1 (pt)	2016-07-26

Legal Events

Date	Code	Title	Description
2005-02-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE272885T1 (de)	2004-08-15	Multimodaler sprachkodierer
AU2001287969A1 (en)	2002-04-02	Codebook structure and search for speech coding
US7596486B2 (en)	2009-09-29	Encoding an audio signal using different audio coder modes
AU2003278014A8 (en)	2004-05-04	Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs
CA2096991A1 (en)	1993-12-02	Celp-based speech compressor
AU7830300A (en)	2001-04-24	Lpc-harmonic vocoder with superframe structure
CA2306098A1 (en)	2000-03-02	Multimode speech coding apparatus and decoding apparatus
CA2611829A1 (en)	2006-12-07	Sub-band voice codec with multi-stage codebooks and redundant coding
CN101141644B (zh)	2010-12-08	编码集成系统和方法与解码集成系统和方法
CN101087319B (zh)	2012-01-04	一种发送和接收背景噪声的方法和装置及静音压缩系统
EP1204092A3 (de)	2003-11-19	Sprachdekoder mit Wiedergabe des Hintergrundrauschens
DE60027140D1 (de)	2006-05-18	Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate
KR20010087393A (ko)	2001-09-15	폐루프 가변-레이트 다중모드 예측 음성 코더
WO2002023533A3 (en)	2002-08-15	System for improved use of pitch enhancement with subcodebooks
Wang et al.	2009	Transcoding Scheme between AMR-WB and VMR-WB
PL1756806T3 (pl)	2010-06-30	Sposób kwantyzacji kodera mowy o bardzo małej przepływności
BRPI0520115A2 (pt)	2009-09-15	métodos para codificar e para decodificar sinais de áudio e codificador e decodificador para sinais de áudio
Srinonchat et al.	2004	New Bit Rate CELP coder for Speaker Dependent Coding System
Ozawa et al.	1993	M-LCELP speech coding at bit-rates below 4kbps
Shikui et al.	2010	Speech transcoding from AMR to G. 729 in excitation domain
Xu et al.	2008	A novel transcoding algorithm between 3GPP AMR-NB (7.95 kbit/s) and ITU-t g. 729a (8kbit/s)