PL2242045T3 - Sposób kodowania i syntezy mowy - Google Patents

Sposób kodowania i syntezy mowy

Info

Publication number: PL2242045T3
Authority: PL; Poland
Prior art keywords: speech synthesis; coding methods; coding; methods; speech
Prior art date: 2009-04-16

Application number

PL09158056T

Other languages

English (en)

Inventor

Thomas Drugman

Geoffrey Wilfart

Thierry Dutoit

Original Assignee

Univ Mons

Acapela Group S A

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2009-04-16

Filing date

2009-04-16

Publication date

2013-02-28

2009-04-16 Application filed by Univ Mons, Acapela Group S A filed Critical Univ Mons

2013-02-28 Publication of PL2242045T3 publication Critical patent/PL2242045T3/pl

Links

230000015572 biosynthetic process Effects 0.000 title 1
238000000034 method Methods 0.000 title 1
238000003786 synthesis reaction Methods 0.000 title 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/125—Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP]
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Signal Processing (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)

PL09158056T 2009-04-16 2009-04-16 Sposób kodowania i syntezy mowy PL2242045T3 (pl)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
EP09158056A EP2242045B1 (en)	2009-04-16	2009-04-16	Speech synthesis and coding methods

Publications (1)

Publication Number	Publication Date
PL2242045T3 true PL2242045T3 (pl)	2013-02-28

Family

ID=40846430

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
PL09158056T PL2242045T3 (pl)	2009-04-16	2009-04-16	Sposób kodowania i syntezy mowy

Country Status (10)

Country	Link
US (1)	US8862472B2 (pl)
EP (1)	EP2242045B1 (pl)
JP (1)	JP5581377B2 (pl)
KR (1)	KR101678544B1 (pl)
CA (1)	CA2757142C (pl)
DK (1)	DK2242045T3 (pl)
IL (1)	IL215628A (pl)
PL (1)	PL2242045T3 (pl)
RU (1)	RU2557469C2 (pl)
WO (1)	WO2010118953A1 (pl)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP2507794B1 (en) *	2009-12-02	2018-10-17	Agnitio S.L.	Obfuscated speech synthesis
JP5591080B2 (ja) *	2010-11-26	2014-09-17	三菱電機株式会社	データ圧縮装置及びデータ処理システム及びコンピュータプログラム及びデータ圧縮方法
KR101402805B1 (ko) *	2012-03-27	2014-06-03	광주과학기술원	음성분석장치, 음성합성장치, 및 음성분석합성시스템
US9978359B1 (en) *	2013-12-06	2018-05-22	Amazon Technologies, Inc.	Iterative text-to-speech with user feedback
US10255903B2 (en)	2014-05-28	2019-04-09	Interactive Intelligence Group, Inc.	Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
EP3149727B1 (en) *	2014-05-28	2021-01-27	Interactive Intelligence Group, Inc.	Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10014007B2 (en)	2014-05-28	2018-07-03	Interactive Intelligence, Inc.	Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US9607610B2 (en) *	2014-07-03	2017-03-28	Google Inc.	Devices and methods for noise modulation in a universal vocoder synthesizer
JP6293912B2 (ja) *	2014-09-19	2018-03-14	株式会社東芝	音声合成装置、音声合成方法およびプログラム
CN108369803B (zh) *	2015-10-06	2023-04-04	交互智能集团有限公司	用于形成基于声门脉冲模型的参数语音合成系统的激励信号的方法
US10140089B1 (en)	2017-08-09	2018-11-27	2236008 Ontario Inc.	Synthetic speech for in vehicle communication
US10347238B2 (en)	2017-10-27	2019-07-09	Adobe Inc.	Text-based insertion and replacement in audio narration
CN108281150B (zh) *	2018-01-29	2020-11-17	上海泰亿格康复医疗科技股份有限公司	一种基于微分声门波模型的语音变调变嗓音方法
US10770063B2 (en)	2018-04-13	2020-09-08	Adobe Inc.	Real-time speaker-dependent neural vocoder
CN109036375B (zh) *	2018-07-25	2023-03-24	腾讯科技（深圳）有限公司	语音合成方法、模型训练方法、装置和计算机设备
WO2021015523A1 (ko) *	2019-07-19	2021-01-28	주식회사 윌러스표준기술연구소	비디오 신호 처리 방법 및 장치
CN112634914B (zh) *	2020-12-15	2024-03-29	中国科学技术大学	基于短时谱一致性的神经网络声码器训练方法
CN113539231B (zh) *	2020-12-30	2024-06-18	腾讯科技（深圳）有限公司	音频处理方法、声码器、装置、设备及存储介质
US12175995B2 (en)	2021-06-03	2024-12-24	Y.E. Hub Armenia LLC	Method and a server for generating a waveform
EP4643106A1 (en) *	2022-12-29	2025-11-05	Med-El Elektromedizinische Geraete GmbH	Synthesis of ling sounds

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JPS6423300A (en) *	1987-07-17	1989-01-25	Ricoh Kk	Spectrum generation system
US5754976A (en) *	1990-02-23	1998-05-19	Universite De Sherbrooke	Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech
EP0481107B1 (en) *	1990-10-16	1995-09-06	International Business Machines Corporation	A phonetic Hidden Markov Model speech synthesizer
DE69203186T2 (de) *	1991-09-20	1996-02-01	Philips Electronics Nv	Verarbeitungsgerät für die menschliche Sprache zum Detektieren des Schliessens der Stimmritze.
JPH06250690A (ja) *	1993-02-26	1994-09-09	N T T Data Tsushin Kk	振幅特徴抽出装置及び合成音声振幅制御装置
JP3093113B2 (ja) *	1994-09-21	2000-10-03	日本アイ・ビー・エム株式会社	音声合成方法及びシステム
JP3747492B2 (ja) *	1995-06-20	2006-02-22	ソニー株式会社	音声信号の再生方法及び再生装置
US6304846B1 (en) *	1997-10-22	2001-10-16	Texas Instruments Incorporated	Singing voice synthesis
JP3268750B2 (ja) *	1998-01-30	2002-03-25	株式会社東芝	音声合成方法及びシステム
US6631363B1 (en) *	1999-10-11	2003-10-07	I2 Technologies Us, Inc.	Rules-based notification system
DE10041512B4 (de) *	2000-08-24	2005-05-04	Infineon Technologies Ag	Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen
WO2002023523A2 (en) *	2000-09-15	2002-03-21	Lernout & Hauspie Speech Products N.V.	Fast waveform synchronization for concatenation and time-scale modification of speech
JP2004117662A (ja) *	2002-09-25	2004-04-15	Matsushita Electric Ind Co Ltd	音声合成システム
AU2003284654A1 (en) *	2002-11-25	2004-06-18	Matsushita Electric Industrial Co., Ltd.	Speech synthesis method and speech synthesis device
US7842874B2 (en) *	2006-06-15	2010-11-30	Massachusetts Institute Of Technology	Creating music by concatenative synthesis
US8140326B2 (en) *	2008-06-06	2012-03-20	Fuji Xerox Co., Ltd.	Systems and methods for reducing speech intelligibility while preserving environmental sounds

2009
- 2009-04-16 PL PL09158056T patent/PL2242045T3/pl unknown
- 2009-04-16 EP EP09158056A patent/EP2242045B1/en not_active Not-in-force
- 2009-04-16 DK DK09158056.3T patent/DK2242045T3/da active
2010
- 2010-03-30 US US13/264,571 patent/US8862472B2/en not_active Expired - Fee Related
- 2010-03-30 CA CA2757142A patent/CA2757142C/en not_active Expired - Fee Related
- 2010-03-30 KR KR1020117027296A patent/KR101678544B1/ko not_active Expired - Fee Related
- 2010-03-30 WO PCT/EP2010/054244 patent/WO2010118953A1/en not_active Ceased
- 2010-03-30 RU RU2011145669/08A patent/RU2557469C2/ru not_active IP Right Cessation
- 2010-03-30 JP JP2012505115A patent/JP5581377B2/ja not_active Expired - Fee Related
2011
- 2011-10-09 IL IL215628A patent/IL215628A/en not_active IP Right Cessation

Also Published As

Publication number	Publication date
RU2011145669A (ru)	2013-05-27
WO2010118953A1 (en)	2010-10-21
US20120123782A1 (en)	2012-05-17
KR20120040136A (ko)	2012-04-26
US8862472B2 (en)	2014-10-14
CA2757142A1 (en)	2010-10-21
JP5581377B2 (ja)	2014-08-27
DK2242045T3 (da)	2012-09-24
CA2757142C (en)	2017-11-07
IL215628A0 (en)	2012-01-31
EP2242045B1 (en)	2012-06-27
JP2012524288A (ja)	2012-10-11
EP2242045A1 (en)	2010-10-20
KR101678544B1 (ko)	2016-11-22
RU2557469C2 (ru)	2015-07-20
IL215628A (en)	2013-11-28

Publication	Publication Date	Title
IL215628A0 (en)	2012-01-31	Speech synthesis and coding methods
GB2466675B (en)	2013-03-06	Speech coding
GB2466666B (en)	2013-01-23	Speech coding
GB2466671B (en)	2013-03-27	Speech encoding
GB2466672B (en)	2013-03-13	Speech coding
GB2466670B (en)	2012-11-14	Speech encoding
GB2466669B (en)	2013-03-06	Speech coding
GB2476041B (en)	2017-03-01	Encoding and decoding speech signals
GB0900144D0 (en)	2009-02-11	Speech coding
ZA201203570B (en)	2013-05-29	Multi-mode audio codec and celp coding adapted therefore
GB2473139B (en)	2012-04-11	Enhanced audio decoder
GB2466673B (en)	2012-11-07	Quantization
GB0900138D0 (en)	2009-02-11	Filtering speech
EP2411024A4 (en)	2013-02-27	Variants of factor viii and associated methods of use
GB0921227D0 (en)	2010-01-20	Personal audio equipment
LT2462586T (lt)	2017-12-27	Kalbos sintezės būdas
ZA201200894B (en)	2012-10-31	Synthesis and use of zsm-12
GB0903154D0 (en)	2009-04-08	Speech clarity
GB2476043B (en)	2016-10-26	Decoding speech signals
EP2645365A4 (en)	2015-01-07	SPEECH SIGNAL ENCODING METHOD AND SPEECH SIGNAL DECODING METHOD
GB0912744D0 (en)	2009-08-26	Methods and uses
GB0901529D0 (en)	2009-03-11	Mooring limb
TWM370808U (en)	2009-12-11	Capo
GB0800863D0 (en)	2008-02-27	Unvoiced speech interface
PH32009000191S1 (en)	2013-01-25	Voice synthesizer