PL2242045T3 - Sposób kodowania i syntezy mowy - Google Patents
Sposób kodowania i syntezy mowyInfo
- Publication number
- PL2242045T3 PL2242045T3 PL09158056T PL09158056T PL2242045T3 PL 2242045 T3 PL2242045 T3 PL 2242045T3 PL 09158056 T PL09158056 T PL 09158056T PL 09158056 T PL09158056 T PL 09158056T PL 2242045 T3 PL2242045 T3 PL 2242045T3
- Authority
- PL
- Poland
- Prior art keywords
- speech synthesis
- coding methods
- coding
- methods
- speech
- Prior art date
Links
- 230000015572 biosynthetic process Effects 0.000 title 1
- 238000000034 method Methods 0.000 title 1
- 238000003786 synthesis reaction Methods 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/125—Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP09158056A EP2242045B1 (en) | 2009-04-16 | 2009-04-16 | Speech synthesis and coding methods |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| PL2242045T3 true PL2242045T3 (pl) | 2013-02-28 |
Family
ID=40846430
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PL09158056T PL2242045T3 (pl) | 2009-04-16 | 2009-04-16 | Sposób kodowania i syntezy mowy |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US8862472B2 (pl) |
| EP (1) | EP2242045B1 (pl) |
| JP (1) | JP5581377B2 (pl) |
| KR (1) | KR101678544B1 (pl) |
| CA (1) | CA2757142C (pl) |
| DK (1) | DK2242045T3 (pl) |
| IL (1) | IL215628A (pl) |
| PL (1) | PL2242045T3 (pl) |
| RU (1) | RU2557469C2 (pl) |
| WO (1) | WO2010118953A1 (pl) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2507794B1 (en) * | 2009-12-02 | 2018-10-17 | Agnitio S.L. | Obfuscated speech synthesis |
| JP5591080B2 (ja) * | 2010-11-26 | 2014-09-17 | 三菱電機株式会社 | データ圧縮装置及びデータ処理システム及びコンピュータプログラム及びデータ圧縮方法 |
| KR101402805B1 (ko) * | 2012-03-27 | 2014-06-03 | 광주과학기술원 | 음성분석장치, 음성합성장치, 및 음성분석합성시스템 |
| US9978359B1 (en) * | 2013-12-06 | 2018-05-22 | Amazon Technologies, Inc. | Iterative text-to-speech with user feedback |
| US10014007B2 (en) | 2014-05-28 | 2018-07-03 | Interactive Intelligence, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
| BR112016027537B1 (pt) * | 2014-05-28 | 2022-05-10 | Interactive Intelligence, Inc | Método para criar um banco de dados de pulso glotal a partir de um sinal de discurso, em um sistema de síntese de discurso, método para criar modelos paramétricos para o uso no treinamento do sistema de síntese de discurso executado por um processador de computador genérico, e método para sintetizar o discurso usando o texto de entrada |
| US10255903B2 (en) | 2014-05-28 | 2019-04-09 | Interactive Intelligence Group, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
| US9607610B2 (en) * | 2014-07-03 | 2017-03-28 | Google Inc. | Devices and methods for noise modulation in a universal vocoder synthesizer |
| JP6293912B2 (ja) * | 2014-09-19 | 2018-03-14 | 株式会社東芝 | 音声合成装置、音声合成方法およびプログラム |
| EP3363015A4 (en) * | 2015-10-06 | 2019-06-12 | Interactive Intelligence Group, Inc. | METHOD FOR GENERATING THE SOUNDPROOF SIGNAL FOR A GLOTTAL IMPULSE MODEL-BASED PARAMETRIC LANGUAGE SYNTHESIS SYSTEM |
| US10140089B1 (en) | 2017-08-09 | 2018-11-27 | 2236008 Ontario Inc. | Synthetic speech for in vehicle communication |
| US10347238B2 (en) | 2017-10-27 | 2019-07-09 | Adobe Inc. | Text-based insertion and replacement in audio narration |
| CN108281150B (zh) * | 2018-01-29 | 2020-11-17 | 上海泰亿格康复医疗科技股份有限公司 | 一种基于微分声门波模型的语音变调变嗓音方法 |
| US10770063B2 (en) | 2018-04-13 | 2020-09-08 | Adobe Inc. | Real-time speaker-dependent neural vocoder |
| CN109036375B (zh) * | 2018-07-25 | 2023-03-24 | 腾讯科技(深圳)有限公司 | 语音合成方法、模型训练方法、装置和计算机设备 |
| CN121056626A (zh) * | 2019-07-19 | 2025-12-02 | 韦勒斯标准与技术协会公司 | 视频信号处理方法和设备 |
| CN112634914B (zh) * | 2020-12-15 | 2024-03-29 | 中国科学技术大学 | 基于短时谱一致性的神经网络声码器训练方法 |
| CN113539231B (zh) * | 2020-12-30 | 2024-06-18 | 腾讯科技(深圳)有限公司 | 音频处理方法、声码器、装置、设备及存储介质 |
| US12175995B2 (en) | 2021-06-03 | 2024-12-24 | Y.E. Hub Armenia LLC | Method and a server for generating a waveform |
| AU2023418288A1 (en) * | 2022-12-29 | 2025-07-24 | Med-El Elektromedizinische Geraete Gmbh | Synthesis of ling sounds |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS6423300A (en) * | 1987-07-17 | 1989-01-25 | Ricoh Kk | Spectrum generation system |
| US5754976A (en) * | 1990-02-23 | 1998-05-19 | Universite De Sherbrooke | Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech |
| EP0481107B1 (en) * | 1990-10-16 | 1995-09-06 | International Business Machines Corporation | A phonetic Hidden Markov Model speech synthesizer |
| DE69203186T2 (de) * | 1991-09-20 | 1996-02-01 | Philips Electronics Nv | Verarbeitungsgerät für die menschliche Sprache zum Detektieren des Schliessens der Stimmritze. |
| JPH06250690A (ja) * | 1993-02-26 | 1994-09-09 | N T T Data Tsushin Kk | 振幅特徴抽出装置及び合成音声振幅制御装置 |
| JP3093113B2 (ja) * | 1994-09-21 | 2000-10-03 | 日本アイ・ビー・エム株式会社 | 音声合成方法及びシステム |
| JP3747492B2 (ja) * | 1995-06-20 | 2006-02-22 | ソニー株式会社 | 音声信号の再生方法及び再生装置 |
| US6304846B1 (en) * | 1997-10-22 | 2001-10-16 | Texas Instruments Incorporated | Singing voice synthesis |
| JP3268750B2 (ja) * | 1998-01-30 | 2002-03-25 | 株式会社東芝 | 音声合成方法及びシステム |
| US6631363B1 (en) * | 1999-10-11 | 2003-10-07 | I2 Technologies Us, Inc. | Rules-based notification system |
| DE10041512B4 (de) * | 2000-08-24 | 2005-05-04 | Infineon Technologies Ag | Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen |
| ATE357042T1 (de) * | 2000-09-15 | 2007-04-15 | Lernout & Hauspie Speechprod | Schnelle wellenformsynchronisation für die verkettung und zeitskalenmodifikation von sprachsignalen |
| JP2004117662A (ja) * | 2002-09-25 | 2004-04-15 | Matsushita Electric Ind Co Ltd | 音声合成システム |
| CN100365704C (zh) * | 2002-11-25 | 2008-01-30 | 松下电器产业株式会社 | 声音合成方法以及声音合成装置 |
| US7842874B2 (en) * | 2006-06-15 | 2010-11-30 | Massachusetts Institute Of Technology | Creating music by concatenative synthesis |
| US8140326B2 (en) * | 2008-06-06 | 2012-03-20 | Fuji Xerox Co., Ltd. | Systems and methods for reducing speech intelligibility while preserving environmental sounds |
-
2009
- 2009-04-16 PL PL09158056T patent/PL2242045T3/pl unknown
- 2009-04-16 DK DK09158056.3T patent/DK2242045T3/da active
- 2009-04-16 EP EP09158056A patent/EP2242045B1/en not_active Not-in-force
-
2010
- 2010-03-30 WO PCT/EP2010/054244 patent/WO2010118953A1/en not_active Ceased
- 2010-03-30 JP JP2012505115A patent/JP5581377B2/ja not_active Expired - Fee Related
- 2010-03-30 CA CA2757142A patent/CA2757142C/en not_active Expired - Fee Related
- 2010-03-30 RU RU2011145669/08A patent/RU2557469C2/ru not_active IP Right Cessation
- 2010-03-30 KR KR1020117027296A patent/KR101678544B1/ko not_active Expired - Fee Related
- 2010-03-30 US US13/264,571 patent/US8862472B2/en not_active Expired - Fee Related
-
2011
- 2011-10-09 IL IL215628A patent/IL215628A/en not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| IL215628A0 (en) | 2012-01-31 |
| KR20120040136A (ko) | 2012-04-26 |
| JP5581377B2 (ja) | 2014-08-27 |
| CA2757142C (en) | 2017-11-07 |
| RU2557469C2 (ru) | 2015-07-20 |
| EP2242045A1 (en) | 2010-10-20 |
| US8862472B2 (en) | 2014-10-14 |
| JP2012524288A (ja) | 2012-10-11 |
| WO2010118953A1 (en) | 2010-10-21 |
| KR101678544B1 (ko) | 2016-11-22 |
| RU2011145669A (ru) | 2013-05-27 |
| IL215628A (en) | 2013-11-28 |
| EP2242045B1 (en) | 2012-06-27 |
| CA2757142A1 (en) | 2010-10-21 |
| DK2242045T3 (da) | 2012-09-24 |
| US20120123782A1 (en) | 2012-05-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| IL215628A0 (en) | Speech synthesis and coding methods | |
| GB2466675B (en) | Speech coding | |
| GB2466666B (en) | Speech coding | |
| GB2466671B (en) | Speech encoding | |
| GB2466672B (en) | Speech coding | |
| GB2466670B (en) | Speech encoding | |
| GB2466669B (en) | Speech coding | |
| GB2476041B (en) | Encoding and decoding speech signals | |
| GB0900144D0 (en) | Speech coding | |
| ZA201203570B (en) | Multi-mode audio codec and celp coding adapted therefore | |
| GB2473139B (en) | Enhanced audio decoder | |
| GB2466673B (en) | Quantization | |
| GB0900138D0 (en) | Filtering speech | |
| EP2411024A4 (en) | Variants of factor viii and associated methods of use | |
| GB0921227D0 (en) | Personal audio equipment | |
| LT2462586T (lt) | Kalbos sintezės būdas | |
| ZA201200894B (en) | Synthesis and use of zsm-12 | |
| GB0903154D0 (en) | Speech clarity | |
| GB2476043B (en) | Decoding speech signals | |
| EP2645365A4 (en) | SPEECH SIGNAL ENCODING METHOD AND SPEECH SIGNAL DECODING METHOD | |
| GB0912744D0 (en) | Methods and uses | |
| GB0901529D0 (en) | Mooring limb | |
| TWM370808U (en) | Capo | |
| GB0800863D0 (en) | Unvoiced speech interface | |
| PH32009000191S1 (en) | Voice synthesizer |