ES2146155B1 - Sintetizadores de voz, metodos para sintetizar voz y para mejorar una voz sintetizada y los correspondientes dispositivo de radio y señal de sintesis. - Google Patents

Sintetizadores de voz, metodos para sintetizar voz y para mejorar una voz sintetizada y los correspondientes dispositivo de radio y señal de sintesis.

Info

Publication number
ES2146155B1
ES2146155B1 ES009750009A ES9750009A ES2146155B1 ES 2146155 B1 ES2146155 B1 ES 2146155B1 ES 009750009 A ES009750009 A ES 009750009A ES 9750009 A ES9750009 A ES 9750009A ES 2146155 B1 ES2146155 B1 ES 2146155B1
Authority
ES
Spain
Prior art keywords
voice
code book
signal
improve
methods
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
ES009750009A
Other languages
English (en)
Other versions
ES2146155A1 (es
Inventor
Kari Jarvinen
Tero Honkanen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Technologies Oy
Original Assignee
Nokia Mobile Phones Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=10776197&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=ES2146155(B1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Nokia Mobile Phones Ltd filed Critical Nokia Mobile Phones Ltd
Publication of ES2146155A1 publication Critical patent/ES2146155A1/es
Application granted granted Critical
Publication of ES2146155B1 publication Critical patent/ES2146155B1/es
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Analogue/Digital Conversion (AREA)
  • Transmission And Conversion Of Sensor Element Output (AREA)
  • Telephonic Communication Services (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Magnetically Actuated Valves (AREA)

Abstract

Sintetizador de voz, métodos para sintetizar y mejorar una voz sintetizada y los correspondientes dispositivo de radio y señal de síntesis. Se describen un post-procesador (317) y un método substancialmente para mejorar voz sintetizada. El post-procesador (317) actúa sobre una señal ex(n) derivada de un generador de excitación (211) que comprende típicamente un libro de códigos fijos (203) y un libro de códigos adaptables (204), siendo formada la señal ex (n) a partir de la adición de salidas escaladas del libro de códigos fijos (203) y del libro de códigos adaptables (204). El post-procesador actúa sobre ex(n) añadiéndose una señal escalada pv(n) derivada del libro de códigos adaptables (204). Un factor de ganancia o de escala p es determinado por coeficientes de voz introducidos en el generador de excitación (211). La señal combinada ex (n)+pv (n) es normalizada por la unidad para control de energía adaptable (316) e introducida en un PLC o filtro de síntesis de voz (208), antes de serintroducida en una unidad de proceso de audio (209). Figura 3.
ES009750009A 1995-06-16 1996-06-13 Sintetizadores de voz, metodos para sintetizar voz y para mejorar una voz sintetizada y los correspondientes dispositivo de radio y señal de sintesis. Expired - Fee Related ES2146155B1 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB9512284.2A GB9512284D0 (en) 1995-06-16 1995-06-16 Speech Synthesiser

Publications (2)

Publication Number Publication Date
ES2146155A1 ES2146155A1 (es) 2000-07-16
ES2146155B1 true ES2146155B1 (es) 2001-02-01

Family

ID=10776197

Family Applications (1)

Application Number Title Priority Date Filing Date
ES009750009A Expired - Fee Related ES2146155B1 (es) 1995-06-16 1996-06-13 Sintetizadores de voz, metodos para sintetizar voz y para mejorar una voz sintetizada y los correspondientes dispositivo de radio y señal de sintesis.

Country Status (12)

Country Link
US (2) US6029128A (es)
EP (1) EP0832482B1 (es)
JP (1) JP3483891B2 (es)
CN (2) CN1199151C (es)
AT (1) ATE206843T1 (es)
AU (1) AU714752B2 (es)
BR (1) BR9608479A (es)
DE (1) DE69615839T2 (es)
ES (1) ES2146155B1 (es)
GB (1) GB9512284D0 (es)
RU (1) RU2181481C2 (es)
WO (1) WO1997000516A1 (es)

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5913187A (en) * 1997-08-29 1999-06-15 Nortel Networks Corporation Nonlinear filter for noise suppression in linear prediction speech processing devices
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
US7117146B2 (en) * 1998-08-24 2006-10-03 Mindspeed Technologies, Inc. System for improved use of pitch enhancement with subcodebooks
US6104992A (en) * 1998-08-24 2000-08-15 Conexant Systems, Inc. Adaptive gain reduction to produce fixed codebook target signal
US6260010B1 (en) * 1998-08-24 2001-07-10 Conexant Systems, Inc. Speech encoder using gain normalization that combines open and closed loop gains
JP3365360B2 (ja) * 1999-07-28 2003-01-08 日本電気株式会社 音声信号復号方法および音声信号符号化復号方法とその装置
US6480827B1 (en) * 2000-03-07 2002-11-12 Motorola, Inc. Method and apparatus for voice communication
US6581030B1 (en) * 2000-04-13 2003-06-17 Conexant Systems, Inc. Target signal reference shifting employed in code-excited linear prediction speech coding
US6466904B1 (en) * 2000-07-25 2002-10-15 Conexant Systems, Inc. Method and apparatus using harmonic modeling in an improved speech decoder
US7283961B2 (en) * 2000-08-09 2007-10-16 Sony Corporation High-quality speech synthesis device and method by classification and prediction processing of synthesized sound
EP1944760B1 (en) * 2000-08-09 2009-09-23 Sony Corporation Voice data processing device and processing method
JP3558031B2 (ja) * 2000-11-06 2004-08-25 日本電気株式会社 音声復号化装置
US7103539B2 (en) * 2001-11-08 2006-09-05 Global Ip Sound Europe Ab Enhanced coded speech
CA2388352A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for frequency-selective pitch enhancement of synthesized speed
DE10236694A1 (de) * 2002-08-09 2004-02-26 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum skalierbaren Codieren und Vorrichtung und Verfahren zum skalierbaren Decodieren
US7516067B2 (en) * 2003-08-25 2009-04-07 Microsoft Corporation Method and apparatus using harmonic-model-based front end for robust speech recognition
US7447630B2 (en) * 2003-11-26 2008-11-04 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement
CA2457988A1 (en) * 2004-02-18 2005-08-18 Voiceage Corporation Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization
JP4398323B2 (ja) * 2004-08-09 2010-01-13 ユニデン株式会社 デジタル無線通信装置
US20070147518A1 (en) * 2005-02-18 2007-06-28 Bruno Bessette Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX
US20060217983A1 (en) * 2005-03-28 2006-09-28 Tellabs Operations, Inc. Method and apparatus for injecting comfort noise in a communications system
US20060217988A1 (en) * 2005-03-28 2006-09-28 Tellabs Operations, Inc. Method and apparatus for adaptive level control
US20060217972A1 (en) * 2005-03-28 2006-09-28 Tellabs Operations, Inc. Method and apparatus for modifying an encoded signal
US20060217970A1 (en) * 2005-03-28 2006-09-28 Tellabs Operations, Inc. Method and apparatus for noise reduction
US20060215683A1 (en) * 2005-03-28 2006-09-28 Tellabs Operations, Inc. Method and apparatus for voice quality enhancement
US7562021B2 (en) * 2005-07-15 2009-07-14 Microsoft Corporation Modification of codewords in dictionary used for efficient coding of digital media spectral data
US7590523B2 (en) * 2006-03-20 2009-09-15 Mindspeed Technologies, Inc. Speech post-processing using MDCT coefficients
US8005671B2 (en) * 2006-12-04 2011-08-23 Qualcomm Incorporated Systems and methods for dynamic normalization to reduce loss in precision for low-level signals
EP2096631A4 (en) * 2006-12-13 2012-07-25 Panasonic Corp TONE DECODING DEVICE AND POWER ADJUSTMENT METHOD
WO2008072736A1 (ja) * 2006-12-15 2008-06-19 Panasonic Corporation 適応音源ベクトル量子化装置および適応音源ベクトル量子化方法
US8688437B2 (en) 2006-12-26 2014-04-01 Huawei Technologies Co., Ltd. Packet loss concealment for speech coding
CN101286319B (zh) * 2006-12-26 2013-05-01 华为技术有限公司 改进语音丢包修补质量的语音编码方法
CN101266797B (zh) * 2007-03-16 2011-06-01 展讯通信(上海)有限公司 语音信号后处理滤波方法
RU2343563C1 (ru) * 2007-05-21 2009-01-10 Федеральное государственное унитарное предприятие "ПЕНЗЕНСКИЙ НАУЧНО-ИССЛЕДОВАТЕЛЬСКИЙ ЭЛЕКТРОТЕХНИЧЕСКИЙ ИНСТИТУТ" (ФГУП "ПНИЭИ") Способ передачи и приема закодированной речи
US8209190B2 (en) * 2007-10-25 2012-06-26 Motorola Mobility, Inc. Method and apparatus for generating an enhancement layer within an audio coding system
CN100578620C (zh) * 2007-11-12 2010-01-06 华为技术有限公司 固定码书搜索方法及搜索器
CN101179716B (zh) * 2007-11-30 2011-12-07 华南理工大学 一种压缩域的传输数据流音频自动增益控制方法
US20090287489A1 (en) * 2008-05-15 2009-11-19 Palm, Inc. Speech processing for plurality of users
US8442837B2 (en) * 2009-12-31 2013-05-14 Motorola Mobility Llc Embedded speech and audio coding using a switchable model core
US8990094B2 (en) * 2010-09-13 2015-03-24 Qualcomm Incorporated Coding and decoding a transient frame
US8862465B2 (en) * 2010-09-17 2014-10-14 Qualcomm Incorporated Determining pitch cycle energy and scaling an excitation signal
EP2816556B1 (en) * 2011-04-15 2016-05-04 Telefonaktiebolaget LM Ericsson (publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
CN103827965B (zh) * 2011-07-29 2016-05-25 Dts有限责任公司 自适应语音可理解性处理器
EP2704142B1 (en) * 2012-08-27 2015-09-02 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for reproducing an audio signal, apparatus and method for generating a coded audio signal, computer program and coded audio signal
CN104299614B (zh) * 2013-07-16 2017-12-29 华为技术有限公司 解码方法和解码装置
US9620134B2 (en) * 2013-10-10 2017-04-11 Qualcomm Incorporated Gain shape estimation for improved tracking of high-band temporal characteristics
ES2839086T3 (es) * 2013-10-18 2021-07-05 Fraunhofer Ges Forschung Concepto para codificar una señal de audio y decodificar una señal de audio usando información determinista y con características de ruido
PL3058568T3 (pl) * 2013-10-18 2021-07-05 Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. Koncepcja kodowania sygnału audio i dekodowania sygnału audio z wykorzystaniem związanych z mową informacji kształtowania widmowego
JP6885221B2 (ja) 2017-06-30 2021-06-09 ブラザー工業株式会社 表示制御装置、表示制御方法及び表示制御プログラム
CN110444192A (zh) * 2019-08-15 2019-11-12 广州科粤信息科技有限公司 一种基于语音技术的智能语音机器人
CN113241082B (zh) * 2021-04-22 2024-02-20 杭州网易智企科技有限公司 变声方法、装置、设备和介质

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4220819A (en) * 1979-03-30 1980-09-02 Bell Telephone Laboratories, Incorporated Residual excited predictive speech coding system
JPS5681900A (en) * 1979-12-10 1981-07-04 Nippon Electric Co Voice synthesizer
US4815135A (en) * 1984-07-10 1989-03-21 Nec Corporation Speech signal processor
US4617676A (en) * 1984-09-04 1986-10-14 At&T Bell Laboratories Predictive communication system filtering arrangement
GB8621932D0 (en) * 1986-09-11 1986-10-15 British Telecomm Speech coding
US4969192A (en) * 1987-04-06 1990-11-06 Voicecraft, Inc. Vector adaptive predictive coder for speech and audio
GB8806185D0 (en) * 1988-03-16 1988-04-13 Univ Surrey Speech coding
US5029211A (en) * 1988-05-30 1991-07-02 Nec Corporation Speech analysis and synthesis system
US5247357A (en) * 1989-05-31 1993-09-21 Scientific Atlanta, Inc. Image compression method and apparatus employing distortion adaptive tree search vector quantization with avoidance of transmission of redundant image data
GB2235354A (en) * 1989-08-16 1991-02-27 Philips Electronic Associated Speech coding/encoding using celp
US5241650A (en) * 1989-10-17 1993-08-31 Motorola, Inc. Digital speech decoder having a postfilter with reduced spectral distortion
WO1991006091A1 (en) * 1989-10-17 1991-05-02 Motorola, Inc. Lpc based speech synthesis with adaptive pitch prefilter
CA2010830C (en) * 1990-02-23 1996-06-25 Jean-Pierre Adoul Dynamic codebook for efficient speech coding based on algebraic codes
JP3102015B2 (ja) * 1990-05-28 2000-10-23 日本電気株式会社 音声復号化方法
FI91457C (fi) * 1991-03-08 1994-06-27 Nokia Mobile Phones Ltd Menetelmä puheen tallentamiseksi muistivälineelle ja tallennetun puheen toistamiseksi sekä menetelmää käyttävä laite
RU2007763C1 (ru) * 1991-04-04 1994-02-15 Завод "Калугаприбор" Способ выделения основного тона из речевого сигнала
DE69232202T2 (de) * 1991-06-11 2002-07-25 Qualcomm, Inc. Vocoder mit veraendlicher bitrate
JP3076086B2 (ja) * 1991-06-28 2000-08-14 シャープ株式会社 音声合成装置用ポストフィルタ
GB9118217D0 (en) * 1991-08-23 1991-10-09 British Telecomm Speech processing apparatus
US5233660A (en) * 1991-09-10 1993-08-03 At&T Bell Laboratories Method and apparatus for low-delay celp speech coding and decoding
WO1993018505A1 (en) * 1992-03-02 1993-09-16 The Walt Disney Company Voice transformation system
US5495555A (en) * 1992-06-01 1996-02-27 Hughes Aircraft Company High quality low bit rate celp-based speech codec
US5327520A (en) * 1992-06-04 1994-07-05 At&T Bell Laboratories Method of use of voice message coder/decoder
FI91345C (fi) * 1992-06-24 1994-06-10 Nokia Mobile Phones Ltd Menetelmä kanavanvaihdon tehostamiseksi
CA2108623A1 (en) * 1992-11-02 1994-05-03 Yi-Sheng Wang Adaptive pitch pulse enhancer and method for use in a codebook excited linear prediction (celp) search loop
AU675322B2 (en) * 1993-04-29 1997-01-30 Unisearch Limited Use of an auditory model to improve quality or lower the bit rate of speech synthesis systems
DE19501517C1 (de) * 1995-01-19 1996-05-02 Siemens Ag Verfahren, Sendegerät und Empfangsgerät zur Übertragung von Sprachinformation
US5664055A (en) * 1995-06-07 1997-09-02 Lucent Technologies Inc. CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity

Also Published As

Publication number Publication date
DE69615839D1 (de) 2001-11-15
EP0832482A1 (en) 1998-04-01
RU2181481C2 (ru) 2002-04-20
JPH11507739A (ja) 1999-07-06
WO1997000516A1 (en) 1997-01-03
US5946651A (en) 1999-08-31
CN1192817A (zh) 1998-09-09
CN1199151C (zh) 2005-04-27
ES2146155A1 (es) 2000-07-16
EP0832482B1 (en) 2001-10-10
ATE206843T1 (de) 2001-10-15
AU714752B2 (en) 2000-01-13
CN1652207A (zh) 2005-08-10
US6029128A (en) 2000-02-22
GB9512284D0 (en) 1995-08-16
JP3483891B2 (ja) 2004-01-06
DE69615839T2 (de) 2002-05-16
BR9608479A (pt) 1999-07-06
AU6230996A (en) 1997-01-15

Similar Documents

Publication Publication Date Title
ES2146155B1 (es) Sintetizadores de voz, metodos para sintetizar voz y para mejorar una voz sintetizada y los correspondientes dispositivo de radio y señal de sintesis.
EP1094447A3 (en) Vector quantization codebook generation method
GB2185370B (en) Speech synthesis system of rule-synthesis type
US6829581B2 (en) Method for prosody generation by unit selection from an imitation speech database
NO20045257L (no) Fremgangsmate og innretning for a gjenvinne hoyfrekvensinnhold av oversamplet, syntetisert bredbandssignal
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
SE9501026D0 (sv) Anordning vid mobilradiosystem
CA2017703A1 (en) Text-to-speech synthesizer having formant-rule and speech-parameter synthesis modes
AR001928A1 (es) Filtro para el mejoramiento y modificación de señales
CA2169822A1 (en) Synthesis of speech using regenerated phase information
ATE233424T1 (de) Stimmentransformation nach einer zielstimme
TR199600519A2 (tr) Konusma sinyallerinin olusturulmasina mahsus yöntem ve cihaz ve sinyallerin iletilmesine mahsus yöntem.
AU6353600A (en) Spectral magnitude quantization for a speech coder
EP1045372A3 (en) Speech sound communication system
CA2315324A1 (en) Speech signal decoding method and apparatus
IT1165641B (it) Sintetizzatore numerico multicanale della voce
CA2090205A1 (en) Speech coding system
NO975869L (no) Forbedret syntese av 2,4,6,8,10,12-heksabenzyl-2,4,6,8,10,12-heksaazatetracyklo£5,5,0,05,9,03,11|dodecan
EP0954849A2 (en) A method and apparatus for audio representation of speech that has been encoded according to the lpc principle, through adding noise to constituent signals therein
WO1997007499A3 (en) A method and device for preparing and using diphones for multilingual text-to-speech generating
EP0852373A3 (en) Improved synthesizer and method
AU1941697A (en) Sound source generator, voice synthesizer and voice synthesizing method
CA2224688A1 (en) Speech coder
JPS6442697A (en) Voice synthesization system
EP0606520A3 (en) Method for realizing tone curves for voice messages and method for speech synthesis and device for its application.

Legal Events

Date Code Title Description
PC2A Transfer of patent
EC2A Search report published

Date of ref document: 20000716

Kind code of ref document: A1

Effective date: 20000716

PC2A Transfer of patent
PC2A Transfer of patent

Owner name: NOKIA CORPORATION

Effective date: 20150811

PC2A Transfer of patent

Owner name: NOKIA TECHNOLOGIES OY

Effective date: 20151124

FD2A Announcement of lapse in spain

Effective date: 20160926