WO1998025260A3 - Speech synthesis using dual neural networks - Google Patents

Speech synthesis using dual neural networks Download PDF

Info

Publication number
WO1998025260A3
WO1998025260A3 PCT/US1997/018815 US9718815W WO9825260A3 WO 1998025260 A3 WO1998025260 A3 WO 1998025260A3 US 9718815 W US9718815 W US 9718815W WO 9825260 A3 WO9825260 A3 WO 9825260A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
neural networks
speech synthesis
dual neural
speech parameters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US1997/018815
Other languages
French (fr)
Other versions
WO1998025260A2 (en
Inventor
Orhan Karaali
Noel Massey
Gerald Corrigan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Motorola Solutions Inc
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Priority to EP97946261A priority Critical patent/EP0932896A2/en
Publication of WO1998025260A2 publication Critical patent/WO1998025260A2/en
Publication of WO1998025260A3 publication Critical patent/WO1998025260A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A method (500, 600), device (201 and 206) and system (203) provide, in response to text/linguistic information, efficient generation of a parametric representation of speech. A coder parameter generating system provides a principal set and a supplementary set of speech parameters, the principal set of speech parameters being the parametric representation of speech. Then feedback is provided to the coder parameter generating system using the supplementary set of speech parameters to modify the principal set of speech parameters.
PCT/US1997/018815 1996-12-05 1997-10-15 Speech synthesis using dual neural networks Ceased WO1998025260A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP97946261A EP0932896A2 (en) 1996-12-05 1997-10-15 Method, device and system for supplementary speech parameter feedback for coder parameter generating systems used in speech synthesis

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US76162796A 1996-12-05 1996-12-05
US08/761,627 1996-12-05

Publications (2)

Publication Number Publication Date
WO1998025260A2 WO1998025260A2 (en) 1998-06-11
WO1998025260A3 true WO1998025260A3 (en) 1998-08-06

Family

ID=25062802

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1997/018815 Ceased WO1998025260A2 (en) 1996-12-05 1997-10-15 Speech synthesis using dual neural networks

Country Status (2)

Country Link
EP (1) EP0932896A2 (en)
WO (1) WO1998025260A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930754A (en) * 1997-06-13 1999-07-27 Motorola, Inc. Method, device and article of manufacture for neural-network based orthography-phonetics transformation

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5165008A (en) * 1991-09-18 1992-11-17 U S West Advanced Technologies, Inc. Speech synthesis using perceptual linear prediction parameters

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2161540C (en) * 1994-04-28 2000-06-13 Orhan Karaali A method and apparatus for converting text into audible signals using a neural network

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5165008A (en) * 1991-09-18 1992-11-17 U S West Advanced Technologies, Inc. Speech synthesis using perceptual linear prediction parameters

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ARTIFICIAL NEURAL NETWORKS, 1993 THIRD INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, CAWLEY G.C. et al., "LSP Speech Synthesis Using Backpropogation Networks", pages 291-294. *
See also references of EP0932896A4 *

Also Published As

Publication number Publication date
EP0932896A4 (en) 1999-09-08
WO1998025260A2 (en) 1998-06-11
EP0932896A2 (en) 1999-08-04

Similar Documents

Publication Publication Date Title
GB2331826B (en) Context dependent phoneme networks for encoding speech information
AU1191899A (en) System and method for representing complex information auditorially
AU8593998A (en) Method and system for using speech recognition to access the internet, includingaccess via a telephone
AU4705796A (en) System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
WO1998024020A3 (en) Method and system for generating software code
AU1067900A (en) Network and language models for use in a speech recognition system
WO1999066496A8 (en) Intelligent text-to-speech synthesis
ZA983549B (en) Method for producing oxidized product and generating power using a solid electrolyte membrane integrated with a gas turbine
AU5170793A (en) Improved membrane computer keyboard and method
CA2161540A1 (en) A Method and Apparatus for Converting Text Into Audible Signals Using a Neural Network
MXPA98008052A (en) Method and apparatus for generating sematically consistent inputs to a dialog manager.
AU3274395A (en) Method and system for continuous speech recognition using voting techniques
KR970000310A (en) A method of generating oxygen and generating power using a solid electrolyte membrane integrated with a gas turbine
EP0750293A3 (en) State transition model design method and voice recognition method and apparatus using same
EP0922279A3 (en) Method and apparatus for executing a human-machine dialogue in the form of two-sided speech as based on a modular dialogue structure
AU5017393A (en) Keyboard and method for producing
AU1028697A (en) Method of operating a gas-turbine-powered generating set using low-calorific-value fuel
AU680788B2 (en) Method for producing oxygen and hydrogen
GB9824762D0 (en) Self-service terminal
EP0586714A4 (en) Speech recognition apparatus using neural network, and learning method therefor
GR3032375T3 (en) Speech recognition based on HMMs.
EP0646896A3 (en) System and method for generating a solid model.
BR9610837A (en) Reactor composite membrane and method for the synthesis of hydrogen peroxide
AUPO199796A0 (en) Method and device for generating hydrogen and oxygen
WO1998025260A3 (en) Speech synthesis using dual neural networks

Legal Events

Date Code Title Description
AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1997946261

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1997946261

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1997946261

Country of ref document: EP