SE520065C2 - Anordning och metod för prosodigenerering vid visuell talsyntes - Google Patents

Anordning och metod för prosodigenerering vid visuell talsyntes

Info

Publication number
SE520065C2
SE520065C2 SE9701101A SE9701101A SE520065C2 SE 520065 C2 SE520065 C2 SE 520065C2 SE 9701101 A SE9701101 A SE 9701101A SE 9701101 A SE9701101 A SE 9701101A SE 520065 C2 SE520065 C2 SE 520065C2
Authority
SE
Sweden
Prior art keywords
face
movement
speech
recorded
words
Prior art date
Application number
SE9701101A
Other languages
English (en)
Swedish (sv)
Other versions
SE9701101L (sv
SE9701101D0 (sv
Inventor
Bertil Lyberg
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9701101A priority Critical patent/SE520065C2/sv
Publication of SE9701101D0 publication Critical patent/SE9701101D0/xx
Priority to DK98911338T priority patent/DK0970465T3/da
Priority to PCT/SE1998/000506 priority patent/WO1998043235A2/en
Priority to DE69816049T priority patent/DE69816049T2/de
Priority to EP98911338A priority patent/EP0970465B1/en
Priority to EEP199900419A priority patent/EE03883B1/et
Priority to JP54446198A priority patent/JP2001517326A/ja
Priority to US09/381,632 priority patent/US6389396B1/en
Publication of SE9701101L publication Critical patent/SE9701101L/xx
Priority to NO19994599A priority patent/NO318698B1/no
Publication of SE520065C2 publication Critical patent/SE520065C2/sv

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/205Three-dimensional [3D] animation driven by audio data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/40Three-dimensional [3D] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • G10L13/07Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Machine Translation (AREA)
  • Processing Or Creating Images (AREA)
  • Steroid Compounds (AREA)
SE9701101A 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes SE520065C2 (sv)

Priority Applications (9)

Application Number Priority Date Filing Date Title
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes
US09/381,632 US6389396B1 (en) 1997-03-25 1998-03-20 Device and method for prosody generation at visual synthesis
EP98911338A EP0970465B1 (en) 1997-03-25 1998-03-20 Device and method for prosody generation for visual synthesis
PCT/SE1998/000506 WO1998043235A2 (en) 1997-03-25 1998-03-20 Device and method for prosody generation at visual synthesis
DE69816049T DE69816049T2 (de) 1997-03-25 1998-03-20 Vorrichtung und verfahren zur prosodie-erzeugung bei der visuellen synthese
DK98911338T DK0970465T3 (da) 1997-03-25 1998-03-20 Indretning og fremgangsmåde til prosodigenerering til visuel syntese
EEP199900419A EE03883B1 (et) 1997-03-25 1998-03-20 Seade ja meetod prosoodia genereerimiseks visuaalsünteesil
JP54446198A JP2001517326A (ja) 1997-03-25 1998-03-20 視覚的合成における韻律生成のための装置および方法
NO19994599A NO318698B1 (no) 1997-03-25 1999-09-22 Anordning og fremgangsmate for prosodigenering av visuell syntese

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes

Publications (3)

Publication Number Publication Date
SE9701101D0 SE9701101D0 (sv) 1997-03-25
SE9701101L SE9701101L (sv) 1998-09-26
SE520065C2 true SE520065C2 (sv) 2003-05-20

Family

ID=20406308

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes

Country Status (9)

Country Link
US (1) US6389396B1 (et)
EP (1) EP0970465B1 (et)
JP (1) JP2001517326A (et)
DE (1) DE69816049T2 (et)
DK (1) DK0970465T3 (et)
EE (1) EE03883B1 (et)
NO (1) NO318698B1 (et)
SE (1) SE520065C2 (et)
WO (1) WO1998043235A2 (et)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6947044B1 (en) * 1999-05-21 2005-09-20 Kulas Charles J Creation and playback of computer-generated productions using script-controlled rendering engines
US20020194006A1 (en) * 2001-03-29 2002-12-19 Koninklijke Philips Electronics N.V. Text to visual speech system and method incorporating facial emotions
CN1159702C (zh) 2001-04-11 2004-07-28 国际商业机器公司 具有情感的语音-语音翻译系统和方法
US7076430B1 (en) 2002-05-16 2006-07-11 At&T Corp. System and method of providing conversational visual prosody for talking heads
US20060009978A1 (en) * 2004-07-02 2006-01-12 The Regents Of The University Of Colorado Methods and systems for synthesis of accurate visible speech via transformation of motion capture data
JP4985714B2 (ja) * 2009-06-12 2012-07-25 カシオ計算機株式会社 音声表示出力制御装置、および音声表示出力制御処理プログラム
US8447610B2 (en) * 2010-02-12 2013-05-21 Nuance Communications, Inc. Method and apparatus for generating synthetic speech with contrastive stress
US8949128B2 (en) * 2010-02-12 2015-02-03 Nuance Communications, Inc. Method and apparatus for providing speech output for speech-enabled applications
US8571870B2 (en) * 2010-02-12 2013-10-29 Nuance Communications, Inc. Method and apparatus for generating synthetic speech with contrastive stress
AU2012100262B4 (en) * 2011-12-15 2012-05-24 Nguyen, Phan Thi My Ngoc Ms Speech visualisation tool
JP2012098753A (ja) * 2012-01-27 2012-05-24 Casio Comput Co Ltd 音声表示出力制御装置、画像表示制御装置、および音声表示出力制御処理プログラム、画像表示制御処理プログラム
CN112100352B (zh) * 2020-09-14 2024-08-20 北京百度网讯科技有限公司 与虚拟对象的对话方法、装置、客户端及存储介质

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2518683B2 (ja) * 1989-03-08 1996-07-24 国際電信電話株式会社 画像合成方法及びその装置
GB9019829D0 (en) 1990-09-11 1990-10-24 British Telecomm Speech analysis and image synthesis
US5878396A (en) * 1993-01-21 1999-03-02 Apple Computer, Inc. Method and apparatus for synthetic speech in facial animation
US6122616A (en) * 1993-01-21 2000-09-19 Apple Computer, Inc. Method and apparatus for diphone aliasing
SE9301596L (sv) 1993-05-10 1994-05-24 Televerket Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk
SE516526C2 (sv) 1993-11-03 2002-01-22 Telia Ab Metod och anordning vid automatisk extrahering av prosodisk information
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
KR960018988A (ko) 1994-11-07 1996-06-17 엠, 케이. 영 음향 보조 영상 처리 방법 및 장치
SE519244C2 (sv) 1995-12-06 2003-02-04 Telia Ab Anordning och metod vid talsyntes
SE9600959L (sv) * 1996-03-13 1997-09-14 Telia Ab Metod och anordning vid tal-till-talöversättning

Also Published As

Publication number Publication date
SE9701101L (sv) 1998-09-26
JP2001517326A (ja) 2001-10-02
DK0970465T3 (da) 2003-10-27
DE69816049D1 (de) 2003-08-07
EP0970465A2 (en) 2000-01-12
DE69816049T2 (de) 2004-04-22
WO1998043235A2 (en) 1998-10-01
EE9900419A (et) 2000-04-17
WO1998043235A3 (en) 1998-12-23
SE9701101D0 (sv) 1997-03-25
NO994599L (no) 1999-12-14
EE03883B1 (et) 2002-10-15
EP0970465B1 (en) 2003-07-02
NO318698B1 (no) 2005-04-25
US6389396B1 (en) 2002-05-14
NO994599D0 (no) 1999-09-22

Similar Documents

Publication Publication Date Title
SE519244C2 (sv) Anordning och metod vid talsyntes
Tatham et al. Developments in speech synthesis
Granström et al. Prosodic cues in multimodal speech perception
US8364488B2 (en) Voice models for document narration
Bulut et al. Expressive speech synthesis using a concatenative synthesizer.
US20010042057A1 (en) Emotion expressing device
Gahlawat et al. Natural speech synthesizer for blind persons using hybrid approach
SE520065C2 (sv) Anordning och metod för prosodigenerering vid visuell talsyntes
Lundeberg et al. Developing a 3D-agent for the august dialogue system.
US6385580B1 (en) Method of speech synthesis
Aaron et al. Conversational computers
JP2806364B2 (ja) 発声訓練装置
Minnis et al. Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis.
De Pijper High-quality message-to-speech generation in a practical application
Roehling et al. Towards expressive speech synthesis in english on a robotic platform
JPH03273280A (ja) 発声練習用音声合成方式
Theobald Audiovisual speech synthesis
Ouni et al. Internationalization of a talking head
Klabbers High-quality speech output generation through advanced phrase concatenation
Greenberg Pronunciation variation is key to understanding spoken language
Granström et al. Eyebrow movements as a cue to prominence
Gambino et al. Virtual conversation with a real talking head
Safabakhsh et al. AUT-Talk: a farsi talking head
Farner et al. Voice transformation and speech synthesis for video games
Karjalainen Review of speech synthesis technology

Legal Events

Date Code Title Description
NUG Patent has lapsed