SE519244C2 - Anordning och metod vid talsyntes - Google Patents

Anordning och metod vid talsyntes

Info

Publication number
SE519244C2
SE519244C2 SE9504367A SE9504367A SE519244C2 SE 519244 C2 SE519244 C2 SE 519244C2 SE 9504367 A SE9504367 A SE 9504367A SE 9504367 A SE9504367 A SE 9504367A SE 519244 C2 SE519244 C2 SE 519244C2
Authority
SE
Sweden
Prior art keywords
face
polyphones
language
model
image
Prior art date
Application number
SE9504367A
Other languages
English (en)
Swedish (sv)
Other versions
SE9504367D0 (sv
SE9504367L (sv
Inventor
Bertil Lyberg
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9504367A priority Critical patent/SE519244C2/sv
Publication of SE9504367D0 publication Critical patent/SE9504367D0/xx
Priority to DE69632901T priority patent/DE69632901T2/de
Priority to EP96850181A priority patent/EP0778560B1/en
Priority to DK96850181T priority patent/DK0778560T3/da
Priority to NO19965147A priority patent/NO311546B1/no
Priority to US08/760,811 priority patent/US5826234A/en
Publication of SE9504367L publication Critical patent/SE9504367L/xx
Publication of SE519244C2 publication Critical patent/SE519244C2/sv

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/47Machine-assisted translation, e.g. using translation memory
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation
    • G06F40/56Natural language generation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Processing Or Creating Images (AREA)
  • Image Processing (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Machine Translation (AREA)
SE9504367A 1995-12-06 1995-12-06 Anordning och metod vid talsyntes SE519244C2 (sv)

Priority Applications (6)

Application Number Priority Date Filing Date Title
SE9504367A SE519244C2 (sv) 1995-12-06 1995-12-06 Anordning och metod vid talsyntes
DE69632901T DE69632901T2 (de) 1995-12-06 1996-10-30 Vorrichtung und Verfahren zur Sprachsynthese
EP96850181A EP0778560B1 (en) 1995-12-06 1996-10-30 Device and method at speech synthesis
DK96850181T DK0778560T3 (da) 1995-12-06 1996-10-30 Indretning og fremgangsmåde til talesyntese
NO19965147A NO311546B1 (no) 1995-12-06 1996-12-03 Anordning og fremgangsmåte ved talesyntese
US08/760,811 US5826234A (en) 1995-12-06 1996-12-05 Device and method for dubbing an audio-visual presentation which generates synthesized speech and corresponding facial movements

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9504367A SE519244C2 (sv) 1995-12-06 1995-12-06 Anordning och metod vid talsyntes

Publications (3)

Publication Number Publication Date
SE9504367D0 SE9504367D0 (sv) 1995-12-06
SE9504367L SE9504367L (sv) 1997-06-07
SE519244C2 true SE519244C2 (sv) 2003-02-04

Family

ID=20400494

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9504367A SE519244C2 (sv) 1995-12-06 1995-12-06 Anordning och metod vid talsyntes

Country Status (6)

Country Link
US (1) US5826234A (no)
EP (1) EP0778560B1 (no)
DE (1) DE69632901T2 (no)
DK (1) DK0778560T3 (no)
NO (1) NO311546B1 (no)
SE (1) SE519244C2 (no)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE519679C2 (sv) * 1997-03-25 2003-03-25 Telia Ab Metod vid talsyntes
SE520065C2 (sv) * 1997-03-25 2003-05-20 Telia Ab Anordning och metod för prosodigenerering vid visuell talsyntes
SE511927C2 (sv) * 1997-05-27 1999-12-20 Telia Ab Förbättringar i, eller med avseende på, visuell talsyntes
US6016148A (en) * 1997-06-06 2000-01-18 Digital Equipment Corporation Automated mapping of facial images to animation wireframes topologies
US6567779B1 (en) * 1997-08-05 2003-05-20 At&T Corp. Method and system for aligning natural and synthetic video to speech synthesis
US7366670B1 (en) 1997-08-05 2008-04-29 At&T Corp. Method and system for aligning natural and synthetic video to speech synthesis
WO1999012128A1 (en) * 1997-09-01 1999-03-11 Koninklijke Philips Electronics N.V. A method and apparatus for synchronizing a computer-animated model with an audio wave output
AU2998099A (en) * 1998-03-11 1999-09-27 Entropic, Inc. Face synthesis system and methodology
US20020069048A1 (en) * 2000-04-07 2002-06-06 Sadhwani Deepak Kishinchand Communication system
DE10018143C5 (de) * 2000-04-12 2012-09-06 Oerlikon Trading Ag, Trübbach DLC-Schichtsystem sowie Verfahren und Vorrichtung zur Herstellung eines derartigen Schichtsystems
US7106887B2 (en) * 2000-04-13 2006-09-12 Fuji Photo Film Co., Ltd. Image processing method using conditions corresponding to an identified person
AU2001292963A1 (en) * 2000-09-21 2002-04-02 The Regents Of The University Of California Visual display methods for use in computer-animated speech production models
US20080040227A1 (en) 2000-11-03 2008-02-14 At&T Corp. System and method of marketing using a multi-media communication system
US6963839B1 (en) 2000-11-03 2005-11-08 At&T Corp. System and method of controlling sound in a multi-media communication application
US7091976B1 (en) 2000-11-03 2006-08-15 At&T Corp. System and method of customizing animated entities for use in a multi-media communication application
US6976082B1 (en) 2000-11-03 2005-12-13 At&T Corp. System and method for receiving multi-media messages
US7203648B1 (en) 2000-11-03 2007-04-10 At&T Corp. Method for sending multi-media messages with customized audio
US7035803B1 (en) 2000-11-03 2006-04-25 At&T Corp. Method for sending multi-media messages using customizable background images
US6990452B1 (en) 2000-11-03 2006-01-24 At&T Corp. Method for sending multi-media messages using emoticons
KR100831755B1 (ko) * 2000-11-17 2008-05-23 테이트 앤드 라일 퍼블릭 리미티드 컴파니 슈크랄로즈의 용융 가능한 형태
US6778252B2 (en) * 2000-12-22 2004-08-17 Film Language Film language
US6661418B1 (en) * 2001-01-22 2003-12-09 Digital Animations Limited Character animation system
US7671861B1 (en) 2001-11-02 2010-03-02 At&T Intellectual Property Ii, L.P. Apparatus and method of customizing animated entities for use in a multi-media communication application
US7663628B2 (en) * 2002-01-22 2010-02-16 Gizmoz Israel 2002 Ltd. Apparatus and method for efficient animation of believable speaking 3D characters in real time
US7209882B1 (en) 2002-05-10 2007-04-24 At&T Corp. System and method for triphone-based unit selection for visual speech synthesis
US8788274B1 (en) 2003-07-03 2014-07-22 Jose Estevan Guzman Language converter and transmitting system
GB0606977D0 (en) * 2006-04-06 2006-05-17 Freemantle Media Ltd Interactive video medium
CN101971262A (zh) * 2007-12-21 2011-02-09 皇家飞利浦电子股份有限公司 用于播放图片的方法和设备
US8655152B2 (en) 2012-01-31 2014-02-18 Golden Monkey Entertainment Method and system of presenting foreign films in a native language
KR20140146965A (ko) * 2013-06-18 2014-12-29 삼성전자주식회사 디스플레이 장치, 서버를 포함하는 변환 시스템 및 디스플레이 장치의 제어 방법
KR102127351B1 (ko) 2013-07-23 2020-06-26 삼성전자주식회사 사용자 단말 장치 및 그 제어 방법
US9564128B2 (en) * 2013-12-09 2017-02-07 Qualcomm Incorporated Controlling a speech recognition process of a computing device
US9607609B2 (en) * 2014-09-25 2017-03-28 Intel Corporation Method and apparatus to synthesize voice based on facial structures
WO2017137947A1 (en) * 2016-02-10 2017-08-17 Vats Nitin Producing realistic talking face with expression using images text and voice
US10657972B2 (en) * 2018-02-02 2020-05-19 Max T. Hall Method of translating and synthesizing a foreign language
US11908478B2 (en) * 2021-08-04 2024-02-20 Q (Cue) Ltd. Determining speech from facial skin movements using a housing supported by ear or associated with an earphone
US12216749B2 (en) 2021-08-04 2025-02-04 Q (Cue) Ltd. Using facial skin micromovements to identify a user
JP2025528023A (ja) 2022-07-20 2025-08-26 キュー(キュー)リミテッド 顔面微細運動の検出および利用

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5111409A (en) * 1989-07-21 1992-05-05 Elon Gasper Authoring and use systems for sound synchronized animation
US5293584A (en) * 1992-05-21 1994-03-08 International Business Machines Corporation Speech recognition system for natural language translation
US5687280A (en) * 1992-11-02 1997-11-11 Matsushita Electric Industrial Co., Ltd. Speech input device including display of spatial displacement of lip position relative to predetermined position
US5482048A (en) * 1993-06-30 1996-01-09 University Of Pittsburgh System and method for measuring and quantitating facial movements
JPH07302351A (ja) * 1994-05-09 1995-11-14 Canon Inc 画像・音声応答装置及び画像・音声応答方法
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
US5615301A (en) * 1994-09-28 1997-03-25 Rivers; W. L. Automated language translation system

Also Published As

Publication number Publication date
SE9504367D0 (sv) 1995-12-06
DE69632901T2 (de) 2005-08-04
NO311546B1 (no) 2001-12-03
EP0778560B1 (en) 2004-07-14
EP0778560A2 (en) 1997-06-11
NO965147L (no) 1997-06-09
NO965147D0 (no) 1996-12-03
DE69632901D1 (de) 2004-08-19
DK0778560T3 (da) 2004-11-22
SE9504367L (sv) 1997-06-07
EP0778560A3 (en) 1998-09-09
US5826234A (en) 1998-10-20

Similar Documents

Publication Publication Date Title
SE519244C2 (sv) Anordning och metod vid talsyntes
US5940797A (en) Speech synthesis method utilizing auxiliary information, medium recorded thereon the method and apparatus utilizing the method
US5880788A (en) Automated synchronization of video image sequences to new soundtracks
Chen et al. Audio-visual integration in multimodal communication
US6250928B1 (en) Talking facial display method and apparatus
US6097381A (en) Method and apparatus for synthesizing realistic animations of a human speaking using a computer
Kshirsagar et al. Visyllable based speech animation
KR102778688B1 (ko) 사람 음성에 따른 실사인물의 발화 영상 합성 시스템
US6389396B1 (en) Device and method for prosody generation at visual synthesis
JP4631078B2 (ja) リップシンクアニメーション作成用の統計確率モデル作成装置、パラメータ系列合成装置、リップシンクアニメーション作成システム、及びコンピュータプログラム
US6385580B1 (en) Method of speech synthesis
Cox et al. The development and evaluation of a speech-to-sign translation system to assist transactions
Minnis et al. Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis.
JPH03273280A (ja) 発声練習用音声合成方式
Bothe et al. The Development of a Computer Animation Program for the Teaching of
Hwang et al. Neural network-based F0 text-to-speech synthesiser for Mandarin
Nordstrand et al. Measurements of articulatory variation and communicative signals in expressive speech.
Goecke A stereo vision lip tracking algorithm and subsequent statistical analyses of the audio-video correlation in Australian English
CN117727306B (zh) 一种基于原生声纹特征的拾音翻译方法、设备及存储介质
Chen et al. Speech driven MPEG-4 based face animation via neural network
Granström et al. Eyebrow movements as a cue to prominence
Williams Speech-to-video conversion for individuals with impaired hearing
CN114515138A (zh) 一种语言障碍评估与矫正系统
Stork et al. Machine recognition and applications
Cole et al. A platform for multilingual research in spoken dialogue systems

Legal Events

Date Code Title Description
NUG Patent has lapsed