JP5120826B2 - 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム - Google Patents
発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム Download PDFInfo
- Publication number
- JP5120826B2 JP5120826B2 JP2006147171A JP2006147171A JP5120826B2 JP 5120826 B2 JP5120826 B2 JP 5120826B2 JP 2006147171 A JP2006147171 A JP 2006147171A JP 2006147171 A JP2006147171 A JP 2006147171A JP 5120826 B2 JP5120826 B2 JP 5120826B2
- Authority
- JP
- Japan
- Prior art keywords
- articulatory
- attribute
- state
- pronunciation
- tongue
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/04—Speaking
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/06—Foreign languages
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/06—Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
- G10L15/25—Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Acoustics & Sound (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Entrepreneurship & Innovation (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Electrically Operated Instructional Devices (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
Description
話者が発した音声信号からの音響的特徴としての周波数的特徴量、音量、持続時間、それらの変化量、またはそれらの変化パターンおよびそれらの少なくとも一つ以上の組合せを抽出する手段と、抽出された音響的特徴に基づいて、前記調音的属性に関する属性値を推定する属性値推定手段と、推定された属性値を前記望ましい調音的属性データと比較することにより、発声者の発音に関する判定を行う手段とを備える。
従って本発明は、話者の発した単語の音声を登録されているスペルの単語に対応付けることによって、発音を診断するものであるので、単語を構成する音素ごとに、正しい調音器官の状態や調音の様式で発音が行われているか否かを診断することができる。よって、本発明により話者に正しい調音器官の状態や様式で発音するように指導することができる。
図6に、カテゴリの例を示す。
Vb/Vo/Vc/Vo)を出力する。
(1)有声音の分類例
強い狭窄を伴う子音(Vc)
強い狭窄を伴わない子音や母音(Vo)
有声破裂音(Vb)
(2)無声音の分類例
無声破裂音(Bu)
その他の無声音(Vl)
(3)無音の音間(Sl)
Claims (9)
- 音声言語体系毎に、それを構成する子音毎に、その子音を発声する際の望ましい発音に対応する調音的属性値を有する調音的属性データベースと、
発声者が発した音声信号から子音の音響的特徴を抽出する手段と、
前記子音毎に予め定められた複数の種類の調音的属性の各々について形成される複数の分布であって、当該子音を発声する際の複数の音響的特徴の組合せによりそれぞれ定められる複数の分布の各々を、前記調音的属性への属否の境界となる閾値によって複数の領域に分割し、抽出された音響的特徴がいずれの領域に属するかを判定することにより、前記複数の種類の調音的属性の調音的属性値をそれぞれ推定する属性値推定手段と、
前記推定された複数の種類の調音的属性の調音的属性値を前記望ましい発音に対応する複数の種類の調音的属性の調音的属性値とそれぞれ比較することにより、前記発声者の子音の発音に関する判定を行う手段と、
を備え、
前記調音的属性値は、調音器官状態と、前記調音器官状態に対する力の入れ方と、前記調音器官状態に対する呼気の状態と、のうちの少なくとも一つを含む調音的属性を数値化した値であって、
前記調音器官状態は、舌の高さと、舌の位置と、舌の形状と、舌の動きと、唇の形状と、唇の開き方と、唇の動きと、声門の状態と、声帯の状態と、口蓋垂の状態と、鼻腔の状態と、上下の歯の位置と、顎の状態と、顎の動きと、のうちの少なくとも一つを含み、
前記音響的特徴は、周波数的特徴量と、音量と、持続時間と、前記周波数的特徴量の変化量と、前記音量の変化量と、前記持続時間の変化量と、前記周波数的特徴量の変化パターンと、前記音量の変化パターンと、前記持続時間の変化パターンと、のうちの少なくとも一つを含む、
発音診断装置。 - 前記発声者の発音診断結果を出力する手段を備えることを特徴とする請求項1記載の発音診断装置。
- 音声言語体系毎に、それを構成する子音毎に、当該子音に対して予め定められた複数の種類の調音的属性の各々について、調音的属性の分布を形成するための調音的属性分布形成手段と、
発声者が発した音声信号に含まれる子音の音響的特徴を抽出する音響的特徴抽出手段と、
前記調音的属性分布形成手段で形成された分布の各々を前記調音的属性への属否の境界となる音響的特徴の閾値でもって複数の領域に分割し、前記抽出された子音の音響的特徴がいずれの領域に属するかによって前記複数の種類の調音的属性の調音的属性値をそれぞれ決定する属性値推定手段と、
前記決定された複数の種類の調音的属性の調音的属性値を、前記子音を発声する際の望ましい発音に対応する複数の種類の調音的属性の調音的属性値と比較することにより、前記発声者の子音の発音に関する判定を行う手段と、
を備え、
前記調音的属性の分布の各々は、前記子音を発声する際の複数の音響的特徴の組合せにより定められ、
前記音響的特徴は、周波数的特徴量と、音量と、持続時間と、前記周波数的特徴量の変化量と、前記音量の変化量と、前記持続時間の変化量と、前記周波数的特徴量の変化パターンと、前記音量の変化パターンと、前記持続時間の変化パターンと、のうちの少なくとも一つを含み、
前記調音的属性は、調音器官状態と、前記調音器官状態に対する力の入れ方と、前記調音器官状態に対する呼気の状態と、のうちの少なくとも一つを含み、
前記調音的属性値は、前記調音的属性を数値化した値であって、
前記調音器官状態は、舌の高さと、舌の位置と、舌の形状と、舌の動きと、唇の形状と、唇の開き方と、唇の動きと、声門の状態と、声帯の状態と、口蓋垂の状態と、鼻腔の状態と、上下の歯の位置と、顎の状態と、顎の動きと、のうちの少なくとも一つを含む、
発音診断装置。 - 前記閾値を可変する閾値可変手段を備えることを特徴とする請求項3記載の発音診断装置。
- 発声者が発した音声信号から子音の音響的特徴を抽出する工程と、
前記子音毎に予め定められた複数の種類の調音的属性の各々について形成される複数の分布であって、当該子音を発声する際の複数の音響的特徴の組合せによりそれぞれ定められる複数の分布の各々を、前記調音的属性への属否の境界となる閾値によって複数の領域に分割し、前記抽出された子音の音響的特徴がいずれの領域に属するかを判定することにより、前記複数の種類の調音的属性の調音的属性値をそれぞれ推定する属性値推定工程と、
前記推定された複数の種類の調音的属性の調音的属性値を望ましい子音の発音に対応する複数の種類の調音的属性の調音的属性値とそれぞれ比較して前記発声者の子音の発音に関する判定を行う工程と、
発声者の発音診断結果を出力する工程と、
を備え、
前記調音的属性値は、音声言語体系毎に、それを構成する音素毎に、その音素を発声する際の調音的属性を数値化した値であって、
前記調音的属性は、調音器官状態と、前記調音器官状態に対する力の入れ方と、前記調音器官状態に対する呼気の状態と、のうちの少なくとも一つを含み、
前記調音器官状態は、舌の高さと、舌の位置と、舌の形状と、舌の動きと、唇の形状と、唇の開き方と、唇の動きと、声門の状態と、声帯の状態と、口蓋垂の状態と、鼻腔の状態と、上下の歯の位置と、顎の状態と、顎の動きと、のうちの少なくとも一つを含み、
前記音響的特徴は、周波数的特徴量と、音量と、持続時間と、前記周波数的特徴量の変化量と、前記音量の変化量と、前記持続時間の変化量と、前記周波数的特徴量の変化パターンと、前記音量の変化パターンと、前記持続時間の変化パターンと、のうちの少なくとも一つを含む、
発音診断方法。 - 音声言語体系毎に、それを構成する子音毎に、当該子音に対して予め定められた複数の種類の調音的属性の各々について、調音的属性の分布を形成するための調音的属性分布形成工程と、
発声者が発した音声信号に含まれる子音の音響的特徴を抽出する音響的特徴抽出工程と、
前記調音的属性分布形成工程において形成された分布の各々を前記調音的属性への属否の境界となる音響的特徴の閾値でもって複数の領域に分割し、前記抽出された子音の音響的特徴がいずれの領域に属するかによって前記複数の種類の調音的属性の調音的属性値をそれぞれ決定する属性値推定工程と、
前記決定された複数の種類の調音的属性の調音的属性値を、前記子音を発声する際の望ましい発音に対応する複数の種類の調音的属性の調音的属性値と比較して前記発声者の子音の発音に関する判定を行う工程と、
を備え、
前記調音的属性の分布の各々は、前記子音を発声する際の複数の音響的特徴の組合せにより定められ、
前記音響的特徴は、周波数的特徴量と、音量と、持続時間と、前記周波数的特徴量の変化量と、前記音量の変化量と、前記持続時間の変化量と、前記周波数的特徴量の変化パターンと、前記音量の変化パターンと、前記持続時間の変化パターンと、のうちの少なくとも一つを含み、
前記調音的属性は、調音器官状態と、前記調音器官状態に対する力の入れ方と、前記調音器官状態に対する呼気の状態と、のうちの少なくとも一つを含み、
前記調音的属性値は、前記調音的属性を数値化した値であって、
前記調音器官状態は、舌の高さと、舌の位置と、舌の形状と、舌の動きと、唇の形状と、唇の開き方と、唇の動きと、声門の状態と、声帯の状態と、口蓋垂の状態と、鼻腔の状態と、上下の歯の位置と、顎の状態と、顎の動きと、のうちの少なくとも一つを含む、
発音診断方法。 - 前記閾値を可変する閾値可変工程をさらに備えることを特徴とする請求項6記載の発音診断方法。
- コンピュータに請求項5〜7のいずれか一項に記載の方法を実行させるプログラムを記録した記録媒体。
- コンピュータに請求項5〜7のいずれか一項に記載の方法を実行させるコンピュータプログラム。
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2006147171A JP5120826B2 (ja) | 2005-09-29 | 2006-05-26 | 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム |
| PCT/JP2006/319428 WO2007037356A1 (ja) | 2005-09-29 | 2006-09-29 | 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム |
| KR1020087008240A KR20080059180A (ko) | 2005-09-29 | 2006-09-29 | 발음진단 장치, 발음진단 방법, 기록 매체, 및, 발음진단프로그램 |
| EP06810834A EP1947643A4 (en) | 2005-09-29 | 2006-09-29 | PRONOUNCIATION DIAGNOSIS DEVICE, PROMISE DIAGNOSTIC PROCEDURE, RECORD MEDIUM, AND PROMISE DIAGNOSTIC PROGRAM |
| US12/088,614 US20090305203A1 (en) | 2005-09-29 | 2006-09-29 | Pronunciation diagnosis device, pronunciation diagnosis method, recording medium, and pronunciation diagnosis program |
| TW095136432A TW200721109A (en) | 2005-09-29 | 2006-09-29 | Pronunciation diagnosis device, pronunciation diagnosis method, recording medium, and pronunciation diagnosis program |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2005285217 | 2005-09-29 | ||
| JP2005285217 | 2005-09-29 | ||
| JP2006147171A JP5120826B2 (ja) | 2005-09-29 | 2006-05-26 | 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2007122004A JP2007122004A (ja) | 2007-05-17 |
| JP5120826B2 true JP5120826B2 (ja) | 2013-01-16 |
Family
ID=37899777
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2006147171A Active JP5120826B2 (ja) | 2005-09-29 | 2006-05-26 | 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20090305203A1 (ja) |
| EP (1) | EP1947643A4 (ja) |
| JP (1) | JP5120826B2 (ja) |
| KR (1) | KR20080059180A (ja) |
| TW (1) | TW200721109A (ja) |
| WO (1) | WO2007037356A1 (ja) |
Families Citing this family (154)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
| US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
| US8271281B2 (en) * | 2007-12-28 | 2012-09-18 | Nuance Communications, Inc. | Method for assessing pronunciation abilities |
| US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
| JP5157488B2 (ja) * | 2008-01-31 | 2013-03-06 | ヤマハ株式会社 | パラメータ設定装置、音響生成装置およびプログラム |
| US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
| US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
| US8457965B2 (en) * | 2009-10-06 | 2013-06-04 | Rothenberg Enterprises | Method for the correction of measured values of vowel nasalance |
| US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
| US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
| US11062615B1 (en) | 2011-03-01 | 2021-07-13 | Intelligibility Training LLC | Methods and systems for remote language learning in a pandemic-aware world |
| US10019995B1 (en) | 2011-03-01 | 2018-07-10 | Alice J. Stiebel | Methods and systems for language learning based on a series of pitch patterns |
| US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
| US8948892B2 (en) | 2011-03-23 | 2015-02-03 | Audible, Inc. | Managing playback of synchronized content |
| US9734153B2 (en) | 2011-03-23 | 2017-08-15 | Audible, Inc. | Managing related digital content |
| US8855797B2 (en) | 2011-03-23 | 2014-10-07 | Audible, Inc. | Managing playback of synchronized content |
| US9760920B2 (en) | 2011-03-23 | 2017-09-12 | Audible, Inc. | Synchronizing digital content |
| US9703781B2 (en) | 2011-03-23 | 2017-07-11 | Audible, Inc. | Managing related digital content |
| US9706247B2 (en) | 2011-03-23 | 2017-07-11 | Audible, Inc. | Synchronized digital content samples |
| US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
| US8805673B1 (en) | 2011-07-14 | 2014-08-12 | Globalenglish Corporation | System and method for sharing region specific pronunciations of phrases |
| US10469623B2 (en) * | 2012-01-26 | 2019-11-05 | ZOOM International a.s. | Phrase labeling within spoken audio recordings |
| US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
| KR101599030B1 (ko) * | 2012-03-26 | 2016-03-14 | 강진호 | 음성분석기술을 이용한 시각적 영어 발음 교정시스템 및 교정법 |
| US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
| US9317500B2 (en) | 2012-05-30 | 2016-04-19 | Audible, Inc. | Synchronizing translated digital content |
| US9721563B2 (en) * | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
| US9141257B1 (en) | 2012-06-18 | 2015-09-22 | Audible, Inc. | Selecting and conveying supplemental content |
| US9536439B1 (en) | 2012-06-27 | 2017-01-03 | Audible, Inc. | Conveying questions with content |
| US9679608B2 (en) | 2012-06-28 | 2017-06-13 | Audible, Inc. | Pacing content |
| US10109278B2 (en) | 2012-08-02 | 2018-10-23 | Audible, Inc. | Aligning body matter across content formats |
| US9367196B1 (en) | 2012-09-26 | 2016-06-14 | Audible, Inc. | Conveying branched content |
| US9632647B1 (en) | 2012-10-09 | 2017-04-25 | Audible, Inc. | Selecting presentation positions in dynamic content |
| US9223830B1 (en) | 2012-10-26 | 2015-12-29 | Audible, Inc. | Content presentation analysis |
| FR3000593B1 (fr) * | 2012-12-27 | 2016-05-06 | Lipeo | Procede de communication entre un locuteur et un appareil electronique et appareil electronique associe |
| FR3000592B1 (fr) * | 2012-12-27 | 2016-04-01 | Lipeo | Module de reconnaissance vocale |
| US9280906B2 (en) | 2013-02-04 | 2016-03-08 | Audible. Inc. | Prompting a user for input during a synchronous presentation of audio content and textual content |
| US9472113B1 (en) | 2013-02-05 | 2016-10-18 | Audible, Inc. | Synchronizing playback of digital content with physical content |
| EP2954514B1 (en) | 2013-02-07 | 2021-03-31 | Apple Inc. | Voice trigger for a digital assistant |
| US9076347B2 (en) | 2013-03-14 | 2015-07-07 | Better Accent, LLC | System and methods for improving language pronunciation |
| TWI508033B (zh) * | 2013-04-26 | 2015-11-11 | Wistron Corp | 語言學習方法與裝置以及電腦可讀記錄媒體 |
| US9317486B1 (en) | 2013-06-07 | 2016-04-19 | Audible, Inc. | Synchronizing playback of digital content with captured physical content |
| WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
| US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
| HK1220268A1 (zh) | 2013-06-09 | 2017-04-28 | 苹果公司 | 用於實現跨數字助理的兩個或更多個實例的會話持續性的設備、方法、和圖形用戶界面 |
| KR20150024180A (ko) * | 2013-08-26 | 2015-03-06 | 주식회사 셀리이노베이션스 | 발음 교정 장치 및 방법 |
| US9489360B2 (en) | 2013-09-05 | 2016-11-08 | Audible, Inc. | Identifying extra material in companion content |
| US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
| JP5805804B2 (ja) * | 2014-02-03 | 2015-11-10 | 山本 一郎 | 構音訓練用録画・録音装置 |
| JP5843894B2 (ja) * | 2014-02-03 | 2016-01-13 | 山本 一郎 | 構音訓練用録画・録音装置 |
| US20150339950A1 (en) * | 2014-05-22 | 2015-11-26 | Keenan A. Wyrobek | System and Method for Obtaining Feedback on Spoken Audio |
| US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
| US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
| US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
| US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
| US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
| US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| JP2016045420A (ja) * | 2014-08-25 | 2016-04-04 | カシオ計算機株式会社 | 発音学習支援装置およびプログラム |
| US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
| US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
| US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
| US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
| KR102278008B1 (ko) * | 2014-12-19 | 2021-07-14 | 박현선 | 사용자 단말기를 이용한 보이스 컨설팅 제공 방법 |
| US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
| US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
| US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
| US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
| US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
| US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
| US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
| US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
| US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
| US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
| US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
| US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
| US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
| US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
| US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
| DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
| DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
| US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
| US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
| US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
| US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
| GB201706078D0 (en) * | 2017-04-18 | 2017-05-31 | Univ Oxford Innovation Ltd | System and method for automatic speech analysis |
| US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
| DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
| US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
| US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
| DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
| DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
| DK201770429A1 (en) | 2017-05-12 | 2018-12-14 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
| US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
| DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
| DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
| DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
| US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
| US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
| US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
| DK179549B1 (en) | 2017-05-16 | 2019-02-12 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
| US11068659B2 (en) * | 2017-05-23 | 2021-07-20 | Vanderbilt University | System, method and computer program product for determining a decodability index for one or more words |
| US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
| US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
| US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
| US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
| US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
| JP6909733B2 (ja) * | 2018-01-26 | 2021-07-28 | 株式会社日立製作所 | 音声分析装置および音声分析方法 |
| US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
| US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
| US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
| US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
| US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
| US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
| US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
| GB2575423B (en) * | 2018-05-11 | 2022-05-04 | Speech Engineering Ltd | Computer implemented method and apparatus for recognition of speech patterns and feedback |
| US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
| US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
| DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
| DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
| DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
| US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
| US10496705B1 (en) | 2018-06-03 | 2019-12-03 | Apple Inc. | Accelerated task performance |
| US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
| US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
| US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
| US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
| US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
| US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
| KR102207812B1 (ko) * | 2019-02-18 | 2021-01-26 | 충북대학교 산학협력단 | 발화 장애인들 및 외국인의 보편적 의사소통을 위한 음성 개선 방법 |
| CN110491382B (zh) * | 2019-03-11 | 2020-12-04 | 腾讯科技(深圳)有限公司 | 基于人工智能的语音识别方法、装置及语音交互设备 |
| US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
| US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
| DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
| US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
| US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
| US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
| US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
| US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
| DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | User activity shortcut suggestions |
| US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
| KR102121227B1 (ko) * | 2019-07-02 | 2020-06-10 | 경북대학교 산학협력단 | 정상압 수두증의 경과를 확인하기 위한 조음 상태 분류 방법 및 그 시스템 |
| US11410642B2 (en) * | 2019-08-16 | 2022-08-09 | Soundhound, Inc. | Method and system using phoneme embedding |
| JP7131518B2 (ja) * | 2019-09-20 | 2022-09-06 | カシオ計算機株式会社 | 電子機器、発音学習方法、サーバ装置、発音学習処理システムおよびプログラム |
| US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
| CN111047922A (zh) * | 2019-12-27 | 2020-04-21 | 浙江工业大学之江学院 | 一种发音教学方法、装置、系统、计算机设备和存储介质 |
| CN115066716A (zh) * | 2020-02-19 | 2022-09-16 | 松下知识产权经营株式会社 | 口腔功能可视化系统、口腔功能可视化方法及程序 |
| KR102395760B1 (ko) * | 2020-04-22 | 2022-05-10 | 한국외국어대학교 연구산학협력단 | 다중 디바이스의 음성인식 제어를 위한 다채널 보이스 트리거 시스템 및 그 제어 방법 |
| CN111833859B (zh) * | 2020-07-22 | 2024-02-13 | 科大讯飞股份有限公司 | 发音检错方法、装置、电子设备及存储介质 |
| CN112687291B (zh) * | 2020-12-21 | 2023-12-01 | 科大讯飞股份有限公司 | 一种发音缺陷识别模型训练方法以及发音缺陷识别方法 |
| CN113077819B (zh) * | 2021-03-19 | 2024-11-22 | 北京有竹居网络技术有限公司 | 发音评价方法和装置、存储介质和电子设备 |
| US11688106B2 (en) * | 2021-03-29 | 2023-06-27 | International Business Machines Corporation | Graphical adjustment recommendations for vocalization |
| CN113506563A (zh) * | 2021-07-06 | 2021-10-15 | 北京一起教育科技有限责任公司 | 一种发音识别的方法、装置及电子设备 |
| CN115376547B (zh) * | 2022-08-12 | 2024-06-04 | 腾讯科技(深圳)有限公司 | 发音评测方法、装置、计算机设备和存储介质 |
| KR20250067504A (ko) | 2023-11-08 | 2025-05-15 | (주)링글잉글리시에듀케이션서비스 | 발음 진단 및 학습 시스템 및 그 방법 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5175793A (en) * | 1989-02-01 | 1992-12-29 | Sharp Kabushiki Kaisha | Recognition apparatus using articulation positions for recognizing a voice |
| US5440661A (en) * | 1990-01-31 | 1995-08-08 | The United States Of America As Represented By The United States Department Of Energy | Time series association learning |
| US5536171A (en) * | 1993-05-28 | 1996-07-16 | Panasonic Technologies, Inc. | Synthesis-based speech training system and method |
| US5340316A (en) * | 1993-05-28 | 1994-08-23 | Panasonic Technologies, Inc. | Synthesis-based speech training system |
| JPH06348297A (ja) * | 1993-06-10 | 1994-12-22 | Osaka Gas Co Ltd | 発音練習装置 |
| JP2908720B2 (ja) * | 1994-04-12 | 1999-06-21 | 松下電器産業株式会社 | 合成を基本とした会話訓練装置及び方法 |
| JP2780639B2 (ja) * | 1994-05-20 | 1998-07-30 | 日本電気株式会社 | 発声訓練装置 |
| JPH08305277A (ja) * | 1995-04-28 | 1996-11-22 | Matsushita Electric Ind Co Ltd | 発声訓練装置 |
| WO1998014934A1 (en) * | 1996-10-02 | 1998-04-09 | Sri International | Method and system for automatic text-independent grading of pronunciation for language instruction |
| AU2998099A (en) * | 1998-03-11 | 1999-09-27 | Entropic, Inc. | Face synthesis system and methodology |
| JP2000242292A (ja) * | 1999-02-19 | 2000-09-08 | Nippon Telegr & Teleph Corp <Ntt> | 音声認識方法、この方法を実施する装置およびこの方法を実行するプログラムを記憶した記憶媒体 |
| JP3520022B2 (ja) * | 2000-01-14 | 2004-04-19 | 株式会社国際電気通信基礎技術研究所 | 外国語学習装置、外国語学習方法および媒体 |
| US6728680B1 (en) * | 2000-11-16 | 2004-04-27 | International Business Machines Corporation | Method and apparatus for providing visual feedback of speed production |
| AU2003283892A1 (en) * | 2002-11-27 | 2004-06-18 | Visual Pronunciation Software Limited | A method, system and software for teaching pronunciation |
| US20070055523A1 (en) * | 2005-08-25 | 2007-03-08 | Yang George L | Pronunciation training system |
-
2006
- 2006-05-26 JP JP2006147171A patent/JP5120826B2/ja active Active
- 2006-09-29 TW TW095136432A patent/TW200721109A/zh unknown
- 2006-09-29 WO PCT/JP2006/319428 patent/WO2007037356A1/ja not_active Ceased
- 2006-09-29 EP EP06810834A patent/EP1947643A4/en not_active Withdrawn
- 2006-09-29 KR KR1020087008240A patent/KR20080059180A/ko not_active Withdrawn
- 2006-09-29 US US12/088,614 patent/US20090305203A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| EP1947643A4 (en) | 2009-03-11 |
| JP2007122004A (ja) | 2007-05-17 |
| US20090305203A1 (en) | 2009-12-10 |
| KR20080059180A (ko) | 2008-06-26 |
| EP1947643A1 (en) | 2008-07-23 |
| TW200721109A (en) | 2007-06-01 |
| WO2007037356A1 (ja) | 2007-04-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP5120826B2 (ja) | 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム | |
| JP3520022B2 (ja) | 外国語学習装置、外国語学習方法および媒体 | |
| JP4114888B2 (ja) | 声質変化箇所特定装置 | |
| Feraru et al. | Cross-language acoustic emotion recognition: An overview and some tendencies | |
| CN101105939B (zh) | 发音指导方法 | |
| Arora et al. | Phonological feature-based speech recognition system for pronunciation training in non-native language learning | |
| AU2003300130A1 (en) | Speech recognition method | |
| JP5148026B1 (ja) | 音声合成装置および音声合成方法 | |
| CN102376182A (zh) | 语言学习系统、语言学习方法及其程序产品 | |
| Proença et al. | Automatic evaluation of reading aloud performance in children | |
| Nance et al. | Phonetic typology and articulatory constraints: The realization of secondary articulations in Scottish Gaelic rhotics | |
| Amrouche et al. | Balanced Arabic corpus design for speech synthesis | |
| KR102333029B1 (ko) | 발음 평가 방법 및 이를 이용한 디바이스 | |
| Iriondo et al. | Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification | |
| CN101292281A (zh) | 发音诊断装置、发音诊断方法、存储媒介、以及发音诊断程序 | |
| WO2006034569A1 (en) | A speech training system and method for comparing utterances to baseline speech | |
| Park | Human and Machine Judgment of Non-Native Speakers’ Speech Proficiency | |
| KR20260053610A (ko) | 목소리 교정을 위한 스피치분석시스템 및 스피치분석시스템의 동작 방법 | |
| Prinsloo | A comparative acoustic analysis of the long vowels and diphthongs of Afrikaans and South African English | |
| Penney et al. | Electroglottographic analysis of coda voicelessness in Australian English | |
| GAOL | Students‟ Ability in Pronouncing English Words by Using ELSA Speak Application of the Second-Year Students of SMA Eka Prasetya Medan | |
| Rosdi | Fuzzy Petri Nets as a Classification Method for Automatic Speech Intelligibility Detection of Children with Speech Impairments | |
| CN121545552A (zh) | 汉语连续语流中语音变调自动检测方法及其系统 | |
| MUSTIKA | ANALYSIS OF A GLOTTAL STOP ALLOPHONE [ʔ] OF THE PHONEME/p/,/t/,/k/IN THE THREE ERNEST HEMINGWAY’S POETRIES | |
| Lennon | Experience and learning in cross-dialect perception: Derhoticised/r/in Glasgow |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20090428 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090518 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090813 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20110927 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20111128 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20120327 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20120626 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20120702 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20120807 |
|
| A911 | Transfer to examiner for re-examination before appeal (zenchi) |
Free format text: JAPANESE INTERMEDIATE CODE: A911 Effective date: 20120828 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20121002 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20121016 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20151102 Year of fee payment: 3 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 Ref document number: 5120826 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| S533 | Written request for registration of change of name |
Free format text: JAPANESE INTERMEDIATE CODE: R313533 |
|
| R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |