PT3371808T - Sistema e método de processamento de fala - Google Patents

Sistema e método de processamento de fala

Info

Publication number
PT3371808T
PT3371808T PT168012839T PT16801283T PT3371808T PT 3371808 T PT3371808 T PT 3371808T PT 168012839 T PT168012839 T PT 168012839T PT 16801283 T PT16801283 T PT 16801283T PT 3371808 T PT3371808 T PT 3371808T
Authority
PT
Portugal
Prior art keywords
processing system
speech processing
speech
processing
Prior art date
Application number
PT168012839T
Other languages
English (en)
Original Assignee
The Chancellor Masters And Scholars Of The Univ Of Cambridge
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Chancellor Masters And Scholars Of The Univ Of Cambridge filed Critical The Chancellor Masters And Scholars Of The Univ Of Cambridge
Publication of PT3371808T publication Critical patent/PT3371808T/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/06Foreign languages
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/04Electrically-operated educational appliances with audible presentation of the material to be studied
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L21/12Transforming into visible information by displaying time domain information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Signal Processing (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
PT168012839T 2015-11-04 2016-11-04 Sistema e método de processamento de fala PT3371808T (pt)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB1519494.7A GB2544070B (en) 2015-11-04 2015-11-04 Speech processing system and method

Publications (1)

Publication Number Publication Date
PT3371808T true PT3371808T (pt) 2020-06-01

Family

ID=55130676

Family Applications (1)

Application Number Title Priority Date Filing Date
PT168012839T PT3371808T (pt) 2015-11-04 2016-11-04 Sistema e método de processamento de fala

Country Status (7)

Country Link
US (2) US10783880B2 (pt)
EP (1) EP3371808B8 (pt)
CN (1) CN108496219B (pt)
ES (1) ES2794573T3 (pt)
GB (1) GB2544070B (pt)
PT (1) PT3371808T (pt)
WO (1) WO2017077330A1 (pt)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6446993B2 (ja) * 2014-10-20 2019-01-09 ヤマハ株式会社 音声制御装置およびプログラム
US11020624B2 (en) * 2016-04-19 2021-06-01 KFT Fire Trainer, LLC Fire simulator
US20180061260A1 (en) * 2016-08-31 2018-03-01 International Business Machines Corporation Automated language learning
CN107507628B (zh) * 2017-08-31 2021-01-15 广州酷狗计算机科技有限公司 唱歌评分方法、装置及终端
CN107678561A (zh) * 2017-09-29 2018-02-09 百度在线网络技术(北京)有限公司 基于人工智能的语音输入纠错方法及装置
GB2568902B (en) * 2017-11-29 2020-09-09 Auris Tech Ltd System for speech evaluation
US11538455B2 (en) * 2018-02-16 2022-12-27 Dolby Laboratories Licensing Corporation Speech style transfer
EP3544001B8 (en) * 2018-03-23 2022-01-12 Articulate.XYZ Ltd Processing speech-to-text transcriptions
CN109036464B (zh) * 2018-09-17 2022-02-22 腾讯科技(深圳)有限公司 发音检错方法、装置、设备及存储介质
US11410641B2 (en) * 2018-11-28 2022-08-09 Google Llc Training and/or using a language selection model for automatically determining language for speech recognition of spoken utterance
CN109326277B (zh) * 2018-12-05 2022-02-08 四川长虹电器股份有限公司 半监督的音素强制对齐模型建立方法及系统
WO2020122293A1 (ko) * 2018-12-14 2020-06-18 엘지전자 주식회사 세탁 스케쥴링 장치
EP3788620B1 (en) 2018-12-28 2023-09-06 Google LLC Supplementing voice inputs to an automated assistant according to selected suggestions
JP7332132B2 (ja) * 2019-03-28 2023-08-23 国立研究開発法人情報通信研究機構 言語識別装置及びそのためのコンピュータプログラム
CN109979484B (zh) * 2019-04-03 2021-06-08 北京儒博科技有限公司 发音检错方法、装置、电子设备及存储介质
JP7131518B2 (ja) * 2019-09-20 2022-09-06 カシオ計算機株式会社 電子機器、発音学習方法、サーバ装置、発音学習処理システムおよびプログラム
US11341331B2 (en) * 2019-10-04 2022-05-24 Microsoft Technology Licensing, Llc Speaking technique improvement assistant
CN110728994B (zh) * 2019-12-19 2020-05-05 北京海天瑞声科技股份有限公司 语音库的语音获取方法、装置、电子设备及存储介质
CN111105813B (zh) * 2019-12-31 2022-09-02 科大讯飞股份有限公司 朗读评分方法、装置、设备及可读存储介质
KR102862266B1 (ko) * 2020-05-07 2025-09-19 구글 엘엘씨 종단 간 모델로 단어 타이밍 방출
WO2022003104A1 (en) * 2020-07-01 2022-01-06 Iliescu Alexandru System and method for interactive and handsfree language learning
KR102739457B1 (ko) * 2020-09-08 2024-12-09 한국전자통신연구원 외국어 학습자의 외국어 문장 평가에 기반한 외국어 교육 제공 장치 및 방법
CN112133277B (zh) * 2020-11-20 2021-02-26 北京猿力未来科技有限公司 样本生成方法及装置
CN112542159B (zh) * 2020-12-01 2024-04-09 腾讯音乐娱乐科技(深圳)有限公司 一种数据处理方法以及设备
US11610581B2 (en) * 2021-02-05 2023-03-21 International Business Machines Corporation Multi-step linear interpolation of language models
CN112951274A (zh) * 2021-02-07 2021-06-11 脸萌有限公司 语音相似度确定方法及设备、程序产品
EP4248441A4 (en) * 2021-03-25 2024-07-10 Samsung Electronics Co., Ltd. SPEECH RECOGNITION METHOD, DEVICE, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
US12046147B2 (en) * 2021-04-21 2024-07-23 Lumos Information Services LLC System and method for analysing an audio to measure oral reading fluency
CN115346421A (zh) * 2021-05-12 2022-11-15 北京猿力未来科技有限公司 一种口语流利度评分方法、计算设备及存储介质
CN117083669A (zh) 2021-05-28 2023-11-17 微软技术许可有限责任公司 检测和改进单词实时误读的方法和系统
CN113314124B (zh) * 2021-06-15 2022-03-25 宿迁硅基智能科技有限公司 文本输出方法及系统、存储介质、电子装置
US12361936B2 (en) 2021-08-24 2025-07-15 Microsoft Technology Licensing, Llc Method and system of automated question generation for speech assistance
US12367867B2 (en) * 2021-10-26 2025-07-22 Samsung Electronics Co., Ltd. System for generating voice in an ongoing call session based on artificial intelligent techniques
CN114625857B (zh) * 2022-03-23 2023-08-25 南京硅基智能科技有限公司 一种提词器及英文文本跟踪方法、存储介质、电子设备
WO2025121794A1 (ko) * 2023-12-05 2025-06-12 삼성전자주식회사 프롬프팅을 위한 전자 장치 및 이의 동작 방법
CN119479702B (zh) * 2025-01-08 2025-04-25 成都佳发安泰教育科技股份有限公司 发音评分方法、装置、电子设备和存储介质

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4805219A (en) * 1987-04-03 1989-02-14 Dragon Systems, Inc. Method for speech recognition
JP3008799B2 (ja) * 1995-01-26 2000-02-14 日本電気株式会社 音声適応化装置,単語音声認識装置,連続音声認識装置およびワードスポッティング装置
US7062441B1 (en) * 1999-05-13 2006-06-13 Ordinate Corporation Automated language assessment using speech recognition modeling
US6389394B1 (en) 2000-02-09 2002-05-14 Speechworks International, Inc. Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations
GB0011798D0 (en) * 2000-05-16 2000-07-05 Canon Kk Database annotation and retrieval
GB0027178D0 (en) * 2000-11-07 2000-12-27 Canon Kk Speech processing system
GB0028277D0 (en) * 2000-11-20 2001-01-03 Canon Kk Speech processing system
US7668718B2 (en) * 2001-07-17 2010-02-23 Custom Speech Usa, Inc. Synchronized pattern recognition source data processed by manual or automatic means for creation of shared speaker-dependent speech user profile
US7219059B2 (en) * 2002-07-03 2007-05-15 Lucent Technologies Inc. Automatic pronunciation scoring for language learning
US7299188B2 (en) * 2002-07-03 2007-11-20 Lucent Technologies Inc. Method and apparatus for providing an interactive language tutor
KR100495667B1 (ko) * 2003-01-13 2005-06-16 삼성전자주식회사 아날로그/디지털 입력 모드를 제공하는 입출력 버퍼
US20040230431A1 (en) * 2003-05-14 2004-11-18 Gupta Sunil K. Automatic assessment of phonological processes for speech therapy and language instruction
US7302389B2 (en) * 2003-05-14 2007-11-27 Lucent Technologies Inc. Automatic assessment of phonological processes
US7590533B2 (en) * 2004-03-10 2009-09-15 Microsoft Corporation New-word pronunciation learning using a pronunciation graph
US8221126B2 (en) * 2004-11-22 2012-07-17 Bravobrava L.L.C. System and method for performing programmatic language learning tests and evaluations
JP4156639B2 (ja) * 2006-08-14 2008-09-24 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声インターフェースの設計を支援するための装置、方法、プログラム
US8972268B2 (en) * 2008-04-15 2015-03-03 Facebook, Inc. Enhanced speech-to-speech translation system and methods for adding a new word
US8226416B2 (en) * 2006-12-08 2012-07-24 Sri International Method and apparatus for reading education
US8457959B2 (en) * 2007-03-01 2013-06-04 Edward C. Kaiser Systems and methods for implicitly interpreting semantically redundant communication modes
US8392190B2 (en) * 2008-12-01 2013-03-05 Educational Testing Service Systems and methods for assessment of non-native spontaneous speech
US8073693B2 (en) 2008-12-04 2011-12-06 At&T Intellectual Property I, L.P. System and method for pronunciation modeling
CN101826263B (zh) * 2009-03-04 2012-01-04 中国科学院自动化研究所 基于客观标准的自动化口语评估系统
US8972253B2 (en) * 2010-09-15 2015-03-03 Microsoft Technology Licensing, Llc Deep belief network for large vocabulary continuous speech recognition
US8880399B2 (en) * 2010-09-27 2014-11-04 Rosetta Stone, Ltd. Utterance verification and pronunciation scoring by lattice transduction
US9123339B1 (en) * 2010-11-23 2015-09-01 Google Inc. Speech recognition using repeated utterances
KR101780760B1 (ko) * 2011-06-30 2017-10-10 구글 인코포레이티드 가변길이 문맥을 이용한 음성인식
TWI484475B (zh) * 2012-06-05 2015-05-11 Quanta Comp Inc 文字顯示方法與語音轉文字裝置以及電腦程式產品
WO2014039828A2 (en) * 2012-09-06 2014-03-13 Simmons Aaron M A method and system for reading fluency training
US9336771B2 (en) * 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US9449522B2 (en) * 2012-11-16 2016-09-20 Educational Testing Service Systems and methods for evaluating difficulty of spoken text
CN103065626B (zh) * 2012-12-20 2015-03-11 中国科学院声学研究所 英语口语考试系统中的朗读题自动评分方法和设备
US9437189B2 (en) * 2014-05-29 2016-09-06 Google Inc. Generating language models

Also Published As

Publication number Publication date
US20180315420A1 (en) 2018-11-01
CN108496219B (zh) 2022-12-30
US12380882B2 (en) 2025-08-05
US20200320987A1 (en) 2020-10-08
EP3371808A1 (en) 2018-09-12
US10783880B2 (en) 2020-09-22
CN108496219A (zh) 2018-09-04
GB201519494D0 (en) 2015-12-16
EP3371808B8 (en) 2020-04-08
WO2017077330A1 (en) 2017-05-11
GB2544070B (en) 2021-12-29
EP3371808B1 (en) 2020-02-26
GB2544070A (en) 2017-05-10
ES2794573T3 (es) 2020-11-18

Similar Documents

Publication Publication Date Title
GB2544070B (en) Speech processing system and method
ZA201900536B (en) Blockchain-implemented method and system
ZA201900509B (en) Blockchain-implemented method and system
ZA201900535B (en) Blockchain implemented method and system
GB201719944D0 (en) Parking-lot-navigation system and method
GB201712642D0 (en) Order processing method and system
EP3306607A4 (en) METHOD AND SYSTEM FOR KARAOKE PROCESSING
PL3320404T3 (pl) System i sposób obróbki powietrza
SG11201801808RA (en) Audio recognition method and system
GB2551499B (en) A speech processing system and speech processing method
GB201510480D0 (en) System and method
GB201601134D0 (en) System and process
GB2542548B (en) System and method
GB2536729B (en) A speech processing system and speech processing method
SG11201709848SA (en) Transaction processing method and system
GB201803529D0 (en) Radio-station-recommendation system and method
GB201604012D0 (en) Refridgeration system and method
ZA201901386B (en) System and process
GB201515115D0 (en) System and method
GB201620926D0 (en) Method and system
FI20165381A7 (fi) Menetelmä ja järjestelmä
GB2549103B (en) A speech processing system and speech processing method
GB2537924B (en) A Speech Processing System and Method
GB2537923B (en) A speech processing system and speech processing method
ZA201702361B (en) Processing system and method