PL399698A1 - Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy - Google Patents

Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy

Info

Publication number
PL399698A1
PL399698A1 PL399698A PL39969812A PL399698A1 PL 399698 A1 PL399698 A1 PL 399698A1 PL 399698 A PL399698 A PL 399698A PL 39969812 A PL39969812 A PL 39969812A PL 399698 A1 PL399698 A1 PL 399698A1
Authority
PL
Poland
Prior art keywords
acoustic model
complexity
discrete acoustic
selecting
recognition system
Prior art date
Application number
PL399698A
Other languages
English (en)
Inventor
Marcin Kuropatwinski
Original Assignee
Voice Lab Spólka Z Ograniczona Odpowiedzialnoscia
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Voice Lab Spólka Z Ograniczona Odpowiedzialnoscia filed Critical Voice Lab Spólka Z Ograniczona Odpowiedzialnoscia
Priority to PL399698A priority Critical patent/PL399698A1/pl
Priority to US13/567,963 priority patent/US20140006021A1/en
Publication of PL399698A1 publication Critical patent/PL399698A1/pl

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/085Methods for reducing search complexity, pruning

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Machine Translation (AREA)

Abstract

Wynalazek dotyczy sposobu doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy, obejmujacym dyskretny model akustyczny, slownik wymowy i opcjonalnie model jezyka badz gramatyke, gdzie przy zadanej bazie danych mowy, obejmujacej wiele par, skladajacych sie z nagrania mowy zwanego przebiegiem czasowym sygnalu mowy i transkrypcji ortograficznej przebiegu czasowego, konstruuje sie modele akustyczne, poprzez: konwersje zapisu ortograficznego na fonetyczny, parametryzacje przebiegów czasowych poprzez obliczanie wektorów cech i normalizacje ciagów wektorów cech i charakteryzuje sie tym, ze zlozonosc Pl dyskretnego modelu akustycznego ustawia sie wedlug procedury, przy zalozonym wspólczynniku generalizacji N.
PL399698A 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy PL399698A1 (pl)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PL399698A PL399698A1 (pl) 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy
US13/567,963 US20140006021A1 (en) 2012-06-27 2012-08-06 Method for adjusting discrete model complexity in an automatic speech recognition system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PL399698A PL399698A1 (pl) 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy

Publications (1)

Publication Number Publication Date
PL399698A1 true PL399698A1 (pl) 2014-01-07

Family

ID=49779004

Family Applications (1)

Application Number Title Priority Date Filing Date
PL399698A PL399698A1 (pl) 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy

Country Status (2)

Country Link
US (1) US20140006021A1 (pl)
PL (1) PL399698A1 (pl)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113300890B (zh) * 2021-05-24 2022-06-14 同济大学 一种网络化机器学习系统的自适应通信方法
CN115050355B (zh) * 2022-05-31 2024-07-16 北京小米移动软件有限公司 语音识别模型的训练方法和装置、电子设备和存储介质
CN116052682B (zh) * 2023-02-24 2026-03-24 阳光保险集团股份有限公司 一种方言语音转换的方法、装置、设备及介质

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5535305A (en) * 1992-12-31 1996-07-09 Apple Computer, Inc. Sub-partitioned vector quantization of probability density functions
US5794197A (en) * 1994-01-21 1998-08-11 Micrsoft Corporation Senone tree representation and evaluation
JP2690027B2 (ja) * 1994-10-05 1997-12-10 株式会社エイ・ティ・アール音声翻訳通信研究所 パターン認識方法及び装置
US5806030A (en) * 1996-05-06 1998-09-08 Matsushita Electric Industrial Co Ltd Low complexity, high accuracy clustering method for speech recognizer
US6895375B2 (en) * 2001-10-04 2005-05-17 At&T Corp. System for bandwidth extension of Narrow-band speech
US20040006470A1 (en) * 2002-07-03 2004-01-08 Pioneer Corporation Word-spotting apparatus, word-spotting method, and word-spotting program
US8214213B1 (en) * 2006-04-27 2012-07-03 At&T Intellectual Property Ii, L.P. Speech recognition based on pronunciation modeling
US7617103B2 (en) * 2006-08-25 2009-11-10 Microsoft Corporation Incrementally regulated discriminative margins in MCE training for speech recognition
US8423364B2 (en) * 2007-02-20 2013-04-16 Microsoft Corporation Generic framework for large-margin MCE training in speech recognition
US8200797B2 (en) * 2007-11-16 2012-06-12 Nec Laboratories America, Inc. Systems and methods for automatic profiling of network event sequences
RU2409897C1 (ru) * 2009-05-18 2011-01-20 Самсунг Электроникс Ко., Лтд Кодер, передающее устройство, система передачи и способ кодирования информационных объектов
KR20120045582A (ko) * 2010-10-29 2012-05-09 한국전자통신연구원 음향 모델 생성 장치 및 방법

Also Published As

Publication number Publication date
US20140006021A1 (en) 2014-01-02

Similar Documents

Publication Publication Date Title
WO2015009586A3 (en) Performing an operation relative to tabular data based upon voice input
WO2014197334A3 (en) System and method for user-specified pronunciation of words for speech synthesis and recognition
EP4700764A3 (en) Joint audio-video facial animation system
EP4235647A3 (en) Determining dialog states for language models
WO2020117639A3 (en) Text independent speaker recognition
WO2009025356A1 (ja) 音声認識装置および音声認識方法
WO2015057907A3 (en) System and method for learning alternate pronunciations for speech recognition
WO2014124332A3 (en) Voice trigger for a digital assistant
SG11201912053XA (en) Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
MX2015009812A (es) Metodo y sistema para el reconicimiento de comandos de voz.
PH12014500482A1 (en) Systems and methods for language learning
WO2014005142A3 (en) Modeling l1-specific phonological errors
GB2552623A (en) Systems and methods for automated evaluation of human speech
MX2017001121A (es) Reconocimiento del habla en base a acustica y a dominio para vehiculos.
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
EP4235649A3 (en) Language model biasing
WO2016139670A8 (en) System and method for generating accurate speech transcription from natural speech audio signals
WO2008087934A1 (ja) 拡張認識辞書学習装置と音声認識システム
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
ATE531031T1 (de) Segmentbasierte tonale modellierung für tonale sprachen
GB2486038B (en) Speech-to-text conversion
EP4224467C0 (en) TRAINING A SPEECH SYNTHESIS MODEL FOR A SPECIFIC SPEAKER'S VOICE BASED ON A PRE-TRAINED MODEL
EP4529677A4 (en) SYSTEM FOR PROVIDING NATURAL SPEECH BY A VOICE ASSISTANT AND ASSOCIATED METHOD
PL399698A1 (pl) Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy
MX2015014413A (es) Simulacion de respuesta al impulso acustico.