PL399698A1 - Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy - Google Patents
Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowyInfo
- Publication number
- PL399698A1 PL399698A1 PL399698A PL39969812A PL399698A1 PL 399698 A1 PL399698 A1 PL 399698A1 PL 399698 A PL399698 A PL 399698A PL 39969812 A PL39969812 A PL 39969812A PL 399698 A1 PL399698 A1 PL 399698A1
- Authority
- PL
- Poland
- Prior art keywords
- acoustic model
- complexity
- discrete acoustic
- selecting
- recognition system
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 3
- 239000013598 vector Substances 0.000 abstract 2
- 238000010606 normalization Methods 0.000 abstract 1
- 238000013518 transcription Methods 0.000 abstract 1
- 230000035897 transcription Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Machine Translation (AREA)
Abstract
Wynalazek dotyczy sposobu doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy, obejmujacym dyskretny model akustyczny, slownik wymowy i opcjonalnie model jezyka badz gramatyke, gdzie przy zadanej bazie danych mowy, obejmujacej wiele par, skladajacych sie z nagrania mowy zwanego przebiegiem czasowym sygnalu mowy i transkrypcji ortograficznej przebiegu czasowego, konstruuje sie modele akustyczne, poprzez: konwersje zapisu ortograficznego na fonetyczny, parametryzacje przebiegów czasowych poprzez obliczanie wektorów cech i normalizacje ciagów wektorów cech i charakteryzuje sie tym, ze zlozonosc Pl dyskretnego modelu akustycznego ustawia sie wedlug procedury, przy zalozonym wspólczynniku generalizacji N.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PL399698A PL399698A1 (pl) | 2012-06-27 | 2012-06-27 | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy |
| US13/567,963 US20140006021A1 (en) | 2012-06-27 | 2012-08-06 | Method for adjusting discrete model complexity in an automatic speech recognition system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PL399698A PL399698A1 (pl) | 2012-06-27 | 2012-06-27 | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| PL399698A1 true PL399698A1 (pl) | 2014-01-07 |
Family
ID=49779004
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PL399698A PL399698A1 (pl) | 2012-06-27 | 2012-06-27 | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US20140006021A1 (pl) |
| PL (1) | PL399698A1 (pl) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113300890B (zh) * | 2021-05-24 | 2022-06-14 | 同济大学 | 一种网络化机器学习系统的自适应通信方法 |
| CN115050355B (zh) * | 2022-05-31 | 2024-07-16 | 北京小米移动软件有限公司 | 语音识别模型的训练方法和装置、电子设备和存储介质 |
| CN116052682B (zh) * | 2023-02-24 | 2026-03-24 | 阳光保险集团股份有限公司 | 一种方言语音转换的方法、装置、设备及介质 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5535305A (en) * | 1992-12-31 | 1996-07-09 | Apple Computer, Inc. | Sub-partitioned vector quantization of probability density functions |
| US5794197A (en) * | 1994-01-21 | 1998-08-11 | Micrsoft Corporation | Senone tree representation and evaluation |
| JP2690027B2 (ja) * | 1994-10-05 | 1997-12-10 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | パターン認識方法及び装置 |
| US5806030A (en) * | 1996-05-06 | 1998-09-08 | Matsushita Electric Industrial Co Ltd | Low complexity, high accuracy clustering method for speech recognizer |
| US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
| US20040006470A1 (en) * | 2002-07-03 | 2004-01-08 | Pioneer Corporation | Word-spotting apparatus, word-spotting method, and word-spotting program |
| US8214213B1 (en) * | 2006-04-27 | 2012-07-03 | At&T Intellectual Property Ii, L.P. | Speech recognition based on pronunciation modeling |
| US7617103B2 (en) * | 2006-08-25 | 2009-11-10 | Microsoft Corporation | Incrementally regulated discriminative margins in MCE training for speech recognition |
| US8423364B2 (en) * | 2007-02-20 | 2013-04-16 | Microsoft Corporation | Generic framework for large-margin MCE training in speech recognition |
| US8200797B2 (en) * | 2007-11-16 | 2012-06-12 | Nec Laboratories America, Inc. | Systems and methods for automatic profiling of network event sequences |
| RU2409897C1 (ru) * | 2009-05-18 | 2011-01-20 | Самсунг Электроникс Ко., Лтд | Кодер, передающее устройство, система передачи и способ кодирования информационных объектов |
| KR20120045582A (ko) * | 2010-10-29 | 2012-05-09 | 한국전자통신연구원 | 음향 모델 생성 장치 및 방법 |
-
2012
- 2012-06-27 PL PL399698A patent/PL399698A1/pl unknown
- 2012-08-06 US US13/567,963 patent/US20140006021A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| US20140006021A1 (en) | 2014-01-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2015009586A3 (en) | Performing an operation relative to tabular data based upon voice input | |
| WO2014197334A3 (en) | System and method for user-specified pronunciation of words for speech synthesis and recognition | |
| EP4700764A3 (en) | Joint audio-video facial animation system | |
| EP4235647A3 (en) | Determining dialog states for language models | |
| WO2020117639A3 (en) | Text independent speaker recognition | |
| WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
| WO2015057907A3 (en) | System and method for learning alternate pronunciations for speech recognition | |
| WO2014124332A3 (en) | Voice trigger for a digital assistant | |
| SG11201912053XA (en) | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface | |
| MX2015009812A (es) | Metodo y sistema para el reconicimiento de comandos de voz. | |
| PH12014500482A1 (en) | Systems and methods for language learning | |
| WO2014005142A3 (en) | Modeling l1-specific phonological errors | |
| GB2552623A (en) | Systems and methods for automated evaluation of human speech | |
| MX2017001121A (es) | Reconocimiento del habla en base a acustica y a dominio para vehiculos. | |
| TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
| EP4235649A3 (en) | Language model biasing | |
| WO2016139670A8 (en) | System and method for generating accurate speech transcription from natural speech audio signals | |
| WO2008087934A1 (ja) | 拡張認識辞書学習装置と音声認識システム | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| ATE531031T1 (de) | Segmentbasierte tonale modellierung für tonale sprachen | |
| GB2486038B (en) | Speech-to-text conversion | |
| EP4224467C0 (en) | TRAINING A SPEECH SYNTHESIS MODEL FOR A SPECIFIC SPEAKER'S VOICE BASED ON A PRE-TRAINED MODEL | |
| EP4529677A4 (en) | SYSTEM FOR PROVIDING NATURAL SPEECH BY A VOICE ASSISTANT AND ASSOCIATED METHOD | |
| PL399698A1 (pl) | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy | |
| MX2015014413A (es) | Simulacion de respuesta al impulso acustico. |