ATE343197T1 - Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells - Google Patents

Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells

Info

Publication number
ATE343197T1
ATE343197T1 AT03712399T AT03712399T ATE343197T1 AT E343197 T1 ATE343197 T1 AT E343197T1 AT 03712399 T AT03712399 T AT 03712399T AT 03712399 T AT03712399 T AT 03712399T AT E343197 T1 ATE343197 T1 AT E343197T1
Authority
AT
Austria
Prior art keywords
gmm
model
hidden markov
gaussic
determining parameters
Prior art date
Application number
AT03712399T
Other languages
English (en)
Inventor
Christopher J Webber
Original Assignee
Qinetiq Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qinetiq Ltd filed Critical Qinetiq Ltd
Application granted granted Critical
Publication of ATE343197T1 publication Critical patent/ATE343197T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Communication Control (AREA)
  • Ultra Sonic Daignosis Equipment (AREA)
  • Fats And Perfumes (AREA)
  • Absorbent Articles And Supports Therefor (AREA)
  • Image Analysis (AREA)
  • Machine Translation (AREA)
  • Character Discrimination (AREA)
  • Analysing Materials By The Use Of Radiation (AREA)
AT03712399T 2002-03-28 2003-03-24 Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells ATE343197T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB0207343A GB2387008A (en) 2002-03-28 2002-03-28 Signal Processing System

Publications (1)

Publication Number Publication Date
ATE343197T1 true ATE343197T1 (de) 2006-11-15

Family

ID=9933907

Family Applications (1)

Application Number Title Priority Date Filing Date
AT03712399T ATE343197T1 (de) 2002-03-28 2003-03-24 Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells

Country Status (8)

Country Link
US (1) US7664640B2 (de)
EP (1) EP1488411B1 (de)
JP (1) JP4264006B2 (de)
AT (1) ATE343197T1 (de)
AU (1) AU2003217013A1 (de)
DE (1) DE60309142T2 (de)
GB (1) GB2387008A (de)
WO (1) WO2003083831A1 (de)

Families Citing this family (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7167587B2 (en) * 2002-08-30 2007-01-23 Lockheed Martin Corporation Sequential classifier for use in pattern recognition system
US20040086185A1 (en) * 2002-10-31 2004-05-06 Eastman Kodak Company Method and system for multiple cue integration
JP2005141601A (ja) * 2003-11-10 2005-06-02 Nec Corp モデル選択計算装置,動的モデル選択装置,動的モデル選択方法およびプログラム
JP4511850B2 (ja) * 2004-03-03 2010-07-28 学校法人早稲田大学 人物属性識別方法およびそのシステム
US8010356B2 (en) * 2006-02-17 2011-08-30 Microsoft Corporation Parameter learning in a hidden trajectory model
US20070219796A1 (en) * 2006-03-20 2007-09-20 Microsoft Corporation Weighted likelihood ratio for pattern recognition
CN101416237B (zh) * 2006-05-01 2012-05-30 日本电信电话株式会社 基于源和室内声学的概率模型的语音去混响方法和设备
US8234116B2 (en) * 2006-08-22 2012-07-31 Microsoft Corporation Calculating cost measures between HMM acoustic models
US7937270B2 (en) * 2007-01-16 2011-05-03 Mitsubishi Electric Research Laboratories, Inc. System and method for recognizing speech securely using a secure multi-party computation protocol
US20080275743A1 (en) * 2007-05-03 2008-11-06 Kadambe Shubha L Systems and methods for planning
US8521530B1 (en) 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US9767806B2 (en) * 2013-09-24 2017-09-19 Cirrus Logic International Semiconductor Ltd. Anti-spoofing
AU2010201891B2 (en) * 2009-05-13 2015-02-12 The University Of Sydney A method and system for data analysis and synthesis
US9008329B1 (en) * 2010-01-26 2015-04-14 Audience, Inc. Noise reduction using multi-feature cluster tracker
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
US8538035B2 (en) 2010-04-29 2013-09-17 Audience, Inc. Multi-microphone robust noise suppression
US8781137B1 (en) 2010-04-27 2014-07-15 Audience, Inc. Wind noise detection and suppression
US9558755B1 (en) 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
US8447596B2 (en) 2010-07-12 2013-05-21 Audience, Inc. Monaural noise suppression based on computational auditory scene analysis
US9047867B2 (en) 2011-02-21 2015-06-02 Adobe Systems Incorporated Systems and methods for concurrent signal recognition
US8554553B2 (en) * 2011-02-21 2013-10-08 Adobe Systems Incorporated Non-negative hidden Markov modeling of signals
US8849663B2 (en) 2011-03-21 2014-09-30 The Intellisis Corporation Systems and methods for segmenting and/or classifying an audio signal from transformed audio information
US9142220B2 (en) 2011-03-25 2015-09-22 The Intellisis Corporation Systems and methods for reconstructing an audio signal from transformed audio information
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8620646B2 (en) 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US8548803B2 (en) 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
US8843364B2 (en) 2012-02-29 2014-09-23 Adobe Systems Incorporated Language informed source separation
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US9058820B1 (en) 2013-05-21 2015-06-16 The Intellisis Corporation Identifying speech portions of a sound model using various statistics thereof
US9484044B1 (en) 2013-07-17 2016-11-01 Knuedge Incorporated Voice enhancement and/or speech features extraction on noisy audio signals using successively refined transforms
US9530434B1 (en) 2013-07-18 2016-12-27 Knuedge Incorporated Reducing octave errors during pitch determination for noisy audio signals
US9208794B1 (en) 2013-08-07 2015-12-08 The Intellisis Corporation Providing sound models of an input signal using continuous and/or linear fitting
KR101559364B1 (ko) * 2014-04-17 2015-10-12 한국과학기술원 페이스 투 페이스 인터랙션 모니터링을 수행하는 모바일 장치, 이를 이용하는 인터랙션 모니터링 방법, 이를 포함하는 인터랙션 모니터링 시스템 및 이에 의해 수행되는 인터랙션 모니터링 모바일 애플리케이션
CN106797512B (zh) 2014-08-28 2019-10-25 美商楼氏电子有限公司 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质
KR101904423B1 (ko) * 2014-09-03 2018-11-28 삼성전자주식회사 오디오 신호를 학습하고 인식하는 방법 및 장치
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
US9721569B2 (en) * 2015-05-27 2017-08-01 Intel Corporation Gaussian mixture model accelerator with direct memory access engines corresponding to individual data streams
US10056076B2 (en) * 2015-09-06 2018-08-21 International Business Machines Corporation Covariance matrix estimation with structural-based priors for speech processing
US20170255864A1 (en) * 2016-03-05 2017-09-07 Panoramic Power Ltd. Systems and Methods Thereof for Determination of a Device State Based on Current Consumption Monitoring and Machine Learning Thereof
CN105933323B (zh) * 2016-06-01 2019-05-31 百度在线网络技术(北京)有限公司 声纹注册、认证方法及装置
US10754959B1 (en) * 2017-01-20 2020-08-25 University Of South Florida Non-linear stochastic models for predicting exploitability
JP7103235B2 (ja) * 2017-02-17 2022-07-20 日本電気株式会社 パラメタ算出装置、パラメタ算出方法、及び、パラメタ算出プログラム
US10650150B1 (en) * 2017-02-28 2020-05-12 University Of South Florida Vulnerability life cycle exploitation timing modeling
US10659488B1 (en) * 2017-02-28 2020-05-19 University Of South Florida Statistical predictive model for expected path length
US11017096B2 (en) * 2018-06-01 2021-05-25 University Of South Florida Prediction of software vulnerabilities
US11190534B1 (en) 2019-03-21 2021-11-30 Snap Inc. Level of network suspicion detection
US11349857B1 (en) * 2019-03-21 2022-05-31 Snap Inc. Suspicious group detection
CN111327558B (zh) * 2020-02-28 2022-06-21 杭州电子科技大学 用于滤波器多载波调制光通信的gmm非均匀量化的方法及系统
US11483335B1 (en) * 2020-12-03 2022-10-25 University Of South Florida Cybersecurity: reliability of a computer network
CN115824481B (zh) * 2022-10-01 2024-07-02 同济大学 一种基于递归演化的实时索杆力识别方法
CN117134968B (zh) * 2023-08-29 2025-11-04 燕山大学 基于高斯混合隐马尔可夫及迁移学习的多阶段攻击检测算法

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5450523A (en) * 1990-11-15 1995-09-12 Matsushita Electric Industrial Co., Ltd. Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems
JP2871561B2 (ja) * 1995-11-30 1999-03-17 株式会社エイ・ティ・アール音声翻訳通信研究所 不特定話者モデル生成装置及び音声認識装置
US6044344A (en) * 1997-01-03 2000-03-28 International Business Machines Corporation Constrained corrective training for continuous parameter system
US6374221B1 (en) * 1999-06-22 2002-04-16 Lucent Technologies Inc. Automatic retraining of a speech recognizer while using reliable transcripts
US6993452B2 (en) * 2000-05-04 2006-01-31 At&T Corp. Distance measure for probability distribution function of mixture type
US6609093B1 (en) * 2000-06-01 2003-08-19 International Business Machines Corporation Methods and apparatus for performing heteroscedastic discriminant analysis in pattern recognition systems
GB0017989D0 (en) 2000-07-24 2001-08-08 Secr Defence Target recognition system
TW473704B (en) * 2000-08-30 2002-01-21 Ind Tech Res Inst Adaptive voice recognition method with noise compensation
US7295978B1 (en) * 2000-09-05 2007-11-13 Verizon Corporate Services Group Inc. Systems and methods for using one-dimensional gaussian distributions to model speech

Also Published As

Publication number Publication date
DE60309142T2 (de) 2007-08-16
EP1488411B1 (de) 2006-10-18
GB0207343D0 (en) 2002-05-08
GB2387008A (en) 2003-10-01
US20060178887A1 (en) 2006-08-10
DE60309142D1 (de) 2006-11-30
WO2003083831A1 (en) 2003-10-09
JP2005521906A (ja) 2005-07-21
US7664640B2 (en) 2010-02-16
AU2003217013A1 (en) 2003-10-13
EP1488411A1 (de) 2004-12-22
JP4264006B2 (ja) 2009-05-13

Similar Documents

Publication Publication Date Title
DE60309142D1 (de) Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells
DE69519297D1 (de) Verfahren und vorrichtung zur spracherkennung mittels optimierter partieller buendelung von wahrscheinlichkeitsmischungen
CN101136199B (zh) 语音数据处理方法和设备
DE59010131D1 (de) Verfahren zur sprecheradaptiven Erkennung von Sprache
EP1022722A3 (de) Sprecheradaptation auf der Basis von Stimm-Eigenvektoren
DE59904741D1 (de) Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
ATE265083T1 (de) Verfahren und vorrichtung zum unterscheidenden training von akustischen modellen in einem spracherkennungssystem
ATE363712T1 (de) Parametrische online-histogramm normierung zur rauschrobusten spracherkennung
Matsui et al. A text-independent speaker recognition method robust against utterance variations
CN108877784B (zh) 一种基于口音识别的鲁棒语音识别方法
AU2001273410A1 (en) Method and apparatus for constructing voice templates for a speaker-independent voice recognition system
CN104658538A (zh) 一种基于鸟鸣声的移动式鸟类识别方法
DE60325881D1 (de) Verfahren zum betreiben eines spracherkennungssystemes
EP0852374A3 (de) Verfahren und System zur sprecherunabhängigen Erkennung von benutzerdefinierten Sätzen
DE60128479D1 (de) Verfahren und vorrichtung zur bestimmung eines synthetischen höheren bandsignals in einem sprachkodierer
DE60117558D1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
Funaki et al. On Robust speech analysis based on time-varying complex AR model
Sailaja et al. Text independent speaker identification with finite multivariate generalized gaussian mixture model and hierarchical clustering algorithm
KR20070045772A (ko) 성대신호 인식 장치 및 그 방법
RU2002129029A (ru) Способ дикторонезависимого распознавания звуков речи
Meen et al. Improving phone label alignment accuracy by utilizing voicing information
Wang et al. A Self adapting Endpoint Detection Algorithm for Speech Recognition in Noisy Environment Based on 1/f Process
Hoshimi et al. Speaker independent speech recognition method using training speech from a small number of speakers
Peng et al. Stochastic Segment Model Decoding Algorithm Based on Neighboring Segments and its Application in LVCSR
WO2006034152A3 (en) Discriminative training of document transcription system

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties