ATE491202T1 - Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache - Google Patents

Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache

Info

Publication number
ATE491202T1
ATE491202T1 AT06742938T AT06742938T ATE491202T1 AT E491202 T1 ATE491202 T1 AT E491202T1 AT 06742938 T AT06742938 T AT 06742938T AT 06742938 T AT06742938 T AT 06742938T AT E491202 T1 ATE491202 T1 AT E491202T1
Authority
AT
Austria
Prior art keywords
compensating
speech
variability
voice signal
input voice
Prior art date
Application number
AT06742938T
Other languages
English (en)
Inventor
Claudio Vair
Daniele Colibro
Pietro Laface
Original Assignee
Loquendo Spa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Loquendo Spa filed Critical Loquendo Spa
Application granted granted Critical
Publication of ATE491202T1 publication Critical patent/ATE491202T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Complex Calculations (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
AT06742938T 2006-05-16 2006-05-16 Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache ATE491202T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2006/004598 WO2007131530A1 (en) 2006-05-16 2006-05-16 Intersession variability compensation for automatic extraction of information from voice

Publications (1)

Publication Number Publication Date
ATE491202T1 true ATE491202T1 (de) 2010-12-15

Family

ID=37057050

Family Applications (1)

Application Number Title Priority Date Filing Date
AT06742938T ATE491202T1 (de) 2006-05-16 2006-05-16 Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache

Country Status (8)

Country Link
US (1) US8566093B2 (de)
EP (1) EP2022042B1 (de)
AT (1) ATE491202T1 (de)
AU (1) AU2006343470B2 (de)
CA (1) CA2652302C (de)
DE (1) DE602006018795D1 (de)
ES (1) ES2357674T3 (de)
WO (1) WO2007131530A1 (de)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8504366B2 (en) * 2005-12-19 2013-08-06 Nuance Communications, Inc. Joint factor analysis scoring for speech processing systems
EP2058797B1 (de) * 2007-11-12 2011-05-04 Harman Becker Automotive Systems GmbH Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen
US9020816B2 (en) * 2008-08-14 2015-04-28 21Ct, Inc. Hidden markov model for speech processing with training method
US8412525B2 (en) 2009-04-30 2013-04-02 Microsoft Corporation Noise robust speech classifier ensemble
US9177557B2 (en) * 2009-07-07 2015-11-03 General Motors Llc. Singular value decomposition for improved voice recognition in presence of multi-talker background noise
FR2965377A1 (fr) 2010-09-24 2012-03-30 Univ D Avignon Et Des Pays De Vaucluse Procede de classification de donnees biometriques
US9042867B2 (en) * 2012-02-24 2015-05-26 Agnitio S.L. System and method for speaker recognition on mobile devices
US9984678B2 (en) * 2012-03-23 2018-05-29 Microsoft Technology Licensing, Llc Factored transforms for separable adaptation of acoustic models
DK2713367T3 (en) * 2012-09-28 2017-02-20 Agnitio S L Speech Recognition
US9240184B1 (en) * 2012-11-15 2016-01-19 Google Inc. Frame-level combination of deep neural network and gaussian mixture models
US9406298B2 (en) * 2013-02-07 2016-08-02 Nuance Communications, Inc. Method and apparatus for efficient i-vector extraction
US20140222423A1 (en) * 2013-02-07 2014-08-07 Nuance Communications, Inc. Method and Apparatus for Efficient I-Vector Extraction
US9865266B2 (en) * 2013-02-25 2018-01-09 Nuance Communications, Inc. Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system
US9489965B2 (en) * 2013-03-15 2016-11-08 Sri International Method and apparatus for acoustic signal characterization
US9258425B2 (en) 2013-05-22 2016-02-09 Nuance Communications, Inc. Method and system for speaker verification
US10438581B2 (en) 2013-07-31 2019-10-08 Google Llc Speech recognition using neural networks
US9514753B2 (en) 2013-11-04 2016-12-06 Google Inc. Speaker identification using hash-based indexing
WO2015147662A1 (en) * 2014-03-28 2015-10-01 Intel Corporation Training classifiers using selected cohort sample subsets
US10014007B2 (en) 2014-05-28 2018-07-03 Interactive Intelligence, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10255903B2 (en) 2014-05-28 2019-04-09 Interactive Intelligence Group, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US9792899B2 (en) * 2014-07-15 2017-10-17 International Business Machines Corporation Dataset shift compensation in machine learning
CN107104994B (zh) * 2016-02-22 2021-07-20 华硕电脑股份有限公司 语音识别方法、电子装置及语音识别系统
NZ749370A (en) 2016-06-02 2020-03-27 Genesys Telecommunications Laboratories Inc Technologies for authenticating a speaker using voice biometrics
DE102017207876A1 (de) * 2017-05-10 2018-11-15 Robert Bosch Gmbh Parallelisierte Verarbeitung
CN109146450A (zh) 2017-06-16 2019-01-04 阿里巴巴集团控股有限公司 支付方法、客户端、电子设备、存储介质和服务器
US10304475B1 (en) * 2017-08-14 2019-05-28 Amazon Technologies, Inc. Trigger word based beam selection
US11289098B2 (en) * 2019-03-08 2022-03-29 Samsung Electronics Co., Ltd. Method and apparatus with speaker recognition registration
CN111833887A (zh) * 2020-07-14 2020-10-27 山东理工大学 一种基于局部保持判别投影的说话人确认方法
US20240428101A1 (en) * 2023-06-22 2024-12-26 Toyota Connected North America, Inc. Systems and methods of voiceprint authentication and interpolation

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999023643A1 (en) * 1997-11-03 1999-05-14 T-Netix, Inc. Model adaptation system and method for speaker verification
US6327565B1 (en) * 1998-04-30 2001-12-04 Matsushita Electric Industrial Co., Ltd. Speaker and environment adaptation based on eigenvoices
US6141644A (en) * 1998-09-04 2000-10-31 Matsushita Electric Industrial Co., Ltd. Speaker verification and speaker identification based on eigenvoices
US6571208B1 (en) * 1999-11-29 2003-05-27 Matsushita Electric Industrial Co., Ltd. Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
US6529872B1 (en) * 2000-04-18 2003-03-04 Matsushita Electric Industrial Co., Ltd. Method for noise adaptation in automatic speech recognition using transformed matrices
DE10047723A1 (de) * 2000-09-27 2002-04-11 Philips Corp Intellectual Pty Verfahren zur Ermittlung eines Eigenraums zur Darstellung einer Mehrzahl von Trainingssprechern
US6895376B2 (en) * 2001-05-04 2005-05-17 Matsushita Electric Industrial Co., Ltd. Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification
US6915259B2 (en) * 2001-05-24 2005-07-05 Matsushita Electric Industrial Co., Ltd. Speaker and environment adaptation based on linear separation of variability sources
KR101011713B1 (ko) * 2003-07-01 2011-01-28 프랑스 텔레콤 화자의 압축된 표시를 위한 음성 신호 분석 방법 및 시스템
WO2005055200A1 (en) * 2003-12-05 2005-06-16 Queensland University Of Technology Model adaptation system and method for speaker recognition
EP1889255A1 (de) * 2005-05-24 2008-02-20 Loquendo S.p.A. Automatische textunabhängige, sprachenunabhänige sprecher-voice-print-erzeugung und sprechererkennung
ATE457511T1 (de) * 2007-10-10 2010-02-15 Harman Becker Automotive Sys Sprechererkennung
US8050920B2 (en) * 2008-01-18 2011-11-01 Universidad De Chile Biometric control method on the telephone network with speaker verification technology by using an intra speaker variability and additive noise unsupervised compensation

Also Published As

Publication number Publication date
WO2007131530A1 (en) 2007-11-22
US20110040561A1 (en) 2011-02-17
CA2652302A1 (en) 2007-11-22
ES2357674T3 (es) 2011-04-28
DE602006018795D1 (de) 2011-01-20
US8566093B2 (en) 2013-10-22
AU2006343470B2 (en) 2012-07-19
AU2006343470A1 (en) 2007-11-22
CA2652302C (en) 2015-04-07
EP2022042B1 (de) 2010-12-08
EP2022042A1 (de) 2009-02-11

Similar Documents

Publication Publication Date Title
ATE491202T1 (de) Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache
WO2020256257A3 (ko) 잡음 환경에 강인한 화자 인식을 위한 심화신경망 기반의 특징 강화 및 변형된 손실 함수를 이용한 결합 학습 방법 및 장치
MX2010008372A (es) Aparato y metodo para calcular coeficientes de filtro para supresion de eco.
WO2006091551A3 (en) Audio signal de-identification
TW200741650A (en) Method and apparatus for processing a audio signal
WO2008087934A1 (ja) 拡張認識辞書学習装置と音声認識システム
NO20083580L (no) Autentisering av taler
DE602005001142D1 (de) Nachrichtenübertragungsgerät
DK2027581T3 (da) Signalseparator, fremgangsmåde til bestemmelse af outputsignaler på basis af mikrofonsignaler og computerprogram
WO2008114448A1 (ja) 音声認識システム、音声認識プログラムおよび音声認識方法
ATE425532T1 (de) Modellbasierte verbesserung von sprachsignalen
WO2008126355A1 (ja) キーワード抽出装置
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
WO2021074721A3 (en) System for automatic assessment of fluency in spoken language and a method thereof
WO2014145960A3 (en) Method and system for generating advanced feature discrimination vectors for use in speech recognition
WO2007035183A3 (en) Method, system, and program product for measuring audio video synchronization independent of speaker characteristics
GB201212435D0 (en) A transcription device and a method for transcribing speech
ATE442641T1 (de) Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist
DK2165567T3 (da) Fremgangsmåde til feedbackophævelse i et høreapparat og et høreapparat
WO2007129156A3 (en) Soft alignment in gaussian mixture model based transformation
Nesta et al. Blind source extraction for robust speech recognition in multisource noisy environments
WO2015012680A3 (en) A method for speech watermarking in speaker verification
DE602005007939D1 (de) Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb ekennungssystems liegen
EP1675102A3 (de) Verfahren zum Extrahieren von Merkmalvektoren für Spracherkennung
DK2360951T3 (da) Fremgangsmåde til adaptiv tilpasning af et høresystems mikrofoner samt et høresystem

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties