ATE457511T1 - Sprechererkennung - Google Patents

Sprechererkennung

Info

Publication number: ATE457511T1
Authority: AT; Austria
Prior art keywords: speaker; speaker model; speech input; received speech; speaker recognition
Prior art date: 2007-10-10

Application number

AT07019849T

Other languages

English (en)

Inventor

Franz Gerl

Tobias Herbig

Original Assignee

Harman Becker Automotive Sys

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2007-10-10

Filing date

2007-10-10

Publication date

2010-02-15

2007-10-10 Application filed by Harman Becker Automotive Sys filed Critical Harman Becker Automotive Sys

2010-02-15 Application granted granted Critical

2010-02-15 Publication of ATE457511T1 publication Critical patent/ATE457511T1/de

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building

Landscapes

Engineering & Computer Science (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Telephonic Communication Services (AREA)
Measuring Fluid Pressure (AREA)
Magnetic Resonance Imaging Apparatus (AREA)
Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
Telephone Function (AREA)

AT07019849T 2007-10-10 2007-10-10 Sprechererkennung ATE457511T1 (de)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
EP07019849A EP2048656B1 (de)	2007-10-10	2007-10-10	Sprechererkennung

Publications (1)

Publication Number	Publication Date
ATE457511T1 true ATE457511T1 (de)	2010-02-15

Family

ID=38769925

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT07019849T ATE457511T1 (de)	2007-10-10	2007-10-10	Sprechererkennung

Country Status (4)

Country	Link
US (1)	US20090119103A1 (de)
EP (1)	EP2048656B1 (de)
AT (1)	ATE457511T1 (de)
DE (1)	DE602007004733D1 (de)

Families Citing this family (66)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US8504366B2 (en) *	2005-12-19	2013-08-06	Nuance Communications, Inc.	Joint factor analysis scoring for speech processing systems
US8566093B2 (en) *	2006-05-16	2013-10-22	Loquendo S.P.A.	Intersession variability compensation for automatic extraction of information from voice
FI20060666A0 (fi) *	2006-07-07	2006-07-07	Nokia Corp	Menetelmä ja järjestelmä epäjatkuvan lähetyksen toiminnallisuuden parantamiseksi
US7970614B2 (en) *	2007-05-08	2011-06-28	Nuance Communications, Inc.	Continuous adaptation in detection systems via self-tuning from target population subsets
DE602007014382D1 (de) *	2007-11-12	2011-06-16	Harman Becker Automotive Sys	Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen
US9418662B2 (en) *	2009-01-21	2016-08-16	Nokia Technologies Oy	Method, apparatus and computer program product for providing compound models for speech recognition adaptation
EP2216775B1 (de) *	2009-02-05	2012-11-21	Nuance Communications, Inc.	Sprechererkennung
US20100217590A1 (en) *	2009-02-24	2010-08-26	Broadcom Corporation	Speaker localization system and method
US8184180B2 (en) *	2009-03-25	2012-05-22	Broadcom Corporation	Spatially synchronized audio and video capture
US9583095B2 (en) *	2009-07-17	2017-02-28	Nec Corporation	Speech processing device, method, and storage medium
US8160877B1 (en) *	2009-08-06	2012-04-17	Narus, Inc.	Hierarchical real-time speaker recognition for biometric VoIP verification and targeting
US8233352B2 (en) *	2009-08-17	2012-07-31	Broadcom Corporation	Audio source localization system and method
GB2478314B (en) *	2010-03-02	2012-09-12	Toshiba Res Europ Ltd	A speech processor, a speech processing method and a method of training a speech processor
US9009040B2 (en) *	2010-05-05	2015-04-14	Cisco Technology, Inc.	Training a transcription system
US8234111B2 (en) *	2010-06-14	2012-07-31	Google Inc.	Speech and noise models for speech recognition
CN102486922B (zh) *	2010-12-03	2014-12-03	株式会社理光	说话人识别方法、装置和系统
US8639508B2 (en) *	2011-02-14	2014-01-28	General Motors Llc	User-specific confidence thresholds for speech recognition
US9082403B2 (en) *	2011-12-15	2015-07-14	Microsoft Technology Licensing, Llc	Spoken utterance classification training for a speech recognition system
US9147401B2 (en) *	2011-12-21	2015-09-29	Sri International	Method and apparatus for speaker-calibrated speaker detection
US8935164B2 (en) *	2012-05-02	2015-01-13	Gentex Corporation	Non-spatial speech detection system and method of using same
CN102664011B (zh) *	2012-05-17	2014-03-12	吉林大学	一种快速说话人识别方法
US9881616B2 (en) *	2012-06-06	2018-01-30	Qualcomm Incorporated	Method and systems having improved speech recognition
US8429103B1 (en)	2012-06-22	2013-04-23	Google Inc.	Native machine learning service for user adaptation on a mobile platform
US8510238B1 (en)	2012-06-22	2013-08-13	Google, Inc.	Method to predict session duration on mobile devices using native machine learning
US8886576B1 (en)	2012-06-22	2014-11-11	Google Inc.	Automatic label suggestions for albums based on machine learning
US9368116B2 (en)	2012-09-07	2016-06-14	Verint Systems Ltd.	Speaker separation in diarization
US9336771B2 (en) *	2012-11-01	2016-05-10	Google Inc.	Speech recognition using non-parametric models
US9837078B2 (en) *	2012-11-09	2017-12-05	Mattersight Corporation	Methods and apparatus for identifying fraudulent callers
US20140136204A1 (en) *	2012-11-13	2014-05-15	GM Global Technology Operations LLC	Methods and systems for speech systems
US9190057B2 (en) *	2012-12-12	2015-11-17	Amazon Technologies, Inc.	Speech model retrieval in distributed speech recognition systems
US20160049163A1 (en) *	2013-05-13	2016-02-18	Thomson Licensing	Method, apparatus and system for isolating microphone audio
US9460722B2 (en)	2013-07-17	2016-10-04	Verint Systems Ltd.	Blind diarization of recorded calls with arbitrary number of speakers
US9984706B2 (en)	2013-08-01	2018-05-29	Verint Systems Ltd.	Voice activity detection using a soft decision mechanism
US10561361B2 (en) *	2013-10-20	2020-02-18	Massachusetts Institute Of Technology	Using correlation structure of speech dynamics to detect neurological changes
CN104143326B (zh) *	2013-12-03	2016-11-02	腾讯科技（深圳）有限公司	一种语音命令识别方法和装置
US20150161999A1 (en) *	2013-12-09	2015-06-11	Ravi Kalluri	Media content consumption with individualized acoustic speech recognition
US9507852B2 (en) *	2013-12-10	2016-11-29	Google Inc.	Techniques for discriminative dependency parsing
US10540979B2 (en) *	2014-04-17	2020-01-21	Qualcomm Incorporated	User interface for secure access to a device using speaker verification
JP6464650B2 (ja) *	2014-10-03	2019-02-06	日本電気株式会社	音声処理装置、音声処理方法、およびプログラム
WO2016095218A1 (en)	2014-12-19	2016-06-23	Dolby Laboratories Licensing Corporation	Speaker identification using spatial information
US9875742B2 (en) *	2015-01-26	2018-01-23	Verint Systems Ltd.	Word-level blind diarization of recorded calls with arbitrary number of speakers
JP6596376B2 (ja) *	2015-04-22	2019-10-23	パナソニック株式会社	話者識別方法及び話者識別装置
US10147442B1 (en) *	2015-09-29	2018-12-04	Amazon Technologies, Inc.	Robust neural network acoustic model with side task prediction of reference signals
WO2017157423A1 (en) *	2016-03-15	2017-09-21	Telefonaktiebolaget Lm Ericsson (Publ)	System, apparatus, and method for performing speaker verification using a universal background model
AU2017294791B2 (en) *	2016-07-11	2021-06-03	Ftr, Ltd.	Method and system for automatically diarising a sound recording
GB2557375A (en) *	2016-12-02	2018-06-20	Cirrus Logic Int Semiconductor Ltd	Speaker identification
CN108305619B (zh) *	2017-03-10	2020-08-04	腾讯科技（深圳）有限公司	语音数据集训练方法和装置
US10468032B2 (en) *	2017-04-10	2019-11-05	Intel Corporation	Method and system of speaker recognition using context aware confidence modeling
IT201700044093A1 (it)	2017-04-21	2018-10-21	Telecom Italia Spa	Metodo e sistema di riconoscimento del parlatore
KR102371313B1 (ko) *	2017-05-29	2022-03-08	삼성전자주식회사	사용자 발화를 처리하는 전자 장치 및 그 전자 장치의 제어 방법
US20180366127A1 (en) *	2017-06-14	2018-12-20	Intel Corporation	Speaker recognition based on discriminant analysis
EP3451330A1 (de)	2017-08-31	2019-03-06	Thomson Licensing	Vorrichtung und verfahren zur erkennung eines sprechers in einem haus
WO2019048063A1 (en)	2017-09-11	2019-03-14	Telefonaktiebolaget Lm Ericsson (Publ)	VOICE COMMAND MANAGEMENT OF USER PROFILES
WO2019048062A1 (en)	2017-09-11	2019-03-14	Telefonaktiebolaget Lm Ericsson (Publ)	MANAGING USER PROFILES WITH VOICE COMMAND
CN107978311B (zh) *	2017-11-24	2020-08-25	腾讯科技（深圳）有限公司	一种语音数据处理方法、装置以及语音交互设备
US11545157B2 (en) *	2018-04-23	2023-01-03	Google Llc	Speaker diartzation using an end-to-end model
US10818296B2 (en)	2018-06-21	2020-10-27	Intel Corporation	Method and system of robust speaker recognition activation
US10762905B2 (en) *	2018-07-31	2020-09-01	Cirrus Logic, Inc.	Speaker verification
RU2744063C1 (ru) *	2018-12-18	2021-03-02	Общество С Ограниченной Ответственностью "Яндекс"	Способ и система определения говорящего пользователя управляемого голосом устройства
US11741986B2 (en) *	2019-11-05	2023-08-29	Samsung Electronics Co., Ltd.	System and method for passive subject specific monitoring
CN113261056B (zh) *	2019-12-04	2024-08-02	谷歌有限责任公司	使用说话者相关语音模型的说话者感知
FR3104797B1 (fr) *	2019-12-17	2022-01-07	Renault Sas	Procede d’identification d’au moins une personne a bord d’un vehicule automobile par analyse vocale
CN112749508B (zh) *	2020-12-29	2024-03-05	浙江天行健智能科技有限公司	一种基于gmm和bp神经网络的路感模拟方法
CN112786058B (zh) *	2021-03-08	2024-03-29	北京百度网讯科技有限公司	声纹模型训练方法、装置、设备以及存储介质
US11996087B2 (en) *	2021-04-30	2024-05-28	Comcast Cable Communications, Llc	Method and apparatus for intelligent voice recognition
CN114974258B (zh) *	2022-07-27	2022-12-16	深圳市北科瑞声科技股份有限公司	基于语音处理的说话人分离方法、装置、设备及存储介质

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5598507A (en) *	1994-04-12	1997-01-28	Xerox Corporation	Method of speaker clustering for unknown speakers in conversational audio data
DE69924596T2 (de) *	1999-01-20	2006-02-09	Sony International (Europe) Gmbh	Auswahl akustischer Modelle mittels Sprecherverifizierung
US6766295B1 (en) *	1999-05-10	2004-07-20	Nuance Communications	Adaptation of a speech recognition system across multiple remote sessions with a speaker
CN1236423C (zh) *	2001-05-10	2006-01-11	皇家菲利浦电子有限公司	说话人声音的后台学习
US7292977B2 (en) *	2002-10-17	2007-11-06	Bbnt Solutions Llc	Systems and methods for providing online fast speaker adaptation in speech recognition
US8386254B2 (en) *	2007-05-04	2013-02-26	Nuance Communications, Inc.	Multi-class constrained maximum likelihood linear regression

2007
- 2007-10-10 EP EP07019849A patent/EP2048656B1/de active Active
- 2007-10-10 DE DE602007004733T patent/DE602007004733D1/de active Active
- 2007-10-10 AT AT07019849T patent/ATE457511T1/de not_active IP Right Cessation
2008
- 2008-10-10 US US12/249,089 patent/US20090119103A1/en not_active Abandoned

Also Published As

Publication number	Publication date
US20090119103A1 (en)	2009-05-07
EP2048656A1 (de)	2009-04-15
DE602007004733D1 (de)	2010-03-25
EP2048656B1 (de)	2010-02-10

Legal Events

Date	Code	Title	Description
2010-08-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE457511T1 (de)	2010-02-15	Sprechererkennung
ATE527652T1 (de)	2011-10-15	Mehrstufige spracherkennung
ATE424329T1 (de)	2009-03-15	Sprachsteuerung von fahrzeugelementen von ausserhalb einer fahrzeugkabine
MX2008001615A (es)	2008-04-07	Confirmacion selectiva para ejecucion de una interfase de usuario activada por voz.
WO2008144638A3 (en)	2009-04-02	Systems and methods of a structured grammar for a speech recognition command system
DE602007014382D1 (de)	2011-06-16	Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen
ATE479983T1 (de)	2010-09-15	Verfahren und system zur spracherkennung zum durchsuchen einer datenbank
WO2012036424A3 (en)	2012-06-28	Method and apparatus for performing microphone beamforming
WO2009016631A3 (en)	2010-03-04	Automatic context sensitive language correction and enhancement using an internet corpus
ATE499180T1 (de)	2011-03-15	Werkzeugmaschinensicherheitsvorrichtung
ATE440334T1 (de)	2009-09-15	System für sprachgesteuerte auswahl einer audiodatei und verfahren dafür
WO2010117712A3 (en)	2011-02-24	Systems and methods for measuring speech intelligibility
DE602005018552D1 (de)	2010-02-04	Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
WO2014063104A3 (en)	2014-06-19	Keyword voice activation in vehicles
EP2482277A4 (de)	2013-04-10	Verfahren zur identiifizierung eines sprechers basierend auf zufälligen sprachphonogrammen unter verwendung der formantenentzerrung
WO2008084575A1 (ja)	2008-07-17	車載用音声認識装置
WO2009145508A3 (ko)	2010-01-21	실시간 호출명령어 인식을 이용한 잡음환경에서의 음성구간검출과 연속음성인식 시스템
WO2006002299A3 (en)	2006-09-08	Method and apparatus for recognizing 3-d objects
NO20075732L (no)	2008-03-17	Flersensorisk taleforbedring ved bruk av sannsynligheten for ren tale
ATE449403T1 (de)	2009-12-15	Mehrstimmige spracherkennung
WO2008005711A3 (en)	2008-09-25	Non-enrolled continuous dictation
ATE407411T1 (de)	2008-09-15	Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
WO2012134877A3 (en)	2014-05-01	Computer-implemented systems and methods evaluating prosodic features of speech
WO2014115115A3 (en)	2014-11-06	Determining apnea-hypopnia index ahi from speech
GB0506528D0 (en)	2005-05-04	System and method for automatic speech recognition