ATE282881T1 - Vokoder basierter spracherkenner - Google Patents

Vokoder basierter spracherkenner

Info

Publication number: ATE282881T1
Authority: AT; Austria
Prior art keywords: word; vocoder; lpc; data; recognition features
Prior art date: 1998-01-08

Application number

AT98933871T

Other languages

English (en)

Inventor

Yehuda Hershkovits

Gabriel Ilan

Original Assignee

Art Advanced Recognition Tech

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1998-01-08

Filing date

1998-07-22

Publication date

2004-12-15

1998-07-22 Application filed by Art Advanced Recognition Tech filed Critical Art Advanced Recognition Tech

2004-12-15 Application granted granted Critical

2004-12-15 Publication of ATE282881T1 publication Critical patent/ATE282881T1/de

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision

Landscapes

Engineering & Computer Science (AREA)
Multimedia (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Computational Linguistics (AREA)
Computer Vision & Pattern Recognition (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Mobile Radio Communication Systems (AREA)
Photoreceptors In Electrophotography (AREA)
Telephonic Communication Services (AREA)
Steering Control In Accordance With Driving Conditions (AREA)
Diaphragms For Electromechanical Transducers (AREA)
Character Discrimination (AREA)
Machine Translation (AREA)
Telephone Function (AREA)

AT98933871T 1998-01-08 1998-07-22 Vokoder basierter spracherkenner ATE282881T1 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US09/002,616 US6003004A (en)	1998-01-08	1998-01-08	Speech recognition method and system using compressed speech data
PCT/IL1998/000341 WO1999035639A1 (en)	1998-01-08	1998-07-22	A vocoder-based voice recognizer

Publications (1)

Publication Number	Publication Date
ATE282881T1 true ATE282881T1 (de)	2004-12-15

Family

ID=21701631

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT98933871T ATE282881T1 (de)	1998-01-08	1998-07-22	Vokoder basierter spracherkenner

Country Status (12)

Country	Link
US (3)	US6003004A (de)
EP (1)	EP1046154B1 (de)
JP (1)	JP2001510595A (de)
KR (1)	KR100391287B1 (de)
CN (1)	CN1125432C (de)
AT (1)	ATE282881T1 (de)
AU (1)	AU8355398A (de)
DE (1)	DE69827667T2 (de)
IL (1)	IL132449A (de)
RU (1)	RU99124623A (de)
TW (1)	TW394925B (de)
WO (1)	WO1999035639A1 (de)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US6370504B1 (en) *	1997-05-29	2002-04-09	University Of Washington	Speech recognition on MPEG/Audio encoded files
US6134283A (en) *	1997-11-18	2000-10-17	Amati Communications Corporation	Method and system for synchronizing time-division-duplexed transceivers
US6003004A (en) *	1998-01-08	1999-12-14	Advanced Recognition Technologies, Inc.	Speech recognition method and system using compressed speech data
KR100277105B1 (ko) *	1998-02-27	2001-01-15	윤종용	음성 인식 데이터 결정 장치 및 방법
US6223157B1 (en) *	1998-05-07	2001-04-24	Dsc Telecom, L.P.	Method for direct recognition of encoded speech data
JP4081858B2 (ja) *	1998-06-04	2008-04-30	ソニー株式会社	コンピュータシステム、コンピュータ端末装置、及び記録媒体
US6321197B1 (en) *	1999-01-22	2001-11-20	Motorola, Inc.	Communication device and method for endpointing speech utterances
US6411926B1 (en) *	1999-02-08	2002-06-25	Qualcomm Incorporated	Distributed voice recognition system
US6792405B2 (en) *	1999-12-10	2004-09-14	At&T Corp.	Bitstream-based feature extraction method for a front-end speech recognizer
US6795698B1 (en) *	2000-04-12	2004-09-21	Northrop Grumman Corporation	Method and apparatus for embedding global positioning system (GPS) data in mobile telephone call data
US6564182B1 (en)	2000-05-12	2003-05-13	Conexant Systems, Inc.	Look-ahead pitch determination
US6999923B1 (en) *	2000-06-23	2006-02-14	International Business Machines Corporation	System and method for control of lights, signals, alarms using sound detection
US7203651B2 (en) *	2000-12-07	2007-04-10	Art-Advanced Recognition Technologies, Ltd.	Voice control system with multiple voice recognition engines
US7155387B2 (en) *	2001-01-08	2006-12-26	Art - Advanced Recognition Technologies Ltd.	Noise spectrum subtraction method and system
US7089184B2 (en) *	2001-03-22	2006-08-08	Nurv Center Technologies, Inc.	Speech recognition for recognizing speaker-independent, continuous speech
US20030028386A1 (en) *	2001-04-02	2003-02-06	Zinser Richard L.	Compressed domain universal transcoder
US7319703B2 (en) *	2001-09-04	2008-01-15	Nokia Corporation	Method and apparatus for reducing synchronization delay in packet-based voice terminals by resynchronizing during talk spurts
US7050969B2 (en) *	2001-11-27	2006-05-23	Mitsubishi Electric Research Laboratories, Inc.	Distributed speech recognition with codec parameters
US7079657B2 (en) *	2002-02-26	2006-07-18	Broadcom Corporation	System and method of performing digital multi-channel audio signal decoding
US7024353B2 (en) *	2002-08-09	2006-04-04	Motorola, Inc.	Distributed speech recognition with back-end voice activity detection apparatus and method
US20040073428A1 (en) *	2002-10-10	2004-04-15	Igor Zlokarnik	Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database
FI20021936A7 (fi) *	2002-10-31	2004-05-01	Nokia Corp	Vaihtuvanopeuksinen puhekoodekki
CN1302454C (zh) *	2003-07-11	2007-02-28	中国科学院声学研究所	语音识别的概率加权平均缺失特征数据重建方法
US7558736B2 (en) *	2003-12-31	2009-07-07	United States Cellular Corporation	System and method for providing talker arbitration in point-to-point/group communication
KR100647290B1 (ko) *	2004-09-22	2006-11-23	삼성전자주식회사	합성된 음성의 특성을 이용하여 양자화/역양자화를선택하는 음성 부호화/복호화 장치 및 그 방법
US7533018B2 (en) *	2004-10-19	2009-05-12	Motorola, Inc.	Tailored speaker-independent voice recognition system
US20060095261A1 (en) *	2004-10-30	2006-05-04	Ibm Corporation	Voice packet identification based on celp compression parameters
US20060224381A1 (en) *	2005-04-04	2006-10-05	Nokia Corporation	Detecting speech frames belonging to a low energy sequence
US7697827B2 (en)	2005-10-17	2010-04-13	Konicek Jeffrey C	User-friendlier interfaces for a camera
GB0710211D0 (en) *	2007-05-29	2007-07-11	Intrasonics Ltd	AMR Spectrography
US20090094026A1 (en) *	2007-10-03	2009-04-09	Binshi Cao	Method of determining an estimated frame energy of a communication
US9208796B2 (en)	2011-08-22	2015-12-08	Genband Us Llc	Estimation of speech energy based on code excited linear prediction (CELP) parameters extracted from a partially-decoded CELP-encoded bit stream and applications of same
DK2981963T3 (en)	2013-04-05	2017-02-27	Dolby Laboratories Licensing Corp	COMPRESSION APPARATUS AND PROCEDURE TO REDUCE QUANTIZATION NOISE USING ADVANCED SPECTRAL EXTENSION
CN104683959B (zh) *	2013-11-27	2018-09-18	深圳市盛天龙视听科技有限公司	即时通讯型便携式音频装置及其账号载入方法
KR20150096217A (ko) *	2014-02-14	2015-08-24	한국전자통신연구원	디지털 데이터 압축 방법 및 장치
TWI631556B (zh) *	2017-05-05	2018-08-01	英屬開曼群島商捷鼎創新股份有限公司	資料壓縮裝置及其資料壓縮方法
US10460749B1 (en) *	2018-06-28	2019-10-29	Nuvoton Technology Corporation	Voice activity detection using vocal tract area information
US12322383B2 (en) *	2021-10-05	2025-06-03	Google Llc	Predicting word boundaries for on-device batching of end-to-end speech recognition models

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US3909532A (en) *	1974-03-29	1975-09-30	Bell Telephone Labor Inc	Apparatus and method for determining the beginning and the end of a speech utterance
US4475189A (en) *	1982-05-27	1984-10-02	At&T Bell Laboratories	Automatic interactive conference arrangement
US4519094A (en) *	1982-08-26	1985-05-21	At&T Bell Laboratories	LPC Word recognizer utilizing energy features
US4866777A (en) *	1984-11-09	1989-09-12	Alcatel Usa Corporation	Apparatus for extracting features from a speech signal
US4908865A (en) *	1984-12-27	1990-03-13	Texas Instruments Incorporated	Speaker independent speech recognition method and system
US5548647A (en) *	1987-04-03	1996-08-20	Texas Instruments Incorporated	Fixed text speaker verification method and apparatus
US5208897A (en) *	1990-08-21	1993-05-04	Emerson & Stern Associates, Inc.	Method and apparatus for speech recognition based on subsyllable spellings
US5371853A (en) *	1991-10-28	1994-12-06	University Of Maryland At College Park	Method and system for CELP speech coding and codebook for use therewith
US5305422A (en) *	1992-02-28	1994-04-19	Panasonic Technologies, Inc.	Method for determining boundaries of isolated words within a speech signal
GB2272554A (en) *	1992-11-13	1994-05-18	Creative Tech Ltd	Recognizing speech by using wavelet transform and transient response therefrom
ZA948426B (en) *	1993-12-22	1995-06-30	Qualcomm Inc	Distributed voice recognition system
AU684872B2 (en) *	1994-03-10	1998-01-08	Cable And Wireless Plc	Communication system
US5704009A (en) *	1995-06-30	1997-12-30	International Business Machines Corporation	Method and apparatus for transmitting a voice sample to a voice activated data processing system
US6003004A (en) *	1998-01-08	1999-12-14	Advanced Recognition Technologies, Inc.	Speech recognition method and system using compressed speech data

1998
- 1998-01-08 US US09/002,616 patent/US6003004A/en not_active Expired - Lifetime
- 1998-07-13 TW TW087111338A patent/TW394925B/zh not_active IP Right Cessation
- 1998-07-22 DE DE69827667T patent/DE69827667T2/de not_active Expired - Lifetime
- 1998-07-22 KR KR10-1999-7009488A patent/KR100391287B1/ko not_active Expired - Fee Related
- 1998-07-22 IL IL13244998A patent/IL132449A/xx not_active IP Right Cessation
- 1998-07-22 AT AT98933871T patent/ATE282881T1/de not_active IP Right Cessation
- 1998-07-22 CN CN98808942A patent/CN1125432C/zh not_active Expired - Fee Related
- 1998-07-22 RU RU99124623/09A patent/RU99124623A/ru not_active Application Discontinuation
- 1998-07-22 JP JP53591099A patent/JP2001510595A/ja not_active Ceased
- 1998-07-22 WO PCT/IL1998/000341 patent/WO1999035639A1/en not_active Ceased
- 1998-07-22 EP EP98933871A patent/EP1046154B1/de not_active Expired - Lifetime
- 1998-07-22 AU AU83553/98A patent/AU8355398A/en not_active Abandoned
1999
- 1999-10-05 US US09/412,406 patent/US6377923B1/en not_active Expired - Lifetime
2002
- 2002-01-22 US US10/051,350 patent/US20030018472A1/en not_active Abandoned

Also Published As

Publication number	Publication date
AU8355398A (en)	1999-07-26
US20030018472A1 (en)	2003-01-23
CN1273662A (zh)	2000-11-15
CN1125432C (zh)	2003-10-22
JP2001510595A (ja)	2001-07-31
KR20010006401A (ko)	2001-01-26
US6377923B1 (en)	2002-04-23
DE69827667D1 (de)	2004-12-23
DE69827667T2 (de)	2005-10-06
TW394925B (en)	2000-06-21
US6003004A (en)	1999-12-14
WO1999035639A1 (en)	1999-07-15
EP1046154B1 (de)	2004-11-17
EP1046154A4 (de)	2001-02-07
IL132449A (en)	2005-07-25
IL132449A0 (en)	2001-03-19
RU99124623A (ru)	2001-09-27
KR100391287B1 (ko)	2003-07-12
EP1046154A1 (de)	2000-10-25

Legal Events

Date	Code	Title	Description
2005-05-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE282881T1 (de)	2004-12-15	Vokoder basierter spracherkenner
US11854545B2 (en)	2023-12-26	Privacy mode based on speaker identifier
US11062694B2 (en)	2021-07-13	Text-to-speech processing with emphasized output audio
US6243680B1 (en)	2001-06-05	Method and apparatus for obtaining a transcription of phrases through text and spoken utterances
US8140336B2 (en)	2012-03-20	Speech recognition system with huge vocabulary
DE69922971D1 (de)	2005-02-03	Netzwerk-interaktive benutzerschnittstelle mittels spracherkennung und verarbeitung natürlicher sprache
WO2001097213A8 (en)	2002-01-17	Speech recognition using utterance-level confidence estimates
WO2004090866A3 (en)	2006-04-06	Phonetically based speech recognition system and method
CA2069675A1 (en)	1993-04-09	Flexible vocabulary recognition
ATE395685T1 (de)	2008-05-15	Spracherkennung durch wort-in-phrase-befehl
EP1220197A3 (de)	2003-05-02	System und Verfahren zur Spracherkennung
BR9913524A (pt)	2001-06-05	Reconhecedor de voz, e, processo de reconhecimento de voz
DE60002584D1 (de)	2003-06-12	Anwendung von Referenzdaten für Spracherkennung
ATE449401T1 (de)	2009-12-15	Automatische erzeugung einer wortaussprache für die spracherkennung
Kim et al.	2005	Robust DTW-based recognition algorithm for hand-held consumer devices
Price et al.	2014	Combining linguistic with statistical methods in modeling prosody
Tolba et al.	2001	Speech recognition by intelligent machines
JP3727436B2 (ja)	2005-12-14	音声原稿最適照合装置および方法
Chen et al.	1994	Large vocabulary word recognition based on tree-trellis search
Tajchman et al.	1995	Learning phonological rule probabilities from speech corpora with exploratory computational phonology
Choi et al.	2000	Lexical tree decoding with a class-based language model for Chinese speech recognition.
WO2000026901A3 (en)	2000-11-09	Performing spoken recorded actions
Hofmann et al.	2010	Improving spontaneous English ASR using a joint-sequence pronunciation model
Okada	1991	A unification-grammar-directed one-pass search algorithm for parsing spoken language
Nagai et al.	1991	Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition