ATE282881T1 - Vokoder basierter spracherkenner - Google Patents
Vokoder basierter spracherkennerInfo
- Publication number
- ATE282881T1 ATE282881T1 AT98933871T AT98933871T ATE282881T1 AT E282881 T1 ATE282881 T1 AT E282881T1 AT 98933871 T AT98933871 T AT 98933871T AT 98933871 T AT98933871 T AT 98933871T AT E282881 T1 ATE282881 T1 AT E282881T1
- Authority
- AT
- Austria
- Prior art keywords
- word
- vocoder
- lpc
- data
- recognition features
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
- Steering Control In Accordance With Driving Conditions (AREA)
- Photoreceptors In Electrophotography (AREA)
- Telephone Function (AREA)
- Diaphragms For Electromechanical Transducers (AREA)
- Character Discrimination (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/002,616 US6003004A (en) | 1998-01-08 | 1998-01-08 | Speech recognition method and system using compressed speech data |
| PCT/IL1998/000341 WO1999035639A1 (en) | 1998-01-08 | 1998-07-22 | A vocoder-based voice recognizer |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE282881T1 true ATE282881T1 (de) | 2004-12-15 |
Family
ID=21701631
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT98933871T ATE282881T1 (de) | 1998-01-08 | 1998-07-22 | Vokoder basierter spracherkenner |
Country Status (12)
| Country | Link |
|---|---|
| US (3) | US6003004A (de) |
| EP (1) | EP1046154B1 (de) |
| JP (1) | JP2001510595A (de) |
| KR (1) | KR100391287B1 (de) |
| CN (1) | CN1125432C (de) |
| AT (1) | ATE282881T1 (de) |
| AU (1) | AU8355398A (de) |
| DE (1) | DE69827667T2 (de) |
| IL (1) | IL132449A (de) |
| RU (1) | RU99124623A (de) |
| TW (1) | TW394925B (de) |
| WO (1) | WO1999035639A1 (de) |
Families Citing this family (38)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6370504B1 (en) * | 1997-05-29 | 2002-04-09 | University Of Washington | Speech recognition on MPEG/Audio encoded files |
| US6134283A (en) * | 1997-11-18 | 2000-10-17 | Amati Communications Corporation | Method and system for synchronizing time-division-duplexed transceivers |
| US6003004A (en) * | 1998-01-08 | 1999-12-14 | Advanced Recognition Technologies, Inc. | Speech recognition method and system using compressed speech data |
| KR100277105B1 (ko) * | 1998-02-27 | 2001-01-15 | 윤종용 | 음성 인식 데이터 결정 장치 및 방법 |
| US6223157B1 (en) * | 1998-05-07 | 2001-04-24 | Dsc Telecom, L.P. | Method for direct recognition of encoded speech data |
| JP4081858B2 (ja) * | 1998-06-04 | 2008-04-30 | ソニー株式会社 | コンピュータシステム、コンピュータ端末装置、及び記録媒体 |
| US6321197B1 (en) * | 1999-01-22 | 2001-11-20 | Motorola, Inc. | Communication device and method for endpointing speech utterances |
| US6411926B1 (en) * | 1999-02-08 | 2002-06-25 | Qualcomm Incorporated | Distributed voice recognition system |
| US6792405B2 (en) * | 1999-12-10 | 2004-09-14 | At&T Corp. | Bitstream-based feature extraction method for a front-end speech recognizer |
| US6795698B1 (en) * | 2000-04-12 | 2004-09-21 | Northrop Grumman Corporation | Method and apparatus for embedding global positioning system (GPS) data in mobile telephone call data |
| US6564182B1 (en) | 2000-05-12 | 2003-05-13 | Conexant Systems, Inc. | Look-ahead pitch determination |
| US6999923B1 (en) * | 2000-06-23 | 2006-02-14 | International Business Machines Corporation | System and method for control of lights, signals, alarms using sound detection |
| US7203651B2 (en) * | 2000-12-07 | 2007-04-10 | Art-Advanced Recognition Technologies, Ltd. | Voice control system with multiple voice recognition engines |
| US7155387B2 (en) * | 2001-01-08 | 2006-12-26 | Art - Advanced Recognition Technologies Ltd. | Noise spectrum subtraction method and system |
| US7089184B2 (en) * | 2001-03-22 | 2006-08-08 | Nurv Center Technologies, Inc. | Speech recognition for recognizing speaker-independent, continuous speech |
| US20030028386A1 (en) * | 2001-04-02 | 2003-02-06 | Zinser Richard L. | Compressed domain universal transcoder |
| US7319703B2 (en) * | 2001-09-04 | 2008-01-15 | Nokia Corporation | Method and apparatus for reducing synchronization delay in packet-based voice terminals by resynchronizing during talk spurts |
| US7050969B2 (en) * | 2001-11-27 | 2006-05-23 | Mitsubishi Electric Research Laboratories, Inc. | Distributed speech recognition with codec parameters |
| US7079657B2 (en) * | 2002-02-26 | 2006-07-18 | Broadcom Corporation | System and method of performing digital multi-channel audio signal decoding |
| US7024353B2 (en) * | 2002-08-09 | 2006-04-04 | Motorola, Inc. | Distributed speech recognition with back-end voice activity detection apparatus and method |
| US20040073428A1 (en) * | 2002-10-10 | 2004-04-15 | Igor Zlokarnik | Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database |
| FI20021936L (fi) * | 2002-10-31 | 2004-05-01 | Nokia Corp | Vaihtuvanopeuksinen puhekoodekki |
| CN1302454C (zh) * | 2003-07-11 | 2007-02-28 | 中国科学院声学研究所 | 语音识别的概率加权平均缺失特征数据重建方法 |
| US7558736B2 (en) * | 2003-12-31 | 2009-07-07 | United States Cellular Corporation | System and method for providing talker arbitration in point-to-point/group communication |
| KR100647290B1 (ko) * | 2004-09-22 | 2006-11-23 | 삼성전자주식회사 | 합성된 음성의 특성을 이용하여 양자화/역양자화를선택하는 음성 부호화/복호화 장치 및 그 방법 |
| US7533018B2 (en) * | 2004-10-19 | 2009-05-12 | Motorola, Inc. | Tailored speaker-independent voice recognition system |
| US20060095261A1 (en) * | 2004-10-30 | 2006-05-04 | Ibm Corporation | Voice packet identification based on celp compression parameters |
| US20060224381A1 (en) * | 2005-04-04 | 2006-10-05 | Nokia Corporation | Detecting speech frames belonging to a low energy sequence |
| US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
| GB0710211D0 (en) * | 2007-05-29 | 2007-07-11 | Intrasonics Ltd | AMR Spectrography |
| US20090094026A1 (en) * | 2007-10-03 | 2009-04-09 | Binshi Cao | Method of determining an estimated frame energy of a communication |
| US9208796B2 (en) * | 2011-08-22 | 2015-12-08 | Genband Us Llc | Estimation of speech energy based on code excited linear prediction (CELP) parameters extracted from a partially-decoded CELP-encoded bit stream and applications of same |
| PL2981963T3 (pl) | 2013-04-05 | 2017-06-30 | Dolby Int Ab | Urządzenie kompandujące i sposób redukcji szumu kwantyzacji stosujący zaawansowane rozszerzenie spektralne |
| CN104683959B (zh) * | 2013-11-27 | 2018-09-18 | 深圳市盛天龙视听科技有限公司 | 即时通讯型便携式音频装置及其账号载入方法 |
| KR20150096217A (ko) * | 2014-02-14 | 2015-08-24 | 한국전자통신연구원 | 디지털 데이터 압축 방법 및 장치 |
| TWI631556B (zh) * | 2017-05-05 | 2018-08-01 | 英屬開曼群島商捷鼎創新股份有限公司 | 資料壓縮裝置及其資料壓縮方法 |
| US10460749B1 (en) * | 2018-06-28 | 2019-10-29 | Nuvoton Technology Corporation | Voice activity detection using vocal tract area information |
| US12322383B2 (en) * | 2021-10-05 | 2025-06-03 | Google Llc | Predicting word boundaries for on-device batching of end-to-end speech recognition models |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3909532A (en) * | 1974-03-29 | 1975-09-30 | Bell Telephone Labor Inc | Apparatus and method for determining the beginning and the end of a speech utterance |
| US4475189A (en) * | 1982-05-27 | 1984-10-02 | At&T Bell Laboratories | Automatic interactive conference arrangement |
| US4519094A (en) * | 1982-08-26 | 1985-05-21 | At&T Bell Laboratories | LPC Word recognizer utilizing energy features |
| US4866777A (en) * | 1984-11-09 | 1989-09-12 | Alcatel Usa Corporation | Apparatus for extracting features from a speech signal |
| US4908865A (en) * | 1984-12-27 | 1990-03-13 | Texas Instruments Incorporated | Speaker independent speech recognition method and system |
| US5548647A (en) * | 1987-04-03 | 1996-08-20 | Texas Instruments Incorporated | Fixed text speaker verification method and apparatus |
| US5208897A (en) * | 1990-08-21 | 1993-05-04 | Emerson & Stern Associates, Inc. | Method and apparatus for speech recognition based on subsyllable spellings |
| US5371853A (en) * | 1991-10-28 | 1994-12-06 | University Of Maryland At College Park | Method and system for CELP speech coding and codebook for use therewith |
| US5305422A (en) * | 1992-02-28 | 1994-04-19 | Panasonic Technologies, Inc. | Method for determining boundaries of isolated words within a speech signal |
| GB2272554A (en) * | 1992-11-13 | 1994-05-18 | Creative Tech Ltd | Recognizing speech by using wavelet transform and transient response therefrom |
| ZA948426B (en) * | 1993-12-22 | 1995-06-30 | Qualcomm Inc | Distributed voice recognition system |
| AU684872B2 (en) * | 1994-03-10 | 1998-01-08 | Cable And Wireless Plc | Communication system |
| US5704009A (en) * | 1995-06-30 | 1997-12-30 | International Business Machines Corporation | Method and apparatus for transmitting a voice sample to a voice activated data processing system |
| US6003004A (en) * | 1998-01-08 | 1999-12-14 | Advanced Recognition Technologies, Inc. | Speech recognition method and system using compressed speech data |
-
1998
- 1998-01-08 US US09/002,616 patent/US6003004A/en not_active Expired - Lifetime
- 1998-07-13 TW TW087111338A patent/TW394925B/zh not_active IP Right Cessation
- 1998-07-22 EP EP98933871A patent/EP1046154B1/de not_active Expired - Lifetime
- 1998-07-22 DE DE69827667T patent/DE69827667T2/de not_active Expired - Lifetime
- 1998-07-22 JP JP53591099A patent/JP2001510595A/ja not_active Ceased
- 1998-07-22 WO PCT/IL1998/000341 patent/WO1999035639A1/en not_active Ceased
- 1998-07-22 AU AU83553/98A patent/AU8355398A/en not_active Abandoned
- 1998-07-22 RU RU99124623/09A patent/RU99124623A/ru not_active Application Discontinuation
- 1998-07-22 IL IL13244998A patent/IL132449A/xx not_active IP Right Cessation
- 1998-07-22 AT AT98933871T patent/ATE282881T1/de not_active IP Right Cessation
- 1998-07-22 CN CN98808942A patent/CN1125432C/zh not_active Expired - Fee Related
- 1998-07-22 KR KR10-1999-7009488A patent/KR100391287B1/ko not_active Expired - Fee Related
-
1999
- 1999-10-05 US US09/412,406 patent/US6377923B1/en not_active Expired - Lifetime
-
2002
- 2002-01-22 US US10/051,350 patent/US20030018472A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| RU99124623A (ru) | 2001-09-27 |
| IL132449A (en) | 2005-07-25 |
| US6003004A (en) | 1999-12-14 |
| KR20010006401A (ko) | 2001-01-26 |
| CN1273662A (zh) | 2000-11-15 |
| TW394925B (en) | 2000-06-21 |
| DE69827667D1 (de) | 2004-12-23 |
| CN1125432C (zh) | 2003-10-22 |
| EP1046154A1 (de) | 2000-10-25 |
| KR100391287B1 (ko) | 2003-07-12 |
| AU8355398A (en) | 1999-07-26 |
| US6377923B1 (en) | 2002-04-23 |
| DE69827667T2 (de) | 2005-10-06 |
| WO1999035639A1 (en) | 1999-07-15 |
| EP1046154A4 (de) | 2001-02-07 |
| EP1046154B1 (de) | 2004-11-17 |
| JP2001510595A (ja) | 2001-07-31 |
| US20030018472A1 (en) | 2003-01-23 |
| IL132449A0 (en) | 2001-03-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE282881T1 (de) | Vokoder basierter spracherkenner | |
| US11062694B2 (en) | Text-to-speech processing with emphasized output audio | |
| US11138974B2 (en) | Privacy mode based on speaker identifier | |
| US8666745B2 (en) | Speech recognition system with huge vocabulary | |
| US6243680B1 (en) | Method and apparatus for obtaining a transcription of phrases through text and spoken utterances | |
| EP1629464A4 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
| DE69922971D1 (de) | Netzwerk-interaktive benutzerschnittstelle mittels spracherkennung und verarbeitung natürlicher sprache | |
| WO2001097213A8 (en) | Speech recognition using utterance-level confidence estimates | |
| CA2069675A1 (en) | Flexible vocabulary recognition | |
| ATE395685T1 (de) | Spracherkennung durch wort-in-phrase-befehl | |
| EP1220197A3 (de) | System und Verfahren zur Spracherkennung | |
| BR9913524A (pt) | Reconhecedor de voz, e, processo de reconhecimento de voz | |
| DE60002584D1 (de) | Anwendung von Referenzdaten für Spracherkennung | |
| ATE449401T1 (de) | Automatische erzeugung einer wortaussprache für die spracherkennung | |
| Kim et al. | Robust DTW-based recognition algorithm for hand-held consumer devices | |
| Price et al. | Combining linguistic with statistical methods in modeling prosody | |
| Tolba et al. | Speech recognition by intelligent machines | |
| JP3727436B2 (ja) | 音声原稿最適照合装置および方法 | |
| Chen et al. | Large vocabulary word recognition based on tree-trellis search | |
| Tajchman et al. | Learning phonological rule probabilities from speech corpora with exploratory computational phonology | |
| Choi et al. | Lexical tree decoding with a class-based language model for Chinese speech recognition. | |
| WO2000026901A3 (en) | Performing spoken recorded actions | |
| Hofmann et al. | Improving spontaneous English ASR using a joint-sequence pronunciation model | |
| Okada | A unification-grammar-directed one-pass search algorithm for parsing spoken language | |
| Nagai et al. | Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |