BRPI0406952A - Quantificação de classe para o reconhecimento de fala distribuìda - Google Patents
Quantificação de classe para o reconhecimento de fala distribuìdaInfo
- Publication number
- BRPI0406952A BRPI0406952A BR0406952-8A BRPI0406952A BRPI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A
- Authority
- BR
- Brazil
- Prior art keywords
- class
- tone
- frame
- codeword
- quantification
- Prior art date
Links
- 238000011002 quantification Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 abstract 5
- 230000010365 information processing Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/72—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for transmitting results of analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G10L2025/935—Mixed voiced class; Transitions
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
Abstract
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/360,582 | 2003-02-07 | ||
| US10/360,582 US6961696B2 (en) | 2003-02-07 | 2003-02-07 | Class quantization for distributed speech recognition |
| PCT/US2004/003419 WO2004072948A2 (en) | 2003-02-07 | 2004-02-05 | Class quantization for distributed speech recognition |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| BRPI0406952A true BRPI0406952A (pt) | 2006-01-03 |
| BRPI0406952B1 BRPI0406952B1 (pt) | 2018-02-27 |
Family
ID=32824044
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| BRPI0406952-8A BRPI0406952B1 (pt) | 2003-02-07 | 2004-02-05 | Quantização de informação de classe para reconhecimento de fala distríbuido |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US6961696B2 (pt) |
| EP (1) | EP1595249B1 (pt) |
| KR (1) | KR100763325B1 (pt) |
| CN (1) | CN101160380B (pt) |
| BR (1) | BRPI0406952B1 (pt) |
| RU (1) | RU2348019C2 (pt) |
| TW (1) | TWI326447B (pt) |
| WO (1) | WO2004072948A2 (pt) |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7783488B2 (en) * | 2005-12-19 | 2010-08-24 | Nuance Communications, Inc. | Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information |
| CN102256372B (zh) * | 2010-05-17 | 2016-06-22 | 中兴通讯股份有限公司 | Mtc终端接入方法及系统和mtc终端 |
| US9495968B2 (en) | 2013-05-29 | 2016-11-15 | Qualcomm Incorporated | Identifying sources from which higher order ambisonic audio data is generated |
| US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US9502045B2 (en) * | 2014-01-30 | 2016-11-22 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| RU2701120C1 (ru) * | 2018-05-14 | 2019-09-24 | Федеральное государственное казенное военное образовательное учреждение высшего образования "Военный учебно-научный центр Военно-Морского Флота "Военно-морская академия имени Адмирала флота Советского Союза Н.Г. Кузнецова" | Устройство для обработки речевого сигнала |
Family Cites Families (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5226084A (en) * | 1990-12-05 | 1993-07-06 | Digital Voice Systems, Inc. | Methods for speech quantization and error correction |
| US5680508A (en) * | 1991-05-03 | 1997-10-21 | Itt Corporation | Enhancement of speech coding in background noise for low-rate speech coder |
| US5233660A (en) * | 1991-09-10 | 1993-08-03 | At&T Bell Laboratories | Method and apparatus for low-delay celp speech coding and decoding |
| AU684872B2 (en) * | 1994-03-10 | 1998-01-08 | Cable And Wireless Plc | Communication system |
| US5732389A (en) * | 1995-06-07 | 1998-03-24 | Lucent Technologies Inc. | Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures |
| US5699485A (en) * | 1995-06-07 | 1997-12-16 | Lucent Technologies Inc. | Pitch delay modification during frame erasures |
| SE512613C2 (sv) * | 1996-12-30 | 2000-04-10 | Ericsson Telefon Ab L M | Metod och organ för informationshantering |
| US6058205A (en) * | 1997-01-09 | 2000-05-02 | International Business Machines Corporation | System and method for partitioning the feature space of a classifier in a pattern classification system |
| JP3011678B2 (ja) * | 1997-07-09 | 2000-02-21 | 株式会社精研 | たわし |
| US5924066A (en) * | 1997-09-26 | 1999-07-13 | U S West, Inc. | System and method for classifying a speech signal |
| US6199037B1 (en) * | 1997-12-04 | 2001-03-06 | Digital Voice Systems, Inc. | Joint quantization of speech subframe voicing metrics and fundamental frequencies |
| US6038535A (en) * | 1998-03-23 | 2000-03-14 | Motorola, Inc. | Speech classifier and method using delay elements |
| GB9811019D0 (en) * | 1998-05-21 | 1998-07-22 | Univ Surrey | Speech coders |
| US6377915B1 (en) * | 1999-03-17 | 2002-04-23 | Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. | Speech decoding using mix ratio table |
| RU2166804C2 (ru) * | 1999-04-05 | 2001-05-10 | ОАО "НПП "Звукотехника" | Способ преобразования речи и устройство для его осуществления |
| US6377916B1 (en) * | 1999-11-29 | 2002-04-23 | Digital Voice Systems, Inc. | Multiband harmonic transform coder |
| US20020016161A1 (en) * | 2000-02-10 | 2002-02-07 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for compression of speech encoded parameters |
| US6934756B2 (en) * | 2000-11-01 | 2005-08-23 | International Business Machines Corporation | Conversational networking via transport, coding and control conversational protocols |
| US6915256B2 (en) * | 2003-02-07 | 2005-07-05 | Motorola, Inc. | Pitch quantization for distributed speech recognition |
| KR20060068278A (ko) * | 2004-12-16 | 2006-06-21 | 한국전자통신연구원 | 분산 음성 인식 시스템에서의 멜켑스트럼 계수의 양자화방법 및 장치 |
-
2003
- 2003-02-07 US US10/360,582 patent/US6961696B2/en not_active Expired - Lifetime
-
2004
- 2004-02-05 RU RU2005127871/09A patent/RU2348019C2/ru active
- 2004-02-05 KR KR1020057012452A patent/KR100763325B1/ko not_active Expired - Lifetime
- 2004-02-05 EP EP04708622.8A patent/EP1595249B1/en not_active Expired - Lifetime
- 2004-02-05 WO PCT/US2004/003419 patent/WO2004072948A2/en not_active Ceased
- 2004-02-05 CN CN2004800036671A patent/CN101160380B/zh not_active Expired - Lifetime
- 2004-02-05 BR BRPI0406952-8A patent/BRPI0406952B1/pt active IP Right Grant
- 2004-02-06 TW TW093102827A patent/TWI326447B/zh not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| CN101160380A (zh) | 2008-04-09 |
| RU2005127871A (ru) | 2006-01-20 |
| TW200501055A (en) | 2005-01-01 |
| WO2004072948A2 (en) | 2004-08-26 |
| RU2348019C2 (ru) | 2009-02-27 |
| US20040158461A1 (en) | 2004-08-12 |
| KR20050097928A (ko) | 2005-10-10 |
| EP1595249B1 (en) | 2017-07-12 |
| US6961696B2 (en) | 2005-11-01 |
| BRPI0406952B1 (pt) | 2018-02-27 |
| TWI326447B (en) | 2010-06-21 |
| EP1595249A4 (en) | 2007-06-20 |
| KR100763325B1 (ko) | 2007-10-05 |
| WO2004072948A3 (en) | 2004-12-16 |
| CN101160380B (zh) | 2011-09-21 |
| EP1595249A2 (en) | 2005-11-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Coupé et al. | Different languages, similar encoding efficiency: Comparable information rates across the human communicative niche | |
| CY1123159T1 (el) | Συστημα και μεθοδος για την ταυτοποιηση και επεξεργασια δεδομενων σε μια ροη δεδομενων | |
| BRPI0406952A (pt) | Quantificação de classe para o reconhecimento de fala distribuìda | |
| EP1347395A3 (en) | Systems and methods for determining the topic structure of a portion of text | |
| BRPI0502554A (pt) | formatos de utilização de gráficos comuns | |
| DK1393034T3 (da) | Modelbaseret alarmering | |
| BRPI0412184A (pt) | renderização de anúncios com documentos tendo um ou mais tópicos utilizando informação de interesse de tópico do usuário | |
| EP1229547A3 (en) | System and method for thematically analyzing and annotating an audio-visual sequence | |
| BR0107718A (pt) | Método e sistema para a provisão de uma lista de meio personalizada | |
| WO2005048038A3 (en) | Personal information space management system and method | |
| BRPI0410320A (pt) | método e aparelho para representação de granularidade de imagem por um ou mais parámetros | |
| BRPI0415606A (pt) | dispositivo de comunicação sem fio e método para efetuar uma avaliação de cadeia integral de palavras compostas | |
| ATE325384T1 (de) | Systeme und verfahren zur integritätszertifikation und verifikation von inhaltsverbrauchsumgebungen | |
| BRPI0402518A (pt) | Método e sistema para gerenciar a memória baixa em um dispositivo de computador | |
| Allen et al. | A linguistic ‘time capsule’: the Newcastle Electronic Corpus of Tyneside English | |
| EP1072985A3 (en) | Automatic wrapper grammar generation | |
| BR112022018339A2 (pt) | Aparelho e método para sintetizar uma fonte sonora espacialmente estendida com o uso de itens de informações de sugestão | |
| DE60300476D1 (de) | System und Verfahren zur Strichcode-Erkennung | |
| EA200400855A1 (ru) | Система и способ создания многоязычной базы данных | |
| Tang et al. | Mutual intelligibility and similarity of Chinese dialects: Predicting judgments from objective measures | |
| BR0315443A (pt) | Codificador em duas camadas para dvd hìbrido de alta definição | |
| BRPI0406956A (pt) | Quantificação do tom para reconhecimento de fala distribuìda | |
| DE60336188D1 (de) | Datenfilterungsverwaltungsvorrichtung | |
| BR0206446A (pt) | Método e arranjo para ajustar um sinal de dados suplementares a ser embutido em um sinal de informação, dispositivo para embutir um sinal de dados suplementares em um sinal de informação, sinal de informação tendo embutido no mesmo um sinal de dados suplementares, e, meio de armazenamento | |
| DE59902143D1 (de) | Verfahren und vorrichtung zur ausgabe von informationen und/oder meldungen per sprache |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| B25D | Requested change of name of applicant approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
| B25A | Requested transfer of rights approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
| B25G | Requested change of headquarter approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
| B25E | Requested change of name of applicant rejected |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , Free format text: INDEFERIDA A ALTERACAO DE NOME DO SEGUNDO DEPOSITANTE SOLICITADA ATRAVES DA PETICAO NO 020130025782-RJ, DE 28/03/2013, UMA VEZ QUE NAO FOI PAGA A RESPECTIVA TAXA DE RETRIBUICAO. |
|
| B25D | Requested change of name of applicant approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
| B25A | Requested transfer of rights approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
| B15K | Others concerning applications: alteration of classification |
Ipc: G10L 25/72 (2013.01), G10L 25/90 (2013.01), G10L 2 |
|
| B06A | Notification to applicant to reply to the report for non-patentability or inadequacy of the application [chapter 6.1 patent gazette] | ||
| B09A | Decision: intention to grant [chapter 9.1 patent gazette] | ||
| B16A | Patent or certificate of addition of invention granted |