BRPI0406765A - Método e aparelho para a reconstrução da fala em um sistema de reconhecimento da fala distribuìdo - Google Patents

Método e aparelho para a reconstrução da fala em um sistema de reconhecimento da fala distribuìdo

Info

Publication number
BRPI0406765A
BRPI0406765A BR0406765-7A BRPI0406765A BRPI0406765A BR PI0406765 A BRPI0406765 A BR PI0406765A BR PI0406765 A BRPI0406765 A BR PI0406765A BR PI0406765 A BRPI0406765 A BR PI0406765A
Authority
BR
Brazil
Prior art keywords
speech
recognition system
mfccs
reconstruction
distributed
Prior art date
Application number
BR0406765-7A
Other languages
English (en)
Inventor
Tenkasi V Ramabadran
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Publication of BRPI0406765A publication Critical patent/BRPI0406765A/pt
Publication of BRPI0406765B1 publication Critical patent/BRPI0406765B1/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Radio Relay Systems (AREA)

Abstract

"MéTODO E APARELHO PARA A RECONSTRUçãO DA FALA EM UM SISTEMA DE RECONHECIMENTO DA FALA DISTRIBUìDO". Um método e aparelho são aqui fornecidos para a reconstrução da fala em um sistema de reconhecimento da fala distribuído. MFCCs faltantes são reconstruídas e utilizadas para gerar a fala. Particularmente, a recuperação parcial das MFCCs faltantes é conseguida pela exploração da dependência das MFCCs faltantes no período de pitch transmitido P bem como nas MFCCs transmitidas. Magnitudes harmónicas são então obtidas a partir das MFCCs transmitidas e reconstruídas, e a fala é reconstruída utilizando essas magnitudes harmónicas.
BRPI0406765-7A 2003-01-14 2004-01-13 Método e aparelho para a reconstrução da fala em um sistema de reconhecimento da fala distribuído BRPI0406765B1 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US10/341,726 2003-01-14
US10/341,726 US7027979B2 (en) 2003-01-14 2003-01-14 Method and apparatus for speech reconstruction within a distributed speech recognition system
PCT/US2004/000871 WO2004066269A2 (en) 2003-01-14 2004-01-13 Method and apparatus for speech reconstruction within a distributed speech recognition system

Publications (2)

Publication Number Publication Date
BRPI0406765A true BRPI0406765A (pt) 2005-12-20
BRPI0406765B1 BRPI0406765B1 (pt) 2018-08-07

Family

ID=32711568

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0406765-7A BRPI0406765B1 (pt) 2003-01-14 2004-01-13 Método e aparelho para a reconstrução da fala em um sistema de reconhecimento da fala distribuído

Country Status (7)

Country Link
US (1) US7027979B2 (pt)
EP (1) EP1588354B1 (pt)
KR (1) KR101059640B1 (pt)
CN (1) CN100371988C (pt)
BR (1) BRPI0406765B1 (pt)
RU (1) RU2366007C2 (pt)
WO (1) WO2004066269A2 (pt)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8412526B2 (en) * 2003-04-01 2013-04-02 Nuance Communications, Inc. Restoration of high-order Mel frequency cepstral coefficients
US7305339B2 (en) * 2003-04-01 2007-12-04 International Business Machines Corporation Restoration of high-order Mel Frequency Cepstral Coefficients
US7386443B1 (en) * 2004-01-09 2008-06-10 At&T Corp. System and method for mobile automatic speech recognition
CN101223581A (zh) * 2005-07-14 2008-07-16 皇家飞利浦电子股份有限公司 音频信号合成
US20070191736A1 (en) * 2005-10-04 2007-08-16 Don Alden Method for loading penetrating members in a collection device
US7783488B2 (en) 2005-12-19 2010-08-24 Nuance Communications, Inc. Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information
KR100735343B1 (ko) * 2006-04-11 2007-07-04 삼성전자주식회사 음성신호의 피치 정보 추출장치 및 방법
US8306817B2 (en) * 2008-01-08 2012-11-06 Microsoft Corporation Speech recognition with non-linear noise reduction on Mel-frequency cepstra
EP3296992B1 (en) * 2008-03-20 2021-09-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for modifying a parameterized representation
US9020816B2 (en) * 2008-08-14 2015-04-28 21Ct, Inc. Hidden markov model for speech processing with training method
US9767806B2 (en) * 2013-09-24 2017-09-19 Cirrus Logic International Semiconductor Ltd. Anti-spoofing
US20100174539A1 (en) * 2009-01-06 2010-07-08 Qualcomm Incorporated Method and apparatus for vector quantization codebook search
KR101712101B1 (ko) * 2010-01-28 2017-03-03 삼성전자 주식회사 신호 처리 방법 및 장치
US8595005B2 (en) * 2010-05-31 2013-11-26 Simple Emotion, Inc. System and method for recognizing emotional state from a speech signal
CN104766608A (zh) * 2014-01-07 2015-07-08 深圳市中兴微电子技术有限公司 一种语音控制方法及装置
US9549068B2 (en) 2014-01-28 2017-01-17 Simple Emotion, Inc. Methods for adaptive voice interaction
RU2610285C1 (ru) * 2016-02-15 2017-02-08 федеральное государственное казенное военное образовательное учреждение высшего образования "Военная академия связи имени Маршала Советского Союза С.М. Буденного" Министерства обороны Российской Федерации Способ распознавания протоколов низкоскоростного кодирования
CN106856093A (zh) * 2017-02-23 2017-06-16 海信集团有限公司 音频信息处理方法、智能终端及语音控制终端
CN106847280B (zh) * 2017-02-23 2020-09-15 海信集团有限公司 音频信息处理方法、智能终端及语音控制终端
CN107527611A (zh) * 2017-08-23 2017-12-29 武汉斗鱼网络科技有限公司 Mfcc语音识别方法、存储介质、电子设备及系统
RU2667462C1 (ru) * 2017-10-24 2018-09-19 федеральное государственное казенное военное образовательное учреждение высшего образования "Военная академия связи имени Маршала Советского Союза С.М. Буденного" Министерства обороны Российской Федерации Способ распознавания протоколов низкоскоростного кодирования речи
CN109616129B (zh) * 2018-11-13 2021-07-30 南京南大电子智慧型服务机器人研究院有限公司 用于提升语音丢帧补偿性能的混合多描述正弦编码器方法
US11227579B2 (en) * 2019-08-08 2022-01-18 International Business Machines Corporation Data augmentation by frame insertion for speech data
CN110610700B (zh) 2019-10-16 2022-01-14 科大讯飞股份有限公司 解码网络构建方法、语音识别方法、装置、设备及存储介质
CN111199747A (zh) * 2020-03-05 2020-05-26 北京花兰德科技咨询服务有限公司 人工智能通信系统及通信方法
RU2748935C1 (ru) * 2020-09-03 2021-06-01 федеральное государственное казенное военное образовательное учреждение высшего образования "Военная академия связи имени Маршала Советского Союза С.М. Буденного" Министерства обороны Российской Федерации Способ распознавания новых протоколов низкоскоростного кодирования
CN115966212A (zh) * 2022-09-06 2023-04-14 深圳市声菲特科技技术有限公司 一种基于语音重建的网络音频高效传输方法

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19549621B4 (de) * 1995-10-06 2004-07-01 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung zum Codieren von Audiosignalen
US5745874A (en) * 1996-03-04 1998-04-28 National Semiconductor Corporation Preprocessor for automatic speech recognition system
FR2766604B1 (fr) * 1997-07-22 1999-10-01 France Telecom Procede et dispositif d'egalisation aveugle des effets d'un canal de transmission sur un signal de parole numerique
US6173260B1 (en) * 1997-10-29 2001-01-09 Interval Research Corporation System and method for automatic classification of speech based upon affective content
US6076058A (en) * 1998-03-02 2000-06-13 Lucent Technologies Inc. Linear trajectory models incorporating preprocessing parameters for speech recognition
GB2355834A (en) * 1999-10-29 2001-05-02 Nokia Mobile Phones Ltd Speech recognition
FI19992350L (fi) * 1999-10-29 2001-04-30 Nokia Mobile Phones Ltd Parannettu puheentunnistus
US6377916B1 (en) * 1999-11-29 2002-04-23 Digital Voice Systems, Inc. Multiband harmonic transform coder
US6633839B2 (en) * 2001-02-02 2003-10-14 Motorola, Inc. Method and apparatus for speech reconstruction in a distributed speech recognition system

Also Published As

Publication number Publication date
KR20050092112A (ko) 2005-09-20
KR101059640B1 (ko) 2011-08-25
US20040138888A1 (en) 2004-07-15
CN1739143A (zh) 2006-02-22
CN100371988C (zh) 2008-02-27
BRPI0406765B1 (pt) 2018-08-07
RU2366007C2 (ru) 2009-08-27
EP1588354B1 (en) 2011-08-24
WO2004066269A3 (en) 2005-01-27
WO2004066269A2 (en) 2004-08-05
EP1588354A4 (en) 2006-03-01
RU2005125737A (ru) 2006-01-10
EP1588354A2 (en) 2005-10-26
US7027979B2 (en) 2006-04-11

Similar Documents

Publication Publication Date Title
BRPI0406765A (pt) Método e aparelho para a reconstrução da fala em um sistema de reconhecimento da fala distribuìdo
NO20032919L (no) Fremgangsmate og anordning for omforming av energi i, eller tilforsel av energi til vann under trykk.
BRPI0413294A (pt) dispositivo, método e sistema
BR0015274A (pt) Sistema e método de diagnóstico e reparo
TR200502538T2 (tr) Bilgi sistem metodu ve aparatı
BRPI0520294A2 (pt) método, aparelho e código de software para suportar o posicionamento baseado no satélite de um dispositivo móvel usando os dados de assistência
DE602004008639D1 (de) Verfahren zum betrieb eines selbstschützenden wellenenergieumwandlers
BRPI0518375A2 (pt) aparelho gerador acionado por corrente d'Água
WO2011073864A3 (en) Reconstructing an object of interest
BRPI0520295A2 (pt) método, aparelho e código de software para suportar o posicionamento baseado no satélite de um dispositivo móvel usando os dados de assistência
WO2007038499A3 (en) Fuel cell water purification system and method
ITBO20040812A1 (it) Sistema per la conversione dell'energia eolica in energia elettrica
ATE458283T1 (de) Managementsystem für eine brennstoffzelle und verfahren dafür
WO2006055294A3 (en) Methods and apparatus for energy conversion using materials comprising molecular deuterium and molecular hydrogen-deuterium
WO2002049998A8 (en) Hydrocarbon conversion system and method with a plurality of sources of compressed oxygen-containing gas
BR112012017381A2 (pt) Método de recuperação de água potável palatável armazenável, água potável palatável armazenável e água engarrafada
ATE504147T1 (de) Verfahren und computerlesbares speichermedium zur bereitstellung eines sicheren zugangs zwischen geräten
JP2007085323A (ja) 動物力発電プラント
ATE246163T1 (de) Verfahren zur herstellung von 1,1,1,3,3- pentafluorpropen und 1,1,1,3,3-pentafluorpropan
NO20050247L (no) Lavmolekylaert oversulfatert polysakkarid
BRPI0416580A (pt) sistema e método de tratamento de água
BR0201510A (pt) Atuação incremental distribuìda para grandes montagens de unidades de implementação
EA201070616A1 (ru) Способ
NL1023408A1 (nl) Inrichting voor toevoer van energie, verwarming en kooldioxide aan broeikassen.
BR8302373U (pt) Equipamento para duplicação de rodado de colheitadeiras

Legal Events

Date Code Title Description
B25D Requested change of name of applicant approved

Owner name: MOTOROLA SOLUTIONS, INC. (US)

B25A Requested transfer of rights approved

Owner name: MOTOROLA MOBILITY, INC. (US)

B25G Requested change of headquarter approved

Owner name: MOTOROLA MOBILITY, INC. (US)

B25E Requested change of name of applicant rejected

Owner name: MOTOROLA MOBILITY, INC. (US)

Free format text: INDEFERIDA A ALTERACAO DE NOME SOLICITADA ATRAVES DA PETICAO NO 0020130027070-RJ, DE 28/03/2013, UMA VEZ QUE NAO FOI PAGA A RESPECTIVA TAXA DE RETRIBUICAO.

B25D Requested change of name of applicant approved

Owner name: MOTOROLA MOBILITY, LLC (US)

B25A Requested transfer of rights approved

Owner name: GOOGLE TECHNOLOGY HOLDINGS LLC (US)

B15K Others concerning applications: alteration of classification

Ipc: G10L 15/30 (2013.01), G10L 21/038 (2013.01)

B06A Patent application procedure suspended [chapter 6.1 patent gazette]
B07A Application suspended after technical examination (opinion) [chapter 7.1 patent gazette]
B09A Decision: intention to grant [chapter 9.1 patent gazette]
B16A Patent or certificate of addition of invention granted [chapter 16.1 patent gazette]
B21F Lapse acc. art. 78, item iv - on non-payment of the annual fees in time

Free format text: REFERENTE A 20A ANUIDADE.

B24J Lapse because of non-payment of annual fees (definitively: art 78 iv lpi, resolution 113/2013 art. 12)

Free format text: EM VIRTUDE DA EXTINCAO PUBLICADA NA RPI 2757 DE 07-11-2023 E CONSIDERANDO AUSENCIA DE MANIFESTACAO DENTRO DOS PRAZOS LEGAIS, INFORMO QUE CABE SER MANTIDA A EXTINCAO DA PATENTE E SEUS CERTIFICADOS, CONFORME O DISPOSTO NO ARTIGO 12, DA RESOLUCAO 113/2013.