BRPI0910793A2 - método e discriminador para a classificação de diferentes segmentos de um sinal - Google Patents

método e discriminador para a classificação de diferentes segmentos de um sinal

Info

Publication number
BRPI0910793A2
BRPI0910793A2 BRPI0910793A BRPI0910793A BRPI0910793A2 BR PI0910793 A2 BRPI0910793 A2 BR PI0910793A2 BR PI0910793 A BRPI0910793 A BR PI0910793A BR PI0910793 A BRPI0910793 A BR PI0910793A BR PI0910793 A2 BRPI0910793 A2 BR PI0910793A2
Authority
BR
Brazil
Prior art keywords
discriminator
classification
signal
different segments
segments
Prior art date
Application number
BRPI0910793A
Other languages
English (en)
Inventor
Frederik Nagel
Guillaume Fuchs
Jens Hirschfeld
Juergen Herre
Jérémie Lecomte
Nikolaus Rettelbach
Stefan Bayer
Stefan Wabnik
Yoshikazu Yokotani
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of BRPI0910793A2 publication Critical patent/BRPI0910793A2/pt
Publication of BRPI0910793B1 publication Critical patent/BRPI0910793B1/pt
Publication of BRPI0910793B8 publication Critical patent/BRPI0910793B8/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Analysis (AREA)
BRPI0910793A 2008-07-11 2009-06-16 Método e discriminador para a classificação de diferentes segmentos de um sinal BRPI0910793B8 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US7987508P 2008-07-11 2008-07-11
US61/079,875 2008-07-11
PCT/EP2009/004339 WO2010003521A1 (en) 2008-07-11 2009-06-16 Method and discriminator for classifying different segments of a signal

Publications (3)

Publication Number Publication Date
BRPI0910793A2 true BRPI0910793A2 (pt) 2016-08-02
BRPI0910793B1 BRPI0910793B1 (pt) 2020-11-24
BRPI0910793B8 BRPI0910793B8 (pt) 2021-08-24

Family

ID=40851974

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0910793A BRPI0910793B8 (pt) 2008-07-11 2009-06-16 Método e discriminador para a classificação de diferentes segmentos de um sinal

Country Status (19)

Country Link
US (1) US8571858B2 (pt)
EP (1) EP2301011B1 (pt)
JP (1) JP5325292B2 (pt)
KR (2) KR101281661B1 (pt)
CN (1) CN102089803B (pt)
AR (1) AR072863A1 (pt)
AU (1) AU2009267507B2 (pt)
BR (1) BRPI0910793B8 (pt)
CA (1) CA2730196C (pt)
CO (1) CO6341505A2 (pt)
ES (1) ES2684297T3 (pt)
MX (1) MX2011000364A (pt)
MY (1) MY153562A (pt)
PL (1) PL2301011T3 (pt)
PT (1) PT2301011T (pt)
RU (1) RU2507609C2 (pt)
TW (1) TWI441166B (pt)
WO (1) WO2010003521A1 (pt)
ZA (1) ZA201100088B (pt)

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MY181247A (en) * 2008-07-11 2020-12-21 Frauenhofer Ges Zur Forderung Der Angenwandten Forschung E V Audio encoder and decoder for encoding and decoding audio samples
CN101847412B (zh) * 2009-03-27 2012-02-15 华为技术有限公司 音频信号的分类方法及装置
KR101666521B1 (ko) * 2010-01-08 2016-10-14 삼성전자 주식회사 입력 신호의 피치 주기 검출 방법 및 그 장치
AR083303A1 (es) 2010-10-06 2013-02-13 Fraunhofer Ges Forschung Aparato y metodo para procesar una señal de audio y para otorgar una mayor granularidad temporal para un codificador-decodificador combinado y unificado de voz y audio (usac)
US8521541B2 (en) * 2010-11-02 2013-08-27 Google Inc. Adaptive audio transcoding
CN103000172A (zh) * 2011-09-09 2013-03-27 中兴通讯股份有限公司 信号分类方法和装置
US20130090926A1 (en) * 2011-09-16 2013-04-11 Qualcomm Incorporated Mobile device context information using speech detection
US20140058737A1 (en) * 2011-10-28 2014-02-27 Panasonic Corporation Hybrid sound signal decoder, hybrid sound signal encoder, sound signal decoding method, and sound signal encoding method
CN105163398B (zh) 2011-11-22 2019-01-18 华为技术有限公司 连接建立方法和用户设备
US9111531B2 (en) 2012-01-13 2015-08-18 Qualcomm Incorporated Multiple coding mode signal classification
JP5724044B2 (ja) * 2012-02-17 2015-05-27 華為技術有限公司Huawei Technologies Co.,Ltd. 多重チャネル・オーディオ信号の符号化のためのパラメトリック型符号化装置
US20130317821A1 (en) * 2012-05-24 2013-11-28 Qualcomm Incorporated Sparse signal detection with mismatched models
ES2661924T3 (es) 2012-08-31 2018-04-04 Telefonaktiebolaget Lm Ericsson (Publ) Método y dispositivo para detectar la actividad vocal
US9589570B2 (en) * 2012-09-18 2017-03-07 Huawei Technologies Co., Ltd. Audio classification based on perceptual quality for low or medium bit rates
CN108074579B (zh) * 2012-11-13 2022-06-24 三星电子株式会社 用于确定编码模式的方法以及音频编码方法
US9100255B2 (en) * 2013-02-19 2015-08-04 Futurewei Technologies, Inc. Frame structure for filter bank multi-carrier (FBMC) waveforms
ES2736309T3 (es) 2013-02-20 2019-12-27 Fraunhofer Ges Forschung Aparato y procedimiento para codificar o descodificar una señal de audio utilizando una superposición que depende de una ubicación de transitorios
CN104347067B (zh) 2013-08-06 2017-04-12 华为技术有限公司 一种音频信号分类方法和装置
US9666202B2 (en) 2013-09-10 2017-05-30 Huawei Technologies Co., Ltd. Adaptive bandwidth extension and apparatus for the same
KR101498113B1 (ko) * 2013-10-23 2015-03-04 광주과학기술원 사운드 신호의 대역폭 확장 장치 및 방법
CN106256001B (zh) * 2014-02-24 2020-01-21 三星电子株式会社 信号分类方法和装置以及使用其的音频编码方法和装置
CN105096958B (zh) 2014-04-29 2017-04-12 华为技术有限公司 音频编码方法及相关装置
RU2765985C2 (ru) * 2014-05-15 2022-02-07 Телефонактиеболагет Лм Эрикссон (Пабл) Классификация и кодирование аудиосигналов
CN105336338B (zh) * 2014-06-24 2017-04-12 华为技术有限公司 音频编码方法和装置
US9886963B2 (en) * 2015-04-05 2018-02-06 Qualcomm Incorporated Encoder selection
PL3522155T3 (pl) * 2015-05-20 2021-04-19 Telefonaktiebolaget Lm Ericsson (Publ) Kodowanie wielokanałowych sygnałów audio
US10706873B2 (en) * 2015-09-18 2020-07-07 Sri International Real-time speaker state analytics platform
WO2017196422A1 (en) * 2016-05-12 2017-11-16 Nuance Communications, Inc. Voice activity detection feature based on modulation-phase differences
US10699538B2 (en) * 2016-07-27 2020-06-30 Neosensory, Inc. Method and system for determining and providing sensory experiences
WO2018048907A1 (en) 2016-09-06 2018-03-15 Neosensory, Inc. C/O Tmc+260 Method and system for providing adjunct sensory information to a user
CN107895580B (zh) * 2016-09-30 2021-06-01 华为技术有限公司 一种音频信号的重建方法和装置
US10744058B2 (en) 2017-04-20 2020-08-18 Neosensory, Inc. Method and system for providing information to a user
US10325588B2 (en) * 2017-09-28 2019-06-18 International Business Machines Corporation Acoustic feature extractor selected according to status flag of frame of acoustic signal
RU2768224C1 (ru) * 2018-12-13 2022-03-23 Долби Лабораторис Лайсэнзин Корпорейшн Двусторонняя медийная аналитика
RU2761940C1 (ru) * 2018-12-18 2021-12-14 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу
JP7651474B2 (ja) * 2019-04-18 2025-03-26 ドルビー ラボラトリーズ ライセンシング コーポレイション ダイアログ検出器
CN110288983B (zh) * 2019-06-26 2021-10-01 上海电机学院 一种基于机器学习的语音处理方法
WO2021062276A1 (en) 2019-09-25 2021-04-01 Neosensory, Inc. System and method for haptic stimulation
US11467668B2 (en) 2019-10-21 2022-10-11 Neosensory, Inc. System and method for representing virtual object information with haptic stimulation
WO2021142162A1 (en) 2020-01-07 2021-07-15 Neosensory, Inc. Method and system for haptic stimulation
WO2021207825A1 (en) * 2020-04-16 2021-10-21 Voiceage Corporation Method and device for speech/music classification and core encoder selection in a sound codec
US11497675B2 (en) 2020-10-23 2022-11-15 Neosensory, Inc. Method and system for multimodal stimulation
US20240321285A1 (en) * 2021-01-08 2024-09-26 Voiceage Corporation Method and device for unified time-domain / frequency domain coding of a sound signal
US11862147B2 (en) 2021-08-13 2024-01-02 Neosensory, Inc. Method and system for enhancing the intelligibility of information for a user
US12272341B2 (en) * 2021-11-08 2025-04-08 Lemon Inc. Controllable music generation
US11995240B2 (en) 2021-11-16 2024-05-28 Neosensory, Inc. Method and system for conveying digital texture information to a user
US12300259B2 (en) 2022-03-10 2025-05-13 Roku, Inc. Automatic classification of audio content as either primarily speech or primarily non-speech, to facilitate dynamic application of dialogue enhancement
CN116070174A (zh) * 2023-03-23 2023-05-05 长沙融创智胜电子科技有限公司 一种多类别目标识别方法及系统

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IT1232084B (it) * 1989-05-03 1992-01-23 Cselt Centro Studi Lab Telecom Sistema di codifica per segnali audio a banda allargata
JPH0490600A (ja) * 1990-08-03 1992-03-24 Sony Corp 音声認識装置
JPH04342298A (ja) * 1991-05-20 1992-11-27 Nippon Telegr & Teleph Corp <Ntt> 瞬時ピッチ分析方法及び有声・無声判定方法
RU2049456C1 (ru) * 1993-06-22 1995-12-10 Вячеслав Алексеевич Сапрыкин Способ передачи речевых сигналов
US6134518A (en) * 1997-03-04 2000-10-17 International Business Machines Corporation Digital audio signal coding using a CELP coder and a transform coder
JP3700890B2 (ja) * 1997-07-09 2005-09-28 ソニー株式会社 信号識別装置及び信号識別方法
RU2132593C1 (ru) * 1998-05-13 1999-06-27 Академия управления МВД России Многоканальное устройство для передачи речевых сигналов
SE0004187D0 (sv) 2000-11-15 2000-11-15 Coding Technologies Sweden Ab Enhancing the performance of coding systems that use high frequency reconstruction methods
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
CN1279512C (zh) 2001-11-29 2006-10-11 编码技术股份公司 用于改善高频重建的方法和装置
AUPS270902A0 (en) * 2002-05-31 2002-06-20 Canon Kabushiki Kaisha Robust detection and classification of objects in audio using limited training data
JP4348970B2 (ja) * 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
JP2004354589A (ja) * 2003-05-28 2004-12-16 Nippon Telegr & Teleph Corp <Ntt> 音響信号判別方法、音響信号判別装置、音響信号判別プログラム
KR100816601B1 (ko) * 2004-06-01 2008-03-24 닛본 덴끼 가부시끼가이샤 정보 제공 시스템, 방법 및 정보 제공용 프로그램을 기록한 기록 매체
US7130795B2 (en) * 2004-07-16 2006-10-31 Mindspeed Technologies, Inc. Music detection with low-complexity pitch correlation algorithm
JP4587916B2 (ja) * 2005-09-08 2010-11-24 シャープ株式会社 音声信号判別装置、音質調整装置、コンテンツ表示装置、プログラム、及び記録媒体
JP2010503881A (ja) 2006-09-13 2010-02-04 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声・音響送信器及び受信器のための方法及び装置
CN1920947B (zh) * 2006-09-15 2011-05-11 清华大学 用于低比特率音频编码的语音/音乐检测器
WO2008045846A1 (en) * 2006-10-10 2008-04-17 Qualcomm Incorporated Method and apparatus for encoding and decoding audio signals
EP2052548B1 (en) * 2006-12-12 2012-02-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for encoding and decoding data segments representing a time-domain data stream
KR100964402B1 (ko) * 2006-12-14 2010-06-17 삼성전자주식회사 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치
KR100883656B1 (ko) * 2006-12-28 2009-02-18 삼성전자주식회사 오디오 신호의 분류 방법 및 장치와 이를 이용한 오디오신호의 부호화/복호화 방법 및 장치
WO2010001393A1 (en) * 2008-06-30 2010-01-07 Waves Audio Ltd. Apparatus and method for classification and segmentation of audio content, based on the audio signal

Also Published As

Publication number Publication date
PT2301011T (pt) 2018-10-26
RU2507609C2 (ru) 2014-02-20
EP2301011A1 (en) 2011-03-30
BRPI0910793B8 (pt) 2021-08-24
AR072863A1 (es) 2010-09-29
US20110202337A1 (en) 2011-08-18
KR20130036358A (ko) 2013-04-11
JP5325292B2 (ja) 2013-10-23
RU2011104001A (ru) 2012-08-20
CA2730196C (en) 2014-10-21
MX2011000364A (es) 2011-02-25
KR20110039254A (ko) 2011-04-15
AU2009267507B2 (en) 2012-08-02
CO6341505A2 (es) 2011-11-21
KR101281661B1 (ko) 2013-07-03
KR101380297B1 (ko) 2014-04-02
CN102089803B (zh) 2013-02-27
ZA201100088B (en) 2011-08-31
TW201009813A (en) 2010-03-01
HK1158804A1 (en) 2012-07-20
WO2010003521A1 (en) 2010-01-14
ES2684297T3 (es) 2018-10-02
US8571858B2 (en) 2013-10-29
PL2301011T3 (pl) 2019-03-29
EP2301011B1 (en) 2018-07-25
BRPI0910793B1 (pt) 2020-11-24
CN102089803A (zh) 2011-06-08
CA2730196A1 (en) 2010-01-14
JP2011527445A (ja) 2011-10-27
MY153562A (en) 2015-02-27
AU2009267507A1 (en) 2010-01-14
TWI441166B (zh) 2014-06-11

Similar Documents

Publication Publication Date Title
BRPI0910793A2 (pt) método e discriminador para a classificação de diferentes segmentos de um sinal
BRPI0820488A2 (pt) método e equipamento para processar um sinal
BRPI0822345A2 (pt) Método e aparelho para mascarar perda de sinal
BRPI1012717A2 (pt) método e aparelho para a integração de dados de lugar fornecidos por comunidade
BRPI0918550A2 (pt) projeto de sinal de referência para lte avancada
BR112012003186A2 (pt) método de determinação de recurso de sinal
EP2380111A4 (en) FACE DETECTION ACCELERATION METHOD
BRPI0917910A2 (pt) método para detectar amostra de aspereza e aparelho para o método
BRPI1013585A2 (pt) método e dispositivo para classificação de sinal de áudio
BRPI0822112A8 (pt) sensor, método para detecção e método para produzir um sensor
BRPI1012752A2 (pt) método e aparelho para detectar um fenômeno físico.
BRPI0912299A2 (pt) método e dispositivo para a detecção de eventos de microssono
BRPI0910807A2 (pt) dispositivo de bloqueio para bloquear um sinal de radio e método para bloquear um sinal alvo
BRPI0810718A2 (pt) Método e aparelho para a formação de múltiplos microcondutos
BRPI1006217A2 (pt) aparelho e método para manipulação de um sinal de áudio
BRPI1010670A2 (pt) método e aparelho para sensoriamento ótico
DK2464555T3 (da) Fremgangsmåde og instrumentering til detektion af skinnedefekter, især detekter på skinneoversiden
BRPI0912610A2 (pt) métodos e sistemas para triagem universal de portador
BRPI1006542A2 (pt) sistema e método para detectar uma queda de um usuário
BRPI0913074A2 (pt) pneumático e método para sua fabricação
BRPI0902903A2 (pt) método de avaliação de sinal e aparelho de avaliação de sinal
BRPI1015922A2 (pt) sistemas e métodos para testar analitos
PT2207037T (pt) Método para a deteção de cancro
IT1401573B1 (it) Metodo e relativo apparato per la identificazione e classificazione antropometrica
BR112012018103A2 (pt) método para detectar a presença de uma amostra de testemunhagem dentro de uma ferramenta de testemunhagem, método para perfurar dentro de um poço com uma ferramenta de testemunhagem, e método para detectar a presença de uma amostra de testemunhagem dentro de uma ferramenta de testemunhagem

Legal Events

Date Code Title Description
B06F Objections, documents and/or translations needed after an examination request according [chapter 6.6 patent gazette]
B06T Formal requirements before examination [chapter 6.20 patent gazette]
B15K Others concerning applications: alteration of classification

Free format text: AS CLASSIFICACOES ANTERIORES ERAM: F27D 1/18 , F27D 11/10 , H05B 7/10 , F16C 32/06

Ipc: G10L 19/20 (2013.01), G10L 19/22 (2013.01), G10L 2

B06A Patent application procedure suspended [chapter 6.1 patent gazette]
B09A Decision: intention to grant [chapter 9.1 patent gazette]
B16A Patent or certificate of addition of invention granted [chapter 16.1 patent gazette]

Free format text: PRAZO DE VALIDADE: 10 (DEZ) ANOS CONTADOS A PARTIR DE 24/11/2020, OBSERVADAS AS CONDICOES LEGAIS.

B16C Correction of notification of the grant [chapter 16.3 patent gazette]

Free format text: REF. RPI 2603 DE 24/11/2020 QUANTO AO ENDERECO.