MX346732B - Cuantificación de señales de audio adaptables por tonalidad de baja complejidad. - Google Patents

Cuantificación de señales de audio adaptables por tonalidad de baja complejidad.

Info

Publication number
MX346732B
MX346732B MX2015009753A MX2015009753A MX346732B MX 346732 B MX346732 B MX 346732B MX 2015009753 A MX2015009753 A MX 2015009753A MX 2015009753 A MX2015009753 A MX 2015009753A MX 346732 B MX346732 B MX 346732B
Authority
MX
Mexico
Prior art keywords
audio signal
tonality
dead
zone
spectrum
Prior art date
Application number
MX2015009753A
Other languages
English (en)
Other versions
MX2015009753A (es
Inventor
Fuchs Guillaume
Helmrich Christian
Dietz Martin
Markovic Goran
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of MX2015009753A publication Critical patent/MX2015009753A/es
Publication of MX346732B publication Critical patent/MX346732B/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/035Scalar quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/02Means for controlling the tone frequencies, e.g. attack or decay; Means for producing special musical effects, e.g. vibratos or glissandos
    • G10H1/06Circuits for establishing the harmonic content of tones, or other arrangements for changing the tone colour
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/45Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/555Tonality processing, involving the key in which a musical piece or melody is played
    • G10H2210/561Changing the tonality within a musical piece
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

La invención proporciona un codificador de audio para codificar una señal de audio (AS) con el objetivo de producir a partir del mismo una señal codificada (ES), el codificador de audio (1) comprende: un dispositivo de integración por marcos (2) configurado para extraer marcos (F) de la señal de audio (AS); un cuantificador (3) configurado para mapear líneas espectrales (SL1-32) de una señal de espectro (SPS) que deriva del marco (F) de la señal de audio (AS) a los índices de cuantificación (I0, I1), en donde el cuantificador (3) tiene una zona muerta (DZ), en la cual las líneas espectrales de entrada (SL) se mapean hasta el índice de cuantificación cero (l0); y un dispositivo de control (4) configurado para modificar la zona muerta (DZ); en donde el dispositivo de control (4) comprende un dispositivo para calcular la tonalidad (5) configurado para calcular por lo menos un valor indicativo de tonalidad (TI5-32) para por lo menos una línea de espectro (SL1-32) o para por lo menos un grupo de líneas espectrales (SL1-32), en donde el dispositivo de control (4) se configura para modificar la zona muerta (DZ) para la por lo menos una línea de espectro (SL1-32) o al menos un grupo de líneas de espectro (SL1-32) dependiendo del respectivo valor indicativo de tonalidad (TI5-32).
MX2015009753A 2013-01-29 2014-01-28 Cuantificación de señales de audio adaptables por tonalidad de baja complejidad. MX346732B (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201361758191P 2013-01-29 2013-01-29
PCT/EP2014/051624 WO2014118171A1 (en) 2013-01-29 2014-01-28 Low-complexity tonality-adaptive audio signal quantization

Publications (2)

Publication Number Publication Date
MX2015009753A MX2015009753A (es) 2015-11-06
MX346732B true MX346732B (es) 2017-03-30

Family

ID=50023575

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2015009753A MX346732B (es) 2013-01-29 2014-01-28 Cuantificación de señales de audio adaptables por tonalidad de baja complejidad.

Country Status (19)

Country Link
US (3) US10468043B2 (es)
EP (1) EP2939235B1 (es)
JP (3) JP6334564B2 (es)
KR (1) KR101757341B1 (es)
CN (2) CN105103226B (es)
AR (1) AR095087A1 (es)
AU (1) AU2014211539B2 (es)
BR (1) BR112015018050B1 (es)
CA (1) CA2898789C (es)
ES (1) ES2613651T3 (es)
MX (1) MX346732B (es)
MY (1) MY172848A (es)
PL (1) PL2939235T3 (es)
PT (1) PT2939235T (es)
RU (1) RU2621003C2 (es)
SG (1) SG11201505922XA (es)
TW (1) TWI524331B (es)
WO (1) WO2014118171A1 (es)
ZA (1) ZA201506319B (es)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6334564B2 (ja) 2013-01-29 2018-05-30 フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. 低複雑度の調性適応音声信号量子化
EP3396670B1 (en) * 2017-04-28 2020-11-25 Nxp B.V. Speech signal processing
CN113539281B (zh) * 2020-04-21 2024-09-06 华为技术有限公司 音频信号编码方法和装置
US11348594B2 (en) 2020-06-11 2022-05-31 Qualcomm Incorporated Stream conformant bit error resilience
WO2022119304A1 (ko) * 2020-12-01 2022-06-09 현대자동차주식회사 적응적 데드존 양자화를 이용하는 포인트 클라우드 코딩 장치 및 방법
CN118395096B (zh) * 2024-06-27 2024-09-17 江西飞尚科技有限公司 信号频率修正、装置、可读存储介质及电子设备

Family Cites Families (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2841765B2 (ja) * 1990-07-13 1998-12-24 日本電気株式会社 適応ビット割当て方法及び装置
TW224553B (en) * 1993-03-01 1994-06-01 Sony Co Ltd Method and apparatus for inverse discrete consine transform and coding/decoding of moving picture
EP0692880B1 (en) * 1993-11-04 2001-09-26 Sony Corporation Signal encoder, signal decoder, recording medium and signal encoding method
US6167093A (en) * 1994-08-16 2000-12-26 Sony Corporation Method and apparatus for encoding the information, method and apparatus for decoding the information and method for information transmission
DE19505435C1 (de) 1995-02-17 1995-12-07 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Bestimmen der Tonalität eines Audiosignals
JP3308764B2 (ja) * 1995-05-31 2002-07-29 日本電気株式会社 音声符号化装置
DE19614108C1 (de) * 1996-04-10 1997-10-23 Fraunhofer Ges Forschung Anordnung zur Vermessung der Koordinaten eines an einem Objekt angebrachten Retroreflektors
US5924064A (en) * 1996-10-07 1999-07-13 Picturetel Corporation Variable length coding using a plurality of region bit allocation patterns
US6301304B1 (en) * 1998-06-17 2001-10-09 Lsi Logic Corporation Architecture and method for inverse quantization of discrete cosine transform coefficients in MPEG decoders
CA2246532A1 (en) * 1998-09-04 2000-03-04 Northern Telecom Limited Perceptual audio coding
DE10134471C2 (de) * 2001-02-28 2003-05-22 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Charakterisieren eines Signals und Verfahren und Vorrichtung zum Erzeugen eines indexierten Signals
US7447631B2 (en) * 2002-06-17 2008-11-04 Dolby Laboratories Licensing Corporation Audio coding system using spectral hole filling
US7280700B2 (en) 2002-07-05 2007-10-09 Microsoft Corporation Optimization techniques for data compression
US8090577B2 (en) * 2002-08-08 2012-01-03 Qualcomm Incorported Bandwidth-adaptive quantization
US7502743B2 (en) 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
JP3881943B2 (ja) * 2002-09-06 2007-02-14 松下電器産業株式会社 音響符号化装置及び音響符号化方法
US7318027B2 (en) * 2003-02-06 2008-01-08 Dolby Laboratories Licensing Corporation Conversion of synthesized spectral components for encoding and low-complexity transcoding
US7333930B2 (en) 2003-03-14 2008-02-19 Agere Systems Inc. Tonal analysis for perceptual audio coding using a compressed spectral representation
US7738554B2 (en) * 2003-07-18 2010-06-15 Microsoft Corporation DC coefficient signaling at small quantization step sizes
JP4168976B2 (ja) * 2004-05-28 2008-10-22 ソニー株式会社 オーディオ信号符号化装置及び方法
FR2882458A1 (fr) * 2005-02-18 2006-08-25 France Telecom Procede de mesure de la gene due au bruit dans un signal audio
US7983922B2 (en) * 2005-04-15 2011-07-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
US8059721B2 (en) * 2006-04-07 2011-11-15 Microsoft Corporation Estimating sample-domain distortion in the transform domain with rounding compensation
US7995649B2 (en) * 2006-04-07 2011-08-09 Microsoft Corporation Quantization adjustment based on texture level
US20080049950A1 (en) * 2006-08-22 2008-02-28 Poletti Mark A Nonlinear Processor for Audio Signals
EP2122615B1 (en) * 2006-10-20 2011-05-11 Dolby Sweden AB Apparatus and method for encoding an information signal
JP5065687B2 (ja) * 2007-01-09 2012-11-07 株式会社東芝 オーディオデータ処理装置及び端末装置
US8498335B2 (en) * 2007-03-26 2013-07-30 Microsoft Corporation Adaptive deadzone size adjustment in quantization
DE602008005250D1 (de) * 2008-01-04 2011-04-14 Dolby Sweden Ab Audiokodierer und -dekodierer
JP5262171B2 (ja) 2008-02-19 2013-08-14 富士通株式会社 符号化装置、符号化方法および符号化プログラム
WO2010001020A2 (fr) * 2008-06-06 2010-01-07 France Telecom Codage/decodage par plans de bits, perfectionne
ES2988414T3 (es) * 2008-07-11 2024-11-20 Fraunhofer Ges Zur Foerderungder Angewandten Forschung E V Decodificador de audio
JP4932917B2 (ja) 2009-04-03 2012-05-16 株式会社エヌ・ティ・ティ・ドコモ 音声復号装置、音声復号方法、及び音声復号プログラム
WO2010134963A1 (en) * 2009-05-16 2010-11-25 Thomson Licensing Methods and apparatus for improved quantization rounding offset adjustment for video encoding and decoding
CA2992917C (en) * 2010-04-09 2020-05-26 Dolby International Ab Mdct-based complex prediction stereo coding
CA2833874C (en) 2011-04-21 2019-11-05 Ho-Sang Sung Method of quantizing linear predictive coding coefficients, sound encoding method, method of de-quantizing linear predictive coding coefficients, sound decoding method, and recording medium
TWI473078B (zh) * 2011-08-26 2015-02-11 Univ Nat Central 音訊處理方法以及裝置
US8885706B2 (en) * 2011-09-16 2014-11-11 Google Inc. Apparatus and methodology for a video codec system with noise reduction capability
JP6334564B2 (ja) 2013-01-29 2018-05-30 フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. 低複雑度の調性適応音声信号量子化
EP3483879A1 (en) * 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analysis/synthesis windowing function for modulated lapped transformation

Also Published As

Publication number Publication date
BR112015018050B1 (pt) 2021-02-23
JP2016510426A (ja) 2016-04-07
AU2014211539A1 (en) 2015-09-17
TW201440039A (zh) 2014-10-16
JP6334564B2 (ja) 2018-05-30
ZA201506319B (en) 2016-07-27
ES2613651T3 (es) 2017-05-25
KR101757341B1 (ko) 2017-07-14
US20200090671A1 (en) 2020-03-19
US20210366499A1 (en) 2021-11-25
EP2939235B1 (en) 2016-11-16
JP6526091B2 (ja) 2019-06-05
PT2939235T (pt) 2017-02-07
CN105103226A (zh) 2015-11-25
JP6979048B2 (ja) 2021-12-08
CA2898789C (en) 2017-12-05
MY172848A (en) 2019-12-12
BR112015018050A2 (pt) 2017-07-18
JP2019164367A (ja) 2019-09-26
TWI524331B (zh) 2016-03-01
KR20150118954A (ko) 2015-10-23
CA2898789A1 (en) 2014-08-07
SG11201505922XA (en) 2015-08-28
PL2939235T3 (pl) 2017-04-28
US20160027448A1 (en) 2016-01-28
AR095087A1 (es) 2015-09-30
CN110047499B (zh) 2023-08-29
WO2014118171A1 (en) 2014-08-07
CN110047499A (zh) 2019-07-23
JP2017151454A (ja) 2017-08-31
CN105103226B (zh) 2019-04-16
RU2015136242A (ru) 2017-03-07
AU2014211539B2 (en) 2017-04-20
US11094332B2 (en) 2021-08-17
US10468043B2 (en) 2019-11-05
EP2939235A1 (en) 2015-11-04
US11694701B2 (en) 2023-07-04
RU2621003C2 (ru) 2017-05-30
HK1216263A1 (en) 2016-10-28
MX2015009753A (es) 2015-11-06

Similar Documents

Publication Publication Date Title
MX346732B (es) Cuantificación de señales de audio adaptables por tonalidad de baja complejidad.
PH12016500079A1 (en) Palette prediction in palette-based video coding
PH12021550947A1 (en) Coefficient processing for video encoding and decoding
BR112017018441A2 (pt) codificador de áudio para codificação de um sinal multicanal e decodificador de áudio para decodificação de um sinal de áudio codificado
MY192214A (en) Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal
MX346927B (es) Énfasis de bajas frecuencias para codificación basada en lpc (codificación de predicción lineal) en el dominio de frecuencia.
MX2017001243A (es) Codificador y decodificador de audio usando un procesador de dominio de frecuencia, un procesador de dominio de tiempo y procesador cruzado para inicializacion continua.
MY172757A (en) Determining contexts for coding transform coefficient data in video coding
MY155785A (en) Noise filler, noise filling parameter calculator encoded audio signal representation, methods and computer program
PH12014502044A1 (en) Deriving context for last position coding for video coding
IN2014MN02210A (es)
BR112017019185A2 (pt) ?codificador de áudio, decodificador de áudio, método para codificar um sinal de áudio e método para decodificar um sinal de áudio codificado?
MY180722A (en) Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information
MX358178B (es) Diseño del valor poc para codificacion de video multi-capa.
PH12018501871A1 (en) Signal encoding method and device
MX350687B (es) Métodos y aparatos para adaptar información de audio en codificación de objeto de audio espacial.
MX2016001016A (es) Ordenes de escaneo para codificacion sin transformacion.
EP4375992A3 (en) Method and device for quantizing linear predictive coefficient, and method and device for dequantizing same
MX2019003952A (es) Dispositivo de decodificacion de imagen y metodo de decodificacion de imagen.
MY173129A (en) Audio encoding method and apparatus
BR112017021424A2 (pt) ?codificador de áudio e método para codificar um sinal de áudio?
BR112015030852A2 (pt) método e dispositivo de codificação e decodificação de sinal
AR098072A1 (es) Concepto para codificar una señal de audio y decodificar una señal de audio usando información de conformación espectral relacionada con la voz
TH93931B (th) การควอนไทเซชันสัญญาณเสียงแบบปรับตัวระบบเสียงได้ที่มีความซับซ้อนต่ำ
TH171863A (th) การควอนไทเซชันสัญญาณเสียงแบบปรับตัวระบบเสียงได้ที่มีความซับซ้อนต่ำ

Legal Events

Date Code Title Description
FG Grant or registration