WO2003019532A1 - Audio coding with non-uniform filter bank - Google Patents
Audio coding with non-uniform filter bank Download PDFInfo
- Publication number
- WO2003019532A1 WO2003019532A1 PCT/IB2002/003316 IB0203316W WO03019532A1 WO 2003019532 A1 WO2003019532 A1 WO 2003019532A1 IB 0203316 W IB0203316 W IB 0203316W WO 03019532 A1 WO03019532 A1 WO 03019532A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- filters
- matrix
- segmentation
- filter bank
- uniform
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/667—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using a division in frequency subbands
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03H—IMPEDANCE NETWORKS, e.g. RESONANT CIRCUITS; RESONATORS
- H03H17/00—Networks using digital techniques
- H03H17/02—Frequency selective networks
- H03H17/0248—Filters characterised by a particular frequency response or filtering method
- H03H17/0264—Filter sets with mutual related characteristics
- H03H17/0266—Filter banks
Definitions
- the present invention relates to coding and decoding audio signals.
- Figure 1(a) shows a basic block diagram for a system including a conventional M-channel analysis filter bank 10 and a synthesis filter bank 12.
- the synthesis filter bank comprises a collection of filters F k (z) each with an associated input channel and a common output y(n).
- each channel is decimated by a factor M and in the synthesis filter bank 12, it is interpolated by a factor M. If the degree of interpolation is equal to the degree of decimation, as in the example, the filter bank is critically sampled and if all the filters have the same bandwidth, the filter bank is a uniform filter bank.
- the M-channels output by the analysis filter bank 10 can be processed in any number of ways. For example, if the analysis filter bank 10 forms part of an audio encoder, then for a given update interval, the channel data and possibly the filter bank structure can be encoded in a bitstream representing the audio signal x(n). If the synthesis filter bank 12 forms part of an audio decoder, then the synthesis filter bank structure are combined with the channel data to generate the signal y(n). Alternatively, both banks 10, 12 may be included in an audio processing system where, for example, the signal x(n) is subjected to some form of post-processing with the processed signal y(n) being stored on a storage medium or relayed on a transmission medium.
- the analysis and synthesis filters are cosine-modulated versions of a single prototype filter.
- a known formula for the analysis and synthesis filters is:
- pseudo-QMF Quadrature Mirror Filter
- non-uniform filter banks i.e. filter banks where the filters have varying bandwidths.
- filter banks that can adapt to the time-frequency energy distribution and characteristics of the input signal.
- the design of non-uniform filter banks is in general quite complex, but some recent methods allow for the design of non-uniform CMF banks.
- H.S. Malvar "Biorthogonal and non-uniform lapped transforms for transform coding with reduced blocking and ringing artefacts," IEEE Trans. Signal Processing, vol. 46, no. 4, pp. 1043-1053, April 1998; and H.S. Malvar, "Enhancing the performance of sub-band audio coders for speech signals,” in Proc. Int. Symp. Circuits and Systems '98, nn. 90-101, June 1998; and US Patent No. 6,115,689, Malvar disclose a method for constructing non-uniform modulated lapped transforms (MLT). This involves combining sub-band filters of a uniform MLT and will be referred to herein as sub-band merging.
- MLT non-uniform modulated lapped transforms
- the combined sub-band filters have better time localization than the non-combined filters at the expense of a decrease in frequency localization. Since the non-uniform filter banks are obtained by simply taking linear combinations of the filters of a uniform MLT, the method allows for an efficient implementation of time- varying transforms.
- Malvar discloses that sub- band merging can be used beneficially for reducing ringing artefacts, e.g. reverberation and pre-echo, in audio and speech coding.
- the design of such transforms is restricted in several ways: Only 2 or 4 subband filters can be combined and only a fixed number of pairs of high-frequency coefficients is combined, i.e. 16x2 filters, 8x4 filters. Furthermore no systematic design procedure is disclosed. In particular, in the case of combined 4 sub-band filters a difficult set of parameters is chosen to provide the required output.
- the present invention provides a sub-band merging method which allows an arbitrary number of sub-bands to be combined in a systematic way.
- the preferred embodiments show that starting from a uniform CMF bank, linear combinations of the constituent filters can be taken such that the resulting combined filters have good frequency selective properties and flat pass-band response.
- Figure 1(a) is a block diagram of a conventional analysis/synthesis filter bank
- Figure 1(b) is a block diagram of an analysis/synthesis filter bank according to a preferred embodiment of the invention
- Figure 2 illustrates the characteristics of a prototype filter Po employed in the preferred embodiment of the invention
- Figures 3(a) and (b) compare time-domain responses of a filter bank of the preferred embodiment with those of a prior art filter bank (a) refers to prior art, (b) to preferred embodiment;
- Figures 4(a) and (b) compare magnitude responses of a filter bank of the preferred embodiment with those of a prior art filter bank (a) refers to prior art, (b) to preferred embodiment; and Figure 5 shows a practical embodiment of a filter bank according to the present invention.
- an M-channel maximally decimated uniform CMF bank 10, 12 comprises filters H ⁇ ), F ⁇ ) derived by cosine modulation of a single prototype filter Po ideally as illustrated in Figure 2.
- a localisation module 14 determines from an analysis of the time-frequency energy distribution and signal characteristics of the signal x(n) in a given time interval, that it is preferable to de- localise frequency segmentation in favour of increased time resolution to provide improved encoded signal quality. (Alternatively the module 14 may determine that a lowering of overall bit-rate may be possible while maintaining the same level of quality if frequency segmentation is de-localised.)
- the module 14 determines that x groups of filters comprising any number p ⁇ M adjacent filters in the uniform CMF bank are to be combined in segmentation matrices S i ... S x to provide a non-uniform filter bank.
- the encoded signal including channel data and indications of the frequency segmentation to be employed in any given time interval is decoded in inverse segmentation matrices S " S ...S _1 x to provide inputs for a uniform synthesis filter bank 12.
- the magnitude characteristics of its filters must exhibit good frequency selectivity and flat passband response.
- a merged filter p> ⁇ ) to be a linear combination ofp adjacent filters starting from the k filter in a uniform CMF bank, i.e.
- H p> ⁇ has a flat passband response and a transition bandwidth similar to those of the underlying uniformly spaced sub- band filters. If the prototype filter satisfies the condition on the stopband reduction (as the exemplary Po), there is no spectral overlap between filters H ⁇ ) and H ⁇ ) for
- the prototype filter Po can't be implemented in practical applications since it requires infinite length filters. Therefore, in practical situations, overlapping terms in the frequency domain of non-adjacent filters do exist and result in ripples in the passband of the combined filters. However, by keeping the stop-band attenuation of the prototype filter high, these ripples are kept to a minimum.
- condition on ⁇ is a new restriction on the under-lying umform CMF bank, but this is not the case.
- Most CMF banks known from literature satisfy the condition on ⁇ since it cancels first-order aliasing and magnitude distortion at ⁇ e ⁇ 0, ⁇ .
- condition on b k this amounts to choosing combinatorial coefficients of magnitude 1 that can only differ in sign.
- the combination operation can be represented by a matrix multiplication.
- a matrix A containing the impulse responses of the analysis filters of the uniform CMF bank as:
- the combinatorial coefficients b k are found in the rows of the block-diagonal element of S, which in this case is a size 2 Hadamard matrix - a non-singular matrix.
- the non-singular block-diagonal element in S is of size p x p having entries ⁇ 1.
- such a non-singular matrix is the p x p principal sub-matrix of a size N ⁇ p Hadamard matrix.
- PR non-uniform CMF banks representing a desired filter bank structure can be provided in an encoder through a matrix multiplication of the component filters and non-singular blocks from Hadamard matrices.
- the transform AS - S ⁇ A' can be made unitary (orthonormal) by scaling the combinatorial coefficients b k properly, so that, assuming the original uniform filter bank is unitary, the non-uniform filter bank is unitary as well.
- the matrix S is:
- FIG. 5 illustrates an analysis filter bank 10' of the form employed in an MPEG encoder.
- the input signal x(n) is connected through a tapped delay line with each successively delayed signal being decimated by a factor M.
- this schema means that only decimated signals are , filtered rather than vice versa.
- the decimated signals are filtered by respective pairs of filter functions G m (-z 2 ) and their outputs are cross-linked within a cosine modulation module which produces M output channels.
- a localisation module 14 determines that frequency de-localisation in a given sub-band will improve the quality of response by improving time resolution, then one or more groups of adjacent filter output channels are combined accordingly within the segmentation matrix system S which comprises one or more principle submatrices of Hadamard matrices as described above.
- an audio encoder including the segmentation matrices according to the preferred embodiment of the present invention can lower its bit-rate so saving in overall bandwidth.
- improved quality will be provided for the same bit-rate.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
- Piezo-Electric Or Mechanical Vibrators, Or Delay Or Filter Circuits (AREA)
- Analogue/Digital Conversion (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Abstract
Description
Claims
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP02755519A EP1421579B1 (en) | 2001-08-21 | 2002-08-14 | Audio coding with non-uniform filter bank |
| JP2003522909A JP2005501277A (en) | 2001-08-21 | 2002-08-14 | Audio coding using a non-uniform filter bank. |
| DE60210479T DE60210479T2 (en) | 2001-08-21 | 2002-08-14 | AUDIO CODERS WITH IRREGULAR FILTER BANK |
| US10/487,164 US20040254797A1 (en) | 2001-08-21 | 2002-08-14 | Audio coding with non-uniform filter bank |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP01203161 | 2001-08-21 | ||
| EP01203161.3 | 2001-08-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2003019532A1 true WO2003019532A1 (en) | 2003-03-06 |
Family
ID=8180810
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/IB2002/003316 Ceased WO2003019532A1 (en) | 2001-08-21 | 2002-08-14 | Audio coding with non-uniform filter bank |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20040254797A1 (en) |
| EP (1) | EP1421579B1 (en) |
| JP (1) | JP2005501277A (en) |
| CN (1) | CN1223992C (en) |
| AT (1) | ATE322734T1 (en) |
| DE (1) | DE60210479T2 (en) |
| WO (1) | WO2003019532A1 (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1750483A1 (en) * | 2005-08-02 | 2007-02-07 | GN ReSound A/S | A hearing aid with suppression of wind noise |
| US7778196B2 (en) * | 2005-05-31 | 2010-08-17 | Avaya Inc. | Method and apparatus for link performance measurements in a packet-switched network |
| CN109495085A (en) * | 2009-02-18 | 2019-03-19 | 杜比国际公司 | Low latency modulated filter group |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2420814C2 (en) * | 2006-03-29 | 2011-06-10 | Конинклейке Филипс Электроникс Н.В. | Audio decoding |
| HUE071544T2 (en) * | 2009-10-21 | 2025-09-28 | Dolby Int Ab | Oversampling in a combined transposer filter bank |
| US10158375B1 (en) | 2018-03-21 | 2018-12-18 | Nxp Usa, Inc. | PDM bitstream to PCM data converter using Walsh-Hadamard transform |
| TWI866996B (en) | 2019-06-26 | 2024-12-21 | 美商杜拜研究特許公司 | Low latency audio filterbank with improved frequency resolution |
| IL290390B2 (en) | 2019-09-03 | 2025-05-01 | Dolby Laboratories Licensing Corp | Audio filterbank with decorrelating components |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4790016A (en) * | 1985-11-14 | 1988-12-06 | Gte Laboratories Incorporated | Adaptive method and apparatus for coding speech |
| US4754492A (en) * | 1985-06-03 | 1988-06-28 | Picturetel Corporation | Method and system for adapting a digitized signal processing system for block processing with minimal blocking artifacts |
| US5109417A (en) * | 1989-01-27 | 1992-04-28 | Dolby Laboratories Licensing Corporation | Low bit rate transform coder, decoder, and encoder/decoder for high-quality audio |
| DE3902948A1 (en) * | 1989-02-01 | 1990-08-09 | Telefunken Fernseh & Rundfunk | METHOD FOR TRANSMITTING A SIGNAL |
| US5502789A (en) * | 1990-03-07 | 1996-03-26 | Sony Corporation | Apparatus for encoding digital data with reduction of perceptible noise |
| US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
| US5913186A (en) * | 1996-03-25 | 1999-06-15 | Prometheus, Inc. | Discrete one dimensional signal processing apparatus and method using energy spreading coding |
| US5805739A (en) * | 1996-04-02 | 1998-09-08 | Picturetel Corporation | Lapped orthogonal vector quantization |
| US5848391A (en) * | 1996-07-11 | 1998-12-08 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Method subband of coding and decoding audio signals using variable length windows |
| US6115689A (en) * | 1998-05-27 | 2000-09-05 | Microsoft Corporation | Scalable audio coder and decoder |
-
2002
- 2002-08-14 EP EP02755519A patent/EP1421579B1/en not_active Expired - Lifetime
- 2002-08-14 AT AT02755519T patent/ATE322734T1/en not_active IP Right Cessation
- 2002-08-14 DE DE60210479T patent/DE60210479T2/en not_active Expired - Fee Related
- 2002-08-14 CN CNB028162773A patent/CN1223992C/en not_active Expired - Fee Related
- 2002-08-14 US US10/487,164 patent/US20040254797A1/en not_active Abandoned
- 2002-08-14 WO PCT/IB2002/003316 patent/WO2003019532A1/en not_active Ceased
- 2002-08-14 JP JP2003522909A patent/JP2005501277A/en active Pending
Non-Patent Citations (3)
| Title |
|---|
| MALVAR H: "Enhancing the performance of subband audio coders for speech signals", CIRCUITS AND SYSTEMS, 1998. ISCAS '98. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL SYMPOSIUM ON MONTEREY, CA, USA 31 MAY-3 JUNE 1998, NEW YORK, NY, USA,IEEE, US, 31 May 1998 (1998-05-31), pages 98 - 101, XP010289988, ISBN: 0-7803-4455-3 * |
| MOON HO LEE ET AL: "The design of multidimensional filter bank using reverse jacket matrix", TENCON 99. PROCEEDINGS OF THE IEEE REGION 10 CONFERENCE CHEJU ISLAND, SOUTH KOREA 15-17 SEPT. 1999, PISCATAWAY, NJ, USA,IEEE, US, 15 September 1999 (1999-09-15), pages 637 - 641, XP010368257, ISBN: 0-7803-5739-6 * |
| PAINTER T ET AL: "A review of algorithms for perceptual coding of digital audio signals", DIGITAL SIGNAL PROCESSING PROCEEDINGS, 1997. DSP 97., 1997 13TH INTERNATIONAL CONFERENCE ON SANTORINI, GREECE 2-4 JULY 1997, NEW YORK, NY, USA,IEEE, US, 2 July 1997 (1997-07-02), pages 179 - 208, XP010251044, ISBN: 0-7803-4137-6 * |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7778196B2 (en) * | 2005-05-31 | 2010-08-17 | Avaya Inc. | Method and apparatus for link performance measurements in a packet-switched network |
| EP1750483A1 (en) * | 2005-08-02 | 2007-02-07 | GN ReSound A/S | A hearing aid with suppression of wind noise |
| US8019103B2 (en) | 2005-08-02 | 2011-09-13 | Gn Resound A/S | Hearing aid with suppression of wind noise |
| CN109495085A (en) * | 2009-02-18 | 2019-03-19 | 杜比国际公司 | Low latency modulated filter group |
| US11735198B2 (en) | 2009-02-18 | 2023-08-22 | Dolby International Ab | Digital filterbank for spectral envelope adjustment |
| US12159642B2 (en) | 2009-02-18 | 2024-12-03 | Dolby International Ab | Digital filterbank for spectral envelope adjustment |
Also Published As
| Publication number | Publication date |
|---|---|
| US20040254797A1 (en) | 2004-12-16 |
| CN1223992C (en) | 2005-10-19 |
| EP1421579A1 (en) | 2004-05-26 |
| JP2005501277A (en) | 2005-01-13 |
| ATE322734T1 (en) | 2006-04-15 |
| EP1421579B1 (en) | 2006-04-05 |
| DE60210479T2 (en) | 2007-04-12 |
| DE60210479D1 (en) | 2006-05-18 |
| CN1545697A (en) | 2004-11-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3937378B1 (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo | |
| CN101405791B (en) | Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples | |
| Saramaki et al. | Multirate systems and filterbanks | |
| EP1421579B1 (en) | Audio coding with non-uniform filter bank | |
| Jiang et al. | High-performance IIR QMF banks for speech subband coding | |
| Kumar et al. | An improved and simplified approach for designing cosine modulated filter bank using window technique | |
| HK40065946B (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo | |
| HK40065946A (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo | |
| HK40088852A (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo processing | |
| HK40088852B (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo processing | |
| HK40020890A (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo processing | |
| HK40020890B (en) | Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo processing | |
| Levine | Critically sampled third octave filter banks | |
| HK1243561B (en) | Low delay modulated filter bank | |
| HK1244112A1 (en) | Complex exponential modulated filter bank for high frequency reconstruction | |
| HK1244112B (en) | Complex exponential modulated filter bank for high frequency reconstruction | |
| HK1218997B (en) | Low delay modulated filter bank | |
| HK1218996B (en) | Low delay modulated filter bank | |
| HK1218592B (en) | Low delay modulated filter bank |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): CN JP Kind code of ref document: A1 Designated state(s): CN JP US |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FR GB GR IE IT LU MC NL PT SE SK TR Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2003522909 Country of ref document: JP |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2002755519 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 10487164 Country of ref document: US |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 20028162773 Country of ref document: CN |
|
| WWP | Wipo information: published in national office |
Ref document number: 2002755519 Country of ref document: EP |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2002755519 Country of ref document: EP |


