WO2008035275A3 - Encoding and decoding of audio objects - Google Patents

Encoding and decoding of audio objects Download PDF

Info

Publication number
WO2008035275A3
WO2008035275A3 PCT/IB2007/053748 IB2007053748W WO2008035275A3 WO 2008035275 A3 WO2008035275 A3 WO 2008035275A3 IB 2007053748 W IB2007053748 W IB 2007053748W WO 2008035275 A3 WO2008035275 A3 WO 2008035275A3
Authority
WO
WIPO (PCT)
Prior art keywords
audio objects
encoding
encoder
audio
decoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB2007/053748
Other languages
French (fr)
Other versions
WO2008035275A2 (en
Inventor
Dirk J Breebaart
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Priority to DE602007012730T priority Critical patent/DE602007012730D1/en
Priority to JP2009527954A priority patent/JP5281575B2/en
Priority to BRPI0716854-3A priority patent/BRPI0716854B1/en
Priority to US12/441,538 priority patent/US8271290B2/en
Priority to AT07826410T priority patent/ATE499677T1/en
Priority to KR1020097007892A priority patent/KR101396140B1/en
Priority to CN2007800345382A priority patent/CN101517637B/en
Priority to MX2009002795A priority patent/MX2009002795A/en
Priority to EP07826410A priority patent/EP2067138B1/en
Priority to PL07826410T priority patent/PL2067138T3/en
Publication of WO2008035275A2 publication Critical patent/WO2008035275A2/en
Publication of WO2008035275A3 publication Critical patent/WO2008035275A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

An audio system comprises an encoder (209) which encodes audio objects in an encoding unit (403) that generates a down-mix audio signal and parametric data representing the plurality of audio objects. The down-mix audio signal and parametric data is transmitted to a decoder (215) which comprises a decoding unit (301) which generates approximate replicas of the audio objects and a rendering unit (303) which generates an output signal from the audio objects. The decoder (215) furthermore contains a processor (501) for generating encoding modification data which is sent to the encoder (209). The encoder (209) then modifies the encoding of the audio objects, and in particular modifies the parametric data, in response to the encoding modification data. The approach allows manipulation of the audio objects to be controlled by the decoder (215) but performed fully or partly by the encoder (209). Thus, the manipulation may be performed on the actual independent audio objects rather than on approximate replicas thereby providing improved performance.
PCT/IB2007/053748 2006-09-18 2007-09-17 Encoding and decoding of audio objects Ceased WO2008035275A2 (en)

Priority Applications (10)

Application Number Priority Date Filing Date Title
DE602007012730T DE602007012730D1 (en) 2006-09-18 2007-09-17 CODING AND DECODING AUDIO OBJECTS
JP2009527954A JP5281575B2 (en) 2006-09-18 2007-09-17 Audio object encoding and decoding
BRPI0716854-3A BRPI0716854B1 (en) 2006-09-18 2007-09-17 ENCODER FOR ENCODING AUDIO OBJECTS, DECODER FOR DECODING AUDIO OBJECTS, TELECONFERENCE DISTRIBUTOR CENTER, AND METHOD FOR DECODING AUDIO SIGNALS
US12/441,538 US8271290B2 (en) 2006-09-18 2007-09-17 Encoding and decoding of audio objects
AT07826410T ATE499677T1 (en) 2006-09-18 2007-09-17 ENCODING AND DECODING AUDIO OBJECTS
KR1020097007892A KR101396140B1 (en) 2006-09-18 2007-09-17 Encoding and decoding of audio objects
CN2007800345382A CN101517637B (en) 2006-09-18 2007-09-17 Audio codec, codec method, hub, transmitter receiver, transmitter and receiver method, communication system, playback device
MX2009002795A MX2009002795A (en) 2006-09-18 2007-09-17 Encoding and decoding of audio objects.
EP07826410A EP2067138B1 (en) 2006-09-18 2007-09-17 Encoding and decoding of audio objects
PL07826410T PL2067138T3 (en) 2006-09-18 2007-09-17 Encoding and decoding of audio objects

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP06120819.5 2006-09-18
EP06120819 2006-09-18
EP06123799.6 2006-11-10
EP06123799 2006-11-10

Publications (2)

Publication Number Publication Date
WO2008035275A2 WO2008035275A2 (en) 2008-03-27
WO2008035275A3 true WO2008035275A3 (en) 2008-05-29

Family

ID=39079648

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2007/053748 Ceased WO2008035275A2 (en) 2006-09-18 2007-09-17 Encoding and decoding of audio objects

Country Status (12)

Country Link
US (1) US8271290B2 (en)
EP (1) EP2067138B1 (en)
JP (1) JP5281575B2 (en)
KR (1) KR101396140B1 (en)
CN (1) CN101517637B (en)
AT (1) ATE499677T1 (en)
BR (1) BRPI0716854B1 (en)
DE (1) DE602007012730D1 (en)
MX (1) MX2009002795A (en)
PL (1) PL2067138T3 (en)
RU (1) RU2460155C2 (en)
WO (1) WO2008035275A2 (en)

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8959016B2 (en) 2002-09-27 2015-02-17 The Nielsen Company (Us), Llc Activating functions in processing devices using start codes embedded in audio
US9711153B2 (en) 2002-09-27 2017-07-18 The Nielsen Company (Us), Llc Activating functions in processing devices using encoded audio and detecting audio signatures
CA2621175C (en) 2005-09-13 2015-12-22 Srs Labs, Inc. Systems and methods for audio processing
EP2005787B1 (en) 2006-04-03 2012-01-25 Srs Labs, Inc. Audio signal processing
MX2008012315A (en) * 2006-09-29 2008-10-10 Lg Electronics Inc Methods and apparatuses for encoding and decoding object-based audio signals.
WO2008060111A1 (en) 2006-11-15 2008-05-22 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
EP2102855A4 (en) 2006-12-07 2010-07-28 Lg Electronics Inc A method and an apparatus for decoding an audio signal
WO2008069593A1 (en) 2006-12-07 2008-06-12 Lg Electronics Inc. A method and an apparatus for processing an audio signal
KR101230691B1 (en) 2008-07-10 2013-02-07 한국전자통신연구원 Method and apparatus for editing audio object in multi object audio coding based spatial information
WO2010005264A2 (en) * 2008-07-10 2010-01-14 한국전자통신연구원 Method and apparatus for editing audio object in spatial information-based multi-object audio coding apparatus
CN102138176B (en) * 2008-07-11 2013-11-06 日本电气株式会社 Signal analyzing device, signal control device, and method therefor
MX2011011399A (en) * 2008-10-17 2012-06-27 Univ Friedrich Alexander Er Audio coding using downmix.
US9667365B2 (en) 2008-10-24 2017-05-30 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US8359205B2 (en) 2008-10-24 2013-01-22 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US8121830B2 (en) * 2008-10-24 2012-02-21 The Nielsen Company (Us), Llc Methods and apparatus to extract data encoded in media content
US8508357B2 (en) 2008-11-26 2013-08-13 The Nielsen Company (Us), Llc Methods and apparatus to encode and decode audio for shopper location and advertisement presentation tracking
JP5274359B2 (en) * 2009-04-27 2013-08-28 三菱電機株式会社 3D video and audio recording method, 3D video and audio playback method, 3D video and audio recording device, 3D video and audio playback device, 3D video and audio recording medium
CA3008502C (en) 2009-05-01 2020-11-10 The Nielsen Company (Us), Llc Methods, apparatus and articles of manufacture to provide secondary content in association with primary broadcast media content
US20100324915A1 (en) * 2009-06-23 2010-12-23 Electronic And Telecommunications Research Institute Encoding and decoding apparatuses for high quality multi-channel audio codec
KR101805212B1 (en) 2009-08-14 2017-12-05 디티에스 엘엘씨 Object-oriented audio streaming system
EP2323130A1 (en) * 2009-11-12 2011-05-18 Koninklijke Philips Electronics N.V. Parametric encoding and decoding
CN101877643B (en) * 2010-06-29 2014-12-10 中兴通讯股份有限公司 Multipoint sound-mixing distant view presenting method, device and system
TWI896112B (en) * 2010-12-03 2025-09-01 美商杜比實驗室特許公司 Audio decoding device, audio decoding method, and audio encoding method
US9165558B2 (en) 2011-03-09 2015-10-20 Dts Llc System for dynamically creating and rendering audio objects
CN103050124B (en) 2011-10-13 2016-03-30 华为终端有限公司 Sound mixing method, Apparatus and system
WO2014035864A1 (en) 2012-08-31 2014-03-06 Dolby Laboratories Licensing Corporation Processing audio objects in principal and supplementary encoded audio signals
CN103152500B (en) * 2013-02-21 2015-06-24 黄文明 Method for eliminating echo from multi-party call
US9786286B2 (en) 2013-03-29 2017-10-10 Dolby Laboratories Licensing Corporation Methods and apparatuses for generating and using low-resolution preview tracks with high-quality encoded object and multichannel audio signals
US9559651B2 (en) * 2013-03-29 2017-01-31 Apple Inc. Metadata for loudness and dynamic range control
US9558785B2 (en) 2013-04-05 2017-01-31 Dts, Inc. Layered audio coding and transmission
BR112015029113B1 (en) * 2013-05-24 2022-03-22 Dolby International Ab Method for encoding audio objects as a data stream, method for reconstructing audio objects based on a data stream, and decoder for reconstructing audio objects based on a data stream
MY204539A (en) 2013-05-24 2024-09-03 Dolby Int Ab Coding of audio scenes
RU2630754C2 (en) * 2013-05-24 2017-09-12 Долби Интернешнл Аб Effective coding of sound scenes containing sound objects
CN105229731B (en) 2013-05-24 2017-03-15 杜比国际公司 Reconstruct according to lower mixed audio scene
BR112015028914B1 (en) * 2013-05-24 2021-12-07 Dolby International Ab METHOD AND APPARATUS TO RECONSTRUCT A TIME/FREQUENCY BLOCK OF AUDIO OBJECTS N, METHOD AND ENCODER TO GENERATE AT LEAST ONE WEIGHTING PARAMETER, AND COMPUTER-READable MEDIUM
JP6396452B2 (en) * 2013-10-21 2018-09-26 ドルビー・インターナショナル・アーベー Audio encoder and decoder
EP2879131A1 (en) * 2013-11-27 2015-06-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation in object-based audio coding systems
CN104882145B (en) * 2014-02-28 2019-10-29 杜比实验室特许公司 It is clustered using the audio object of the time change of audio object
US10037202B2 (en) 2014-06-03 2018-07-31 Microsoft Technology Licensing, Llc Techniques to isolating a portion of an online computing service
CN105336339B (en) 2014-06-03 2019-05-03 华为技术有限公司 Method and device for processing speech and audio signals
US9510125B2 (en) * 2014-06-20 2016-11-29 Microsoft Technology Licensing, Llc Parametric wave field coding for real-time sound propagation for dynamic sources
CN105989845B (en) * 2015-02-25 2020-12-08 杜比实验室特许公司 Video Content Assisted Audio Object Extraction
CN107358959B (en) * 2016-05-10 2021-10-26 华为技术有限公司 Coding method and coder for multi-channel signal
CN109479178B (en) 2016-07-20 2021-02-26 杜比实验室特许公司 Audio object aggregation based on renderer awareness perception differences
US11074921B2 (en) * 2017-03-28 2021-07-27 Sony Corporation Information processing device and information processing method
US10602296B2 (en) * 2017-06-09 2020-03-24 Nokia Technologies Oy Audio object adjustment for phase compensation in 6 degrees of freedom audio
US10602298B2 (en) 2018-05-15 2020-03-24 Microsoft Technology Licensing, Llc Directional propagation
US10932081B1 (en) 2019-08-22 2021-02-23 Microsoft Technology Licensing, Llc Bidirectional propagation of sound
CN111462767B (en) * 2020-04-10 2024-01-09 全景声科技南京有限公司 Incremental coding method and device for audio signal
US11662975B2 (en) 2020-10-06 2023-05-30 Tencent America LLC Method and apparatus for teleconference
WO2022248620A1 (en) 2021-05-27 2022-12-01 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoding and decoding of acoustic environment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030026441A1 (en) * 2001-05-04 2003-02-06 Christof Faller Perceptual synthesis of auditory scenes
US20030035553A1 (en) * 2001-08-10 2003-02-20 Frank Baumgarte Backwards-compatible perceptual coding of spatial cues

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6782360B1 (en) * 1999-09-22 2004-08-24 Mindspeed Technologies, Inc. Gain quantization for a CELP speech coder
SE0202159D0 (en) * 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
JP2003188731A (en) * 2001-12-18 2003-07-04 Yrp Mobile Telecommunications Key Tech Res Lab Co Ltd Variable rate coding method, coding device and decoding device
AU2003269551A1 (en) * 2002-10-15 2004-05-04 Electronics And Telecommunications Research Institute Method for generating and consuming 3d audio scene with extended spatiality of sound source
CN1748247B (en) * 2003-02-11 2011-06-15 皇家飞利浦电子股份有限公司 Audio coding
DE10344638A1 (en) * 2003-08-04 2005-03-10 Fraunhofer Ges Forschung Generation, storage or processing device and method for representation of audio scene involves use of audio signal processing circuit and display device and may use film soundtrack
JP2005352396A (en) * 2004-06-14 2005-12-22 Matsushita Electric Ind Co Ltd Acoustic signal encoding apparatus and acoustic signal decoding apparatus
JP4892184B2 (en) * 2004-10-14 2012-03-07 パナソニック株式会社 Acoustic signal encoding apparatus and acoustic signal decoding apparatus
DE102005008369A1 (en) * 2005-02-23 2006-09-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for simulating a wave field synthesis system
US7974422B1 (en) * 2005-08-25 2011-07-05 Tp Lab, Inc. System and method of adjusting the sound of multiple audio objects directed toward an audio output device
KR20080093422A (en) * 2006-02-09 2008-10-21 엘지전자 주식회사 Object-based audio signal encoding and decoding method and apparatus therefor
ATE527833T1 (en) * 2006-05-04 2011-10-15 Lg Electronics Inc IMPROVE STEREO AUDIO SIGNALS WITH REMIXING
US8295494B2 (en) * 2007-08-13 2012-10-23 Lg Electronics Inc. Enhancing audio with remixing capability

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030026441A1 (en) * 2001-05-04 2003-02-06 Christof Faller Perceptual synthesis of auditory scenes
US20030035553A1 (en) * 2001-08-10 2003-02-20 Frank Baumgarte Backwards-compatible perceptual coding of spatial cues

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BREEBART J ET AL: "MPEG Spatial Audio Coding / MPEG surround: Overview and Current Status", AUDIO ENGINEERING SOCIETY CONVENTION PAPER, NEW YORK, NY, US, 7 October 2005 (2005-10-07), pages 1 - 17, XP002379094 *

Also Published As

Publication number Publication date
US20090326960A1 (en) 2009-12-31
PL2067138T3 (en) 2011-07-29
EP2067138A2 (en) 2009-06-10
RU2460155C2 (en) 2012-08-27
MX2009002795A (en) 2009-04-01
BRPI0716854A2 (en) 2013-10-01
WO2008035275A2 (en) 2008-03-27
ATE499677T1 (en) 2011-03-15
EP2067138B1 (en) 2011-02-23
CN101517637B (en) 2012-08-15
KR20090080945A (en) 2009-07-27
JP5281575B2 (en) 2013-09-04
DE602007012730D1 (en) 2011-04-07
CN101517637A (en) 2009-08-26
KR101396140B1 (en) 2014-05-20
BRPI0716854B1 (en) 2020-09-15
JP2010503887A (en) 2010-02-04
RU2009114741A (en) 2010-10-27
BRPI0716854A8 (en) 2019-01-15
US8271290B2 (en) 2012-09-18

Similar Documents

Publication Publication Date Title
WO2008035275A3 (en) Encoding and decoding of audio objects
US11562758B2 (en) System and method for processing audio data into a plurality of frequency components
WO2007102782A3 (en) Methods and arrangements for audio coding and decoding
WO2008084427A3 (en) Audio decoder
EP4583541A3 (en) Methods, apparatus and systems for a pre-rendered signal for audio rendering
MX2010004220A (en) Audio coding using downmix.
ATE456261T1 (en) AUDIO CODING AND AUDIO DECODING
WO2007005750A3 (en) Method, apparatus and system for use in multimedia signal encoding
NO20070560L (en) Multi-channel synthesizer and method for generating a multi-channel output signal.
MY165328A (en) Audio signal decoder, audio signal encoder, method for providing an upmix signal representation, method for providing a downmix signal representation, computer program and bitstream using a common inter-object-correlation parameter value
MX2011011399A (en) Audio coding using downmix.
BRPI0608945B8 (en) multi-channel audio encoder, multi-channel audio decoder, method of encoding n audio signals into m audio signals and associated parametric data, method of decoding k audio signals and associated parametric data, method of transmitting and receiving an encoded multi-channel audio signal, computer-readable storage media, and broadcast system
ATE406651T1 (en) AUDIO CODING AND AUDIO DECODING
TW200628002A (en) Method, device, encoder apparatus, decoder apparatus and audio system
DE602005006777D1 (en) MULTI-CHANNEL CODER
KR102710843B1 (en) Audio coding/decoding apparatus using reverberation signal of object audio signal
ATE524808T1 (en) METHOD FOR ENCODING A SOURCE AUDIO SIGNAL, CORRESPONDING ENCODING DEVICE, DECODING METHOD AND DEVICE, SIGNAL AND COMPUTER PROGRAM PRODUCTS
PL1902443T3 (en) Audio encoding and decoding
KR102335911B1 (en) Audio coding/decoding apparatus using reverberation signal of object audio signal
TW200616443A (en) Video processing device and method thereof
TW200732960A (en) Method and apparatus for decoding a signal

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780034538.2

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07826410

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2007826410

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2009527954

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: MX/A/2009/002795

Country of ref document: MX

WWE Wipo information: entry into national phase

Ref document number: 12441538

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1970/CHENP/2009

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 1020097007892

Country of ref document: KR

ENP Entry into the national phase

Ref document number: 2009114741

Country of ref document: RU

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: PI0716854

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20090316