SG11202003125SA - Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding - Google Patents

Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding

Info

Publication number
SG11202003125SA
SG11202003125SA SG11202003125SA SG11202003125SA SG11202003125SA SG 11202003125S A SG11202003125S A SG 11202003125SA SG 11202003125S A SG11202003125S A SG 11202003125SA SG 11202003125S A SG11202003125S A SG 11202003125SA SG 11202003125S A SG11202003125S A SG 11202003125SA
Authority
SG
Singapore
Prior art keywords
decoding
encoding
computer program
audio coding
spatial audio
Prior art date
Application number
SG11202003125SA
Other languages
English (en)
Inventor
Guillaume Fuchs
Jürgen Herre
Fabian Küch
Stefan Döhla
Markus Multrus
Oliver Thiergart
Oliver Wübbolt
Florin Ghido
Stefan Bayer
Wolfgang Jaegers
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of SG11202003125SA publication Critical patent/SG11202003125SA/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/04Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/40Visual indication of stereophonic sound image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2205/00Details of stereophonic arrangements covered by H04R5/00 but not provided for in any of its subgroups
    • H04R2205/024Positioning of loudspeaker enclosures for spatial sound reproduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
SG11202003125SA 2017-10-04 2018-10-01 Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding SG11202003125SA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP17194816 2017-10-04
PCT/EP2018/076641 WO2019068638A1 (en) 2017-10-04 2018-10-01 APPARATUS, METHOD AND COMPUTER PROGRAM FOR CODING, DECODING, SCENE PROCESSING AND OTHER PROCEDURES RELATED TO DIRAC-BASED SPATIAL AUDIO CODING

Publications (1)

Publication Number Publication Date
SG11202003125SA true SG11202003125SA (en) 2020-05-28

Family

ID=60185972

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202003125SA SG11202003125SA (en) 2017-10-04 2018-10-01 Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding

Country Status (19)

Country Link
US (3) US11368790B2 (de)
EP (2) EP3975176A3 (de)
JP (2) JP7297740B2 (de)
KR (2) KR102468780B1 (de)
CN (2) CN111630592B (de)
AR (2) AR117384A1 (de)
AU (2) AU2018344830B2 (de)
BR (1) BR112020007486A2 (de)
CA (4) CA3076703C (de)
ES (1) ES2907377T3 (de)
MX (2) MX2020003506A (de)
MY (1) MY202120A (de)
PL (1) PL3692523T3 (de)
PT (1) PT3692523T (de)
RU (1) RU2759160C2 (de)
SG (1) SG11202003125SA (de)
TW (2) TWI700687B (de)
WO (1) WO2019068638A1 (de)
ZA (1) ZA202001726B (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019204214A2 (en) 2018-04-16 2019-10-24 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
EP3818524B1 (de) * 2018-07-02 2023-12-13 Dolby Laboratories Licensing Corporation Verfahren und vorrichtungen zur erzeugung oder decodierung eines bitstroms mit immersiven audiosignalen
ES2974219T3 (es) 2018-11-13 2024-06-26 Dolby Laboratories Licensing Corp Procesamiento de audio en servicios de audio inversivos
ES2985934T3 (es) 2018-11-13 2024-11-07 Dolby Laboratories Licensing Corp Representar audio espacial por medio de una señal de audio y metadatos asociados
KR102692707B1 (ko) * 2018-12-07 2024-08-07 프라운호퍼-게젤샤프트 추르 푀르데룽 데어 안제반텐 포르슝 에 파우 낮은 차수, 중간 차수 및 높은 차수 컴포넌트 생성기를 사용하는 DirAC 기반 공간 오디오 코딩과 관련된 인코딩, 디코딩, 장면 처리 및 기타 절차를 위한 장치, 방법 및 컴퓨터 프로그램
US11158335B1 (en) * 2019-03-28 2021-10-26 Amazon Technologies, Inc. Audio beam selection
WO2020217781A1 (ja) * 2019-04-24 2020-10-29 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 到来方向推定装置、システム、及び、到来方向推定方法
WO2021018378A1 (en) 2019-07-29 2021-02-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method or computer program for processing a sound field representation in a spatial transform domain
AU2020320270B2 (en) 2019-08-01 2025-10-23 Dolby Laboratories Licensing Corporation Encoding and decoding IVAS bitstreams
GB2586126A (en) * 2019-08-02 2021-02-10 Nokia Technologies Oy MASA with embedded near-far stereo for mobile devices
GB2587335A (en) 2019-09-17 2021-03-31 Nokia Technologies Oy Direction estimation enhancement for parametric spatial audio capture using broadband estimates
US11430451B2 (en) * 2019-09-26 2022-08-30 Apple Inc. Layered coding of audio with discrete objects
IL322658A (en) 2019-10-30 2025-10-01 Dolby Laboratories Licensing Corp Data rate decentralization in embedded voice and audio services
US11636866B2 (en) * 2020-03-24 2023-04-25 Qualcomm Incorporated Transform ambisonic coefficients using an adaptive network
US20210304879A1 (en) * 2020-03-31 2021-09-30 Change Healthcare Holdings Llc Methods, systems, and computer program products for dividing health care service responsibilities between entities
CN111885414B (zh) * 2020-07-24 2023-03-21 腾讯科技(深圳)有限公司 一种数据处理方法、装置、设备及可读存储介质
EP4229630A1 (de) * 2020-10-13 2023-08-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und verfahren zur codierung mehrerer audioobjekte unter verwendung von richtungsinformationen während eines downmixings oder vorrichtung und verfahren zur decodierung mit optimierter kovarianzsynthese
CN116648931A (zh) * 2020-10-13 2023-08-25 弗劳恩霍夫应用研究促进协会 在下混期间使用方向信息对多个音频对象进行编码的装置和方法或使用优化的协方差合成进行解码的装置和方法
MX2023004247A (es) * 2020-10-13 2023-06-07 Fraunhofer Ges Forschung Aparato y metodo para codificar una pluralidad de objetos de audio o aparato y metodo para decodificacion usando dos o mas objetos de audio relevantes.
TWI816071B (zh) * 2020-12-09 2023-09-21 宏正自動科技股份有限公司 音訊轉換裝置及音訊處理方法
CN117501362B (zh) * 2021-06-15 2025-05-09 北京字跳网络技术有限公司 音频渲染系统、方法和电子设备
GB2608406A (en) 2021-06-30 2023-01-04 Nokia Technologies Oy Creating spatial audio stream from audio objects with spatial extent
DE112022007568T5 (de) * 2022-09-28 2025-05-08 Mitsubishi Electric Corporation Schallraumkonstruktionseinrichtung, schallraumkonstruktionssystem, programm und schallraumkonstruktionsverfahren
US20240298130A1 (en) * 2023-03-03 2024-09-05 Sony Interactive Entertainment Inc. Systems and methods for generating and applying audio-based basis functions
WO2025054331A1 (en) * 2023-09-05 2025-03-13 Virtuel Works Llc Spatial audio scene description and rendering
WO2025097318A1 (zh) * 2023-11-07 2025-05-15 北京小米移动软件有限公司 一种音频信号编解码方法及装置、通信系统、通信设备、存储介质

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW432806B (en) * 1996-12-09 2001-05-01 Matsushita Electric Industrial Co Ltd Audio decoding device
US8872979B2 (en) 2002-05-21 2014-10-28 Avaya Inc. Combined-media scene tracking for audio-video summarization
TW200742359A (en) * 2006-04-28 2007-11-01 Compal Electronics Inc Internet communication system
US9014377B2 (en) * 2006-05-17 2015-04-21 Creative Technology Ltd Multichannel surround format conversion and generalized upmix
US20080004729A1 (en) * 2006-06-30 2008-01-03 Nokia Corporation Direct encoding into a directional audio coding format
US9015051B2 (en) 2007-03-21 2015-04-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Reconstruction of audio channels with direction parameters indicating direction of origin
US8290167B2 (en) * 2007-03-21 2012-10-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US8509454B2 (en) * 2007-11-01 2013-08-13 Nokia Corporation Focusing on a portion of an audio scene for an audio signal
US20110002469A1 (en) * 2008-03-03 2011-01-06 Nokia Corporation Apparatus for Capturing and Rendering a Plurality of Audio Channels
EP2154911A1 (de) * 2008-08-13 2010-02-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung zur Bestimmung eines räumlichen Mehrkanalausgangsaudiosignals
EP2154910A1 (de) * 2008-08-13 2010-02-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung zum Mischen von Raumtonströmen
ES2425814T3 (es) * 2008-08-13 2013-10-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Aparato para determinar una señal de audio espacial convertida
US8504184B2 (en) * 2009-02-04 2013-08-06 Panasonic Corporation Combination device, telecommunication system, and combining method
EP2249334A1 (de) 2009-05-08 2010-11-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audioformat-Transkodierer
WO2011104418A1 (en) * 2010-02-26 2011-09-01 Nokia Corporation Modifying spatial image of a plurality of audio signals
DE102010030534A1 (de) * 2010-06-25 2011-12-29 Iosono Gmbh Vorrichtung zum Veränderung einer Audio-Szene und Vorrichtung zum Erzeugen einer Richtungsfunktion
EP2448289A1 (de) 2010-10-28 2012-05-02 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur Ableitung einer direktionalen Information und Systeme
EP2464146A1 (de) * 2010-12-10 2012-06-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur Dekomposition eines Eingabesignals mit einer im Voraus berechneten Bezugskurve
EP2600343A1 (de) 2011-12-02 2013-06-05 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Mischen von Raumtoncodierungsstreams auf Geometriebasis
US9955280B2 (en) * 2012-04-19 2018-04-24 Nokia Technologies Oy Audio scene apparatus
US9190065B2 (en) * 2012-07-15 2015-11-17 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for three-dimensional audio coding using basis function coefficients
CN103236255A (zh) * 2013-04-03 2013-08-07 广西环球音乐图书有限公司 音频文件转化midi文件
DE102013105375A1 (de) 2013-05-24 2014-11-27 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Tonsignalerzeuger, Verfahren und Computerprogramm zum Bereitstellen eines Tonsignals
US9847088B2 (en) * 2014-08-29 2017-12-19 Qualcomm Incorporated Intermediate compression for higher order ambisonic audio data
KR101993348B1 (ko) * 2014-09-24 2019-06-26 한국전자통신연구원 동적 포맷 변환을 지원하는 오디오 메타데이터 제공 장치 및 오디오 데이터 재생 장치, 상기 장치가 수행하는 방법 그리고 상기 동적 포맷 변환들이 기록된 컴퓨터에서 판독 가능한 기록매체
US9983139B2 (en) 2014-11-10 2018-05-29 Donald Channing Cooper Modular illumination and sensor chamber
US9794721B2 (en) * 2015-01-30 2017-10-17 Dts, Inc. System and method for capturing, encoding, distributing, and decoding immersive audio
CN104768053A (zh) 2015-04-15 2015-07-08 冯山泉 一种基于流分解和流重组的格式转换方法及系统

Also Published As

Publication number Publication date
US11729554B2 (en) 2023-08-15
RU2020115048A3 (de) 2021-11-08
JP7564295B2 (ja) 2024-10-08
JP2020536286A (ja) 2020-12-10
CN117395593A (zh) 2024-01-12
CA3076703C (en) 2024-01-02
CA3076703A1 (en) 2019-04-11
CA3219540A1 (en) 2019-04-11
EP3975176A3 (de) 2022-07-27
ES2907377T3 (es) 2022-04-25
EP3692523A1 (de) 2020-08-12
TWI700687B (zh) 2020-08-01
AU2021290361B2 (en) 2024-02-22
MY202120A (en) 2024-04-04
US20200221230A1 (en) 2020-07-09
AR117384A1 (es) 2021-08-04
RU2020115048A (ru) 2021-11-08
CA3219566C (en) 2025-10-07
CN111630592A (zh) 2020-09-04
AR125562A2 (es) 2023-07-26
TW202016925A (zh) 2020-05-01
ZA202001726B (en) 2021-10-27
CA3219566A1 (en) 2019-04-11
RU2759160C2 (ru) 2021-11-09
JP7297740B2 (ja) 2023-06-26
CA3219540C (en) 2025-10-07
TW201923744A (zh) 2019-06-16
EP3975176A2 (de) 2022-03-30
US12058501B2 (en) 2024-08-06
AU2021290361A1 (en) 2022-02-03
MX2020003506A (es) 2020-07-22
AU2018344830A1 (en) 2020-05-21
MX2024003251A (es) 2024-04-04
PT3692523T (pt) 2022-03-02
WO2019068638A1 (en) 2019-04-11
AU2018344830A8 (en) 2020-06-18
US11368790B2 (en) 2022-06-21
EP3692523B1 (de) 2021-12-22
KR102468780B1 (ko) 2022-11-21
CN111630592B (zh) 2023-10-27
AU2018344830B2 (en) 2021-09-23
JP2023126225A (ja) 2023-09-07
KR102700687B1 (ko) 2024-08-30
BR112020007486A2 (pt) 2020-10-27
CA3134343A1 (en) 2019-04-11
KR20200053614A (ko) 2020-05-18
US20220150635A1 (en) 2022-05-12
TWI834760B (zh) 2024-03-11
US20220150633A1 (en) 2022-05-12
PL3692523T3 (pl) 2022-05-02
KR20220133311A (ko) 2022-10-04

Similar Documents

Publication Publication Date Title
ZA202001726B (en) Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding
ZA202103739B (en) Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding using direct component compensation
PL3695602T3 (pl) Urządzenie, sposób i program komputerowy do kodowania i dekodowania wideo
ZA201802567B (en) An apparatus, a method and a computer program for video coding and decoding
EP3539291A4 (de) Vorrichtung, verfahren und computerprogramm zur videocodierung und -decodierung
GB201500400D0 (en) An apparatus, a method and a computer program for video coding and decoding
EP3535977A4 (de) Vorrichtung, verfahren und computerprogramm zur codierung und decodierung von videoinhalten
PL3346709T3 (pl) Urządzenie, sposób i program komputerowy do kodowania oraz dekodowania wideo
EP3566445A4 (de) Vorrichtung, verfahren und computerprogramm zur videocodierung und -decodierung
EP3275191A4 (de) Vorrichtung, verfahren und computerprogramm zur codierung und decodierung von videoinhalten
EP3311572A4 (de) Vorrichtung, verfahren und computerprogramm zur videocodierung und -decodierung
EP3364655A4 (de) Verfahren und vorrichtung zur videodecodierung sowie verfahren und vorrichtung zur videocodierung
ZA201601010B (en) Apparatus, method and computer program for decoding an encoded audio signal
EP3022917A4 (de) Verfahren, vorrichtung und computerprogrammprodukt zur codierung und decodierung von videos
GB201313113D0 (en) An apparatus, a method and a computer program for video coding and decoding
GB201312460D0 (en) An apparatus, a method and a computer program for video coding and decoding
GB201616808D0 (en) An apparatus, a method and a computer program for video coding and decoding
GB201508620D0 (en) An apparatus, a method and a computer program for video coding and decoding
SG11201505278TA (en) An apparatus, a method and a computer program for video coding and decoding
EP3348061A4 (de) Vorrichtung, verfahren und computerprogramm zur codierung und decodierung von videoinhalten
GB201601489D0 (en) Apparatus, methods and computer computer programs for encoding and decoding audio signals
GB201707792D0 (en) An apparatus, a method and a computer program for video coding and decoding
IL275034A (en) Method and apparatus for encoding and decoding video based on block shape
EP3580935A4 (de) Vorrichtung, verfahren und computerprogramm zur videocodierung und -decodierung
IL273437A (en) Method and device for video decoding, and method and device for video encoding