WO2017019781A1 - System and method for spatial processing of soundfield signals - Google Patents

System and method for spatial processing of soundfield signals Download PDF

Info

Publication number
WO2017019781A1
WO2017019781A1 PCT/US2016/044286 US2016044286W WO2017019781A1 WO 2017019781 A1 WO2017019781 A1 WO 2017019781A1 US 2016044286 W US2016044286 W US 2016044286W WO 2017019781 A1 WO2017019781 A1 WO 2017019781A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
soundfield
arrival
input
output
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2016/044286
Other languages
English (en)
French (fr)
Inventor
David S. Mcgrath
Rhonda Wilson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corp filed Critical Dolby Laboratories Licensing Corp
Priority to CN201680043670.9A priority Critical patent/CN107851432B/zh
Priority to EP16747709.0A priority patent/EP3329485B1/de
Priority to CN202111507803.2A priority patent/CN114302315B/zh
Priority to US15/746,787 priority patent/US10932078B2/en
Publication of WO2017019781A1 publication Critical patent/WO2017019781A1/en
Anticipated expiration legal-status Critical
Priority to US17/166,162 priority patent/US11381927B2/en
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K15/00Acoustics not otherwise provided for
    • G10K15/08Arrangements for producing a reverberation or echo sound
    • G10K15/12Arrangements for producing a reverberation or echo sound using electronic time-delay networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • the present invention provides for systems and methods for the input of an audio soundfield signal and the creation of a reverberant acoustic equivalent soundfield signal.
  • Multi-channel audio signals are used to store or transport a listening experience, for an end listener, that may include the impression of a very complex acoustic scene.
  • the multi-channel signals may carry the information that describes the acoustic scene using a number of common conventions including, but not limited to, the following:
  • Discrete Speaker Channels The audio scene may have been rendered in some way, to form speaker channels which, when played back on the appropriate arrangement of loudspeakers, create the illusion of the desired acoustic scene.
  • Examples of Discrete Speaker Formats include stereo, 5.1 or 7.1 speaker signals, as used in many sound formats today.
  • the audio scene may be represented as one or more object audio channels which, when rendered by the listener's playback equipment, can re-create the acoustic scene.
  • each audio object will be accompanied by metadata (implicit or explicit) that is used by the renderer to pan the object to the appropriate "location" in the listener's playback environment.
  • Audio Object Formats include Dolby Atmos (Trade Mark), which is used in the carriage of rich sound-tracks on Blu-Ray Disc and other motion picture delivery formats.
  • the audio scene may be represented by a Soundfield Format - a set of two or more audio signals that collectively contain one or more audio objects with the spatial location of each object "encoded" in the Spatial Format in the form of panning gains.
  • Soundfield Formats include Ambisonics, and Higher Order Ambisonics (both of which are well known in the art). Example systems are described in Gerzon, M.A., Periphony: With-Height Sound Reproduction. J. Audio Eng. Soc, 1973. 21(1): p. 2-10, and 3D Sound Field Recording with Higher Order Ambisonics-Objective Measurements and Validation of Spherical Microphone S Bertet, J Daniel, S Moreau - Audio Engineering Society Convention 120, 2006
  • a method for creating an output soundfield signal from an input soundfield signal including the steps of: (a) forming at least one delayed signals from the input soundfield signal, (b) for each of the delayed signals, creating an acoustically transformed delayed signal, by an acoustic transformation process, and (c) combining together the acoustically transformed delayed signals and the input soundfield signal to produce the output soundfield signal
  • the acoustic transformation process utilises a multi-channel matrix mixer.
  • the multi-channel matrix mixer can be formed by combining one or more spatial operations, including a spatial rotation operation.
  • the multi-channel matrix mixer can be formed by combining one or more spatial operations, including a spatial mirror operation.
  • the multi-channel matrix mixer can be formed by combining one or more spatial operations, including a directional gain operation.
  • the multi-channel matrix mixer can be formed by combining one or more spatial operations, including a directional permutation operation.
  • the acoustic transformation process preferably can include frequency-dependant filtering.
  • each simulated echo can comprise a delayed and rotated copy of the input sound field signal.
  • each simulated echo preferably can include substantially the same delay.
  • the alternative direction of arrival can comprise a geometric transformation of the first direction of arrival.
  • a system for processing of soundfield signals to simulate the presence of reverberance including: an input unit for the input of a soundfield encoded signal; a tapped delay line for interconnected to the input unit and providing a series of tapped delays of the soundfield encoded signal; a series of acoustic transformation units interconnected to the output taps of the tapped delay line, for applying an acoustic transformation to the output taps to produce transformed delayed outputs; and a combining unit for combining the transformed delayed outputs into an output soundfield signal.
  • the acoustic transformation units can include: a multi channel matrix multiplier for applying a geometric transformation to an output tap to produce a geometric transformed output; and a series of linear audio filters applied to each channel of the geometric transformed output.
  • Fig. 1 illustrates schematically an audio object, at direction ⁇ /) m , and an echo at direction ⁇ ' .
  • Fig. 2 is a schematic block diagram of a tapped delay line.
  • Fig. 3 is a schematic block diagram of an echo processor.
  • Fig. 4 is a schematic block diagram of an echo processor with direction-dependant filtering.
  • Fig. 5 illustrates an alternative form of an echo processor.
  • the preferred embodiments provide for a system and method which, given that an input soundfield signal contains audio components that are encoded with different directions of arrival, produces an output soundfield signal that will contain simulated echoes, such that each simulated echo will have a direction of arrival that is a function of the direction of arrival of the original audio component as it appeared in the input signal.
  • the output soundfield signal thereby provides for reverberance and other simulated audio effects.
  • N -channel Soundfield Format is often defined by it's panning function, P N (0) .
  • G— ⁇ ⁇ ( ⁇ ) where G is an [N X l] column vector of gain values, and defines the spatial location of the object, i.e:
  • a set of M objects (represented by the M audio signals ⁇ ⁇ ) , o 2 (t) , ⁇ , o M (t) ) can be encoded into a N -channel Spatial Format signal X N (t) as per Equation 2 below (where object m is "located" at the position defined by (f ) m )
  • the signal X N (t) can be referred to as an Anechoic Mixture of the audio objects.
  • any sound emitted by the audio object will reach the listener via multiple paths.
  • This phenomenon is well known in the art, and the resulting sound, received at the listening position, is said to be reverberant.
  • the number of acoustic paths, formed by the propagation of sound from the object and reflected off acoustic surfaces to reach the listener, may be infinite, but a reasonably close estimate of the sound received at the listening position may be formed by considering a finite number ( E ) of echoes.
  • Fig. 2 illustrates an example of reverberance, where the sound from audio object m , 20, is received at the listening position from direction ⁇ ⁇ , along with one echo (echo e ) being received at the listening position from direction ⁇ ⁇ ' e .
  • d m e the delay (in samples) of echo e from object m (7)
  • Equation 2 shows how an N -channel soundfield signal, X N (t) , may be created by combining M audio objects, based on the assumption that each audio object has a location ( ⁇ ⁇ ) and an audio signal ( o m (t) ). [0036] It is possible to devise a more complex acoustic soundfield signal,
  • R N (t) X N (t) + Y N (t) , intended to contain all of the M audio objects, combined together in a way that includes a simulation of an acoustic space (by including E echoes for each object). This is shown in Equation 10 below:
  • R N (t) X N (t) + Y N (t) (9)
  • the signal Y N (t) can be referred to as the Reverberant Mixture of the audio objects.
  • the complete acoustic-simulation is created by summing together the Anechoic Mixture, X N (t) , and the Reverberant Mixture, Y N (t) .
  • Equation 10 the terminology [o m ®h me ](t) is used to indicate the convolution of the object audio signal o m (t) with the impulse response h me (t), and hence [ m ®h me ](t— ⁇ ) indicates the convolved signal with an additional delay of d me samples (where F s is the sample frequency).
  • Equation 11 may be written in terms of the frequency domain equation in Equation 12 below:
  • the N -channel soundfield signal format is defined by the panning function, ⁇ ( ) .
  • Equation 14 tells us that, if we wish to apply a 3x3 matrix transformation, A , to the ( , y, z) coordinates of an object location, prior to the computation of the panning function, we can instead achieve this transformation as a 4 x 4 matrix operation, applied to the panning-gain vector, after the computation of the panning function.
  • Equation 14 The result shown in in Equation 14 can be applied to Equation 2, in order to manipulate the location of all objects in audio scene, as per Equation 17 below.
  • a transformed soundfield signal, X N ' (t) is created from X N (t) , achieving the same result that would have occured if all of the objects had their ( , y, z) locations modified by the 3x3 matrix A .
  • Rotation The locations of all objects within a soundfield can be rotated around the listening position.
  • the manipulation of the (x, y, z) coordinates of each object may be defined in terms of a 3x3 matrix, A , and the manipulation of the 4-channel soundfield signal may be carried out according to Equation 17.
  • the locations of all objects within a soundfield may be mirrored about a plane that passes through the listening position.
  • the manipulation of the (x, y, z) coordinates of each object may be defined in terms of a 3x3 matrix, A , and the manipulation of the 4-channel soundfield signal may be carried out according to Equation 17.
  • Dominance A transformation of the 4-channel soundfield signal (known as the Lorentz transformation) may be applied by multiplying the 4 channels of the signal by the following 4 x 4 matrix:
  • Echo Time Simplification It will be recalled that the original reverberation calculation (as per Equation 10) treats the reverberation for each object as a series of echoes, wherein for object m, echo e, has a time delay (relative to the direct-path) equal to d m e (so, the echo times are different for each object).
  • a delay d k ' is defined to be the arrival time (relative to the direct sound) of echo k, and this delay is the same for every object (and hence, the echo delay, d k ' , is no longer dependant on the object identifier, m ).
  • Echo Direction Simplification The original reverberation calculation (as per Equation 10) treats the reverberation for each object as a series of echoes, wherein for object m , echo e has a direction of arrival, ⁇ ⁇ ' e (so, the echo arrival directions are different for each object).
  • Fig. 2 shows one method that may be used to achieve this, with the corresponding z - domain transfer function being shown in Equation 18 below:
  • the processing chain 100 includes a Delay Line, 3, with K taps (and, in the following explanation, the variable k can be used to refer to a specific tap number, so that k € ⁇ 1 ,2,...,K ⁇ ).
  • the input, 2, to the Delay Line 3 is the N -channel input signal, X N (t) .
  • an N -channel delayed signal e.g. 5, is taken from the Delay Line, and processed via an acoustic transformation process, 200, to produce an acoustically transformed delayed signal, 6.
  • the set of K acoustically transformed delayed signals are added together 7 to produce the output soundfield signal, 8.
  • Fig. 3 illustrates one example form of implementation of an Echo Processor 200 which applies an acoustic transformation process.
  • the input N -channel delayed signal 5 is processed, to produce the N -channel acoustically transformed delayed signal 6.
  • two operations are performed by the acoustic transformation process, a multi-channel matrix mixer (represented by the N N matrix R k ) 11, and a linear time- invariant filter, H k (z) e.g. 12, applied to each of the N channels of the soundfield signal.
  • a multi-channel matrix mixer represented by the N N matrix R k
  • H k (z) linear time- invariant filter
  • the intention of the acoustic transformation process is to create a simulation of the k' h acoustic echo according to the following operating principles: [0074] Echo Delay: The time delay of echo k is defined by use of the Delay Line so that input to the Delay Line 2 (of Fig. 2), is delayed by d k ' samples to give the input 5, to the k' h acoustic transformation process (referring to Figure 2).
  • Equation 17 substitution A k in place of A in Equation 17.
  • Echo Amplitude and Frequency Response The amplitude and frequency response of echo k are provided by the filter, H k (z) e.g. 12, applied to each of the N channels as per Fig. 3.
  • Equation 20 defines the 4 x 4 matrix, BtoA that is the inverse of AtoB .
  • R' and R" are arbitrary 3 x 3 rotation matrices.
  • Equation 21 Equation 25
  • EchoProcess k C k xH h ' xB k (25)
  • a processing train for implementing the method of Equation 25 is also shown in Fig. 4, with the matrix processing Bk and Ck being separately implemented 21, 23.
  • an acoustic transformation process can be implemented as a 4x4 matrix of arbitrary filter operations 200.
  • the methods described above may also be combined with alternative reverberation processes, which may be known in the art, to produce a reverberant mixture that contains some echoes generated according to the above described methods, along with additional echoes and reverberation that are generated by the alternative methods.
  • any one of the terms comprising, comprised of or which comprises is an open term that means including at least the elements/features that follow, but not excluding others. Thus, the term comprising, when used in the claims, should not be interpreted as being limitative to the means or elements or steps listed thereafter.
  • a device comprising A and B should not be limited to devices consisting only of elements A and B. Any one of the terms including or which includes or that includes as used herein is also an open term that also means including at least the elements/features that follow the term, but not excluding others. Thus, including is synonymous with and means comprising.
  • exemplary is used in the sense of providing examples, as opposed to indicating quality. That is, an "exemplary embodiment” is an embodiment provided as an example, as opposed to necessarily being an embodiment of exemplary quality.
  • Coupled when used in the claims, should not be interpreted as being limited to direct connections only.
  • the terms “coupled” and “connected,” along with their derivatives, may be used. It should be understood that these terms are not intended as synonyms for each other.
  • the scope of the expression a device A coupled to a device B should not be limited to devices or systems wherein an output of device A is directly connected to an input of device B. It means that there exists a path between an output of A and an input of B which may be a path including other devices or means.
  • Coupled may mean that two or more elements are either in direct physical or electrical contact, or that two or more elements are not in direct contact with each other but yet still co-operate or interact with each other.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Algebra (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Stereophonic System (AREA)
PCT/US2016/044286 2015-07-29 2016-07-27 System and method for spatial processing of soundfield signals Ceased WO2017019781A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN201680043670.9A CN107851432B (zh) 2015-07-29 2016-07-27 用于声场信号的空间处理的系统和方法
EP16747709.0A EP3329485B1 (de) 2015-07-29 2016-07-27 System und verfahren zur räumlichen verarbeitung von schallfeldsignalen
CN202111507803.2A CN114302315B (zh) 2015-07-29 2016-07-27 用于声场信号的空间处理的系统和方法
US15/746,787 US10932078B2 (en) 2015-07-29 2016-07-27 System and method for spatial processing of soundfield signals
US17/166,162 US11381927B2 (en) 2015-07-29 2021-02-03 System and method for spatial processing of soundfield signals

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201562198440P 2015-07-29 2015-07-29
US62/198,440 2015-07-29
EP15185913.9 2015-09-18
EP15185913 2015-09-18

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US15/746,787 A-371-Of-International US10932078B2 (en) 2015-07-29 2016-07-27 System and method for spatial processing of soundfield signals
US17/166,162 Division US11381927B2 (en) 2015-07-29 2021-02-03 System and method for spatial processing of soundfield signals

Publications (1)

Publication Number Publication Date
WO2017019781A1 true WO2017019781A1 (en) 2017-02-02

Family

ID=54150339

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2016/044286 Ceased WO2017019781A1 (en) 2015-07-29 2016-07-27 System and method for spatial processing of soundfield signals

Country Status (3)

Country Link
EP (1) EP3329485B1 (de)
CN (2) CN107851432B (de)
WO (1) WO2017019781A1 (de)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10149082B2 (en) 2015-02-12 2018-12-04 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
US10390166B2 (en) 2017-05-31 2019-08-20 Qualcomm Incorporated System and method for mixing and adjusting multi-input ambisonics
CN110800048A (zh) * 2017-05-09 2020-02-14 杜比实验室特许公司 多通道空间音频格式输入信号的处理

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107851432B (zh) * 2015-07-29 2022-01-28 杜比实验室特许公司 用于声场信号的空间处理的系统和方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2255884A (en) * 1991-04-04 1992-11-18 Michael Anthony Gerzon Producing simulated sound distance effects
WO2014159376A1 (en) * 2013-03-12 2014-10-02 Dolby Laboratories Licensing Corporation Method of rendering one or more captured audio soundfields to a listener

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007010771A1 (ja) * 2005-07-15 2007-01-25 Matsushita Electric Industrial Co., Ltd. 信号処理装置
US8705757B1 (en) * 2007-02-23 2014-04-22 Sony Computer Entertainment America, Inc. Computationally efficient multi-resonator reverberation
DK3550859T3 (da) * 2015-02-12 2021-11-01 Dolby Laboratories Licensing Corp Hovedtelefonsvirtualisering
CN107851432B (zh) * 2015-07-29 2022-01-28 杜比实验室特许公司 用于声场信号的空间处理的系统和方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2255884A (en) * 1991-04-04 1992-11-18 Michael Anthony Gerzon Producing simulated sound distance effects
WO2014159376A1 (en) * 2013-03-12 2014-10-02 Dolby Laboratories Licensing Corporation Method of rendering one or more captured audio soundfields to a listener

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JOSEPH ANDERSON ET AL: "ADAPTING ARTIFICIAL REVERBERATION ARCHITECTURES FOR B-FORMAT SIGNAL PROCESSING", AMBISONICS SYMPOSIUM 2009, 27 June 2009 (2009-06-27), pages 1 - 5, XP055246673, Retrieved from the Internet <URL:http://ambisonics.iem.at/symposium2009/proceedings/ambisym09-josephandersonseancostello-ambireverbarch.pdf/@@download/file/AmbiSym09_JosephAndersonSeanCostello_AmbiReverbArch.pdf> [retrieved on 20160202] *
LOPEZ-LEZCANO ET AL: "An Architecture for Reverberation in High Order Ambisonics", AES CONVENTION 137; OCTOBER 2014, AES, 60 EAST 42ND STREET, ROOM 2520 NEW YORK 10165-2520, USA, 8 October 2014 (2014-10-08), XP040639018 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10149082B2 (en) 2015-02-12 2018-12-04 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
US10382875B2 (en) 2015-02-12 2019-08-13 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
US10750306B2 (en) 2015-02-12 2020-08-18 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
US11140501B2 (en) 2015-02-12 2021-10-05 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
US11671779B2 (en) 2015-02-12 2023-06-06 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
US12143797B2 (en) 2015-02-12 2024-11-12 Dolby Laboratories Licensing Corporation Reverberation generation for headphone virtualization
CN110800048A (zh) * 2017-05-09 2020-02-14 杜比实验室特许公司 多通道空间音频格式输入信号的处理
CN110800048B (zh) * 2017-05-09 2023-07-28 杜比实验室特许公司 多通道空间音频格式输入信号的处理
US10390166B2 (en) 2017-05-31 2019-08-20 Qualcomm Incorporated System and method for mixing and adjusting multi-input ambisonics

Also Published As

Publication number Publication date
EP3329485A1 (de) 2018-06-06
CN107851432B (zh) 2022-01-28
CN114302315A (zh) 2022-04-08
EP3329485B1 (de) 2020-08-26
CN114302315B (zh) 2023-10-31
CN107851432A (zh) 2018-03-27

Similar Documents

Publication Publication Date Title
CN105900457B (zh) 用于设计和应用数值优化的双耳房间脉冲响应的方法和系统
KR101346490B1 (ko) 오디오 신호 처리 방법 및 장치
JP2025123226A (ja) 少なくとも一つのフィードバック遅延ネットワークを使ったマルチチャネル・オーディオに応答したバイノーラル・オーディオの生成
JP6130599B2 (ja) 第1および第2の入力チャネルを少なくとも1個の出力チャネルにマッピングするための装置及び方法
AU699647B2 (en) Method and apparatus for efficient presentation of high-quality three-dimensional audio
EP3406088B1 (de) Synthese von signalen für immersive audiowiedergabe
US11950078B2 (en) Binaural dialogue enhancement
JP2013211906A (ja) 音声空間化及び環境シミュレーション
KR20180075610A (ko) 사운드 스테이지 향상을 위한 장치 및 방법
CN110326310A (zh) 串扰消除的动态均衡
EP3329485B1 (de) System und verfahren zur räumlichen verarbeitung von schallfeldsignalen
JP2023070650A (ja) 音場の少なくとも一部の位置決めによる空間オーディオ再生
US9794717B2 (en) Audio signal processing apparatus and audio signal processing method
US11381927B2 (en) System and method for spatial processing of soundfield signals
CN101278597B (zh) 生成空间声音的方法和装置
Hofmann et al. Generalized wave-domain transforms for listening room equalization with azimuthally irregularly spaced loudspeaker arrays
CN119278636A (zh) 信号处理方法
Saari Modulaarisen arkkitehtuurin toteuttaminen Directional Audio Coding-menetelmälle
Pulkki Implementing a modular architecture for virtual-world Directional Audio Coding
McGrath et al. Creation, manipulation and playback of sound field
HK1196738A (en) Audio spatialization and environment simulation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16747709

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE