WO2019183271A1 - Multi-channel subband spatial processing for loudspeakers - Google Patents
Multi-channel subband spatial processing for loudspeakers Download PDFInfo
- Publication number
- WO2019183271A1 WO2019183271A1 PCT/US2019/023243 US2019023243W WO2019183271A1 WO 2019183271 A1 WO2019183271 A1 WO 2019183271A1 US 2019023243 W US2019023243 W US 2019023243W WO 2019183271 A1 WO2019183271 A1 WO 2019183271A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- channel
- input channel
- peripheral input
- crosstalk
- channels
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers
- H04R3/12—Circuits for transducers for distributing signals to two or more loudspeakers
- H04R3/14—Cross-over networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/03—Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/05—Generation or adaptation of centre channel in multi-channel audio systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/13—Aspects of volume control, not necessarily automatic, in stereophonic sound systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/07—Synergistic effects of band splitting and sub-band processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
- H04S7/303—Tracking of listener position or orientation
Definitions
- Embodiments of the present disclosure generally relate to the field of audio signal processing and, more particularly, to spatially enhanced multi-channel audio.
- Surround sound refers to sound reproduction of an audio signal including multiple channels with loudspeakers positioned around a listener.
- 5.1 surround sound uses a six channels for a front speaker, left and right speakers, a subwoofer, and rear (or“surround”) left and rear right speakers.
- 7.1 surround sound uses eight channels by seperating the rear left and right speakers of the 5.1 surrond sound configuration into four separate speakers, such as a left surround speaker, a right surround speaker, a left rear surround speaker, and a right rear surround speaker.
- Audio channels of the multi-channel audio signal may be associated with an angular position that corresponds with the location of the speaker to which the audio channels are output.
- the multi-channel audio signals allow a listener to perceive a spatial sense in the sound field when the audio signals are output to speakers at different locations.
- the spatial sense may be lost when the multi-channel audio signals for surround sound are output to stereo (e.g., left and right) loudspeakers or head-mounted speakers.
- Example embodiments relate to processing a (e.g., surround sound) mult-channel input audio signal into a stereo output signal for left and right speakers, while preserving or enhancing the spatial sense of the sound field of the multi-channel input audio signal.
- the processing results in a listening experience whereby each channel of audio signal is perceived as originating from the same or similar direction as would occur if the audio signal were rendered on a surround sound system (e.g., 5.1, 7.1, etc.).
- a multi-channel input audio signal including a left input channel, a right input channel, a left peripheral input channel, and a right peripheral input channel is received.
- a subband spatial processing is performed on the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel to create spatially enhanced channels.
- the subband spatial processing may include gain adjusting mid and side subband components of the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel.
- Crosstalk cancellation is performed on the spatially enhanced channels to create a crosstalk cancelled left channel and a right crosstalk cancelled channel.
- a left outpout channel is generated from the left crosstalk cancelled channel and a right output channel is generated from the right crosstalk cancelled channel.
- the left and right peripheral channels may include a left surround input channel and a right surround input channel, and/or a left surround rear input channel and a right surround rear input channel.
- the mult-channel input audio signal may further include a center channel and a low frequency channel that may be combined with the output of the crosstalk cancellation.
- the subband spatial processing is performed on each of the corresponding pairs of left right channels.
- subband spatial processing may be performed by gain adjusting the mid subband components and the side subband components of the left input channel and the right input channel, gain adjusting the mid subband components and the side subband components of the left peripheral input channel and the right peripheral input channel, and combining the gain adjusted mid subband components and the gain adjusted side subband components of the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel into a left combined channel and a right combined channel.
- the crosstalk cancellation is performed on the left and right combined channels to generate the output channels.
- the subband spatial processing is performed on combined left and right channels.
- the subband spatial processing may include combining the left input channel and the left peripheral input channel into a left combined channel, combining the right input channel and the right peripheral input channel into a right combined channel, and gain adjusting mid subband components and the side subband components of the left combined channel and the right combined channel to create a left spatially enhanced channel and a right spatially enhanced channel.
- the crosstalk cancellation is performed on the left and right spatially enhanced channels to generate the output channels.
- a binaural filter is applied to at least a portion of the input channels.
- a binaural filter is applied to the peripheral input channels to adjust for angular positions associated with the peripheral input channels.
- a binaural filter is applied to any input channel as suitable to adjust for the angular positions associated with the input channel, including the left or right input channels.
- Some embodiments may include a system for processing a multi-channel input audio signal.
- the system includes circuitry configured to: receive the multi-channel input audio signal including a left input channel, a right input channel, a left peripheral input channel, and a right peripheral input channel; perform subband spatial processing on the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel to create spatially enhanced channels, the subband spatial processing including gain adjusting mid and side subband components of the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel; perform crosstalk cancellation on the spatially enhanced channels to create a left crosstalk cancelled channel and a right crosstalk cancelled channel; and generate a left output channel from the left crosstalk cancelled channel and a right output channel from the right crosstalk cancelled channel.
- Some embodiments may include a non-transitory computer readable medium storing program code.
- the program code may be software comprised of executable instructions.
- the program code may be executed by one or more processors.
- the program code when executed by a processor, causes the processor to receive a multi-channel input audio signal including a left input channel, a right input channel, a left peripheral input channel, and a right peripheral input channel. Whe executed, the program code when excuted by the processor may cause the processor to perform subband spatial processing on the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel to create spatially enhanced channels.
- the subband spatial processing may include gain adjusting mid and side subband components of the left input channel, the right input channel, the left peripheral input channel, and the right peripheral input channel.
- the program code when excuted by the processor may cause the processor to perform crosstalk cancellation on the spatially enhanced channels to create a left crosstalk cancelled channel and a right crosstalk cancelled channel.
- the program code when excuted by the processor also may cause the processor to generate a left output channel from the left crosstalk cancelled channel and a right output channel from the right crosstalk cancelled channel.
- FIG. 1 illustrates an example of a surround sound stereo audio reproduction system, according to one embodiment.
- FIG. 2 illustrates an example of an audio system, according to one embodiment.
- FIG. 3 illustrates an example of a subband spatial processor, according to one embodiment.
- FIG. 4 illustrates an example of a crosstalk cancellation processor, according to one embodiment.
- FIG. 5 illustrates an example of a method for enhancing an audio signal with the audio system shown in FIG. 2, according to one embodiment.
- FIG. 6 illustrates an example of an audio system, according to one embodiment.
- FIG. 7 illustrates an example of a method for enhacing an audio signal with the audio system shown in FIG. 6, according to one embodiment.
- FIG. 8 illustrates an example of a computer system, according to one embodiment.
- the audio systems discussed herein provide crosstalk processing and spatial enhancement for multi-channel surround sound audio signal for output to stereo (e.g., left and right) speakers.
- the signal processing results in the preserving or enhancing of the spatial sense of the sound field encoded in the multi-channel surround sound audio signal.
- the spatial sense achieved using multi-speaker surround sound systems is achieved using stereo loudspeakers.
- FIG. 1 illustrates an example of a surround sound stereo audio reproduction system 100, according to one embodiment.
- the system 100 is an example of a 7.1 surround sound system that provides audio signal reproduction to a listener 140.
- the system 100 includes a left speaker 110L, a right speaker 110R, a center speaker 115, a subwoofer 125, a left surround speaker 120L, a right surround speaker 120R, a left surround rear speaker 130L, and a right surround speaker 13 OR.
- the center speaker 115 and subwoofer 125 may be positioned in front of the listener 140, which defines a forward axis at 0 °.
- the left speaker 110L may be positioned at an angle between -20° to -30° relative to the forward axis, and the right speaker 110R may be positioned at an angle between 20° to 30° relative to the forward axis.
- the left surround speaker 120L may be positioned at an angle between -90° to -110° relative to the forward axis, and the right surround speaker 120R may be positioned at an angle between 90° to 110° relative to the forward axis.
- the left surround rear speaker 130L may be positioned at an angle between -135° to -150° relative to the forward axis, and the right surround speaker 130R may be positioned at an angle between 135° to 150° relative to the forward axis.
- the system 100 may be configured to receive an audio signal including channels for each of the speakers 110, 115, 120, and 130 and the subwoofer 125.
- the multiple speakers and their positional arrangement provides for a spatial sense in the sound field that can be perceived by the listener 140.
- the audio system may be configured to process a multi-channel input audio signal for the surround sound system 100 into an enhanced stereo signal for left and right speakers (e.g., speakers 110L and 110R) that reproduces or simulates the spatial sense in the sound field generated by the surround sound system 100 using the multi-channel audio signal.
- FIG. 2 illustrates an example of an audio system 200, according to one
- the audio system 200 receives an input audio signal including a left input channel 201 A, a right input channel 210B, a center input channel 210C, a low frequency input channel 210D, a left surround input channel 210E, a right surround input channel 21 OF, a left surround rear input channel 210G, and a right surround rear input channel 21 OH.
- the channels 210E, 21 OF, 210G, and 21 OH are examples of peripheral channels for surround speakers.
- Peripheral channels may include channels other than the left and right input channels.
- Peripheral channels may include channel pairs, such as left-right pairs, or front- back pairs, or other pair arrangements.
- the left surround speaker 120L receives the left surround input channel 210E
- the right surround speaker 120R receives the right surround input channel 21 OF
- the left surround rear speaker 130L receives the left surround rear input channel 210G
- the right surround rear speaker 13 OR receives the right surround rear input channel 21 OH.
- the input audio signal has fewer or more peripheral channels.
- an audio input signal for a 5.1 surround sound system may include only two peripheral channels, such as left and right surround input channels that may be output to left and right surround speakers.
- the left speaker 110L may receive the left input channel 210A
- the right speaker 110R may receive the right input channel 210B
- the center speaker 115 may receive the center input channel 210C
- the subwoofer 125 may receive the low frequency input channel 210D.
- the input audio signal provides a spatial sense of the sound field when output by the surround sound stereo audio reproduction system 100.
- the audio system 200 receives the input audio signal and generates an output signal including a left output channel 290L and a right output channel 290R.
- the audio system 200 may combine the input channels of the input audio signal, and may further provide enhancements such as subband spatial processing and crosstalk cancellation, to generate the output audio signal.
- the left output channel 290L may be provided to a left speaker and the right output channel 290R may be output to a right speaker.
- the output audio signal provides a spatial sense of the sound field using the left and right speakers (e.g., left speaker 110L and right speaker 110R) that is typically achieved by outputting the input audio signal using a surround sound system including multiple (e.g., peripheral) speakers.
- the audio system 200 includes gains 215 A, 215B, 215C, 215D, 215E, 215F,
- sub-band spatial processors 230A, 230B, and 230C sub-band spatial processors 230A, 230B, and 230C, a high shelf filter 220, a divider 240, binaural filters 250A, 250B, 250C, and 250D, a left channel combiner 260A, a right channel combiner 260B, a crosstalk cancellation processor 270, a left channel combiner 260C, a right channel combiner 260D, and an output gain 280.
- Each of the gains 215 A through 215H may receive a respective input channel 210A through 21 OH, and may apply a gain to an input channel 210A through 21 OH.
- the gains 215 A through 215H may be different to adjust gains of the input channels with respect to each other, or may be the same.
- positive gains are applied to the left and right peripheral input channels 210E, 21 OF, 210G, and 21 OH, and a negative gain is applied to the center channel 210C.
- the gain 215 A may apply a 0 db gain
- the gain 215B may apply a 0 dB gain
- the gain 215C may apply a -3 dB gain
- the gain 215D may apply a 0 db gain
- the gain 215E may apply a 3 dB gain
- the gain 215F may apply a 3 dB gain
- the gain 215G may apply a 3 dB gain
- the gain 215H may apply a 3 dB gain.
- the gain 215A and gain 215B are coupled to the subband spatial processor 230.
- the gains 215E and 215F are coupled to the subband spatial proricessor 230B
- the gains 215G and 215H are coupled to the subband spatial processor 230C.
- the subband spatial processors 230A, 230B, and 230C each apply subband spatial processing to corresponding left and right channel pairs.
- Each subband spatial processor 230 performs subband spatial processing on a left and right input channel by gain adjusting mid and side subband components of the left and right input channels to generate left and right spatially enhanced channels.
- the subband spatial processor 230A performs the subband spatial processing on the left and right intput channels, while other subband spatial processors 230B and 230C each perform the subband spatial processing to corresponding left and right peripheral channels.
- the audio system 200 may include more or less subband spatial processors.
- channels without left/right counterparts can bypass SBS processing.
- the subband spatial processor 230B is coupled to the binaural filters 250A and 250B.
- the subband spatial processor 230B provides a left spatially enhanced channel to the binaural filter 250A, and provides a right spatially enhanced channel to the binaural filter 250B.
- the subband spatial processor 230C is coupled to the binaural filters 250C and 250D.
- the subband spatial processor 230C provides a left spatially enhanced channel to the binaural filter 250C, and provides a right spatially enhanced channel to the binaural filter 250D.
- subband spatial processor 230 Additional details regarding a subband spatial processor 230 are shown in FIG. 3 and discussed below.
- Each of the binuaral filters 250A, 250B, 250C, and 250D apply a head-related transfer function (HRTF) that describes the target source location from which the listener should perceive the sound of the input channel.
- HRTF head-related transfer function
- Each binaural filter receives an input channel and generates a left and right output channel by applying a HRTF that adjusts for an angular position associated with the input channel.
- the angular position may include an angle defined in an X-Y “azimuthal” plane relative to listener 140 the as shown in FIG. 1, and may further include an angle defined in the Z axis, such as for an ambisonics signal or a channel-based format containing signals intended to be rendered above or below the X-Y plane relative to the listener 140.
- the binaural filter 250A may be configured to apply a filter based on the left surround input channel 210E being associated with the angle (defined in the X-Y plane) between -90° to -110° relative to the forward axis of the left surround speaker 120L.
- the binaural filter 250B may be configured to apply a filter based on the right surround input channel 21 OF being associated the angle between 90° to 110° relative to the forward axis of the right surround speaker 120L.
- the binaural filter 250C may be configured to apply a filter based on the left surround rear input channel 2lOG being associated with the angle between -135° to -150° relative to the forward axis of the left surround rear speaker 130L.
- the binaural filter 250D may be configured to apply a filter based on the right surround rear input channel 21 OH being associated with the angle between 135° to 150° relative to the forward axis of the rear speaker 13 OR. In some embodiments, the binaural processing may be bypassed entirely in order to preserve inter-channel spectral uniformity. One or more of the binuaral filters 250A, 250B, 250C, and 250D may be omitted from the audio system 200. However, the binuaral filters 250A, 250B, 250C, and 250D may be used to enhance spatial imaging. In some embodiments, binaural filtering may be applied to channels other than peripheral input channels.
- a binaural filter may be applied to each of the left and right spatially enhanced channels that are output from the subband spatial processor 230A to adjust for different left and right output speaker location.
- the input audio signal includes channels associated with other speaker locations (i.e. Overhead, Rear-Center, etc.)
- binaural processing may be applied to the other input channels. In that sense, binaural processing may be appled to one or more of the left input channel 210A, the right input channel 21 OB, the center input channel 210C, or the low frequency input channel 210D.
- HRTFs are not applied, and one or more of the binuaral filters 250A, 250B, 250C, and 250D may be bypassed or omitted from the system 200.
- An example binaural filter may be defined by Equation 1 :
- S 0 (z) H(Q, z)Si(z) Eq. (l)
- S 0 and S L are the output and input signals, respectively.
- the argument Q encodes the angle of each channel in S L and S 0.
- the value z is an arbitrary complex number, of which our solution is a function, encoding frequency.
- H (Q, z) is therefore a function of both angle Q and z, returning a transfer function, itself a function of z, which may be selected or interpolated among a collection of transfer functions, perhaps derived from an anthropometric database.
- the angle Q , as well as S and H (Q) as functions of z may evaluate to vectors if multichannel processing is desired. In this case, each coefficient in S(z), and H(q, z)
- each coefficient in Q associates an angle to each channel.
- the input audio signal is an ambisonics audio signal defining a speaker-independent representation of a sound field.
- the ambisonics audio signal may be decoded into a multi-channel audio signal for a surround sound system.
- the channels may be associated with speaker locations at various locations, including locations that are above or below the listener.
- a binaural filter may be applied to each decoded input channel of the ambisonics audio signal to adjust for the associated position of the decoded input audio channel.
- the binaural filtering is performed prior to subband spatial processing. For example, a binaural filter may be applied to one or more of the input channels as suitable to adjust for angular positions associated with the channels.
- the left output channels of the binaural filters may be combined, and right output channels of the binaural filters may be combined, and the subband spatial processing may be applied to the combined left and right channels.
- binaural filters are applied to the center input channel 210C or the low frequency input channel 210D. In some embodiments, binaural filters are applied to each input channel except the low frequency input channel 210D.
- the left channel combiner 260A is coupled to the subband spatial processor 230A, and the binaural filters 250A, 250B, 250C, and 250D.
- the left channel combiner 260A receives the left output channels of the subband subband spatial processor 230 A, and the binaural filters 250A, 250B, 250C, and 250D, and combines these channels into a left combined channel.
- the right channel combiner 260B is also coupled to the subband spatial processor 230A, and the binaural filters 250A, 250B, 250C, and 250D.
- the right channel combiner 260B receives the right output channels of the subband subband spatial processor 230A, and the binaural filters 250A, 250B, 250C, and 250D, and combines these channels into a right combined channel.
- the crosstalk cancellation processor 270 receives left and right input channels and performs a crosstalk cancellation to generate left and right crosstalk cancelled channels.
- the crosstalk cancellation processor is coupled to the left channel combiner 260A to receive a left combined channel, and the right channel combiner 260B to receive a right combined channel.
- the left and right combined channels processed by the crosstalk cancellation processor 270 represent mixed down left and right counterpart input channels. Additional details regarding the crosstalk cancellation processor 270 are shown in FIG. 4 and discussed below.
- the high shelf filter 220 receives the center input channel 210C and applies a high frequency shelving or peaking filter.
- the high shelf filter 220 provides a“voice-lift” on the center input channel 210C.
- the high shelf filter 220 is bypassed, or omitted from the audio system 200.
- the high shelf filter 220 may attenuate or amplify frequencies above a corner frequency.
- the high shelf filter 220 is coupled to the left channel combiner 260C and the right channel combiner 260D.
- the high shelf filter 220 is defined by a 750 Hz corner frequency, a +3 dB gain, and 0.8 Q factor.
- the high shelf filter 220 generates a left center channel and a right center channel as output, such as by seperating the center input channel into two separate left and right center channels.
- the divider 240 receives the low frequency input channel 210D, and seperates the low frequency input channel 210D into left and right low frequency channels.
- the divider 240 is coupled to the left channel combiner 260C and the right channel combiner 260D, and provides the left low frequency channel to the left channel combiner 260C and the right low frequency channel to the right channel combiner 260D.
- the left channel combiner 260C is coupled to the crosstalk cancellation processor 270, the high shelf filter 220, and the divider 240.
- the left channel combiner 260C receives the left crosstalk channel from the crosstalk cancellation processor 270, the left center channel from the high shelf filter 220, and the left low frequency channel from the divider 240, and combines these channels into a left output channel.
- Right channel combiner 260D is coupled to the crosstalk cancellation processor 270, the high shelf filter 220, and the divider 240.
- the right channel combiner 260D receives the right crosstalk channel from the crosstalk cancellation processor 270, the right output channel from the high shelf filter 220, and the right low frequency channel from the divider 240, and combines these channels into a right output channel.
- the left center channel from the high shelf filter 220 and the left low frequency channel from the divider 240 are combined by the left channel combiner 260 A with the left spatially enhanced channel from the subband spatial processor 230 A and the left output channels of the binaural filters 250A, 250B, 250C, and 250D to generate the left combined channel.
- the right output channel from the high shelf filter 220 and the right low frequency channel from the divider 240 are combined by the right channel combiner 260 with the right spatially enhanced channel from the subband subband spatial processor 230 A and the right output channels of the binaural filters 250A, 250B, 250C, and 250D to generate the right combined channel.
- the left and right combined channels are input into the crosstalk cancellation processor 270.
- the center and low frequency channels receive the crosstalk cancellation operation.
- the left channel combiner 260C and right channel combiner 260D may be omitted. In some embodiments, one of the center or low frequency channels receives the crosstalk cancellation operation.
- the output gain 280 is coupled to left channel combiner 260C and the right channel combiner 260D.
- the output gain 280 applies a gain to the left output channel from the left channel combiner 260C, and applies a gain to the right output channel from the right channel combiner 260D.
- the output gain 280 may apply the same gain to the left and right output channels, or may apply different gains.
- the output gain 280 outputs the left output channel 290L and the right output channel 290R which represent the channels of the output signal of the audio system 200.
- FIG. 3 illustrates an example of a subband spatial processor 230, according to one embodiment.
- the subband spatial processor 230 is an example of the subband spatial procesors 230A, 230B, or 230C of the audio system 200.
- the subband spatial processor 230 includes a spatial frequency band divider 340, a spatial frequency band processor 345, and a spatial frequency band combiner 350.
- the spatial frequency band divider 340 is coupled to the spatial frequency band processor 345, and the spatial frequency band processor 345 is coupled to the spatial frequency band cominber 350.
- the spatial frequency band divider 340 includes an L/R to M/S converter 312 that receives a left input channel XL and a right input channel XR, and converts these inputs into a spatial component X m and the nonspatial component X s .
- the spatial component X s may be generated by subtracting the left input channel XL and right input channel XR.
- the nonspatial component X m may be generated by adding the left input channel XL and the right input channel XR.
- the spatial frequency band processor 345 receives the nonspatial component Xm and applies a set of subband filters to generate the enhanced nonspatial subband component E m.
- the spatial frequency band processor 345 also receives the spatial subband component X s and applies a set of subband filters to generate the enhanced nonspatial subband component E m.
- the subband filters can include various combinations of peak filters, notch filters, low pass filters, high pass filters, low shelf filters, high shelf filters, bandpass filters, bandstop filters, and/or all pass filters.
- the spatial frequency band processor 345 includes a subband filter for each of n frequency subbands of the nonspatial component X m and a subband filter for each of the n frequency subbands of the spatial component X s.
- the spatial frequency band processor 345 includes a series of subband filters for the nonspatial component X m including a mid equalization (EQ) filter 362(1) for the subband (1), a mid EQ filter 362(2) for the subband (2), a mid EQ filter 362(3) for the subband (3), and a mid EQ filter 362(4) for the subband (4).
- Each mid EQ filter 362 applies a filter to a frequency subband portion of the nonspatial component X m to generate the enhanced nonspatial component Em.
- the spatial frequency band processor 345 further includes a series of subband filters for the frequency subbands of the spatial component X s , including a side equalization (EQ) filter 364(1) for the subband (1), a side EQ filter 364(2) for the subband (2), a side EQ filter 364(3) for the subband (3), and a side EQ filter 364(4) for the subband (4).
- Each side EQ filter 364 applies a filter to a frequency subband portion of the spatial component X s to generate the enhanced spatial component E s.
- Each of the n frequency subbands of the nonspatial component X m and the spatial component X s may correspond with a range of frequencies.
- the frequency subband (1) may corresponding to 0 to 300 Hz
- the frequency subband(2) may correspond to 300 to 510 Hz
- the frequency subband(3) may correspond to 510 to 2700 Hz
- the frequency subband(4) may correspond to 2700 Hz to Nyquist frequency.
- the n frequency subbands are a consolidated set of critical bands.
- the critical bands may be determined using a corpus of audio samples from a wide variety of musical genres. A long term average energy ratio of mid to side components over the 24 Bark scale critical bands is determined from the samples. Contiguous frequency bands with similar long term average ratios are then grouped together to form the set of critical bands.
- the range of the frequency subbands, as well as the number of frequency subbands, may be adjustable.
- the mid EQ filters 362 or side EQ filters 364 may include a biquad filter, having a transfer function defined by Equation 2: where z is a complex variable.
- the filter may be implemented using a direct form I topology as defined by Equation 3 : b n t b,i t) 2 CL 2
- Y[n ] —X[n - 1] +— X[n - 1] +— X[n - 2] - - Y[n - 1] - - Y[n - 2]
- the biquad can then be used to implement any second-order filter with real-valued inputs and outputs.
- a discrete-time filter a continuous-time filter is designed and transformed it into discrete time via a bilinear transform. Furthermore, compensation for any resulting shifts in center frequency and bandwidth may be achieved using frequency warping.
- a peaking filter may include an S-plane transfer function defined by Equation 4: where s is a complex variable, A is the amplitude of the peak, and Q is the filter“quality”
- the spatial frequency band combiner 350 receives mid and side components, applies gains to each of the components, and converts the mid and side components into left and right channels.
- the spatial frequency band combiner 350 receives the enhanced nonspatial component E m and the enhanced spatial component E s , and performs global mid and side gains before converting the enhanced nonspatial component E m and the enhanced spatial component E s into the left spatially enhanced channel EL and the right spatially enhanced channel ER.
- the spatial frequency band combiner 350 includes a global mid gain 322, a global side gain 324, and an M/S to L/R converter 326 coupled to the global mid gain 322 and the global side gain 324.
- the global mid gain 322 receives the enhanced nonspatial component E m and applies a gain
- the global side gain 324 receives the enhanced spatial component E s and applies a gain.
- the M/S to L/R converter 326 receives the enhanced nonspatial component E m from the global mid gain 322 and the enhanced spatial component E s from the global side gain 324, and converts these inputs into the left spatially enhanced channel EL and the right spatially enhanced channel ER.
- FIG. 4 illustrates a crosstalk cancellation processor 270, according to one example embodiment.
- the crosstalk cancellation processor 270 receives the left spatially enhanced channel EL as input from the left channel combiner 260A and the right spatially enhanced channel ER as input from the right channel combiner 260B, and performs crosstalk cancellation on the channels EL, ER to generate the left output channel OL, and the right output channel OR.
- the crosstalk cancellation processor 270 includes an in-out band divider 410, inverters 420 and 422, contralateral estimators 430 and 440, combiners 450 and 452, and an in- out band combiner 460. These components operate together to divide the input channels TL, TR into in-band components and out-of-band components, and perform a crosstalk cancellation on the in-band components to generate the output channels OL, OR.
- crosstalk cancellation can be performed for a particular frequency band while obviating degradations in other frequency bands. If crosstalk cancellation is performed without dividing the input audio signal E into different frequency bands, the audio signal after such crosstalk cancellation may exhibit significant attenuation or amplification in the nonspatial and spatial components in low frequency (e.g., below 350 Hz), higher frequency (e.g., above 12000 Hz), or both.
- the in-out band divider 410 separates the input channels EL, ER into in-band channels EL, in, ER,i n and out of band channels EL, Out, ER,out, respectively. Particularly, the in-out band divider 410 divides the left enhanced compensation channel EL into a left in-band channel EL, in and a left out-of-band channel EL, Out. Similarly, the in-out band divider 410 separates the right enhanced compensation channel ER into a right in-band channel ER,IU and a right out-of- band channel ER,O U L Each in-band channel may encompass a portion of a respective input channel corresponding to a frequency range including, for example, 250 Hz to 14 kHz. The range of frequency bands may be adjustable, for example according to speaker parameters.
- the inverter 420 and the contralateral estimator 430 operate together to generate a left contralateral cancellation component SL to compensate for a contralateral sound component due to the left in-band channel EL, in.
- the inverter 422 and the contralateral estimator 440 operate together to generate a right contralateral cancellation component SR to compensate for a contralateral sound component due to the right in-band channel ER,i n .
- the inverter 420 receives the in-band channel EL, in and inverts a polarity of the received in-band channel EL, in to generate an inverted in-band channel EL, in’.
- the contralateral estimator 430 receives the inverted in-band channel EL, in’, and extracts a portion of the inverted in-band channel EL, in’ corresponding to a contralateral sound component through filtering. Because the filtering is performed on the inverted in-band channel EL, in’, the portion extracted by the contralateral estimator 430 becomes an inverse of a portion of the in-band channel EL, in attributing to the contralateral sound component.
- the portion extracted by the contralateral estimator 430 becomes a left contralateral cancellation component SL, which can be added to a counterpart in-band channel ER.IH to reduce the contralateral sound component due to the in-band channel EL, in.
- the inverter 420 and the contralateral estimator 430 are implemented in a different sequence.
- the contralateral estimator 430 includes a filter 432, an amplifier 434, and a delay unit 436.
- the filter 432 receives the inverted input channel EL, in’ and extracts a portion of the inverted in-band channel EL, in’ corresponding to a contralateral sound component through a filtering function.
- An example filter implementation is a Notch or Highshelf filter with a center frequency selected between 5000 and 10000 Hz, and Q selected between 0.5 and 1.0.
- Gain in decibels (G CIB ) may be derived from Equation 5:
- G dB -3.0 - logi .333 (D) Eq. (5)
- D is a delay amount by delay unit 1556A/B in samples, for example, at a sampling rate of 48 KHz.
- An alternate implementation is a Lowpass filter with a comer frequency selected between 5000 and 10000 Hz, and Q selected between 0.5 and 1.0.
- the amplifier 434 amplifies the extracted portion by a corresponding gain coefficient GL,in, and the delay unit 436 delays the amplified output from the amplifier 434 according to a delay function D to generate the left contralateral cancellation component SL.
- the contralateral estimator 440 includes a filter 442, an amplifier 444, and a delay unit 446 that performs similar operations on the inverted in- band channel ER,i n ’ to generate the right contralateral cancellation component SR.
- the contralateral estimators 430, 440 generate the left contralateral cancellation components SL, SR, according to equations below:
- the configurations of the crosstalk cancellation can be determined by the speaker parameters.
- filter center frequency, delay amount, amplifier gain, and filter gain can be determined, according to an angle formed between two outputs speakers of the output signal with respect to a listener, or other features of the speaker such as relative position, power, etc.
- values between the speaker angles are used to interpolate other values.
- the combiner 450 combines the right contralateral cancellation component SR to the left in-band channel EL, in to generate a left in-band compensation channel UL, and the combiner 452 combines the left contralateral cancellation component SL to the right in-band channel ER,in to generate a right in-band compensation channel UR.
- the in-out band combiner 460 combines the left in-band compensation channel UL with the out-of-band channel EL, Out to generate the left output channel OL, and combines the right in-band compensation channel UR with the out-of-band channel ER,out to generate the right output channel OR.
- the left output channel OL includes the right contralateral cancellation component SR corresponding to an inverse of a portion of the in-band channel TR,in attributing to the contralateral sound
- the right output channel OR includes the left contralateral cancellation component SL corresponding to an inverse of a portion of the in-band channel TL,in attributing to the contralateral sound.
- a wavefront of an ipsilateral sound component output by a right speaker (e.g., speaker 110R) according to the right output channel OR arrived at the right ear can cancel a wavefront of a contralateral sound component output by a right speaker (e.g., speaker 110L) according to the left output channel OL.
- a wavefront of an ipsilateral sound component output by the left speaker according to the left output channel OL arrived at the left ear can cancel a wavefront of a contralateral sound component output by the right speaker according to right output channel OR.
- contralateral sound components can be reduced to enhance spatial detectability.
- FIG. 5 illustrates an example of a method 500 for enhancing an audio signal with the audio system 200 shown in FIG. 2, according to one embodiment.
- the method 500 may include different and/or additional steps, or some steps may be in different orders.
- the audio system 200 receives 505 a multi-channel input audio signal.
- the mutli- channel audio signal may be a surround sound audio signal including a left input channel, a right input channel, at least one left peripheral input channel, and at least one right peripheral input channel.
- the multi-channel audio signal may further include the center input channel 210C and the low frequency input channel 210D.
- the input audio signal may be for a 7.1 surround sound system including the left input channel 210A and the right input channel 210B, and peripheral channels including the left surround input channel 210E and the right surround input channel 21 OF, and the left surround rear input channel 210G, and the right surround rear input channel 21 OH.
- the peripheral channels may include a single left peripheral channel and a single right peripheral channel.
- the audio system 200 applies 510 gains to the channels of the multi-channel input audio signal.
- the gains 215A through 215H may vary to control the contribution of particular input channels to the output signal generated by the audio system 200.
- the center channel 210C receives a negative gain while the peripheral input channels receive a positive gain.
- the audio system 200 (e.g., subband spatial processor 230A) generates 515 a left spatially enhanced channel and a right spatially enhanced channel by performing subband spatial processing on the left input channel and the right input channel.
- the subband spatial processor 230A generates the spatially enhanced channels by adjusting gains of n subbands of the mid component and the side component of the left input channel 210A and the right input channel 210B.
- the audio system 200 (e.g., subband spatial processor 230B and/or 230C) generates 520 a left spatially enhanced peripheral channel and a right spatially enhanced peripheral channel by performing subband spatial processing on the left peripheral input channel and the right peripheral input channel.
- the subband spatial processor 230B adjusts gains of n subbands of the mid component and the side component of the left surround channel 210E and the right surround channel 21 OF to generate left and right spatially enhanced peripheral channels.
- the subband spatial processor 230C adjusts gains of the n subband of the mid component and the side component of the left surround rear channel 210G and the right surround rear channel 21 OH to generate left and right spatially enhanced peripheral channels.
- the audio system 200 applies 525 a binaural filter to each of the left and right spatially enhanced peripheral channels.
- the binaural filter 250A generates a left and right output channel from the left spatially enhanced peripheral channel output from the subband spatial processor 230B by applying a head-related transfer function (HRTF).
- the binaural filter 250B generates a left and right output channel from the spatially enhanced right channel output from the subband spatial processor 230B by applying a HRTF.
- the binaural filter 250C generates a left and right output channel from the spatially enhanced left channel output from the subband spatial processor 230C by applying a HRTF.
- the binaural filter 250D generates a left and right output channel from the spatially enhanced right channel output from the subband spatial processor 230C by applying a HRTF.
- the binaural filtering is bypassed.
- the audio system 200 applies 530 a high shelf filter to the center input channel 210C.
- a gain is applied to the center input channel 210C.
- the high shelf filter 220 seperates the center input channel 210C into a left center channel and a right center channel.
- the audio system 200 seperates 535 the low frequency input channel into left and right low frequency channels.
- the audio system 200 e.g., left channel combiner 260 A
- the left spatially enhanced channel may be added with the left output channels.
- the audio system 200 e.g., right channel combiner 260B
- the right spatially enhanced channel may be added with the right output channels.
- the audio system 200 (e.g., crosstalk cancellation processor 270) performs 550 a crosstalk cancellation on the left combined channel and the right combined channel to generate a left crosstalk cancelled channel and a right crosstalk cancelled channel.
- the audio system 200 (e.g., left channel combiner 260C and right channel combiner 260D) combines 555 the left crosstalk cancelled channel from the crosstalk cancellation processor 270 with the left low frequency channel from the divider 240 and the left center channel from the high shelf filter 220 to generate a left output channel, and combines the right crosstalk cancelled channel from the crosstalk cancellation processor 270 with the right low frequency channel from the divider 240 and the right center channel from the high shelf filter 220 to generate a right output channel.
- the audio system 200 (e.g., output gain 280) may apply gains to each of the left and right output channels.
- the audio system 200 outputs an output audio signal including the left and right output channels 290L and 290R.
- FIG. 6 illustrates an example of an audio system 600, according to one
- the audio system 600 may be similar to the audio system 200, but may differ from the audio system 200 at least in that the left and right input channels are combined with the left and right peripheral channels prior to subband spatial processing for the audio system 600.
- a single subband spatial processor and corresponding subband spatial processing step may be used rather than separate subband spatial processors for left-right speaker pairs as shown for the audio system 200.
- the audio system 600 receives an input audio signal.
- the input audio signal may include a left input channel 610A, a right input channel 610B, a center input channel 610C, a low frequency input channel 610D, a left surround input channel 610E, a right surround input channel 61 OF, a left surround rear input channel 610G, and a right surround rear input channel 61 OH.
- the channels 610E, 61 OF, 610G, and 61 OH are examples of peripheral channels that may be provided to surround speakers.
- the audio system 600 may receive and process an input audio signal having fewer or more channels.
- the audio system 600 generates an output signal including a left output channel 690L and a right output channel 690R using enhancements such as subband spatial processing and crosstalk cancellation on the input audio signal.
- the left output channel 690L may be provided to a left speaker and the right output channel 690R may be output to a right speaker.
- the output audio signal provides a spatial sense of the sound field associated with the surround sound input audio signal using left and right speakers (e.g., left speaker 110L and right speaker 11 OR).
- left and right speakers e.g., left speaker 110L and right speaker 11 OR.
- the audio system 600 includes gains 615 A, 615B, 615C, 615D, 615E, 615F,
- a high shelf filter 620 a divider 640, binaural filters 650A, 650B, 650C, and 650D, a left channel combiner 660A, a right channel combiner 660B, a sub-band spatial processor 630, a crosstalk cancellation processor 670, a left channel combiner 660C, a right channel combiner 660D, and an output gain 680.
- Each of the gains 615 A through 615H may receive a respective input channel 610A through 61 OH, and may apply a gain to an input channel 610A through 61 OH.
- the gains 615 A through 615H may be different to adjust gains of the input channels with respect to each other, or may be the same.
- positive gains are applied to the left and right peripheral input channels 610E, 61 OF, 610G, and 61 OH, and a negative gain is applied to the center channel 610C.
- the gain 615 A may apply a 0 db gain
- the gain 615B may apply a 0 dB gain
- the gain 615C may apply a -3 dB gain
- the gain 615D may apply a 0 db gain
- the gain 615E may apply a 3 dB gain
- the gain 615F may apply a 3 dB gain
- the gain 615G may apply a 3 dB gain
- the gain 615H may apply a 3 dB gain.
- the gain 615 A for the left input channel 610A is coupled to the left channel combiner 660A.
- the gain 615B for the right input channel 61 OB is coupled to the right channel combiner 660B.
- the gain 615C is coupled to the high shelf filter 620.
- the gain 615D is coupled to the divider 640.
- the gains 615E, 615F, 610G, and 61 OH of the peripheral input channels are each coupled to a binaural filter 650.
- the gain 610E is coupled to the binaural filter 650A
- the gain 615F is coupled to the binaural filter 650B
- the gain 615G is coupled to the binaural filter 650C
- the gain 615H is coupled to the binaural filter 650D.
- Each of the binuaral filters 650A, 650B, 650C, and 650D apply a head-related transfer function (HRTF) that describes the target source location from which the listener should perceive the sound of the input channel.
- HRTF head-related transfer function
- Each binaural filter receives an input channel and generates a left and right output channel by applying the HRTF.
- the discussion of the binaural fritters 250A, 250B, 250C, and 250D of the audio system 200 may be applicable to the binaural filters 650A, 650B, 650C, and 650D.
- each of the binaural filters 650A through 650D may apply an adjustment for the angular positions associated with their respective input channel.
- one or more of the binaural filters 650A through 650D may be bypassed, or omitted from the audio system 600.
- the left channel combiner 660A is coupled to the gain 615 A and the binaural filters 650A through 650D.
- the left channel combiner 660A receives the left output channels of the binaural filters 650A through 650D, and combines the left output channels with the output of the gain 615 A.
- the right channel combier 660B is coupled to the gain 615B and the binaural filters 650A through 650D.
- the right channel combiner 660B receives the right output channels of the binaural filters 650A through 650D, and combines the right output channels with the output of the gain 615B.
- the binaural filtering is performed subsequent to subband spatial processing.
- a binaural filter may be applied to the left and right outputs of the subband spatial processor 630 as suitable to adjust for angular positions associated with the channels.
- binaural filters are applied to the peripheral input channels as shown in FIG. 6.
- binaural filters are applied to the center input channel 610C or the low frequency input channel 610D.
- binaural filters are applied to each input channel except the low frequency input channel 610D.
- the subband spatial processor 630 performs subband spatial processing on a left and right input channel by gain adjusting mid and side subband components of the left and right input channels to generate left and right spatially enhanced channels as output.
- the subband spatial processor 630 is coupled to the left channel combiner 660A to receive a left combined channel from the left channel combiner 660A and is coupled to the right channel combiner 660B to receive a right combined channel from the right channel combiner 660B.
- the subband spatial processor 630 processes the left and right channels after combination into the left and right combined channels.
- the audio system 600 may include only a single subband spatial processor 630.
- the subband spatial processor 230 shown in FIG. 3 is an example of the subband spatial processor 630.
- the crosstalk cancellation processor 670 performs crosstalk cancellation on the output of the subband spatial processor 630, which may represent a mixed down stereo signal of the input audio signal.
- the crosstalk cancellation processor 670 receives left and right input channels from the subband spatial processor 630, and performs a crosstalk cancellation to generate left and right crosstalk cancelled channels.
- the crosstalk cancellation processor 670 is coupled to the left channel combiner 260 A and the right channel combiner 260B.
- the crosstalk cancellation processor 270 shown in FIG. 4 is an example of the crosstalk cancellation processor 670.
- the high shelf filter 620 receives the center input channel 610C and applies a high frequency shelving or peaking filter.
- the high shelf filter 620 provides a“voice-lift” on the center input channel 610C.
- the high shelf filter 620 is bypassed, or omitted from the audio system 600.
- the high shelf filter 620 may attenuate frequencies above a corner frequency.
- the high shelf filter 620 is coupled to the left channel combiner 660C and the right channel combiner 660D.
- the high shelf filter 620 is defined by a 750 Hz comer frequency, a +3 dB gain, and 0.8 Q factor.
- the high shelf filter 620 generates a left center channel and a right center channel as output.
- the divider 640 receives the low frequency input channel 610D, and seperates the low frequency input channel 610D into left and right low frequency channels.
- the divider 640 is coupled to the left channel combiner 660C and the right channel combiner 660D, and provides the left low frequency channel to the left channel combiner 660C and the right low frequency channel to the right channel combiner 660D.
- the left channel combiner 660C is coupled to the crosstalk cancellation processor 670, the high shelf filter 620, and the divider 640.
- the left channel combiner 660C receives the left crosstalk channel from the crosstalk cancellation processor 670, the left center channel from the high shelf filter 620, and the left low frequency channel from the divider 640, and combines these channels into a left output channel.
- Right channel combiner 660D is coupled to the crosstalk cancellation processor 670, the high shelf filter 620, and the divider 640.
- the right channel combiner 660D receives the right crosstalk channel from the crosstalk cancellation processor 670, the right center channel from the high shelf filter 620, and the right low frequency channel from the divider 640, and combines these channels into a right output channel.
- the left center channel from the high shelf filter 620 and the left low frequency channel from the divider 640 are combined by the left channel combiner 660A with the left output channels of the binaural filters 650A through 650D and the output of the gain 615 A to generate a left combined channel.
- the right center channel from the high shelf filter 620 and the right low frequency channel from the divider 640 are combined by the right channel combiner 660B with the right output channels of the binaural filters 650A through 650D and the output of the gain 615B to generate a right combined channel.
- the left and right combined channels are input into the subband spatial processor 630 and the crosstalk
- the center and low frequency channels receive the subband spatial processing and crosstalk cancellation operations.
- the left channel combiner 660C and right channel combiner 660D may be omitted.
- one of the center or low frequency channels receives the subband spatial processing and crosstalk cancellation operations.
- the output gain 680 is coupled to left channel combiner 660C and the right channel combiner 660D.
- the output gain 680 applies a gain to the left output channel from the left channel combiner 660C, and applies a gain to the right output channel from the right channel combiner 660D.
- the output gain 680 may apply the same gain to the left and right output channels, or may apply different gains.
- the output gain 680 outputs the left output channel 690L and the right output channel 690R which represent the channels of the output signal of the audio system 600.
- FIG. 7 illustrates an example of a method 700 for enhancing an audio signal with the audio system 600 shown in FIG. 6, according to one embodiment.
- the method 700 may include different and/or additional steps, or some steps may be in different orders.
- the audio system 600 receives 705 a multi-channel input audio signal.
- the input audio signal may include a left input channel 610A, a right input channel 610B, at least one left peripheral input channel, and at least one right peripheral input channel.
- the multi-channel audio signal may further include the center input channel 610C and the low frequency input channel 610D.
- the audio system 600 applies 710 gains to the channels of the multi-channel input audio signal.
- the gains 615A through 615H may vary to control the contribution of particular input channels to the output signal generated by the audio system 600.
- the audio system 600 applies 715 a binaural filter to each of the left and right peripheral channels.
- the binaural filter 650A generates a left and right output channel from the left surround input channel 610E by applying a head-related transfer function (HRTF).
- the binaural filter 650B generates a left and right output channel from the right surround input channel 61 OF by applying a HRTF.
- the binaural filter 650C generates a left and right output channel from the left surround rear input channel 610G by applying a HRTF.
- the binaural filter 650D generates a left and right output channel from the right surround rear input channel 61 OH by applying a HRTF.
- the audio system 600 applies 720 a high shelf filter to the center input channel 610C.
- a gain is applied to the center input channel 610C.
- the high shelf filter 620 seperates the center input channel 610C into a left center channel and a right center channel.
- the audio system 600 seperates 725 the low frequency input channel into left and right low frequency channels.
- the audio system 600 e.g., left channel combiner 660 A
- the audio system 600 e.g., right channel combiner 660B
- the audio system 600 (e.g., subband spatial processor 630) generates 740 a left spatially enhanced channel and a right spatially enhanced channel by performing subband spatial processing on the left combined channel and the right combined channel.
- the subband spatial processor 630 receives the left and right combined channels from the left channel combiner 660A and the right channel combiner 660B, and generates the spatially enhanced channels by adjusting gains of n subbands of the mid component and the side component of the left and right combined channels.
- the audio system 600 (e.g., crosstalk cancellation processor 670) performs 745 a crosstalk cancellation on the left and right spatially enhanced channels from the subband spatial processor 630 to generate a left crosstalk cancelled channel and a right crosstalk cancelled channel.
- the audio system 600 e.g., left channel combiner 660C and right channel combiner 660D combines 750 the left crosstalk cancelled channel from the crosstalk
- the audio system 600 may apply gains to each of the left and right output channels.
- the audio system 600 outputs an output audio signal including the left and right output channels 690L and 690R.
- systems and processes described herein may be embodied in an embedded electronic circuit or electronic system.
- the systems and processes also may be embodied in a computing system that includes one or more processing systems (e.g., a digital signal processor) and a memory (e.g., programmed read only memory or programmable solid state memory), or some other circuitry such as an application specific integrated circuit (ASIC) or field-programmable gate array (FPGA) circuit.
- processing systems e.g., a digital signal processor
- a memory e.g., programmed read only memory or programmable solid state memory
- ASIC application specific integrated circuit
- FPGA field-programmable gate array
- FIG. 8 illustrates an example of a computer system 800, according to one embodiment.
- the audio systems 200 and 600 may be implemented on the system 800.
- the chipset 804 includes a memory controller hub 820 and an input/output (I/O) controller hub 822.
- a memory 806 and a graphics adapter 812 are coupled to the memory controller hub 820, and a display device 818 is coupled to the graphics adapter 812.
- a storage device 808, keyboard 810, pointing device 814, and network adapter 816 are coupled to the I/O controller hub 822.
- Other embodiments of the computer 800 have different architectures.
- the memory 806 is directly coupled to the processor 802 in some embodiments.
- the storage device 808 includes one or more non-transitory computer-readable storage media such as a hard drive, compact disk read-only memory (CD-ROM), DVD, or a solid-state memory device.
- the memory 806 holds instructions and data used by the processor 802.
- the memory 806 may store instructions that when executed by the processor 802 causes or configures the processor 802 to perform the methods discussed herein, such as the method 500 or 700.
- the pointing device 814 is used in combination with the keyboard 810 to input data into the computer system 800.
- the graphics adapter 812 displays images and other information on the display device 818.
- the display device 818 includes a touch screen capability for receiving user input and selections.
- the network adapter 816 couples the computer system 800 to a network.
- Some embodiments of the computer 800 have different and/or other components than those shown in FIG. 8.
- the computer system 800 may be a server that lacks a display device, keyboard, and other components.
- the computer 800 is adapted to execute computer program modules for providing functionality described herein.
- module refers to computer program instructions and/or other logic used to provide the specified functionality.
- a module can be implemented in hardware, firmware, and/or software.
- program modules formed of executable computer program instructions are stored on the storage device 808, loaded into the memory 806, and executed by the processor 802.
- the disclosed configuration may include a number of benefits and/or advantages.
- a multi-channel input signal can be output to stereo loudspeakers while preserving or enhancing a spatial sense of the sound field.
- a high quality listening experience can be achieved without requiring expensive multi-speaker sound systems, such as on mobile devices, sound bars, or smart speakers.
- a software module is implemented with a computer program product comprising a computer readable medium (e.g., non-transitory computer readable medium) containing computer program code, which can be executed by a computer processor for performing any or all of the steps, operations, or processes described.
- a computer readable medium e.g., non-transitory computer readable medium
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Stereophonic System (AREA)
Abstract
Description
Claims
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP19771968.5A EP3769541A4 (en) | 2018-03-22 | 2019-03-20 | MULTI-CHANNEL SUBBAND SPATIAL TREATMENT FOR LOUDSPEAKERS |
| JP2020550867A JP7323544B2 (en) | 2018-03-22 | 2019-03-20 | Multichannel subband spatial processing for loudspeakers |
| CN201980020001.3A CN111869234B (en) | 2018-03-22 | 2019-03-20 | A system, method, and computer-readable medium for processing a multi-channel input audio signal |
| KR1020207030276A KR102195586B1 (en) | 2018-03-22 | 2019-03-20 | Multi-channel subband spatial processing technique for loudspeakers |
| JP2022144496A JP7553522B2 (en) | 2018-03-22 | 2022-09-12 | Multichannel Subband Spatial Processing for Loudspeakers |
| JP2024153106A JP2024161241A (en) | 2018-03-22 | 2024-09-05 | Multichannel Subband Spatial Processing for Loudspeakers |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/933,207 US10764704B2 (en) | 2018-03-22 | 2018-03-22 | Multi-channel subband spatial processing for loudspeakers |
| US15/933,207 | 2018-03-22 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2019183271A1 true WO2019183271A1 (en) | 2019-09-26 |
Family
ID=67983865
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2019/023243 Ceased WO2019183271A1 (en) | 2018-03-22 | 2019-03-20 | Multi-channel subband spatial processing for loudspeakers |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US10764704B2 (en) |
| EP (1) | EP3769541A4 (en) |
| JP (3) | JP7323544B2 (en) |
| KR (1) | KR102195586B1 (en) |
| CN (1) | CN111869234B (en) |
| TW (1) | TWI744615B (en) |
| WO (1) | WO2019183271A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021071608A1 (en) | 2019-10-10 | 2021-04-15 | Boomcloud 360, Inc | Multi-channel crosstalk processing |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3900394A1 (en) * | 2018-12-21 | 2021-10-27 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Sound reproduction/simulation system and method for simulating a sound reproduction |
| CN113366865B (en) * | 2019-02-13 | 2023-03-21 | 杜比实验室特许公司 | Adaptive loudness normalization for audio object clustering |
| CN112351379B (en) * | 2020-10-28 | 2021-07-30 | 歌尔光学科技有限公司 | Control method of audio component and smart head mounted device |
| GB2600943A (en) * | 2020-11-11 | 2022-05-18 | Sony Interactive Entertainment Inc | Audio personalisation method and system |
| BE1029638B1 (en) * | 2021-07-30 | 2023-02-27 | Areal | Method for processing an audio signal |
| WO2023039096A1 (en) | 2021-09-09 | 2023-03-16 | Dolby Laboratories Licensing Corporation | Systems and methods for headphone rendering mode-preserving spatial coding |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070223708A1 (en) * | 2006-03-24 | 2007-09-27 | Lars Villemoes | Generation of spatial downmixes from parametric representations of multi channel signals |
| US20090086982A1 (en) * | 2007-09-28 | 2009-04-02 | Qualcomm Incorporated | Crosstalk cancellation for closely spaced speakers |
| US20160249151A1 (en) | 2013-10-30 | 2016-08-25 | Huawei Technologies Co., Ltd. | Method and mobile device for processing an audio signal |
| US20170208411A1 (en) * | 2016-01-18 | 2017-07-20 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reproduction |
| US20170230777A1 (en) * | 2016-01-19 | 2017-08-10 | Boomcloud 360, Inc. | Audio enhancement for head-mounted speakers |
Family Cites Families (59)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE2244162C3 (en) | 1972-09-08 | 1981-02-26 | Eugen Beyer Elektrotechnische Fabrik, 7100 Heilbronn | "system |
| JP2993418B2 (en) * | 1996-01-19 | 1999-12-20 | ヤマハ株式会社 | Sound field effect device |
| GB9622773D0 (en) | 1996-11-01 | 1997-01-08 | Central Research Lab Ltd | Stereo sound expander |
| JP3368836B2 (en) | 1998-07-31 | 2003-01-20 | オンキヨー株式会社 | Acoustic signal processing circuit and method |
| JP2002191099A (en) | 2000-09-26 | 2002-07-05 | Matsushita Electric Ind Co Ltd | Signal processing device |
| FI113147B (en) | 2000-09-29 | 2004-02-27 | Nokia Corp | Method and signal processing apparatus for transforming stereo signals for headphone listening |
| JP4735920B2 (en) * | 2001-09-18 | 2011-07-27 | ソニー株式会社 | Sound processor |
| TWI230024B (en) * | 2001-12-18 | 2005-03-21 | Dolby Lab Licensing Corp | Method and audio apparatus for improving spatial perception of multiple sound channels when reproduced by two loudspeakers |
| EP1532734A4 (en) | 2002-06-05 | 2008-10-01 | Sonic Focus Inc | Acoustical virtual reality engine and advanced techniques for enhancing delivered sound |
| FI118370B (en) | 2002-11-22 | 2007-10-15 | Nokia Corp | Equalization of output from a stereo expansion network |
| JP4521549B2 (en) | 2003-04-25 | 2010-08-11 | 財団法人くまもとテクノ産業財団 | A method for separating a plurality of sound sources in the vertical and horizontal directions, and a system therefor |
| US7949141B2 (en) * | 2003-11-12 | 2011-05-24 | Dolby Laboratories Licensing Corporation | Processing audio signals with head related transfer function filters and a reverberator |
| US20050265558A1 (en) | 2004-05-17 | 2005-12-01 | Waves Audio Ltd. | Method and circuit for enhancement of stereo audio reproduction |
| US7634092B2 (en) | 2004-10-14 | 2009-12-15 | Dolby Laboratories Licensing Corporation | Head related transfer functions for panned stereo audio content |
| GB2419265B (en) | 2004-10-18 | 2009-03-11 | Wolfson Ltd | Improved audio processing |
| KR100636248B1 (en) | 2005-09-26 | 2006-10-19 | 삼성전자주식회사 | Vocal Removal Device and Method |
| US8811627B2 (en) | 2005-10-26 | 2014-08-19 | Nec Corporation | Echo suppressing method and apparatus |
| JP4940671B2 (en) * | 2006-01-26 | 2012-05-30 | ソニー株式会社 | Audio signal processing apparatus, audio signal processing method, and audio signal processing program |
| KR100754220B1 (en) | 2006-03-07 | 2007-09-03 | 삼성전자주식회사 | Binaural decoder for MPE surround and its decoding method |
| US8045719B2 (en) | 2006-03-13 | 2011-10-25 | Dolby Laboratories Licensing Corporation | Rendering center channel audio |
| US8619998B2 (en) | 2006-08-07 | 2013-12-31 | Creative Technology Ltd | Spatial audio enhancement processing method and apparatus |
| JP4841324B2 (en) | 2006-06-14 | 2011-12-21 | アルパイン株式会社 | Surround generator |
| WO2008035227A2 (en) | 2006-09-14 | 2008-03-27 | Lg Electronics Inc. | Dialogue enhancement techniques |
| JP2008228225A (en) * | 2007-03-15 | 2008-09-25 | Victor Co Of Japan Ltd | Sound signal processing equipment |
| US8612237B2 (en) | 2007-04-04 | 2013-12-17 | Apple Inc. | Method and apparatus for determining audio spatial quality |
| US8705748B2 (en) | 2007-05-04 | 2014-04-22 | Creative Technology Ltd | Method for spatially processing multichannel signals, processing module, and virtual surround-sound systems |
| WO2009022463A1 (en) | 2007-08-13 | 2009-02-19 | Mitsubishi Electric Corporation | Audio device |
| CN101884065B (en) | 2007-10-03 | 2013-07-10 | 创新科技有限公司 | Spatial audio analysis and synthesis for binaural reproduction and format conversion |
| JP4655098B2 (en) | 2008-03-05 | 2011-03-23 | ヤマハ株式会社 | Audio signal output device, audio signal output method and program |
| US8295498B2 (en) | 2008-04-16 | 2012-10-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Apparatus and method for producing 3D audio in systems with closely spaced speakers |
| US9445213B2 (en) | 2008-06-10 | 2016-09-13 | Qualcomm Incorporated | Systems and methods for providing surround sound using speakers and headphones |
| US9247369B2 (en) | 2008-10-06 | 2016-01-26 | Creative Technology Ltd | Method for enlarging a location with optimal three-dimensional audio perception |
| UA101542C2 (en) * | 2008-12-15 | 2013-04-10 | Долби Лабораторис Лайсензин Корпорейшн | Surround sound virtualizer and method with dynamic range compression |
| US8000485B2 (en) * | 2009-06-01 | 2011-08-16 | Dts, Inc. | Virtual audio processing for loudspeaker or headphone playback |
| WO2011005479A2 (en) | 2009-06-22 | 2011-01-13 | SoundBeam LLC | Optically coupled bone conduction systems and methods |
| JP2011101284A (en) * | 2009-11-09 | 2011-05-19 | Canon Inc | Sound signal processing apparatus and method |
| JP5850216B2 (en) | 2010-04-13 | 2016-02-03 | ソニー株式会社 | Signal processing apparatus and method, encoding apparatus and method, decoding apparatus and method, and program |
| US9107021B2 (en) | 2010-04-30 | 2015-08-11 | Microsoft Technology Licensing, Llc | Audio spatialization using reflective room model |
| US20110288860A1 (en) | 2010-05-20 | 2011-11-24 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for processing of speech signals using head-mounted microphone pair |
| CN102907120B (en) | 2010-06-02 | 2016-05-25 | 皇家飞利浦电子股份有限公司 | For the system and method for acoustic processing |
| CN103222187B (en) | 2010-09-03 | 2016-06-15 | 普林斯顿大学托管会 | For being eliminated by the non-staining optimization crosstalk of the frequency spectrum of the audio frequency of speaker |
| US8660271B2 (en) | 2010-10-20 | 2014-02-25 | Dts Llc | Stereo image widening system |
| KR101785379B1 (en) | 2010-12-31 | 2017-10-16 | 삼성전자주식회사 | Method and apparatus for controlling distribution of spatial sound energy |
| JP5955862B2 (en) | 2011-01-04 | 2016-07-20 | ディーティーエス・エルエルシーDts Llc | Immersive audio rendering system |
| JP2013013042A (en) | 2011-06-02 | 2013-01-17 | Denso Corp | Three-dimensional sound apparatus |
| JP5772356B2 (en) | 2011-08-02 | 2015-09-02 | ヤマハ株式会社 | Acoustic characteristic control device and electronic musical instrument |
| EP2560161A1 (en) | 2011-08-17 | 2013-02-20 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Optimal mixing matrices and usage of decorrelators in spatial audio processing |
| SG11201407255XA (en) * | 2012-05-29 | 2014-12-30 | Creative Tech Ltd | Stereo widening over arbitrarily-configured loudspeakers |
| US9351073B1 (en) | 2012-06-20 | 2016-05-24 | Amazon Technologies, Inc. | Enhanced stereo playback |
| CN102737647A (en) | 2012-07-23 | 2012-10-17 | 武汉大学 | Encoding and decoding method and encoding and decoding device for enhancing dual-track voice frequency and tone quality |
| US20150036826A1 (en) | 2013-05-08 | 2015-02-05 | Max Sound Corporation | Stereo expander method |
| US9338570B2 (en) | 2013-10-07 | 2016-05-10 | Nuvoton Technology Corporation | Method and apparatus for an integrated headset switch with reduced crosstalk noise |
| TW201532035A (en) | 2014-02-05 | 2015-08-16 | Dolby Int Ab | Prediction-based FM stereo radio noise reduction |
| CN103928030B (en) | 2014-04-30 | 2017-03-15 | 武汉大学 | Based on the scalable audio coding system and method that subband spatial concern is estimated |
| RU2685041C2 (en) * | 2015-02-18 | 2019-04-16 | Хуавэй Текнолоджиз Ко., Лтд. | Device of audio signal processing and method of audio signal filtering |
| CN106303821A (en) | 2015-06-12 | 2017-01-04 | 青岛海信电器股份有限公司 | Cross-talk cancellation method and system |
| KR102580502B1 (en) | 2016-11-29 | 2023-09-21 | 삼성전자주식회사 | Electronic apparatus and the control method thereof |
| US10623883B2 (en) | 2017-04-26 | 2020-04-14 | Hewlett-Packard Development Company, L.P. | Matrix decomposition of audio signal processing filters for spatial rendering |
| US10547927B1 (en) | 2018-07-27 | 2020-01-28 | Mimi Hearing Technologies GmbH | Systems and methods for processing an audio signal for replay on stereo and multi-channel audio devices |
-
2018
- 2018-03-22 US US15/933,207 patent/US10764704B2/en active Active
-
2019
- 2019-03-20 EP EP19771968.5A patent/EP3769541A4/en active Pending
- 2019-03-20 JP JP2020550867A patent/JP7323544B2/en active Active
- 2019-03-20 KR KR1020207030276A patent/KR102195586B1/en active Active
- 2019-03-20 WO PCT/US2019/023243 patent/WO2019183271A1/en not_active Ceased
- 2019-03-20 CN CN201980020001.3A patent/CN111869234B/en active Active
- 2019-03-22 TW TW108109941A patent/TWI744615B/en active
-
2022
- 2022-09-12 JP JP2022144496A patent/JP7553522B2/en active Active
-
2024
- 2024-09-05 JP JP2024153106A patent/JP2024161241A/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070223708A1 (en) * | 2006-03-24 | 2007-09-27 | Lars Villemoes | Generation of spatial downmixes from parametric representations of multi channel signals |
| US20090086982A1 (en) * | 2007-09-28 | 2009-04-02 | Qualcomm Incorporated | Crosstalk cancellation for closely spaced speakers |
| US20160249151A1 (en) | 2013-10-30 | 2016-08-25 | Huawei Technologies Co., Ltd. | Method and mobile device for processing an audio signal |
| US20170208411A1 (en) * | 2016-01-18 | 2017-07-20 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reproduction |
| US20170230777A1 (en) * | 2016-01-19 | 2017-08-10 | Boomcloud 360, Inc. | Audio enhancement for head-mounted speakers |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3769541A4 |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021071608A1 (en) | 2019-10-10 | 2021-04-15 | Boomcloud 360, Inc | Multi-channel crosstalk processing |
| KR20220078687A (en) * | 2019-10-10 | 2022-06-10 | 붐클라우드 360 인코포레이티드 | Multi-channel crosstalk processing |
| CN114731482A (en) * | 2019-10-10 | 2022-07-08 | 博姆云360公司 | Multi-channel crosstalk processing |
| JP2022551871A (en) * | 2019-10-10 | 2022-12-14 | ブームクラウド 360 インコーポレイテッド | Multi-channel crosstalk processing |
| EP4042720A4 (en) * | 2019-10-10 | 2023-11-01 | Boomcloud 360 Inc. | MULTI-CHANNEL CROSSTALK PROCESSING |
| JP7531584B2 (en) | 2019-10-10 | 2024-08-09 | ブームクラウド 360 インコーポレイテッド | Multi-Channel Crosstalk Processing |
| KR102712921B1 (en) | 2019-10-10 | 2024-10-04 | 붐클라우드 360 인코포레이티드 | Multi-channel crosstalk processing |
| CN114731482B (en) * | 2019-10-10 | 2025-05-23 | 博姆云360公司 | Multichannel crosstalk processing |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2021510992A (en) | 2021-04-30 |
| EP3769541A4 (en) | 2021-12-22 |
| JP2024161241A (en) | 2024-11-15 |
| US10764704B2 (en) | 2020-09-01 |
| US20190297447A1 (en) | 2019-09-26 |
| TW201941622A (en) | 2019-10-16 |
| KR102195586B1 (en) | 2020-12-28 |
| EP3769541A1 (en) | 2021-01-27 |
| CN111869234A (en) | 2020-10-30 |
| TWI744615B (en) | 2021-11-01 |
| JP7553522B2 (en) | 2024-09-18 |
| JP2022168213A (en) | 2022-11-04 |
| CN111869234B (en) | 2022-05-10 |
| JP7323544B2 (en) | 2023-08-08 |
| KR20200126429A (en) | 2020-11-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7553522B2 (en) | Multichannel Subband Spatial Processing for Loudspeakers | |
| JP7811628B2 (en) | Crosstalk Processing b-Chain | |
| EP3718318A1 (en) | Enhanced virtual stereo reproduction for unmatched transaural loudspeaker systems | |
| US11284213B2 (en) | Multi-channel crosstalk processing | |
| EP3718313A1 (en) | Crosstalk cancellation for opposite-facing transaural loudspeaker systems |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19771968 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2020550867 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 20207030276 Country of ref document: KR Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2019771968 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2019771968 Country of ref document: EP Effective date: 20201022 |