EP3820161A1 - Audiosignalverarbeitungsvorrichtung und -verfahren, impulsantworterzeugungsvorrichtung und -verfahren und programm - Google Patents
Audiosignalverarbeitungsvorrichtung und -verfahren, impulsantworterzeugungsvorrichtung und -verfahren und programm Download PDFInfo
- Publication number
- EP3820161A1 EP3820161A1 EP19831112.8A EP19831112A EP3820161A1 EP 3820161 A1 EP3820161 A1 EP 3820161A1 EP 19831112 A EP19831112 A EP 19831112A EP 3820161 A1 EP3820161 A1 EP 3820161A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- impulse response
- characteristic
- phase characteristic
- fft
- audio signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers
- H04R3/04—Circuits for transducers for correcting frequency response
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/033—Headphones for stereophonic communication
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H04S1/005—For headphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
Definitions
- the present technology relates to an audio signal processing device and method, an impulse response generation device and method, and a program, and particularly to an audio signal processing device and method, an impulse response generation device and method, and a program that enable a desired phase characteristic to be obtained.
- Patent Document 1 a technology of changing an amplitude characteristic of an audio signal so as to bring about a bass enhancement effect by combining a plurality of filters has been proposed (see, for example, Patent Document 1).
- An audio signal processing method or program of the first aspect of the present technology includes steps of acquiring an impulse response having a flat or substantially flat amplitude characteristic and a predetermined phase characteristic, and convolving the impulse response into an input audio signal.
- an impulse response having a flat or substantially flat amplitude characteristic and a predetermined phase characteristic is acquired, and the impulse response is convolved into an input audio signal.
- An impulse response generation method or program of the second aspect of the present technology includes a step of generating a target characteristic impulse response having a flat or substantially flat amplitude characteristic and a predetermined phase characteristic.
- the amplitude characteristic being flat or substantially flat means that, for example, a value of amplitude (gain) at each frequency of the amplitude characteristic is 1 or substantially 1.
- An amplitude characteristic and a phase characteristic can be obtained by such an FFT, the amplitude (gain) value at each frequency of the amplitude characteristic is set to 1 so that the amplitude characteristic becomes flat, and inverse fast Fourier transform (IFFT) is performed on the basis of the flat amplitude characteristic and the phase characteristic obtained by the FFT. Then, the subsequent stage of the impulse response obtained by the IFFT is fade-processed with an appropriate time constant to obtain a desired impulse response.
- IFFT inverse fast Fourier transform
- the impulse response obtained as described above functions as an infinite impulse response (IIR) filter that changes only the phase characteristic while maintaining the amplitude characteristic. Therefore, only the phase characteristic can be adjusted by convolving such an impulse response into the audio signal.
- IIR infinite impulse response
- the impulse response to be obtained by the reconstruction is an impulse response in which only the phase characteristic shown in Fig. 1 is added to the audio signal without changing the amplitude characteristic, that is, an impulse response in which only a desired phase characteristic is added without changing the amplitude characteristic.
- an impulse response that functions as a filter that adds a desired phase characteristic without changing the amplitude characteristic will be also referred to as a target phase characteristic impulse response in particular.
- 0-filling processing of adding 0 data which is a sample having a sample value of 0, to the rear side (end) of the impulse response in the time direction is performed so that the total length (number of samples) of the impulse response is 4096 samples.
- the IFFT is performed on the frequency characteristic including the flat amplitude characteristic obtained by the amplitude adjustment and the phase characteristic obtained by the FFT.
- the impulse response is reconstructed by such fade processing, and as a result, the target phase characteristic impulse response shown by an arrow Q25 is obtained.
- an impulse response having a length of 4096 samples is obtained as the target phase characteristic impulse response.
- the target phase characteristic impulse response shown by the arrow Q25 should ideally have a flat or substantially flat amplitude characteristic and the same phase characteristic as that of the original HPF.
- the frequency characteristic of the target phase characteristic impulse response shown by the arrow Q25 is as shown in Fig. 3 .
- a curve L11 shows the amplitude characteristic of the target phase characteristic impulse response shown by the arrow Q25 in Fig. 2
- a curve L12 shows the amplitude characteristic of the original HPF shown by the arrow Q21 in Fig. 2 . From the curve L11, it can be seen that, in the amplitude characteristic of the target phase characteristic impulse response, the gain of the low frequency portion, that is, the portion shown by an arrow W11 is reduced, although not as much as the original HPF, and the amplitude characteristic is not flat.
- the curve L13 is substantially the same as the curve L14, and it can be seen that the target characteristic is obtained with respect to the phase characteristic in the target phase characteristic impulse response.
- the impulse response in general, in a case of being opposite from the target phase characteristic impulse response, that is, a case where the phase characteristic is flat (a straight line) and the amplitude (gain) changes, it is known that the impulse response basically has a symmetrical shape.
- the impulse response of the original HPF is subjected to 0-filling processing at least on the front side (past side) in the time direction so that the impulse response has a substantially symmetrical shape, and then the FFT and the IFFT are performed to generate a target phase characteristic impulse response.
- the impulse response is reconstructed and the target phase characteristic impulse response is generated.
- Fig. 4 the portion shown by an arrow Q41 shows the impulse response of the HPF shown by the arrow Q13 in Fig. 1 , and this impulse response converges in approximately 1024 samples.
- the impulse response of the HPF shown by the arrow Q41 is subjected to 0-filling processing as shown by an arrow Q42.
- 0 data is added not only to the rear side (end side) in the time direction of the impulse response but also to the front side (head side) according to the length of the impulse response.
- 0 data is added to the front side of the impulse response in the time direction by the amount of 8192 samples, and 0 data is also added to the rear side of the impulse response in the time direction so that the impulse length itself becomes 8192 samples. Due to such 0-filling processing, the impulse response shown by the arrow Q42 has a substantially symmetrical shape, and the total length is 16384 samples.
- the target phase characteristic impulse response has a flat amplitude characteristic, so the amplitude (gain) value of each frequency in the amplitude characteristic obtained by the FFT is adjusted to "1" so that a flat amplitude characteristic is obtained.
- phase characteristic obtained by the FFT should be the target phase characteristic, no particular phase adjustment is performed on the phase characteristic obtained by the FFT.
- the IFFT is performed on the frequency characteristic including the flat amplitude characteristic obtained by the amplitude adjustment and the phase characteristic obtained by the FFT, and the impulse response obtained as a result is subjected to the fade processing in a similar manner to in the case of the arrow Q24 in Fig. 2 .
- the impulse response obtained by the fade processing is used as the target phase characteristic impulse response.
- the target phase characteristic impulse response shown by an arrow Q45 is obtained, and the target phase characteristic impulse response has a shape close to symmetry. Furthermore, the length of the target phase characteristic impulse response is 16384 samples.
- the frequency characteristic of the target phase characteristic impulse response shown by the arrow Q45 thus obtained is as shown in Fig. 5 .
- Fig. 5 the portion shown by an arrow Q51 shows the amplitude characteristic, and the portion shown by an arrow Q52 shows the phase characteristic.
- the vertical axis indicates the gain (amplitude)
- the horizontal axis indicates the frequency
- the vertical axis indicates the phase
- the horizontal axis indicates the frequency.
- a curve L31 shows the amplitude characteristic of the target phase characteristic impulse response shown by the arrow Q45 in Fig. 4
- a curve L32 shows the amplitude characteristic of the original HPF shown by the arrow Q41 in Fig. 4 .
- a curve L33 shows the phase characteristic of the target phase characteristic impulse response shown by the arrow Q45 in Fig. 4
- a curve L34 shows the phase characteristic of the original HPF shown by the arrow Q41 in Fig. 4 , that is, the target phase characteristic
- a curve L35 shows the phase characteristic of a simple impulse with only 8192 samples delayed, that is, the linear phase.
- the curve L33 and the curve L34 almost overlap each other, and it can be seen that a characteristic that is substantially equivalent to the target characteristic as the phase characteristic of the target phase characteristic impulse response is obtained.
- the curve L35 is shown for comparison. Since the curve L35 shows the phase characteristic of a simple impulse that is a linear phase, if the difference between the curve L33 and the curve L35 at each frequency is the phase value at each frequency of the phase characteristic shown by the arrow Q12 in Fig. 1 , the target characteristic is obtained as the phase characteristic of the target phase characteristic impulse response. Note that the phase characteristic of the original HPF shown by the arrow Q41 in Fig. 4 is the same as the phase characteristic shown by the arrow Q12 in Fig. 1
- the method of generating the target phase characteristic impulse response described with reference to Fig. 4 as described above is the above-mentioned method A1.
- the length of the impulse response that has been subjected to the 0-filling processing is infinite samples, the error between the frequency characteristic of the target phase characteristic impulse response and the target characteristic becomes infinitely close to zero.
- the processing amount may be reduced even if an error from the target characteristic is allowed to some extent. For example, if the length of the target phase characteristic impulse response is shortened, the processing amount is reduced both at the time of generation and at the time of convolution after generation.
- the number of pieces of 0 data added to the impulse response in the 0-filling processing may be reduced so that the processing amount is reduced and the target phase characteristic impulse response having a sufficient characteristic can be obtained.
- Fig. 6 the portion shown by an arrow Q61 shows the impulse response of the HPF shown by the arrow Q13 in Fig. 1 , and this impulse response converges in approximately 1024 samples.
- the impulse response of the HPF shown by the arrow Q61 is subjected to 0-filling processing as shown by an arrow Q62.
- 0 data is added to the front side of the impulse response in the time direction by the amount of 384 samples, and 0 data is also added to the rear side of the impulse response in the time direction so that the total length of the impulse response becomes 4096 samples.
- the number of pieces of 0 data added to the front side in the time direction in the impulse response is small, so that the impulse response after the 0-filling processing shown by the arrow Q62 does not have a symmetrical shape.
- the value of the amplitude (gain) of each frequency in the amplitude characteristic obtained by the FFT is adjusted to "1" to obtain a flat amplitude characteristic, and no particular phase adjustment is performed on the phase characteristic obtained by the FFT.
- the IFFT is performed on the frequency characteristic including the flat amplitude characteristic obtained by the amplitude adjustment and the phase characteristic obtained by the FFT, and the impulse response obtained as a result is subjected to the fade processing in a similar manner to in the case of the arrow Q24 in Fig. 2 .
- the impulse response obtained by the fade processing is used as the target phase characteristic impulse response.
- the target phase characteristic impulse response shown by an arrow Q65 is obtained, and the length of the target phase characteristic impulse response is 4096 samples.
- the target phase characteristic impulse response shown by the arrow Q65 does not have a symmetrical shape.
- the frequency characteristic of the target phase characteristic impulse response shown by the arrow Q65 thus obtained is as shown in Fig. 7 .
- Fig. 7 the portion shown by an arrow Q71 shows the amplitude characteristic, and the portion shown by an arrow Q72 shows the phase characteristic.
- the vertical axis indicates the gain (amplitude)
- the horizontal axis indicates the frequency
- the vertical axis indicates the phase
- the horizontal axis indicates the frequency.
- a curve L51 shows the amplitude characteristic of the target phase characteristic impulse response shown by the arrow Q65 in Fig. 6
- a curve L52 shows the amplitude characteristic of the original HPF shown by the arrow Q61 in Fig. 6 .
- the amplitude characteristic of the target phase characteristic impulse response shown in the curve L51 is within ⁇ 1 dB of the amplitude (gain) value at each frequency, and it can be seen that a substantially flat characteristic is obtained. That is, it can be seen that a sufficient amplitude characteristic is obtained.
- the amplitude characteristic shown in the curve L51 here has a larger error from the target characteristic than the amplitude characteristic shown in the curve L31 in Fig. 5 , but it can be seen that the error is within a sufficiently small range.
- a curve L53 shows the phase characteristic of the target phase characteristic impulse response shown by the arrow Q65 in Fig. 6
- a curve L54 shows the phase characteristic of the original HPF shown by the arrow Q61 in Fig. 6 , that is, the target phase characteristic
- a curve L55 shows the phase characteristic of the delayed simple impulse as similar to the curve L35 of Fig. 5 .
- the number of pieces of 0 data to be added to the front side of the impulse response in the time direction has a trade-off relationship with the tolerance with the target characteristic and the processing amount, the number of pieces of 0 data to be added is only required to be adjusted as necessary.
- the 0-filling processing may be performed on a simple impulse as shown in the curve L35 in Fig. 5 to generate the target phase characteristic impulse response, for example.
- Such a method of generating the target phase characteristic impulse response is the above-mentioned method A2.
- 0-filling processing of adding 0 data to the front side of the simple impulse in the time direction is performed, and the FFT is performed on the simple impulse that has been subjected to the 0-filling processing.
- phase characteristic of the frequency characteristic obtained by the FFT with respect to the simple impulse after 0-filling processing will be also referred to as the phase characteristic of the simple impulse in particular.
- the 0-filling processing is not performed on the impulse response having the target phase characteristic, and the FFT is performed on the impulse response as it is.
- the phase characteristic of the frequency characteristic obtained by the FFT on the impulse response having the target phase characteristic will be also referred to as the target phase characteristic in particular.
- phase characteristic of the simple impulse and the target phase characteristic are obtained by the FFT as described above, the phase characteristic of the simple impulse and the target phase characteristic are added, and the frequency characteristic including the phase characteristic obtained by the addition and the flat amplitude characteristic is subjected to the IFFT.
- the target phase characteristic impulse response thus obtained is an impulse response having a flat or substantially flat amplitude characteristic and a target phase characteristic.
- the phase characteristic obtained by performing the FFT on the impulse response of a predetermined HPF without performing the 0-filling processing is subtracted from the phase characteristic of the simple impulse, and the IFFT is performed on the frequency characteristic including the phase characteristic obtained as a result and the flat amplitude characteristic. Then, the fade processing is performed on the impulse response obtained by the IFFT, and the impulse response obtained as a result is used as the target phase characteristic impulse response.
- the 0-filling processing is performed on the impulse response having the target phase characteristic, and then the FFT, the IFFT, and the fade processing are performed to generate the target phase characteristic impulse response.
- the simple impulse is subjected to 0-filling processing, and then the FFT, the IFFT, and the fade processing are performed to generate the target phase characteristic impulse response.
- the phase characteristic of the headphones or speaker on the reproduction side can be canceled for the audio signal of the content.
- the audio signal in which the phase characteristic of the headphones or speaker on the reproduction side is canceled is referred to as a corrected audio signal.
- the listener reproduces the sound of content with headphones on the reproduction side
- the head transmission characteristic that is, the head related transfer function (HRTF)
- HRTF head related transfer function
- the HRTF is a function indicating the sound transmission characteristic from the sound source to the listener's ear, and more specifically, to the vicinity of the listener's eardrum or the entrance of the ear canal.
- An impulse response generation device 11 shown in Fig. 8 has a 0-filling processing part 21, an FFT processing part 22, an IFFT processing part 23, and a fade processing part 24.
- step S11 0-filling processing is performed in which 0 data is added to the rear side and the front side in the time direction in the input impulse response.
- 0 data is added at least to the front side in the time direction in the input impulse response.
- the impulse response generation device 11 performs 0-filling processing of adding 0 data to the front side in the time direction at least in the input impulse response, and the FFT, the IFFT, and the fade processing is performed on the input impulse response that has been subjected to the 0-filling processing, so that the target phase characteristic impulse response is generated.
- the FFT processing part 61 is supplied with an impulse response having the target phase characteristic used for generating the target phase characteristic impulse response, that is the input impulse response.
- step S42 the 0-filling processing part 62 performs 0-filling processing on the supplied simple impulse and supplies the result to the FFT processing part 63.
- 0 data is added to the front side of the simple impulse in the time direction, and the simple impulse is appropriately delayed.
- step S44 the operation processing part 64 performs operation processing based on the phase characteristic supplied from the FFT processing part 61 and the phase characteristic supplied from the FFT processing part 63, and supplies the phase characteristic obtained as a result to the IFFT processing part 23.
- the reproduction device used for reproducing the content is configured as shown in Fig. 13 , for example.
- a reproduction device 121 includes at least a portable player, a smartphone, a personal computer, or the like capable of controlling reproduction of audio content, and headphones 122 are connected to the reproduction device 121.
- the audio signal of the content obtained by mastering by the creator M11 is supplied to the speaker phase characteristic convolution part 132.
- the acquisition part 131 acquires and holds the target phase characteristic impulse response from an external device such as the impulse response generation device 11 or the impulse response generation device 51 at an arbitrary timing. Furthermore, the acquisition part 131 supplies the held target phase characteristic impulse response to the speaker phase characteristic convolution part 132.
- the target phase characteristic impulse response acquired by the acquisition part 131 is generated by the impulse response generation device 11 or the impulse response generation device 51 using the input impulse response having the phase characteristic of the speaker 91 used for mastering. That is, the target phase characteristic impulse response is an impulse response having the same phase characteristic as the phase characteristic of the speaker 91.
- the target phase characteristic impulse response may not be acquired by the acquisition part 131 at an arbitrary timing, and may be held in advance by the acquisition part 131.
- the speaker phase characteristic convolution part 132 convolves the speaker characteristic impulse response supplied from the acquisition part 131 into the supplied audio signal, and supplies the audio signal obtained as a result to the reproduction control part 133.
- the reproduction control part 133 supplies the audio signal supplied from the speaker phase characteristic convolution part 132 to the headphones 122, and reproduces the sound of content. In other words, the reproduction control part 133 controls the reproduction of the sound of content on the headphones 122.
- the headphones 122 reproduce the sound of content on the basis of the audio signal supplied from the reproduction control part 133.
- the reproduction device 121 is not provided with the headphones 122 here, the headphones 122 may be provided in the reproduction device 121, or the acquisition part 131 to the reproduction control part 133 may be provided inside the headphones 122.
- the operation of the reproduction device 121 will be described. That is, the reproduction processing by the reproduction device 121 will be described below with reference to the flowchart of Fig. 14 . Note that at the timing when this reproduction processing is started, the speaker characteristic impulse response has already been acquired by the acquisition part 131.
- step S71 the speaker phase characteristic convolution part 132 convolves the speaker characteristic impulse response supplied from the acquisition part 131 into the supplied audio signal, and supplies the audio signal obtained as a result to the reproduction control part 133.
- phase characteristic of the speaker characteristic impulse response that is, the phase characteristic of the speaker 91 can be added to the sound of content based on the audio signal.
- step S72 the reproduction control part 133 supplies the audio signal supplied from the speaker phase characteristic convolution part 132 to the headphones 122, and reproduces the sound of content, and the reproduction processing ends.
- the sound of content reproduced by the headphones 122 has the same characteristic as the phase characteristic of the speaker 91, the listener listening to the sound of content hear the sound with almost the same sound quality as the sound of content that the creator M11 was listening to in the studio. Moreover, since, with the speaker characteristic impulse response, only the desired phase characteristic can be added to the sound of content without changing the amplitude characteristic, the gain of the sound of content does not change.
- the reproduction device 121 reproduces the sound of content after convolving the speaker characteristic impulse response into the audio signal of the content. As a result, even in a case where the sound of content is reproduced by the headphones 122, the phase characteristic of the speaker 91 used for mastering can be added to the sound of content. That is, a desired phase characteristic can be obtained.
- the same characteristic as the phase characteristic of the speaker 91 is added to the sound of content.
- the phase characteristic of the headphones 122 is also added to the sound.
- phase characteristic of the headphones 122 may be canceled (removed) to allow the listener to hear the sound closer to the sound of content that the creator M11 was listening to in the studio.
- the reproduction device is configured as shown in Fig. 15 , for example.
- the same reference numerals are given to the parts corresponding to the case in Fig. 13 , and the description thereof will be omitted as appropriate.
- Headphones 122 are connected to a reproduction device 161 shown in Fig. 15 . Furthermore, the reproduction device 161 has an acquisition part 131, a headphone inverse characteristic convolution part 171, a speaker phase characteristic convolution part 132, and a reproduction control part 133.
- the reproduction device 161 is configured such that the headphone inverse characteristic convolution part 171 is provided in the preceding stage of the speaker phase characteristic convolution part 132 in the reproduction device 121.
- the reproduction device 161 not only the above-mentioned speaker characteristic impulse response but also the target phase characteristic impulse response having the inverse characteristic of the phase characteristic of the headphones 122 is acquired by the acquisition part 131 from an external device such as the impulse response generation device 11 and the impulse response generation device 51 and is held.
- the target phase characteristic impulse response having the inverse characteristic of the phase characteristic of the headphones 122 will also be referred to as a headphone inverse characteristic impulse response.
- This headphone inverse characteristic impulse response is a target phase characteristic impulse response generated by the impulse response generation device 51 by, for example, using an input impulse response having the phase characteristic of the headphones 122, and performing subtraction as the operation processing in the operation processing part 64.
- the headphone inverse characteristic impulse response may not be acquired by the acquisition part 131, but may be held in advance by the acquisition part 131.
- the acquisition part 131 supplies the held headphone inverse characteristic impulse response to the headphone inverse characteristic convolution part 171.
- the headphone inverse characteristic convolution part 171 convolves the headphone inverse characteristic impulse response supplied from the acquisition part 131 into the audio signal of the supplied content, and supplies the audio signal obtained as a result to the speaker phase characteristic convolution part 132.
- step S101 the headphone inverse characteristic convolution part 171 convolves the headphone inverse characteristic impulse response supplied from the acquisition part 131 into the supplied audio signal of content, and supplies the audio signal obtained as a result to the speaker phase characteristic convolution part 132.
- the phase characteristic of the headphones 122 which is added when the sound of content is reproduced with the headphones 122, is canceled.
- the phase characteristic of the headphones 122 which is added when the sound of content is reproduced with the headphones 122, is canceled.
- the phase characteristic can be adjusted without changing the amplitude (gain) of the sound of content.
- step S102 and step S103 are performed thereafter to end the reproduction processing, and these pieces of processing are similar to those in step S71 and step S72 of Fig. 14 , and therefore, the description thereof will be omitted.
- the phase characteristic of the headphones 122 is first canceled with respect to the sound of content, and then the phase characteristic of the speaker 91, which is a characteristic to be added, is added.
- a target phase characteristic impulse response to which the inverse characteristic of the phase characteristic of the headphones 122 can be added and simultaneously, the phase characteristic of the speaker 91 can be added may be generated, and the target phase characteristic impulse response may be convolved into the audio signal of the content.
- the phase characteristic added to the sound of content can be freely changed. That is, for example, in the reproduction device 161, it is possible to select an arbitrary speaker 91 from a plurality of speakers 91 of different manufacturers or the like, and convolve the speaker characteristic impulse response having the phase characteristic of the selected speaker 91.
- the reproduction device 161 convolves the headphone inverse characteristic impulse response into the audio signal of content, further convolves the speaker characteristic impulse response into the audio signal, and then reproduces the sound of content.
- the phase characteristic added by the headphones 122 can be canceled and the phase characteristic of the speaker 91 used for mastering can be added to the sound of the content. That is, a desired phase characteristic can be obtained.
- the reproduction processing described with reference to Fig. 16 it is possible to hear sound closer to the sound of content that the creator M11 was listening to in the studio than in the case of reproduction processing described with reference to Fig. 14 .
- the reproduction device is configured, for example, as shown in Fig. 17 .
- the same reference numerals are given to the parts corresponding to the case in Fig. 15 , and the description thereof will be omitted as appropriate.
- Headphones 122 are connected to a reproduction device 201 shown in Fig. 17 . Furthermore, the reproduction device 201 has an acquisition part 131, a headphone inverse characteristic convolution part 171, a speaker phase characteristic convolution part 132, an HRTF convolution part 211, and a reproduction control part 133.
- the reproduction device 201 is configured such that the HRTF convolution part 211 is provided in the subsequent stage of the speaker phase characteristic convolution part 132 in the reproduction device 161.
- the HRTF is acquired from an external device by the acquisition part 131 and is held. Note that the HRTF may not be acquired by the acquisition part 131, but may be held in advance by the acquisition part 131.
- the acquisition part 131 supplies the held HRTF to the HRTF convolution part 211.
- the HRTF convolution part 211 convolves the HRTF supplied from the acquisition part 131 into the audio signal supplied from the speaker phase characteristic convolution part 132, and supplies the audio signal obtained as a result to the reproduction control part 133.
- reproduction device 121 shown in Fig. 13 may be provided with the HRTF convolution part 211.
- the operation of the reproduction device 201 will be described. That is, the reproduction processing by the reproduction device 201 will be described below with reference to the flowchart of Fig. 18 . Note that at the timing when this reproduction processing is started, the speaker characteristic impulse response, the headphone inverse characteristic impulse response and the HRTF have already been acquired by the acquisition part 131.
- step S131 and step S132 are performed, and these pieces of processing are similar to those in step S101 and step S102 of Fig. 16 , and therefore, the description thereof will be omitted.
- step S133 the HRTF convolution part 211 convolves the HRTF supplied from the acquisition part 131 into the audio signal supplied from the speaker phase characteristic convolution part 132, and supplies the audio signal obtained as a result to the reproduction control part 133.
- step S134 the reproduction control part 133 supplies the audio signal supplied from the HRTF convolution part 211 to the headphones 122, and reproduces the sound of the content, and the reproduction processing ends. Therefore, when reproducing the sound of content, the phase characteristic of the headphones 122 is canceled, and the phase characteristic of the speaker 91 and the sound transmission characteristic in the studio are added.
- the reproduction device 201 convolves the headphone inverse characteristic impulse response, the speaker characteristic impulse response, and the HRTF into the audio signal, and then reproduces the sound of content.
- the headphones 122 even in a case where the sound of content is reproduced by the headphones 122, it is possible to add the desired phase characteristic and the transmission characteristic in a desired listening environment such as a studio, and allow the listener to hear almost the same sound as the sound of content that the creator M11 was listening to in the studio.
- a generation part that generates a target phase characteristic impulse response may be provided inside the reproduction device 121, the reproduction device 161, and the reproduction device 201.
- the reproduction device 161 is configured as shown in Fig. 19 .
- the same reference numerals are given to the parts corresponding to the case in Fig. 15 , and the description thereof will be omitted as appropriate.
- the reproduction device 161 shown in Fig. 19 has a generation part 241, an acquisition part 131, a headphone inverse characteristic convolution part 171, a speaker phase characteristic convolution part 132, and a reproduction control part 133.
- the configuration of the reproduction device 161 shown in Fig. 19 is such that the reproduction device 161 shown in Fig. 15 is further provided with the generation part 241.
- the generation part 241 corresponds to the impulse response generation device 11 and the impulse response generation device 51. That is, the generation part 241 performs similar processing to the impulse response generation processing described with reference to Figs. 9 and 11 to generate the headphone inverse characteristic impulse response and the speaker characteristic impulse response, and supplies the result to the acquisition part 131.
- the phase characteristic of any speaker used for mastering in music creation especially the low-frequency phase characteristic can be added to the sound source while the amplitude characteristic remains flat. Therefore, even when the sound is reproduced using headphones, the equivalent effect to that obtained in a mastering studio can be obtained as a low-frequency sound quality effect.
- the impulse response of any general IIR filter that imitates the phase characteristic of the speaker is used as the above-mentioned input impulse response, it is possible to use the obtained speaker characteristic impulse response to add a low-frequency phase characteristic equivalent to that of a speaker without changing the amplitude characteristic.
- phase characteristic of the headphones particularly the inverse characteristic of the low-frequency phase characteristic
- the phase characteristic of the headphones is added by the headphone inverse characteristic impulse response
- the phase characteristic of the speaker particularly the low frequency characteristic
- the speaker characteristic impulse response an effect closer to the low frequency sound quality effect in the mastering studio can be obtained.
- an impulse response having the phase characteristic of the speaker may be used as the input impulse response.
- an impulse response such as an IIR type HPF that imitates the phase characteristic of the speaker may be used as the input impulse response.
- the low-frequency phase characteristic of the listening environment in the mastering studio can be simulated with headphones.
- the series of processing described above can be also performed by hardware or can be performed by software.
- a program constituting the software is installed in a computer.
- the computer includes a computer incorporated in dedicated hardware and a general-purpose personal computer capable of executing various functions by installing various programs, for example, and the like.
- Fig. 20 is a block diagram showing a configuration example of a hardware of a computer that executes the above-described series of processing by a program.
- An input and output interface 505 is further connected to the bus 504.
- An input part 506, an output part 507, a recording part 508, a communication part 509, and a drive 510 are connected to the input and output interface 505.
- the CPU 501 loads the program recorded in the recording part 508 into the RAM 503 via the input and output interface 505 and the bus 504, and executes the program, so that the above-described series of processing is performed.
- each step described in the above-described flowchart can be executed by one device or shared by a plurality of devices.
- a plurality of processes included in the one step can be executed by one device or shared and executed by a plurality of devices.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2018127664 | 2018-07-04 | ||
| PCT/JP2019/024440 WO2020008889A1 (ja) | 2018-07-04 | 2019-06-20 | オーディオ信号処理装置および方法、インパルス応答生成装置および方法、並びにプログラム |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EP3820161A1 true EP3820161A1 (de) | 2021-05-12 |
| EP3820161A4 EP3820161A4 (de) | 2021-11-24 |
Family
ID=69060578
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP19831112.8A Ceased EP3820161A4 (de) | 2018-07-04 | 2019-06-20 | Audiosignalverarbeitungsvorrichtung und -verfahren, impulsantworterzeugungsvorrichtung und -verfahren und programm |
Country Status (3)
| Country | Link |
|---|---|
| EP (1) | EP3820161A4 (de) |
| JP (1) | JP7359146B2 (de) |
| WO (1) | WO2020008889A1 (de) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP7447533B2 (ja) * | 2020-02-19 | 2024-03-12 | ヤマハ株式会社 | 音信号処理方法および音信号処理装置 |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2681349B2 (ja) * | 1986-08-08 | 1997-11-26 | ヤマハ株式会社 | スピーカ再生装置 |
| JP2002171589A (ja) | 2000-11-30 | 2002-06-14 | Sony Corp | オーディオ装置 |
| WO2008090897A1 (ja) * | 2007-01-22 | 2008-07-31 | Toa Corporation | フィルタ |
| CN102804808B (zh) * | 2009-06-30 | 2015-05-27 | 诺基亚公司 | 用于呈现空间音频的方法及装置 |
| US20120033829A1 (en) * | 2010-08-04 | 2012-02-09 | Lewis Ivan Lawayne | Audio phase corrector |
| JP2012054863A (ja) * | 2010-09-03 | 2012-03-15 | Mitsubishi Electric Corp | 音響再生装置 |
| JP6155698B2 (ja) * | 2013-02-28 | 2017-07-05 | 株式会社Jvcケンウッド | オーディオ信号処理装置、オーディオ信号処理方法、オーディオ信号処理プログラムおよびヘッドホン |
| US9992573B1 (en) * | 2013-10-29 | 2018-06-05 | Meyer Sound Laboratories, Incorporated | Phase inversion filter for correcting low frequency phase distortion in a loudspeaker system |
-
2019
- 2019-06-20 JP JP2020528786A patent/JP7359146B2/ja active Active
- 2019-06-20 EP EP19831112.8A patent/EP3820161A4/de not_active Ceased
- 2019-06-20 WO PCT/JP2019/024440 patent/WO2020008889A1/ja not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| WO2020008889A1 (ja) | 2020-01-09 |
| EP3820161A4 (de) | 2021-11-24 |
| JPWO2020008889A1 (ja) | 2021-07-08 |
| JP7359146B2 (ja) | 2023-10-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9407993B2 (en) | Latency reduction in transposer-based virtual bass systems | |
| US20170373656A1 (en) | Loudspeaker-room equalization with perceptual correction of spectral dips | |
| JP7726972B2 (ja) | スペクトル直交オーディオ成分を用いたサブバンド空間処理およびクロストーク処理 | |
| KR20200080344A (ko) | 크로스토크 처리 b 체인 | |
| JP7113920B2 (ja) | 空間オーディオ信号のクロストーク処理のためのスペクトル欠陥補償 | |
| US8295508B2 (en) | Processing an audio signal | |
| US9324334B2 (en) | Signal processing apparatus, signal processing method, and program | |
| EP2907324A1 (de) | System und verfahren zur reduzierung der latenzzeit in transposerbasierten virtuellen basssystemen | |
| CN109791773B (zh) | 音频输出产生系统、音频通道输出方法和计算机可读介质 | |
| EP3820161A1 (de) | Audiosignalverarbeitungsvorrichtung und -verfahren, impulsantworterzeugungsvorrichtung und -verfahren und programm | |
| JP5651813B1 (ja) | 音声信号処理装置、および音声信号処理方法 | |
| EP4525486A1 (de) | Informationsverarbeitungsvorrichtung, informationsverarbeitungsverfahren und programm | |
| WO2023009377A1 (en) | A method of processing audio for playback of immersive audio | |
| JP7820540B2 (ja) | スピーカを介して仮想音響を再生するためのシステムにおけるスペクトル歪みを低減するための装置及び方法 | |
| US20130195286A1 (en) | Equalization of an Audio Signal | |
| Norcross | Evaluation and alternative regularization of audio equalization techniques | |
| WO2024134805A1 (ja) | 再生音補正装置、再生音補正方法、プログラム | |
| BR112018071019B1 (pt) | Aparelho e método para fornecer zonas individuais de som |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
| 17P | Request for examination filed |
Effective date: 20210204 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
| RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SONY GROUP CORPORATION |
|
| RIC1 | Information provided on ipc code assigned before grant |
Ipc: H04R 3/04 20060101AFI20210719BHEP Ipc: H04R 5/033 20060101ALI20210719BHEP Ipc: H04S 1/00 20060101ALI20210719BHEP |
|
| DAV | Request for validation of the european patent (deleted) | ||
| DAX | Request for extension of the european patent (deleted) | ||
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R079 Free format text: PREVIOUS MAIN CLASS: H04R0003000000 Ipc: H04R0003040000 |
|
| A4 | Supplementary search report drawn up and despatched |
Effective date: 20211026 |
|
| RIC1 | Information provided on ipc code assigned before grant |
Ipc: H04S 1/00 20060101ALI20211020BHEP Ipc: H04R 5/033 20060101ALI20211020BHEP Ipc: H04R 3/04 20060101AFI20211020BHEP |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
| 17Q | First examination report despatched |
Effective date: 20230821 |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R003 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
| 18R | Application refused |
Effective date: 20251107 |