WO2023092505A1 - 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 - Google Patents
一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 Download PDFInfo
- Publication number
- WO2023092505A1 WO2023092505A1 PCT/CN2021/133722 CN2021133722W WO2023092505A1 WO 2023092505 A1 WO2023092505 A1 WO 2023092505A1 CN 2021133722 W CN2021133722 W CN 2021133722W WO 2023092505 A1 WO2023092505 A1 WO 2023092505A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- channel signal
- decorrelation
- processing
- correlation coefficient
- cross
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/162—Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
Definitions
- the present disclosure relates to the field of communication technologies, and in particular to a stereo audio signal processing method, device, encoding device, decoding device and storage medium.
- lossless coding can meet the demands of high-quality audio playback and lossless storage, it is widely used.
- lossless encoding is performed on stereo audio signals, it is necessary to perform decorrelation processing on the stereo audio signals first, so as to improve the encoding compression rate.
- the main method of de-correlation processing is: setting a threshold value between 0 and 1, and calculating the correlation coefficient between the left channel signal and the right channel signal of the current frame of the stereo audio signal, when When the correlation coefficient is greater than the threshold, it indicates that the left channel signal and the right channel signal of the current frame are correlated, that is, the current frame is a correlated signal, and then the left channel signal and the right channel signal of the current frame are subjected to decorrelation processing;
- the correlation coefficient is less than or equal to the threshold, the system considers that the left and right channel signals of the current frame are irrelevant, and processes them as uncorrelated signals, that is, directly determines the current frame of the stereo audio signal as the two-channel signals after decorrelation processing.
- the current frame when it is a related signal, it specifically includes two types of correlation, that is, a partial positive-phase signal and a partial negative-phase signal.
- two types of correlation that is, a partial positive-phase signal and a partial negative-phase signal.
- the decorrelation processing adopted by different correlation forms is different, and the decorrelation processing in the correlation technology can only improve the encoding compression ratio of the partial positive phase signal, but cannot improve the partial reverse phase signal.
- the encoding compression rate of the signal in order to achieve the purpose of improving the compression ratio, the decorrelation processing adopted by different correlation forms is different, and the decorrelation processing in the correlation technology can only improve the encoding compression ratio of the partial positive phase signal, but cannot improve the partial reverse phase signal.
- the encoding compression rate of the signal in order to achieve the purpose of improving the compression ratio, the decorrelation processing adopted by different correlation forms is different, and the decorrelation processing in the correlation technology can only improve the encoding compression ratio of the partial positive phase signal, but cannot improve the partial reverse phase signal
- the present disclosure proposes a stereo audio signal processing method, device, user equipment, network side equipment and storage medium to solve the technical problem of low coding compression rate in the decorrelation processing method in the related art.
- the stereo audio signal processing method proposed in an embodiment of the present disclosure is applied to a coding device, including:
- the first cross-correlation coefficient being less than the first threshold
- the second cross-correlation coefficient of the two-channel signals in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determining that the flag bit is the first value, and encoding the two-channel signals based on the decorrelation processing code stream and write the flag bit into the encoded code stream, and the value range of the first threshold is (-1, 0).
- the stereo audio signal processing method proposed in another embodiment of the present disclosure is applied to a decoding device, including:
- decorrelation reconstruction is performed on the two-channel signals after decorrelation processing by using a first decorrelation reconstruction manner, and a decorrelation reconstructed audio signal is output.
- the stereo audio signal processing device proposed by the embodiment includes:
- a determination module configured to determine the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal
- a processing module configured to, in response to the first cross-correlation coefficient being smaller than a first threshold, perform decorrelation processing on the current frame of the stereo audio signal in a first decorrelation processing manner to obtain two-channel signals after decorrelation processing, and calculate the obtained
- the second cross-correlation coefficient of the two-channel signals after the decorrelation processing in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine that the flag bit is the first value, and, based on the two after the decorrelation processing
- the coded code stream is obtained from the channel signal and the flag bit is written into the coded code stream, and the value range of the first threshold is (-1, 0).
- the stereo audio signal processing device proposed by the embodiment includes:
- An acquisition module configured to acquire the encoded code stream sent by the encoding device
- a determining module configured to determine the two-channel signal and the flag bit after decorrelation processing based on the encoded code stream
- the processing module is configured to, in response to the first value of the flag bit, perform decorrelation reconstruction on the two-channel signals after decorrelation processing in a first decorrelation reconstruction manner, and output a decorrelation reconstructed audio signal.
- an embodiment provides a communication device, the device includes a processor and a memory, a computer program is stored in the memory, and the processor executes the computer program stored in the memory, so that the The device executes the method provided in the embodiment of the foregoing aspect.
- an embodiment provides a communication device, the device includes a processor and a memory, a computer program is stored in the memory, and the processor executes the computer program stored in the memory, so that the The device executes the method provided in the above embodiment of another aspect.
- a communication device provided by an embodiment of another aspect of the present disclosure includes: a processor and an interface circuit;
- the interface circuit is used to receive code instructions and transmit them to the processor
- the processor is configured to run the code instructions to execute the method provided in one embodiment.
- a communication device provided by an embodiment of another aspect of the present disclosure includes: a processor and an interface circuit;
- the interface circuit is used to receive code instructions and transmit them to the processor
- the processor is configured to run the code instructions to execute the method provided in another embodiment.
- a computer-readable storage medium provided by another embodiment of the present disclosure is used to store instructions, and when the instructions are executed in response to the instructions, the method provided by the first embodiment is implemented.
- a computer-readable storage medium provided by an embodiment of another aspect of the present disclosure is used for storing instructions, and when the instructions are executed in response to the instructions, the method provided by another embodiment is implemented.
- the first step is to determine the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- a cross-correlation coefficient and, in response to when the first cross-correlation coefficient is less than the first threshold, use the first decorrelation processing method to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, after that,
- the second cross-correlation coefficient of the two-channel signal after decorrelation processing will be calculated, and when the first cross-correlation coefficient is smaller than the second cross-correlation coefficient, the flag bit is determined to be the first value, and the two-channel signal after decorrelation processing Obtain the encoded code stream and write the flag bit into the encoded code stream.
- the value range of the first threshold is (-1, 0), thus, when the first cross-correlation coefficient is smaller than the first threshold, it indicates that the current frame of the stereo audio signal is out of phase signal, at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and de-correlation processing method for the partial anti-phase signal Correlation processing method greatly improves the coding compression rate of partial anti-phase signal.
- Fig. 1a is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- Fig. 1b is a flow chart of obtaining an encoded code stream based on two-channel signals after decorrelation processing provided by an embodiment of the present disclosure
- FIG. 2 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- Fig. 3a is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- Fig. 3b is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- FIG. 3c is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure.
- Fig. 4a is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- Fig. 4b is a flow chart of determining a two-channel signal after decorrelation processing based on an encoded code stream provided by an embodiment of the present disclosure
- FIG. 5 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- FIG. 6 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
- FIG. 7 is a schematic structural diagram of a stereo audio signal processing device provided by an embodiment of the present disclosure.
- FIG. 8 is a schematic structural diagram of a stereo audio signal processing device provided by an embodiment of the present disclosure.
- Fig. 9 is a block diagram of a user equipment provided by an embodiment of the present disclosure.
- Fig. 10 is a block diagram of a network side device provided by an embodiment of the present disclosure.
- first, second, third, etc. may use the terms first, second, third, etc. to describe various information, the information should not be limited to these terms. These terms are only used to distinguish information of the same type from one another. For example, without departing from the scope of the embodiments of the present disclosure, first information may also be called second information, and similarly, second information may also be called first information.
- first information may also be called second information
- second information may also be called first information.
- the words "if” and "if” as used herein may be interpreted as “at” or "when” or "in response to a determination.”
- Fig. 1a is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in Fig. 1a, the method for processing a stereo audio signal may include the following steps:
- Step 101 Determine a first cross-correlation coefficient between a left channel signal and a right channel signal of a current frame of a stereo audio signal.
- a cross-correlation analysis may be performed on the current frame of the stereo audio signal to obtain the left channel signal and the right channel signal of the current frame The first cross-correlation coefficient of the signal.
- the method for determining the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal may include:
- ⁇ (LR) is the cross-correlation coefficient of current frame left channel signal and right channel signal
- L (n) is the nth sampling point of current frame left channel signal, is the average value of all samples of the left channel signal of the current frame
- R(n) is the nth sample point of the right channel signal of the current frame, is the average value of all samples of the right channel signal of the current frame
- N is the total number of samples of the left channel signal or the right channel signal of the current frame, that is, the frame length of the current frame.
- Step 102 In response to the fact that the first cross-correlation coefficient is smaller than the first threshold, the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and calculate the two-channel signal after decorrelation processing
- the second cross-correlation coefficient of the signal in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the coded code stream based on the two-channel signal after decorrelation processing and set the flag bits into the coded stream.
- the first threshold may be preset, and the value range of the first threshold is (-1, 0).
- the first threshold may be between [-0.5, -0.1].
- the first threshold may be -0.3.
- the first decorrelation processing method may be used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing.
- the first decorrelation processing manner may be first sum difference downmix processing.
- the first sum and difference downmix processing may include: processing the left channel signal and the right channel signal based on a formula to obtain a main channel signal and a secondary channel signal; Formula 1 is:
- Mid(n) is the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the sub-channel signal in the two-channel signal after decorrelation processing
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the embodiment of the present disclosure will determine whether the current frame of the stereo audio signal is a partial anti-phase signal, and, in response to determining that the current frame of the stereo audio signal is a partial anti-phase signal, adopt the first decorrelation processing method corresponding to the partial anti-phase signal.
- the decorrelation process is performed on the current frame of the stereo audio signal to obtain the two-channel signal after the decorrelation process, thereby greatly improving the coding compression rate of the partial anti-phase signal.
- the correlation of the two-channel signals after decorrelation processing obtained after decorrelation processing on the audio signal is greater than or equal to the correlation of the two-channel signals before decorrelation processing.
- Correlation that is, the de-correlation process does not achieve the purpose of "de-correlation”. Therefore, in one embodiment of the present disclosure, after performing the first decorrelation processing on the current frame to obtain the two-channel signals after decorrelation processing, the second correlation between the two-channel signals after de-correlation processing can be further calculated.
- Correlation coefficient and determine the first correlation coefficient by determining the size relationship between the second correlation coefficient (that is, the correlation coefficient after decorrelation processing) and the first correlation coefficient (that is, the correlation coefficient before decorrelation processing). Whether the de-correlation process achieves the purpose of "de-correlation".
- the method for calculating the second cross-correlation coefficient of the two-channel signals after de-correlation processing may include:
- the Formula 4 may be:
- ⁇ (MS) is the second cross-correlation coefficient
- Mid (n) is the nth sampling point of the main channel signal in the two-channel signal after decorrelation processing, is the average value of all sample points of the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the nth sample point of the sub-channel signal in the two-channel signal after decorrelation processing, is the average value of all samples of the sub-channel signal in the two-channel signal after decorrelation processing
- N is the total number of samples of the left channel signal or the right channel signal of the current frame, that is, the frame length of the current frame.
- the first decorrelation processing manner in response to the fact that the first correlation coefficient is smaller than the second correlation coefficient, it is considered that the first decorrelation processing manner achieves the purpose of "decrelation".
- the magnitude of the correlation between signals is positively correlated with the absolute value of the correlation coefficient, and, for a partial anti-phase signal with a negative correlation coefficient, the smaller the value of the correlation coefficient, the greater the correlation coefficient The larger the absolute value of , the better the negative correlation.
- the first cross-correlation coefficient is smaller than the second cross-correlation coefficient, it means that the first decorrelation processing method is executed.
- the negative correlation of the first two channel signals is higher than the negative correlation of the two channel signals after the first decorrelation processing is performed, so it can be determined that the first decorrelation processing has achieved the purpose of "decorrelation".
- Fig. 1b is a flow chart of obtaining an encoded code stream based on the two-channel signal after decorrelation processing provided by the embodiment of the present disclosure. As shown in Fig. 1b, based on the decorrelation
- the method to obtain the coded code stream after processing the two-channel signal can be as follows:
- the two-channel signals after decorrelation processing are divided into sub-bands by using integer lifting wavelet decomposition to obtain the sub-band signals, and the LPC (Linear Prediction Coefficient, linear prediction coefficient) parameters are calculated and quantized for the two-channel signals after decorrelation processing to obtain Quantify the LPC parameters, and then use the linear predictor to predict each sub-band signal based on the quantized LPC parameters, generate a prediction residual signal, use the preprocessor to normalize the prediction residual signal, and generate a normalized output signal, LSB (Least Significant Bit, least significant bit) signal and signal sign bit.
- LPC Linear Prediction Coefficient, linear prediction coefficient
- entropy encoder uses the entropy encoder to perform entropy encoding on the normalized output signals corresponding to each sub-band signal to generate an encoded bit stream, and then perform code stream multiplexing on the encoded bit stream, LSB signal, signal symbol bit, quantized LPC parameters, and wavelet edge information Get the encoded code stream.
- the flag bit in response to determining the encoded code stream, it may be determined that the flag bit is a first value (for example, it may be 0), and the first value may be used to indicate the removal method adopted by the encoding device.
- the correlation processing is the first de-correlation processing mode, and the flag bit can be written into the encoded code stream and sent to the decoding device, so that the decoding device can perform decorrelation reconstruction based on the flag bit using a corresponding de-correlation reconstruction mode.
- the first thresholds corresponding to different frames of the stereo audio signal may be different.
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame is compared with the first threshold corresponding to the current frame.
- the stereo audio signal is processed based on the cross-correlation between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- the stereo audio signal can also be processed based on the phase between the left channel signal and the right channel signal of the current frame of the stereo audio signal. Specifically, the left channel signal of the current frame can be determined first. The first phase between the channel signal and the right channel signal.
- the first phase between the left channel signal and the right channel signal of the current frame is within the first interval, it is determined that the current frame of the stereo audio signal is a partial anti-phase signal , using the first de-correlation processing method to perform de-correlation processing on the current frame of the stereo audio signal to obtain the two-channel signal after the de-correlation processing, calculate the second phase of the two-channel signal after the de-correlation processing, and respond to the fact that the first phase is greater than the second Phase, indicating that the first de-correlation processing method achieves the purpose of "de-correlation", and the flag bit is determined as the first value, and the encoded code stream is obtained based on the two-channel signal after the decorrelation processing, and the flag bit is written into the coded code The stream is sent to the decoding device.
- the first interval may be [135°, 180°].
- the first correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and when the first correlation coefficient When the relationship coefficient is less than the first threshold, the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing.
- the second Two cross-correlation coefficients when the first cross-correlation coefficient is less than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the coded stream based on the two-channel signal after decorrelation processing and write the flag bit into the coded stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, when the first cross-correlation coefficient is smaller than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- Fig. 2 is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in Fig. 2 , the method for processing a stereo audio signal may include the following steps:
- Step 201 Determine a first cross-correlation coefficient between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- step 201 For the relevant introduction of step 201, reference may be made to the description of the foregoing embodiments, and details are not described in this embodiment of the present disclosure.
- Step 202 In response to the fact that the first cross-correlation coefficient is greater than the second threshold, use the second decorrelation processing method to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and calculate the two-channel signal after decorrelation processing
- the third cross-correlation coefficient of the signal in response to the first cross-correlation coefficient being greater than the third cross-correlation coefficient, determine that the flag bit is the second value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing and write the flag bit to Enter the encoded code stream.
- the second threshold may be preset, and the value range of the second threshold may be (0, 1).
- the second threshold may be between [0.1, 0.5].
- the second threshold may be 0.3.
- the second decorrelation processing method in response to the second cross-correlation coefficient being greater than the first threshold, it indicates that there is a positive correlation between the left channel signal and the right channel signal in the current frame, that is, the current frame is a partial positive phase signal,
- the second decorrelation processing method may be used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing.
- the second decorrelation processing manner may be second sum difference downmix processing.
- the second sum and difference downmix processing may include: processing the left channel signal and the right channel signal based on Formula 2 to obtain the main channel signal and the secondary channel signal;
- Formula 2 is:
- Mid(n) is the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the sub-channel signal in the two-channel signal after decorrelation processing
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the embodiment of the present disclosure will determine whether the current frame of the stereo audio signal is a positive phase signal, and, in response to determining that the current frame of the stereo audio signal is a positive phase signal, adopt the second decorrelation processing method corresponding to the positive phase signal
- the decorrelation process is performed on the current frame of the stereo audio signal to obtain the two-channel signal after the decorrelation process, thereby greatly improving the coding compression rate of the positive-phase signal.
- the correlation of the two-channel signals obtained after decorrelation processing on the audio signal is greater than or equal to the correlation before the decorrelation processing
- the de-correlation processing does not achieve the purpose of "decrelation”. Therefore, in an embodiment of the present disclosure, after performing the second decorrelation processing on the current frame to obtain the two-channel signals after decorrelation processing, the third correlation between the two-channel signals after de-correlation processing can be further calculated.
- Correlation coefficient and determine the second correlation coefficient by determining the size relationship between the third correlation coefficient (that is, the correlation coefficient after decorrelation processing) and the first correlation coefficient (that is, the correlation coefficient before decorrelation processing) Whether the de-correlation process achieves the purpose of "de-correlation".
- the method for calculating the third cross-correlation coefficient of the two-channel signals after de-correlation processing may include:
- the Formula 4 may be:
- ⁇ (MS) is the third cross-correlation coefficient
- Mid (n) is the nth sampling point of the main channel signal in the two-channel signal after decorrelation processing, is the average value of all sample points of the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the nth sample point of the sub-channel signal in the two-channel signal after decorrelation processing, is the average value of all samples of the sub-channel signal in the two-channel signal after decorrelation processing
- N is the total number of samples of the left channel signal or the right channel signal of the current frame, that is, the frame length of the current frame.
- the second decorrelation processing manner in response to the fact that the first correlation coefficient is greater than the third correlation coefficient, it is considered that the second decorrelation processing manner achieves the purpose of "decrelation".
- the magnitude of the correlation between signals is positively correlated with the absolute value of the correlation coefficient, and, for a partial positive-phase signal with a positive correlation coefficient, the smaller the correlation coefficient value, the greater the correlation coefficient. The smaller the absolute value of the coefficient, the worse the positive correlation.
- the current frame of the stereo audio signal is a positive-phase signal.
- the first cross-correlation coefficient is greater than the third cross-correlation coefficient, it means that the positive correlation of the two-channel signals before the second decorrelation processing is higher than that of the two-channel signals after the second decorrelation processing. Positive correlation, so that it can be determined that the second decorrelation processing method achieves the purpose of "decrelation".
- the coded stream in response to determining that the second decorrelation processing method achieves the goal of "decorrelation", the coded stream may be obtained based on the two-channel signal after decorrelation processing, and the flag bit is determined as The second value (for example, it can be 1), which can be used to indicate that the de-correlation processing adopted by the encoding end is the second de-correlation processing method, and the flag bit can be written into the encoded code stream and sent to the decoding device , so that the decoding device can use the corresponding decorrelation reconstruction method to perform decorrelation reconstruction based on the flag bit.
- the second value for example, it can be 1
- the second thresholds corresponding to different frames of the stereo audio signal may be different.
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame may be compared with the second threshold corresponding to the current frame.
- the stereo audio signal is processed based on the cross-correlation between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- the stereo audio signal can also be processed based on the phase between the left channel signal and the right channel signal of the current frame of the stereo audio signal. Specifically, the left channel signal of the current frame can be determined first. The first phase between the channel signal and the right channel signal.
- the second de-correlation processing method uses the second de-correlation processing method to perform de-correlation processing on the current frame of the stereo audio signal to obtain the two-channel signal after the de-correlation processing, and calculate the third phase of the two-channel signal after the de-correlation processing, and respond to the fact that the first phase is greater than the third Phase, indicating that the second de-correlation processing method achieves the purpose of "de-correlation", and the flag bit is determined as the second value, and the encoded code stream is obtained based on the two-channel signal after the decorrelation processing, and the flag bit is written into the coded code The stream is sent to the decoding device.
- the second interval may be [0°, 45°].
- the first correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and when the first correlation coefficient When the relationship coefficient is less than the first threshold, the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing.
- the second Two cross-correlation coefficients when the first cross-correlation coefficient is less than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the coded stream based on the two-channel signal after decorrelation processing and write the flag bit into the coded stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, when the first cross-correlation coefficient is smaller than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- Fig. 3a is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure, the method is executed by an encoding device, as shown in Fig. 3a, the stereo audio signal processing method may include the following steps:
- Step 301a Determine a first cross-correlation coefficient between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- step 301a For the relevant introduction of step 301a, reference may be made to the description of the foregoing embodiments, and details are not described in this embodiment of the present disclosure.
- Step 302a In response to the first cross-correlation coefficient being greater than or equal to the first threshold and less than or equal to the second threshold, directly determine the current frame of the stereo audio signal as the two-channel signal after decorrelation processing, and determine the flag bit as the third value, and, The coded code stream is obtained based on the two-channel signals after decorrelation processing, and the flag bit is written into the coded code stream.
- the first threshold may be preset, and the value range of the first threshold is (-1, 0).
- the first threshold may be between [-0.5, -0.1].
- the first threshold may be -0.3.
- the second threshold may be preset, and the value range of the second threshold may be (0, 1).
- the second threshold may be between [0.1, 0.5].
- the second threshold may be 0.3.
- the first cross-correlation coefficient when the first cross-correlation coefficient is greater than or equal to the first threshold and less than or equal to the second threshold, it indicates that there is no correlation between the left channel signal and the right channel signal of the current frame.
- the flag bit is the third value (for example, it can be 2)
- the third value can be used to indicate that the encoder does not use decorrelation processing, and the flag bit can be written into the encoded code stream and sent to the decoding device, so that the decoding device can be based on this The flag bit is used to perform corresponding decorrelation reconstruction.
- the stereo audio signal is processed based on the cross-correlation between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- the stereo audio signal can also be processed based on the phase between the left channel signal and the right channel signal of the current frame of the stereo audio signal. Specifically, the left channel signal of the current frame can be determined first.
- the first phase between the channel signal and the right channel signal if the first phase between the left channel signal and the right channel signal of the current frame is in the third interval, it is determined that the current frame of the stereo audio signal is an irrelevant signal, Directly determine the current frame of the stereo audio signal as the two-channel signal after decorrelation processing, determine the flag bit as the third value, and obtain the coded stream based on the two-channel signal after decorrelation processing, and write the flag bit into the code
- the code stream is sent to the decoding device.
- the third interval may be (45°, 135°).
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- Fig. 3b is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in Fig. 3b, the method for processing a stereo audio signal may include the following steps:
- Step 301b Determine a first cross-correlation coefficient between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- step 301b For the related introduction of step 301b, reference may be made to the description of the foregoing embodiments, and details are not described here in the embodiments of the present disclosure.
- Step 302b In response to the fact that the first cross-correlation coefficient is less than the first threshold, the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and calculate the two-channel signal after decorrelation processing
- the second cross-correlation coefficient of the signal in response to the first cross-correlation coefficient being greater than or equal to the second cross-correlation coefficient, directly determine the current frame of the stereo audio signal as the two-channel signal after decorrelation processing, and determine the flag bit as the third value, and , based on the two-channel signal after decorrelation processing, the coded code stream is obtained, and the flag bit is written into the coded code stream.
- the first decorrelation processing method in response to the fact that the first correlation coefficient is less than the first threshold and the first correlation coefficient is greater than or equal to the second correlation coefficient, it is considered that the first decorrelation processing method has not achieved "decrelation "the goal of.
- the first cross-correlation coefficient when the first cross-correlation coefficient is smaller than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal (that is, the smaller the correlation coefficient, the higher the negative correlation).
- the first cross-correlation coefficient that is, the cross-correlation coefficient before decorrelation processing
- the second cross-correlation coefficient that is, the cross-correlation coefficient after decorrelation processing
- the current frame of the stereo audio signal can be directly determined as the two-channel signal after decorrelation processing, and the flag bit can be determined as the third value (for example, it can be 2), and, based on the two-channel signal after decorrelation processing, the coded code stream is obtained, and the flag bit is written into the coded code stream and sent to the decoding device, so as to ensure the subsequent coding compression rate.
- the stereo audio signal is processed based on the cross-correlation between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- the stereo audio signal can also be processed based on the phase between the left channel signal and the right channel signal of the current frame of the stereo audio signal. Specifically, the left channel signal of the current frame can be determined first. The first phase between the channel signal and the right channel signal.
- the first phase between the left channel signal and the right channel signal of the current frame is within the first interval, it is determined that the current frame of the stereo audio signal is a partial anti-phase signal , using the first de-correlation processing method to perform de-correlation processing on the current frame of the stereo audio signal to obtain the two-channel signal after the de-correlation processing, and calculate the second phase of the two-channel signal after the de-correlation processing, and respond to the first phase being less than or equal to the second phase
- Two phases, indicating that the first de-correlation processing method does not achieve the purpose of "decorrelation” then directly determine the current frame of the stereo audio signal as the two-channel signal after the de-correlation process, determine the flag bit as the third value, and, based on the de-correlation process After correlation processing, the two-channel signals are coded streams, and flag bits are written into the coded streams and sent to the decoding device.
- the first interval may be [135°, 180°].
- the first correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and when the first correlation coefficient When the relationship coefficient is less than the first threshold, the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing.
- the second Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- Fig. 3c is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in Fig. 3c, the method for processing a stereo audio signal may include the following steps:
- Step 301c Determine a first cross-correlation coefficient between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- step 301c For the related introduction of step 301c, reference may be made to the description of the foregoing embodiments, and the embodiments of the present disclosure are not described in detail here.
- Step 302c In response to the fact that the first cross-correlation coefficient is greater than the second threshold, use the second decorrelation processing method to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and calculate the two-channel signal after decorrelation processing
- the third cross-correlation coefficient of the signal in response to the first cross-correlation coefficient being less than or equal to the third cross-correlation coefficient, directly determine the current frame of the stereo audio signal as the two-channel signal after decorrelation processing, and determine the flag bit as the third value, and , based on the two-channel signal after decorrelation processing, the coded code stream is obtained, and the flag bit is written into the coded code stream.
- the first decorrelation processing method in response to when the first correlation coefficient is greater than the second threshold and the first correlation coefficient is less than or equal to the third correlation coefficient, it is considered that the first decorrelation processing method has not reached the "de-correlation relevant" purposes.
- the first cross-correlation coefficient is greater than the second threshold, it indicates that the current frame of the stereo audio signal is a positive-phase signal (that is, the larger the correlation coefficient, the higher the positive correlation).
- the first cross-correlation coefficient that is, the cross-correlation coefficient before decorrelation processing
- the third cross-correlation coefficient that is, the cross-correlation coefficient after decorrelation processing
- the current frame of the stereo audio signal can be directly determined as the two-channel signal after decorrelation processing, and the flag bit can be determined as the third value (for example, it can be is 2), and, based on the two-channel signal after decorrelation processing, the coded code stream is obtained, and the flag bit is written into the coded code stream and sent to the decoding device.
- the stereo audio signal is processed based on the cross-correlation between the left channel signal and the right channel signal of the current frame of the stereo audio signal.
- the stereo audio signal can also be processed based on the phase between the left channel signal and the right channel signal of the current frame of the stereo audio signal. Specifically, the left channel signal of the current frame can be determined first. The first phase between the channel signal and the right channel signal.
- the first phase between the left channel signal and the right channel signal of the current frame is within the second interval, it is determined that the current frame of the stereo audio signal is a partial positive phase signal , using the second decorrelation processing method to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, calculate the third phase of the two-channel signal after decorrelation processing, and respond to the first phase being less than or equal to the first phase
- Three phases indicating that the second de-correlation processing method does not achieve the purpose of "decorrelation”
- directly determine the current frame of the stereo audio signal as the two-channel signal after the de-correlation process determine the flag bit as the third value, and, based on de-correlation After correlation processing, the two-channel signals are coded streams, and flag bits are written into the coded streams and sent to the decoding device.
- the first interval may be [0°, 45°].
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, when the first cross-correlation coefficient is smaller than the first threshold, it indicates that the current frame of the stereo audio signal is out of phase signal, at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and de-correlation processing method for the partial anti-phase signal Correlation processing method greatly improves the coding compression rate of partial anti-phase signal.
- Fig. 4a is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by a decoding device. As shown in Fig. 4a, the method for processing a stereo audio signal may include the following steps:
- Step 401 Obtain an encoded code stream sent by an encoding device.
- Step 402 Determine the two-channel signal and the flag bit after decorrelation processing based on the coded code stream.
- FIG. 4b is a flow chart of determining the two-channel signal after decorrelation processing based on the encoded code stream provided by the embodiment of the present disclosure. As shown in FIG. 4b, the two-channel signal after the decorrelation processing is determined based on the encoded code stream.
- the method can be:
- the coded stream After obtaining the coded stream, first parse the coded stream to obtain coded bit stream, flag bit, LSB signal, sign bit signal, quantized LPC parameters and wavelet edge information, and then use entropy decoder to entropy decode the coded bit stream to obtain The decoded signal is then processed by a post-processor based on the LSB signal and the sign bit signal to generate a prediction residual. Afterwards, the linear predictor is used to reconstruct the prediction residual according to the quantized LPC parameters to generate the sub-band signals, and then the integer lifting wavelet is used to reconstruct the sub-band signals based on the wavelet edge information to obtain the two-channel after decorrelation processing Signal.
- Step 403 in response to the first value of the flag bit, perform decorrelation reconstruction on the two-channel signals after decorrelation processing by using the first decorrelation reconstruction method, and output the decorrelation reconstructed audio signal.
- the flag bit is the first value (for example, it may be 0)
- the first de-correlation reconstruction mode corresponding to the first de-correlation processing mode may be used to perform de-correlation reconstruction on the two-channel signals after the de-correlation processing, and output the de-correlation reconstructed audio signal.
- the two-channel signal after decorrelation processing may include a main channel signal and a sub-channel signal.
- the first de-correlation reconstruction method may include: performing de-correlation reconstruction on the two-channel signals after de-correlation processing based on Formula 5; Formula 5 is:
- Mid(n) is the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the sub-channel signal in the two-channel signal after decorrelation processing
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- Fig. 5 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure, the method is executed by a decoding device, as shown in Fig. 5, the stereo audio signal processing method may include the following steps:
- Step 501 Obtain an encoded stream sent by an encoding device.
- Step 502 Determine the two-channel signal and the flag bit after decorrelation processing based on the coded code stream.
- Step 503 in response to the flag being the second value, perform decorrelation reconstruction on the two-channel signals after decorrelation processing by using a second decorrelation reconstruction manner, and output the decorrelation reconstructed audio signal.
- the flag bit is the second value (for example, it may be 1)
- the second de-correlation reconstruction mode corresponding to the second de-correlation processing mode may be used to perform de-correlation reconstruction on the two-channel signals after the de-correlation processing, and output the de-correlation reconstructed audio signal.
- the two-channel signal after decorrelation processing may include a main channel signal and a sub-channel signal.
- the second de-correlation reconstruction method may include: performing de-correlation reconstruction on the two-channel signals after the de-correlation processing based on Formula 6;
- Formula 6 is:
- Mid(n) is the main channel signal of the two-channel signal after decorrelation processing
- Sid(n) is the sub-channel signal of the two-channel signal after decorrelation processing
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- FIG. 6 is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by a decoding device. As shown in FIG. 6 , the method for processing a stereo audio signal may include the following steps:
- Step 601. Obtain an encoded stream sent by an encoding device.
- Step 602 Determine the two-channel signal and the flag bit after decorrelation processing based on the coded code stream.
- Step 603 In response to the flag bit being the third value, directly determine the two-channel signal after decorrelation processing as the audio signal after decorrelation and reconstruction.
- the flag bit is a third value (for example, it may be 2), it indicates that the encoding device has not performed decorrelation processing. Based on this, the two-channel signal after decorrelation processing can be directly determined as the audio signal after decorrelation and reconstruction.
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- FIG. 7 is a schematic structural diagram of a stereo audio signal processing device provided by an embodiment of the present disclosure, which is applied to the encoding end. As shown in FIG. 7 , the device 700 may include:
- Determining module 701 for determining the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal
- the processing module 702 is configured to, in response to the first cross-correlation coefficient being smaller than the first threshold, perform decorrelation processing on the current frame of the stereo audio signal in a first decorrelation processing manner to obtain two-channel signals after decorrelation processing, and calculate The second cross-correlation coefficient of the two-channel signals after the decorrelation processing, in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine that the flag bit is the first value, and, based on the de-correlation processing An encoded code stream is obtained from the two-channel signal, and the flag bit is written into the encoded code stream and sent to the decoding device, and the value range of the first threshold is (-1, 0).
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- the first decorrelation processing manner includes first sum difference downmix processing.
- the first sum and difference downmix processing includes:
- Mid(n) is the main channel signal
- Sid(n) is the secondary channel signal
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the device is also used for:
- the second decorrelation processing manner includes second sum and difference downmix processing.
- the second sum and difference downmix processing includes:
- the left channel signal and the right channel signal are processed based on Formula 2 to obtain the main channel signal;
- the Formula 2 is:
- Mid(n) is the main channel signal
- Sid(n) is the secondary channel signal
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the device is also used for:
- Two cross-correlation coefficients or the first cross-correlation coefficient is greater than the second threshold and the first cross-correlation coefficient is less than or equal to the third cross-correlation coefficient, directly determine the current frame of the stereo audio signal as after decorrelation processing For the two-channel signal, determine that the flag bit is a third value, and obtain an encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the encoded code stream.
- the determination module is further configured to:
- ⁇ (LR) is the cross-correlation coefficient of current frame left channel signal and right channel signal
- L (n) is the nth sampling point of current frame left channel signal, is the average value of all samples of the left channel signal of the current frame
- R(n) is the nth sample point of the right channel signal of the current frame, is the average value of all samples of the right channel signal of the current frame
- N is the total number of samples of the left channel signal or the right channel signal of the current frame, that is, the frame length of the current frame.
- the two-channel signal after decorrelation processing includes a main channel signal and a secondary channel signal
- the device is also used for:
- n (MS) is the second cross-correlation coefficient or the third cross-correlation coefficient
- Mid (n) is the nth sampling point of the main channel signal in the two-channel signal after decorrelation processing, is the average value of all sample points of the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the nth sample point of the sub-channel signal in the two-channel signal after decorrelation processing, is the average value of all samples of the sub-channel signal in the two-channel signal after decorrelation processing
- N is the total number of samples of the left channel signal or the right channel signal of the current frame, that is, the frame length of the current frame.
- the first cross-correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal will be determined first, and, in response to the first
- the first decorrelation processing method is used to perform decorrelation processing on the current frame of the stereo audio signal to obtain the two-channel signal after decorrelation processing, and then the first decorrelation processing of the two-channel signal after decorrelation processing is calculated.
- Two cross-correlation coefficients in response to the first cross-correlation coefficient being smaller than the second cross-correlation coefficient, determine the flag bit as the first value, and obtain the encoded code stream based on the two-channel signal after decorrelation processing, and write the flag bit into the coded code stream sent to the decoding device.
- the value range of the first threshold is (-1, 0), thus, in response to the first cross-correlation coefficient being less than the first threshold, it indicates that the current frame of the stereo audio signal is a partial anti-phase signal , at this time, the first de-correlation processing method will be adopted correspondingly for the partial anti-phase signal, so as to ensure the subsequent compression rate, then the embodiment of the present disclosure provides a judgment method and decorrelation for the partial anti-phase signal The processing method greatly improves the encoding compression rate of the partial anti-phase signal.
- Fig. 8 is a schematic structural diagram of a stereo audio signal processing device provided by an embodiment of the present disclosure, which is applied to the decoding end. As shown in Fig. 8, the device 800 may include:
- An acquisition module 801 configured to acquire an encoded stream sent by an encoding device
- a determination module 802 configured to determine the two-channel signal and the flag bit after decorrelation processing based on the encoded code stream
- the processing module 803 is configured to, in response to the flag bit being a first value, perform decorrelation reconstruction on the two-channel signal after decorrelation processing by using a first decorrelation reconstruction manner, and output a decorrelation reconstructed audio signal.
- the first decorrelation reconstruction manner includes:
- Mid(n) is the main channel signal in the two-channel signal after decorrelation processing
- Sid(n) is the sub-channel signal in the two-channel signal after decorrelation processing
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the device is also used for:
- decorrelation reconstruction is performed on the two-channel signals after decorrelation processing by using a second decorrelation reconstruction manner, and a decorrelation reconstructed audio signal is output.
- the second decorrelation reconstruction manner includes:
- Mid(n) is the main channel signal of the two-channel signal after decorrelation processing
- Sid(n) is the sub-channel signal of the two-channel signal after decorrelation processing
- L(n) is the left channel signal
- R(n) is the right channel signal.
- the device is also used for:
- Fig. 9 is a block diagram of a user equipment UE900 provided by an embodiment of the present disclosure.
- the UE 900 may be a mobile phone, a computer, a digital broadcast terminal device, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
- UE900 may include at least one of the following components: a processing component 902, a memory 904, a power supply component 906, a multimedia component 908, an audio component 910, an input/output (I/O) interface 912, a sensor component 913, and a communication component 916.
- a processing component 902 a memory 904
- a power supply component 906 a multimedia component 908, an audio component 910, an input/output (I/O) interface 912, a sensor component 913, and a communication component 916.
- I/O input/output
- the processing component 902 generally controls the overall operations of the UE 900, such as those associated with display, phone calls, data communications, camera operations, and recording operations.
- the processing component 902 may include at least one processor 920 to execute instructions to complete all or part of the steps of the above-mentioned method.
- processing component 902 can include at least one module to facilitate interaction between processing component 902 and other components.
- processing component 902 may include a multimedia module to facilitate interaction between multimedia component 908 and processing component 902 .
- the memory 904 is configured to store various types of data to support operations at the UE 900 . Examples of such data include instructions for any application or method operating on UE900, contact data, phonebook data, messages, pictures, videos, etc.
- the memory 904 can be implemented by any type of volatile or non-volatile storage device or their combination, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read-only memory
- EPROM erasable Programmable Read Only Memory
- PROM Programmable Read Only Memory
- ROM Read Only Memory
- Magnetic Memory Flash Memory
- Magnetic or Optical Disk Magnetic Disk
- the power supply component 906 provides power to various components of the UE 900 .
- Power component 906 may include a power management system, at least one power supply, and other components associated with generating, managing, and distributing power for UE 900 .
- the multimedia component 908 includes a screen providing an output interface between the UE 900 and the user.
- the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user.
- the touch panel includes at least one touch sensor to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense a boundary of a touch or slide action, but also detect a wake-up time and pressure related to the touch or slide operation.
- the multimedia component 908 includes a front camera and/or a rear camera. When the UE900 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capability.
- the audio component 910 is configured to output and/or input audio signals.
- the audio component 910 includes a microphone (MIC), which is configured to receive an external audio signal when the UE 900 is in an operation mode, such as a call mode, a recording mode, and a voice recognition mode. Received audio signals may be further stored in memory 904 or sent via communication component 916 .
- the audio component 910 also includes a speaker for outputting audio signals.
- the I/O interface 912 provides an interface between the processing component 902 and a peripheral interface module.
- the peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: a home button, volume buttons, start button, and lock button.
- the sensor component 913 includes at least one sensor for providing various aspects of state assessment for the UE 900 .
- the sensor component 913 can detect the open/closed state of the device 900, the relative positioning of components, such as the display and the keypad of the UE900, the sensor component 913 can also detect the position change of the UE900 or a component of the UE900, and the user and Presence or absence of UE900 contact, UE900 orientation or acceleration/deceleration and temperature change of UE900.
- the sensor assembly 913 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact.
- the sensor assembly 913 may also include an optical sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
- the sensor component 913 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor or a temperature sensor.
- Communication component 916 is configured to facilitate wired or wireless communications between UE 900 and other devices.
- UE900 can access wireless networks based on communication standards, such as WiFi, 2G or 3G, or a combination thereof.
- the communication component 916 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel.
- the communication component 916 also includes a near field communication (NFC) module to facilitate short-range communication.
- NFC near field communication
- the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, Infrared Data Association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
- RFID Radio Frequency Identification
- IrDA Infrared Data Association
- UWB Ultra Wideband
- Bluetooth Bluetooth
- UE 900 may be powered by at least one Application Specific Integrated Circuit (ASIC), Digital Signal Processor (DSP), Digital Signal Processing Device (DSPD), Programmable Logic Device (PLD), Field Programmable Gate Array ( FPGA), controller, microcontroller, microprocessor or other electronic components for implementing the above method.
- ASIC Application Specific Integrated Circuit
- DSP Digital Signal Processor
- DSPD Digital Signal Processing Device
- PLD Programmable Logic Device
- FPGA Field Programmable Gate Array
- controller microcontroller, microprocessor or other electronic components for implementing the above method.
- Fig. 10 is a block diagram of a network side device 1000 provided by an embodiment of the present disclosure.
- the network side device 1000 may be provided as a network side device.
- the network side device 1000 includes a processing component 1011, which further includes at least one processor, and a memory resource represented by a memory 1032 for storing instructions executable by the processing component 1022, such as an application program.
- the application program stored in memory 1032 may include one or more modules each corresponding to a set of instructions.
- the processing component 1010 is configured to execute instructions, so as to execute any of the aforementioned methods applied to the network side device, for example, the method shown in FIG. 1 .
- the network side device 1000 may also include a power supply component 1026 configured to perform power management of the network side device 1000, a wired or wireless network interface 1050 configured to connect the network side device 1000 to the network, and an input and output (I/O ) interface 1058.
- the network side device 1000 can operate based on the operating system stored in the memory 1032, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, Free BSDTM or similar.
- the methods provided in the embodiments of the present disclosure are introduced from the perspectives of the network side device and the UE respectively.
- the network side device and the UE may include a hardware structure and a software module, and implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module.
- a certain function among the above-mentioned functions may be implemented in the form of a hardware structure, a software module, or a hardware structure plus a software module.
- the methods provided in the embodiments of the present disclosure are introduced from the perspectives of the network side device and the UE respectively.
- the network side device and the UE may include a hardware structure and a software module, and implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module.
- a certain function among the above-mentioned functions may be implemented in the form of a hardware structure, a software module, or a hardware structure plus a software module.
- the communication device may include a transceiver module and a processing module.
- the transceiver module may include a sending module and/or a receiving module, the sending module is used to realize the sending function, the receiving module is used to realize the receiving function, and the sending and receiving module can realize the sending function and/or the receiving function.
- the communication device may be a terminal device (such as the terminal device in the foregoing method embodiments), may also be a device in the terminal device, and may also be a device that can be matched and used with the terminal device.
- the communication device may be a network device, or a device in the network device, or a device that can be matched with the network device.
- the communication device may be a network device, or a terminal device (such as the terminal device in the foregoing method embodiments), or a chip, a chip system, or a processor that supports the network device to implement the above method, or it may be a terminal device that supports A chip, a chip system, or a processor for realizing the above method.
- the device can be used to implement the methods described in the above method embodiments, and for details, refer to the descriptions in the above method embodiments.
- a communications device may include one or more processors.
- the processor may be a general purpose processor or a special purpose processor or the like.
- it can be a baseband processor or a central processing unit.
- the baseband processor can be used to process communication protocols and communication data
- the central processor can be used to control communication devices (such as network side equipment, baseband chips, terminal equipment, terminal equipment chips, DU or CU, etc.)
- a computer program that processes data for a computer program.
- the communication device may further include one or more memories, on which computer programs may be stored, and the processor executes the computer programs, so that the communication device executes the methods described in the foregoing method embodiments.
- data may also be stored in the memory.
- the communication device and the memory can be set separately or integrated together.
- the communication device may further include a transceiver and an antenna.
- the transceiver may be referred to as a transceiver unit, a transceiver, or a transceiver circuit, etc., and is used to implement a transceiver function.
- the transceiver may include a receiver and a transmitter, and the receiver may be called a receiver or a receiving circuit for realizing a receiving function; the transmitter may be called a transmitter or a sending circuit for realizing a sending function.
- the communication device may further include one or more interface circuits.
- the interface circuit is used to receive code instructions and transmit them to the processor.
- the processor executes the code instructions to enable the communication device to execute the methods described in the foregoing method embodiments.
- the communication device is a terminal device (such as the terminal device in the foregoing method embodiments): the processor is configured to execute any of the methods shown in FIG. 1-FIG. 4a.
- the communication device is a network device: the transceiver is used to execute the method shown in any one of Fig. 5-Fig. 7 .
- the processor may include a transceiver for implementing receiving and transmitting functions.
- the transceiver may be a transceiver circuit, or an interface, or an interface circuit.
- the transceiver circuits, interfaces or interface circuits for realizing the functions of receiving and sending can be separated or integrated together.
- the above-mentioned transceiver circuit, interface or interface circuit may be used for reading and writing code/data, or the above-mentioned transceiver circuit, interface or interface circuit may be used for signal transmission or transmission.
- the processor may store a computer program, and the computer program runs on the processor to enable the communication device to execute the methods described in the foregoing method embodiments.
- a computer program may be embedded in a processor, in which case the processor may be implemented by hardware.
- the communication device may include a circuit, and the circuit may implement the function of sending or receiving or communicating in the foregoing method embodiments.
- the processors and transceivers described in this disclosure can be implemented on integrated circuits (integrated circuits, ICs), analog ICs, radio frequency integrated circuits (RFICs), mixed signal ICs, application specific integrated circuits (ASICs), printed circuit boards ( printed circuit board, PCB), electronic equipment, etc.
- the processor and transceiver can also be fabricated using various IC process technologies such as complementary metal oxide semiconductor (CMOS), nMetal-oxide-semiconductor (NMOS), P-type Metal oxide semiconductor (positive channel metal oxide semiconductor, PMOS), bipolar junction transistor (bipolar junction transistor, BJT), bipolar CMOS (BiCMOS), silicon germanium (SiGe), gallium arsenide (Gas), etc.
- CMOS complementary metal oxide semiconductor
- NMOS nMetal-oxide-semiconductor
- PMOS bipolar junction transistor
- BJT bipolar CMOS
- SiGe silicon germanium
- Gas gallium arsenide
- the communication device described in the above embodiments may be a network device or a terminal device (such as the terminal device in the foregoing method embodiments), but the scope of the communication device described in this disclosure is not limited thereto, and the structure of the communication device may not be limited limits.
- a communication device may be a stand-alone device or may be part of a larger device.
- the communication device may be:
- a set of one or more ICs may also include storage components for storing data and computer programs;
- ASIC such as modem (Modem);
- the communications device may be a chip or system-on-a-chip
- the chip includes a processor and an interface.
- the number of processors may be one or more, and the number of interfaces may be more than one.
- the chip also includes a memory, which is used to store necessary computer programs and data.
- An embodiment of the present disclosure also provides a system for determining the duration of a side link, the system includes a communication device as a terminal device (such as the first terminal device in the method embodiment above) in the foregoing embodiments and a communication device as a network device, Alternatively, the system includes the communication device as the terminal device in the foregoing embodiments (such as the first terminal device in the foregoing method embodiment) and the communication device as a network device.
- the present disclosure also provides a readable storage medium on which instructions are stored, and when the instructions are executed by a computer, the functions of any one of the above method embodiments are realized.
- the present disclosure also provides a computer program product, which implements the functions of any one of the above method embodiments when executed by a computer.
- all or part of them may be implemented by software, hardware, firmware or any combination thereof.
- software When implemented using software, it may be implemented in whole or in part in the form of a computer program product.
- the computer program product comprises one or more computer programs. When the computer program is loaded and executed on the computer, all or part of the processes or functions according to the embodiments of the present disclosure will be generated.
- the computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices.
- the computer program can be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer program can be downloaded from a website, computer, server or data center Transmission to another website site, computer, server or data center by wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.).
- the computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media.
- the available medium may be a magnetic medium (for example, a floppy disk, a hard disk, a magnetic tape), an optical medium (for example, a high-density digital video disc (digital video disc, DVD)), or a semiconductor medium (for example, a solid state disk (solid state disk, SSD)) etc.
- a magnetic medium for example, a floppy disk, a hard disk, a magnetic tape
- an optical medium for example, a high-density digital video disc (digital video disc, DVD)
- a semiconductor medium for example, a solid state disk (solid state disk, SSD)
- At least one in the present disclosure can also be described as one or more, and a plurality can be two, three, four or more, and the present disclosure is not limited.
- the technical feature is distinguished by "first”, “second”, “third”, “A”, “B”, “C” and “D”, etc.
- the technical features described in the “first”, “second”, “third”, “A”, “B”, “C” and “D” have no sequence or order of magnitude among the technical features described.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (22)
- 一种立体声音频信号处理方法,其特征在于,应用于编码设备,包括:确定立体声音频信号当前帧左声道信号和右声道信号的第一互相关系数;响应于所述第一互相关系数小于第一阈值,采用第一去相关处理方式对所述立体声音频信号当前帧进行去相关处理获得去相关处理后两声道信号,计算所述去相关处理后两声道信号的第二互相关系数,响应于所述第一互相关系数小于所述第二互相关系数,确定标志位为第一值,以及,基于所述去相关处理后两声道信号得到编码码流并将所述标志位写入所述编码码流,所述第一阈值的取值范围为(-1,0)。
- 如权利要求1所述的方法,其特征在于,所述第一去相关处理方式包括第一和差下混处理。
- 如权利要求1所述的方法,其特征在于,所述方法还包括:响应于所述第一互相关系数大于第二阈值,采用第二去相关处理方式对所述立体声音频信号当前帧进行去相关处理获得去相关处理后两声道信号,计算所述去相关处理后两声道信号的第三互相关系数,响应于所述第一互相关系数大于所述第三互相关系数,确定标志位为第二值,以及,基于所述去相关处理后两声道信号得到编码码流并将所述标志位写入所述编码码流,所述第二阈值的取值范围(0,1)。
- 如权利要求4所述的方法,其特征在于,所述第二去相关处理方式包括第二和差下混处理。
- 如权利要求4所述的方法,其特征在于,所述方法还包括:响应于所述第一互相关系数大于等于所述第一阈值且小于等于所述第二阈值,或者所述第一互相关系数小于所述第一阈值且所述第一互相关系数大于等于所述第二互相关系数,或者所述第一互相关系数大于所述第二阈值且所述第一互相关系数小于等于所述第三互相关系数,直接将所述立体声音频信号当前帧确定为去相关处理后两声道信号,确定标志位为第三值,以及,基于所述去相关处理后两声道信号得到编码码流并将所述标志位写入所述编码码流。
- 一种立体声音频信号处理方法,其特征在于,应用于解码设备,包括:获取编码设备发送的编码码流;基于所述编码码流确定去相关处理后两声道信号和标志位;响应于所述标志位为第一值,采用第一去相关重建方式对所述去相关处理后两声道信号进行去相关重建,并输出去相关重建后的音频信号。
- 如权利要求10所述的方法,其特征在于,所述方法还包括:响应于所述标志位为第二值,采用第二去相关重建方式对所述去相关处理后两声道信号进行去相关重建,并输出去相关重建后的音频信号。
- 如权利要求10所述的方法,其特征在于,所述方法还包括:响应于所述标志位为第三值,直接将所述去相关处理后两声道信号确定为去相关重建后的音频信号。
- 一种立体声音频信号处理装置,其特征在于,包括:确定模块,用于确定立体声音频信号当前帧左声道信号和右声道信号的第一互相关系数;处理模块,用于响应于所述第一互相关系数小于第一阈值,采用第一去相关处理方式对所述立体声音频信号当前帧进行去相关处理获得去相关处理后两声道信号,计算所述去相关处理后两声道信号的第二互相关系数,响应于所述第一互相关系数小于所述第二互相关系数,确定标志位为第一值,以及,基于所述去相关处理后两声道信号得到编码码流并将所述标志位写入所述编码码流,所述第一阈值的取值范围为(-1,0)。
- 一种立体声音频信号处理装置,其特征在于,包括:获取模块,用于获取编码设备发送的编码码流;确定模块,用于基于所述编码码流确定去相关处理后两声道信号和标志位;处理模块,用于响应于所述标志位为第一值,采用第一去相关重建方式对所述去相关处理后两声道信号进行去相关重建,并输出去相关重建后的音频信号。
- 一种通信装置,其特征在于,所述装置包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器执行所述存储器中存储的计算机程序,以使所述装置执行如权利要求1至9中任一项所述的方法。
- 一种通信装置,其特征在于,所述装置包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器执行所述存储器中存储的计算机程序,以使所述装置执行如权利要求10至14中任一项所述的方法。
- 一种通信装置,其特征在于,包括:处理器和接口电路;所述接口电路,用于接收代码指令并传输至所述处理器;所述处理器,用于运行所述代码指令以执行如权利要求1至9中任一项所述的方法。
- 一种通信装置,其特征在于,包括:处理器和接口电路;所述接口电路,用于接收代码指令并传输至所述处理器;所述处理器,用于运行所述代码指令以执行如权利要求10至14任一所述的方法。
- 一种计算机可读存储介质,用于存储有指令,响应于所述指令被执行时,使如权利要求1至9中任一项所述的方法被实现。
- 一种计算机可读存储介质,用于存储有指令,响应于所述指令被执行时,使如权利要求10至14中任一项所述的方法被实现。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP21965238.5A EP4440151A4 (en) | 2021-11-26 | 2021-11-26 | METHOD AND DEVICE FOR STEREO AUDIO SIGNAL PROCESSING, ENCODING DEVICE, DECODING DEVICE AND STORAGE MEDIUM |
| US18/713,343 US20250022474A1 (en) | 2021-11-26 | 2021-11-26 | Stereo audio signal processing method, communication apparatus, and storage medium |
| PCT/CN2021/133722 WO2023092505A1 (zh) | 2021-11-26 | 2021-11-26 | 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 |
| CN202180004116.0A CN114258568B (zh) | 2021-11-26 | 2021-11-26 | 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2021/133722 WO2023092505A1 (zh) | 2021-11-26 | 2021-11-26 | 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023092505A1 true WO2023092505A1 (zh) | 2023-06-01 |
Family
ID=80796643
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2021/133722 Ceased WO2023092505A1 (zh) | 2021-11-26 | 2021-11-26 | 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20250022474A1 (zh) |
| EP (1) | EP4440151A4 (zh) |
| CN (1) | CN114258568B (zh) |
| WO (1) | WO2023092505A1 (zh) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116434760A (zh) * | 2023-04-14 | 2023-07-14 | 北京小米移动软件有限公司 | 一种音频编码方法、装置、电子设备及存储介质 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080226085A1 (en) * | 2007-03-12 | 2008-09-18 | Noriyuki Takashima | Audio Apparatus |
| CN102368385A (zh) * | 2011-09-07 | 2012-03-07 | 中科开元信息技术(北京)有限公司 | 后向块自适应Golomb-Rice编解码方法及装置 |
| CN110495105A (zh) * | 2017-04-12 | 2019-11-22 | 华为技术有限公司 | 多声道信号的编解码方法和编解码器 |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN100588288C (zh) * | 2005-12-09 | 2010-02-03 | 华南理工大学 | 双通路立体声信号模拟5.1通路环绕声的信号处理方法 |
| US20090164223A1 (en) * | 2007-12-19 | 2009-06-25 | Dts, Inc. | Lossless multi-channel audio codec |
| CN101946526B (zh) * | 2008-02-14 | 2013-01-02 | 杜比实验室特许公司 | 声音再现方法和系统以及立体声扩展方法 |
| CN101609677B (zh) * | 2009-03-13 | 2012-01-04 | 华为技术有限公司 | 一种预处理方法、装置及编码设备 |
| CN101944362B (zh) * | 2010-09-14 | 2012-05-30 | 北京大学 | 一种基于整形小波变换的音频无损压缩编码、解码方法 |
| EP2830053A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal |
| TWI587286B (zh) * | 2014-10-31 | 2017-06-11 | 杜比國際公司 | 音頻訊號之解碼和編碼的方法及系統、電腦程式產品、與電腦可讀取媒體 |
| US10319385B2 (en) * | 2015-09-25 | 2019-06-11 | Voiceage Corporation | Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget |
| CN109300480B (zh) * | 2017-07-25 | 2020-10-16 | 华为技术有限公司 | 立体声信号的编解码方法和编解码装置 |
| CN109389984B (zh) * | 2017-08-10 | 2021-09-14 | 华为技术有限公司 | 时域立体声编解码方法和相关产品 |
-
2021
- 2021-11-26 EP EP21965238.5A patent/EP4440151A4/en active Pending
- 2021-11-26 CN CN202180004116.0A patent/CN114258568B/zh active Active
- 2021-11-26 WO PCT/CN2021/133722 patent/WO2023092505A1/zh not_active Ceased
- 2021-11-26 US US18/713,343 patent/US20250022474A1/en active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080226085A1 (en) * | 2007-03-12 | 2008-09-18 | Noriyuki Takashima | Audio Apparatus |
| CN102368385A (zh) * | 2011-09-07 | 2012-03-07 | 中科开元信息技术(北京)有限公司 | 后向块自适应Golomb-Rice编解码方法及装置 |
| CN110495105A (zh) * | 2017-04-12 | 2019-11-22 | 华为技术有限公司 | 多声道信号的编解码方法和编解码器 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4440151A4 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114258568B (zh) | 2024-12-03 |
| EP4440151A4 (en) | 2024-11-27 |
| CN114258568A (zh) | 2022-03-29 |
| EP4440151A1 (en) | 2024-10-02 |
| US20250022474A1 (en) | 2025-01-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| RU2377735C2 (ru) | Способ, устройство и система для эффективного кодирования и декодирования видеоданных | |
| JP7745100B2 (ja) | 信号の符号化および復号化方法、装置、ユーザイクイップメント、ネットワーク側デバイス並びに記憶媒体 | |
| WO2023051368A1 (zh) | 编解码方法、装置、设备、存储介质及计算机程序产品 | |
| CN114127844B (zh) | 一种信号编解码方法、装置、编码设备、解码设备及存储介质 | |
| WO2018141164A1 (zh) | 下行控制信息的传输方法及装置 | |
| CN116368460A (zh) | 音频处理方法、装置 | |
| WO2023092505A1 (zh) | 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质 | |
| CN113225644B (zh) | 低延迟蓝牙音频系统及其音频源端和音频接收端 | |
| WO2021213128A1 (zh) | 音频信号编码方法和装置 | |
| KR100931871B1 (ko) | 비디오 데이터의 효과적인 fgs 부호화 및 복호화를 위한방법, 장치, 시스템 | |
| CN114365509B (zh) | 一种立体声音频信号处理方法及设备/存储介质/装置 | |
| WO2022126342A1 (zh) | 传输块大小的确定方法、装置及通信设备 | |
| RU2841476C2 (ru) | Способ и устройство для кодирования и декодирования сигналов, а также пользовательское оборудование, сетевое устройство и носитель информации | |
| CN114125790A (zh) | 蓝牙通信方法、装置及介质 | |
| WO2023082194A1 (zh) | 一种波束处理方法、装置、用户设备、ris阵列、基站及存储介质 | |
| WO2023092602A1 (zh) | 一种预编码方法、装置、用户设备、ris阵列、基站及存储介质 | |
| CN116406525A (zh) | 一种信息更新方法、装置、用户设备、基站及存储介质 | |
| CN115696279B (zh) | 信息处理方法、装置、电子设备及存储介质 | |
| CN113810721B (zh) | 视频流误码掩盖方法、装置、终端设备和可读存储介质 | |
| CN117203985A (zh) | 音频回放方法/装置/设备及存储介质 | |
| EP4543025A1 (en) | Audio signal format determination method and apparatus | |
| CN107645733A (zh) | 通信方法、装置及存储介质 | |
| WO2023130283A1 (zh) | 一种映射方式确定方法/装置/设备及存储介质 | |
| WO2023193276A1 (zh) | 一种上报方法/装置/设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21965238 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 18713343 Country of ref document: US |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202417047919 Country of ref document: IN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2021965238 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2021965238 Country of ref document: EP Effective date: 20240626 |


















