WO2015165264A1 - 处理信号的方法及设备 - Google Patents

处理信号的方法及设备 Download PDF

Info

Publication number
WO2015165264A1
WO2015165264A1 PCT/CN2014/092695 CN2014092695W WO2015165264A1 WO 2015165264 A1 WO2015165264 A1 WO 2015165264A1 CN 2014092695 W CN2014092695 W CN 2014092695W WO 2015165264 A1 WO2015165264 A1 WO 2015165264A1
Authority
WO
WIPO (PCT)
Prior art keywords
subbands
sub
bands
energy
original envelope
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2014/092695
Other languages
English (en)
French (fr)
Inventor
王宾
苗磊
刘泽新
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to BR112016015855A priority Critical patent/BR112016015855B1/pt
Priority to EP19181139.7A priority patent/EP3637417B1/en
Priority to KR1020197013008A priority patent/KR102052487B1/ko
Priority to EP14891093.8A priority patent/EP3079150B1/en
Priority to UAA201608914A priority patent/UA117395C2/uk
Priority to KR1020187023421A priority patent/KR101977383B1/ko
Priority to KR1020187004298A priority patent/KR101890346B1/ko
Priority to SG11201605353YA priority patent/SG11201605353YA/en
Priority to MX2018009564A priority patent/MX389400B/es
Priority to JP2016556272A priority patent/JP6231699B2/ja
Priority to EP23213863.6A priority patent/EP4372738A3/en
Priority to KR1020167019856A priority patent/KR101830855B1/ko
Priority to AU2014392351A priority patent/AU2014392351B2/en
Priority to ES14891093T priority patent/ES2755152T3/es
Priority to MX2016010690A priority patent/MX358255B/es
Priority to CA2935084A priority patent/CA2935084C/en
Priority to NZ721875A priority patent/NZ721875A/en
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to RU2016134014A priority patent/RU2656812C2/ru
Publication of WO2015165264A1 publication Critical patent/WO2015165264A1/zh
Priority to IL246427A priority patent/IL246427B/en
Priority to ZA2016/04465A priority patent/ZA201604465B/en
Priority to US15/335,449 priority patent/US9837088B2/en
Anticipated expiration legal-status Critical
Priority to US15/686,175 priority patent/US10186271B2/en
Priority to AU2017272204A priority patent/AU2017272204B2/en
Priority to US16/205,246 priority patent/US10347264B2/en
Priority to US16/409,290 priority patent/US10546591B2/en
Priority to US16/725,678 priority patent/US11081121B2/en
Priority to US17/376,813 priority patent/US11580996B2/en
Priority to US18/157,518 priority patent/US11881226B2/en
Priority to US18/538,381 priority patent/US12249339B2/en
Priority to US19/027,147 priority patent/US20250239266A1/en
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Definitions

  • the present invention relates to the field of signal processing and, in particular, to a method and apparatus for processing signals.
  • each subband of the signal is usually allocated directly according to the size of the frequency domain envelope, and then each subband is encoded by the allocated number of bits.
  • the subbands of the low frequency band have a great influence on the signal coding quality. Therefore, the subbands of the low frequency band often become the bottleneck of the signal coding performance, and the above bit allocation mode cannot be well adapted.
  • the bit requirements of each sub-band, especially the sub-bands of the low frequency band result in poor coding quality of the signal. Accordingly, the decoding performance of the signal is also poor.
  • Embodiments of the present invention provide a method and a device for processing a signal, which can improve coding and decoding performance of a signal.
  • a method for processing a signal includes: selecting M subbands from N subbands, the N subbands being obtained by dividing a spectral coefficient of a current frame of the signal, the frequency bands of the M subbands a frequency band lower than K sub-bands other than the M sub-bands, N is a positive integer greater than 1, M and K are positive integers, and a sum of M and K is N; Performing a correction operation on the original envelope values of the M subbands, the performance information is used to indicate energy characteristics and spectral characteristics of the M subbands; respectively, for the M subbands Correcting an original envelope value to obtain a modified envelope value of the M subbands; performing the N subbands according to the modified envelope value of the M subbands and the original envelope value of the K subbands One bit allocation.
  • the determining, according to the performance information of the M subbands, performing a correcting operation on the original envelope values of the M subbands including:
  • the first parameter belongs to the first range and the second parameter belongs to the second range, it is determined that a correction operation is performed on the original envelope values of the M sub-bands.
  • the determining, according to the original envelope value of the N subbands, the determining the first parameter including: according to the M subbands The original envelope value determines the total energy of the M sub-bands; determining the total energy of the K sub-bands according to the original envelope values of the K sub-bands; and summing the total energy of the M sub-bands with the K The ratio of the total energy of the sub-bands is determined as the first parameter.
  • the determining the second parameter according to the original envelope value of the M subbands includes: determining, according to an original envelope value of the M subbands, total energy of the M subbands and energy of a first subband, wherein the energy of the first subband is the largest among the M subbands; The ratio of the energy of the first sub-band to the total energy of the M sub-bands is determined as the second parameter.
  • the original envelope values of the M subbands are respectively modified to obtain a modified envelope value of the M subbands
  • the method includes: determining, according to an original envelope value of the M subbands, total energy of the M subbands and energy of the first subband, wherein the energy of the first subband is the largest among the M subbands; Determining a correction factor according to the total energy of the M sub-bands and the energy of the first sub-band; and correcting, by using the correction factor, the original envelope values of the M sub-bands respectively to obtain the M sub-sub Corrected envelope value with band.
  • the modified envelope value of each of the M subbands is greater than the original envelope value of the same subband.
  • the method further includes: determining, according to the number of bits allocated to the N subbands in the primary bit allocation, the N subbands The number of redundant bits per subband, the number of redundant bits of each of the N subbands being smaller than the number of bits required for encoding a single information unit in the same subband; according to each of the N subbands Redundant number of bits, determining a total number of redundant bits; and for the N subbands according to the modified envelope value of the M subbands, the original envelope value of the K subbands, and the total number of redundant bits Perform a second bit allocation.
  • a second aspect provides an apparatus for processing a signal, including: a selecting unit, configured to select M subbands from N subbands, where the N subbands are obtained by dividing a spectral coefficient of a current frame of the signal, The frequency bands of the M sub-bands are lower than the frequency bands of the K sub-bands other than the M sub-bands in the N sub-bands, N is a positive integer greater than 1, and M and K are positive integers, and the sum of M and K is a determining unit, configured to perform a correcting operation on the original envelope values of the M subbands according to the performance information of the M subbands, where the performance information is used to indicate energy characteristics and a spectrum of the M subbands a correction unit, configured to respectively correct original envelope values of the M subbands to obtain modified envelope values of the M subbands, and an allocation unit, configured to modify the envelope according to the M subbands And a value of the original envelope value of the K subbands, and performing bit allocation on the N subbands.
  • the determining unit is specifically configured to: determine, according to an original envelope value of the N subbands, a first parameter, where the first parameter indicates the a degree of concentration of the spectral energy of the signal on the M subbands; determining a second parameter according to an original envelope value of the M subbands, wherein the second parameter represents a degree of spectral fluctuation of the M subbands; In a case where the first parameter belongs to the first range and the second parameter belongs to the second range, it is determined that a correction operation is performed on the original envelope values of the M sub-bands.
  • the determining unit is specifically configured to: determine, according to an original envelope value of the M subbands, the M subbands a total energy; determining a total energy of the K sub-bands according to an original envelope value of the K sub-bands; determining a ratio of a total energy of the M sub-bands to a total energy of the K sub-bands as the One parameter.
  • the determining unit is specifically configured to: according to the original envelope of the M subbands a value, a total energy of the M sub-bands and an energy of the first sub-band, wherein the energy of the first sub-band is the largest in the M sub-bands; and the energy of the first sub-band and the M sub-bands The ratio of the total energy is determined as the second parameter.
  • the modifying unit is specifically configured to: determine, according to an original envelope value of the M subbands, a total of the M subbands Energy and energy of the first sub-band, wherein the energy of the first sub-band is the largest among the M sub-bands; determining the correction factor according to the total energy of the M sub-bands and the energy of the first sub-band Using the correction factor, the original envelope values of the M subbands are respectively corrected to obtain the modified envelope values of the M subbands.
  • a modified envelope value of each of the M subbands is greater than an original envelope value of the same subband.
  • the determining unit is further configured to determine, according to the number of bits allocated to the N subbands respectively in the one bit allocation a number of redundant bits of each of the N subbands, the number of redundant bits of each of the N subbands being smaller than a number of bits required for encoding a single information unit in the same subband; the determining unit further And determining, according to the number of redundant bits of each of the N subbands, a total number of redundant bits; the allocation unit is further configured to: according to the modified envelope value of the M subbands, the K subbands The original envelope value and the total number of redundant bits are subjected to secondary bit allocation for the N subbands.
  • the present invention by selecting M subbands of the low frequency band from the N subbands, determining, according to the energy characteristics and spectral characteristics of the M subbands, performing a correction operation on the original envelope values of the M subbands, according to the original of the M subbands.
  • the envelope value is corrected for the original envelope values of the M subbands, and the N subbands are allocated once according to the modified envelope values of the M subbands and the original envelope values of the K subbands, instead of directly according to N
  • the original envelope values of the subbands are bit-allocated, so that the bit allocation is more in line with the bit requirements of each sub-band, thereby improving the coding and decoding performance of the signal.
  • FIG. 1 is a schematic flow chart of a method of processing a signal according to an embodiment of the present invention.
  • FIG. 2 is a schematic flow diagram of a process of a method of processing a signal in accordance with one embodiment of the present invention.
  • FIG. 3 is a schematic block diagram of an apparatus for processing signals in accordance with one embodiment of the present invention.
  • FIG. 4 is a schematic block diagram of an apparatus for processing a signal in accordance with another embodiment of the present invention.
  • Signal coding technology and decoding technology are widely used in various electronic devices, such as mobile phones, wireless devices, personal data assistants (PDAs), handheld or portable computers, and Global Positioning System (Global Positioning System, GPS) Receiver/Navigator, camera, audio/video player, camera, VCR, monitoring equipment, etc.
  • electronic devices include voice or audio encoders, voice or audio decoders, voice or audio encoders, and voice or audio decoders that can be directly processed by digital circuits or chips such as digital signals (Digital Signal Processor). , DSP) chip implementation, or by software code to drive the processor to execute the process in the software code.
  • DSP Digital Signal Processor
  • FIG. 1 is a schematic flow chart of a method of processing a signal according to an embodiment of the present invention.
  • the method of Figure 1 is performed by an encoding end, such as the speech or audio encoder described above.
  • the method of Figure 1 can also be performed by a decoding end, such as the speech or audio decoder described above.
  • the encoding end may first convert the time domain signal into a frequency domain signal, for example, a fast Fourier transform (FFT) or a modified Discrete Cosine Transform (MDCT) algorithm may be used for time-frequency. Transform. Then, the spectral coefficients of the frequency domain signal can be normalized by the global gain, and the normalized spectral coefficients are band-divided to obtain a plurality of sub-bands.
  • FFT fast Fourier transform
  • MDCT modified Discrete Cosine Transform
  • the decoding end can decode the code stream received from the encoding end to obtain normalized spectral coefficients, and divide the normalized spectral coefficients to obtain a plurality of sub-bands.
  • N is a positive integer greater than 1
  • M and K are positive integers
  • the sum of M and K is N.
  • the signal may be a voice signal or an audio signal.
  • the above K subbands are all subbands of the N subbands other than the M subbands.
  • the present invention by selecting M subbands of the low frequency band from the N subbands, determining, according to the energy characteristics and spectral characteristics of the M subbands, performing a correction operation on the original envelope values of the M subbands, according to the original of the M subbands.
  • the envelope value is corrected for the original envelope values of the M subbands, and the N subbands are allocated once according to the modified envelope values of the M subbands and the original envelope values of the K subbands, instead of directly according to N
  • the original envelope values of the subbands are bit-allocated, so that the bit allocation is more in line with the bit requirements of each sub-band, thereby improving the coding and decoding performance of the signal.
  • each sub-band of the signal is allocated according to the size of the frequency domain envelope, so that the allocated number of bits does not satisfactorily satisfy certain low-band sub-bands. Bit requirements.
  • the M subbands of the low frequency band are first selected from the N subbands, and the correcting operation is performed on the original envelope values of the M subbands according to the energy characteristics and the spectral characteristics of the M subbands, according to the N subbands.
  • the original envelope value corrects the original envelope values of the M subbands, and then performs bit allocation on the N subbands according to the modified envelope values of the M subbands and the original envelope values of the K subbands.
  • the N subbands are not directly allocated according to the original envelope values of the N subbands, but the energy characteristics and the spectral characteristics of the M subbands are taken into consideration to determine that the M subbands need to be performed. Correction, thereby obtaining the modified envelope values of the M sub-bands, and performing bit allocation according to the modified envelope value of the low-band sub-band and the original envelope value of other sub-bands, so that the bit allocation of each sub-band is more reasonable, especially It is the M subbands of the low frequency band, which can improve the coding and decoding performance of the signal.
  • each subband After dividing the spectral coefficients into N subbands, the envelope of each subband can be calculated and quantized. So each subband has a quantized envelope value.
  • the original envelope value is relative to the modified envelope value, and the original envelope value may refer to the original envelope value of the subband, that is, the quantized envelope value calculated after dividing the subband.
  • the envelope value obtained by correcting the original envelope value of the subband is called the modified envelope value. Therefore, in the embodiment of the present invention, both the original envelope value and the modified envelope value mentioned refer to the quantized envelope value.
  • M subbands may be selected from the N subbands according to subband harmonicity and subband energy.
  • the harmonic intensity of each subband may be greater than a preset intensity threshold and the energy of the subband occupies N subbands
  • the ratio of total energy is greater than a preset energy threshold.
  • the low-band sub-bands are often the bottleneck of signal coding performance.
  • the sub-bands with strong harmonics and a certain proportion of the total energy of all sub-bands are especially the bottleneck of coding performance.
  • M subbands are selected from N subbands, so that the original envelope values of the M subbands are corrected, so that the bit allocation for the M subbands is more reasonable, thereby enabling Effectively improve the encoding and decoding performance of the signal.
  • the N sub-bands may be arranged in a sequence from a small to a large frequency band.
  • the top M subbands can be selected from the N subbands.
  • the M sub-bands are selected in the order of the frequency bands from small to large, which can simplify the operation and improve the signal processing efficiency.
  • a first parameter may be determined according to an original envelope value of the N subbands, where the first parameter may indicate a concentration of the spectral energy of the signal in the M subbands.
  • the second parameter may be determined according to the original envelope values of the M subbands, wherein the second parameter represents the degree of spectral fluctuation of the M subbands.
  • the correcting operation may be performed on the original envelope values of the M subbands in a case where the first parameter belongs to the first range and the second parameter belongs to the second range.
  • the energy characteristics of the M sub-bands may be the concentration of the spectral energy of the signal in the M sub-bands, and the spectral characteristics of the M sub-bands may be the degree of spectral fluctuation of the M sub-bands.
  • the first range is related to the energy of the sub-bands and may be preset.
  • the spectral energy of the signal is less concentrated in the M subbands, it can be said that the proportion of the M subbands in the N subbands is small, and the encoding performance is not greatly affected, so there is no need to correct the M subbands.
  • Original envelope value When the spectral energy of the signal is concentrated in the M subbands, it indicates that the original envelope values of the M subbands are also relatively large, and the number of bits allocated by the M subbands is sufficient for encoding, and need not be used. Correct the original envelope value of these M subbands.
  • Experimental simulation can be performed to predetermine the first range.
  • the first range can be preset to [1/6, 2/3].
  • the second range is related to the degree of spectral fluctuation of the sub-band and may be preset.
  • the spectrum fluctuation degree of the M sub-bands is small, and even if the number of bits allocated by the M sub-bands is small, the coding performance is not greatly affected, so that it is not necessary to correct the original envelope values of the M sub-bands. Therefore, the second range indicates that the spectrum of the sub-band is highly volatile.
  • An experimental simulation can be performed to predetermine the second range.
  • the second range can be preset to or Generally, if the bandwidth that can be encoded in the signal is 0 to 4 KHz, the second range can be preset to If the bandwidth that can be encoded in the signal is 0 to 8 KHz, the second range can be preset to
  • the original envelope values of the M subbands may be modified to make the bits allocated by the M subbands more consistent with the bit requirements of the M subbands in the primary bit allocation of the M subbands. For example, for each of the M subbands, the modified envelope value is greater than the original envelope value. Then, compared to the original packet value according to the M subbands, the bit allocation is performed according to the modified envelope values of the M subbands, so that the M subbands are allocated more bits, thereby improving M.
  • the coding performance of the subbands may be modified to make the bits allocated by the M subbands more consistent with the bit requirements of the M subbands in the primary bit allocation of the M subbands. For example, for each of the M subbands, the modified envelope value is greater than the original envelope value. Then, compared to the original packet value according to the M subbands, the bit allocation is performed according to the modified envelope values of the M subbands, so that the M subbands are allocated more bits, thereby improving M.
  • the first parameter and the second parameter determined according to the original envelope values of the N subbands may reflect the characteristics of each frequency band, so that the first parameter belongs to the first range and the second parameter belongs to the second range.
  • the original envelope value correction operation performed on the M sub-bands is determined, so that when the bit allocation is performed according to the modified envelope values of the M sub-bands, the number of bits allocated by the M sub-bands is more consistent with the bit requirements of the M sub-bands. Thereby the encoding and decoding performance of the signal can be improved.
  • the total energy of the M sub-bands may be determined according to the original envelope values of the M sub-bands.
  • the total energy of the K sub-bands can be determined from the original envelope values of the K sub-bands.
  • the ratio of the total energy of the M sub-bands to the total energy of the K sub-bands may be determined as the first parameter.
  • the ratio of the total energy of the M sub-bands to the total energy of the K sub-bands may be determined as the first parameter.
  • the first parameter can be calculated according to the following equation.
  • the first parameter can be represented by a.
  • band_energy i can represent the original envelope value of the ith subband.
  • the original envelope value band_energy i of the i-th sub-band can be obtained according to the spectral coefficient of the i-th sub-band.
  • band_energy i can be obtained according to the following equation.
  • the total energy of the M sub-bands may be determined according to the original envelope values of the M sub-bands, and the energy of the first sub-band is determined, where, in the M sub-bands The energy of the first sub-band is the largest.
  • the ratio of the energy of the first sub-band to the total energy of the M sub-bands may be determined as a second parameter.
  • the degree of spectral fluctuation of the M sub-bands can be expressed by the degree of fluctuation of the original envelope values of the M sub-bands.
  • the second parameter can be calculated according to the following equation.
  • the second parameter can be represented by ⁇ .
  • the total energy of the M subbands and the energy of the first subband may be determined according to the original envelope values of the M subbands, wherein the first sub-band in the M subbands
  • the belt has the most energy.
  • the correction factor can be determined based on the total energy of the M subbands and the energy of the first subband.
  • the original envelope value of the M sub-bands can be corrected by using the correction factor to obtain the modified envelope values of the M sub-bands.
  • the correction factor can be determined according to the following equation.
  • the correction factor can be represented by ⁇ .
  • the original envelope value of each of the M subbands can be corrected.
  • the original envelope value of each subband can be multiplied by a correction factor to obtain a modified envelope value of the subband.
  • the modified envelope value of each of the M subbands may be greater than the original envelope value of the subband.
  • the corrected envelope value of each of the M subbands is obtained by correcting the original envelope values of each of the M subbands.
  • the modified envelope value of each subband may be greater than the original envelope value of the subband.
  • the modified envelope value of each of the M subbands is greater than the original envelope value of the subband, then in step 140, the bit allocation is performed according to the modified envelope values of the M subbands, and each of the M subbands The number of allocated bits will increase, making the bit allocation more in line with its bit requirements, thereby improving the encoding and decoding performance of the signal.
  • the modified envelope value of each of the M subbands may be smaller than the original envelope value of the subband.
  • the modified envelope value of each of the M subbands may be smaller than the original envelope value of the subband, then in step 140, according to the modified envelope value of the M subbands and the original envelope value of the K subband For bit allocation, each of the M subbands will have fewer bits allocated, and accordingly the number of bits allocated by the K subbands will increase, so that the bit allocation is more in line with its bit requirements, thereby improving Signal encoding and decoding performance.
  • the N subbands may be allocated once in a sequence of envelope values from large to small.
  • the correction factor may be determined according to the second parameter. Then, the original envelope value of the M sub-bands can be corrected by using the correction factor to obtain the modified envelope values of the M sub-bands.
  • the correction factor may be determined according to the second parameter, and according to the correction factor, the original envelope value of each of the M sub-bands may be corrected, for example, the original package of each sub-band may be The network value is multiplied by the correction factor to obtain the modified envelope value of the subband, so that the number of bits allocated by the M subbands more closely matches the bit requirements of the M subbands, thereby improving signal encoding and decoding performance.
  • the number of redundant bits per subband cannot encode one unit of information for that subband. Therefore, the number of redundant bits of each sub-band can be counted to obtain the total number of redundant bits, and then the second bit allocation is performed.
  • the number of redundant bits of each of the N subbands may be determined according to the number of bits allocated to the N subbands in one bit allocation, in the N subbands.
  • the number of redundant bits per subband is less than the number of bits required for encoding a single unit of information in the same subband.
  • the total number of redundant bits can be determined based on the number of redundant bits of each of the N subbands.
  • the secondary bit allocation can then be made to the N subbands based on the total number of redundant bits.
  • the total redundant bits can be equally distributed to N subbands. In this way, redundant bits can be utilized twice, bit waste is avoided, and the encoding and decoding performance of the signal is further improved.
  • the encoding end can quantize the spectral coefficients of the respective sub-bands by using the bits allocated in the two bit allocation processes of the respective sub-bands, and quantize the spectrum.
  • the index of the coefficients and the index of the original envelope values of the individual subbands are written to the code stream, which can then be sent to the decoder.
  • the decoding end can decode the quantized spectral coefficients by using the number of bit bits allocated by each sub-band in the two-bit allocation process to obtain recovery. signal of.
  • FIG. 2 is a schematic flow diagram of a process of a method of processing a signal in accordance with one embodiment of the present invention.
  • the encoding end performs time-frequency transform on the time domain signal.
  • the coding end divides the spectral coefficients of the frequency domain signal into N subbands, where N is a positive integer greater than 1.
  • the encoding end can calculate the global gain, and use the global gain to enter the original spectral coefficient.
  • the lines are normalized, and then the normalized spectral coefficients are banded to obtain the respective sub-bands.
  • the encoding end obtains the original envelope value of each sub-band through calculation and quantization operations.
  • the encoding end selects M subbands from the N subbands, where M is a positive integer.
  • the frequency bands of the M sub-bands are lower than the frequency bands of the K sub-bands other than the M sub-bands in the N sub-bands, K is a positive integer, and the sum of K and M is N.
  • the encoding end determines the first parameter according to the original envelope value of the M subbands and the original envelope value of the K subbands.
  • the first parameter may indicate the degree of concentration of the spectral energy of the signal over the M subbands.
  • the ratio of the total energy of the M subbands to the total energy of the K subbands may be used to represent the first parameter.
  • the calculation method of the first parameter reference may be made to the calculation method of the first parameter in the embodiment of FIG. 1 , and details are not described herein again.
  • the encoding end determines the second parameter according to the original envelope value of the M subbands.
  • the second parameter can represent the degree of spectral fluctuation of the M subbands.
  • a second parameter can be represented using a ratio of the energy of the first sub-band to the total energy of the M sub-bands, wherein the energy of the first sub-band is the largest among the M sub-bands.
  • the encoding end determines whether the first parameter belongs to the first range and the second parameter belongs to the second range.
  • the first range and the second range may be preset.
  • the first range can be preset to [1/6, 2/3].
  • the second range can be preset to or
  • step 207 If the encoding end determines in step 207 that the first parameter belongs to the first range and the second parameter belongs to the second range, correct the original envelope values of the M sub-bands according to the original envelope values of the M sub-bands to obtain M sub-respectors respectively. Corrected envelope value with band.
  • the encoding end may determine the correction factor according to the original envelope values of the M subbands. For the calculation of the correction factor, reference may be made to the process of the embodiment of FIG. 1, and details are not described herein again.
  • the encoding end may correct the original envelope value of each of the M subbands by using a correction factor to obtain a modified envelope value of the M subbands. For example, the corrected envelope value for each subband may be greater than the original envelope value for the subband.
  • the encoding end performs bit allocation on the N subbands according to the modified envelope value of the M subbands and the original envelope value of the K subbands.
  • the encoding end may perform bit allocation on the N sub-bands in descending order of the envelope value. For the M subbands, since the modified envelope value of each subband is greater than the subband The original envelope value, so the number of bits allocated to each of the M subbands is increased before the correction, so that the bit allocation is more in line with the bit requirements of each subband, thereby improving the encoding and decoding performance of the signal.
  • the encoding end performs secondary bit allocation on the N subbands.
  • the encoding end may determine the number of redundant bits of each subband according to the number of bits allocated to each subband of the N subbands after one bit allocation and the bandwidth of each subband. Thereby determining the total number of redundant bits of the N subbands. The total redundant bits are then evenly distributed to the N subbands based on the total number of redundant bits.
  • the encoding end quantizes the spectral coefficients of each subband according to the number of bits allocated by each of the N subbands.
  • the encoding end writes the code stream according to the quantized spectral coefficients in step 211 and the original envelope value of each subband.
  • the encoding end may write an index of the quantized spectral coefficients and the original envelope values of the respective subbands into the code stream.
  • the specific process can refer to the prior art, and details are not described herein again.
  • the encoding end determines in step 207 that the first parameter does not belong to the first range or the second parameter does not belong to the second range, the encoding end performs bit allocation on the N sub-bands according to the original envelope value of the N sub-bands.
  • the encoding end may perform bit allocation on the N sub-bands in descending order of the envelope value.
  • the encoding end performs secondary bit allocation on the N subbands.
  • the encoding end may determine the number of redundant bits of each subband according to the number of bits allocated to each subband of the N subbands after one bit allocation and the bandwidth of each subband. Thereby determining the total number of redundant bits of the N subbands. The total redundant bits are then evenly distributed to the N subbands based on the total number of redundant bits.
  • the encoding end quantizes the spectral coefficients of each subband according to the number of bits allocated by each of the N subbands.
  • the encoding end writes the code stream according to the spectral coefficients quantized in step 215 and the original envelope value of each subband.
  • the encoding end may write an index of the quantized spectral coefficients and the original envelope values of the respective subbands into the code stream.
  • the specific process can refer to the prior art, and details are not described herein again.
  • the first parameter belongs to the first range and the second parameter belongs to the second range.
  • the original envelope values of the M subbands in the low frequency band are corrected according to the original envelope values of the M subbands, and the N subbands are performed according to the modified envelope values of the M subbands and the original envelope values of the K subbands.
  • One bit allocation makes the bit allocation more in line with the bit requirements of each sub-band, thereby improving the encoding and decoding performance of the signal.
  • the device 300 of FIG. 3 is a schematic block diagram of an apparatus for processing signals in accordance with one embodiment of the present invention.
  • the device 300 of FIG. 3 may be an encoding end or a decoding end device.
  • the apparatus 300 of FIG. 3 includes a selection unit 310, a determination unit 320, a correction unit 330, and an allocation unit 340.
  • the selecting unit 310 selects M subbands from the N subbands, and the N subbands are obtained by dividing the spectral coefficients of the current frame of the signal, and the frequency bands of the M subbands are lower than the K subbands of the N subbands other than the M subbands.
  • the frequency band, N is a positive integer greater than 1
  • M and K are positive integers
  • the sum of M and K is N.
  • the determining unit 320 determines, according to the performance information of the M subbands, performing a correcting operation on the original envelope values of the M subbands, and the performance information is used to indicate the energy characteristics and the spectral characteristics of the M subbands.
  • the correcting unit 330 respectively corrects the original envelope values of the M sub-bands to obtain the corrected envelope values of the M sub-bands.
  • the allocating unit 340 performs bit allocation on the N subbands according to the modified envelope value of the M subbands and the original envelope value of the K subbands.
  • the present invention by selecting M subbands of the low frequency band from the N subbands, determining, according to the energy characteristics and spectral characteristics of the M subbands, performing a correction operation on the original envelope values of the M subbands, according to the original of the M subbands.
  • the envelope value is corrected for the original envelope values of the M subbands, and the N subbands are allocated once according to the modified envelope values of the M subbands and the original envelope values of the K subbands, instead of directly according to N
  • the original envelope values of the subbands are bit-allocated, so that the bit allocation is more in line with the bit requirements of each sub-band, thereby improving the coding and decoding performance of the signal.
  • the determining unit 320 may determine the first parameter according to the original envelope value of the N subbands, where the first parameter indicates a concentration degree of the spectral energy of the signal on the M subbands.
  • the determining unit 320 may determine the second parameter according to the original envelope values of the M subbands, wherein the second parameter represents the degree of spectral fluctuation of the M subbands.
  • the determining unit 320 may determine to perform a correcting operation on the original envelope values of the M subbands in a case where the first parameter belongs to the first range and the second parameter belongs to the second range.
  • the determining unit 320 may determine the total energy of the M sub-bands according to the original envelope values of the M sub-bands, and determine the total energy of the K sub-bands according to the original envelope values of the K sub-bands.
  • the ratio of the total energy of the M subbands to the total energy of the K subbands can be determined For the first parameter.
  • the determining unit 320 may determine the total energy of the M subbands and the energy of the first subband according to the original envelope values of the M subbands, where the first subbands of the M subbands The energy is the biggest.
  • the determining unit 320 may determine the ratio of the energy of the first sub-band to the total energy of the M sub-bands as the second parameter.
  • the correcting unit 330 may determine the total energy of the M subbands and the energy of the first subband according to the original envelope values of the M subbands, where the first subbands of the M subbands The energy is the biggest.
  • the correcting unit 330 may determine the correction factor according to the total energy of the M sub-bands and the energy of the first sub-band, and may correct the original envelope values of the M sub-bands by using the correction factor to obtain the correction packets of the M sub-bands. Network value.
  • the modified envelope value of each of the M subbands may be greater than the original envelope value of the same subband.
  • the determining unit 320 may further determine, according to the number of bits allocated to the N subbands at the time of one bit allocation, the number of redundant bits of each of the N subbands, and each of the N subbands The number of redundant bits of a subband is less than the number of bits required for encoding a single information unit in the same subband.
  • the determining unit 320 may also determine the total number of redundant bits according to the number of redundant bits of each of the N subbands.
  • the allocating unit 340 may also perform secondary bit allocation on the N subbands according to the modified envelope value of the M subbands, the original envelope value of the K subbands, and the total number of redundant bits.
  • the device 400 of Figure 4 is a schematic block diagram of an apparatus for processing a signal in accordance with another embodiment of the present invention.
  • the device 400 of Figure 4 can be an encoding end or a decoding end device.
  • the device 400 of FIG. 4 includes a memory 410 and a processor 420.
  • Memory 410 can include random access memory, flash memory, read only memory, programmable read only memory, nonvolatile memory or registers, and the like.
  • the processor 420 can be a Central Processing Unit (CPU).
  • the memory 410 is used to store executable instructions.
  • the processor 420 may execute executable instructions stored in the memory 410, to select M subbands from the N subbands, where the N subbands are obtained by dividing the spectral coefficients of the current frame of the signal, and the frequency bands of the M subbands are lower than
  • the frequency bands of K subbands other than M subbands in the N subbands, N is a positive integer greater than 1
  • M and K are positive integers, and the sum of M and K is N; determining the pair according to the performance information of the M subbands Original envelope value of M subbands Performing a correction operation, the performance information is used to indicate the energy characteristics and the spectral characteristics of the M sub-bands; the original envelope values of the M sub-bands are respectively corrected to obtain the corrected envelope values of the M sub-bands; the correction packets according to the M sub-bands
  • the network value and the original envelope value of the K subbands are allocated once for the N subbands.
  • the present invention by selecting M subbands of the low frequency band from the N subbands, determining, according to the energy characteristics and spectral characteristics of the M subbands, performing a correction operation on the original envelope values of the M subbands, according to the original of the M subbands.
  • the envelope value is corrected for the original envelope values of the M subbands, and the N subbands are allocated once according to the modified envelope values of the M subbands and the original envelope values of the K subbands, instead of directly according to N
  • the original envelope values of the subbands are bit-allocated, so that the bit allocation is more in line with the bit requirements of each sub-band, thereby improving the coding and decoding performance of the signal.
  • the processor 420 may determine the first parameter according to the original envelope values of the N subbands, where the first parameter indicates the degree of concentration of the spectral energy of the signal on the M subbands.
  • the processor 420 may determine the second parameter according to the original envelope values of the M subbands, wherein the second parameter represents a degree of spectral fluctuation of the M subbands.
  • the processor 420 may determine to perform a correcting operation on the original envelope values of the M subbands if the first parameter belongs to the first range and the second parameter belongs to the second range.
  • the processor 420 may determine the total energy of the M subbands according to the original envelope values of the M subbands, and determine the total energy of the K subbands according to the original envelope values of the K subbands.
  • the ratio of the total energy of the M sub-bands to the total energy of the K sub-bands may be determined as the first parameter.
  • the processor 420 may determine the total energy of the M subbands and the energy of the first subband according to the original envelope values of the M subbands, where the first subbands of the M subbands The energy is the biggest.
  • the processor 420 can determine the ratio of the energy of the first sub-band to the total energy of the M sub-bands as the second parameter.
  • the processor 420 may determine the total energy of the M subbands and the energy of the first subband according to the original envelope values of the M subbands, where the first subbands of the M subbands The energy is the biggest.
  • the processor 420 may determine the correction factor according to the total energy of the M subbands and the energy of the first subband, and may correct the original envelope values of the M subbands by using the correction factor to obtain the correction packets of the M subbands. Network value.
  • the modified envelope value of each of the M subbands may be greater than The original envelope value of the same subband.
  • the processor 420 may further determine the number of redundant bits of each of the N subbands according to the number of bits allocated to the N subbands in one bit allocation, and each of the N subbands The number of redundant bits of a subband is less than the number of bits required for encoding a single information unit in the same subband.
  • the processor 420 can also determine the total number of redundant bits based on the number of redundant bits of each of the N subbands.
  • the processor 420 can also perform secondary bit allocation on the N subbands according to the modified envelope value of the M subbands, the original envelope value of the K subbands, and the total number of redundant bits.
  • the disclosed systems, devices, and methods may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product.
  • the technical solution of the present invention which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including
  • the instructions are used to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

提供了一种处理信号的方法及设备。该方法包括:从N个子带中选择M个子带(110),N个子带是对信号的当前帧的频谱系数划分得到的,M个子带的频段低于N个子带中除M个子带之外的K个子带的频段;根据M个子带的性能信息,确定对M个子带的原始包络值执行修正操作(120),性能信息用于指示M个子带的能量特性和频谱特性;分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值(130);根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配(140)。该处理信号的方法及设备使得比特分配更为符合各个子带的比特需求,从而能够提升信号的编码以及解码性能。

Description

处理信号的方法及设备
本申请要求于2014年4月29日提交中国专利局、申请号为201410177234.3、发明名称为“处理信号的方法及设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明涉及信号处理领域,并且具体地,涉及处理信号的方法及设备。
背景技术
目前的通信传输越来越重视语音或音频信号的质量,因此对信号编码和解码的要求也越来越高。在现有频域编码算法中,通常直接根据频域包络的大小对信号的各个子带进行比特分配,然后利用所分配的比特数对各个子带进行编码。然而,实践表明,在现有的这些编码算法中,低频段的子带对信号编码质量影响较大,因此低频段的子带往往成为信号编码性能瓶颈,而上述比特分配方式不能很好地适配各个子带的比特需求,尤其是低频段的子带,从而导致信号的编码质性能较差。相应地,信号的解码性能也较差。
发明内容
本发明实施例提供处理信号的方法及设备,能够提升信号的编码以及解码性能。
第一方面,提供了一种处理信号的方法,包括:从N个子带中选择M个子带,所述N个子带是对信号的当前帧的频谱系数划分得到的,所述M个子带的频段低于所述N个子带中除所述M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N;根据所述M个子带的性能信息,确定对所述M个子带的原始包络值执行修正操作,所述性能信息用于指示所述M个子带的能量特性和频谱特性;分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值;根据所述M个子带的修正包络值以及所述K个子带的原始包络值,对所述N个子带进行一次比特分配。
结合第一方面,在第一种可能的实现方式中,所述根据所述M个子带的性能信息,确定对所述M个子带的原始包络值执行修正操作,包括:
根据所述N个子带的原始包络值,确定第一参数,其中所述第一参数表示所述信号的频谱能量在所述M个子带上的集中程度;
根据所述M个子带的原始包络值,确定第二参数,其中所述第二参数表示所述M个子带的频谱波动程度;
在所述第一参数属于第一范围并且所述第二参数属于第二范围的情况下,确定对所述M个子带的原始包络值执行修正操作。
结合第一方面的第一种可能的实现方式,在第二种可能的实现方式中,所述根据所述N个子带的原始包络值,确定第一参数,包括:根据所述M个子带的原始包络值确定所述M个子带的总能量;根据所述K个子带的原始包络值,确定所述K个子带的总能量;将所述M个子带的总能量与所述K个子带的总能量的比值确定为所述第一参数。
结合第一方面的第一种可能的实现方式或第二种可能的实现方式,在第三种可能的实现方式中,所述根据所述M个子带的原始包络值,确定第二参数,包括:根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;将第一子带的能量与所述M个子带的总能量的比值确定为所述第二参数。
结合第一方面或上述任一实现方式,在第四种可能的实现方式中,所述分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值,包括:根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;根据所述M个子带的总能量和所述第一子带的能量,确定修正因子;利用所述修正因子,分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值。
结合第一方面或上述任一实现方式,在第五种可能的实现方式中,所述M个子带中每个子带的修正包络值大于同一子带的原始包络值。
结合第一方面或上述任一实现方式,在第六种可能的实现方式中,还包括:根据在所述一次比特分配时分别对所述N个子带分配的比特数,确定所述N个子带中每个子带的冗余比特数,所述N个子带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数;根据所述N个子带中每个子带的冗余比特数,确定冗余比特总数;根据所述M个子带的修正包络值、所述K个子带的原始包络值以及冗余比特总数,对所述N个子带 进行二次比特分配。
第二方面,提供了一种处理信号的设备,包括:选择单元,用于从N个子带中选择M个子带,所述N个子带是对信号的当前帧的频谱系数划分得到的,所述M个子带的频段低于所述N个子带中除所述M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N;确定单元,用于根据所述M个子带的性能信息,确定对所述M个子带的原始包络值执行修正操作,所述性能信息用于指示所述M个子带的能量特性和频谱特性;修正单元,用于分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值;分配单元,用于根据所述M个子带的修正包络值以及所述K个子带的原始包络值,对所述N个子带进行一次比特分配。
结合第二方面,在第一种可能的实现方式中,所述确定单元,具体用于:根据所述N个子带的原始包络值,确定第一参数,其中所述第一参数表示所述信号的频谱能量在所述M个子带上的集中程度;根据所述M个子带的原始包络值,确定第二参数,其中所述第二参数表示所述M个子带的频谱波动程度;在所述第一参数属于第一范围并且所述第二参数属于第二范围的情况下,确定对所述M个子带的原始包络值执行修正操作。
结合第二方面的第一种可能的实现方式,在第二种可能的实现方式中,所述确定单元,具体用于:根据所述M个子带的原始包络值确定所述M个子带的总能量;根据所述K个子带的原始包络值,确定所述K个子带的总能量;将所述M个子带的总能量与所述K个子带的总能量的比值确定为所述第一参数。
结合第二方面的第一种可能的实现方式或第二种可能的实现方式,在第三种可能的实现方式中,所述确定单元,具体用于:根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;将第一子带的能量与所述M个子带的总能量的比值确定为所述第二参数。
结合第二方面或上述任一实现方式,在第四种可能的实现方式中,所述修正单元,具体用于:根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;根据所述M个子带的总能量和所述第一子带的能量,确定修正因 子;利用所述修正因子,分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值。
结合第二方面或上述任一实现方式,在第五种可能的实现方式中,所述M个子带中每个子带的修正包络值大于同一子带的原始包络值。
结合第二方面或上述任一实现方式,在第六种可能的实现方式中,所述确定单元,还用于根据在所述一次比特分配时分别对所述N个子带分配的比特数,确定所述N个子带中每个子带的冗余比特数,所述N个子带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数;所述确定单元,还用于根据所述N个子带中每个子带的冗余比特数,确定冗余比特总数;所述分配单元,还用于根据所述M个子带的修正包络值、所述K个子带的原始包络值以及冗余比特总数,对所述N个子带进行二次比特分配。
本发明实施例中,通过从N个子带中选择低频段的M个子带,根据M个子带的能量特性和频谱特性确定对M个子带的原始包络值执行修正操作,根据M个子带的原始包络值分别对M个子带的原始包络值进行修正,并根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配,而非直接根据N个子带的原始包络值进行比特分配,使得比特分配更为符合各个子带的比特需求,从而能够提升信号的编码以及解码性能。
附图说明
为了更清楚地说明本发明实施例的技术方案,下面将对本发明实施例中所需要使用的附图作简单地介绍,显而易见地,下面所描述的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是根据本发明实施例的处理信号的方法的示意性流程图。
图2是根据本发明一个实施例的处理信号的方法的过程的示意性流程图。
图3是根据本发明一个实施例的处理信号的设备的示意框图。
图4是根据本发明另一实施例的处理信号的设备的示意框图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明的一部分实施例,而不是全部实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都应属于本发明保护的范围。
信号的编码技术和解码技术,广泛应用于各种电子设备中,例如:移动电话,无线装置,个人数据助理(Personal Data Assistant,PDA),手持式或便携式计算机,全球定位系统(Global Positioning System,GPS)接收机/导航器,照相机,音频/视频播放器,摄像机,录像机,监控设备等。通常,这类电子设备中包括语音或音频编码器,还可以包括语音或音频解码器,语音或音频编码器,以及语音或音频解码器可以直接由数字电路或芯片例如数字信号处理(Digital Signal Processor,DSP)芯片实现,或者由软件代码驱动处理器执行软件代码中的流程而实现。
图1是根据本发明实施例的处理信号的方法的示意性流程图。图1的方法由编码端执行,例如上述语音或音频编码器。图1的方法还可以由解码端执行,例如上述语音或音频解码器。
在编码过程中,编码端可以先将时域信号变换为频域信号,例如可以采用快速傅立叶变换(Fast Fourier Transform,FFT)或改进离散余弦变换(Modified Discrete Cosine Transform,MDCT)等算法进行时频变换。然后,可以利用全局增益对频域信号的频谱系数进行归一化,将归一化的频谱系数进行分带从而得到多个子带。
在解码过程中,解码端可以对从编码端接收的码流进行解码,得到归一化的频谱系数,将归一化的频谱系数进行分带从而得到多个子带。
110,从N个子带中选择M个子带,N个子带是对信号的当前帧的频谱系数划分得到的,M个子带的频段低于N个子带中除M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N。
本发明实施例中,信号可以是语音信号,也可以是音频信号。上述K个子带是N个子带中除M个子带之外的所有子带。
120,根据M个子带的性能信息,确定对M个子带的原始包络值执行修正操作,性能信息用于指示M个子带的能量特性和频谱特性。
130,分别对M个子带的原始包络值进行修正,以获取M个子带的修正 包络值。
140,根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配。
本发明实施例中,通过从N个子带中选择低频段的M个子带,根据M个子带的能量特性和频谱特性确定对M个子带的原始包络值执行修正操作,根据M个子带的原始包络值分别对M个子带的原始包络值进行修正,并根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配,而非直接根据N个子带的原始包络值进行比特分配,使得比特分配更为符合各个子带的比特需求,从而能够提升信号的编码以及解码性能。
具体而言,在现有的频域编码算法中,直接根据频域包络的大小对信号的各个子带进行比特分配,使得所分配的比特数并不能很好地满足某些低频段子带的比特需求。而本发明实施例中,首先从N个子带中选择低频段的M个子带,根据M个子带的能量特性和频谱特性确定对M个子带的原始包络值执行修正操作,根据N个子带的原始包络值对M个子带的原始包络值进行修正,然后根据M个子带的修正包络值以及K个子带的原始包络值对N个子带进行一次比特分配。可见,本发明实施例中,并非直接根据N个子带的原始包络值对N个子带进行比特分配,而是将M个子带的能量特性和频谱特性作为考虑因素来确定需要对M个子带进行修正,从而得到M个子带各自的修正包络值,并根据低频段子带的修正包络值以及其它子带的原始包络值进行比特分配,使得对各个子带的比特分配更为合理,尤其是低频段的M个子带,从而能够提升信号的编码以及解码性能。
在对频谱系数划分得到N个子带后,可以计算并量化每个子带的包络。所以每个子带具有量化的包络值。应理解,原始包络值与修正包络值是相对而言的,原始包络值可以指子带原本的包络值,即在划分子带后计算得到的量化的包络值。在对子带原本的包络值进行修正后得到的包络值,称为修正包络值。因此,本发明实施例中,所提到的原始包络值和修正包络值均是指量化的包络值。
可选地,作为一个实施例,在步骤110中,可以根据子带谐波性以及子带能量,从N个子带中选择M个子带。例如,对于M个子带而言,每个子带的谐波性强度可以大于预设的强度阈值并且该子带的能量占N个子带的 总能量的比例大于预设的能量阈值。如上所述,低频带子带往往是信号编码性能的瓶颈,而这些子带中,谐波性较强并且能量占所有子带总能量一定比例的子带尤其是编码性能的瓶颈,因此,根据子带谐波性以及子带能量,从N个子带中选择M个子带,这样对这M个子带的原始包络值进行修正后,使得对于这M个子带的比特分配更为合理,从而能够更有效地提升信号的编码及解码性能。
可选地,作为另一实施例,N个子带可以是可以按照频段从小到大的顺序排列的。这样,在步骤110中,可以从N个子带中选择前M个子带。本实施例中,按照频段从小到大的顺序选择M个子带,能够简化操作,提升信号处理效率。
可选地,作为另一实施例,在步骤120中,可以根据N个子带的原始包络值,确定第一参数,其中第一参数可以表示信号的频谱能量在M个子带的集中程度。可以根据M个子带的原始包络值,确定第二参数,其中第二参数表示M个子带的频谱波动程度。可以在所述第一参数属于第一范围并且所述第二参数属于第二范围的情况下,确定对M个子带的原始包络值执行修正操作。
具体地,M个子带的能量特性可以是信号的频谱能量在M个子带的集中程度,M个子带的频谱特性可以是M个子带的频谱波动程度。
第一范围与子带的能量相关,可以是预先设定的。当信号的频谱能量在除M个子带的集中程度较小时,可以说明这M个子带在N个子带中的比例小,不会对编码性能造成很大的影响,因此无需修正这M个子带的原始包络值。当信号的频谱能量在M个子带的集中程度较大时,说明这M个子带的原始包络值也会比较大,那么这M个子带所分配的比特数是足够用于编码的,也无需修正这M个子带的原始包络值。可以进行实验仿真预先确定第一范围。例如,第一范围可以预设为[1/6,2/3]。
第二范围与子带的频谱波动程度相关,可以是预先设定的。M个子带的频谱波动程度小,则即使M个子带所分配的比特数少也不会对编码性能造成很大的影响,这样就无需修正M个子带的原始包络值。因此,第二范围表示子带的频谱波动程度较大。可以进行实验仿真预先确定第二范围。例如,第二范围可以预设为
Figure PCTCN2014092695-appb-000001
或者
Figure PCTCN2014092695-appb-000002
通常,如果信号中能够编 码的带宽为0~4KHz,第二范围可以预设为
Figure PCTCN2014092695-appb-000003
如果信号中能够编码的带宽为0~8KHz,第二范围可以预设为
Figure PCTCN2014092695-appb-000004
当第一参数属于第一范围并且第二参数属于第二范围时,意味着信号的频谱能量在M个子带上的集中程度不是特别大也不是特别小,而且M个子带的频谱波动程度较大,可以对M个子带的原始包络值进行修正,以便在M个子带的一次比特分配中使得M个子带所分配到的比特更符合M个子带的比特需求。例如,对于M个子带中每个子带,修正包络值大于原始包络值。那么,相比根据M个子带的原始包络值进行一次比特分配而言,根据M个子带的修正包络值进行一次比特分配使得M个子带分配到的比特数要多一些,从而能够提升M个子带的编码性能。
可见,本实施例中,根据N个子带的原始包络值确定的第一参数和第二参数可以反映各频段的特点,因此在第一参数属于第一范围并且第二参数属于第二范围的情况下确定对M个子带执行的原始包络值修正操作,使得后续根据M个子带的修正包络值进行比特分配时,M个子带所分配到的比特数更符合M个子带的比特需求,从而能够提升信号的编码和解码性能。
可选地,作为另一实施例,在步骤120中,可以根据M个子带的原始包络值确定M个子带的总能量。可以根据K个子带的原始包络值确定K个子带的总能量。可以将M个子带的总能量与K个子带的总能量的比值确定为第一参数。
具体地,可以将M个子带的总能量与K个子带的总能量的比值确定为第一参数。这样,能够
例如,可以按照如下等式计算得到第一参数。第一参数可以由α表示。
Figure PCTCN2014092695-appb-000005
Figure PCTCN2014092695-appb-000006
Figure PCTCN2014092695-appb-000007
其中,
Figure PCTCN2014092695-appb-000008
可以表示M个子带的总能量,
Figure PCTCN2014092695-appb-000009
可以表示K个子带的总能 量,band_widthi可以表示第i个子带的带宽,band_energyi可以表示第i个子带的能量。band_energyi可以表示第i个子带的原始包络值。例如,可以根据第i个子带的频谱系数,得到第i个子带的原始包络值band_energyi。比如,可以按照下述等式得到band_energyi
band_energyi=log2Ei
Figure PCTCN2014092695-appb-000010
应理解,本领域技术人员根据上述等式,显然可以进行各种等价的修改或变化来得到第一参数,这样的修改或变化也落入本发明实施例的范围内。
可选地,作为另一实施例,在步骤120中,可以根据M个子带的原始包络值,确定M个子带的总能量,并确定第一子带的能量,其中,在M个子带中第一子带的能量最大。可以将第一子带的能量与M个子带的总能量的比值确定为第二参数。
具体地,M个子带的频谱波动程度可以使用M个子带的原始包络值的波动程度来表示。比如,第二参数可以按照如下等式计算得到。第二参数可以由β表示。
Figure PCTCN2014092695-appb-000011
其中,
Figure PCTCN2014092695-appb-000012
Figure PCTCN2014092695-appb-000013
的计算方式可以参照上面的等式。
应理解,本领域技术人员根据上述等式,显然可以进行各种等价的修改或变化来得到第二参数,这样的修改或变化也落入本发明实施例的范围内。
可选地,作为另一实施例,在步骤130中,可以根据M个子带的原始包络值,确定M个子带的总能量和第一子带的能量,其中在M个子带中第一子带的能量最大。可以根据M个子带的总能量和第一子带的能量,确定修正因子。然后可以利用修正因子,分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值。
例如,修正因子可以按照如下等式确定。修正因子可以由γ表示。
Figure PCTCN2014092695-appb-000014
其中,
Figure PCTCN2014092695-appb-000015
Figure PCTCN2014092695-appb-000016
的计算方式可以参照上面的等式。
根据修正因子γ,可以对M个子带中每个子带的原始包络值进行修正,例如,可以将每个子带的原始包络值乘以修正因子,从而得到该子带的修正包络值。
应理解,本领域技术人员根据上述等式,显然可以进行各种等价的修改或变化来得到修正因子,这样的修改或变化也落入本发明实施例的范围内。
可选地,作为另一实施例,在步骤130中,M个子带中每个子带的修正包络值可以大于该子带的原始包络值。
具体地,通过对M个子带中每个子带的原始包络值进行修正,得到M个子带中每个子带的修正包络值。而每个子带的修正包络值可以大于该子带的原始包络值。M个子带中每个子带的修正包络值大于该子带的原始包络值,那么在步骤140中,根据M个子带的修正包络值进行比特分配,这M个子带中每个子带所分配到的比特数将会增大,使得比特分配更符合其比特需求,从而能够提升信号的编码及解码性能。
可选地,作为另一实施例,在步骤130中,M个子带中每个子带的修正包络值可以小于该子带的原始包络值。
具体地,M个子带中每个子带的修正包络值可以小于该子带的原始包络值,那么在步骤140中,根据M个子带的修正包络值以及K子带的原始包络值进行比特分配,这M个子带中每个子带所分配到的比特数将会较少,相应地K个子带分别分配到的比特数会增加,这样使得比特分配更符合其比特需求,从而能够提升信号的编码及解码性能。
可选地,作为另一实施例,在步骤130中,可以按照包络值从大到小的顺序,对N个子带进行一次比特分配。
可选地,作为另一实施例,在步骤120中,可以根据第二参数确定修正因子。然后可以利用修正因子,分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值。
具体地,可以根据第二参数确定修正因子,根据修正因子,可以对M个子带中每个子带的原始包络值进行修正,例如,可以将每个子带的原始包 络值乘以修正因子,从而得到该子带的修正包络值,这样使得M个子带所分配的比特数更符合M个子带的比特需求,从而能够提升信号编码和解码性能。
在进行第一次比特分配后,每个子带所分配的比特数中一般会有冗余的比特。每个子带的冗余比特数无法编码该子带的一个信息单位。因此,可以统计各个子带的冗余比特数得到冗余比特总数,然后进行二次比特分配。
可选地,作为另一实施例,在步骤140之后,可以根据在一次比特分配时分别对N个子带分配的比特数,确定N个子带中每个子带的冗余比特数,N个子带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数。可以根据N个子带中每个子带的冗余比特数,确定冗余比特总数。然后可以根据冗余比特总数,向N个子带进行二次比特分配。
具体而言,可以将总的冗余比特平均分配给N个子带。这样,可以二次利用冗余比特,避免比特浪费,并进一步提升信号的编码及解码性能。
上面描述了关于一次比特分配和第二比特分配的过程。如果上述图1的方法是由编码端执行的,在二次比特分配之后,编码端可以利用各个子带两次比特分配过程中所分配的比特数量化各个子带的频谱系数,将量化的频谱系数的索引和各个子带的原始包络值的索引写入码流,然后可以将码流发送至解码端。
如果上述图1的方法是由解码端执行的,在二次比特分配之后,解码端可以利用各个子带在两次比特分配过程中所分配的比特比特数对量化的频谱系数进行解码,得到恢复的信号。
下面将结合具体例子详细描述本发明实施例。应理解,这些例子只是为了帮助本领域技术人员更好地理解本发明实施例,而非限制本发明实施例的范围。
在下面的例子中,将以编码端为例进行描述。
图2是根据本发明一个实施例的处理信号的方法的过程的示意性流程图。
201,编码端对时域信号进行时频变换。
202,编码端对频域信号的频谱系数划分成N个子带,N为大于1的正整数。
具体地,编码端可以计算全局增益,使用全局增益对原始的频谱系数进 行归一化,然后对归一化后的频谱系数进行分带,从而得到各个子带。
203,编码端通过计算及量化操作,得到各个子带的原始包络值。
204,编码端从N个子带中选择M个子带,M为正整数。
M个子带的频段低于N个子带中除M个子带之外的K个子带的频段,K为正整数,K与M之和为N。
205,编码端根据M个子带的原始包络值和K个子带的原始包络值,确定第一参数。
第一参数可以表示信号的频谱能量在M个子带上的集中程度。例如,可以使用M个子带的总能量与K个子带的总能量的比值来表示第一参数。第一参数的计算方式可以参照图1的实施例中第一参数的计算方式,不再赘述。
206,编码端根据M个子带的原始包络值,确定第二参数。
第二参数可以表示M个子带的频谱波动程度。例如,可以使用第一子带的能量与所述M个子带的总能量的比值来表示第二参数,其中在M个子带中第一子带的能量最大。第二参数的计算方式可以参照图1的实施例中第二参数的计算方式,不再赘述。
207,编码端确定是否第一参数属于第一范围且第二参数属于第二范围。
第一范围和第二范围可以是预先设定的。例如,第一范围可以预设为[1/6,2/3]。第二范围可以预设为
Figure PCTCN2014092695-appb-000017
或者
Figure PCTCN2014092695-appb-000018
208,如果步骤207中编码端确定第一参数属于第一范围且第二参数属于第二范围,根据M个子带的原始包络值,修正M个子带的原始包络值,以分别得到M个子带的修正包络值。
具体而言,编码端可以根据M个子带的原始包络值确定修正因子。修正因子的计算方式可以参照图1的实施例的过程,不再赘述。编码端可以利用修正因子对M个子带中每个子带的原始包络值进行修正,得到M个子带的修正包络值。例如,每个子带的修正包络值可以大于该子带的原始包络值。
209,编码端根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配。
例如,编码端可以按照包络值从大到小的顺序,对N个子带进行一次比特分配。对于M个子带中而言,由于每个子带的修正包络值大于该子带的 原始包络值,所以相比修正之前,M个子带中每个子带所分配到的比特数增多,这样使得比特分配更符合各个子带的比特需求,从而使得信号的编码和解码性能提升。
210,编码端对N个子带进行二次比特分配。
具体地,编码端可以根据一次比特分配之后N个子带中每个子带所分配到的比特数以及每个子带的带宽,确定每个子带的冗余比特数。从而确定N个子带的冗余比特总数。然后根据冗余比特总数,向N个子带平均分配总的冗余比特。
211,编码端根据N个子带中每个子带所分配的比特数,量化每个子带的频谱系数。
212,编码端根据步骤211中量化的频谱系数以及每个子带的原始包络值,写码流。
具体地,编码端可以将量化的频谱系数以及各个子带的原始包络值等的索引写入码流。具体过程可以参照现有技术,此处不再赘述。
213,如果在步骤207中编码端确定第一参数不属于第一范围或者第二参数不属于第二范围,则编码端根据N个子带的原始包络值,对N个子带进行一次比特分配。
例如,编码端可以按照包络值从大到小的顺序,对N个子带进行一次比特分配。
214,编码端对N个子带进行二次比特分配。
具体地,编码端可以根据一次比特分配之后N个子带中每个子带所分配到的比特数以及每个子带的带宽,确定每个子带的冗余比特数。从而确定N个子带的冗余比特总数。然后根据冗余比特总数,向N个子带平均分配总的冗余比特。
215,编码端根据N个子带中每个子带所分配的比特数,量化每个子带的频谱系数。
216,编码端根据步骤215中量化的频谱系数以及每个子带的原始包络值,写码流。
具体地,编码端可以将量化的频谱系数以及各个子带的原始包络值等的索引写入码流。具体过程可以参照现有技术,此处不再赘述。
本发明实施例中,在第一参数属于第一范围且第二参数属于第二范围的 情况下,根据M个子带的原始包络值对低频段的M个子带的原始包络值进行修正,根据M个子带的修正包络值以及K个子带的原始包络值对N个子带进行一次比特分配,使得比特分配更符合各个子带的比特需求,从而能够提升信号的编码和解码性能。
图3是根据本发明一个实施例的处理信号的设备的示意框图。图3的设备300可以是编码端或解码端设备。图3的设备300包括选择单元310、确定单元320、修正单元330和分配单元340。
选择单元310从N个子带中选择M个子带,N个子带是对信号的当前帧的频谱系数划分得到的,M个子带的频段低于N个子带中除M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N。确定单元320根据M个子带的性能信息确定对M个子带的原始包络值执行修正操作,性能信息用于指示M个子带的能量特性和频谱特性。修正单元330分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值。分配单元340根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配。
本发明实施例中,通过从N个子带中选择低频段的M个子带,根据M个子带的能量特性和频谱特性确定对M个子带的原始包络值执行修正操作,根据M个子带的原始包络值分别对M个子带的原始包络值进行修正,并根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配,而非直接根据N个子带的原始包络值进行比特分配,使得比特分配更为符合各个子带的比特需求,从而能够提升信号的编码以及解码性能。
可选地,作为一个实施例,确定单元320可以根据N个子带的原始包络值,确定第一参数,其中第一参数表示信号的频谱能量在M个子带上的集中程度。确定单元320可以根据M个子带的原始包络值,确定第二参数,其中第二参数表示M个子带的频谱波动程度。确定单元320可以在第一参数属于第一范围并且第二参数属于第二范围的情况下,确定对M个子带的原始包络值执行修正操作。
可选地,作为另一实施例,确定单元320可以根据M个子带的原始包络值确定M个子带的总能量,可以根据K个子带的原始包络值,确定K个子带的总能量,可以将M个子带的总能量与K个子带的总能量的比值确定 为第一参数。
可选地,作为另一实施例,确定单元320可以根据M个子带的原始包络值,确定M个子带的总能量和第一子带的能量,其中在M个子带中第一子带的能量最大。确定单元320可以将第一子带的能量与M个子带的总能量的比值确定为第二参数。
可选地,作为另一实施例,修正单元330可以根据M个子带的原始包络值,确定M个子带的总能量和第一子带的能量,其中在M个子带中第一子带的能量最大。修正单元330可以根据M个子带的总能量和第一子带的能量,确定修正因子,并可以利用修正因子,分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值。
可选地,作为另一实施例,M个子带中每个子带的修正包络值可以大于同一子带的原始包络值。
可选地,作为另一实施例,确定单元320还可以根据在一次比特分配时分别对N个子带分配的比特数,确定N个子带中每个子带的冗余比特数,N个子带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数。确定单元320还可以根据N个子带中每个子带的冗余比特数,确定冗余比特总数。分配单元340还可以根据M个子带的修正包络值、K个子带的原始包络值以及冗余比特总数,对N个子带进行二次比特分配。
设备300的其它功能和操作可以参照图1和图2的方法实施例的过程,为了避免重复,此处不再赘述。
图4是根据本发明另一实施例的处理信号的设备的示意框图。图4的设备400可以是编码端或解码端设备。图4的设备400包括存储器410和处理器420。
存储器410可以包括随机存储器、闪存、只读存储器、可编程只读存储器、非易失性存储器或寄存器等。处理器420可以是中央处理器(Central Processing Unit,CPU)。
存储器410用于存储可执行指令。处理器420可以执行存储器410中存储的可执行指令,用于:从N个子带中选择M个子带,N个子带是对信号的当前帧的频谱系数划分得到的,M个子带的频段低于N个子带中除M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N;根据M个子带的性能信息确定对M个子带的原始包络值 执行修正操作,性能信息用于指示M个子带的能量特性和频谱特性;分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值;根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配。
本发明实施例中,通过从N个子带中选择低频段的M个子带,根据M个子带的能量特性和频谱特性确定对M个子带的原始包络值执行修正操作,根据M个子带的原始包络值分别对M个子带的原始包络值进行修正,并根据M个子带的修正包络值以及K个子带的原始包络值,对N个子带进行一次比特分配,而非直接根据N个子带的原始包络值进行比特分配,使得比特分配更为符合各个子带的比特需求,从而能够提升信号的编码以及解码性能。
可选地,作为一个实施例,处理器420可以根据N个子带的原始包络值,确定第一参数,其中第一参数表示信号的频谱能量在M个子带上的集中程度。处理器420可以根据M个子带的原始包络值,确定第二参数,其中第二参数表示M个子带的频谱波动程度。处理器420可以在第一参数属于第一范围并且第二参数属于第二范围的情况下,确定对M个子带的原始包络值执行修正操作。
可选地,作为另一实施例,处理器420可以根据M个子带的原始包络值确定M个子带的总能量,可以根据K个子带的原始包络值,确定K个子带的总能量,可以将M个子带的总能量与K个子带的总能量的比值确定为第一参数。
可选地,作为另一实施例,处理器420可以根据M个子带的原始包络值,确定M个子带的总能量和第一子带的能量,其中在M个子带中第一子带的能量最大。处理器420可以将第一子带的能量与M个子带的总能量的比值确定为第二参数。
可选地,作为另一实施例,处理器420可以根据M个子带的原始包络值,确定M个子带的总能量和第一子带的能量,其中在M个子带中第一子带的能量最大。处理器420可以根据M个子带的总能量和第一子带的能量,确定修正因子,并可以利用修正因子,分别对M个子带的原始包络值进行修正,以获取M个子带的修正包络值。
可选地,作为另一实施例,M个子带中每个子带的修正包络值可以大于 同一子带的原始包络值。
可选地,作为另一实施例,处理器420还可以根据在一次比特分配时分别对N个子带分配的比特数,确定N个子带中每个子带的冗余比特数,N个子带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数。处理器420还可以根据N个子带中每个子带的冗余比特数,确定冗余比特总数。处理器420还可以根据M个子带的修正包络值、K个子带的原始包络值以及冗余比特总数,对N个子带进行二次比特分配。
设备400的其它功能和操作可以参照图1和图2的方法实施例的过程,为了避免重复,此处不再赘述。
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。

Claims (14)

  1. 一种处理信号的方法,其特征在于,包括:
    从N个子带中选择M个子带,所述N个子带是对信号的当前帧的频谱系数划分得到的,所述M个子带的频段低于所述N个子带中除所述M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N;
    根据所述M个子带的性能信息,确定对所述M个子带的原始包络值执行修正操作,所述性能信息用于指示所述M个子带的能量特性和频谱特性;
    分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值;
    根据所述M个子带的修正包络值以及所述K个子带的原始包络值,对所述N个子带进行一次比特分配。
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述M个子带的性能信息,确定对所述M个子带的原始包络值执行修正操作,包括:
    根据所述N个子带的原始包络值,确定第一参数,其中所述第一参数表示所述信号的频谱能量在所述M个子带上的集中程度;
    根据所述M个子带的原始包络值,确定第二参数,其中所述第二参数表示所述M个子带的频谱波动程度;
    在所述第一参数属于第一范围并且所述第二参数属于第二范围的情况下,确定对所述M个子带的原始包络值执行修正操作。
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述N个子带的原始包络值,确定第一参数,包括:
    根据所述M个子带的原始包络值确定所述M个子带的总能量;
    根据所述K个子带的原始包络值,确定所述K个子带的总能量;
    将所述M个子带的总能量与所述K个子带的总能量的比值确定为所述第一参数。
  4. 根据权利要求2或3所述的方法,其特征在于,所述根据所述M个子带的原始包络值,确定第二参数,包括:
    根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;
    将第一子带的能量与所述M个子带的总能量的比值确定为所述第二参 数。
  5. 根据权利要求1至4中任一项所述的方法,其特征在于,所述分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值,包括:
    根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;
    根据所述M个子带的总能量和所述第一子带的能量,确定修正因子;
    利用所述修正因子,分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值。
  6. 根据权利要求1至5中任一项所述的方法,其特征在于,所述M个子带中每个子带的修正包络值大于同一子带的原始包络值。
  7. 根据权利要求1至6中任一项所述的方法,其特征在于,还包括:
    根据在所述一次比特分配时分别对所述N个子带分配的比特数,确定所述N个子带中每个子带的冗余比特数,所述N个子带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数;
    根据所述N个子带中每个子带的冗余比特数,确定冗余比特总数;
    根据所述M个子带的修正包络值、所述K个子带的原始包络值以及冗余比特总数,对所述N个子带进行二次比特分配。
  8. 一种处理信号的设备,其特征在于,包括:
    选择单元,用于从N个子带中选择M个子带,所述N个子带是对信号的当前帧的频谱系数划分得到的,所述M个子带的频段低于所述N个子带中除所述M个子带之外的K个子带的频段,N为大于1的正整数,M和K均为正整数,M与K之和为N;
    确定单元,用于根据所述M个子带的性能信息,确定对所述M个子带的原始包络值执行修正操作,所述性能信息用于指示所述M个子带的能量特性和频谱特性;
    修正单元,用于分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值;
    分配单元,用于根据所述M个子带的修正包络值以及所述K个子带的原始包络值,对所述N个子带进行一次比特分配。
  9. 根据权利要求8所述的设备,其特征在于,所述确定单元,具体用 于:
    根据所述N个子带的原始包络值,确定第一参数,其中所述第一参数表示所述信号的频谱能量在所述M个子带上的集中程度;
    根据所述M个子带的原始包络值,确定第二参数,其中所述第二参数表示所述M个子带的频谱波动程度;
    在所述第一参数属于第一范围并且所述第二参数属于第二范围的情况下,确定对所述M个子带的原始包络值执行修正操作。
  10. 根据权利要求9所述的设备,其特征在于,所述确定单元,具体用于:
    根据所述M个子带的原始包络值确定所述M个子带的总能量;
    根据所述K个子带的原始包络值,确定所述K个子带的总能量;
    将所述M个子带的总能量与所述K个子带的总能量的比值确定为所述第一参数。
  11. 根据权利要求9或10所述的设备,其特征在于,所述确定单元,具体用于:
    根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;
    将第一子带的能量与所述M个子带的总能量的比值确定为所述第二参数。
  12. 根据权利要求8至11中任一项所述的设备,其特征在于,所述修正单元,具体用于:
    根据所述M个子带的原始包络值,确定所述M个子带的总能量和第一子带的能量,其中在所述M个子带中所述第一子带的能量最大;
    根据所述M个子带的总能量和所述第一子带的能量,确定修正因子;
    利用所述修正因子,分别对所述M个子带的原始包络值进行修正,以获取所述M个子带的修正包络值。
  13. 根据权利要求8至12中任一项所述的设备,其特征在于,所述M个子带中每个子带的修正包络值大于同一子带的原始包络值。
  14. 根据权利要求8至13中任一项所述的设备,其特征在于,
    所述确定单元,还用于根据在所述一次比特分配时分别对所述N个子带分配的比特数,确定所述N个子带中每个子带的冗余比特数,所述N个子 带中每个子带的冗余比特数小于同一子带中单个信息单位编码所需要的比特数;
    所述确定单元,还用于根据所述N个子带中每个子带的冗余比特数,确定冗余比特总数;
    所述分配单元,还用于根据所述M个子带的修正包络值、所述K个子带的原始包络值以及冗余比特总数,对所述N个子带进行二次比特分配。
PCT/CN2014/092695 2014-04-29 2014-12-01 处理信号的方法及设备 Ceased WO2015165264A1 (zh)

Priority Applications (30)

Application Number Priority Date Filing Date Title
NZ721875A NZ721875A (en) 2014-04-29 2014-12-01 Signal processing method and device
KR1020197013008A KR102052487B1 (ko) 2014-04-29 2014-12-01 신호 처리 방법 및 디바이스
EP14891093.8A EP3079150B1 (en) 2014-04-29 2014-12-01 Signal processing method and device
UAA201608914A UA117395C2 (uk) 2014-04-29 2014-12-01 Спосіб і пристрій для обробки сигналів
KR1020187023421A KR101977383B1 (ko) 2014-04-29 2014-12-01 신호 처리 방법 및 디바이스
KR1020187004298A KR101890346B1 (ko) 2014-04-29 2014-12-01 신호 처리 방법 및 디바이스
SG11201605353YA SG11201605353YA (en) 2014-04-29 2014-12-01 Signal processing method and device
MX2018009564A MX389400B (es) 2014-04-29 2014-12-01 Método y dispositivo de procesamiento de señales.
JP2016556272A JP6231699B2 (ja) 2014-04-29 2014-12-01 信号処理方法及び装置
RU2016134014A RU2656812C2 (ru) 2014-04-29 2014-12-01 Способ и устройство обработки сигналов
KR1020167019856A KR101830855B1 (ko) 2014-04-29 2014-12-01 신호 처리 방법 및 디바이스
AU2014392351A AU2014392351B2 (en) 2014-04-29 2014-12-01 Signal processing method and device
ES14891093T ES2755152T3 (es) 2014-04-29 2014-12-01 Método y dispositivo de procesamiento de señal
MX2016010690A MX358255B (es) 2014-04-29 2014-12-01 Metodo y dispositivo de procesamiento de señales.
CA2935084A CA2935084C (en) 2014-04-29 2014-12-01 Signal processing method and device
BR112016015855A BR112016015855B1 (pt) 2014-04-29 2014-12-01 método e dispositivo de processamento de sinal
EP19181139.7A EP3637417B1 (en) 2014-04-29 2014-12-01 Signal processing method and device
EP23213863.6A EP4372738A3 (en) 2014-04-29 2014-12-01 Signal processing mthod and device
IL246427A IL246427B (en) 2014-04-29 2016-06-23 Method and apparatus for signal processing
ZA2016/04465A ZA201604465B (en) 2014-04-29 2016-06-30 Signal processing method and device
US15/335,449 US9837088B2 (en) 2014-04-29 2016-10-27 Signal processing method and device
US15/686,175 US10186271B2 (en) 2014-04-29 2017-08-25 Signal processing method and device
AU2017272204A AU2017272204B2 (en) 2014-04-29 2017-12-06 Signal processing method and device
US16/205,246 US10347264B2 (en) 2014-04-29 2018-11-30 Signal processing method and device
US16/409,290 US10546591B2 (en) 2014-04-29 2019-05-10 Signal processing method and device
US16/725,678 US11081121B2 (en) 2014-04-29 2019-12-23 Signal processing method and device
US17/376,813 US11580996B2 (en) 2014-04-29 2021-07-15 Signal processing method and device
US18/157,518 US11881226B2 (en) 2014-04-29 2023-01-20 Signal processing method and device
US18/538,381 US12249339B2 (en) 2014-04-29 2023-12-13 Signal processing method and device
US19/027,147 US20250239266A1 (en) 2014-04-29 2025-01-17 Signal Processing Method and Device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410177234.3 2014-04-29
CN201410177234.3A CN105096957B (zh) 2014-04-29 2014-04-29 处理信号的方法及设备

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/335,449 Continuation US9837088B2 (en) 2014-04-29 2016-10-27 Signal processing method and device

Publications (1)

Publication Number Publication Date
WO2015165264A1 true WO2015165264A1 (zh) 2015-11-05

Family

ID=54358124

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/092695 Ceased WO2015165264A1 (zh) 2014-04-29 2014-12-01 处理信号的方法及设备

Country Status (20)

Country Link
US (9) US9837088B2 (zh)
EP (3) EP3079150B1 (zh)
JP (4) JP6231699B2 (zh)
KR (4) KR102052487B1 (zh)
CN (2) CN105096957B (zh)
AU (2) AU2014392351B2 (zh)
BR (1) BR112016015855B1 (zh)
CA (1) CA2935084C (zh)
CL (1) CL2016002106A1 (zh)
ES (1) ES2755152T3 (zh)
IL (2) IL246427B (zh)
MX (2) MX389400B (zh)
NZ (1) NZ721875A (zh)
PL (1) PL3637417T3 (zh)
PT (1) PT3079150T (zh)
RU (3) RU2688259C2 (zh)
SG (2) SG10201801095QA (zh)
UA (1) UA117395C2 (zh)
WO (1) WO2015165264A1 (zh)
ZA (1) ZA201604465B (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107452390B (zh) 2014-04-29 2021-10-26 华为技术有限公司 音频编码方法及相关装置
CN105096957B (zh) 2014-04-29 2016-09-14 华为技术有限公司 处理信号的方法及设备
RU2763480C1 (ru) * 2021-06-16 2021-12-29 Федеральное государственное казенное военное образовательное учреждение высшего образования "Военный учебно-научный центр Военно-Морского Флота "Военно-морская академия имени Адмирала флота Советского Союза Н.Г. Кузнецова" Устройство для восстановления речевого сигнала
CN117476013A (zh) * 2022-07-27 2024-01-30 华为技术有限公司 音频信号的处理方法、装置、存储介质及计算机程序产品

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007041789A1 (en) * 2005-10-11 2007-04-19 National Ict Australia Limited Front-end processing of speech signals
JP2008309955A (ja) * 2007-06-13 2008-12-25 Toshiba Corp ノイズサプレス装置
CN101727906A (zh) * 2008-10-29 2010-06-09 华为技术有限公司 高频带信号的编解码方法及装置
CN101770775A (zh) * 2008-12-31 2010-07-07 华为技术有限公司 信号处理方法及装置
CN102436820A (zh) * 2010-09-29 2012-05-02 华为技术有限公司 高频带信号编码方法及装置、高频带信号解码方法及装置
CN103366750A (zh) * 2012-03-28 2013-10-23 北京天籁传音数字技术有限公司 一种声音编解码装置及其方法

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100188912B1 (ko) * 1992-09-21 1999-06-01 윤종용 서브밴드코딩의 비트재할당 방법
JP3283413B2 (ja) * 1995-11-30 2002-05-20 株式会社日立製作所 符号化復号方法、符号化装置および復号装置
JP2003280698A (ja) * 2002-03-22 2003-10-02 Sanyo Electric Co Ltd 音声圧縮方法および音声圧縮装置
DE10345995B4 (de) 2003-10-02 2005-07-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Verarbeiten eines Signals mit einer Sequenz von diskreten Werten
DE102004007200B3 (de) 2004-02-13 2005-08-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audiocodierung
JP4287840B2 (ja) * 2005-06-06 2009-07-01 パナソニック株式会社 符号化装置
US7590523B2 (en) 2006-03-20 2009-09-15 Mindspeed Technologies, Inc. Speech post-processing using MDCT coefficients
US8392198B1 (en) * 2007-04-03 2013-03-05 Arizona Board Of Regents For And On Behalf Of Arizona State University Split-band speech compression based on loudness estimation
US7885819B2 (en) * 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
KR20090110244A (ko) 2008-04-17 2009-10-21 삼성전자주식회사 오디오 시맨틱 정보를 이용한 오디오 신호의 부호화/복호화 방법 및 그 장치
JP2009288560A (ja) 2008-05-29 2009-12-10 Sanyo Electric Co Ltd 音声符号化装置、音声復号装置、及びプログラム
WO2010028299A1 (en) * 2008-09-06 2010-03-11 Huawei Technologies Co., Ltd. Noise-feedback for spectral envelope quantization
US8352279B2 (en) * 2008-09-06 2013-01-08 Huawei Technologies Co., Ltd. Efficient temporal envelope coding approach by prediction between low band signal and high band signal
CN101770776B (zh) 2008-12-29 2011-06-08 华为技术有限公司 瞬态信号的编码方法和装置、解码方法和装置及处理系统
EP2645367B1 (en) 2009-02-16 2019-11-20 Electronics and Telecommunications Research Institute Encoding/decoding method for audio signals using adaptive sinusoidal coding and apparatus thereof
PL2234103T3 (pl) 2009-03-26 2012-02-29 Fraunhofer Ges Forschung Urządzenie i sposób manipulacji sygnałem audio
CN101853663B (zh) * 2009-03-30 2012-05-23 华为技术有限公司 比特分配方法、编码装置及解码装置
US8498874B2 (en) 2009-09-11 2013-07-30 Sling Media Pvt Ltd Audio signal encoding employing interchannel and temporal redundancy reduction
CN102081927B (zh) 2009-11-27 2012-07-18 中兴通讯股份有限公司 一种可分层音频编码、解码方法及系统
CN102081926B (zh) * 2009-11-27 2013-06-05 中兴通讯股份有限公司 格型矢量量化音频编解码方法和系统
US8560330B2 (en) * 2010-07-19 2013-10-15 Futurewei Technologies, Inc. Energy envelope perceptual correction for high band coding
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
ES2484795T3 (es) 2010-07-19 2014-08-12 Dolby International Ab Procesamiento de señales de audio durante la reconstrucción de alta frecuencia
KR101826331B1 (ko) * 2010-09-15 2018-03-22 삼성전자주식회사 고주파수 대역폭 확장을 위한 부호화/복호화 장치 및 방법
CN102741831B (zh) * 2010-11-12 2015-10-07 宝利通公司 多点环境中的可伸缩音频
RU2464649C1 (ru) * 2011-06-01 2012-10-20 Корпорация "САМСУНГ ЭЛЕКТРОНИКС Ко., Лтд." Способ обработки звукового сигнала
CN102208188B (zh) * 2011-07-13 2013-04-17 华为技术有限公司 音频信号编解码方法和设备
RU2505921C2 (ru) * 2012-02-02 2014-01-27 Корпорация "САМСУНГ ЭЛЕКТРОНИКС Ко., Лтд." Способ и устройство кодирования и декодирования аудиосигналов (варианты)
CN103544957B (zh) * 2012-07-13 2017-04-12 华为技术有限公司 音频信号的比特分配的方法和装置
EP2717262A1 (en) * 2012-10-05 2014-04-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding
PL3232437T3 (pl) * 2012-12-13 2019-05-31 Fraunhofer Ges Forschung Urządzenie do kodowania głosowego audio, urządzenie do dekodowania głosowego audio, sposób kodowania głosowego audio i sposób dekodowania głosowego audio
CN104282312B (zh) * 2013-07-01 2018-02-23 华为技术有限公司 信号编码和解码方法以及设备
CN105096957B (zh) 2014-04-29 2016-09-14 华为技术有限公司 处理信号的方法及设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007041789A1 (en) * 2005-10-11 2007-04-19 National Ict Australia Limited Front-end processing of speech signals
JP2008309955A (ja) * 2007-06-13 2008-12-25 Toshiba Corp ノイズサプレス装置
CN101727906A (zh) * 2008-10-29 2010-06-09 华为技术有限公司 高频带信号的编解码方法及装置
CN101770775A (zh) * 2008-12-31 2010-07-07 华为技术有限公司 信号处理方法及装置
CN102436820A (zh) * 2010-09-29 2012-05-02 华为技术有限公司 高频带信号编码方法及装置、高频带信号解码方法及装置
CN103366750A (zh) * 2012-03-28 2013-10-23 北京天籁传音数字技术有限公司 一种声音编解码装置及其方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3079150A4 *

Also Published As

Publication number Publication date
BR112016015855B1 (pt) 2019-09-10
HK1212501A1 (zh) 2016-06-10
JP2017507366A (ja) 2017-03-16
JP7144499B2 (ja) 2022-09-29
AU2014392351A1 (en) 2016-07-21
NZ721875A (en) 2018-03-23
CA2935084C (en) 2019-03-12
CL2016002106A1 (es) 2016-12-16
US20190108848A1 (en) 2019-04-11
RU2016134014A (ru) 2018-02-22
SG11201605353YA (en) 2016-08-30
US11881226B2 (en) 2024-01-23
AU2014392351B2 (en) 2017-09-14
US20200135218A1 (en) 2020-04-30
EP4372738A2 (en) 2024-05-22
AU2017272204B2 (en) 2019-09-12
RU2018118576A (ru) 2018-10-31
US10546591B2 (en) 2020-01-28
ES2755152T3 (es) 2020-04-21
CN106409303A (zh) 2017-02-15
IL248575B (en) 2022-02-01
KR101890346B1 (ko) 2018-08-21
IL248575A0 (en) 2016-12-29
CN105096957A (zh) 2015-11-25
EP3079150B1 (en) 2019-09-11
US10186271B2 (en) 2019-01-22
PL3637417T3 (pl) 2024-06-10
US20210343298A1 (en) 2021-11-04
US20250239266A1 (en) 2025-07-24
KR20180095120A (ko) 2018-08-24
RU2688259C2 (ru) 2019-05-21
CN106409303B (zh) 2019-09-20
US20240185867A1 (en) 2024-06-06
US20170047074A1 (en) 2017-02-16
CA2935084A1 (en) 2015-11-05
CN105096957B (zh) 2016-09-14
US10347264B2 (en) 2019-07-09
UA117395C2 (uk) 2018-07-25
IL246427B (en) 2019-03-31
ZA201604465B (en) 2017-09-27
KR20190049954A (ko) 2019-05-09
US20190267015A1 (en) 2019-08-29
JP6790166B2 (ja) 2020-11-25
EP3637417A1 (en) 2020-04-15
US20170352358A1 (en) 2017-12-07
EP3637417B1 (en) 2024-02-07
EP4372738A3 (en) 2024-07-24
US11580996B2 (en) 2023-02-14
KR20180019761A (ko) 2018-02-26
JP2019152871A (ja) 2019-09-12
JP2018041091A (ja) 2018-03-15
EP3079150A4 (en) 2016-12-07
US12249339B2 (en) 2025-03-11
SG10201801095QA (en) 2018-03-28
US20230238010A1 (en) 2023-07-27
US9837088B2 (en) 2017-12-05
KR101977383B1 (ko) 2019-05-10
MX389400B (es) 2025-03-20
KR101830855B1 (ko) 2018-02-21
RU2018118576A3 (zh) 2019-03-05
KR20160099714A (ko) 2016-08-22
US11081121B2 (en) 2021-08-03
MX2018009564A (es) 2022-01-14
JP6517300B2 (ja) 2019-05-22
JP2021043453A (ja) 2021-03-18
JP6231699B2 (ja) 2017-11-15
KR102052487B1 (ko) 2019-12-05
PT3079150T (pt) 2019-10-29
RU2702265C1 (ru) 2019-10-07
AU2017272204A1 (en) 2018-01-04
MX2016010690A (es) 2016-11-23
MX358255B (es) 2018-08-09
EP3079150A1 (en) 2016-10-12
RU2656812C2 (ru) 2018-06-06

Similar Documents

Publication Publication Date Title
JP7144499B2 (ja) 信号処理方法及び装置
WO2015000373A1 (zh) 信号编码和解码方法以及设备
HK1230780B (zh) 处理信号的方法及设备
HK1230780A1 (zh) 处理信号的方法及设备
HK1230780A (zh) 處理信號的方法及設備
HK1212501B (zh) 处理信号的方法及设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14891093

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 246427

Country of ref document: IL

ENP Entry into the national phase

Ref document number: 2935084

Country of ref document: CA

REEP Request for entry into the european phase

Ref document number: 2014891093

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014891093

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 122018010264

Country of ref document: BR

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112016015855

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 20167019856

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2014392351

Country of ref document: AU

Date of ref document: 20141201

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: MX/A/2016/010690

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 2016134014

Country of ref document: RU

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: A201608914

Country of ref document: UA

Ref document number: IDP00201605470

Country of ref document: ID

ENP Entry into the national phase

Ref document number: 2016556272

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 112016015855

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20160707