JPH0736493A

JPH0736493A - Variable rate speech coder

Info

Publication number: JPH0736493A
Application number: JP5181125A
Authority: JP
Inventors: Norio Nomura; 規雄野村
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1993-07-22
Filing date: 1993-07-22
Publication date: 1995-02-07

Abstract

(57)【要約】【目的】線形分析フィルタの次数をフレームごとに可
変化して、線形予測係数の量子化ビットを容易に可変で
きるようにする。【構成】フレーム切り出し部１１が入力音声から１フ
レーム分の音声データを切り出し、この１フレーム分の
音声データにおけるＰＡＣＯＲ係数を線形予測係数計算
部１２で算出する。さらに、次数決定部１３が線形予測
分析の次数ｐを決定する。線形予測係数計算部１２から
次数ｐに対応した線形予測係数α１〜αｐを線形分析フ
ィルタ１４及び量子化部１５に送出する。線形分析フィ
ルタ１４は残差信号を算出し、量子化部１５が線形予測
係数を符号化コードに量子化して送出する。これによっ
て、フレームごとに線形予測係数の次数ｐが可変とな
り、線形予測係数の可変レートによる処理が行われる。 (57) [Summary] [Purpose] The order of the linear analysis filter is made variable for each frame so that the quantization bit of the linear prediction coefficient can be easily changed. [Structure] A frame cutout unit 11 cuts out one frame of voice data from input voice, and a linear prediction coefficient calculation unit 12 calculates a PACOR coefficient in the one frame of voice data. Further, the order determining unit 13 determines the order p of the linear prediction analysis. The linear prediction coefficient calculation unit 12 sends the linear prediction coefficients α1 to αp corresponding to the order p to the linear analysis filter 14 and the quantization unit 15. The linear analysis filter 14 calculates the residual signal, and the quantizer 15 quantizes the linear prediction coefficient into a coded code and sends it out. As a result, the order p of the linear prediction coefficient becomes variable for each frame, and the processing at the variable rate of the linear prediction coefficient is performed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディジタル音声通信装
置、ディジタル音声蓄積装置などに利用し、符号化デー
タを可変レートで伝送する可変レート音声符号化装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a variable rate speech coding apparatus which is used in a digital speech communication apparatus, a digital speech storage apparatus or the like and which transmits coded data at a variable rate.

【０００２】[0002]

【従来の技術】近年、音声符号化として可変レート音声
符号化処理の研究が行われている。この可変レート音声
符号化処理は、伝送情報量が固定である従来の固定レー
トの音声符号化処理と相違し、時間によって伝送情報量
が変化する。この場合、固定レート音声符号化処理に比
較して合計の伝送情報量を低減できる。また、可変レー
トの音声符号化処理、固定レートの音声符号化処理のい
ずれも音声の線形予測分析の次数は固定方式が主流であ
る。2. Description of the Related Art In recent years, research on variable-rate speech coding processing has been conducted as speech coding. This variable-rate voice encoding process differs from the conventional fixed-rate voice encoding process in which the amount of transmission information is fixed, and the amount of transmission information changes with time. In this case, it is possible to reduce the total amount of transmission information as compared with the fixed rate speech coding process. Further, in both the variable rate speech coding processing and the fixed rate speech coding processing, the order of the linear prediction analysis of speech is mainly fixed.

【０００３】図３は従来の可変レート音声符号化装置の
構成を示すブロック図である。図３において、この可変
レート音声符号化装置は、入力音声（信号）から１フレ
ーム長の音声データを切り出すフレーム切り出し部６
と、線形予測係数を算出する線形予測係数計算部７と、
残差信号を算出する線形分析フィルタ８と、線形予測係
数を符号化コードに量子化する量子化部９とを有してい
る。ここでの線形分析フィルタ８の次数は固定値であ
る。FIG. 3 is a block diagram showing the structure of a conventional variable rate speech coding apparatus. In FIG. 3, the variable rate audio encoding device includes a frame cutout unit 6 that cuts out audio data of one frame length from input audio (signal).
And a linear prediction coefficient calculation unit 7 that calculates a linear prediction coefficient,
It has a linear analysis filter 8 for calculating a residual signal and a quantizer 9 for quantizing a linear prediction coefficient into an encoding code. The order of the linear analysis filter 8 here is a fixed value.

【０００４】次に、この従来例の構成の動作について説
明する。フレーム切り出し部６は、１フレーム長の音声
データを切り出す。線形予測係数計算部７は、線形分析
フィルタ８の出力パワーが最小になるような線形予測係
数α１〜αｐを算出する。線形分析フィルタ８では残差
信号が計算される。また、量子化部９では線形予測係数
α１〜αｐを符号化コードに量子化する。Next, the operation of this conventional configuration will be described. The frame cutout unit 6 cuts out audio data of one frame length. The linear prediction coefficient calculator 7 calculates the linear prediction coefficients α1 to αp such that the output power of the linear analysis filter 8 is minimized. The linear analysis filter 8 calculates the residual signal. Further, the quantizing unit 9 quantizes the linear prediction coefficients α1 to αp into coding codes.

【０００５】[0005]

【発明が解決しようとする課題】このような上記の従来
の音声符号化処理では、線形分析フィルタ８の次数ｐが
固定であるため線形予測係数α１〜αｐの量子化コード
のビット数を可変化し難いという問題を有していた。In such a conventional speech coding process as described above, since the order p of the linear analysis filter 8 is fixed, the number of bits of the quantization code of the linear prediction coefficients α1 to αp is made variable. It had a problem that it was difficult to do.

【０００６】本発明は、このような従来の問題を解決す
るものであり、線形分析フィルタの次数をフレームごと
に可変化して、線形予測係数の量子化ビットを容易に可
変できる優れた可変レート音声符号化装置の提供を目的
とする。The present invention solves such a conventional problem, and makes it possible to easily change the quantization bit of the linear prediction coefficient by changing the order of the linear analysis filter for each frame. An object is to provide a speech encoding device.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するため
に、本発明の可変レート音声符号化装置は、入力音声か
ら１フレーム分の音声データを切り出すフレーム切り出
し手段と、１フレーム分の音声データにおける係数を算
出する線形予測係数計算手段と、線形予測分析の次数を
決定する次数決定手段と、次数に対応した線形予測係数
を送出する線形予測係数計算手段と、残差信号を算出す
る可変次数の線形分析フィルタと、線形予測係数を符号
化コードに量子化して送出する量子化手段とを有し、フ
レームごとに線形予測係数の次数を可変して、線形予測
係数の可変レートによる処理を行う構成としている。In order to achieve the above object, a variable rate speech coding apparatus according to the present invention is provided with a frame clipping means for clipping speech data of one frame from input speech and speech data of one frame. , A linear predictive coefficient calculating means for calculating the coefficient, an order determining means for determining the order of the linear predictive analysis, a linear predictive coefficient calculating means for sending a linear predictive coefficient corresponding to the order, and a variable order for calculating the residual signal. , And a quantizing means for quantizing and transmitting the linear prediction coefficient into a coded code. By varying the order of the linear prediction coefficient for each frame, processing is performed at a variable rate of the linear prediction coefficient. It is configured.

【０００８】また、線形予測係数計算手段でＰＡＣＯＲ
係数を算出し、かつ、線形分析フィルタは、次数に対す
る線形予測係数を使用し、出力パワー対入力パワー比を
示す予測ゲインを算出する構成としている。In addition, the linear prediction coefficient calculation means uses PACOR.
The coefficient is calculated, and the linear analysis filter uses the linear prediction coefficient for the order to calculate the prediction gain indicating the output power to input power ratio.

【０００９】[0009]

【作用】このような構成により、本発明の可変レート音
声符号化装置は、フレームごとに線形予測係数の次数、
すなわち、線形予測係数における数値が可変となり、線
形予測係数の量子化部での符号化コードのビット数が容
易に可変化される。With such a configuration, the variable rate speech coding apparatus of the present invention has the order of the linear prediction coefficient for each frame,
That is, the numerical value of the linear prediction coefficient becomes variable, and the number of bits of the coding code in the quantizing unit of the linear prediction coefficient is easily variable.

【００１０】[0010]

【実施例】以下、本発明の可変レート音声符号化装置の
実施例を図面を参照して詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT An embodiment of a variable rate speech coder according to the present invention will be described below in detail with reference to the drawings.

【００１１】図１は本発明の可変レート音声符号化装置
の実施例における構成を示すブロック図である。図１に
おいて、この可変レート音声符号化装置は、入力音声
（信号）の１フレーム分の音声データを切り出すフレー
ム切り出し部１１と、ＰＡＣＯＲ係数を算出する線形予
測係数計算部１２とを有している。さらに、この可変レ
ート音声符号化装置は、次数ｐを決定する次数決定部１
３と、残差信号を算出する可変次数の線形分析フィルタ
１４と、線形予測係数を符号化コードに量子化する量子
化部１５とを有している。FIG. 1 is a block diagram showing the configuration of an embodiment of a variable rate speech coding apparatus of the present invention. In FIG. 1, this variable rate speech coding apparatus has a frame cutout unit 11 that cuts out one frame of speech data of an input speech (signal), and a linear prediction coefficient calculation unit 12 that calculates a PACOR coefficient. . Further, this variable rate speech encoding apparatus has an order determining unit 1 for determining the order p.
3, a variable-order linear analysis filter 14 for calculating a residual signal, and a quantizer 15 for quantizing a linear prediction coefficient into an encoding code.

【００１２】次に、この実施例の構成における動作につ
いて説明する。フレーム切り出し部１１は、連続音声で
ある入力音声から１フレーム分の音声データを切り出
し、この切り出した１フレーム分の音声データにおける
ＰＡＣＯＲ係数ｋ１〜ｋｐｍａｘを線形予測係数計算部
１２で計算する。ここで、ｐｍａｘは最大次数である。
そして、次数決定部１３が次の処理手順を通じて次数ｐ
を決定する。（１）入力音声より各次数ｐにおけるＰＡＣＯＲ係数ｋ
ｉ（１≦ｉ≦ｐｍａｘ）を求める。（２）各次数ｐにおける線形分析フィルタ１４の予測ゲ
インｕｉ（０≦ｉ≦ｐｍａｘ）を次式〔数１〕で算出す
る。予測ゲインｕｉは次数ｉに対する線形予測係数を使
用した線形分析フィルタ１４における出力パワー対入力
パワー比を示す。Next, the operation of the configuration of this embodiment will be described. The frame cutout unit 11 cuts out one frame of sound data from the input sound that is continuous sound, and the linear prediction coefficient calculation unit 12 calculates the PACOR coefficients k1 to kpmax in the cutout one frame of sound data. Here, pmax is the maximum order.
Then, the order determining unit 13 executes the order p through the following processing procedure.
To decide. (1) PACOR coefficient k at each order p from input speech
i (1 ≦ i ≦ pmax) is calculated. (2) The prediction gain ui (0 ≦ i ≦ pmax) of the linear analysis filter 14 at each order p is calculated by the following equation [Equation 1]. The prediction gain ui indicates the output power to input power ratio in the linear analysis filter 14 using the linear prediction coefficient for the order i.

【００１３】[0013]

【数１】 [Equation 1]

【００１４】（３）次の式（２）を用いて次数ｐを決定
する。次数ｐは、ｕｉ＜ｕｔを満たす最小のｉの値（０≦ｉ≦
ｐｍａｘ）ここで、ｕｔ＝ｕｐｍａｘ×（１＋ｄｕ／１００） …（２）ｄｕ：次数決定処理の動作を決定するパラメータ（％）（４）線形予測係数計算部１２が次数ｐにおける次数に
対応した線形予測係数α１〜αｐを算出する。（５）線形予測係数計算部１２で算出した線形予測係数
α１〜αｐを使用して、線形分析フィルタ１４が出力残
差を算出する。(3) The order p is determined using the following equation (2). The order p is the minimum value of i satisfying ui <ut (0 ≦ i ≦
pmax) where ut = upmax × (1 + du / 100) (2) du: parameter that determines the operation of the order determination process (%) (4) The linear prediction coefficient calculation unit 12 corresponds to the order of the order p. The prediction coefficients α1 to αp are calculated. (5) Using the linear prediction coefficients α1 to αp calculated by the linear prediction coefficient calculation unit 12, the linear analysis filter 14 calculates the output residual.

【００１５】この場合、入力音声によっては線形分析フ
ィルタ１４の次数ｐを増加しても符号化の効果の無い信
号がある。例えば、ホワイトノイズは線形予測係が、い
ずれも０に近くなり、線形分析フィルタ１４は意味がな
い。これは線形分析フィルタ１４が、入力音声のスペク
トル包絡をフラットにする処理を行うためである。した
がって、ホワイトノイズのようなスペクトルがフラット
の信号に対しては線形分析フィルタ１４は用いることが
出来ない。これは入力音声の性質によって線形分析フィ
ルタ１４の次数ｐを小さくすれば良いことを意味する。In this case, depending on the input voice, there is a signal that has no coding effect even if the order p of the linear analysis filter 14 is increased. For example, with respect to white noise, the linear predictors are close to 0, and the linear analysis filter 14 has no meaning. This is because the linear analysis filter 14 performs processing for flattening the spectral envelope of the input voice. Therefore, the linear analysis filter 14 cannot be used for a signal having a flat spectrum such as white noise. This means that the order p of the linear analysis filter 14 may be reduced depending on the nature of the input voice.

【００１６】したがって、上記の線形予測係数の次数決
定処理では、予測ゲインが最大次数における予測ゲイン
に近いときの次数ｐを求めている。このとき、この次数
ｐで線形分析フィルタ１４をかけた出力残差と、最大次
数で線形分析フィルタ１４をかけたときの出力残差は同
様の波形となる。Therefore, in the above-described linear prediction coefficient order determination process, the order p is calculated when the prediction gain is close to the prediction gain at the maximum order. At this time, the output residual when the linear analysis filter 14 is applied with the order p and the output residual when the linear analysis filter 14 is applied with the maximum order have the same waveform.

【００１７】図２に実際の音声波形例を示す波形図であ
る。図２において、この例は、図２（ａ）に示す入力音
声に対する次数ｐが２の場合の波形を図２（ｂ）に示し
ている。さらに図２（ｃ）（ｄ）（ｅ）（ｆ）にそれぞ
れ、次数ｐが４，５，８，１０の場合の波形を示してい
る。FIG. 2 is a waveform diagram showing an example of an actual voice waveform. In FIG. 2, in this example, the waveform when the order p of the input voice shown in FIG. 2 (a) is 2 is shown in FIG. 2 (b). 2 (c), (d), (e), and (f) show waveforms when the orders p are 4, 5, 8, and 10, respectively.

【００１８】このように次数ｐをフレームごとに変化さ
せることにより、この音声符号化処理で伝送する線形予
測係数の個数は、固定レートの音声符号化処理での線形
予測係数の個数より少なくなる。By changing the order p for each frame in this way, the number of linear prediction coefficients to be transmitted in this speech coding process becomes smaller than the number of linear prediction coefficients in the fixed rate speech coding process.

【００１９】[0019]

【発明の効果】以上の説明から明らかなように、本発明
の可変レート音声符号化装置は、フレームごとに線形予
測係数の次数である線形予測係数における数値を可変に
しているため、線形予測係数の量子化部での符号化コー
ドのビット数を容易に可変化できるという効果を有す
る。As is apparent from the above description, since the variable rate speech coding apparatus of the present invention makes the value of the linear prediction coefficient which is the order of the linear prediction coefficient variable for each frame, the linear prediction coefficient is changed. This has the effect that the number of bits of the encoded code in the quantization unit can be easily changed.

[Brief description of drawings]

【図１】本発明の可変レート音声符号化装置の実施例に
おける構成を示すブロック図FIG. 1 is a block diagram showing the configuration of an embodiment of a variable rate speech coding apparatus of the present invention.

【図２】実施例における実際の音声波形例を示す波形図FIG. 2 is a waveform diagram showing an example of an actual voice waveform in the embodiment.

【図３】従来の可変レート音声符号化装置の構成を示す
ブロック図FIG. 3 is a block diagram showing a configuration of a conventional variable rate speech coding apparatus.

[Explanation of symbols]

１１フレーム切り出し部１２線形予測係数計算部１３次数決定部１４線形分析フィルタ１５量子化部 11 frame cutout unit 12 linear prediction coefficient calculation unit 13 order determination unit 14 linear analysis filter 15 quantization unit

Claims

[Claims]

1. A frame cutout unit that cuts out one frame of voice data from an input voice, a linear prediction coefficient calculation unit that calculates a coefficient in the one frame of voice data, and an order determination unit that determines the order of linear prediction analysis. Means, a linear prediction coefficient calculation means for sending a linear prediction coefficient corresponding to the above-mentioned order, a variable-order linear analysis filter for calculating a residual signal, and a quantization for quantizing and sending the linear prediction coefficient into a coding code Means for varying the order of the linear prediction coefficient for each frame and performing processing at a variable rate of the linear prediction coefficient.

2. A linear prediction coefficient calculation means calculates a PACOR coefficient, and a linear analysis filter uses a linear prediction coefficient for an order to calculate a prediction gain indicating an output power to input power ratio. The variable rate speech coding apparatus according to claim 1.