JPH0367375B2

JPH0367375B2 -

Info

Publication number: JPH0367375B2
Application number: JP57204849A
Authority: JP
Inventors: Takehiro Morya; Masaaki Yoda
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: NTT Inc
Priority date: 1982-11-22
Filing date: 1982-11-22
Publication date: 1991-10-22
Also published as: JPS5994936A

Description

【発明の詳細な説明】この発明は一つのベクトルあたりの量子化情報
量が大きい場合に適するベクトル量子化法に関す
る。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a vector quantization method suitable for cases where the amount of quantization information per vector is large.

＜従来技術＞一般にベクトル量子化はサンプル値を複数個ご
とにベクトルE₁＝（e₁₁e₁₂……e_1p）、E₂＝（e₂₁e₂₂…
…e_2p）………E_o＝（e_o1e_o2……e_op）とし、その各
ベクトルごとに予め求めておいた代表ベクトル
F₁＝（f₁₁f₁₂……f_1p）、F₂＝（f₂₁f₂₂……f_2p）………
F_n＝（f_n1f_n2……f_np）の辞書から歪が最小となる
もの、即ち最も近いものを選び、その番号を符号
とするものである。こおように一つのベクトルを
一つの代表値例えば番号で符号化するものである
から、一つのベクトルあたりの符号化情報量（ビ
ツト数）Ｂが少ない場合でも効率のよい符号化が
可能である。また、2^B個の重心ベクトル（代表ベ
クトル）から成る辞書（符号帳）を重心ベクトル
数の10倍以上の学習サンプル（学習ベクトル）か
ら成る辞書を学習サンプルからデータの統計的分
布を反映するように作つておけば、特異な分布や
偏りのある分布を持つデータをも能率よく符号化
できる。そして符号化による平均歪はベクトルの
次元をＰとする時、2^-(2B/P)に比例して小さくする
ことができる。<Prior art> In general, vector quantization converts multiple sample values into vectors E ₁ = (e ₁₁ e ₁₂ ... e _1p ), E ₂ = (e ₂₁ e ₂₂ ...
…e _2p )……E _o = (e _o1 e _o2 …e _op ), and the representative vector calculated in advance for each vector is
F ₁ = (f ₁₁ f ₁₂ ...f _1p ), F ₂ = (f ₂₁ f ₂₂ ... f _2p ) ......
From the dictionary of F _n =(f _n1 f _n2 . . . f _np ), the one with the minimum distortion, that is, the one closest to it, is selected, and that number is used as the code. In this way, one vector is encoded using one representative value, such as a number, so efficient encoding is possible even when the amount of encoded information (number of bits) B per one vector is small. . In addition, we created a dictionary (codebook) consisting of ^2B centroid vectors (representative vectors) and a dictionary consisting of learning samples (learning vectors) that are 10 times more than the number of centroid vectors to reflect the statistical distribution of data from the learning samples. If created in advance, even data with unusual or biased distributions can be encoded efficiently. The average distortion due to encoding can be reduced in proportion to 2 ^{- (2B/P)} , where P is the dimension of the vector.

しかし与えられた情報量Ｂに対して2^B個のベク
トルの辞書とその10倍以上の学習サンプルが必要
であり、辞書製作に要する記憶容量や計算量を考
慮すれば情報量Ｂは10以下が現実的である。例え
ばＢ≧20の場合には現存のいかなる電子計算機を
もつてしても辞書の作製は不可能と言つてよい。
即ち一般のベクトル量子化においては１ベクトル
あたりの情報量Ｂの制約により、符号化に伴う平
均歪を小さくすることには限界があつた。またＢ
が異なる場合にはＢの値毎に別の辞書を作つてお
く必要があつた。 However, for a given amount of information B, a dictionary of 2 ^B vectors and more than 10 times as many learning samples are required, and considering the storage capacity and calculation amount required for dictionary creation, the amount of information B should be less than 10. Be realistic. For example, if B≧20, it is impossible to create a dictionary using any existing electronic computer.
That is, in general vector quantization, there is a limit to reducing the average distortion associated with encoding due to the restriction on the amount of information B per vector. Also B
If the values of B were different, it was necessary to create a separate dictionary for each value of B.

一方、スカラ量子化は１サンプルごと独立の量
子化で処理は極めて簡単であるが、サンプル当り
のビツト数が少ないときや入力データの分布や相
関に偏りがあるときの歪は、ベクトル量子化の場
合より非常に大きくなつてしまう。 On the other hand, scalar quantization is an independent quantization process for each sample, and processing is extremely simple. It becomes much larger than usual.

この発明は分布に偏りがあるデータでも能率よ
く量子化できるというベクトル量子化の利点をそ
のまま生かし、かつ符号化情報量Ｂが20以上の場
合でも歪をそれに応じて小さくできるようにした
ベクトル量子化法を提供するものである。 This invention utilizes the advantage of vector quantization that even data with a biased distribution can be efficiently quantized, and at the same time, vector quantization is capable of reducing distortion accordingly even when the amount of encoded information B is 20 or more. It provides law.

＜実施例＞第１図はこの発明のベクトル量子化法を適用し
た情報伝送系の例を示す。符号器１１は通信路１
２を介して復号器１３と接続され、符号器１１及
び復号器１３にはそれぞれ辞書１４及び１５が接
続されている。辞書１４には代表値を表す重心ベ
クトルが記憶された重心ベクトル部１６と、各重
心ベクトルが属する各集落（領域）に属する学習
ベクトルの広がりを示す分散ベクトルが記憶され
た分散ベクトル部１７とを具備する。復号器側の
辞書１５は符号器側の辞書１４と全く同一の構成
であつて、重心ベクトル部１８及び分散ベクトル
部１９を備えている。符号器１１は重心ベクトル
部１６及び分散ベクトル部１７をそれぞれ使うベ
クトル量子化部２１及びスカラ量子化部２２より
成る。<Embodiment> FIG. 1 shows an example of an information transmission system to which the vector quantization method of the present invention is applied. Encoder 11 is communication path 1
2, and dictionaries 14 and 15 are connected to the encoder 11 and the decoder 13, respectively. The dictionary 14 includes a centroid vector section 16 in which centroid vectors representing representative values are stored, and a dispersion vector section 17 in which dispersion vectors indicating the spread of learning vectors belonging to each village (region) to which each centroid vector belongs are stored. Be equipped. The dictionary 15 on the decoder side has exactly the same configuration as the dictionary 14 on the encoder side, and includes a centroid vector section 18 and a dispersion vector section 19. The encoder 11 includes a vector quantizer 21 and a scalar quantizer 22, each using a centroid vector section 16 and a variance vector section 17.

第２図に示す分布をもつ２次元（Ｐ＝２）のデ
ータを、Ｂ＝６ビツト及びB₀＝２で量子化する
場合を例にとり、この発明のベクトル量子化法を
説明する。第２図において閉曲線２３で囲まれた
領域内にデータが分布しているとする。通常の
X₁、X₂座標独立の量子化や座標変換後の量子化
ではスカラ量子化である限り、各データごとに量
子化を行う限り、一定の情報量に対する歪は大き
くなつてしまう。 The vector quantization method of the present invention will be explained by taking as an example the case where two-dimensional (P=2) data having the distribution shown in FIG. 2 is quantized with B=6 bits and B ₀ =2. It is assumed that data is distributed within a region surrounded by a closed curve 23 in FIG. normal
In quantization independent of X ₁ and X ₂ coordinates or quantization after coordinate transformation, as long as scalar quantization is used and quantization is performed for each data item, distortion for a certain amount of information will increase.

しかしこの発明では二段階でベクトル量子化を
行う。まず第１の量子化段では、サンプル値系列
がブロツクごとにそのサンプル値を要素とするベ
クトルとされた入力データは入力端子２４よりベ
クトル量子化部２１に入力されてB₀＝２でベク
トル量子化される。即ち領域２３の先験的分布に
合うように全体が2^B0個の領域２５ａ〜２５ｄに
分割され、その各領域即ち各集落２５ａ〜２５ｄ
の重心点C₁〜C₄が辞書として作製されている。
ベクトル量子化部２１では入力のベクトルが2^B0
個のどの集落２５ａ〜２５ｄに属するかが決めら
れる。例えば入力ベクトルＳは重心点のC₁の集
落２５ａに属するとする。 However, in this invention, vector quantization is performed in two stages. First, in the first quantization stage, the input data in which the sample value series is converted into a vector whose element is the sample value for each block is input to the vector quantization unit 21 from the input terminal 24, and is converted into a vector quantizer with B ₀ = 2. be converted into That is, the entire area is divided into 2 ^B0 areas 25a to 25d in accordance with the a priori distribution of the area 23, and each area, that is, each village 25a to 25d.
The center of gravity points C ₁ to _{C 4} are created as a dictionary.
In the vector quantization unit 21, the input vector is 2 ^B0
It is determined which of the villages 25a to 25d the person belongs to. For example, it is assumed that the input vector S belongs to the village 25a at the center of gravity _C1 .

以上の第１ベクトル量子化段の結果に基づい
て、次の第２のスカラ量子化段を行なう。 Based on the results of the first vector quantization stage, the next second scalar quantization stage is performed.

集落２５ａの分散ベクトルはその重心C₁を示
す重心ベクトルで代表される。この分散ベクトル
を利用して作られた（Ｂ−B₀）ビツトで表現さ
れる各座標独立の格子点２６から歪が最小となる
ものを選択する。即ち、同図の例では前述の仮定
よりＢ−B₀＝４であるから、各座標（X₁、X₂）
あたり２ビツトをとることになり、これによつて
計16個（2²×2²）の格子点２６を重心C₁のまわり
に設けてある。このように、第１のベクトル量子
化段で得られるB₀ビツトを使つて領域２５の番
号を伝える。次に、残りの（Ｂ−B₀）ビツトを
使つて格子点２６の番号を符号器１１の出力とし
て端子２７，２８より復号器１３へ伝送する。 The dispersion vector of the village 25a is represented by the center of gravity vector indicating its center of gravity _C1 . The one with the minimum distortion is selected from the coordinate-independent grid points 26 expressed by (B-B ₀ ) bits created using this dispersion vector. That is, in the example of the same figure, since B-B ₀ = 4 from the above assumption, each coordinate (X ₁ , X ₂ )
Thus, a total of 16 (2 ² × 2 ² ) grid points 26 are provided around the center of gravity C ₁ . In this way, the _B0 bit obtained in the first vector quantization stage is used to convey the number of region 25. Next, the number of the grid point 26 is transmitted to the decoder 13 from the terminals 27 and 28 as the output of the encoder 11 using the remaining (B-B ₀ ) bits.

復号器１３では送られたＢビツトの符号中の
B₀ビツトの集落番号及びＢ−B₀ビツトの格子点
番号によりそれぞれ重心ベクトル部１８及び分散
ベクトル部１９を参照して、この例では入力ベク
トルＳに近い格子点２６ａのベクトルＳ＾を復号し
て端子３１へ出力する。 In the decoder 13, the code of the sent B bits is
In this example, the vector S^ at the grid point 26a near the input vector S is decoded by referring to the center of gravity vector section 18 and the dispersion vector section 19 using the B0-bit village number and the B- _B0 _- bit grid point number, respectively. and outputs to terminal 31.

第１段階のベクトル量子化の際に複数の集落を
候補として選んでおいて、各候補ごとに歪が最小
となる格子点を選び、最終的に歪が最小となるも
のの組合せを送るようにすれば同じ情報量でさら
に平均の歪を小さくできる。 During the first stage of vector quantization, multiple villages are selected as candidates, and for each candidate, the grid point with the minimum distortion is selected, and finally the combination of the ones with the minimum distortion is sent. In other words, the average distortion can be further reduced with the same amount of information.

また分散ベクトル部１７，１９は各重心点ごと
に各座標ごとに値を持つているが、それぞれ平均
値におきかえても全体の歪はあまり大きくならな
いので分散ベクトルの辞書を必要に応じて簡単な
ものにすることが可能である。 In addition, the dispersion vector parts 17 and 19 have values for each coordinate for each center of gravity, but even if each is replaced with an average value, the overall distortion will not become very large, so a dictionary of dispersion vectors can be easily used as needed. It is possible to make it into something.

＜効果＞以上述べたようにこの発明によれば2^B0×２個
のベクトル辞書を用いた２段階の量子化法によ
り、同じ情報量でスカラ量子化より平均歪を小さ
くできる。音声のサンプル値の場合の比較例を第
３図に示す。横軸はサンプル当りのビツト数Ｂ
を、縦軸はSN比を示し、曲線３２は１次元最適
量子化の場合（従来の場合）、曲線３３は６次元
ベクトル及びガウス量子化の場合、曲線３４は12
次元ベクトル及びガウス量子化の場合である。こ
れよりビツト数が多くなるとベクトル量子化は困
難になるが、この発明ではベクトル量子化がで
き、しかもSN比もよいものとなる。特異な分布
をもつデータについてはさらに差が大きくなる。
そしてこの発明の量子化法によれば歪は2^B個のベ
クトル辞書を使つたベクトル量子化の歪と同程度
となることが期待される。符号化情報量Ｂが20以
上では、2^Bの辞書が非現実的であることから、Ｂ
が20以上の場合の量子化ではこの発明のベクトル
量子化法がきわめて有効となる。<Effects> As described above, according to the present invention, the two-stage quantization method using 2 ^B0 ×2 vector dictionaries allows the average distortion to be smaller than that of scalar quantization with the same amount of information. A comparative example in the case of audio sample values is shown in FIG. The horizontal axis is the number of bits per sample B
, the vertical axis shows the SN ratio, curve 32 is for one-dimensional optimal quantization (conventional case), curve 33 is for six-dimensional vector and Gaussian quantization, and curve 34 is 12
This is the case for dimensional vectors and Gaussian quantization. Vector quantization becomes difficult when the number of bits becomes larger than this, but vector quantization is possible with this invention, and the signal-to-noise ratio is also good. The difference becomes even larger for data with unique distributions.
According to the quantization method of this invention, it is expected that the distortion will be on the same level as the distortion of vector quantization using ^2B vector dictionaries. When the amount of encoded information B is 20 or more, a dictionary of 2 ^B is unrealistic, so B
The vector quantization method of the present invention is extremely effective for quantization when is 20 or more.

さらにＢが変化する場合に、この発明のベクト
ル量子化法では格子点の設定の規則を定めておけ
ば辞書を変更する必要がなく、サンプル値あたり
の情報量（Ｂ／Ｐ）は細かく制御できる。通常の
スカラ量子化ではサンプル値あたりの情報量は基
本的に整数値に限定されるし、通常のベクトル量
子化においてはＢの変化ごとに別の辞書を用意し
なければならない。 Furthermore, when B changes, the vector quantization method of this invention eliminates the need to change the dictionary as long as rules for setting grid points are established, and the amount of information per sample value (B/P) can be finely controlled. . In normal scalar quantization, the amount of information per sample value is basically limited to an integer value, and in normal vector quantization, a separate dictionary must be prepared for each change in B.

[Brief explanation of drawings]

第１図はこの発明のベクトル量子化法を適用し
た入力データを符号化して伝送する情報伝送系の
例を示すブロツク図、第２図は２次元で偏りのあ
る分布を持つデタの量子化例を示す図、第３図は
音声の周波数領域でのサンプル値を使つたスカラ
量子化法との比較例を示す図である。１１：符号器、１２：通信路、１３：復号器、
１４，１５：辞書、１６，１８：重心ベクトル
部、１７，１９：分散ベクトル部、２１：ベクト
ル量子化部、２２：スカラ量子化部。 Figure 1 is a block diagram showing an example of an information transmission system that encodes and transmits input data to which the vector quantization method of the present invention is applied, and Figure 2 is an example of quantization of data with a two-dimensional biased distribution. FIG. 3 is a diagram showing an example of comparison with a scalar quantization method using sample values in the audio frequency domain. 11: encoder, 12: communication path, 13: decoder,
14, 15: Dictionary, 16, 18: Centroid vector section, 17, 19: Dispersion vector section, 21: Vector quantization section, 22: Scalar quantization section.

Claims

[Claims] 1. In a vector quantization method in which a series of sample values is quantized in batches, each input vector is quantized by the number of bits B (B is an integer).
When quantizing it, part of it B ₀ (B ₀ is 2 or more and
N (N is a positive integer greater than or equal to 2 and less than or equal to 2 ^B0 ) in descending order of quantization distortion between the input vector and the input vector from a dictionary consisting of 2 ^B0 centroid vectors. The first vector quantization stage selects candidate vectors, and the remaining (B-B ₀ ) is calculated based on a dictionary of variance vectors corresponding to each of the candidate vectors.
A second scalar quantization stage that uses bits to quantize the difference between the input vector and each of the candidate vectors for each coordinate axis, thereby determining the encoded information that minimizes distortion as a whole. Vector quantization method.