JPH0367375B2 - - Google Patents

Info

Publication number
JPH0367375B2
JPH0367375B2 JP57204849A JP20484982A JPH0367375B2 JP H0367375 B2 JPH0367375 B2 JP H0367375B2 JP 57204849 A JP57204849 A JP 57204849A JP 20484982 A JP20484982 A JP 20484982A JP H0367375 B2 JPH0367375 B2 JP H0367375B2
Authority
JP
Japan
Prior art keywords
vector
quantization
dictionary
vectors
distortion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP57204849A
Other languages
Japanese (ja)
Other versions
JPS5994936A (en
Inventor
Takehiro Morya
Masaaki Yoda
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NTT Inc
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP57204849A priority Critical patent/JPS5994936A/en
Publication of JPS5994936A publication Critical patent/JPS5994936A/en
Publication of JPH0367375B2 publication Critical patent/JPH0367375B2/ja
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Image Processing (AREA)
  • Analogue/Digital Conversion (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Description

【発明の詳細な説明】 この発明は一つのベクトルあたりの量子化情報
量が大きい場合に適するベクトル量子化法に関す
る。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a vector quantization method suitable for cases where the amount of quantization information per vector is large.

<従来技術> 一般にベクトル量子化はサンプル値を複数個ご
とにベクトルE1=(e11e12……e1p)、E2=(e21e22
…e2p)………Eo=(eo1eo2……eop)とし、その各
ベクトルごとに予め求めておいた代表ベクトル
F1=(f11f12……f1p)、F2=(f21f22……f2p)………
Fn=(fn1fn2……fnp)の辞書から歪が最小となる
もの、即ち最も近いものを選び、その番号を符号
とするものである。こおように一つのベクトルを
一つの代表値例えば番号で符号化するものである
から、一つのベクトルあたりの符号化情報量(ビ
ツト数)Bが少ない場合でも効率のよい符号化が
可能である。また、2B個の重心ベクトル(代表ベ
クトル)から成る辞書(符号帳)を重心ベクトル
数の10倍以上の学習サンプル(学習ベクトル)か
ら成る辞書を学習サンプルからデータの統計的分
布を反映するように作つておけば、特異な分布や
偏りのある分布を持つデータをも能率よく符号化
できる。そして符号化による平均歪はベクトルの
次元をPとする時、2-(2B/P)に比例して小さくする
ことができる。
<Prior art> In general, vector quantization converts multiple sample values into vectors E 1 = (e 11 e 12 ... e 1p ), E 2 = (e 21 e 22 ...
…e 2p )……E o = (e o1 e o2 …e op ), and the representative vector calculated in advance for each vector is
F 1 = (f 11 f 12 ...f 1p ), F 2 = (f 21 f 22 ... f 2p ) ......
From the dictionary of F n =(f n1 f n2 . . . f np ), the one with the minimum distortion, that is, the one closest to it, is selected, and that number is used as the code. In this way, one vector is encoded using one representative value, such as a number, so efficient encoding is possible even when the amount of encoded information (number of bits) B per one vector is small. . In addition, we created a dictionary (codebook) consisting of 2B centroid vectors (representative vectors) and a dictionary consisting of learning samples (learning vectors) that are 10 times more than the number of centroid vectors to reflect the statistical distribution of data from the learning samples. If created in advance, even data with unusual or biased distributions can be encoded efficiently. The average distortion due to encoding can be reduced in proportion to 2 - (2B/P) , where P is the dimension of the vector.

しかし与えられた情報量Bに対して2B個のベク
トルの辞書とその10倍以上の学習サンプルが必要
であり、辞書製作に要する記憶容量や計算量を考
慮すれば情報量Bは10以下が現実的である。例え
ばB≧20の場合には現存のいかなる電子計算機を
もつてしても辞書の作製は不可能と言つてよい。
即ち一般のベクトル量子化においては1ベクトル
あたりの情報量Bの制約により、符号化に伴う平
均歪を小さくすることには限界があつた。またB
が異なる場合にはBの値毎に別の辞書を作つてお
く必要があつた。
However, for a given amount of information B, a dictionary of 2 B vectors and more than 10 times as many learning samples are required, and considering the storage capacity and calculation amount required for dictionary creation, the amount of information B should be less than 10. Be realistic. For example, if B≧20, it is impossible to create a dictionary using any existing electronic computer.
That is, in general vector quantization, there is a limit to reducing the average distortion associated with encoding due to the restriction on the amount of information B per vector. Also B
If the values of B were different, it was necessary to create a separate dictionary for each value of B.

一方、スカラ量子化は1サンプルごと独立の量
子化で処理は極めて簡単であるが、サンプル当り
のビツト数が少ないときや入力データの分布や相
関に偏りがあるときの歪は、ベクトル量子化の場
合より非常に大きくなつてしまう。
On the other hand, scalar quantization is an independent quantization process for each sample, and processing is extremely simple. It becomes much larger than usual.

この発明は分布に偏りがあるデータでも能率よ
く量子化できるというベクトル量子化の利点をそ
のまま生かし、かつ符号化情報量Bが20以上の場
合でも歪をそれに応じて小さくできるようにした
ベクトル量子化法を提供するものである。
This invention utilizes the advantage of vector quantization that even data with a biased distribution can be efficiently quantized, and at the same time, vector quantization is capable of reducing distortion accordingly even when the amount of encoded information B is 20 or more. It provides law.

<実施例> 第1図はこの発明のベクトル量子化法を適用し
た情報伝送系の例を示す。符号器11は通信路1
2を介して復号器13と接続され、符号器11及
び復号器13にはそれぞれ辞書14及び15が接
続されている。辞書14には代表値を表す重心ベ
クトルが記憶された重心ベクトル部16と、各重
心ベクトルが属する各集落(領域)に属する学習
ベクトルの広がりを示す分散ベクトルが記憶され
た分散ベクトル部17とを具備する。復号器側の
辞書15は符号器側の辞書14と全く同一の構成
であつて、重心ベクトル部18及び分散ベクトル
部19を備えている。符号器11は重心ベクトル
部16及び分散ベクトル部17をそれぞれ使うベ
クトル量子化部21及びスカラ量子化部22より
成る。
<Embodiment> FIG. 1 shows an example of an information transmission system to which the vector quantization method of the present invention is applied. Encoder 11 is communication path 1
2, and dictionaries 14 and 15 are connected to the encoder 11 and the decoder 13, respectively. The dictionary 14 includes a centroid vector section 16 in which centroid vectors representing representative values are stored, and a dispersion vector section 17 in which dispersion vectors indicating the spread of learning vectors belonging to each village (region) to which each centroid vector belongs are stored. Be equipped. The dictionary 15 on the decoder side has exactly the same configuration as the dictionary 14 on the encoder side, and includes a centroid vector section 18 and a dispersion vector section 19. The encoder 11 includes a vector quantizer 21 and a scalar quantizer 22, each using a centroid vector section 16 and a variance vector section 17.

第2図に示す分布をもつ2次元(P=2)のデ
ータを、B=6ビツト及びB0=2で量子化する
場合を例にとり、この発明のベクトル量子化法を
説明する。第2図において閉曲線23で囲まれた
領域内にデータが分布しているとする。通常の
X1、X2座標独立の量子化や座標変換後の量子化
ではスカラ量子化である限り、各データごとに量
子化を行う限り、一定の情報量に対する歪は大き
くなつてしまう。
The vector quantization method of the present invention will be explained by taking as an example the case where two-dimensional (P=2) data having the distribution shown in FIG. 2 is quantized with B=6 bits and B 0 =2. It is assumed that data is distributed within a region surrounded by a closed curve 23 in FIG. normal
In quantization independent of X 1 and X 2 coordinates or quantization after coordinate transformation, as long as scalar quantization is used and quantization is performed for each data item, distortion for a certain amount of information will increase.

しかしこの発明では二段階でベクトル量子化を
行う。まず第1の量子化段では、サンプル値系列
がブロツクごとにそのサンプル値を要素とするベ
クトルとされた入力データは入力端子24よりベ
クトル量子化部21に入力されてB0=2でベク
トル量子化される。即ち領域23の先験的分布に
合うように全体が2B0個の領域25a〜25dに
分割され、その各領域即ち各集落25a〜25d
の重心点C1〜C4が辞書として作製されている。
ベクトル量子化部21では入力のベクトルが2B0
個のどの集落25a〜25dに属するかが決めら
れる。例えば入力ベクトルSは重心点のC1の集
落25aに属するとする。
However, in this invention, vector quantization is performed in two stages. First, in the first quantization stage, the input data in which the sample value series is converted into a vector whose element is the sample value for each block is input to the vector quantization unit 21 from the input terminal 24, and is converted into a vector quantizer with B 0 = 2. be converted into That is, the entire area is divided into 2 B0 areas 25a to 25d in accordance with the a priori distribution of the area 23, and each area, that is, each village 25a to 25d.
The center of gravity points C 1 to C 4 are created as a dictionary.
In the vector quantization unit 21, the input vector is 2 B0
It is determined which of the villages 25a to 25d the person belongs to. For example, it is assumed that the input vector S belongs to the village 25a at the center of gravity C1 .

以上の第1ベクトル量子化段の結果に基づい
て、次の第2のスカラ量子化段を行なう。
Based on the results of the first vector quantization stage, the next second scalar quantization stage is performed.

集落25aの分散ベクトルはその重心C1を示
す重心ベクトルで代表される。この分散ベクトル
を利用して作られた(B−B0)ビツトで表現さ
れる各座標独立の格子点26から歪が最小となる
ものを選択する。即ち、同図の例では前述の仮定
よりB−B0=4であるから、各座標(X1、X2
あたり2ビツトをとることになり、これによつて
計16個(22×22)の格子点26を重心C1のまわり
に設けてある。このように、第1のベクトル量子
化段で得られるB0ビツトを使つて領域25の番
号を伝える。次に、残りの(B−B0)ビツトを
使つて格子点26の番号を符号器11の出力とし
て端子27,28より復号器13へ伝送する。
The dispersion vector of the village 25a is represented by the center of gravity vector indicating its center of gravity C1 . The one with the minimum distortion is selected from the coordinate-independent grid points 26 expressed by (B-B 0 ) bits created using this dispersion vector. That is, in the example of the same figure, since B-B 0 = 4 from the above assumption, each coordinate (X 1 , X 2 )
Thus, a total of 16 (2 2 × 2 2 ) grid points 26 are provided around the center of gravity C 1 . In this way, the B0 bit obtained in the first vector quantization stage is used to convey the number of region 25. Next, the number of the grid point 26 is transmitted to the decoder 13 from the terminals 27 and 28 as the output of the encoder 11 using the remaining (B-B 0 ) bits.

復号器13では送られたBビツトの符号中の
B0ビツトの集落番号及びB−B0ビツトの格子点
番号によりそれぞれ重心ベクトル部18及び分散
ベクトル部19を参照して、この例では入力ベク
トルSに近い格子点26aのベクトルS^を復号し
て端子31へ出力する。
In the decoder 13, the code of the sent B bits is
In this example, the vector S^ at the grid point 26a near the input vector S is decoded by referring to the center of gravity vector section 18 and the dispersion vector section 19 using the B0-bit village number and the B- B0 - bit grid point number, respectively. and outputs to terminal 31.

第1段階のベクトル量子化の際に複数の集落を
候補として選んでおいて、各候補ごとに歪が最小
となる格子点を選び、最終的に歪が最小となるも
のの組合せを送るようにすれば同じ情報量でさら
に平均の歪を小さくできる。
During the first stage of vector quantization, multiple villages are selected as candidates, and for each candidate, the grid point with the minimum distortion is selected, and finally the combination of the ones with the minimum distortion is sent. In other words, the average distortion can be further reduced with the same amount of information.

また分散ベクトル部17,19は各重心点ごと
に各座標ごとに値を持つているが、それぞれ平均
値におきかえても全体の歪はあまり大きくならな
いので分散ベクトルの辞書を必要に応じて簡単な
ものにすることが可能である。
In addition, the dispersion vector parts 17 and 19 have values for each coordinate for each center of gravity, but even if each is replaced with an average value, the overall distortion will not become very large, so a dictionary of dispersion vectors can be easily used as needed. It is possible to make it into something.

<効果> 以上述べたようにこの発明によれば2B0×2個
のベクトル辞書を用いた2段階の量子化法によ
り、同じ情報量でスカラ量子化より平均歪を小さ
くできる。音声のサンプル値の場合の比較例を第
3図に示す。横軸はサンプル当りのビツト数B
を、縦軸はSN比を示し、曲線32は1次元最適
量子化の場合(従来の場合)、曲線33は6次元
ベクトル及びガウス量子化の場合、曲線34は12
次元ベクトル及びガウス量子化の場合である。こ
れよりビツト数が多くなるとベクトル量子化は困
難になるが、この発明ではベクトル量子化がで
き、しかもSN比もよいものとなる。特異な分布
をもつデータについてはさらに差が大きくなる。
そしてこの発明の量子化法によれば歪は2B個のベ
クトル辞書を使つたベクトル量子化の歪と同程度
となることが期待される。符号化情報量Bが20以
上では、2Bの辞書が非現実的であることから、B
が20以上の場合の量子化ではこの発明のベクトル
量子化法がきわめて有効となる。
<Effects> As described above, according to the present invention, the two-stage quantization method using 2 B0 ×2 vector dictionaries allows the average distortion to be smaller than that of scalar quantization with the same amount of information. A comparative example in the case of audio sample values is shown in FIG. The horizontal axis is the number of bits per sample B
, the vertical axis shows the SN ratio, curve 32 is for one-dimensional optimal quantization (conventional case), curve 33 is for six-dimensional vector and Gaussian quantization, and curve 34 is 12
This is the case for dimensional vectors and Gaussian quantization. Vector quantization becomes difficult when the number of bits becomes larger than this, but vector quantization is possible with this invention, and the signal-to-noise ratio is also good. The difference becomes even larger for data with unique distributions.
According to the quantization method of this invention, it is expected that the distortion will be on the same level as the distortion of vector quantization using 2B vector dictionaries. When the amount of encoded information B is 20 or more, a dictionary of 2 B is unrealistic, so B
The vector quantization method of the present invention is extremely effective for quantization when is 20 or more.

さらにBが変化する場合に、この発明のベクト
ル量子化法では格子点の設定の規則を定めておけ
ば辞書を変更する必要がなく、サンプル値あたり
の情報量(B/P)は細かく制御できる。通常の
スカラ量子化ではサンプル値あたりの情報量は基
本的に整数値に限定されるし、通常のベクトル量
子化においてはBの変化ごとに別の辞書を用意し
なければならない。
Furthermore, when B changes, the vector quantization method of this invention eliminates the need to change the dictionary as long as rules for setting grid points are established, and the amount of information per sample value (B/P) can be finely controlled. . In normal scalar quantization, the amount of information per sample value is basically limited to an integer value, and in normal vector quantization, a separate dictionary must be prepared for each change in B.

【図面の簡単な説明】[Brief explanation of drawings]

第1図はこの発明のベクトル量子化法を適用し
た入力データを符号化して伝送する情報伝送系の
例を示すブロツク図、第2図は2次元で偏りのあ
る分布を持つデタの量子化例を示す図、第3図は
音声の周波数領域でのサンプル値を使つたスカラ
量子化法との比較例を示す図である。 11:符号器、12:通信路、13:復号器、
14,15:辞書、16,18:重心ベクトル
部、17,19:分散ベクトル部、21:ベクト
ル量子化部、22:スカラ量子化部。
Figure 1 is a block diagram showing an example of an information transmission system that encodes and transmits input data to which the vector quantization method of the present invention is applied, and Figure 2 is an example of quantization of data with a two-dimensional biased distribution. FIG. 3 is a diagram showing an example of comparison with a scalar quantization method using sample values in the audio frequency domain. 11: encoder, 12: communication path, 13: decoder,
14, 15: Dictionary, 16, 18: Centroid vector section, 17, 19: Dispersion vector section, 21: Vector quantization section, 22: Scalar quantization section.

Claims (1)

【特許請求の範囲】 1 サンプル値の系列を複数個ごとにまとめて量
子化するベクトル量子化法において、 各入力べクトルを量子化ビツト数B(Bは整数)
で量子化するとき、その一部B0(B0は2以上でB
より小さい正の整数)を使つて2B0個の重心ベク
トルから成る辞書の中から入力ベクトルとの間の
量子化歪が小さい順にN(Nは2以上で2B0以下の
正の整数)個の候補ベクトルを選択する第1のベ
クトル量子化段と、前記各候補ベクトルに対応す
る分散ベクトルの辞書をもとに残りの(B−B0
ビツトを使つて入力ベクトルと前記各候補ベクト
ルとの差を各座標軸ごとに量子化する第2のスカ
ラ量子化段とにより、全体として歪が最小となる
符号化情報を決定することを特徴とするベクトル
量子化法。
[Claims] 1. In a vector quantization method in which a series of sample values is quantized in batches, each input vector is quantized by the number of bits B (B is an integer).
When quantizing it, part of it B 0 (B 0 is 2 or more and
N (N is a positive integer greater than or equal to 2 and less than or equal to 2 B0 ) in descending order of quantization distortion between the input vector and the input vector from a dictionary consisting of 2 B0 centroid vectors. The first vector quantization stage selects candidate vectors, and the remaining (B-B 0 ) is calculated based on a dictionary of variance vectors corresponding to each of the candidate vectors.
A second scalar quantization stage that uses bits to quantize the difference between the input vector and each of the candidate vectors for each coordinate axis, thereby determining the encoded information that minimizes distortion as a whole. Vector quantization method.
JP57204849A 1982-11-22 1982-11-22 Vector quantizing method Granted JPS5994936A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57204849A JPS5994936A (en) 1982-11-22 1982-11-22 Vector quantizing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57204849A JPS5994936A (en) 1982-11-22 1982-11-22 Vector quantizing method

Publications (2)

Publication Number Publication Date
JPS5994936A JPS5994936A (en) 1984-05-31
JPH0367375B2 true JPH0367375B2 (en) 1991-10-22

Family

ID=16497408

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57204849A Granted JPS5994936A (en) 1982-11-22 1982-11-22 Vector quantizing method

Country Status (1)

Country Link
JP (1) JPS5994936A (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2683734B2 (en) * 1987-09-11 1997-12-03 日本電信電話株式会社 Audio coding method
KR19980022377A (en) * 1996-09-21 1998-07-06 김광호 Video signal coding and / or decoding method using adaptive lattice quantization

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5282064A (en) * 1975-12-27 1977-07-08 Fujitsu Ltd Analog-to-digital converter

Also Published As

Publication number Publication date
JPS5994936A (en) 1984-05-31

Similar Documents

Publication Publication Date Title
US20030103573A1 (en) Method and apparatus for encoding and decoding data
EP1209917A3 (en) System and method for encoding/decoding data using multi-stage vector quantisation
US20060171533A1 (en) Method and apparatus for encoding and decoding key data
US5444488A (en) Method and apparatus for coding digital data using vector quantizing techniques
CN112398484A (en) A coding method and related equipment
Boo et al. Fixed-point optimization of transformer neural network
US5748839A (en) Quantization of input vectors and without rearrangement of vector elements of a candidate vector
Frey et al. Efficient stochastic source coding and an application to a Bayesian network source model
JPH0367375B2 (en)
Slyz et al. A nonlinear VQ-based predictive lossless image coder
CN118244993B (en) Data storage method, data processing method and device, electronic equipment and medium
CN100459460C (en) Method and apparatus for encoding and decoding data
Aksu et al. Multistage trellis coded quantisation (MS-TCQ) design and performance
Hwang et al. Genetic entropy-constrained vector quantizer design algorithm
CN117544176A (en) Radar data lossy compression method and system, computer equipment and medium
JPS6054581A (en) Difference encoding device
Effros Robustness to channel variation in source coding for transmission across noisy channels
KR20110033154A (en) How to count vectors in a network of regular points
Hwang et al. Fuzzy channel-optimized vector quantization
Somasundaram A novel codebook initialisation technique for generalized Lloyd algorithm using cluster density
US20260030484A1 (en) Information processing method, information processing system, and information processing program
JPH03145887A (en) Prediction coder
JPH01277300A (en) Learning type vector quantizer
JPH06112836A (en) Quantizer
KR19980034878A (en) Vector quantization device and method with weight