JPH0449720B2

JPH0449720B2 -

Info

Publication number: JPH0449720B2
Application number: JP58034979A
Authority: JP
Inventors: Fumio Maehara; Juichi Taniguchi; Hisayo Kusuhara; Ryoji Sagara
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1983-03-02
Filing date: 1983-03-02
Publication date: 1992-08-12
Also published as: JPS59160199A

Description

【発明の詳細な説明】産業上の利用分野本発明は不特定の話者を対象とする音声認識装
置に関する。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a speech recognition device intended for unspecified speakers.

従来例の構成とその問題点従来、音声認識装置では入力音声信号を分析す
ることによつて得られるｎ次元の特徴ベクトル系
列｛a₁、a₂、……a_I｝に対し辞書としてあらかじ
め装置内に登録してあるＰ個の標準パターンベク
トル系列｛b¹ ₁、b¹ ₂、……b¹ _J｝……｛b^p ₁、b^p ₂、……
b^p _K｝の中からこれと距離の最も近いもの、もしく
は最も類似性の大きいものをもつて認識結果とし
ているが、このとき入力ベクトル系列｛a₁、a₂、
……、a_I｝と標準パターンベクトル系列のうちの
１つ、例えば｛b^l ₁、b^l ₂、……b_M｝（但しｌ＝１〜
Ｐ）の比較に際して｛a₁、a₂、……、a_I｝の１要
素ベクトルaiと｛b^l ₁、b^l ₂、……、b^l _M｝の中の１要
素ベクトルb^l _nの市街距離、もしくはユークリツ
ド距離を計算し、これをもとに２つのベクトル系
列の総距離を、ダイナミツクプログラミングや線
形伸縮などの手法を用いて計算するものが大部分
である。Configuration of conventional example and its problems Conventionally, in a speech recognition device, an n-dimensional feature vector sequence {a ₁ , a ₂ , ...a _I } obtained by analyzing an input speech signal is processed in advance as a dictionary. P standard pattern vector sequences registered in {b ¹ ₁ , b ¹ ₂ , ...b ¹ _J } ...{b ^p ₁ , b ^p ₂ , ...
b ^p _K }, the one with the closest distance or the one with the greatest similarity is used as the recognition result. In this case, the input vector sequence {a ₁ , a ₂ ,
..., a _I } and one of the standard pattern vector series, e.g. {b ^l ₁ , b ^l ₂ , ... b _M } (where l=1~
P), one-element vector ai of {a ₁ , a ₂ , ..., a _I } and one-element vector b ^l _n of {b ^l ₁ , b ^l ₂ , ..., b ^l _M } Most methods calculate the city distance or Euclidean distance, and then calculate the total distance between two vector sequences based on this using techniques such as dynamic programming or linear expansion/contraction.

但し、市街距離、ユークリツド距離は次式で与
えられる。 However, the city distance and Euclidean distance are given by the following formula.

ai＝｛ai、₁、ai₂、……、ai、Ｎ｝ b^l _n＝｛b^l _n,1、b^l _n,2、……、b^l _n,N｝とするとき c_l,n＝_N 〓^r=1 ｜ai、ｒ−b^l _n,rｒ｜（市街距離） y_l,n＝_N 〓^r=1 （ai、ｒ−b^l _n,r）² （ユークリツド距離）しかし、市街距離や、ユークリツド距離では登
録されている標準パターンを抽出した話者と、実
際に認識しようとする話者が異なる。いわゆる不
特定話者認識の場合に十分な認識率が得られてい
なかつた。これは話者毎のスペクトルの構造の微
細な変動に起因するものである。ai = {ai, ₁ , ai ₂ , ..., ai, N} b ^l _n = {b ^l _n,1 , b ^l _{n,2 ,} ..., b ^l _n,N }, then c _l,n = _N 〓 ^r=1 |ai, r−b ^l _n,r r| (urban distance) y _l,n = _N 〓 ^r=1 (ai, r−b ^l _n,r ) ² (Euclidean distance) However, In city distance and Euclidean distance, the speaker who extracted the registered standard pattern and the speaker who is actually trying to recognize are different. In the case of so-called speaker-independent recognition, a sufficient recognition rate was not obtained. This is due to minute variations in the spectral structure of each speaker.

音声信号の一区間、例えば10ｍＳ程度を切りと
つて、これをフーリエ変換やフイルタバンクなど
の手段によつて周波数分析を行つた時、幾つかの
周波数帯にピークが現われる。これはフオルマン
トと呼ばれ音韻を特徴づける重要なパラメータで
ある。フオルマントとは人間の声道を、ある伝達
関数を持つフイルタとしてとらえた時のフイルタ
の極、すなわち共振点に相当する。このうち共振
周波数の低いものから順に第１フオルマント、第
２フオルマント、……第ｎフオルマントといい、
特に第１、第２フオルマントを中心に比較的低次
のフオルマントは音韻を特徴ずけるのに非常に重
要な役割をはたすことが一般に知られている。 When a section of an audio signal, for example about 10 msec, is cut out and frequency analyzed using means such as Fourier transform or a filter bank, peaks appear in some frequency bands. This is called a formant and is an important parameter that characterizes phoneme. A formant corresponds to the pole of a filter, or resonance point, when the human vocal tract is viewed as a filter with a certain transfer function. Among these, the one with the lowest resonance frequency is called the first formant, the second formant, ... the nth formant,
It is generally known that relatively low-order formants, particularly the first and second formants, play a very important role in characterizing phonemes.

フオルマント周波数と帯域幅が決定されれば、
音韻はほぼ決定できるが、しかし又、個人によつ
バラツキがありこれが不特定話者認識における認
識率低下の原因となつている。 Once the formant frequency and bandwidth are determined,
Although the phoneme can almost be determined, there is also variation depending on the individual, and this is the cause of a decline in the recognition rate in speaker-independent recognition.

例えば｜ａ｜（「ア」）と発声された音声波形の
一部を切り出してピツチ成分が現われない様な比
較的広帯域なハンドパスフイルタ群で周波数分析
を行うと第１図Ａに示すようにｋHzを中心に２つ
の山ができる。これが第１、第２フオルマント
（F1，F2）相当する。又、第３フオルマント
（F3）は３とH2付近に現われる。 For example, if you cut out a part of the voice waveform uttered as |a| (``a'') and perform frequency analysis using a relatively wide band hand-pass filter group that does not contain pitch components, the result will be as shown in Figure 1A. Two peaks are formed around kHz. This corresponds to the first and second formants (F1, F2). Also, the third formant (F3) appears near 3 and H2.

これに対して｜ｉ｜（「イ」）ではF1300Hz、
F22.5kHz、F33kHzとなる（第１図Ｂ）しかしF1、F2、F3……の値は個人によつて微
妙な差が有る。すなわち同じ｜ａ｜と発声された
音でも話者Ａと話者Ｂでは第１図Ｃ，Ｄのように
フオルマントの位置が多少異なる。この話者間に
おけるフオルマント位置のバラツキが従来の音声
認識装置を不特定話者に適用した認識率の低下の
原因となつていた。 On the other hand, |i| (“I”) has F1300Hz,
F22.5kHz and F33kHz (Figure 1B) However, there are slight differences in the values of F1, F2, F3... depending on the individual. That is, even when the same sound is uttered as |a|, the positions of the formants are slightly different between speaker A and speaker B, as shown in FIG. 1, C and D. This variation in formant positions among speakers has been the cause of a decrease in recognition rates when conventional speech recognition devices are applied to unspecified speakers.

発明の目的本発明は上記欠点に鑑み、フオルマント周波数
の個人差による不特定話者認識における認識率の
低下を改善する音声認識装置を提供することを目
的とする。OBJECTS OF THE INVENTION In view of the above drawbacks, it is an object of the present invention to provide a speech recognition device that improves the reduction in recognition rate in speaker-independent recognition due to individual differences in formant frequencies.

発明の構成本発明は、特徴ベクトルの列を出力する周波数
分析手段と、あらかじめ周波数分析された標準パ
ターンベクトル系列を記憶手段と、前記周波数分
析手段の出力と、前記標準パターンベクトル系列
の各々を比較する比較手段と、前記比較の結果最
小の距離を与える標準パターンベクトルを認識結
果とする判断手段とを備えた音声認識装置であ
り、入力パターンペクトル列の各ベクトルと標準
パターンベクトル列の各ベクトルを隣接周波数ど
うしの組に分け各組毎に平行移動させながら比較
し、その距離が最小となる対応を求め、その時の
距離の総和をもつて２ベクトルの間の距離とする
ことにより分者間のフオルマント位置の個人差を
軽減し、不特定話者認識における認識率の向上を
はかることのできるものである。Structure of the Invention The present invention includes a frequency analysis means for outputting a sequence of feature vectors, a storage means for storing a standard pattern vector sequence subjected to frequency analysis in advance, and a comparison between the output of the frequency analysis means and each of the standard pattern vector sequences. A speech recognition device is provided with a comparison means for determining a standard pattern vector that provides the minimum distance as a result of the comparison, and a determination means for determining, as a recognition result, a standard pattern vector that provides the minimum distance as a result of the comparison. Divide into pairs of adjacent frequencies, compare each pair while moving them in parallel, find the correspondence that minimizes the distance, and calculate the distance between the two vectors by taking the sum of the distances at that time. This method can reduce individual differences in formant positions and improve the recognition rate in speaker-independent recognition.

実施例の説明以下、本発明の一実施例について図面を参照し
ながら説明する。DESCRIPTION OF EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

第２図は本発明の一実施例における音声認識装
置のブロツク図である。同図において、１は入力
音声をパラメータ分析してＮ次元のパラメータベ
クトル列｛a₁、a₂、……、a_I｝に遂次変換するパ
ラメータ分析部で、フイルタバンク、フーリエ変
換器なの周波数分析器により構成される。２はス
イツチで、標準パターン作成時にはＢ側に、パタ
ーン比較時にはＡ側に切り換る。３はパターン記
憶部で、パラメータ分析部１により作成されたＮ
次元のパラメータベクトルの列を標準パターン
｛b¹ ₁、b¹ ₂、……、b¹ _J｝、……、｛b^p ₁、b^p ₂、……b^p _K｝
として記憶する。 FIG. 2 is a block diagram of a speech recognition device in one embodiment of the present invention. In the figure, reference numeral 1 denotes a parameter analysis unit that analyzes the parameters of the input voice and sequentially converts it into an N-dimensional parameter vector sequence {a ₁ , a ₂ , ..., a _I }. It consists of an analyzer. 2 is a switch which is switched to the B side when creating a standard pattern and to the A side when comparing patterns. 3 is a pattern storage unit, which stores N created by the parameter analysis unit 1.
The array of dimensional parameter vectors is defined as a standard pattern {b ¹ ₁ , b ¹ ₂ , ..., b ¹ _J }, ..., {b ^p ₁ , b ^p ₂ , ...b ^p _K }
be memorized as

４は移相部でＫ個の移相器より成り、標準パタ
ーンベクトル列に属する１ベクトルb^l _nを b^l _n＝｛b^l _n,1、b^l _n,2……、b^l _n,N｝とする時、これをＫ
個の組に分割してその各々を対応する移相器によみ出し
てシフトする。５は部分距離計算部で、Ｋ個の部
分距離計算器より成り、シフトしながら順次出力
される各移相器の出力と、同じくＫ個の組に分割
された入力パラメータベクトルの１つ ai＝｛ai、１、a_i,2、……a_i,r｝ ……｛a_i,t、a_i,t+1……a_i,N｝の各組について距離を計算する。 4 is a phase shifter consisting of K phase shifters, which converts one vector b ^l _n belonging to the standard pattern vector sequence into b ^l _n = {b ^l _n,1 , b ^l _n,2 ..., b ^l _{n, N} }, this is K
set of pieces The signal is divided into 2 and each of them is transferred to a corresponding phase shifter and shifted. 5 is a partial distance calculation unit, which consists of K partial distance calculators, and calculates the output of each phase shifter, which is sequentially output while shifting, and one of the input parameter vectors ai=, which is also divided into K sets. The distance is calculated for each set of {ai, 1, a _i,2 , ... a _i,r } ... {a _i,t , a _i,t+1 ... a _i,N }.

６は部分判定部で、Ｋ個の部分判定器から成り
順次シフトしながら計算した各移相器出力のうち
最小のものを選択し出力する。７は総合距離計算
部で、部分判定部１５より得られたＫ個の総和を
求めると共に、以上の動作を入力パラメータベク
トル列のｉ＝１〜Ｉについて行つた結果を順次累
積加算してゆく。８は判定部で、以上の操作を、
標準パターンベクトルのｌ＝１〜Ｐに対して行い
その結果得られた距離が最小のものをもつて認識
結果として信号線９に出力する。 Reference numeral 6 denotes a partial determining unit, which is composed of K partial determining units, and selects and outputs the minimum among the calculated phase shifter outputs while sequentially shifting. Reference numeral 7 denotes a total distance calculation unit which calculates the sum of the K values obtained from the partial determination unit 15 and sequentially cumulatively adds the results of performing the above operations for i=1 to I of the input parameter vector sequence. 8 is the judgment section, which performs the above operations,
This is performed for standard pattern vectors l=1 to P, and the one with the minimum distance obtained as a result is output to the signal line 9 as the recognition result.

次に上記のように構成された装置の動作につい
て、標準パターン作成時、パターン比較時とに分
けて各々説明する。 Next, the operation of the apparatus configured as described above will be explained separately for the time of standard pattern creation and the time of pattern comparison.

まず標準パターン作成時にはスイツチ２をＢ側
に接続し入力された音声信号をパラメータ分析部
１よりＮ次元のパラメータベクトルの列｛a₁、
a₂、……、a_I｝を遂次変換した後、パターン記憶
部３に記憶させる。この動作を９回繰り返すこと
によりパターン記憶部３内に標準パターンベクト
ル列｛b¹ ₁、b¹ ₂、……、b¹ _J｝、……｛b^p ₁、b^p ₂、……
、
b^p _K｝が記憶される。 First, when creating a standard pattern, the switch 2 is connected to the B side, and the input audio signal is processed by the parameter analysis section 1 into an N-dimensional parameter vector sequence {a ₁ ,
a ₂ , . . . , a _I } are sequentially converted and then stored in the pattern storage unit 3. By repeating this operation nine times, standard pattern vector sequences {b ¹ ₁ , b ¹ ₂ , ..., b ¹ _J }, ... {b ^p ₁ , b ^p ₂ , ...
,
b ^p _K } is stored.

次にパターン比較の場合について説明する。パ
ターン比較に際しては、スイツチ２をＡ側に接続
し、パラメータ分析部１は入力音声を入力パラメ
ータベクトル列｛a₁、a₂、……、a_I｝に変換し部
分距離計算部５に入力する。一方パターン記憶部
３は標準パターンベクトル列の１つ｛b^l ₁、b^l ₂、…
…、b^l _M｝の各ベクトルをＫ個の組に分け移相部
４のＫ個の移相器に入力する。すなわちベクトル
b^l _nを｛b^l ₁、b^l ₂、……、b^l _M｝に層する１ベクトルと
しb^l _n＝｛b^l _n,1、b^l _n,2……b^l _n,N｝とするとき、これをに分割しこれを移相部４の入力とする。 Next, the case of pattern comparison will be explained. For pattern comparison, the switch 2 is connected to the A side, and the parameter analysis section 1 converts the input voice into an input parameter vector sequence {a ₁ , a ₂ , ..., a _I }, and inputs it to the partial distance calculation section 5. . On the other hand, the pattern storage unit 3 stores one of the standard pattern vector sequences {b ^l ₁ , b ^l ₂ , . . .
..., b ^l _M } are divided into K groups and input to K phase shifters of the phase shifter 4. i.e. vector
Let b ^l _n be one vector layered in {b ^l ₁ , b ^l ₂ , ..., b ^l _M }b ^l _n = {b ^l _n,1 , b ^l _n,2 ...b ^l _n,N } , then this This is used as input to the phase shifter 4.

移相部４の各位相器では、次段の部分距離計算
部５で部分距離が計算される毎に、その出力をシ
フトし、同じように組分けされた入力パラメータ
ベクトル列の各ベクトルとの間でパターンをずら
せながら距離計算を行う。 Each phase shifter in the phase shifter 4 shifts its output each time a partial distance is calculated in the partial distance calculator 5 in the next stage, and compares it with each vector in the input parameter vector sequence grouped in the same way. Distance calculations are performed while shifting the pattern between the two.

すなわち入力パラメータベクトル列｛a₁、a₂、
……、a_I｝の一ベクトルai（ｉ＝１〜Ｉ）の各要
素を同じくＫ個の組に分割する。すなわちこの各組を部分距離計算部５の各部分距離計算器
の一方の入力とし、標準パターンベクトルのそれ
を他方の入力とする。この時第１組目の距離を市
街距離で表わした時 c_1d＝_S 〓^v=1 ｜a_i,v−b^l _n,(v+d)｜ ……(1) で表わされる。この時のｄが移相部４でのシフト
量となる。以下第Ｋ組迄同様にして距離c_k,vが定
義できる（ｋ＝１〜Ｋ）。 In other words, the input parameter vector sequence {a ₁ , a ₂ ,
..., a _I }, each element of one vector ai (i=1 to I) is similarly divided into K sets. i.e. Each of these sets is used as one input of each partial distance calculator of the partial distance calculating section 5, and that of the standard pattern vector is used as the other input. At this time, when the distance of the first set is expressed as a city distance, it is expressed as c _1d = _S 〓 ^v=1 | a _i,v −b ^l _n,(v+d) | ...(1). d at this time becomes the shift amount in the phase shifter 4. The distance c _k,v can be defined in the same manner up to the Kth group (k=1 to K).

部分判定部６では、部分距離計算部５より順次
与えられる距離c_k,v（但し、−D1ｄD2：D1、
D2はシフトの量を示す定数）のうち最小のもの
c_k,vnioを判定し、総合距離計算部７に入力する。
総合距離計算部７では部分判定部６から得られる
Ｋ個の部分距離の総和を求め、さらに、この総和
を、標準パターンベクトル列｛b^l ₁、b^l ₂、……b^l _n、……b^l _M｝のｍ＝１〜Ｍにわた
つて累積し、これを入力パターンベクトル列
｛a₁、a₂、……a_I｝との距離clとして総合判定部
８に出力する。すなわち cl＝_M 〓^m=1 _K 〓^k=1 c_k,vnio ……(2) 総合判定部８では以上の動作を標準パターンベク
トル列のｌ＝１〜Ｐについて行い、その時最小の
距離を与える標準パターンベクトル列をもつて認
識結果として信号線９に出力する。 In the partial determination unit 6, the distances c _k,v sequentially given by the partial distance calculation unit 5 (however, -D1dD2:D1,
D2 is a constant indicating the amount of shift).
c _k,vnio is determined and input to the comprehensive distance calculation section 7.
The total distance calculation section 7 calculates the sum of the K partial distances obtained from the partial determination section 6, and further converts this sum into a standard pattern vector sequence {b ^l ₁ , b ^l ₂ , . . . b ^l _n , . . . b ^l _M } over m=1 to M, and output this to the comprehensive determination unit 8 as the distance cl from the input pattern vector sequence {a ₁ , a ₂ , . . . a _I }. That is, cl= _M 〓 ^m=1 _K 〓 ^k=1 c _k,vnio ...(2) The comprehensive judgment unit 8 performs the above operation for l=1 to P of the standard pattern vector sequence, and then gives the minimum distance. A standard pattern vector sequence is output to the signal line 9 as a recognition result.

次に移相部４のシフト動作とパターン比較につ
いて第３図を用いてさらに説明する。 Next, the shift operation of the phase shifter 4 and pattern comparison will be further explained using FIG. 3.

第３図Ａは話者甲によつて発せられた入力パタ
ーンベクトル列の１つaiを示し、 ai＝｛a_i,1、a_i,2、……a_i,N｝とする。 FIG. 3A shows one input pattern vector sequence ai uttered by speaker A, where ai={a _i,1 , a _i,2 , . . . a _i,N }.

同図Ｃは標準パターンベクトル列中の入力パター
ンベクトルに対応するベクトルb^l _nを示し、 b^l _n＝｛b^l _n,1、……、b^l _n,N｝とし、これらをＫ個のブロツクに分割した各々を
B1〜B4とする。C in the figure shows the vector b ^l _n corresponding to the input pattern vector in the standard pattern vector sequence, b ^l _n = {b ^l _n,1 , ..., b ^l _n,N }, and these are divided into K pieces. Each divided into blocks
B1 to B4.

この時、前述の部分距離 c_1d＝_S 〓^v=1 ｜a_i,v−b^l _n,(v+d)｜（−D1ｄD2）を計算
する動作は、第３図Ｂに示す様に標準パターンの
B1部を左から右に順次１サンプルずつシフトし
てその距離を順次計算してゆくことに他ならな
い。 At this time, the operation of calculating the partial distance c _1d = _S 〓 ^v=1 ｜a _i,v −b ^l _n,(v+d) ｜(−D1dD2) is standard as shown in Figure 3B. pattern of
This is nothing more than shifting part B1 one sample at a time from left to right and calculating the distance one by one.

以上のように本実施例によれば標準パターンベ
クトルをＫ等分し順次シフトする移相部４、同じ
くＫ等分された入力パターンと移相部４の出力と
の距離を順次計算する部分距離計算部５とその出
力のうち最小距離のものを判定する部分判定部６
を設けることにより入力パターンに対して標準パ
ターンの各部を平行移動させながら比較すること
により発話者が異なることによるフオルマントの
位置のずれを補正するパターン比較法を実現する
ことができる。 As described above, according to this embodiment, the phase shifter 4 divides the standard pattern vector into K equal parts and shifts them sequentially, and the partial distance sequentially calculates the distance between the input pattern divided into K equal parts and the output of the phase shifter 4. A calculation unit 5 and a partial determination unit 6 that determines the minimum distance among its outputs.
By providing this, it is possible to realize a pattern comparison method that corrects shifts in formant positions due to different speakers by comparing each part of the standard pattern while moving it parallel to the input pattern.

なお本分中式(1)で与えられる市街距離c_,dはこ
れをユークリツド距離やLPC距離等他の距離尺
度を用いても同様に実現できる。 The urban distances c _{and d} given by Equation (1) in the main text can be similarly realized using other distance measures such as Euclidean distance or LPC distance.

又、総合距離計算部において式(2)で与えられる
累積距離clの計算は、線形伸縮やD.P.マツチング
の手法を併用して行うこともできる。 Further, the calculation of the cumulative distance cl given by equation (2) in the comprehensive distance calculation section can also be performed using linear expansion/contraction and DP matching techniques.

発明の効果以上のように本発明の音声認識装置は標準パタ
ーン並びに入力パターンをＫ個の組に分割し、各
組を個別に平行移動させながらその距離が最小と
なる時の距離の総和をもつて２つのパターンの距
離とすることにより、フオルマントの個人差に起
因する距離の誤差を軽減し不特定話者音声認識に
おける認識率の改善を図ることができ、その工業
的価値は大なるものがある。Effects of the Invention As described above, the speech recognition device of the present invention divides the standard pattern and the input pattern into K sets, moves each set individually in parallel, and calculates the sum of the distances when the distance is the minimum. By setting the distance between the two patterns as the distance between the two patterns, it is possible to reduce the distance error caused by individual differences in formants and improve the recognition rate in speaker-independent speech recognition, which has great industrial value. be.

[Brief explanation of the drawing]

第１図Ａ〜Ｄはスペクトル形状の差異を説明す
るための波形図、第２図は本発明の一実施例にお
ける音声認識装置のブロツク図、第３図は同実施
例のパターン比較方法を説明するための波形図で
ある。１……パラメータ分析部、２……スイツチ、３
……パターン記憶部、４……移相部、５……部分
距離計算部、６……部分判定部、７……総合距離
計算部、８……総合判定部。 Figures 1A to D are waveform diagrams for explaining differences in spectral shapes, Figure 2 is a block diagram of a speech recognition device in an embodiment of the present invention, and Figure 3 explains a pattern comparison method in the same embodiment. FIG. 1...Parameter analysis section, 2...Switch, 3
...Pattern storage unit, 4...Phase shift unit, 5...Partial distance calculation unit, 6...Partial judgment unit, 7...Comprehensive distance calculation unit, 8...Comprehensive judgment unit.

Claims

[Claims]

1. A frequency analysis means that frequency-analyzes an input audio signal and outputs an N-dimensional feature vector sequence {a ₁ , a ₂ ..., a _I }, and P sets of standard pattern vector sequences {b ¹ that have been frequency-analyzed in advance ₁ , b ¹ ₂ , ..., b ¹ _J }...
..., {b ^p ₁ , b ^p ₂ , ..., b ^p _K };
One element vector ai (i=1 to I) of the input feature vector sequence {a ₁ , a ₂ , ... a _I } and the standard pattern vector sequence {b ¹ ₁ , b ¹ ₂ , ..., b ¹ _J ｝……,
One _- element ^vector b _l ⁿ ₍ ^l ⁼ ₁
~P), the elements of b ^l _n {b ^l _n,1 , b ^l _n,2 ,
..., b ^l _n,N } is a set of adjacent frequencies {b ^l _n,1 ,
b ^l _n,2 ,..., b ^l _n,S }...{b _n , ^l _t , b ^l _n,t+1 ,...b ^l _{n
,N} }
, move each pair in parallel around the corresponding frequency band of ai divided in the same way, find the correspondence that minimizes the distance, and then create a vector ai with the sum of the distances of each pair. and b ^l _n , and by this measure, the output of the frequency analysis means {a ₁ ,
a ₂ , ..., a _I } and the standard pattern vector sequence {b ¹ ₁ , b ¹ ₂ , ..., b ¹ _J } ..., {b ^p ₁ , b ^p ₂ , ..., b ^p _K }
a standard pattern vector {b ^l ₁ , b ^l ₂ ,
...b ^l _M } as a recognition result.