JPH0449718B2

JPH0449718B2 -

Info

Publication number: JPH0449718B2
Application number: JP58048105A
Authority: JP
Inventors: Seiichi Nakagawa; Hidekazu Tsuboka
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1983-03-22
Filing date: 1983-03-22
Publication date: 1992-08-12
Also published as: JPS59173883A

Description

[Detailed description of the invention]

産業上の利用分野本発明はパターン比較装置、特に音声認識に応
用可能なパターン比較装置に関する。従来例の構成とその問題点パターンマツチングによる音声認識装置の一般
的な構成は次のようなものである。入力音声信号を、フイルタアンバンク、周波数
分析LPU分析等によつて特徴ベクトルの系孔に
変換する特徴抽出手段と、予め発声され、この特
徴抽出手段により抽出された特徴ベクトルの系列
を認識単語全部について標準パターンとして登録
しておく標準パターン記憶手段と、認識させるべ
く発声され、前記特徴抽出手段により抽出された
入力パターンと前記標準パターン記憶手段に記憶
されている標準パターンの全てと特徴ベクトルと
の系列としての類似度あるいは距離を計算するパ
ターン比較手段と、パターン比較の結果、最も類
似度の高かつた（距離の小さかつた）標準パター
ンに対応する単語を認識結果として判定出力する
判定手段からなる。このとき、同一話者が同一の単語を発声しても
発声の都度、その発声時間長が異るので、前記パ
ターン比較手段で標準パターンと入力パターンの
比較を行う際には、両者の時間軸を伸縮させ、両
者のパターン長を揃えて比較する必要がある。そ
の際、発声時間長の変化は、発声単語の各部で一
様に生じているわけではないので、各部を不均一
に伸縮する必要がある。その伸縮は比較すべき両
者のパターンの類似度が最大になる（距離が最小
になる。以下距離で説明する。）ように行われる
のが最も良い結果が得られている。このようなマ
ツチングを効率的に行うのに動的計画法を用いる
装置が一般的である（以下このマツチングをDP
マツチングと称する）。 DPマツチングの方法は格子グラフによつて説
明できる。第１図は格子グラフであつて、横軸は
入力パターンＴ＝a₁ a₂……a_Iに対応するｉ座標、
縦軸は標準パターンRⁿ＝bⁿ ₁、bⁿ ₂……bⁿ _Joに対応す
るｊ座標を表している。入力パターンＴと標準パ
ターンを時間軸を非線形に伸縮してマツチングす
るとは、この格子グラフ上において、両パターン
の各特徴ベクトルの対応関係を示す経路１を何ら
かの標価基準によつて決定し、この経路に関して
両パターンの距離を評価することである。この経
路を決定する際には音声の性質を考慮して制限条
件を設ける。第２図ａは経路選択の制限条件の一
例である。即ち、この例では点（ｉ、ｊ）へ至る
経路は、点（ｉ−２、ｊ−１）から点（ｉ−１、
ｊ）を通る経路２か、点（ｉ−１、ｊ−１）から
来る経路３か、点（ｉ−１、ｊ−１）から点
（ｉ、ｊ−１）を通る経路４かの何れかしか取り
得ないということを意味している。このとき、入
力パターンと標準パターンの始端は必ず対応させ
るという条件をつければ、前記マツチングの経路
は第１図の斜線の部分に制限される。この制限
は、いかに時間軸が伸縮するといつても、同一単
語に対してはそれ程極端に伸縮するはずはないと
いう事実からあまり極端な対応づけが生じないよ
うにするためである。 a_iとbⁿ _jのベクトル間距離をdⁿ（ｉ、ｊ）とすれ
ば、入力パターンＴと標準パターンRⁿのパター
ン間の前記経路に沿う距離は、その経路に沿うdⁿ
（ｉ、ｊ）の荷重平均として定義される。第２図
の経路上のａ、ｂ、ｃ、ｄ、ｅはそれに対応する
経路が選ばれたときの荷重である。DPマツチン
グが適用できるためにはこの荷重の決め方は、格
子グラフ上で前記制限条件の下でいかなる経路が
選ばれようともその経路に沿う荷重の和が一定に
なるように決めれば良い。ａ＝ｃ＝ｅ＝２、ｂ＝
ｄ＝１とすれば、この荷重の和はＩ＋Jⁿ、ａ＝ｂ
＝ｃ＝１、ｄ＝ｅ＝0.5とすれば、この荷重の和
はJⁿとなり経路の選ばれ方によらず一定となる。
これらは共によく用いられる。また、前記荷重の
和一定という条件の下でこの荷重をｊに関するこ
とにより、より重視してマツチングしたい経路上
の部分の荷重を重くする等の操作も可能である。入力パターンＴと標準パターンRⁿの距離は、
前記制限条件の下で、前記ベクトル間距離dⁿ（ｉ、
ｊ）の荷重平均の最小値として定義される。即
ち、次の漸化式を解くことによつて前記荷重平均
の最小値とその最小値を与える経路が決定され得
る。ｇ＝（ｉ、ｊ）＝mingⁿ（ｉ−２、ｊ−１）＋adⁿ（ｉ−
１、ｊ）＋bdⁿ（ｉ、ｊ） gⁿ（ｉ−１、ｊ−１）＋cdⁿ（ｉ、ｊ） gⁿ（ｉ−１、ｊ−１）＋cdⁿ（ｉ、ｊ） gⁿ（ｉ−１、ｊ−２）＋edⁿ（ｉ、ｊ−１）＋ddⁿ（ｉ、
ｊ）……(1) （初期条件 gⁿ（１、１）＝dⁿ（１、１）、Ｄ（Ｔ、
Ｒ）＝gⁿ（Ｉ、Jⁿ）／（荷重の和）ここにgⁿ（ｉ、ｊ）は始点から点（ｉ、ｊ）に
至る、ベクトル間距離dⁿ（ｉ、ｊ）の荷重和の最
小値、Ｄ（Ｔ、Rⁿ）入力パターンＴと標準パター
ンRⁿの距離である。）経路選択の条件としては他にも種々考えられ
る。第２図ｂ〜ｊ等は他の例である。この他にも
さらに種々の変形が考えられ得る。これら経路に
選択条件に伴つて前記漸化式は対応するものに書
き換えられる。前記のように、荷重をｊに関する関数として、
マツチング経路上の一部のマツチング結果を重視
するには、例えば経路上の重みを第３図のように
すれば良い。同図ａの場合はマツチングの始点か
ら終点までの経路上の和はＩ＋_Jo 〓^j=1 Wⁿｊであり、
ｂの場合は_Jo 〓^j=1 Wⁿｊである。即ち、経路に沿う重
み和はａは入力パターン長と標準パターン長の両
方に依存し、ｂは標準パターン長にのみ依存す
る。ここで、前記説明に従つて_Jo 〓^j=1 Wⁿｊは経路の
選び方によらず一定である。このとき、例えば、ａに対する累積距離算出の
前記式(1)に関する漸化式は、式(1)においてａ＝１＋１／２Wⁿｊ、ｂ＝１＋１／２Wⁿｊ、ｃ＝１＋Wⁿｊ、ｄ＝１／２Wⁿｊ、ｅ＝１＋１／２Wⁿｊとなる。入力パターンＴと標準パターンRⁿの距
離は、となる。 n^ argminⁿ 〔Ｄ（Ｔ、Rⁿ）〕を求め、標準パターンRⁿに対応する単語を認識
結果とする。 argminⁿ 〔ｆ（ｘ）〕なる記法は、ｆ
（ｘ）を最小にするｘのことを意味する。このと
き、単語の子音部を重視したマツチングを行いた
ければ、標準パターンの子音部に接当するフレー
ムｊの重みWⁿｊを大きくしておけばよい。Wⁿｊ
はフレーム毎に決定できるから、それぞれの標準
パターンに最も適した重み付をきめ細かく行うこ
とができる。ｎにかかわらず_Jo 〓^j=1 Wⁿｊが一定であ
るようにすれば、式(2)はＤ（Ｔ、Rⁿ）＝ｇ（Ｉ、Jⁿ）とすることもできる。重み付DPマツチングは、以上のように、すべ
てのフレームを平等に評価する通常のDPマツチ
ングに比べて、すぐれた特徴をもつ。しかし、次
のような問題点を有する。即ち、例えば、子音部を重視した場合のマツチ
ングの経路と、すべてのフレームを平等に評価し
た場合のマツチング経路は一般に異なり、両者の
場合の認識結果も異ることが予想される。すべて
のフレームを平等に評価することは、単語全体と
して最も良くマツチングする場合の距離を、ま
た、子音部を重視したマツチングは局部的に最も
良くマツチングする場合の距離を求めていること
になり、全体として距離的に最も近い標準パター
ンと、局部的にみて最も近い標準パターンが一般
には異るということである。したがつて、単に重
み付の方法を導入するだけでは必ずしも認識率の
向上に結びつくとは限らない。発明の目的本発明は、上記欠点を解決し、精度の高い認識
結果を得ることができるパターン比較装置を提供
することを目的とする。発明の構成本発明のパターン比較装置は、複数種類の重み
付についてマツチングを行い、それぞれのマツチ
ング結果を総合的に判断することによつて認識を
行うように構成したもので、各重み付についてマ
ツチング経路を独立に求めそれぞれの経路に沿う
マツチング結果から認識したり、ある特定の重み
付法についての経路を求め、この求めた経路に沿
い、重み付法を種々変えた場合の標準パターンと
入力パターンの距離とから認識したりすることが
でき、また、複数のマツチング結果からの総合評
価の方法も、前記種々の重み付法によつて得られ
た結果の重み付平均を最終的な距離として、その
距離の最小のものを認識結果とする方法や、ある
特定の重み付法によつていくつかの認識結果の候
補を選んでおき、得られたそれぞれの候補につい
て、他の重み寸法によつて最終的な認識結果を得
る方法等を用いることができ、入力フレーム毎に
全ての重み付法により得られる累積距離を求める
ようにすることにより、ベクトル間距離dⁿ（ｉ、
ｊ）の計算が各格子点について一回のみで済み、
また入力が終了すると同時に認識結果を得ること
ができる。実施例の説明第４図に本発明の第１の実施例を示す。図にお
いて、５は音声信号の入力端子、６は特徴抽出部
であつて、前記入力音声信号を特徴ベクトルの系
列に変換する。７は標準パターン記憶部で、特徴
抽出部６で得られた各認識単語に対する特徴ベク
トルの系列を標準パターンとして認識に先立つて
予め記憶している。８はベクトル間距離計算部で
あつて、入力の第ｉフレームにおいてｎ＝１、
２、……、Ｎ；ｊ＝１、２、……Jⁿについて、前
記ベクトル間距離dⁿ（ｉ、ｊ）を求める。dⁿ（ｉ、
ｊ）としては最も簡単には市街地距離とすること
ができる。即ち、a_i＝（a_i1、a_i2、……、a_in）、bⁿ _j
＝（bⁿ _j1、bⁿ _j2、……、bⁿ _jn）とするとき、 dⁿ（ｉ、ｊ）＝_n 〓^k=1 ｜a_ik−bⁿ _jk｜となし得る。９はベクトル間距離記憶部であつ
て、ベクトル間距離計算部８で計算されたベクト
ル間距離dⁿ（ｉ、ｊ）をｎ＝１、２、……、Ｎ；
ｊ、１、２、……Jⁿについても必要がなくなるま
で記憶している。即ち、経路の制限条件を第３図
の如く選ぶときは２フレーム分のベクトル間距離
を記憶している。ベクトル間距離記憶部９は
VDM１とVDM２の２つの記憶領域から構成さ
れ、VDM１は現フレームｉのベクトル間距離、
VDM２は前フレームのベクトル間距離が記憶
し、入力フレームが１つ更新されるとVDM１の
内容はVDM２に移され、新たなベクトル間距離
がVDM１に記憶される。１０は重み係数記憶部
であつて、本実施例では第３図ａのように重み付
けられる場合について説明する。１つの標準パタ
ーンとマツチングする重み付の種類をＫ種類と
し、第ｎ単語の標準パターンの第ｊフレームに対
応する第ｋ番目の重みをWⁿ _kｊとする。重み係数
記憶部１０はｎ＝１、２……、Ｎ；ｋ＝１、２、
……、Ｋ：ｊ＝１、２、……、Jⁿについての重み
係数Wⁿ _kｊを記憶している。11〜13はＫ種類のそ
れぞれの重み係数による累積距離計算部である。
累種距離計算部ｋ（ｋ＝１、２、……、Ｋ）は始
点（１、１）から（ｉ、ｊ）までの重み係数Wⁿ _k
ｊについてのベクトル間距離の重み和gⁿ _k（ｉ、ｊ）
をｎ＝１、２、……、Ｎについて計算する。即
ち、第３図ａの拘束条件のもとでは次の漸化式計
算することになる。第５図は累積距離計算部ｋの詳細な構成を示し
ている。漸化式計算部１０３は式(3)の計算を行う
部分である。１０１，１０２はベクトル間距離記
憶部９の内容が入力される端子、１００は重み係
数記憶部１０の内容が入力される端子、１０４は
累積距離記憶部であつて漸化式計算部１０３にお
ける漸化式の計算の必要がなくなるまで、その計
算に必要な漸化式の値を記憶しておくところで、
ADM₁は現フレームｉにおける累積距離ｇ（ｉ、
ｊ）（ｎ＝１、２、……、Ｎ；ｊ＝１、２、……
Jⁿ）を記憶しており、ADM２は前フレームｉ−
１における累積距離ｇ（ｉ−１、ｊ）（ｎ＝１、
２、……、Ｎ；ｊ＝１、２、……Jⁿ）を記憶して
いる。入力フレームが１つ更新されるとADM１
の内容はADM２に移され、ADM１には新たに
計算された累積距離が記憶される。漸化式計算部
１０３では、このADM１，ADM２に記憶され
ている累積距離と、VDM１、VDM２に記憶さ
れているベクトル間距離から式(3)の漸化式を計算
する。以上のようにベクトル間距離計算部８、累積距
離計算部１１〜１３におけるベクトル間距離、累
積距離の計算は、ｎ＝１、２、……、Ｎ；ｊ＝
１、２、……Jⁿについて１フレーム毎に行われ、
入力が完了するとと同時に、ｎ＝１、２、……、
Ｎに対して、Ｋ積類の重み付法による最終の累積
距離、gⁿ _k（Ｉ、Jⁿ）が累積距離記憶部１０４の
ADM１に記憶されることになる。第５図におい
て、１０５はこのようにして得られたgⁿ _k（Ｉ、Jⁿ）
を正規化する累積距離正規化部である。端子１０
７に音声入力終了の旨が、また端子１０８に全フ
レーム数が通知されると、ADM１の内容が正規
化され、端子１０６から次段の判定部１４へ正規
化された結果が出力される。正規化された結果
は、となる。第４図において、１６は音声区間検出部であつ
て、入力音声の開始時点と終了時点を検出するも
ので、入力音声の電力等から公知の方法が適用で
きる。１７はフレーム数計数部であつて、音声区
間の開始以後、１フレーム毎に計数を行い、音声
区間長を最終的に得るものである。１４は以上のようにして得られた正規化累積距
離Dⁿ _k（Ｉ、Jⁿ）から最終的な認識結果を得る判定
部である。判定の方法としては次のような方法が
可能である。 () 入力信号の標準パターンRⁿに対する距離Dⁿ
を即ち、Dⁿ _k（Ｉ、Jⁿ）のｋについての第２の重
み係数a_kの重み付平均をDⁿとし、Dⁿを最小に
するｎに対する標準パターンRⁿに対応する単
語を認識結果とする。 () 標準パターンRⁿに対する重み係数Wⁿ _ko(j)に
よる正規化累積距離Dⁿ _ko（Ｉ、Jⁿ）について、最
小値から第ｌ番目の最小値を与える標準パター
ンR^q(1)、R^q(2)、……、R^qlを求め、得られた標
準パターンR^q(1)、R^q(2)、……、R^qlについて、
Dⁿ _ko（Ｉ、Jⁿ）を含んであるいは除いて、Dⁿ _k
（Ｉ、Jⁿ）の（）において説明したのと同様
の重み付平均が最小になる標準パターンに対応
する単語を認識結果とする。以上の実施例においては、標準パターンRⁿに
対して計算される重みWⁿ _kｊについてのマツチン
グ経路は、ｋに関してそれぞれ独立に求めたが、
これを、標準パターンｎについては、重みWⁿ _koｊ
について計算されるマツチング経路に沿つて、他
の重みWⁿ _kｊによる累積距離をめるようにするこ
ともできる。このとき、式(3)の漸化式は次のよう
に変更される。 INDUSTRIAL APPLICATION FIELD The present invention relates to a pattern comparison device, and particularly to a pattern comparison device applicable to speech recognition. Conventional configuration and its problems The general configuration of a speech recognition device using pattern matching is as follows. A feature extraction means converts the input speech signal into a system of feature vectors by filter unbanking, frequency analysis LPU analysis, etc., and a feature vector sequence that is uttered in advance and extracted by the feature extraction means is used to recognize all words. A standard pattern storage means for registering as a standard pattern, an input pattern uttered for recognition and extracted by the feature extraction means, all standard patterns stored in the standard pattern storage means, and a feature vector. A pattern comparison means that calculates the similarity or distance as a series, and a determination means that determines and outputs the word corresponding to the standard pattern with the highest degree of similarity (smallest distance) as a recognition result as a result of the pattern comparison. Become. At this time, even if the same speaker utters the same word, the duration of the utterance differs each time, so when comparing the standard pattern and the input pattern using the pattern comparison means, the time axis of both It is necessary to expand and contract the pattern lengths of the two to make them the same and then compare them. At this time, since the utterance time length does not change uniformly in each part of the uttered word, it is necessary to expand and contract each part non-uniformly. The best results have been obtained when the expansion/contraction is performed in such a way that the similarity between the two patterns to be compared is maximized (the distance is minimized; this will be explained in terms of distance below). Devices that use dynamic programming are generally used to perform this kind of matching efficiently (hereinafter referred to as DP).
(referred to as matching). The DP matching method can be explained using a grid graph. Figure 1 is a grid graph, and the horizontal axis is the i coordinate corresponding to the input pattern T = a ₁ a ₂ ...a _I ,
The vertical axis represents the j coordinate corresponding to the standard pattern R ⁿ =b ⁿ ₁ , b ⁿ ₂ . . . b ⁿ _Jo . Matching the input pattern T and the standard pattern by non-linearly expanding and contracting the time axis means that on this lattice graph, path 1 indicating the correspondence between the feature vectors of both patterns is determined by some price standard, and this The purpose is to evaluate the distance between both patterns with respect to the route. When determining this route, limiting conditions are set in consideration of the nature of the voice. FIG. 2a shows an example of restrictive conditions for route selection. That is, in this example, the path to point (i, j) is from point (i-2, j-1) to point (i-1,
path 2 passing through point j), path 3 coming from point (i-1, j-1), or path 4 passing from point (i-1, j-1) to point (i, j-1). This means that only Kakashi can be taken. At this time, if the condition is that the starting ends of the input pattern and the standard pattern must correspond, the matching path is limited to the shaded area in FIG. This restriction is made to prevent extreme correspondences from occurring due to the fact that no matter how much the time axis expands or contracts, the same word cannot be expanded or contracted so drastically. If the distance between the vectors a _i and b ⁿ _j is d ⁿ (i, j), then the distance along the path between the input pattern T and the standard pattern R ⁿ is d ⁿ along that path.
It is defined as the weighted average of (i,j). A, b, c, d, and e on the route in FIG. 2 are the loads when the corresponding route is selected. In order for DP matching to be applicable, this load should be determined in such a way that no matter what path is selected on the lattice graph under the above-mentioned limiting conditions, the sum of the loads along that path will be constant. a=c=e=2, b=
If d=1, the sum of these loads is I+J ⁿ , a=b
If =c=1 and d=e=0.5, the sum of these loads will be J ⁿ and will be constant regardless of how the route is chosen.
Both of these are commonly used. Furthermore, by relating this load to j under the condition that the sum of the loads is constant, it is also possible to perform operations such as increasing the load on a portion of the route that is to be matched with greater emphasis. The distance between the input pattern T and the standard pattern R ⁿ is
Under the limiting conditions, the distance between the vectors d ⁿ (i,
j) is defined as the minimum value of the weighted average of j). That is, by solving the following recurrence formula, the minimum value of the weighted average and the path that provides the minimum value can be determined. g = (i, j) = ming ⁿ (i-2, j-1) + ad ⁿ (i-
1, j) + bd ⁿ (i, j) g ⁿ (i-1, j-1) + cd ⁿ (i, j) g ⁿ (i-1, j-1) + cd ⁿ (i, j) g ⁿ ( i-1, j-2)+ed ⁿ (i, j-1)+dd ⁿ (i,
j)...(1) (Initial condition g ⁿ (1, 1) = d ⁿ (1, 1), D(T,
R) = g ⁿ (I, J ⁿ )/(sum of loads) where g ⁿ (i, j) is the load of the vector distance d ⁿ (i, j) from the starting point to the point (i, j) The minimum value of the sum, D(T, R ⁿ ) is the distance between the input pattern T and the standard pattern R ⁿ . ) Various other conditions can be considered for route selection. FIGS. 2b to 2j, etc. are other examples. In addition to this, various other modifications can be considered. The recurrence formula is rewritten to correspond to these route selection conditions. As mentioned above, as a function of the load with respect to j,
In order to give importance to a part of the matching results on the matching route, the weights on the route may be set as shown in FIG. 3, for example. In the case of a in the same figure, the sum on the path from the matching starting point to the ending point is I+ _Jo 〓 ^j=1 W ⁿ j,
In the case of b, _Jo 〓 ^j=1 W ⁿ j. That is, the weight sum along the path a depends on both the input pattern length and the standard pattern length, and b depends only on the standard pattern length. Here, according to the above explanation, _Jo 〓 ^j=1 W ⁿ j is constant regardless of how the route is selected. At this time, for example, the recurrence formula for the above equation (1) for calculating the cumulative distance for a is as follows in equation (1): a=1+1/2W ⁿ j, b=1+1/2 W ⁿ j, c=1 +W ⁿ j, d=1/2W ⁿ j and e=1+1/2W ⁿ j . The distance between the input pattern T and the standard pattern R ⁿ is becomes. n^ argmin ⁿ [D(T, R ⁿ )] is determined, and the word corresponding to the standard pattern R ⁿ is taken as the recognition result. The notation argmin ⁿ [f(x)] is f
It means x that minimizes (x). At this time, if it is desired to perform matching that emphasizes the consonant part of the word, it is sufficient to increase the weight W ⁿ j of the frame j that abuts the consonant part of the standard pattern. W ⁿ j
can be determined for each frame, so it is possible to finely assign weights that are most suitable for each standard pattern. If _Jo 〓 ^j=1 W ⁿ j is kept constant regardless of n, equation (2) can also be changed to D(T, R ⁿ )=g(I, J ⁿ ). As described above, weighted DP matching has superior features compared to normal DP matching, which evaluates all frames equally. However, it has the following problems. That is, for example, the matching path when emphasis is placed on consonant parts and the matching path when all frames are evaluated equally are generally different, and it is expected that the recognition results will be different in both cases. Evaluating all frames equally means finding the distance that best matches the word as a whole, and for matching that emphasizes consonants, finding the distance that matches best locally. This means that the standard pattern that is closest in distance as a whole and the standard pattern that is locally closest are generally different. Therefore, simply introducing a weighting method does not necessarily lead to an improvement in the recognition rate. OBJECTS OF THE INVENTION It is an object of the present invention to provide a pattern comparison device that can solve the above drawbacks and obtain highly accurate recognition results. Structure of the Invention The pattern comparison device of the present invention is configured to perform recognition by performing matching for multiple types of weighting and comprehensively judging each matching result. Standard patterns and input patterns when paths are determined independently and recognized from the matching results along each path, or when a path is determined using a certain weighting method and the weighting method is varied along the determined path. In addition, a comprehensive evaluation method based on multiple matching results uses the weighted average of the results obtained by the various weighting methods as the final distance. Select several recognition result candidates using a method that uses the minimum distance as the recognition result or a certain weighting method, and then select each candidate using other weight dimensions. The distance between vectors d ⁿ (i,
j) only needs to be calculated once for each grid point,
Furthermore, the recognition result can be obtained at the same time as the input is completed. DESCRIPTION OF EMBODIMENTS FIG. 4 shows a first embodiment of the present invention. In the figure, 5 is an input terminal for an audio signal, and 6 is a feature extractor that converts the input audio signal into a series of feature vectors. Reference numeral 7 denotes a standard pattern storage unit which stores in advance a series of feature vectors for each recognized word obtained by the feature extraction unit 6 as a standard pattern prior to recognition. 8 is an inter-vector distance calculation unit, and in the i-th input frame, n=1,
2, . . . , N; j=1, 2, . . . J ⁿ , the intervector distance d ⁿ (i, j) is determined. d ⁿ (i,
j) can most easily be city distance. That is, a _i = (a _i1 , a _i2 , ..., a _in ), b ⁿ _j
= (b ⁿ _j1 , b ⁿ _j2 , ..., b ⁿ _jn ), it can be done as d ⁿ (i, j) = _n 〓 ^k=1 | a _ik − ^{b n} _jk |. 9 is an inter-vector distance storage unit which stores the inter-vector distances d ⁿ (i, j) calculated by the inter-vector distance calculation unit 8 as n=1, 2, . . . , N;
j, 1, 2, . . . J ⁿ is also memorized until it is no longer needed. That is, when selecting the route restriction conditions as shown in FIG. 3, the distance between vectors for two frames is stored. The inter-vector distance storage unit 9 is
It is composed of two storage areas, VDM1 and VDM2, and VDM1 is the distance between vectors of the current frame i,
VDM2 stores the distance between vectors of the previous frame, and when one input frame is updated, the contents of VDM1 are transferred to VDM2, and the new distance between vectors is stored in VDM1. 10 is a weighting coefficient storage unit, and in this embodiment, a case where weighting is performed as shown in FIG. 3a will be explained. It is assumed that there are K types of weights that are matched with one standard pattern, and that the k-th weight corresponding to the j-th frame of the standard pattern of the n-th word is W ⁿ _k j. The weighting coefficient storage unit 10 has n=1, 2..., N; k=1, 2,
..., K: j=1, 2, ..., weighting coefficients W ⁿ _k j for J ⁿ are stored. 11 to 13 are cumulative distance calculation units using K types of weighting coefficients.
The cumulative distance calculation unit k (k=1, 2, ..., K) calculates the weighting coefficient W ⁿ _k from the starting point (1, 1) to (i, j).
Weighted sum of distances between vectors for j g ⁿ _k (i, j)
is calculated for n=1, 2, ..., N. That is, under the constraint conditions shown in FIG. 3a, the following recurrence formula is calculated. FIG. 5 shows the detailed configuration of the cumulative distance calculating section k. The recurrence formula calculation unit 103 is a part that calculates equation (3). 101 and 102 are terminals into which the contents of the inter-vector distance storage section 9 are input, 100 is a terminal into which the contents of the weighting coefficient storage section 10 are input, and 104 is a cumulative distance storage section which is used to input the contents of the inter-vector distance storage section 9. The value of the recurrence formula necessary for the calculation is memorized until the calculation of the formula is no longer necessary.
ADM ₁ is the cumulative distance g(i,
j) (n=1, 2,..., N; j=1, 2,...
J ⁿ ), and ADM2 stores the previous frame i-
Cumulative distance g(i-1,j)(n=1,
2,...,N;j=1,2,... ^Jn ). ADM1 when one input frame is updated
The contents of are transferred to ADM2, and the newly calculated cumulative distance is stored in ADM1. The recurrence formula calculation unit 103 calculates the recurrence formula of equation (3) from the cumulative distances stored in ADM1 and ADM2 and the inter-vector distances stored in VDM1 and VDM2. As described above, the inter-vector distance calculation unit 8 and the cumulative distance calculation units 11 to 13 calculate the inter-vector distance and cumulative distance using n=1, 2, ..., N; j=
1, 2, ...J ⁿ is performed every frame,
As soon as the input is completed, n=1, 2,...
For N, the final cumulative distance g ⁿ _k (I, J ⁿ ) by the K product class weighting method is stored in the cumulative distance storage unit 104.
It will be stored in ADM1. In Fig. 5, 105 is g ⁿ _k (I, J ⁿ ) obtained in this way.
This is the cumulative distance normalization unit that normalizes the . terminal 10
When the end of audio input is notified to terminal 7 and the total number of frames is notified to terminal 108, the contents of ADM1 are normalized, and the normalized result is output from terminal 106 to determination section 14 at the next stage. The normalized result is becomes. In FIG. 4, reference numeral 16 denotes a voice section detecting section, which detects the start and end points of the input voice, and a known method can be applied based on the power of the input voice, etc. Reference numeral 17 is a frame number counting section which counts every frame after the start of the voice section and finally obtains the length of the voice section. 14 is a determination unit that obtains the final recognition result from the normalized cumulative distance D ⁿ _k (I, J ⁿ ) obtained in the above manner. The following methods can be used for determination. () Distance D ⁿ with respect to the standard pattern R ⁿ of the input signal
of That is, let D ⁿ be the weighted average of the second weighting coefficient a _k for k of D _{n k} (I, J ⁿ ⁾ , and use the word corresponding to the standard pattern R ⁿ for n that minimizes D ⁿ as the recognition result. shall be. () Standard pattern R ^q (1) that gives the l-th minimum value from the minimum value for the normalized cumulative distance D ⁿ _ko (I, J ⁿ ) using the weighting coefficient W ⁿ _ko (j) for the standard pattern R ⁿ , R ^q(2) ,..., R ^ql is calculated, and for the obtained standard pattern R ^q(1) , R ^q(2) ,..., R ^ql ,
D ⁿ _k with or without D ⁿ _ko (I, J ⁿ )
The word corresponding to the standard pattern with the minimum weighted average as explained in () of (I, J ⁿ ) is taken as the recognition result. In the above embodiment, the matching paths for the weights W ⁿ _k j calculated for the standard pattern R ⁿ were obtained independently with respect to k.
For the standard pattern n, the weight W ⁿ _ko j
It is also possible to calculate the cumulative distance by other weights W ⁿ _k j along the matching path calculated for . At this time, the recurrence formula of equation (3) is changed as follows.

【表】このとき、第４図の構成は、累積距離計算部１
１で式(4)を計算するようになし、そこで得られた
経路を他の累積距離計算部１２〜１３へ通知する
信号線１８を追加し、累積距離計算部１２〜１３
は式(5)を計算するように変更すればよい。判定処
理は、（）の場合と同様である。第６図は以上の動作をプログラムで表現したも
のであつて重み係数の種類毎に独立に累積距離を
計算する場合である。ソフトウエアで実現すると
きもこのプログラムに従えばよい。なお、プログラムの記載において、なる表記法の意味は、Ａが真である間Ｂを実行す
るということである。ステツプ200は漸化式計算部１０３における漸
化式に計算するに先立ち、初期化する部分であ
る。ステツプ201に入力パターンと全ての標準パ
ターンとのマツチングを行う部分であつて、入力
パターンとそれぞれの標準パターンとの累積距離
が重み係数の付け方毎に得られる。ステツプ202
は経路に沿う重み係数の総和で、前記累積距離を
正規化する部分であつて、前記実施例の累積距離
正規化部１０５で行われる処理に担当する。ステツプ203は入力のフレーム毎に行われる処
理で、全ての標準パターンｎ＝１、２、……、Ｎ
について、標準パターンの全フレームにおけるベ
クトル間距離、重み係数の種類毎の累積距離が求
められる。前記実施例のベクトル間距離計算部
８、累積距離件算部１１〜１３で行われる処理で
ある。第７図は、マツチングの経路は各標準パターン
について、ある重み係数について求めたものに固
定して、重み係数のに種々変えて累積距離を計算
する場合である。この例において、第６図と同じ
番号を付したステツプは、第６図の場合と同様の
機能を表す。異るところは、ステツプ２０３の内
容のみであつて、既に説明したところである。なお、本実施例では、累積距離正規化部１０５
を設けたが、_Jo 〓^j=1 Wⁿｊ＝一定となるように重み係
数を決めておけば、この正規化の必要はない。発明の効果本発明のパターン比較装置は、マツチング経路
に種々の重み係数を導入し、その結果を総合的に
判断するように構成したので、単語に依つて、局
部的に重視すべきところは重視し、また全体とし
てのマツチング結果も考慮するところにより、よ
り精度の高い認識結果を得ることができ、また、
あらゆる認識単語、重み係数について、入力フレ
ーム毎に計算を完了することにより、実時間処理
が可能となつたものである。[Table] At this time, the configuration shown in FIG.
1 to calculate the formula (4), and add a signal line 18 that notifies the other cumulative distance calculation units 12 to 13 of the route obtained there, and the cumulative distance calculation units 12 to 13
can be changed to calculate equation (5). The determination process is the same as in the case of (). FIG. 6 shows the above operation expressed by a program, in which cumulative distances are calculated independently for each type of weighting coefficient. You can also follow this program when implementing it with software. In addition, in the program description, The meaning of the notation is to execute B while A is true. Step 200 is a part for initializing before the recurrence formula calculation unit 103 calculates the recurrence formula. In step 201, the input pattern is matched with all standard patterns, and the cumulative distance between the input pattern and each standard pattern is obtained for each weighting coefficient. Step 202
is the sum of weighting coefficients along the route, which is a part that normalizes the cumulative distance, and is responsible for the processing performed by the cumulative distance normalization unit 105 of the embodiment. Step 203 is a process performed for each input frame, and all standard patterns n = 1, 2, ..., N
, the distance between vectors in all frames of the standard pattern and the cumulative distance for each type of weighting coefficient are determined. This is a process performed by the inter-vector distance calculation unit 8 and the cumulative distance calculation units 11 to 13 of the embodiment. FIG. 7 shows a case where the matching path is fixed to the one determined with respect to a certain weighting coefficient for each standard pattern, and the cumulative distance is calculated by varying the weighting coefficient. In this example, steps numbered the same as in FIG. 6 represent similar functions as in FIG. The only difference is the content of step 203, which has already been explained. Note that in this embodiment, the cumulative distance normalization unit 105
However, if the weighting coefficients are determined so that _Jo 〓 ^j=1 W ⁿ j=constant, there is no need for this normalization. Effects of the Invention The pattern comparison device of the present invention is configured to introduce various weighting coefficients into the matching path and comprehensively judge the results. However, by considering the overall matching result, more accurate recognition results can be obtained, and
By completing calculations for each input frame for all recognized words and weighting coefficients, real-time processing is possible.

[Brief explanation of the drawing]

第１図はDPマツチングを説明する図、第２図
ａ〜ｊはDPマツチングにおけるマツチング経路
の拘束条件の例を示す図、第３図は局部的に重視
したDPマツチングを行うためのマツチング経路
に対する重み付の一例を示す図、第４図は本発明
における一実施例のパターン比較装置の構成を示
すブロツク図、第５図は同実施例における累積距
離計算部の詳細を示すブロツク図、第６図は同実
施例における動作をプログラムで示した図、第７
図は別の実施例における動作をプログラムで示し
た図である。６……特徴抽出部、７……標準パターン記憶
部、８……ベクトル間距離計算部、９……ベクト
ル間距離記憶部、１０……重み係数記憶部、１１
〜１３……累積距離計算部、１４……判定部、１
０３……漸化式計算部、１０４……累積距離記憶
部。 Figure 1 is a diagram explaining DP matching, Figures 2 a to j are diagrams showing examples of constraint conditions for matching paths in DP matching, and Figure 3 is a diagram for explaining matching paths for performing DP matching with local emphasis. FIG. 4 is a block diagram showing the configuration of a pattern comparison device according to an embodiment of the present invention, FIG. 5 is a block diagram showing details of the cumulative distance calculating section in the same embodiment, and FIG. The figure is a diagram showing the operation in the same embodiment as a program.
The figure is a program diagram showing the operation in another embodiment. 6...Feature extraction unit, 7...Standard pattern storage unit, 8...Vector distance calculation unit, 9...Vector distance storage unit, 10...Weighting coefficient storage unit, 11
~13... Cumulative distance calculation unit, 14... Judgment unit, 1
03... Recurrence formula calculation unit, 104... Cumulative distance storage unit.

Claims

[Claims] 1. An input signal is converted into a series of feature vectors a ₁ , a ₂ , . . .
a feature extraction means for converting into an input pattern T consisting of a ₁ ... a _I , and a series of feature vectors b ⁿ ₁ , b ⁿ ₂ ... b ⁿ _j ...
...b ⁿ Standard pattern R ⁿ consisting of _Jo (where n=1,
2, ..., N), and ^a plurality of types of weighting coefficients W ⁿ ₁ (1), W ⁿ ₁ (2), ..., W ⁿ ₁ ( J ⁿ ); W ⁿ ₂ (1),
W ⁿ ₂ (2), ..., W ⁿ ₂ (J ⁿ ); ...; W ⁿ _k (1), ..., W ⁿ _k
(J ⁿ ), and a distance between vectors a _i and b _j in a lattice graph with the frame of the input pattern T on the horizontal axis and the frame of the standard pattern R ⁿ on the vertical axis. inter-vector distance calculation means for calculating d ⁿ (i, j); inter-vector distance storage means for storing this inter-vector distance; A recurrence formula calculation means for calculating a distance, a cumulative distance storage means for storing a plurality of types of cumulative distances obtained by the recurrence formula calculation means, and a determination means for obtaining a final recognition result from the plurality of cumulative distances. A pattern comparison device characterized by: 2. The inter-vector distance calculation means calculates the inter-vector distance between the input pattern and the standard pattern for each frame of the input pattern and for each frame of all standard patterns, and the recurrence formula calculation means calculates the inter-vector distance between the input pattern and the standard pattern. Claim 1, characterized in that the distance is determined for each frame of the input pattern and for each frame of all standard patterns.
The pattern comparison device described in Section 1.