JPH0465396B2

JPH0465396B2 -

Info

Publication number: JPH0465396B2
Application number: JP62061735A
Authority: JP
Inventors: Hiroaki Sekoe
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1987-03-16
Filing date: 1987-03-16
Publication date: 1992-10-19
Also published as: JPS63226696A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は人間が発声した音声を自動認識する音
声認識等の主要処理であるパタースマツチング方
式に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a pattern matching method, which is a main processing such as speech recognition, which automatically recognizes speech uttered by a human being.

（従来の技術）音声認識のパターンマツチングに関しては種々
の技術が開発されているが、それらの中で最も重
用されているものの一つとして「日本音響学会誌
第42巻９号（昭和61年９月発行の第725頁」に記
載される如きDPマツチング法がある。これは音
声の時間軸歪を整合する手法として極めて有効と
されている。また、DPマツチング法を連続単語
認識に拡張したものとして、特願昭56−199098号
明細書に記載されるが如きクロツクワイズDP法
がある。この手法は構文制御を有する連続単語認
識法として説明されているが、その特殊形として
当然離散単語認識をも包含している。ここでは簡
単のため離散単語認識の形式で、クロツクワイズ
DP法の要部を説明する。(Prior art) Various technologies have been developed regarding pattern matching for speech recognition, but one of the most heavily used among them is the one published in ``Journal of the Acoustical Society of Japan, Vol. 42, No. 9 (1986). There is a DP matching method as described in "Page 725 of the September issue. This is considered to be extremely effective as a method for matching time axis distortion of speech. Also, the DP matching method was extended to continuous word recognition. One such method is the Clockwise DP method as described in the specification of Japanese Patent Application No. 199098.This method is described as a continuous word recognition method with syntactic control, but its special form is of course discrete word recognition. For simplicity, we will use the form of discrete word recognition.
Explain the main parts of the DP method.

単語名を番号ｎで指定することとして｛ｎ｜ｎ＝１、２、…Ｎ｝なる単語セツトを認識対象とする。各単語に標準
パターン Bⁿ＝〓₁ ⁿ、〓₂ ⁿ、…〓_j ⁿ…〓ⁿ _Jo を考える。ここにｊは時刻を示し、〓_j ⁿは標準パ
ターンBⁿの時刻ｊの特徴を意味する。入力音声
パターンを同様にＡ＝ａ｜₁、ａ｜₂…ａ｜_i…ａ｜_I と示す。 Assuming that word names are designated by numbers n, a set of words {n|n=1, 2, . . . N} is to be recognized. Consider the standard pattern B ⁿ =〓 ₁ ⁿ , 〓 ₂ ⁿ , …〓 _j ⁿ …〓 ⁿ _Jo for each word. Here, j indicates time, and 〓 _j ⁿ means the characteristic of standard pattern B ⁿ at time j. The input speech patterns are similarly expressed as A=a| ₁ , a| ₂ ...a| _i ...a| _I.

音声認識は、入力パターンＡと標準パターン
Bⁿとのパターン間距離Ｄ（Ａ、Bⁿ）を求め、それ
が最小となるｎを定め、認識結果とすることによ
つて行なわれる。 Speech recognition uses input pattern A and standard pattern
This is done by finding the inter-pattern distance D (A, B ⁿ ) with respect to B ⁿ , determining n at which it is the minimum, and using this as the recognition result.

DPマツチングではこのパターン間距離の計算
を一例として次のような動的計画法計算によつて
行なう。 In DP matching, the distance between patterns is calculated by the following dynamic programming calculation, for example.

Γ初期条件 gⁿ（１、１）＝dⁿ（１、１） ……(1) Γ漸化式 gⁿ（ｉ、ｊ）＝dⁿ（ｉ、ｊ）＋mingⁿ（ｉ−１
、ｊ） gⁿ（ｉ−１、ｊ−１） gⁿ（ｉ−１、ｊ−２） ……(2) ｉ＝１、２、…Ｉｊ＝１、２、…Ｊ Γパターン間距離Ｄ（Ａ、Bⁿ）＝gⁿ（Ｉ、Jⁿ） ……(3) ここにdⁿ（ｉ、ｊ）は特徴ａ｜_iと〓_j ⁿの距離dⁿ
（ｉ、ｊ）＝‖ａ｜_i−〓_j ⁿ‖である。これを積分した
形式となる。gⁿ（ｉ、ｊ）を最適累積距離と呼ぶ。Γ initial condition g ⁿ (1, 1) = d ⁿ (1, 1) ...(1) Γ recurrence formula g ⁿ (i, j) = d ⁿ (i, j) + ming ⁿ (i-1
, j) g ⁿ (i-1, j-1) g ⁿ (i-1, j-2) ...(2) i=1, 2, ...I j=1, 2, ...J Γ pattern distance D (A, B ⁿ ) = g ⁿ (I, J ⁿ ) ...(3) Here, d ⁿ (i, j) is the distance d ⁿ between feature a | _i and 〓 _j ⁿ
(i, j)=‖a| _i −〓 _j ⁿ ‖. This is the integral form. g ⁿ (i, j) is called the optimal cumulative distance.

このDPマツチング処理は当初、単語ごとに実
行されていたが、クロツクワイズDP法では各単
語に対して並列的に実行される形式に改良され
た。すなわち、第１図のような、ｉ，ｊ，ｎが張
る空間において入力パターンの各時刻ｉにおい
て、各標準パターンBⁿの指定ｎと、それらの中
のｊのすべての組み合わせで指定されるｎ、に対
してgⁿ（ｉ、ｊ）なる最適累積値を計算し、しか
る後に時刻ｉを進めて処理を実行するという方式
になつている。 Initially, this DP matching process was executed for each word, but in the Crotwise DP method, it has been improved to a format in which it is executed for each word in parallel. That is, at each time i of the input pattern in the space spanned by i, j, and n as shown in Figure 1, the n specified by the specification n of each standard pattern B ⁿ and all the combinations of j among them. , the optimal cumulative value g ⁿ (i, j) is calculated for , and the process is then executed by advancing time i.

実際の計算においては図の空間のすべてのワー
クエリアを用意する必要はなく、ｉ方向に関して
は、時刻ｉとｉ−１の２時刻分があれば(2)式の計
算を進めることができる。このような方法は、入
力パターンの特徴ａ｜_iの入力に同期して処理を進
めることができるので、発声と並行して認識のた
めの計算を進行することができ、実時間性が良い
とされている。 In the actual calculation, it is not necessary to prepare the entire work area in the space shown in the figure, and in the i direction, the calculation of equation (2) can proceed as long as there are two times, i and i-1. In this method, processing can proceed in synchronization with the input of the input pattern feature a _| has been done.

（発明が解決しようとする問題点）しかし、この方法を大語いの認識に適用しよう
とすると計算量が大となるという問題がある。(2)
式の漸化式はｉの１サイクル内で、ｎとｊのすべ
ての組合せについて実行しなくてはならない。標
準パターン長がJⁿ＝30で、1000語を認識しようと
すると、３×10⁴の点で(2)式を計算することにな
る。１点あたり10μsで実行したとしても300ｍｓ
を要する。通常の音声認識ではｉの量子化は20μs
程度で行なわれるので、このような大語いでは実
時間実行はとても不可能である。(Problems to be Solved by the Invention) However, when this method is applied to recognizing large words, there is a problem in that the amount of calculation becomes large. (2)
The recurrence formula must be executed for all combinations of n and j within one cycle of i. If the standard pattern length is J ⁿ =30 and an attempt is made to recognize 1000 words, equation (2) will be calculated using 3×10 ⁴ points. Even if it is executed at 10μs per point, it will take 300ms
It takes. In normal speech recognition, the quantization of i is 20μs
Real-time execution is quite impossible with such a large term.

本発明はクロツクワイズ型DPマツチングの有
する計算量が多いという上記欠点を改良して、高
速で大語い認識が可能でありながら低価格な音声
認識装置のパターンマツチング方式を提供するこ
とを目的とする。 An object of the present invention is to provide a pattern matching method for a speech recognition device that is capable of recognizing large words at high speed and at low cost, by improving the above-mentioned drawback that the clockwise type DP matching has a large amount of calculation. do.

（問題点を解決するための手段）本発明によるパターンマツチング方式は、上記
クロツクワイズ型のDPマツチングの(2)式の漸化
式計算を実行するに当り、過去に計算された最適
累積値に基づいて新たな最適累積値gⁿ（ｉ、ｊ）
を計算する点（ｎ、ｊ）を制限し、かつ各（ｎ、
ｊ）点における漸化式計算処理を、その近傍で計
算が実行された点（n′、j′）との相互関係に基づ
いて制御することを特徴とする。(Means for Solving the Problems) The pattern matching method according to the present invention uses the optimal cumulative value calculated in the past when executing the recurrence formula calculation of equation (2) of the above clockwise type DP matching. Based on the new optimal cumulative value g ⁿ (i, j)
We limit the points (n, j) at which we calculate and each (n,
The method is characterized in that the recurrence formula calculation process at point j) is controlled based on the mutual relationship with points (n', j') for which calculations were performed in the vicinity.

（作用・原理）元来DPマツチングは第１図の如きｎ、ｉ、ｊ
が張る空間において、各単語ごとに、（１、１）
点から（Ｉ、Jⁿ）点に至る経路でdⁿ（ｉ、ｊ）の
総和、すなわち累積値が最小となるものを探索す
るものである。この過程で計算される最適累積値
gⁿ（ｉ、ｊ）は、単語ｎの（１、１）点から（ｉ、
ｊ）点に至る距離dⁿ（ｉ、ｊ）の累積値を与えて
いる。したがつてgⁿ（ｉ、ｊ）の値が大であると
いうことはこの点（ｉ、ｊ）が最適経路上にある
可能性が低いことを意味する。本発明の第１の特
徴はgⁿ（ｉ、ｊ）が大となると予測される場合に
は、DPの漸化式計算を省略することによつて高
速化を図る点にある。(Operation/Principle) Originally, DP matching was based on n, i, j as shown in Figure 1.
For each word in the space spanned by (1, 1)
This is a search for a path from a point to a point (I, J ⁿ ) that minimizes the sum of d ⁿ (i, j), that is, the cumulative value. The optimal cumulative value calculated in this process
g ⁿ (i, j) is from the (1, 1) point of word n to (i,
j) The cumulative value of the distance d ⁿ (i, j) to the point is given. Therefore, a large value of g ⁿ (i, j) means that this point (i, j) is unlikely to be on the optimal route. The first feature of the present invention is that when g ⁿ (i, j) is predicted to be large, speeding up is achieved by omitting the calculation of the DP recurrence formula.

具体的には第２図に示すように、過去のクロツ
ク（ｉ−１）で計算された最適累積値gⁿ（ｉ、ｊ）
を所定の基準で検定し、その値が小である（ｎ、
ｊ）の点の集合ｗ（図に○で表示）を定め、新た
な最適累積値gⁿ（ｉ、ｊ）を算出するための(2)式
の漸化式計算は、これらの点の近傍のみで行なう
ものとする。 Specifically, as shown in Figure 2, the optimal cumulative value g ⁿ (i, j) calculated at the past clock (i-1)
is tested according to a predetermined standard, and the value is small (n,
j), and calculate the new optimal cumulative value g ⁿ (i, j) using the recurrence formula of equation (2). It shall be carried out by only one person.

しかし、この方法をこのまま実行しようとする
第３図のような問題が残る。この図は単語ｎの
（ｉ、ｊ）点の近傍を拡大した図である。参照数
字１で示す孤立した点gⁿ（ｉ−１、ｊ）が集合ｗ
に含まれていたとする。(2)式の漸化式計算を行な
うとすると、このgⁿ（ｉ−１、ｊ）は参照数字２，
３，４で示す３点の最適累積値、すなわちgⁿ（ｉ、
ｊ）、gⁿ（ｉ、ｊ＋１）、gⁿ（ｉ、ｊ＋２）に影響を
及ぼす。したがつて、これら３点での漸化式計算
を行なう必要があるが、(2)式をそのまま実行した
のでは効率が悪い。なぜならば（ｉ−１、ｊ）の
近傍ではこの点だけが集合ｗに含まれていること
から gⁿ（ｉ−１、ｊ）＜gⁿ（ｉ−１、ｋ） ……(4) ｋ＝ｊ−２、ｊ−１、ｊ＋１、ｊ＋２であり、(2)式の漸化式計算結果が gⁿ（ｉ、ｊ）＝dⁿ（ｉ、ｊ）＋gⁿ（ｉ−
１、ｊ） gⁿ（ｉ、ｊ）＝dⁿ（ｉ、ｊ）＋gⁿ（ｉ−
１、ｊ） gⁿ（ｉ、ｊ＋１）＝dⁿ（ｉ、ｊ＋１）＋gⁿ（ｉ−１
、ｊ） gⁿ（ｉ、ｊ）＝dⁿ（ｉ、ｊ）＋gⁿ（ｉ−
１、ｊ） gⁿ（ｉ、ｊ＋１）＝dⁿ（ｉ、ｊ＋１）＋gⁿ（ｉ−１
、ｊ） gⁿ（ｉ、ｊ＋２）＝dⁿ（ｉ、ｊ＋２）＋gⁿ（ｉ−１
、ｊ）……(5) となるのは自明であるからである。それにもかか
わらず、(2)式をそのまま計算するのは不利であ
り、特に参照数字５，６，１，７，８のgⁿ（ｉ−
１、ｋ）に対する３×３＝９回のメモリアクセス
は処理速度を低下させる。 However, if this method is attempted to be carried out as is, the problem as shown in FIG. 3 remains. This figure is an enlarged view of the vicinity of point (i, j) of word n. Isolated points g ⁿ (i-1, j) indicated by reference number 1 are set w
Suppose that it was included in When calculating the recurrence formula of equation (2), this g ⁿ (i-1, j) is the reference number 2,
The optimal cumulative value of the three points indicated by 3 and 4, that is, g ⁿ (i,
j), g ⁿ (i, j+1), and g ⁿ (i, j+2). Therefore, it is necessary to calculate the recurrence formula at these three points, but it is inefficient to execute equation (2) as is. This is because in the vicinity of (i-1, j), only this point is included in the set w, so g ⁿ (i-1, j) < g ⁿ (i-1, k) ...(4) k = j-2, j-1, j+1, j+2, and the recurrence formula calculation result of equation (2) is g ⁿ (i, j) = d ⁿ (i, j) + g ⁿ (i-
1, j) g ⁿ (i, j) = d ⁿ (i, j) + g ⁿ (i-
1, j) g ⁿ (i, j+1)=d ⁿ (i, j+1)+g ⁿ (i-1
, j) g ⁿ (i, j) = d ⁿ (i, j) + g ⁿ (i-
1, j) g ⁿ (i, j+1)=d ⁿ (i, j+1)+g ⁿ (i-1
, j) g ⁿ (i, j+2)=d ⁿ (i, j+2)+g ⁿ (i-1
, j)...(5) is obvious. Nevertheless, it is disadvantageous to calculate equation (2) as is, especially g ⁿ (i−
1,k) 3×3=9 memory accesses reduces processing speed.

以上では集合ｗに含まれる点がその近傍で完全
に孤立している場合の例を上げたが、同様のこと
は、集合ｗに含まれる点の近傍の点との関係にお
いて、程度の差こそあれ生じる。本発明は集合ｗ
に含まれる点の近傍の相互関係によつて漸化式計
算を制御することによつて効率良くクロツクワイ
ズ型のDPマツチングを実行することを第２の特
徴とする。 In the above, we have given an example where a point included in the set w is completely isolated in its vicinity, but the same thing can be said to be that there is a difference in degree in the relationship between a point included in the set w and its neighboring points. That happens. The present invention is a set w
The second feature is that clockwise-type DP matching can be efficiently executed by controlling recurrence formula calculations based on the mutual relationships in the vicinity of points included in the method.

DP漸化式の例とし(2)式を考える。（ｎ、ｊ）∈
ｗとし、その直前に処理を行なつた点を（n′、j′）
∈ｗとする。いま漸化式計算を実行するプロセツ
サに密に結合されたレジスタR0、R1とR2をワー
クエリアとして考える。このとき（ｎ、ｊ）と
（n′、j′）との相互関係によつて制御される計算処
理の例は次のごとくである。 Consider equation (2) as an example of a DP recurrence equation. (n,j)∈
Let w and the point processed immediately before that point be (n′, j′)
Let ∈w. Now consider the registers R0, R1, and R2, which are tightly coupled to the processor that executes the recurrence formula calculation, as work areas. An example of calculation processing controlled by the mutual relationship between (n, j) and (n', j') is as follows.

(A) ｎ≠n′のとき (A) ｎ≠n′のとき min（R1、R2）＋dⁿ′（ｉ、j′＋１）→gⁿ′（ｉ、
j′＋１） R1＋dⁿ′（ｉ、j′＋２）→gⁿ′（ｉ、j′＋２） gⁿ（ｉ−１、ｊ）→R1 R1＋dⁿ（ｉ、ｊ）→gⁿ（ｉ、ｊ） ∞→R2 ｎ→n′、ｊ→j′ ……（６−１） (B) ｎ−n′、ｊ−j′＞２のとき (B) ｎ−n′、ｊ−j′＞２のとき min（R1、R2）＋dⁿ（ｉ、j′＋１）→gⁿ′（ｉ、j
′＋１） R1＋dⁿ（ｉ、j′＋２）→gⁿ′（ｉ、j′＋２） gⁿ′（ｉ−１、ｊ）→R1 R1＋dⁿ（ｉ、ｊ）→gⁿ（ｉ、ｊ） ∞→R2 ｊ→j′ ……（６−２） (C) ｎ＝n′、ｊ−j′＝２のとき (C) ｎ＝n′、ｊ−j′＝２のとき min（R1、R2）＋dⁿ（ｉ、j′＋１）→gⁿ（ｉ、j′
＋１） gⁿ（ｉ−ｊ、ｊ）→R0 min（R0、R1）＋dⁿ（ｉ、ｊ）→gⁿ（ｉ、ｊ） R0→R1 ∞→R2 ｊ→j′ ……（６−３） (D) ｊ−j′＝１のとき gⁿ（ｉ−１、ｊ）→R0 min（R0、R1、R2）＋dⁿ（ｉ、ｊ）→gⁿ（ｉ、ｊ） R1→R2 R0→R1 ｊ→j′ ……（６−４）以上の各処理の始まる時点では、手続（６−
１）における、、、手続（６−２）におけ
る、、、手続（６−３）における、、
、手続（６−４）における、、の処理か
ら分るように、n′、j′には前回処理を行なつた
（n′、j′）点の情報が含まれ、R1にはgⁿ（ｉ−１、
j′）、R2にはgⁿ′（ｉ−１、j′−１）が記憶された状
態になつている。 (A) When n≠n′ (A) When n≠n′ min(R1, R2)+d ⁿ ′(i, j′+1)→g ⁿ ′(i,
j′+1) R1+d ⁿ ′(i, j′+2)→g ⁿ ′(i, j′+2) g ⁿ (i−1, j)→R1 R1+d ⁿ (i, j)→g ⁿ (i, j ) ∞→R2 n→n′, j→j′ ……(6-1) (B) When n−n′, j−j′>2 (B) n−n′, j−j′>2 When min(R1, R2) + d ⁿ (i, j′+1) → g ⁿ ′(i, j
′+1) R1+d ⁿ (i, j′+2)→g ⁿ ′(i, j′+2) g ⁿ ′(i−1, j)→R1 R1+d ⁿ (i, j)→g ⁿ (i, j) ∞→R2 j→j′ ……(6-2) (C) When n=n′, j−j′=2 (C) When n=n′, j−j′=2 min(R1, R2) + d ⁿ (i, j'+1) → g ⁿ (i, j'
+1) g ⁿ (i-j, j) → R0 min (R0, R1) + d ⁿ (i, j) → g ⁿ (i, j) R0 → R1 ∞ → R2 j → j' ... (6-3 ) (D) When j-j'=1 g ⁿ (i-1, j) → R0 min (R0, R1, R2) + d ⁿ (i, j) → g ⁿ (i, j) R1 → R2 R0 →R1 j→j′ ...(6-4) At the start of each of the above processes, procedure (6-
In 1), in procedure (6-2), in procedure (6-3),
As can be seen from the processing of , in procedure (6-4), n', j' contain information about the previously processed point (n', j'), and R1 contains g ⁿ (i-1,
j') and R2 stores g ⁿ '(i-1, j'-1).

上記(A)は単語が切り替つた場合の処理で、直前
に処理していた単語n′に対して手続（６−１）の
、の処理を行なう。このとき、R1にはgⁿ（ｉ
−１、j′）が、R2にはgⁿ（ｉ−１、j′−１）が含ま
れている。(2)式に照し合わせて、これらのデータ
から計算可能なのはgⁿ′（ｉ、j′＋１）とgⁿ′（ｉ、
j′＋２）であることが分かる。それゆえ、これら
のレジスタ内のデータを基にして、 gⁿ′（ｉ、j′＋１）＝dⁿ（ｉ、j′＋１）＋
mingⁿ′（ｉ、j′） gⁿ′（ｉ、j′−１） ……(7) と、 gⁿ′(i、j′+2)＝dⁿ(i、j′+2)＋gⁿ′(i、j′) ……(8) なる形で(2)式を簡略化して計算を行なう。続いて
（ｎ、ｊ）点に対する処理を行なう。R1にgⁿ（ｉ
−１、を読み出す。（ｎ、ｊ−１）、（ｎ、ｊ−２）
は集合ｗに含まれていないので、gⁿ（ｉ、ｊ）は
このデータのみで確定するとしてが実行され
る。やはり、（ｎ、ｊ−１）が集合ｗに含まれて
いないことからgⁿ（ｉ−１、ｊ−１）＝∞とみなし
てR2には∞をセツトしておく。 The above (A) is the process when the word is switched, and the process of procedure (6-1) is performed for the word n' that was being processed immediately before. At this time, R1 has g ⁿ (i
-1, j'), and R2 contains g ⁿ (i-1, j'-1). According to equation (2), what can be calculated from these data are g ⁿ ′(i, j′+1) and g ⁿ ′(i, j′+1).
j′+2). Therefore, based on the data in these registers, g ⁿ ′(i, j′+1)=d ⁿ (i, j′+1)+
ming ⁿ ′(i, j′) g ⁿ ′(i, j′−1) …(7) and g ⁿ ′(i, j′+2)=d ⁿ (i, j′+2)+g Calculation is performed by simplifying equation (2) in the form ⁿ ′(i, j′) ……(8). Subsequently, processing is performed on point (n, j). In R1 g ⁿ (i
-1, is read out. (n, j-1), (n, j-2)
is not included in the set w, so it is assumed that g ⁿ (i, j) is determined only by this data. Again, since (n, j-1) is not included in the set w, it is assumed that g ⁿ (i-1, j-1) = ∞, and ∞ is set in R2.

上記(B)は同一単語でｊがj′より２以上離れてい
る場合であるが、処理の内容は(a)の場合と類似し
ている。（ｎ＝n′とすればまつたく同じ）ので説
明を省略する。 The above (B) is a case where j is two or more away from j' for the same word, but the content of the processing is similar to the case (a). (If n=n', it is the same again), so the explanation will be omitted.

上記の(C)は同一単語でｊがj′と２だけ離れてい
る場合である。このときR1にはgⁿ（ｉ−１、j′）、
R2にはgⁿ（ｉ−１、j′−１）が記憶されている。
（ｎ、j′＋１）が集合ｗに含まれないことから、
gⁿ（ｉ、j′＋１）はこれらの２データより決定さ
れるゆえによつて gⁿ（ｉ、j′＋１）＝mingⁿ（ｉ−１、j′） gⁿ（ｉ−１、j′−１）
……(9) なる形で(2)式を簡略化して実行する。次いでR0
にgⁿ（ｉ、ｊ）を読み出す。（ｎ、ｊ−１）が集合
ｗに含まれていないこと、R1にgⁿ（ｉ−１、j′）
としてgⁿ（ｉ−１、ｊ−２）が記憶されているこ
とからによつて gⁿ（ｉ、ｊ）＝mingⁿ（ｉ−１、ｊ） gⁿ（ｉ−１、ｊ−２） ……(10) なる(2)式の簡略形を実行する。によつてR1に
gⁿ（ｉ−１、ｊ）をセツトし、によつて集合ｗ
に含まれないgⁿ（ｉ−１、ｊ−１）に代わるもの
として∞をR2にセツトする。 (C) above is the case where j is the same word and j is separated by 2 from j'. At this time, R1 has g ⁿ (i-1, j'),
G ⁿ (i-1, j'-1) is stored in R2.
Since (n, j′+1) is not included in the set w,
Since g ⁿ (i, j'+1) is determined from these two data, g ⁿ (i, j'+1)=ming ⁿ (i-1, j') g ⁿ (i-1, j' -1)
...(9) Simplify and execute equation (2) in the form. Then R0
Read out g ⁿ (i, j). (n, j-1) is not included in the set w, and in R1 g ⁿ (i-1, j')
Since g ⁿ (i-1, j-2) is stored as g ⁿ (i, j) = ming ⁿ (i-1, j) g ⁿ (i-1, j-2) ...(10) Execute the simplified form of equation (2). to R1 by
Set g ⁿ (i-1, j) and set w by
∞ is set in R2 as a replacement for g ⁿ (i-1, j-1) which is not included in .

最後の(D)は、同一単語でｊとj′が１だけ離れて
いる場合、すなわち連続している場合である。
によつてR0にgⁿ（ｉ−１、ｊ）を読み出した後、
によつて(2)式をそのまま実行する。、によ
つてR1にはgⁿ（ｉ−１、ｊ）が、R2にはgⁿ（ｉ−
１、ｊ−１）がセツトされる。 The last case (D) is the case where j and j' are the same word and are separated by 1, that is, they are continuous.
After reading g ⁿ (i-1, j) into R0 by
Expression (2) is executed as is. , R1 has g ⁿ (i-1, j) and R2 has g ⁿ (i-
1, j-1) is set.

以上述べた方法によると、多くの場合漸化式(2)
を(7)〜(10)式のように簡略化して計算することがで
き、第３図のような不利を避けることができる。
しかもgⁿ（ｉ−１、ｊ）のメモリへのアクセスは
上記(A)〜(D)のいずれのケースでも各１回であり、
実効的な高速化が可能となる。 According to the method described above, in many cases the recurrence formula (2)
can be calculated by simplifying it as shown in equations (7) to (10), and the disadvantages shown in FIG. 3 can be avoided.
Moreover, the memory of g ⁿ (i-1, j) is accessed once in each case of (A) to (D) above.
Effective speeding up becomes possible.

（実施例）第４図は本発明によるパターンマツチング方式
に基づいた離散単語型の音声認識装置の構成例を
示すブロツク図であり、第５図はその動作を示す
フローチヤートである。マイクロホン１０より入
力された音声信号は分析部１０によつて周波数分
析され、マイクロプロセツサ３０に入力される。
マイクロプロセツサ３０には前記の手続（６−
１）〜（６−４）で使用するためのレジスタR0、
R1、R2が内蔵されている。また外部には標準パ
ターンBⁿ＝〓₁ ⁿ、…〓ⁿ _j、…〓ｎ_Joを記憶するため
の標準パターン記憶部４０と、gⁿ（ｉ、ｊ）のワ
ークメモりとなるｇメモリ５０とが接続されてい
る。(Embodiment) FIG. 4 is a block diagram showing a configuration example of a discrete word type speech recognition device based on the pattern matching method according to the present invention, and FIG. 5 is a flowchart showing its operation. The audio signal inputted from the microphone 10 is subjected to frequency analysis by the analysis section 10 and inputted to the microprocessor 30.
The microprocessor 30 performs the above procedure (6-
1) Register R0 for use in (6-4),
Built-in R1 and R2. Also, externally there is a standard pattern storage unit 40 for storing standard patterns B ⁿ =〓 ₁ ⁿ , ...〓 ⁿ _j , ...〓n _Jo , and a g memory 50 that serves as a work memory for g ⁿ (i, j). are connected.

このｇメモリ５０は、gⁿ（ｉ−１、ｊ）とgⁿ
（ｉ、ｊ）のための２段分用意され、各単語ｎご
とにｊ＝１、２、…Jⁿ、Jⁿ＋１、Jⁿ＋２のアドレ
スを有している。この最後の２個は、手続（６−
１）の処理においてj′−Jⁿ′であつたとき、が
空回りするためのエリアとなるものである。ま
た、ｎ＝０に対してはｊ＝０、ｊ＝−１の２アド
レスが余分に用意されている。これについては後
で説明する。 This g memory 50 has g ⁿ (i-1, j) and g ⁿ
Two stages are prepared for (i, j), and each word n has an address of j=1, 2, . . . J ⁿ , J ⁿ +1, J ⁿ +2. These last two are the procedure (6-
In the process of 1), when j' - J ⁿ ', becomes the area for idle rotation. Furthermore, for n=0, two extra addresses, j=0 and j=-1, are prepared. This will be explained later.

最初の入力ベクトルａ｜₁が与えらえると、ｇメ
モリ５０内のgⁿ（ｉ−１、ｊ）に対して次のよう
な初期設定が行なわれる。 When the first input vector a| ₁ is given, g ⁿ (i-1, j) in the g memory 50 is initialized as follows.

gⁿ（１、１）＝dⁿ（１、１） gⁿ（１、ｊ）＝∞（ｊ≠１） ……（11）これらは特願56−199098号明細書第６図ａの場
合と同様である。g ⁿ (1, 1) = d ⁿ (1, 1) g ⁿ (1, j) = ∞ (j≠1) ... (11) These are the cases shown in Figure 6 a of the specification of Japanese Patent Application No. 56-199098. It is similar to

一般的に時刻ｉでは第５図に示す処理が実行さ
れる。まずａ｜_iが入力されるとブロツク100により
ｇメモリ５０内のgⁿ（ｉ、ｊ）のテーブルを総て
∞でリセツトする。これは虫喰い的に漸化式計算
を行なうことにより生じる未定義の累積距離gⁿ
（ｉ、ｊ）が次の時刻ｉ＋１で不都合を生じさせ
ないようにするためである。次にｎ＝１、n′＝
０、j′＝−２なる初期設定がなされる。n′＝０、
j′＝−２とするのは、このｉサイクルで最初に手
続（６−１）の処理が実行されるときとの処
理が、先に説明したg⁰（ｉ、−１）とg⁰（ｉ、０）
で空回りできるようにするためである。 Generally, at time i, the process shown in FIG. 5 is executed. First, when a| _i is input, the block 100 resets all the tables of g ⁿ (i, j) in the g memory 50 to ∞. This is the undefined cumulative distance g ⁿ that is generated by performing recurrence formula calculations
This is to prevent (i, j) from causing any inconvenience at the next time i+1. Then n=1, n′=
The initial settings are 0 and j'=-2. n′=0,
The reason for setting j'=-2 is that the processing when the process of procedure (6-1) is executed for the first time in this i cycle is the same as g ⁰ (i, -1) and g ⁰ ( i, 0)
This is so that you can spin freely.

一般的な（ｎ、ｊ）に対しては、ブロツク120
でgⁿ（ｉ−１、ｊ）をR0に読み出し、130で閾値
θ(i)との比較を行なう。これは先に述べた集合ｗ
にこの（ｎ、ｊ）が含まれるか否かのテストであ
る。閾値θ(i)はｉの単調増加関数として予かじめ
与えられている。R0＞θ(i)のときは、このｊに
対する処理はすべて省略される。R0＜θ(i)のと
きはｎとn′、ｊとj′の関係がテストされ、それぞ
れに応じて（６−１）、（６−３）、（６−４）のい
ずれかの処理がなされる。なお、（６−２）の処
理は本質的に同等な（６−１）とまとめてブロツ
ク140に示している。また、gⁿ（ｉ−１、ｊ）は
120のブロツクでR0に読み出されているので、
（６−１）式の等はR0からの転送でよく、（６
−３）の等は省略してもよい。 For general (n,j), block 120
At step 130, g ⁿ (i-1, j) is read out to R0 and compared with the threshold value θ(i). This is the set mentioned earlier lol
This is a test to see whether this (n, j) is included in . The threshold value θ(i) is given in advance as a monotonically increasing function of i. When R0>θ(i), all processing for this j is omitted. When R0<θ(i), the relationship between n and n' and j and j' is tested, and one of (6-1), (6-3), and (6-4) is performed depending on each. will be done. Note that the processing of (6-2) is collectively shown in block 140 with (6-1), which is essentially equivalent. Also, g ⁿ (i-1, j) is
Since it is read to R0 in block 120,
Equation (6-1) can be transferred from R0, and (6
-3) etc. may be omitted.

以上の処理をｎ、ｊの２重ループとして回すこ
とによつて時刻ｉのサイクルは終了する。このサ
イクルで計算されたgⁿ（ｉ、ｊ）を過去の最適累
積値gⁿ（ｉ−１、ｊ）として切り替えて次のｉ＋
１のサイクルに移行する。 The cycle at time i is completed by repeating the above processing as a double loop of n and j. Switch g ⁿ (i, j) calculated in this cycle as the past optimal cumulative value g ⁿ (i-1, j) and use it for the next i+
Shift to cycle 1.

かくして時刻Ｉまでの処理が行なわれ、入力音
声が終了しｉ＝Ｉ＋１となつた時点では、各単語
ｎごとにｇメモリ５０内にgⁿ（ｉ−１、Jⁿ）とし
てパターン間距離Ｄ（Ａ、Bⁿ）が得られる。これ
らを比較して最小となる単語ｎ＝n^として認識結
果を定め出力する。 In this way, the processing up to time I is completed, and when the input voice ends and i=I+1, the inter-pattern distance ^D ⁽ A, B ⁿ ) are obtained. These are compared and the recognition result is determined and output as the minimum word n=n^.

以上、本発明の原理を実施例に基づいて述べた
が、これらは本発明の範囲を限定するものではな
い。特に、第３図におけるブロツク130の判定処
理には種々の変形が考えられる。閾値θ(i)の定め
方に関しても、予じめ人手によつて定義しておく
方法の他に、gⁿ（ｉ−１、ｊ）の最小値にリンク
させて設定するなどの変形が考えられ、本発明の
権利範囲に属するものである。 Although the principle of the present invention has been described above based on examples, these do not limit the scope of the present invention. In particular, various modifications can be made to the determination process of block 130 in FIG. Regarding the method of determining the threshold value θ(i), in addition to the method of manually defining it in advance, variations such as setting it by linking it to the minimum value of g ⁿ (i-1, j) are considered. and falls within the scope of the present invention.

また以上の説明では、基本的な漸化式として(2)
式を用いたが、「日経エレクトロニクスの1983年
11月７日号第184頁の表１」に記載されるが如き、
種々の変形の漸化式についても本発明の原理は適
用される。さらに本発明は特願昭56−199098記載
のクロツクワイズDP法と同様連続単語認識に利
用できるものである。 In addition, in the above explanation, we use (2) as the basic recurrence formula.
``Nikkei Electronics' 1983
As stated in “Table 1” on page 184 of the November 7th issue,
The principles of the present invention are also applicable to various deformed recurrence formulas. Furthermore, the present invention can be used for continuous word recognition similar to the clockwise DP method described in Japanese Patent Application No. 56-199098.

（発明の効果）以上述べた本発明の原理によるとDP漸化式の
計算を、必要な（ｎ、ｊ）点のみで、極めて無駄
なく実行することができ、安価かつ高速な音声認
識装置を実現・提供できる。(Effects of the Invention) According to the principle of the present invention described above, the calculation of the DP recurrence formula can be executed with only the necessary (n, j) points without waste, and an inexpensive and high-speed speech recognition device can be realized. Can be realized and provided.

[Brief explanation of drawings]

第１図、第２図、第３図は本発明の原理説明
図、第４図は実施例ブロツク図、第５図はその動
作を説明するフローチヤートである。１０……マイクロホン、２０……分析部、３０
……マイクロプロセツサ、４０……標準パターン
記憶部、５０……ｇメモリ。 1, 2, and 3 are diagrams explaining the principle of the present invention, FIG. 4 is a block diagram of an embodiment, and FIG. 5 is a flowchart explaining its operation. 10...Microphone, 20...Analysis department, 30
...Microprocessor, 40...Standard pattern storage unit, 50...g memory.

Claims

[Claims]

1 Characterize the standard pattern of each word n〓 Time series of _j ⁿ
means for storing as B ⁿ =〓 ₁ ⁿ …〓 _j ⁿ …〓 ⁿ _Jo ; means for temporarily storing the feature a| _i of the input voice pattern;
Corresponding to each word n, the optimal cumulative value g ⁿ (i, _j ⁾ of the distance d ⁿ (i, j) between the feature a| _i and 〓 j n is determined by the recurrence formula of dynamic programming. and calculates a new optimal cumulative value g ⁿ (i, j) at each time i based on the past optimal cumulative value (n,
j), and the recurrence formula calculation process at each (n, j) point is controlled by the mutual relationship between the point (n', j') of n and j calculated immediately before. Highly efficient pattern matching method.