JPH0136151B2

JPH0136151B2 -

Info

Publication number: JPH0136151B2
Application number: JP58143288A
Authority: JP
Inventors: Hirozo Yamada; Kazuhiko Yamamoto; Ryuichi Oka; Noboru Funakubo
Original assignee: Agency of Industrial Science and Technology
Current assignee: National Institute of Advanced Industrial Science and Technology AIST
Priority date: 1983-08-05
Filing date: 1983-08-05
Publication date: 1989-07-28
Also published as: JPS6033676A

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、パターン認識、特に文字など２次
元図形の認識におけるパターン間の相違性の度合
を求めるパターンマツチング方式に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a pattern matching method for determining the degree of dissimilarity between patterns in pattern recognition, particularly in recognition of two-dimensional figures such as characters.

[Background technology and its problems]

２次元図形、音声等のパターン認識において、
パターンから抽出された２つの特徴ベクトルの集
合、Ａ＝｛A₁、A₂、……、A_I｝ ……(1) Ｂ＝｛B₁、B₂、……、B_J｝ ……(2) の類似性または相違性の度合（以下相違度と呼
ぶ）を評価することは、その中核的な問題であ
る。 In pattern recognition of two-dimensional figures, sounds, etc.
A set of two feature vectors extracted from the pattern, A = {A ₁ , A ₂ , ..., A _I } ...(1) B = {B ₁ , B ₂ , ..., B _J } ...( 2) Evaluating the degree of similarity or dissimilarity (hereinafter referred to as dissimilarity) is the core problem.

例えば、文字認識のためのパターン整合法にお
いては、標準パターン（マスク）の各点Z_i＝（x_i、
y_i）の濃度値の集合Ａ＝｛ｐ（Z_i）｝と、未知入力
パターンの濃度値の集合Ｂ＝｛ｑ（Z_i）｝との重ね
合わせの差 D₁＝（Ａ，Ｂ）＝〓^zi ｜ｐ（Z_i）−ｑ（Z_i）｜ ……(3) を相違度とすることにより、入力文字と各文字概
念間の近さの評価が行われる。なお、相関値Σp
（Z_i）・ｑ（Z_i）を類似度に用いたり、この相関値
や第(3)式の値を、ＡとＢの平均濃度で正規化する
場合もあるが、いずれも重ね合わせの量を求める
という意味で基本的な立場は同じである。このパ
ターン整合法は論理が単純であるため、印刷文字
認識の手法として広く用いられてきたが、本質的
に標本点の位置は固定であるとして重ね合せてい
るため、手書き文字のように標本点の位置が変動
する対象に対しては、その適用は困難である。こ
のため、このような対象に対しては、入力とマス
クとの間で特徴の塊の対応づけを行う、いわゆる
構造解析法の立場が必要となる。 For example, in the pattern matching method for character recognition, each point Z _i = (x _i ,
y _i ) density value set A={p(Z _i )} and the unknown input pattern density value set B={q(Z _i )} D ₁ = (A, B) = 〓 ^zi |p(Z _i )−q(Z _i )| (3) By using the following as the degree of dissimilarity, the closeness between the input character and each character concept is evaluated. In addition, the correlation value Σp
(Z _i )・q(Z _i ) may be used for the similarity, or this correlation value or the value of equation (3) may be normalized by the average concentration of A and B, but in both cases, the The basic position is the same in terms of determining quantity. Because this pattern matching method has simple logic, it has been widely used as a method for recognizing printed characters. However, since the position of the sample points is essentially fixed and superimposition is performed, the sample points are It is difficult to apply this method to objects whose positions change. For this reason, for such objects, a so-called structural analysis method is required, which associates clusters of features between the input and the mask.

この構造解析法の一つとして、特徴が時系列で
ある音声認識の分野においては、標本的（時間
軸）を逐次的に非線形に伸縮させて最適な対応づ
けを行うDP（動的計画法：R.Bellman“Dynamic
Programming”、Princeton University Press、
1957）の形式を用いるものが、マツチングの標準
的な手法として定着している。すなわち、音声の
ように特徴が系列化されている場合、第１図のよ
うに非線形に対応させるための第(1)式および第(2)
式のＡの標本点ｉとＢの標本点ｊの組合せをＣ（ｋ）＝（ｉ（ｋ）、ｊ（ｋ）） ……(4) 任意の標本点ｉとｊの特徴間の距離をｄ（ｃ）＝ｄ（ｉ、ｊ）＝｜A_i−B_j｜ ……(5) とし、標本点ＡとＢの相違度をとそれぞれ定義する。第(6)式は、ＡとＢの対応の
組合せを種々にとり、その中で和が最小なものを
全体の尺度とするものである。ｗ（ｋ）は、シス
テムを柔軟にするために導入された非負の重み係
数である。 In the field of speech recognition, where features are time-series, one method of structural analysis is DP (dynamic programming), which sequentially expands and contracts the sample (time axis) in a nonlinear manner to find the optimal correspondence. R.Bellman“Dynamic
Programming”, Princeton University Press,
1957) has been established as the standard matching method. In other words, when the features are arranged in series, such as in speech, Equations (1) and (2) are used to respond nonlinearly as shown in Figure 1.
The combination of sample point i of A and sample point j of B in the formula is C(k) = (i(k), j(k))...(4) The distance between the features of arbitrary sample points i and j is d(c)=d(i,j)=|A _i −B _j | ...(5), and the degree of difference between sample points A and B is are defined respectively. Equation (6) takes various combinations of correspondence between A and B, and uses the one with the smallest sum as the overall measure. w(k) is a non-negative weighting factor introduced to make the system flexible.

ここで、ＡとＢの対応を１対２まで許し、歪Ｃ
＝（ｉ、ｊ）を、Ａ側のみからの関数（ｉ，ｊ
（ｉ））にするという、いわゆる非対称
（Asymmetric）な立場をとり、ｉ点の標本化間
隔（後述の輪郭線分の例では線分長）をl_iとする
と、ｗ（ｋ）＝ｗ（ｉ）＝l_iと書け、D₂を求める問題
は、ｇ(i、j) ＝minｇ (i‐1、j‐1)＋l_i・d₁（ｉ、ｊ）ｇ (i‐1、j‐2)＋l_i・d₂ (i、j‐1、j) ｇ (i‐2,j‐1)＋（l_i-1＋l_i）・d₃ (i‐1、i、j) ……(7) なる漸化式を、下記のｊ（ｉ）に関する拘束条件
(1)、(2)、(3) (1) ｊ（ｉ）は、連続的な単調増加関数である (2) ｊ(1)＝１、ｊ（Ｉ）Ｊ (3) ｊ（ｉ）の値はｉ付近にあるのもとで、順次計算し、第ｉ段での漸化的評価量
ｇ（ｉ、ｊ）を求めるDPの問題に帰着する。最終
的な相違度D₂は D₂（Ａ、Ｂ）＝１／Σl₁ｇ（Ｉ，Ｊ） ……(8) で求められる。すなわち、第(7)式の逐次的な最適
化により、最終的には目的とする第(6)式の大局的
な最適解を第(8)式で求められる相違度D₂として
高速に得ることができる。 Here, the correspondence between A and B is allowed to be 1:2, and the distortion C
= (i, j) as a function (i, j
(i)), which is the so-called asymmetric position, and if the sampling interval of point i (line segment length in the example of a contour segment described later) is l _i , then w(k) = w( Write i) = l _i , and the problem to find D ₂ is g(i, j) = ming (i-1, j-1) + l _i・d ₁ (i, j) g (i-1, j- 2)＋l _i・d ₂ (i, j‐1, j) g (i‐2, j‐1)＋(l _i‐1 +l _i )・d ₃ (i‐1, i, j) ……( 7) Set the recurrence formula to the following constraint condition regarding j(i).
(1), (2), (3) (1) j(i) is a continuous monotonically increasing function (2) j(1)=1, j(I)J (3) j(i) This results in a DP problem in which the value of is around i, and the recursive evaluation g(i, j) at the i-th stage is obtained by calculating sequentially. The final degree of dissimilarity D ₂ is determined by D ₂ (A, B)=1/Σl ₁ g (I, J) (8). In other words, by sequentially optimizing equation (7), the desired global optimal solution of equation (6) can be quickly obtained as the degree of dissimilarity D ₂ determined by equation (8). be able to.

なお、等間隔（l_i＝１）の標本化で、特徴間の
距離を d₁（ｉ、ｊ）＝ｄ（ｉ、ｊ） d₂（ｉ、ｊ−１、ｊ）＝（ｄ（ｉ、ｊ−１）＋ｄ
（ｉ、ｊ））／２ d₃（ｉ−１、ｉ、ｊ）＝（ｄ（ｉ−１、ｊ）＋ｄ
（ｉ、ｊ））／２ ……(9) としたものはＨ・Sakoe and S.Chiba“Dynamic
programming algorthm optimization for
spoken word recognition”IEEE Transactions
on Acoustics Speech and Signal Processing、
Vol.ASSP―26、NO.1、PP.43―49（1978）に示
されている。 Note that when sampling at equal intervals (l _i = 1), the distance between features is expressed as d ₁ (i, j) = d (i, j) d ₂ (i, j-1, j) = (d (i , j-1)+d
(i, j))/2 d ₃ (i-1, i, j) = (d(i-1, j) + d
(i, j))/2 ……(9) H. Sakoe and S. Chiba “Dynamic
programming algorithm optimization for
spoken word recognition”IEEE Transactions
on Acoustics Speech and Signal Processing,
Vol.ASSP-26, NO.1, PP.43-49 (1978).

また、文字認識の分野においても、特徴が時系
列であるオンライン形ではDPが利用されやすい。
しかし、オフライン形の文字認識では、特徴が２
次元的に分布するため、DPの適用は容易ではな
い。２次元図形の特徴の場合、２次元的な近さの
関係を保持することが重要であるが、この関係を
保持したまゝ１次元的に系列化するのが困難なた
めである。 Also, in the field of character recognition, DP is easily used in online formats where the features are in chronological order.
However, in offline character recognition, there are two characteristics.
Due to the dimensional distribution, the application of DP is not easy. In the case of features of two-dimensional figures, it is important to maintain a two-dimensional relationship of proximity, but it is difficult to serialize them one-dimensionally while maintaining this relationship.

この問題について具体的に説明する前に、この
発明の２次元図形のDPマツチングで用いる特徴
ベクトルと言葉・記法の定義について述べる。ま
ず、白黒の２値図形の境界の黒点を輪郭点、輪郭
点の一繋がりの閉ループを輪郭と呼ぶ。次に、第
２図ａのように、白点を左に見て廻る方向に輪郭
を直線近似した時のその線素を輪郭線部または単
に線分（矢印及び符号１〜１２で示す）、各線分
の最初（矢の印のない側）及び最後（印のある
側）の輪郭点を始点及び終点と呼ぶ。なお、ある
線分の終点と次の線分の始点は同一点とする。次
に、一周の輪郭に対する最初及び最後の線分を始
線及び終線と呼ぶ。始線の始点と終線の終点とは
一致する。 Before explaining this problem specifically, we will explain the definition of the feature vectors, words, and notations used in the DP matching of two-dimensional figures according to the present invention. First, black dots on the boundary of a black and white binary figure are called contour points, and a closed loop of continuous contour points is called a contour. Next, as shown in Fig. 2a, when the contour is approximated by a straight line in the direction of turning the white point to the left, the line element is the contour line part or simply a line segment (indicated by arrows and symbols 1 to 12). The first (on the side without the arrow mark) and last (on the side with the mark) outline points of each line segment are called the starting point and the ending point. Note that the end point of one line segment and the start point of the next line segment are the same point. Next, the first and last line segments for the contour of one circumference are called the starting line and the ending line. The starting point of the starting line and the ending point of the ending line match.

このように輪郭を線分近似すると、全ての線分
ｉは、ｉの始点を終点とする線分i′を持つが、こ
のi′をｉの直前の線分と呼び i′＝Ｐ（ｉ） ……(10) で表わす。 When the contour is approximated by line segments in this way, every line segment i has a line segment i' whose end point is the start point of i, but this i' is called the line segment immediately before i, and i'=P(i ) ...(10)

また、第３図の線分ｉの特徴ベクトル定義図に
示されるように、線分ｉの属性としては、始点お
よび終点座標Z^b _i、Z^o _i、始点から終点への方向a_i、
長さl_iが掲げられ、それぞれ下記のように定義さ
れる。 Furthermore, as shown in the feature vector definition diagram ^of line _segment _i ⁱⁿ FIG _.
The lengths l _i are listed, and each is defined as follows.

最後に、ある線分ｉに“先行可能”な線分の集
合Ｑ（ｉ）を次のように定義する。 Finally, a set Q(i) of line segments that can "precede" a certain line segment i is defined as follows.

Ｑ（ｉ）＝｛i′｜条件１かつ条件２かつ条件３を
満足する｝ ……(12) 条件(1)：｜Z^b _i−Z^o _i′｜≦δ（例えば３）条件(2)：｜Z^b _i−Z^o _i′｜≦β（例えば32）または｜a_i〜a_i′｜＞π／２または（Z^b _i−Z^o _i′）・（Z^o _i−Z^b _i′）≧０条件(3)：｜Z^b _i−Z^o _i′｜｜Z^b _i−Z^o _p（1′）｜かつ、｜Z^b _i−Z^o _i′｜｜Z^b _i−Z^o _i″｜、i′＝Ｐ（i″
）すなわち、条件(1)、(2)は第３図において斜線の
内部に終点がくる線分、あるいは、半径βの円内
に終点があり角度差がπ／２以上の線分を意味す
る。ただし、ベクトルに対する絶対値の定義第(11)
式から実際は円ではなく菱形（正方形を45゜回転
させたもの）である。条件(3)は、上記条件(1)、(2)
の線分のうち、前後でｉとの距離が極小になるも
のだけを残すことを意味する。 Q(i)={i′|Satisfies condition 1, condition 2, and condition 3} ……(12) Condition (1): |Z ^b _i −Z ^o _i ′|≦δ (for example, 3) Condition (2 ): |Z ^b _i −Z ^o _i ′|≦β (for example, 32) or |a _i ~a _i ′|>π/2 or (Z ^b _i −Z ^o _i ′)・(Z ^o _i −Z ^b _i ′)≧0 Condition (3): |Z ^b _i −Z ^o _i ′ | | Z ^b _i −Z ^o _p (1′) | and |Z ^b _i −Z ^o _i ′ | | Z ^b _i − Z ^o _i ″｜, i′=P(i″
) In other words, conditions (1) and (2) mean a line segment whose end point is inside the diagonal line in Figure 3, or a line segment whose end point is inside a circle with radius β and whose angular difference is π/2 or more. . However, the definition of absolute value for vectors (11)
From the formula, it is actually not a circle but a rhombus (a square rotated 45 degrees). Condition (3) is the same as conditions (1) and (2) above.
This means that among the line segments, only those whose distance from i before and after is minimal are left.

この集合は、線分i′の終点の次に線分ｉを接続
して対応させることを許容させるためのものであ
り、この発明のパターンマツチング方式において
重要な役割を果たすものである。 This set is for allowing line segment i to connect and correspond to the end point of line segment i', and plays an important role in the pattern matching method of the present invention.

以上の定義のもとで、マスク側の特徴Ａ＝｛A_i｝
と入力側の特徴Ｂ＝｛B_j｝のDPによる対応づけを
考えるが、最初に、マスク側の特徴｛A_i｝に系列
を与える。これには前述の輪郭線分の順序をその
まま利用する。 Under the above definition, mask side feature A = {A _i }
Let us consider the correspondence of the input side feature B={B _j } by DP. First, a sequence is given to the mask side feature {A _i }. For this purpose, the order of the contour line segments described above is used as is.

次に、入力側も追跡の順という意味では系列化
されているが、マスク側との対応づけにこの系列
をそのまま用いることはできない。それは、第２
図ｂに示されるように、線に切れが生ずると、輪
郭の系列が変化してしまう（線分１〜14で示す）。
この現象は、特に手書漢字のように多数の“画”
から構成される文字の場合、ある画の開始点や終
了点との他の線との間で起き易い。例えば、手書
漢字の“田”の場合、標準的には線によつて区切
られる白地の数（ループ数）は４と考えられる
が、現在標準的な実験用データベースとして用い
られる電子技術総合研究手書教育漢字データベー
スETL８の解析によれば、ループ数は０〜４ま
でほゞ平均的に分布することが報告されている。 Next, although the input side is also organized in a series in terms of the order of tracking, this series cannot be used as is to correlate with the mask side. That is the second
As shown in FIG. b, when a break occurs in the line, the series of contours changes (indicated by line segments 1 to 14).
This phenomenon is especially true for handwritten kanji, which have many “strokes”.
In the case of characters consisting of , this tends to occur between the start or end point of a certain stroke and another line. For example, in the case of the handwritten kanji ``田'', the standard number of blank spaces separated by lines (the number of loops) is considered to be 4, but the number of blank spaces separated by lines (the number of loops) is considered to be 4. According to an analysis of the handwriting education kanji database ETL8, it has been reported that the number of loops is approximately evenly distributed from 0 to 4.

このことは、手書漢字のように複雑で変形の激
しい２次元図形の相違度の評価において、系列化
された特徴ベクトル間での対応づけという仮定が
成り立たないことを示している。 This indicates that the assumption of correspondence between serialized feature vectors does not hold in evaluating the degree of dissimilarity of two-dimensional figures that are complex and highly deformed, such as handwritten kanji.

[Purpose of the invention]

この発明は、上記の問題を解決するためになさ
れたもので、系列化されていない特徴ベクトル集
合間の類似度を、DP手法を用いて高速に計算で
きるようにしたパターンマツチング方式を提供す
るものである。 This invention was made to solve the above problem, and provides a pattern matching method that can quickly calculate the similarity between unsequential feature vector sets using the DP method. It is something.

[Summary of the invention]

この発明は、上記の目的を達成するため、下記
(1)、(2)、(3)の機能を有するパターンマツチング方
式である。 In order to achieve the above object, this invention has the following objectives:
This is a pattern matching method that has functions (1), (2), and (3).

(1) 入力側、マスク側双方共、白と黒の境界部の
直線近似（輪郭線分）を用いることにより、細
線化図形に比べ情報の歪みを少なくすると共
に、原図形に比べ対応候補点を減少させる。輪
郭線分の方向性を距離評価に用いることによ
り、評価関数の精度向上を図る。(1) By using straight line approximation (contour segment) of the boundary between white and black on both the input side and the mask side, information distortion is reduced compared to thinned figures, and corresponding candidate points are reduced compared to the original figure. decrease. The accuracy of the evaluation function is improved by using the directionality of the contour line segment for distance evaluation.

(2) マスク側特徴は、（輪郭の順序性をそのまま
用いて）系列化されているのに対し、入力側特
徴は基本的には順序性がないものとして対応さ
せることにより、線の切断に対しても強いマツ
チングを行い、同時に、入力側の各輪郭線分に
“先行可能”な線分の集合を定義することによ
り、DPの持つ処理の高速性を生かし、不自然
な対応を防ぐ。(2) The mask-side features are serialized (using the ordering of the contour as is), whereas the input-side features are basically treated as having no ordering, which makes it easier to cut lines. At the same time, by defining a set of line segments that can precede each contour line segment on the input side, we take advantage of the high-speed processing of DP and prevent unnatural correspondence.

(3) 単に輪郭線分間の距離だけでなく前後の輪郭
線分のつながり方も評価することにより、重ね
合せの量だけでなく、空間を非線形に歪ませた
時の歪み量も評価に入れる。(3) By evaluating not only the distance between contour lines but also the way in which the front and rear contour lines are connected, not only the amount of overlap but also the amount of distortion when space is distorted nonlinearly is included in the evaluation.

[Principle of the invention]

次に、この発明のパターンマツチング方式の原
理について説明する。 Next, the principle of the pattern matching method of the present invention will be explained.

まず、マスク側特徴の記法は、第(11)式の通りと
し、入力側特徴に対しては、始点座標ω^b _j、終点
座標ω^e _j、方向α_j、長さλ_jなる記号を用いる。な
お、前述の説明では、マスク側に対して先行可能
な線分の集合を定義したが、実際のマツチングで
は、入力側のみにこの集合Ｑを定義する。 First, the notation for the mask side feature is as shown in Equation (11), and for the input side feature, the following symbols are used: start point coordinate ω ^b _j , end point coordinate ω ^e _j , direction α _j , and length λ _j . In the above description, a set of line segments that can precede the mask side is defined, but in actual matching, this set Q is defined only on the input side.

以上の準備のもとで、上記第(7)式に相当する漸
化式を、ｇ（ｉ、ｊ）＝min｛h₁（ｉ、ｊ），h₂（ｉ、ｊ），
h₃（ｉ、ｊ）｝ ……（13）と表現する。第（13）式のh₁，h₂，h₃は第４図の
マスク線分ｉと入力線分ｊとの対応図に示される
ａ，ｂ，ｃの対応評価量であり、ａは１対１（ｉ
＝i′対ｊ＝j′）、ｂは１対２（ｉ＝i′対j′＝ｊ−１
，
ｊ），ｃは２対１（i′＝ｉ−１、ｉ対j′＝ｊ）を、
また、Ｍはマスク、Ｎは入力を示す。 Based on the above preparation, the recurrence formula corresponding to the above equation (7) can be written as g(i, j)=min{h ₁ (i, j), h ₂ (i, j),
h ₃ (i, j)} ...(13) Expressed as. h ₁ , h ₂ , and h ₃ in equation (13) are the correspondence evaluation quantities of a, b, and c shown in the correspondence diagram between mask line segment i and input line segment j in FIG. 4, and a is 1 vs.1(i
= i' vs. j = j'), b is 1 vs. 2 (i = i' vs. j' = j-1
，
j), c is 2 to 1 (i'=i-1, i to j'=j),
Further, M indicates a mask, and N indicates an input.

こゝで、第(7)式のminの中の第１項と、第
（14）式のh₁を求める式を比較してみると、相違
の第１点は、第（14）式でh₁を求める場合、j″∈
Ｑ（j′）に対してのmin（最小化）処理が行われて
いる。これに対し、第(7)式ではj″＝ｊ−１に対す
る計算項はただ１つである点である。これは、１
次元に系列化されている場合は、ｉ−１の次に
ｉ，ｊ−１の次にｊが生起することが保障されて
いるために、ｉにｊが対応する時、ｉ−１に対応
する相手をｊ−１だけにすればよいためである。 Now, if we compare the first term in min in Equation (7) and the equation for calculating h ₁ in Equation (14), the first difference is that in Equation (14), When finding h ₁ , j″∈
Min (minimization) processing is being performed on Q(j'). On the other hand, in equation (7), there is only one calculation term for j″=j−1.
In the case of dimensional series, it is guaranteed that i will occur after i-1 and j will occur after j-1, so when j corresponds to i, it will correspond to i-1. This is because it is only necessary to limit the opponent to j-1.

一方、系列化されていない場合、ｉにｊが対応
する時、ｉ−１に対応する相手としての全ての
j″を考慮しなければならないので、j″∈Ｑ（j′）と
いう条件が必要となる。こゝで、Ｑ（j′）は、前
述した先行可能な線分の集合である。Ｑ（j′）と
して全ての線分の集合ではなくこのような部分集
合を用いることにより、計算が高速化されると共
に、不自然な対応が除去される。 On the other hand, if it is not serialized, when j corresponds to i, all the partners corresponding to i-1
Since j″ must be taken into consideration, the condition j″∈Q(j′) is required. Here, Q(j') is the aforementioned set of line segments that can precede. Using such a subset rather than the set of all line segments as Q(j') speeds up the computation and eliminates unnatural correspondences.

相違の第２点は、γの項が存在する点である、
第(7)式では対応づけられた特徴間の距離は評価さ
れているが、Ｃ＝（ｉ、ｊ）の歪に関する評価は
行われていない。Ｃ＝（ｉ、ｊ）を歪ませた結果、
完全に合致するにしても、歪の量が多ければ評価
は下がると考えるのが自然である。そこで、各時
点での評価において、マスク側のi′−１からi′へ
の遷移に対応する入力側のj″からj′への遷移の差
の関数γを定義し、この歪み量を特徴間の距離に
加算したものを総合評価値とする。 The second difference is that there is a term γ,
In Equation (7), the distance between the correlated features is evaluated, but the distortion of C=(i, j) is not evaluated. As a result of distorting C=(i,j),
It is natural to think that even if there is a perfect match, the evaluation will be lower if there is a large amount of distortion. Therefore, in the evaluation at each time point, we define a function γ of the difference between the transition from j″ to j′ on the input side, which corresponds to the transition from i′−1 to i′ on the mask side, and define this amount of distortion as a characteristic. The sum added to the distance between the two is the comprehensive evaluation value.

次に、上記特徴（輪郭部分）間の距離を下記の
ように定義する。 Next, the distance between the above features (outline portions) is defined as follows.

上記第（15）式中の、d₁における記号〜は角度
差の演算で、結果は−πからπの間にあり、係数
２／πは、角度差がπ／２になつた時、単位長の
差と同じ評価量にするための係数である。また、
長さは、２倍までの差を評価０で許容する。d₂、
d₃の場合、上記の距離を長さで比例配分してい
る。 In the above equation (15), the symbol ~ in d ₁ is the calculation of the angular difference, the result is between -π and π, and the coefficient 2/π is the unit when the angular difference becomes π/2. This is a coefficient to make the evaluation amount the same as the difference in length. Also,
For length, a difference of up to 2 times is allowed with an evaluation of 0. _d2 ,
In the case of d ₃ , the above distance is distributed proportionally by length.

また、γの項については、対応をとる線分（Ａ
側はi′、Ｂ側はj′）の始点と、その一つの前の線
分（Ａ側はi′−１、Ｂ側はj″）の終点の間の位置
関係から歪み量を評価するものであり、 γ（i′−１、i′；j″、j′）＝｜｜ω^b _j′−ω^e _j″
｜−｜Z^b _i′
−Z^e _i′_-1｜｜ ……（16）で定義される。 Also, regarding the term γ, the corresponding line segment (A
The amount of distortion is evaluated from the positional relationship between the starting point of side i' and j' for side B and the end point of the previous line segment (i'-1 for side A and j'' for side B). and γ(i′−1, i′; j″, j′)=｜ω ^b _j ′−ω ^e _j ″
｜−｜Z ^b _i ′
−Z ^e _i ′ _-1 ｜｜……(16) is defined as follows.

通常、マスク側線分は連続しているから｜Z^b _i′−Z^e _i′_-1｜＝０であるが、ある輪郭から次の輪郭に移る時は０で
ない。 Normally, the line segments on the mask side are continuous, so |Z ^b _i ′−Z ^e _i ′ ₋₁ |=0, but it is not 0 when moving from one contour to the next.

初期条件は、 h₃（ｉ、ｊ）＝∞ ｇ（i′−１、j″）＝０ γ（i′−１、i′、j″、j′）＝０ｉ＝１ i′＝１ i′＝１ ……（17）とする。 The initial conditions are h ₃ (i, j) = ∞ g (i'-1, j'') = 0 γ (i'-1, i', j'', j') = 0 i = 1 i' = 1 Let i'=1...(17).

上記第(8)式に相当するマスク側Ａと入力側Ｂの
全体としての最適対応による相違度は、 D₃（Ａ，Ｂ）＝１／Σl_imin ｊ｛ｇ（Ｉ，ｊ）｝ ……（18）で求められる。 The degree of difference based on the overall optimal correspondence between the mask side A and the input side B, which corresponds to the above equation (8), is D ₃ (A, B)=1/Σl _i min j{g(I, j)}... ...(18) is obtained.

このようにして第（18）式で示される相違度
D₃の計算を行うと、系列の順序が変つてしまつ
た入力側に対してもマスク側との対応をとること
ができ、かつ、線の切れの量を考慮に入れた評価
量をDP手法を用いて高速に得ることが出来る。 In this way, the degree of dissimilarity expressed by equation (18)
When calculating _D3 , it is possible to correspond to the mask side even for the input side where the order of the series has changed, and the evaluation amount that takes into account the amount of line breaks can be calculated using the DP method. can be obtained quickly using

[Embodiments of the invention]

第５図はこの発明の一実施例のマツチングの手
順を説明するためのブロツク図である。 FIG. 5 is a block diagram for explaining the matching procedure according to an embodiment of the present invention.

同図において、h₁，h₂，h₃はそれぞれ第（14）
式の計算ブロツクである。計算ブロツクh₁に入力
されるデータは、マスク記憶部Ａの特徴a_i，l_i，｜
Z^b _i−Z^e _i-1｜、入力バツフア部Ｂの特徴α_j，λ_j，｜
ω^b _j−ω^e _j″｜、および第ｉ−１段の漸化的評価量ｇ
（ｉ−１、j″），j″∈Ｑ（ｊ）である。ｇ（ｉ−１、
j″）は、一時記憶ブロツクg_-1に貯えられている。
また、計算ブロツクh₂に入力されるデータは、
a_i，l_i，｜Z^b _i−Z^e _i-1｜，α_p(j)，λ_p(j)，α_j，λ_j，
｜ω^b _p(j)−
ω^e _j″｜，j″∈Ｑ（Ｐ（ｊ））およびブロツクg_-1から
のｇ（ｉ−１、j″）である。そして、計算ブロツ
クh₃に入力されるデータは、マスク側のＡの特徴
a_i-1，l_i-1，a_i，l_i，｜Z^b _i-1−Z^e _i-2｜と入力側Ｂの特
徴α_j，λ_j，｜ω^b _j−ω^e _j″｜およびｉ―２段の漸化的
評
価量ｇ（ｉ−２、j″）、j″∈Ｑ（ｊ）であり、ｇ（ｉ
−２、j″）は、一時記憶ブロツクg_-2に貯えられ
ている。 In the same figure, h ₁ , h ₂ , and h ₃ are respectively (14th)
This is the calculation block for Eq. The data input to the calculation block _h1 are the characteristics a _i , l _i , | of the mask storage unit A.
Z ^b _i −Z ^e _i-1 |, characteristics α _j , λ _j , | of input buffer section B
ω ^b _j −ω ^e _j ″|, and the recursive evaluation g of the i-1st stage
(i-1, j″), j″∈Q(j). g(i-1,
j″) is stored in temporary memory block g _-1 .
Also, the data input to calculation block _h2 is
a _i , l _i , |Z ^b _i −Z ^e _i-1 |, α _p(j) , λ _p(j) , α _j , λ _j ,
｜ω ^b _p(j) −
ω ^e _j ″|, j″∈Q(P(j)) and g(i−1, j″) from block g _−1.Then , the data input to calculation block _h3 is Characteristics of A
a _i-1 , l _i-1 , a _i , l _i , |Z ^b _i-1 −Z ^e _i-2 | and the characteristics of input side B α _j , λ _j , |ω ^b _j −ω ^e _j ″ | and i-2-stage recursive evaluation g(i-2, j″), j″∈Q(j), and g(i
-2, j'') is stored in temporary memory block g _-2 .

計算ブロツクh₁，h₂，h₃で計算された第（14）
式の値は、第（13）式の最小値計算ブロツク
min₁に送られ、第ｉ段における各ｊとの漸化的
評価量が計算され、一時記憶ブロツクg₀に送られ
第ｉ段の計算が完了する。 The (14th) calculated block h ₁ , h ₂ , h ₃
The value of the formula is the minimum value calculation block of formula (13).
min ₁ , the recursive evaluation amount with each j in the i-th stage is calculated, and is sent to the temporary storage block g ₀ , completing the calculation of the i-th stage.

次に、第ｉ＋１段の計算の準備として、まず、
g_-1の内容を全てg_-2に送り、その後g₀の内容を全
てg_-1に送る。この段階で記憶ブロツクg_-1にはｇ
（ｉ，ｊ）の値が、記憶ブロツクg_-2にはｇ（ｉ−
１，ｊ）の値が入り、第ｉ＋１段の計算のための
準備を完了する。 Next, in preparation for the calculation of the i+1th stage, first,
Send all the contents of g _-1 to g _-2 , then send all the contents of g ₀ to g _-1 . At this stage, memory block g _-1 has g
The value of (i,j) is stored in memory block g _-2 as g(i-
1, j) is entered, completing the preparation for the calculation of the i+1th stage.

以上が、この発明の要部をなすDPによる反復
計算部分に関する動作であるが、処理の全てを説
明するには、第（17）式の初期値設定と、第
（18）式の相違度に関して補足する必要がある。
そこで、第１段から順に説明する。なお、〜
は各処理を示している。そして、〜は繰返し
である。 The above is the operation related to the iterative calculation part by DP which forms the main part of this invention, but in order to explain the entire process, it is necessary to explain the initial value setting of equation (17) and the degree of difference in equation (18). It is necessary to supplement.
Therefore, the explanation will be given in order starting from the first stage. In addition,~
indicates each process. And ~ is a repetition.

まず、処理に先立つて、初期値設定ブロツク
INITにより、g_-1の全ての値に零が設定される
。これは第（17）式の第２式に相当する。そし
て最初のマスク線分ｉ＝１に対する計算では、ブ
ロツクh₁，h₂に対し、γの値として零が供給さ
れ、h₃からは∞が出力される。すなわちブロツ
クh₁からは h₁（ｉ、ｊ）＝l₁・d₁（１、ｊ）を出力し、ブロツクh₂からは h₂（ｉ、ｊ）＝l₁・d₂（１、Ｐ（ｊ）、ｊ）が出力
され、その最小値がブロツクmin₁で計算され
る。なお、処理の時、ブロツクh₃から∞が出
力されるのは、h₃の値がmin₁の出力として選ば
れないようにするためである。次に、処理で計
算された各ｊに対する第１段での漸化的評価量ｇ
（１，ｊ）は全てブロツクg₀に送られ記憶され、
第１段の計算が完了し、g_-1がg_-2に送られ、g₀
がg_-1に送られる。 First, before processing, the initial value setting block is
INIT sets all values of g _-1 to zero. This corresponds to the second equation of equation (17). In the calculation for the first mask line segment i=1, zero is supplied as the value of γ to blocks h ₁ and h ₂ , and ∞ is output from h ₃ . That is, block h ₁ outputs h ₁ (i, j) = l ₁ · d ₁ (1, j), and block h ₂ outputs h ₂ (i, j) = l ₁ · d ₂ (1, P (j), j) are output and their minimum value is calculated in block min ₁ . Note that during processing, ∞ is output from block _h3 in order to prevent the value of _h3 from being selected as the min ₁ output. Next, the recursive evaluation amount g in the first stage for each j calculated in the process
All (1, j) are sent to block g ₀ and stored,
The first stage calculation is completed, g _-1 is sent to g _-2 , and g ₀
is sent to g _-1 .

２番目のマスク線分ｉ＝２に対する第２段の計
算においては、ブロツクh₁およびブロツクh₂では
上記の通常の反復計算時の値が計算される。これ
に対し、ブロツクh₃では、まず、第１段に先立つ
てブロツクg_-1に設定された値がブロツクg_-2に移
されてきているためブロツクg_-2の値は全て零で
ある（第（17）式の中段の式）。また第（17）
式の第３式より、γの値も全て零として計算す
る。そして、h₁，h₂，h₃の結果がブロツクmin₁に
送られ、各ｊに対する第２段目の漸化的評価量
ｇ（２、ｊ）が計算され、ブロツクg₀に貯えられ
、第２段の処理を完了し、g_-1からg_-2，g₀か
らg_-1へのデータの移動を経て第３段の準備を
完了する。 In the second stage calculation for the second mask line segment i=2, the values obtained during the above-mentioned normal iterative calculation are calculated in blocks _h1 and _h2 . On the other hand, in block _h3 , the values set in block g _-1 prior to the first stage have been transferred to block g _{-2, so all values in block g -2} _are zero ( (middle equation of equation (17)). Also No. (17)
From the third equation, the values of γ are also calculated assuming that they are all zero. Then, the results of h ₁ , h ₂ , and h ₃ are sent to block min ₁ , and the second stage recursive evaluation amount g (2, j) for each j is calculated and stored in block g ₀ . The second stage processing is completed, and the preparation for the third stage is completed after the data is moved from g _-1 to g _-2 and from g ₀ to g _-1 .

第３段以降の反復処理は上記した通りである。
最後に、第Ｉ段でのｇ（Ｉ、ｊ）ｊ＝１〜Ｊが，
，のｊに関する繰返しで計算され、ブロツク
min₂に送られ第（18）式のｊに関する最小値が
求められ、Σl_iで除されて最適対応による全体
の相違度D₃（Ａ、Ｂ）が求められる。 The iterative processing from the third stage onwards is as described above.
Finally, g(I, j)j=1~J in stage I is
, is calculated by iterating over j, and the block
min ₂ to find the minimum value for j in equation (18), which is divided by Σl _i to find the overall degree of dissimilarity D ₃ (A, B) based on the optimal correspondence.

なお、上記はこの発明の一実施例であつて、以
下の(1)〜(5)に述べるような変更、およびそれらの
複合的変更は容易に可能である。 It should be noted that the above is one embodiment of the present invention, and changes such as those described in (1) to (5) below and combination thereof are easily possible.

(1) 第（14）式では、全ての線分で前との接続が
切断可能として、minの範囲をj″∈Ｑ（j′）とし
ているが、切断される可能性のない点では、
j″＝Ｐ（j′）として計算すれば、第（14）式の最
小化を行う必要がなくなり、計算が高速化され
る。具体的には、第２図ａをマスクと考えた
時、切断の可能性のあるi′＝２、４、９では、
j″∈Ｑ（j′）を用いるが、その他の線分ではj″＝
Ｐ（j′）を用いることを意味する。(1) In equation (14), the range of min is set to j″∈Q(j′) assuming that all line segments can be disconnected from the previous line, but at points where there is no possibility of disconnection,
Calculating with j″=P(j′) eliminates the need to minimize Equation (14) and speeds up the calculation.Specifically, when Figure 2 a is considered as a mask, For i′=2, 4, and 9, where there is a possibility of disconnection,
j″∈Q(j′) is used, but for other line segments j″=
This means using P(j').

(2) 第（14）式、第（15）式において、各線分の
長さl_iを可変長にしたため、l_iの重みが計算式の
中で用いられているが、固定長にするか、また
は線分近似しないで、各輪郭点をそのまゝ用い
るかすれば、長さを考慮する必要がなくなる。(2) In equations (14) and (15), the length l _i of each line segment is made variable, so the weight of l _i is used in the calculation formula, but should it be fixed length? , or by using each contour point as is without performing line segment approximation, there is no need to consider the length.

すなわち、第（15）式のd₁，d₂，d₃は第
（９）式で書換えられ、ｄは、ｄ（ｉ，ｊ）＝２／π｜α_j〜a_i｜と書け、計算が単純化できる。 That is, d ₁ , d ₂ , d ₃ in equation (15) can be rewritten as equation (9), and d can be written as d (i, j) = 2/π | α _j ~ a _i | can be simplified.

(3) 第（13）式、第（14）式を変形すると、ｇ(i、j) ＝ming′(i、j)＋l_id₁(i、j) g′(i、P(j))＋l_id₂(i、j′、j) g′ (j‐1、j)＋（l_i′＋l_i）d₃(i′、i、j) g′（i′、j′）＝min j″∈Ｑ（j′）｛ｇ（i′−１、j″）＋γ（i′−
１、i′；j″j′）｝となる。すなわち、第ｉ段と第ｉ−１段の間で、
途中結果としてg′（i′，j′）を計算することにより
γの計算とｄの計算が分離できる。(3) Transforming equations (13) and (14), g(i, j) = ming′(i, j)+l _i d ₁ (i, j) g′(i, P(j) )+l _i d ₂ (i, j′, j) g′ (j‐1, j)+(l _i ′+l _i )d ₃ (i′, i, j) g′(i′, j′)= min j″∈Q(j′) {g(i′−1, j″)+γ(i′−
1, i′; j″j′)}.In other words, between the i-th stage and the i-1th stage,
By calculating g'(i', j') as an intermediate result, the calculation of γ and the calculation of d can be separated.

(4) 特徴として、輪郭ではなく、例えば中心線を
用いる。また、第（15）式の特徴間の評価ｄ
を、角度差以外の例えば、位置情報、線と線の
関係などの高次特徴などを加えて行う。(4) Use, for example, a center line instead of an outline as a feature. Also, the evaluation d between the features in equation (15)
This is performed by adding higher-order features other than the angular difference, such as position information and line-to-line relationships.

(5) 上記実施例で説明したDPによる処理は、線
分と線分の対応付けのためだけに用い、パター
ン間の識別は別の評価関数で行う。また、輪郭
全体でなく、その一部分毎にDPによる処理を
適用する。(5) The DP processing described in the above embodiment is used only for associating line segments, and discrimination between patterns is performed using another evaluation function. In addition, DP processing is applied not to the entire contour but to each part of the contour.

〔Effect of the invention〕

以上詳細に説明したように、この発明によれ
ば、系列化されていない特徴ベクトル間に対し、
線の切れなど位相的変形を許容し、大局的最適評
価を行う相違度を、DPの手法を用いて高速に求
めることができる。さらに、特徴間の距離だけで
なく、空間の歪も評価量に加えるとともに、“先
行可能”な線分の集合を定義することにより、不
自然な対応を排除しつゝ処理の高速化をはかるこ
とができる。また、この発明は、単に文字認識の
輪郭特徴ペクトル間の相違度として用いられるに
止まらず、広くパターン認識に適用できる優れた
ものである。 As explained in detail above, according to the present invention, between feature vectors that are not serialized,
Dissimilarity, which allows topological deformations such as line breaks and performs global optimal evaluation, can be quickly determined using the DP method. Furthermore, by adding not only the distance between features but also spatial distortion to the evaluation quantity and defining a set of line segments that can be "preceded," we aim to speed up processing while eliminating unnatural correspondences. be able to. Further, the present invention is not only used as a degree of dissimilarity between contour feature vectors for character recognition, but is also excellent and can be widely applied to pattern recognition.

[Brief explanation of drawings]

第１図は音声のように系列化された特徴ベクト
ル間のDPマツチングの原理説明図、第２図ａ，
ｂはこの発明で用いられる輪郭線分特徴と線の切
れにより線分の系列が変化することを説明するた
めの図、第３図は輪郭線分の特徴とその輪郭線分
に“先行可能”な輪郭線分の説明図、第４図ａ，
ｂ，ｃはこの発明のDPマツチングの各段におけ
る動作を説明するための図、第５図はこの発明の
一施例のマツチングの手順を説明するためのブロ
ツク図である。図中、Ｍはマスク、Ｎは入力、h₁，h₂，h₃は計
算ブロツク、g₀，g_-1，g_-2は漸化的評価量の一時
記憶ブロツク、INITは初期値設定ブロツク、
min₁，min₂は最小値計算ブロツクである。 Figure 1 is a diagram explaining the principle of DP matching between feature vectors that are sequenced like speech, Figure 2 a,
b is a diagram for explaining the outline line segment features used in this invention and how the series of line segments changes due to line breaks, and Figure 3 is a diagram showing the outline segment features and the "possible precedence" of the outline segment. An explanatory diagram of the contour line segment, Fig. 4a,
b and c are diagrams for explaining the operation at each stage of the DP matching of the present invention, and FIG. 5 is a block diagram for explaining the matching procedure of one embodiment of the present invention. In the figure, M is a mask, N is an input, h ₁ , h ₂ , h ₃ are calculation blocks, g ₀ , g _-1 , g _-2 are temporary storage blocks for recursive evaluation quantities, and INIT is an initial value setting block. ,
min ₁ and min ₂ are minimum value calculation blocks.

Claims

[Claims]

1 Set of standard pattern features A = {A ₁ , A ₂ ,...
...A _i , ..., A _I };
Set of features of unknown input pattern B = {B ₁ , B ₂ ,...
...B _j , ..., B _J }, and one point of i'=i on the A side, or i'=
Two points i-1 and i and 1 of j'=j on the B side
a point, or a point j' before j, and means for calculating an evaluation quantity d of the distance between two features of j;
means for previously defining a part of the B for each j' as j'' that corresponds to the i'-1 when j'corresponds; and a transition from the i'-1 to i'; and,
Similarly, the evaluation amount r regarding the difference between the transition from j″ to j′
i-th (i=1, 2,...
..., during the calculation of the recursive evaluation quantity by dynamic programming in stage I); the above d, the above i' and j' in the calculation of this d, and the above j' are predetermined by the means defined above. In the calculation of d and r, (i'-
i, j″);
and the minimum value of the added values within the range of the means and definitions of i', j', j'' is set as the recursive evaluation g(i, j) of the optimal correspondence for i. A pattern matching method.