JPH0253830B2

JPH0253830B2 -

Info

Publication number: JPH0253830B2
Application number: JP57108776A
Authority: JP
Inventors: Yoshihisa Fujii; Hiroshi Kamata
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1982-06-24
Filing date: 1982-06-24
Publication date: 1990-11-19
Also published as: JPS58225489A

Description

【発明の詳細な説明】 (A) 発明の技術分野本発明は、漢字認識装置、特に例えば手書き漢
字に対する認識処理を行うに当つて、文字輪郭の
左・右・上・下の各線分についての系列を抽出
し、辞書にもたせた上記各線分についての冗長度
をもたせた系列と系列に沿つて照合してゆくよう
にした漢字認識装置に関するものである。[Detailed Description of the Invention] (A) Technical Field of the Invention The present invention relates to a kanji recognition device, in particular, when performing recognition processing for handwritten kanji. The present invention relates to a kanji recognition device that extracts a sequence and matches it with a redundant sequence of each line segment stored in a dictionary along the sequence.

(B) 技術の背景と問題点漢字特に手書き漢字に対する認識処理は、現在
の所仲々困難な段階にある。このような認識処理
の１つとして、漢字の背景部分に注目して大局的
な判断を行うようにする方式が考慮されている
が、当該背景（あるいは文字内部の空間）の２次
元的な特徴をとらえようとすると処理がきわめて
複雑となり易い。このことから、従来、手書き片
仮名などに適用されていた輪郭線分の利用に着目
することが考慮された。(B) Technical background and problems Recognition processing for kanji, especially handwritten kanji, is currently at a very difficult stage. As one such recognition process, a method is being considered that focuses on the background part of the kanji and makes a global judgment, but the two-dimensional characteristics of the background (or the space inside the character) are considered. If you try to capture this, the processing tends to become extremely complicated. For this reason, consideration has been given to focusing on the use of contour line segments, which has traditionally been applied to handwritten katakana.

(C) 発明の目的と構成本発明は上記の点を解決することを目的として
おり、本発明の漢字認識装置は、認識対象漢字文
字を走査して特徴を抽出し、標準漢字文字に対応
した特徴が格納されている辞書の内容と照合し
て、上記認識対象漢字文字のカテゴリを決定する
漢字認識装置において、上記認識対象漢字文字
を、少なくとも、水平右から左方向へ向う探索に
よつて得られる文字左輪郭と、水平左から右方向
へ向う探索によつて得られる文字右輪郭と、上方
から下方向へ向う探索によつて得られる文字上輪
郭と、下方から上方向へ向う探索によつて得られ
る文字下輪郭とにもとづいて、上記文字左輪郭に
沿う輪郭左線分の系列と、上記文字右輪郭に沿う
輪郭右線分の系列と、上記文字左輪郭に沿う輪郭
上線分の系列と、上記文字下輪郭に沿う輪郭下線
分の系列とを抽出すると共に、上記辞書中に、上
記各輪郭線分系列を当該系列に冗長度をもたせて
夫々格納してなり、上記認識対象漢字文字から得
られた上記各輪郭線分系列と上記辞書から読出さ
れた上記各輪郭線分系列とを系列に沿つて照合し
てゆくようにしたことを特徴としている。以下図
面を参照しつつ説明する。(C) Purpose and Structure of the Invention The purpose of the present invention is to solve the above-mentioned problems, and the kanji recognition device of the present invention scans the kanji characters to be recognized and extracts features, In a kanji recognition device that determines the category of the kanji character to be recognized by comparing it with the contents of a dictionary in which features are stored, the kanji character to be recognized is obtained by at least a horizontal search from the right to the left. The left contour of the character obtained by searching from the horizontal left to the right, the upper contour of the character obtained by searching from the top to the bottom, and the top contour of the character obtained by searching from the bottom to the top. Based on the lower contour of the character obtained by and a series of contour underline segments along the lower contour of the character, and each of the contour line segment series is stored in the dictionary with redundancy in the series, and the recognition target kanji character is extracted. The present invention is characterized in that each of the contour line segment series obtained from the above is compared with each of the contour line segment series read out from the dictionary along the series. This will be explained below with reference to the drawings.

(D) 発明の実施例第１図は本発明にいう輪郭線分の系列を説明す
る説明図、第２図は本発明において辞書中に格納
される系列を説明する説明図、第３図は本発明の
一実施例構成を示す。(D) Embodiments of the Invention FIG. 1 is an explanatory diagram for explaining the series of contour line segments according to the present invention, FIG. 2 is an explanatory diagram for explaining the series stored in the dictionary according to the present invention, and FIG. 1 shows the configuration of an embodiment of the present invention.

第１図Ａに例示する漢字１例えば手書き漢字
「資」が与えられたとき、図示矢印方向に走査２
を行い、 (i) 背景の左白領域から最初に黒領域に達した点
Ａを抽出する。 When the kanji 1 illustrated in Figure 1A is given, for example, the handwritten kanji ``shi'', scan 2 in the direction of the arrow shown in the figure.
(i) Extract the point A that first reaches the black area from the left white area of the background.

(ii) 次に黒領域から白領域に達した点αを抽出す
る。(ii) Next, extract the point α that reaches the white area from the black area.

(iii) 次に白領域から黒領域に達した点Ｂを抽出す
る。(iii) Next, extract point B that reaches the black area from the white area.

(iv) 次に黒領域から白領域に達した点ｂを抽出す
る。(iv) Next, extract the point b that reaches the white area from the black area.

(v) 最後に黒領域から背景の右白領域に達した点
ｂを抽出する。(v) Finally, extract the point b that reaches the right white area of the background from the black area.

ようにする。do it like this.

そして、上記点Ａに対応する各走査毎の点につ
いて例えば上下方向に連らねて輪郭左線分を、第
１図Ｂ図示Ｌ１，Ｌ２，……の如く抽出する。な
お、このとき、上下に並ぶ２つの走査に対応して
得られた上記点Ａに対応する点の水平位置が閾値
以上離れていれば、線分が不連続であるとみる。
また各線分の始端や終端が文字の黒領域によつて
封さがれている場合（図示黒丸）と封さがれてい
ない場合（図示白丸）とを区別して抽出する。 Then, for each point corresponding to the point A in each scan, left contour line segments are extracted in series in the vertical direction, for example, as shown in L1, L2, . . . in FIG. 1B. Note that at this time, if the horizontal positions of the points corresponding to the above-mentioned point A obtained in correspondence with the two vertically arranged scans are separated by a threshold value or more, the line segment is considered to be discontinuous.
In addition, cases in which the start and end of each line segment are sealed by a black area of a character (black circles in the figure) and cases in which they are not sealed (white circles in the figure) are extracted separately.

上記と同様に第１図図示点ｎに対応する点を連
らねて輪郭右線分を、第１図Ｂ図示Ｒ１，Ｒ２，
……の如く抽出する。勿論、この抽出に当つて、
改めて右側から左方向へ向う走査をやり直しても
よい。 Similarly to the above, the right line segment of the contour is created by connecting the points corresponding to the point n shown in FIG.
Extract as follows. Of course, in this extraction,
The scan from the right side to the left may be performed again.

更に必要に応じて、文字のストロークによつて
挾まれている白領域をコード化すべく、第１図Ａ
図示の点ａと点Ｂとの中央点、点ｂと点Ｎとの中
央点……を抽出し、これら夫々の中央点を上下方
向に連らねて、本発明にいう輪郭中線分を抽出
し、その系列を抽出することができる。 Furthermore, if necessary, in order to code the white area between the strokes of the character,
The center point between point a and point B shown in the figure, the center point between point b and point N, etc. are extracted, and these respective center points are connected in the vertical direction to form the contour midline segment according to the present invention. You can extract and extract the series.

第１図Ｂに示す如く抽出された輪郭左線分Ｌ
１，Ｌ２，……や輪郭右線分Ｒ１，Ｒ２，……に
ついて、第２図Ａ図示の如く、矢印３の方向に輪
郭左線分系列４や輪郭右線分系列５をつくる。こ
の系列は、認識対象漢字１の輪郭特徴を代表して
いることは言うまでもない。 Extracted contour left line segment L as shown in Figure 1B
1, L2, . . . and right contour line segments R1, R2, . It goes without saying that this series represents the outline features of the kanji to be recognized 1.

このようにして得られた輪郭線分系列が、本発
明の場合、辞書中の輪郭線分系列と矢印３の方向
に順次照合されてゆく。第２図Ｂは、文字「資」
に対応する所の辞書中の輪郭左線分系列６を示し
ている。図示の符号７，８はいずれでも可を示
し、符号９，１０は省略化を示し、符号１１は系
列終点を示している。辞書中の系列６において、
符号７，８や符号９，１０の如く冗長度を与えた
のは、文字の変形に対処するためと考えてよい。 In the case of the present invention, the contour line segment series obtained in this manner is sequentially compared with the contour line segment series in the dictionary in the direction of arrow 3. Figure 2 B shows the character “Shi”
The left contour line segment series 6 in the dictionary corresponding to is shown. In the figure, numerals 7 and 8 indicate either OK, numerals 9 and 10 indicate abbreviation, and numeral 11 indicates a series end point. In series 6 in the dictionary,
The reason why redundancy is given as in symbols 7 and 8 and symbols 9 and 10 can be considered to be to cope with deformation of characters.

上記特に輪郭左線分系列を利用して説明した如
き、輪郭線分系列が、例えば(i)輪郭左線分系列、
(ii)輪郭右線分系列、(iii)輪郭上線分系列、(iv)輪郭
下
線分系列の４通り抽出される。また文字ストロー
ク間に存在する線分系列として、(v)水平方向走査
時の輪郭中線分系列、(vi)垂直方向走査時の輪郭央
線分系列の２通りが抽出される。そして、夫々に
ついて上記と同様な照合が行われる。 The contour line segment series as explained above using the contour left line segment series is, for example, (i) the contour left line segment series,
Four types are extracted: (ii) contour right line segment series, (iii) contour upper line segment series, and (iv) contour lower line segment series. Furthermore, two types of line segment series existing between character strokes are extracted: (v) a contour center line segment series during horizontal direction scanning, and (vi) a contour center line segment series during vertical direction scanning. Then, the same verification as above is performed for each.

第３図は本発明の一実施例構成を示している。
図中の符号１２は水平方向線分抽出回路、１３は
垂直方向線分抽出回路、１４，１５は夫々特徴線
分系列（線分系列）連結回路であつて第１図Ｂ図
示の線分Ｌ１，Ｌ２，……の如き系列を得るも
の、１６は輪郭左線分系列バツフア、１７は輪郭
右線分系列バツフア、１８は輪郭中線分系列バツ
フア、１９は輪郭上線分系列バツフア、２０は輪
郭下線分系列バツフア、２１は輪郭央線分系列バ
ツフア、２２は辞書、２３ないし２８は夫々照合
判定回路、２９は決定回路を表わしている。 FIG. 3 shows the configuration of an embodiment of the present invention.
In the figure, reference numeral 12 is a horizontal line segment extraction circuit, 13 is a vertical line segment extraction circuit, and 14 and 15 are characteristic line segment series (line segment series) connection circuits, which are the line segment L1 shown in FIG. 1B. , L2, ..., 16 is a contour left line segment series buffer, 17 is a contour right line segment series buffer, 18 is a contour middle line segment series buffer, 19 is a contour line segment series buffer, and 20 is a contour line segment series buffer. 21 is a contour center line segment series buffer; 22 is a dictionary; 23 to 28 are matching judgment circuits; and 29 is a determining circuit.

図示特徴線分連結回路１４は、第１図Ｂに関連
して説明した線分Ｌ１，Ｌ２，……やＲ１，Ｒ
２，……の連結状態を調べ、夫々の対応するバツ
フア１６，１７，１８に格納する。特徴線分連結
回路１５は、垂直方向走査に対応するものであつ
て、回路１４と同様に動作する。 The illustrated characteristic line segment connection circuit 14 includes the line segments L1, L2, . . . and R1, R
2, . . . are checked and stored in the corresponding buffers 16, 17, 18, respectively. The feature line segment connection circuit 15 corresponds to vertical scanning and operates in the same manner as the circuit 14.

各バツフア１６ないし２１の内容は、辞書２２
から読出された各線分系列と、照合判定回路２３
ないし２８によつて照合される。この場合、第２
図Ａ図示の矢印３の方向に順次照合されてゆくも
のと考えてよい。 The contents of each buffer 16 to 21 can be found in the dictionary 22.
Each line segment series read out from the matching judgment circuit 23
to 28. In this case, the second
It may be considered that the verification is performed sequentially in the direction of arrow 3 shown in Figure A.

各照合判定回路２３ないし２８からの判定結果
は決定回路２９に導びかれ、それらを綜合的に調
べて、決定回路２９が認識対象漢字文字のカテゴ
リを決定する。 The determination results from each of the comparison determination circuits 23 to 28 are led to a determination circuit 29, which comprehensively examines them and determines the category of the Chinese character to be recognized.

(E) 発明の効果以上説明した如く、本発明によれば、文字の輪
郭線分系列を手書き漢字の認識などに利用でき
る。認識対象文字の２次元的特徴を抽出すること
が容易であり、かつ辞書中に多少の冗長度をもつ
ものを用意しておくことによつて十分な認識率を
得ることが可能となる。(E) Effects of the Invention As explained above, according to the present invention, a series of character outline segments can be used for recognition of handwritten kanji characters, etc. It is easy to extract two-dimensional features of characters to be recognized, and by preparing a dictionary with some redundancy, it is possible to obtain a sufficient recognition rate.

[Brief explanation of the drawing]

第１図は本発明にいう輪郭線分の系列を説明す
る説明図、第２図は本発明において辞書中に格納
される系列を説明する説明図、第３図は本発明の
一実施例構成を示す。図中、１は認識対象漢字、２は走査線、４，
５，６は夫々輪郭線分系列、１２，１３は夫々線
分抽出回路、１４，１５は夫々線分系列連結回
路、１６ないし２１は夫々線分系列バツフア、２
２は辞書、２３ないし２８は夫々照合判定回路、
２９は決定回路を表わしている。 FIG. 1 is an explanatory diagram for explaining the series of contour line segments according to the present invention, FIG. 2 is an explanatory diagram for explaining the series stored in the dictionary according to the present invention, and FIG. 3 is an explanatory diagram for explaining the configuration of an embodiment of the present invention. shows. In the figure, 1 is the kanji to be recognized, 2 is the scanning line, 4,
5 and 6 are contour line segment series, 12 and 13 are line segment extraction circuits, 14 and 15 are each line segment series concatenation circuits, 16 to 21 are line segment series buffers, respectively.
2 is a dictionary; 23 to 28 are respective matching judgment circuits;
29 represents a decision circuit.

Claims

[Claims]

1 Scan the kanji characters to be recognized and extract the features,
In a kanji recognition device that determines the category of the kanji character to be recognized by comparing it with the contents of a dictionary that stores features corresponding to standard kanji characters, the kanji character to be recognized is at least horizontally moved from right to left. The left contour of the character obtained by searching backwards, the right contour of the character obtained by searching horizontally from left to right, the top contour of the character obtained by searching from top to bottom, and the top contour of the character obtained by searching from bottom to top. Based on the lower contour of the character obtained by searching in the direction, a series of contour left line segments along the left contour of the character, a series of right contour line segments along the right contour of the character, and a series of contour right line segments along the left contour of the character, and the upper contour of the character. A series of contour upper line segments that follow the character lower contour and a series of contour lower line segments that follow the character lower contour are extracted, and each of the contour line segment series is stored in the dictionary with redundancy added to the series. , a kanji recognition device characterized in that each of the contour segment series obtained from the kanji character to be recognized is compared with each of the contour segment series read from the dictionary along the series. .