JPH0253830B2 - - Google Patents

Info

Publication number
JPH0253830B2
JPH0253830B2 JP57108776A JP10877682A JPH0253830B2 JP H0253830 B2 JPH0253830 B2 JP H0253830B2 JP 57108776 A JP57108776 A JP 57108776A JP 10877682 A JP10877682 A JP 10877682A JP H0253830 B2 JPH0253830 B2 JP H0253830B2
Authority
JP
Japan
Prior art keywords
contour
character
series
kanji
line segment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP57108776A
Other languages
Japanese (ja)
Other versions
JPS58225489A (en
Inventor
Yoshihisa Fujii
Hiroshi Kamata
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP57108776A priority Critical patent/JPS58225489A/en
Publication of JPS58225489A publication Critical patent/JPS58225489A/en
Publication of JPH0253830B2 publication Critical patent/JPH0253830B2/ja
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/28Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet
    • G06V30/287Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet of Kanji, Hiragana or Katakana characters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Description

【発明の詳細な説明】 (A) 発明の技術分野 本発明は、漢字認識装置、特に例えば手書き漢
字に対する認識処理を行うに当つて、文字輪郭の
左・右・上・下の各線分についての系列を抽出
し、辞書にもたせた上記各線分についての冗長度
をもたせた系列と系列に沿つて照合してゆくよう
にした漢字認識装置に関するものである。
[Detailed Description of the Invention] (A) Technical Field of the Invention The present invention relates to a kanji recognition device, in particular, when performing recognition processing for handwritten kanji. The present invention relates to a kanji recognition device that extracts a sequence and matches it with a redundant sequence of each line segment stored in a dictionary along the sequence.

(B) 技術の背景と問題点 漢字特に手書き漢字に対する認識処理は、現在
の所仲々困難な段階にある。このような認識処理
の1つとして、漢字の背景部分に注目して大局的
な判断を行うようにする方式が考慮されている
が、当該背景(あるいは文字内部の空間)の2次
元的な特徴をとらえようとすると処理がきわめて
複雑となり易い。このことから、従来、手書き片
仮名などに適用されていた輪郭線分の利用に着目
することが考慮された。
(B) Technical background and problems Recognition processing for kanji, especially handwritten kanji, is currently at a very difficult stage. As one such recognition process, a method is being considered that focuses on the background part of the kanji and makes a global judgment, but the two-dimensional characteristics of the background (or the space inside the character) are considered. If you try to capture this, the processing tends to become extremely complicated. For this reason, consideration has been given to focusing on the use of contour line segments, which has traditionally been applied to handwritten katakana.

(C) 発明の目的と構成 本発明は上記の点を解決することを目的として
おり、本発明の漢字認識装置は、認識対象漢字文
字を走査して特徴を抽出し、標準漢字文字に対応
した特徴が格納されている辞書の内容と照合し
て、上記認識対象漢字文字のカテゴリを決定する
漢字認識装置において、上記認識対象漢字文字
を、少なくとも、水平右から左方向へ向う探索に
よつて得られる文字左輪郭と、水平左から右方向
へ向う探索によつて得られる文字右輪郭と、上方
から下方向へ向う探索によつて得られる文字上輪
郭と、下方から上方向へ向う探索によつて得られ
る文字下輪郭とにもとづいて、上記文字左輪郭に
沿う輪郭左線分の系列と、上記文字右輪郭に沿う
輪郭右線分の系列と、上記文字左輪郭に沿う輪郭
上線分の系列と、上記文字下輪郭に沿う輪郭下線
分の系列とを抽出すると共に、上記辞書中に、上
記各輪郭線分系列を当該系列に冗長度をもたせて
夫々格納してなり、上記認識対象漢字文字から得
られた上記各輪郭線分系列と上記辞書から読出さ
れた上記各輪郭線分系列とを系列に沿つて照合し
てゆくようにしたことを特徴としている。以下図
面を参照しつつ説明する。
(C) Purpose and Structure of the Invention The purpose of the present invention is to solve the above-mentioned problems, and the kanji recognition device of the present invention scans the kanji characters to be recognized and extracts features, In a kanji recognition device that determines the category of the kanji character to be recognized by comparing it with the contents of a dictionary in which features are stored, the kanji character to be recognized is obtained by at least a horizontal search from the right to the left. The left contour of the character obtained by searching from the horizontal left to the right, the upper contour of the character obtained by searching from the top to the bottom, and the top contour of the character obtained by searching from the bottom to the top. Based on the lower contour of the character obtained by and a series of contour underline segments along the lower contour of the character, and each of the contour line segment series is stored in the dictionary with redundancy in the series, and the recognition target kanji character is extracted. The present invention is characterized in that each of the contour line segment series obtained from the above is compared with each of the contour line segment series read out from the dictionary along the series. This will be explained below with reference to the drawings.

(D) 発明の実施例 第1図は本発明にいう輪郭線分の系列を説明す
る説明図、第2図は本発明において辞書中に格納
される系列を説明する説明図、第3図は本発明の
一実施例構成を示す。
(D) Embodiments of the Invention FIG. 1 is an explanatory diagram for explaining the series of contour line segments according to the present invention, FIG. 2 is an explanatory diagram for explaining the series stored in the dictionary according to the present invention, and FIG. 1 shows the configuration of an embodiment of the present invention.

第1図Aに例示する漢字1例えば手書き漢字
「資」が与えられたとき、図示矢印方向に走査2
を行い、 (i) 背景の左白領域から最初に黒領域に達した点
Aを抽出する。
When the kanji 1 illustrated in Figure 1A is given, for example, the handwritten kanji ``shi'', scan 2 in the direction of the arrow shown in the figure.
(i) Extract the point A that first reaches the black area from the left white area of the background.

(ii) 次に黒領域から白領域に達した点αを抽出す
る。
(ii) Next, extract the point α that reaches the white area from the black area.

(iii) 次に白領域から黒領域に達した点Bを抽出す
る。
(iii) Next, extract point B that reaches the black area from the white area.

(iv) 次に黒領域から白領域に達した点bを抽出す
る。
(iv) Next, extract the point b that reaches the white area from the black area.

(v) 最後に黒領域から背景の右白領域に達した点
bを抽出する。
(v) Finally, extract the point b that reaches the right white area of the background from the black area.

ようにする。do it like this.

そして、上記点Aに対応する各走査毎の点につ
いて例えば上下方向に連らねて輪郭左線分を、第
1図B図示L1,L2,……の如く抽出する。な
お、このとき、上下に並ぶ2つの走査に対応して
得られた上記点Aに対応する点の水平位置が閾値
以上離れていれば、線分が不連続であるとみる。
また各線分の始端や終端が文字の黒領域によつて
封さがれている場合(図示黒丸)と封さがれてい
ない場合(図示白丸)とを区別して抽出する。
Then, for each point corresponding to the point A in each scan, left contour line segments are extracted in series in the vertical direction, for example, as shown in L1, L2, . . . in FIG. 1B. Note that at this time, if the horizontal positions of the points corresponding to the above-mentioned point A obtained in correspondence with the two vertically arranged scans are separated by a threshold value or more, the line segment is considered to be discontinuous.
In addition, cases in which the start and end of each line segment are sealed by a black area of a character (black circles in the figure) and cases in which they are not sealed (white circles in the figure) are extracted separately.

上記と同様に第1図図示点nに対応する点を連
らねて輪郭右線分を、第1図B図示R1,R2,
……の如く抽出する。勿論、この抽出に当つて、
改めて右側から左方向へ向う走査をやり直しても
よい。
Similarly to the above, the right line segment of the contour is created by connecting the points corresponding to the point n shown in FIG.
Extract as follows. Of course, in this extraction,
The scan from the right side to the left may be performed again.

更に必要に応じて、文字のストロークによつて
挾まれている白領域をコード化すべく、第1図A
図示の点aと点Bとの中央点、点bと点Nとの中
央点……を抽出し、これら夫々の中央点を上下方
向に連らねて、本発明にいう輪郭中線分を抽出
し、その系列を抽出することができる。
Furthermore, if necessary, in order to code the white area between the strokes of the character,
The center point between point a and point B shown in the figure, the center point between point b and point N, etc. are extracted, and these respective center points are connected in the vertical direction to form the contour midline segment according to the present invention. You can extract and extract the series.

第1図Bに示す如く抽出された輪郭左線分L
1,L2,……や輪郭右線分R1,R2,……に
ついて、第2図A図示の如く、矢印3の方向に輪
郭左線分系列4や輪郭右線分系列5をつくる。こ
の系列は、認識対象漢字1の輪郭特徴を代表して
いることは言うまでもない。
Extracted contour left line segment L as shown in Figure 1B
1, L2, . . . and right contour line segments R1, R2, . It goes without saying that this series represents the outline features of the kanji to be recognized 1.

このようにして得られた輪郭線分系列が、本発
明の場合、辞書中の輪郭線分系列と矢印3の方向
に順次照合されてゆく。第2図Bは、文字「資」
に対応する所の辞書中の輪郭左線分系列6を示し
ている。図示の符号7,8はいずれでも可を示
し、符号9,10は省略化を示し、符号11は系
列終点を示している。辞書中の系列6において、
符号7,8や符号9,10の如く冗長度を与えた
のは、文字の変形に対処するためと考えてよい。
In the case of the present invention, the contour line segment series obtained in this manner is sequentially compared with the contour line segment series in the dictionary in the direction of arrow 3. Figure 2 B shows the character “Shi”
The left contour line segment series 6 in the dictionary corresponding to is shown. In the figure, numerals 7 and 8 indicate either OK, numerals 9 and 10 indicate abbreviation, and numeral 11 indicates a series end point. In series 6 in the dictionary,
The reason why redundancy is given as in symbols 7 and 8 and symbols 9 and 10 can be considered to be to cope with deformation of characters.

上記特に輪郭左線分系列を利用して説明した如
き、輪郭線分系列が、例えば(i)輪郭左線分系列、
(ii)輪郭右線分系列、(iii)輪郭上線分系列、(iv)輪郭

線分系列の4通り抽出される。また文字ストロー
ク間に存在する線分系列として、(v)水平方向走査
時の輪郭中線分系列、(vi)垂直方向走査時の輪郭央
線分系列の2通りが抽出される。そして、夫々に
ついて上記と同様な照合が行われる。
The contour line segment series as explained above using the contour left line segment series is, for example, (i) the contour left line segment series,
Four types are extracted: (ii) contour right line segment series, (iii) contour upper line segment series, and (iv) contour lower line segment series. Furthermore, two types of line segment series existing between character strokes are extracted: (v) a contour center line segment series during horizontal direction scanning, and (vi) a contour center line segment series during vertical direction scanning. Then, the same verification as above is performed for each.

第3図は本発明の一実施例構成を示している。
図中の符号12は水平方向線分抽出回路、13は
垂直方向線分抽出回路、14,15は夫々特徴線
分系列(線分系列)連結回路であつて第1図B図
示の線分L1,L2,……の如き系列を得るも
の、16は輪郭左線分系列バツフア、17は輪郭
右線分系列バツフア、18は輪郭中線分系列バツ
フア、19は輪郭上線分系列バツフア、20は輪
郭下線分系列バツフア、21は輪郭央線分系列バ
ツフア、22は辞書、23ないし28は夫々照合
判定回路、29は決定回路を表わしている。
FIG. 3 shows the configuration of an embodiment of the present invention.
In the figure, reference numeral 12 is a horizontal line segment extraction circuit, 13 is a vertical line segment extraction circuit, and 14 and 15 are characteristic line segment series (line segment series) connection circuits, which are the line segment L1 shown in FIG. 1B. , L2, ..., 16 is a contour left line segment series buffer, 17 is a contour right line segment series buffer, 18 is a contour middle line segment series buffer, 19 is a contour line segment series buffer, and 20 is a contour line segment series buffer. 21 is a contour center line segment series buffer; 22 is a dictionary; 23 to 28 are matching judgment circuits; and 29 is a determining circuit.

図示特徴線分連結回路14は、第1図Bに関連
して説明した線分L1,L2,……やR1,R
2,……の連結状態を調べ、夫々の対応するバツ
フア16,17,18に格納する。特徴線分連結
回路15は、垂直方向走査に対応するものであつ
て、回路14と同様に動作する。
The illustrated characteristic line segment connection circuit 14 includes the line segments L1, L2, . . . and R1, R
2, . . . are checked and stored in the corresponding buffers 16, 17, 18, respectively. The feature line segment connection circuit 15 corresponds to vertical scanning and operates in the same manner as the circuit 14.

各バツフア16ないし21の内容は、辞書22
から読出された各線分系列と、照合判定回路23
ないし28によつて照合される。この場合、第2
図A図示の矢印3の方向に順次照合されてゆくも
のと考えてよい。
The contents of each buffer 16 to 21 can be found in the dictionary 22.
Each line segment series read out from the matching judgment circuit 23
to 28. In this case, the second
It may be considered that the verification is performed sequentially in the direction of arrow 3 shown in Figure A.

各照合判定回路23ないし28からの判定結果
は決定回路29に導びかれ、それらを綜合的に調
べて、決定回路29が認識対象漢字文字のカテゴ
リを決定する。
The determination results from each of the comparison determination circuits 23 to 28 are led to a determination circuit 29, which comprehensively examines them and determines the category of the Chinese character to be recognized.

(E) 発明の効果 以上説明した如く、本発明によれば、文字の輪
郭線分系列を手書き漢字の認識などに利用でき
る。認識対象文字の2次元的特徴を抽出すること
が容易であり、かつ辞書中に多少の冗長度をもつ
ものを用意しておくことによつて十分な認識率を
得ることが可能となる。
(E) Effects of the Invention As explained above, according to the present invention, a series of character outline segments can be used for recognition of handwritten kanji characters, etc. It is easy to extract two-dimensional features of characters to be recognized, and by preparing a dictionary with some redundancy, it is possible to obtain a sufficient recognition rate.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明にいう輪郭線分の系列を説明す
る説明図、第2図は本発明において辞書中に格納
される系列を説明する説明図、第3図は本発明の
一実施例構成を示す。 図中、1は認識対象漢字、2は走査線、4,
5,6は夫々輪郭線分系列、12,13は夫々線
分抽出回路、14,15は夫々線分系列連結回
路、16ないし21は夫々線分系列バツフア、2
2は辞書、23ないし28は夫々照合判定回路、
29は決定回路を表わしている。
FIG. 1 is an explanatory diagram for explaining the series of contour line segments according to the present invention, FIG. 2 is an explanatory diagram for explaining the series stored in the dictionary according to the present invention, and FIG. 3 is an explanatory diagram for explaining the configuration of an embodiment of the present invention. shows. In the figure, 1 is the kanji to be recognized, 2 is the scanning line, 4,
5 and 6 are contour line segment series, 12 and 13 are line segment extraction circuits, 14 and 15 are each line segment series concatenation circuits, 16 to 21 are line segment series buffers, respectively.
2 is a dictionary; 23 to 28 are respective matching judgment circuits;
29 represents a decision circuit.

Claims (1)

【特許請求の範囲】[Claims] 1 認識対象漢字文字を走査して特徴を抽出し、
標準漢字文字に対応した特徴が格納されている辞
書の内容と照合して、上記認識対象漢字文字のカ
テゴリを決定する漢字認識装置において、上記認
識対象漢字文字を、少なくとも、水平右から左方
向へ向う探索によつて得られる文字左輪郭と、水
平左から右方向へ向う探索によつて得られる文字
右輪郭と、上方から下方向へ向う探索によつて得
られる文字上輪郭と、下方から上方向へ向う探索
によつて得られる文字下輪郭とにもとづいて、上
記文字左輪郭に沿う輪郭左線分の系列と、上記文
字右輪郭に沿う輪郭右線分の系列と、上記文字上
輪郭に沿う輪郭上線分の系列と、上記文字下輪郭
に沿う輪郭下線分の系列とを抽出すると共に、上
記辞書中に、上記各輪郭線分系列を当該系列に冗
長度をもたせて夫々格納してなり、上記認識対象
漢字文字から得られた上記各輪郭線分系列と上記
辞書から読出された上記各輪郭線分系列とを系列
に沿つて照合してゆくようにしたことを特徴とす
る漢字認識装置。
1 Scan the kanji characters to be recognized and extract the features,
In a kanji recognition device that determines the category of the kanji character to be recognized by comparing it with the contents of a dictionary that stores features corresponding to standard kanji characters, the kanji character to be recognized is at least horizontally moved from right to left. The left contour of the character obtained by searching backwards, the right contour of the character obtained by searching horizontally from left to right, the top contour of the character obtained by searching from top to bottom, and the top contour of the character obtained by searching from bottom to top. Based on the lower contour of the character obtained by searching in the direction, a series of contour left line segments along the left contour of the character, a series of right contour line segments along the right contour of the character, and a series of contour right line segments along the left contour of the character, and the upper contour of the character. A series of contour upper line segments that follow the character lower contour and a series of contour lower line segments that follow the character lower contour are extracted, and each of the contour line segment series is stored in the dictionary with redundancy added to the series. , a kanji recognition device characterized in that each of the contour segment series obtained from the kanji character to be recognized is compared with each of the contour segment series read from the dictionary along the series. .
JP57108776A 1982-06-24 1982-06-24 Chinese character recognizing device Granted JPS58225489A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57108776A JPS58225489A (en) 1982-06-24 1982-06-24 Chinese character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57108776A JPS58225489A (en) 1982-06-24 1982-06-24 Chinese character recognizing device

Publications (2)

Publication Number Publication Date
JPS58225489A JPS58225489A (en) 1983-12-27
JPH0253830B2 true JPH0253830B2 (en) 1990-11-19

Family

ID=14493183

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57108776A Granted JPS58225489A (en) 1982-06-24 1982-06-24 Chinese character recognizing device

Country Status (1)

Country Link
JP (1) JPS58225489A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0588787U (en) * 1991-08-16 1993-12-03 孝尚 越智 Steel backing metal

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0588787U (en) * 1991-08-16 1993-12-03 孝尚 越智 Steel backing metal

Also Published As

Publication number Publication date
JPS58225489A (en) 1983-12-27

Similar Documents

Publication Publication Date Title
US4757551A (en) Character recognition method and system capable of recognizing slant characters
US5668892A (en) Table recognition apparatus
US4811412A (en) Method of a system for analyzing characters
KR100205726B1 (en) Handwriting recognition device
Bai et al. An approach to extracting the target text line from a document image captured by a pen scanner
JPH0253830B2 (en)
US11270146B2 (en) Text location method and apparatus
JP2675303B2 (en) Character recognition method
JPH0246988B2 (en)
JPH03142691A (en) Table format document recognizing system
JPH0253831B2 (en)
JP2606816B2 (en) Character reader
JPH0436432B2 (en)
JPS61188679A (en) Character recognition equipment by stroke approximate linearity extraction
KR100248384B1 (en) Individual character extraction method in multilingual document recognition and its recognition system
JP2575402B2 (en) Character recognition method
JP2797523B2 (en) Drawing follower
JPH0514952B2 (en)
EP0067236A1 (en) Character and figure isolating and extracting system
JPH06150056A (en) Table recognition device
JPS60110089A (en) Character recognizer
JPH02166583A (en) Character recognizing device
JPS63229586A (en) Character recognition device
JPH0664628B2 (en) Character recognition device
JPH03217995A (en) Handwritten character recognizer