JPS6330665B2

JPS6330665B2 -

Info

Publication number: JPS6330665B2
Application number: JP55187607A
Authority: JP
Inventors: Akira Inoe; Masumi Yoshida
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1980-12-29
Filing date: 1980-12-29
Publication date: 1988-06-20
Also published as: JPS57111783A

Description

【発明の詳細な説明】本発明は文字分離方式に関し、特に複数の文字
が一定の形状のマスクによつても個々の文字に分
離できないように図面等に書入されている場合で
も、これらの各文字を個々の文字に分離すること
ができるようにした文字分離方式に関する。[Detailed Description of the Invention] The present invention relates to a character separation method, and in particular, even when a plurality of characters are written on a drawing etc. in such a way that they cannot be separated into individual characters even with a mask of a certain shape, these characters can be separated. This invention relates to a character separation method that allows each character to be separated into individual characters.

例えば自由に手書きされた手書き文字を認識す
る場合、まずその第一段階として文字を一字一字
分離して抽出し、それからこの抽出した文字を識
別している。このように文字を分離して抽出する
場合、第１図イに示す如きマスクＭを使用し、こ
れを同ロの如く文字列上に走査して、個々の文字
を抽出していた。この場合、第１図ロに示すよう
に、複数の文字が互に離れてしかもある大きさの
文字で記載されているような場合、これをマスク
Ｍのような一定の形状（例えば矩形）のマスクに
より各文字毎に分離することができるので、これ
らをそれぞれ互に分離して抽出することが可能で
ある。 For example, when recognizing freely handwritten characters, the first step is to separate and extract each character, and then identify the extracted characters. When separating and extracting characters in this way, a mask M as shown in FIG. In this case, as shown in FIG. Since each character can be separated using a mask, it is possible to separate and extract these characters from each other.

しかしながら第１図ハに示すように、複数の文
字が互に接近して記載されている場合、もはや第
１図イに示す如きマスクを走査する方法ではこれ
らの文字を正確に分離することが不可能であり、
したがつてこのような場合には文字を正確に認識
することが困難である。 However, when multiple characters are written close to each other as shown in Figure 1C, it is no longer possible to accurately separate these characters using the method of scanning a mask as shown in Figure 1B. It is possible and
Therefore, in such cases, it is difficult to accurately recognize characters.

したがつて本発明の目的は、このように複数の
文字が近接して記載されている場合でも各文字を
正確に分離することができる文字分離方式を提供
することを目的とするものである。そしてそのた
め本発明における文字分離方式では、入力手段か
ら入力された文字の画像情報を保持する情報保持
手段と、上記画像情報を第１の方向に走査してそ
の白黒変化点を求め、該変化点の間の領域である
第１領域を検出する第１特徴抽出手段と、上記画
像情報を、前記第１の方向と略直交する第２の方
向に走査してその白黒変化点を求め、該変化点間
の領域である第２領域を検出する第２特徴抽出手
段と、上記第１領域と第２領域との重畳しない領
域を前記第１の方向に走査したときの中間点を検
出する検出手段と、前記得られた中間点から前記
第２の方向に第１の線を引くとともに該第１の線
が文字情報に一致したときこれに応じて前記第１
の方向に第２の線を引く引線手段とを備え、前記
得られた第１と第２の線を個々の文字情報の存在
する領域を分離する分離線とすることを特徴とす
る。 Therefore, an object of the present invention is to provide a character separation method that can accurately separate each character even when a plurality of characters are written close to each other. Therefore, the character separation method according to the present invention includes an information holding means that holds image information of characters inputted from an input means, and an information holding means for holding image information of characters inputted from an input means, and scanning the image information in a first direction to obtain black and white change points. a first feature extracting means for detecting a first region that is a region between the two; a second feature extraction means for detecting a second region that is an area between points; and a detection means for detecting an intermediate point when scanning an area where the first area and the second area do not overlap in the first direction. Then, a first line is drawn in the second direction from the obtained intermediate point, and when the first line matches the character information, the first line is drawn in the second direction.
and a drawing line means for drawing a second line in the direction of , and the obtained first and second lines are used as separation lines that separate areas where individual character information exists.

以下本発明を具体的に説明するに先立ち、本発
明の原理を第２図〜９図にもとづき説明する。 Before explaining the present invention in detail below, the principle of the present invention will be explained based on FIGS. 2 to 9.

(a) まず手書き文字の記入された原稿あるいは図
面を読むことにより得られた画像データm₀の
記入されたメモリm₁，m₂を用意する。そして
メモリm₁を第２図イに示すように、ｘ方向に
走査する。このとき文字が画かれている黒点を
「１」とし、文字の画かれていない白紙領域を
「０」として画像データm₀を得る。このとき、
第２図ロに示すように変化点すなわち「０」→
「１」および「１」→「０」に変化する点を求
める。例えば第２図ロに示すように、ラインｌ
上を走査するとき、x₁では「０」→「１」に変
化し、x₂では「１」→「０」に変化し、x₃では
「０」→「１」に変化し、x₄では「１」→「０」
に変化するので、これらのx₁〜x₄はいずれも変
化点である。この場合変化点P₁（「１」→
「０」）およびP₂（「０」→「１」）の対を検出
し、メモリm₂上のこの変化点P₁〜P₂間の領域
をすべて「１」とする。(a) First, memories m ₁ and m ₂ containing image data m ₀ obtained by reading a manuscript or drawing containing handwritten characters are prepared. Then, the memory _m1 is scanned in the x direction as shown in FIG. 2A. At this time, image data _m0 is obtained by setting the black dot where the characters are drawn as "1" and the blank area where no characters are drawn as "0". At this time,
As shown in Figure 2 (b), the change point is "0" →
Find "1" and the point where "1" changes to "0". For example, as shown in Figure 2 (b), the line l
When scanning above, x ₁ changes from "0" to "1", x ₂ changes from "1" to "0", x ₃ changes from "0" to "1", x ₄ Then "1" → "0"
Therefore, all of these x ₁ to x ₄ are changing points. In this case, the change point P ₁ (“1” →
0) and P ₂ (“0”→“1”), and all areas between the change points P ₁ and _{P 2} on the memory m ₂ are set to “1”.

(b) 次にメモリm₁上の画像データm₀を、、第２
図イに示すようにｙ方向に走査して、同様に変
化点Q₁（「１」→「０」ただしｙ方向）および
Q₂（「０」→「１」）の対を検出し、これによ
り、該Q₁〜Q₂間のメモリm₂上の領域を、もし「０」ならば「１」に、もし「１」ならび「０」に反転させる。これにより第３図に示す如く、二
重線領域が「０」となり、１線領域が「１」と
なり、メモリm₂には第４図に示すデータが記
入されることになる。このとき文字の部分は
「１」が連続しているので、そのまま残る。(b) Next, the image data m ₀ on the memory m ₁ is transferred to the second
As shown in Figure A, scan in the y direction and similarly change the change point Q ₁ (“1” → “0” in the y direction) and
The pair Q ₂ (“0” → “1”) is detected, and the area on the memory m ₂ between Q ₁ and _{Q 2} is changed to “1” if “0” and “1” if “1”. ” and invert it to “0”. As a result, as shown in FIG. 3, the double line area becomes "0", the single line area becomes "1", and the data shown in FIG. 4 is written in the memory _m2 . At this time, since "1" is consecutive in the character part, it remains as is.

(c) 今度は、この第４図に示す状態のデータが記
入されているメモリm₂をｘ方向に走査し、斜
線領域内での変化点R₁（「０」→「１」）、R₂
（「１」→「０」）の対を検出する。これはその
斜線前の領域が「０」が連続しているかどうか
と、「１」が文字幅よりも多く連続しているこ
と等により識別して検出できる。(c) Next, scan the memory m ₂ in which the data in the state shown in _FIG . ₂
Detect the pair (“1” → “0”). This can be detected by checking whether the area before the diagonal line has consecutive "0"s and whether there are "1"s consecutively more than the character width.

(d) いま、第５図に示すように上記R₁の座標を
（x₁、y₁）とし、R₂の座標を（x₂、y₂）とした
とき、その中央部の中心点Ｍ（xm、ym）を求
める。このときxm＝（x₁＋x₂）／２、ym＝y₁
（y₂）である。このようにして第６図に示すよ
うにM₁〜M₄を求めることができる。そしてこ
れらの中心点M₁〜M₄を始点としてｙ方向に直
線を引き、これが文字情報点（「１」が連続し
て存在する）に接触したとき、その点から左右
両側に再び文字情報点接触するまで横線を引
く。この場合M₄からは横線が引けないことは、
図より明らかである。(d) Now, as shown in Figure 5, when the coordinates of R ₁ are (x ₁ , y ₁ ) and the coordinates of R ₂ are (x ₂ , y ₂ ), the center point M Find (xm, ym). In this case, xm = (x ₁ + x ₂ )/2, ym = y ₁
(y ₂ ). In this way, M ₁ to _{M 4} can be determined as shown in FIG. Then, a straight line is drawn in the y direction starting from these center points M ₁ to _{M 4} , and when it touches a character information point (“1” exists continuously), character information points are drawn again on both the left and right sides from that point. Draw horizontal lines until they touch. In this case, the fact that a horizontal line cannot be drawn from M ₄ is
It is clear from the figure.

(e) 次に第７図に示す如く、メモリm₂を下より
ｘ方向に走査する。そして第８図に示す如く、、
上記(d)と同様に中心点S₁〜S₄を求めてこれより
ｙ方向に直線を引き、これが文字情報点に接触
したとき同様に横線を引く。この場合S₁および
S₄からは横線が引けない。(e) Next, as shown in FIG. 7, the memory m ₂ is scanned from below in the x direction. And as shown in Figure 8,
Similarly to (d) above, find the center points S ₁ to _{S 4} and draw a straight line from these in the y direction, and when this comes into contact with a character information point, draw a horizontal line in the same way. In this case S ₁ and
You cannot draw a horizontal line from S ₄ .

(f) このとき、第８図に示すように、中心点M₁
からの垂直線が２本の水平線p₂，l₁と交るよう
なときは、水平線p₂とl₁の中間に水平線lmを引
く。そして水平線がｘ方向でオーバラツプする
ときは、そのオーバラツプ領域で垂線v₁，v₂，
v₃を引く。(f) At this time, as shown in Figure 8, the center point M ₁
If a vertical line from 2 intersects two horizontal lines p ₂ and l ₁ , draw a horizontal line lm between horizontal lines p ₂ and l ₁ . When the horizontal lines overlap in the x direction, the perpendicular lines v ₁ , v ₂ ,
v subtract ₃ .

(g) このように線を引くことにより、第９図に示
す如く、文字Ａ、Ｂ、…………Ｅを中心点M₁，
M₂，M₃，S₂，S₃、水平線lm，p₂，l₂，p₃，l₃、
垂線v₁，v₂，v₃………等により単一文字として
区別することが可能になる。(g) By drawing lines in this way, as shown in Figure 9, we can move the letters A, B, ......E to the center point M ₁ ,
M ₂ , M ₃ , S ₂ , S ₃ , horizontal line lm, p ₂ , l ₂ , p ₃ , l ₃ ,
Perpendicular lines v ₁ , v ₂ , v ₃ , etc. make it possible to distinguish them as a single character.

次に本発明の一実施例を第１０図にもとづき説
明する。 Next, one embodiment of the present invention will be described based on FIG. 10.

図中、１は入力部、２は出力メモリ、３は第１
画像メモリ、４はバツフア、５は第１特徴抽出
部、６は第１アドレス・テーブル、７は第１アド
レス発生部、８は制御部、９は第２アドレス発生
部、１０はバツフア、１１は第２特徴抽出部、１
２は第２アドレス・テーブル、１３は第２画像メ
モリ、１４はエクスクルシーブ・オア回路、１５
は第３アドレス発生部、１６，１７はバツフア、
１８は第３特徴抽出部、１９は第３アドレス・テ
ーブル、２０は境界線抽出部、２１は境界線テー
ブル、２２は第４アドレス発生部、２３，２４は
バツフア、２５は第４特徴抽出部である。 In the figure, 1 is the input section, 2 is the output memory, and 3 is the first
Image memory, 4 is a buffer, 5 is a first feature extraction section, 6 is a first address table, 7 is a first address generation section, 8 is a control section, 9 is a second address generation section, 10 is a buffer, 11 is a Second feature extraction unit, 1
2 is a second address table, 13 is a second image memory, 14 is an exclusive OR circuit, 15
is the third address generation part, 16 and 17 are buffers,
18 is a third feature extractor, 19 is a third address table, 20 is a boundary line extractor, 21 is a boundary line table, 22 is a fourth address generator, 23 and 24 are buffers, and 25 is a fourth feature extractor. It is.

入力部１は、手書き原稿等を例えば光電変換部
で変換する電気信号発生部である。出力メモリ２
は、手書き原稿等から入力された情報を、第９図
に示す如く区分けして得られる単独文字情報が出
力されるためにセツトされるメモリである。 The input unit 1 is an electrical signal generation unit that converts a handwritten manuscript or the like using, for example, a photoelectric conversion unit. Output memory 2
is a memory set for outputting single character information obtained by dividing information input from a handwritten manuscript or the like as shown in FIG.

第１画像メモリ３は、入力部１から入力された
画像データが保持されるメモリである。バツフア
４は、上記第１画像メモリ３に保持された画像デ
ータが送出保持されるものであつて、上記(a)の如
き処理を行なうための作業用のバツフア・メモリ
である。 The first image memory 3 is a memory in which image data input from the input section 1 is held. The buffer 4 is a working buffer memory for transmitting and holding the image data held in the first image memory 3, and is used to perform the processing as described in (a) above.

第１特徴抽出部５は、バツフア４に保持された
画像データから、上記(a)の如く、変化点P₁およ
びP₂の対を求め、そのP₁〜P₂の領域を「１」と
して読出すものであり、第１アドレス・テーブル
６には上記変化点P₁〜P₂の間の「１」の領域の
アドレスが保持されるテーブルである。 The first feature extraction unit 5 obtains a pair of change points P ₁ and P ₂ from the image data held in the buffer 4, as shown in (a) above, and sets the region of P ₁ to _{P 2} as “1”. The first address table 6 is a table in which the addresses of the areas of "1" between the change points _P1 and _P2 are held.

第１アドレス発生部７は、バツフア４に保持さ
れた画像データを、上記(a)に示す如く、ｘ方向に
走査するためのアドレスを発生するものである。 The first address generating section 7 generates an address for scanning the image data held in the buffer 4 in the x direction as shown in (a) above.

制御部８は、第１画像メモリ３に入力された画
像データを上記(a)〜(g)の手順にしたがつて処理
し、個々の文字領域を作成するための各種制御を
行なうものであつて、例えばバツフア４を走査す
るための第１アドレス発生部７を制御したり、第
１特徴抽出部５を制御するものである。 The control unit 8 processes the image data input to the first image memory 3 according to the steps (a) to (g) above, and performs various controls for creating individual character areas. For example, it controls the first address generation section 7 for scanning the buffer 4 or the first feature extraction section 5.

第２アドレス発生部９は、バツフア１０に保持
された画像データを上記(b)に示すようにｙ方向に
走査するためのアドレスを発生するものである。
またバツフア１０は上記第１画像メモリ３に保持
された画像データが送出され、これが保持される
メモリであつて、上記(b)の如き処理を行なうため
の作業用のバツフア・メモリである。 The second address generating section 9 generates an address for scanning the image data held in the buffer 10 in the y direction as shown in (b) above.
The buffer 10 is a memory to which the image data held in the first image memory 3 is sent and held, and is a working buffer memory for performing the processing as in (b) above.

第２特徴抽出部１１はバツフア１０に保持され
た画像データから、上記(b)の如く変化点Q₁，Q₂
の対を検出し、そのQ₁，Q₂の領域を「１」とし
て読出すものであり、第２アドレス・テーブル１
２には上記変化点Q₁〜Q₂間の「１」の領域のア
ドレスが保持されるテーブルである。 The second feature extraction unit 11 extracts the change points Q ₁ and Q ₂ from the image data held in the buffer 10 as shown in (b) above.
, and reads out the Q ₁ and Q ₂ areas as "1".
2 is a table in which addresses of areas of "1" between the change points Q ₁ and Q ₂ are held.

第２画像メモリ１３は、上記(a)、(b)の結果得ら
れた第４図に示す画像データがセツトされるメモ
リである。 The second image memory 13 is a memory in which the image data shown in FIG. 4 obtained as a result of the above (a) and (b) is set.

第３アドレス発生部１５はバツフア１６および
１７を文字の上方より順次ｘ方向に走査するため
のアドレスを発生するものである。バツフア１６
は、第２画像メモリ１３にセツトされている第４
図に示す如き画像データがセツトされるバツフ
ア・メモリであり、またバツフア１７は第１画像
メモリ３にセツトされている画像データがセツト
されるバツフア・メモリである。そしてこれらの
バツフア１６，１７は、上記(c)および(d)に示す処
理を行なつて第６図に示す如き中心点M₁〜M₄お
よびそれから発生される垂直線、水平線等を得る
ために、文字の上方方向より順次ｘ方向に走査さ
れる。 The third address generating section 15 generates an address for sequentially scanning the buffers 16 and 17 in the x direction from above the character. Batsuhua 16
is the fourth image set in the second image memory 13.
This is a buffer memory into which image data as shown in the figure is set, and the buffer 17 is a buffer memory into which the image data set in the first image memory 3 is set. These buffers 16 and 17 undergo the processing shown in (c) and (d) above to obtain center points M ₁ to M ₄ and vertical lines, horizontal lines, etc. generated therefrom as shown in FIG. Then, the character is sequentially scanned in the x direction starting from the top.

第３特徴抽出部１８は、バツフア１６，１７に
セツトされた画像データにもとづき上記(c)および
(d)の処理を行なうものである。すなわち、第４図
に示す画像データのセツトされているバツフア１
６および第１画像メモリ３から伝達された画像デ
ータのセツトされているバツフア１７は、第３ア
ドレス発生部１５から発生されたアドレス情報に
もとづき、その文字の上方よりｘ方向に順次走査
され、それらの出力データを順次第３特徴抽出部
１８に伝達する。そしてバツフア１６から出力さ
れたデータにより第４図斜線領域内の上記変曲点
R₁，R₂を検出する。そしてその中心点を求める
これをM₂とする。このようにバツフア１６をｘ
方向に順次走査することにより上記(c)に記載した
ような手法で、中心点M₁，M₃，M₄を得る。こ
のときバツフア１７からは文字情報が伝達される
ので、上記(c)に説明した例とは異なり、これによ
り文字位置を識別するものである。このようにし
て中心点M₁〜M₄を得たのちに、第３特徴抽出部
１８は、その引線回路１８−０にてこれらの中心
点M₁〜M₄よりｙ方向に直線を下方に引く。そし
てこれが文字に接触したとき（勿論この文字位置
はバツフア１７から伝達される文字情報より得
る）、今度はその接触点より左右のｘ方向に直線
を引く。これらの各直線は文字と接触するまで引
く。したがつて中心点M₄はｙ方向の垂直線のみ
が引かれることになる。 The third feature extraction unit 18 extracts the above (c) and
This process performs the process (d). That is, the buffer 1 in which the image data shown in FIG.
6 and the buffer 17 in which the image data transmitted from the first image memory 3 is set are sequentially scanned in the x direction from above the character based on the address information generated from the third address generation section 15. The output data of is sequentially transmitted to the three feature extraction sections 18. Then, based on the data output from the buffer 16, the above-mentioned inflection point within the shaded area in Fig.
Detect R ₁ and R ₂ . Then find the center point and let it be M ₂ . In this way, convert the buffer 16 to x
Center points M ₁ , M ₃ , and M ₄ are obtained by scanning sequentially in the direction as described in (c) above. At this time, character information is transmitted from the buffer 17, so unlike the example described in (c) above, character positions are identified using this information. After obtaining the center points M ₁ to _{M 4} in this way, the third feature extraction unit 18 uses its drawing line circuit 18-0 to draw a straight line downward in the y direction from these center points M ₁ to _{M 4} . Pull. When this contacts a character (of course, the character position is obtained from the character information transmitted from the buffer 17), a straight line is drawn from the contact point in the left and right x directions. Draw each of these lines until they touch the letters. Therefore, only a vertical line in the y direction is drawn at the center point _M4 .

第３アドレス・テーブル１９は、上記(c)、(d)、
(e)により得た中心点M₁〜M₄、S₁〜S₄および各直
線の文字と接触する座標および各直線の交点座標
等が記入されるテーブルである。 The third address table 19 includes the above (c), (d),
This is a table in which the center points M ₁ to _{M 4} and S ₁ to _{S 4} obtained in (e), the coordinates of each straight line in contact with the characters, the coordinates of the intersection of each straight line, etc. are entered.

境界線抽出部２０は、上記第３アドレス・テー
ブル１９から伝達されたデータにもとづき、上記
(f)および(g)の処理を行ない、第９図に示す如き各
文字間の境界を作成するものである。そして、境
界線テーブル２１は上記境界線抽出部２０により
作成された各文字間の境界位置のデータが記入さ
れるテーブルである。 Based on the data transmitted from the third address table 19, the boundary line extraction unit 20 extracts the
By performing the processing in (f) and (g), boundaries between characters as shown in FIG. 9 are created. The boundary line table 21 is a table in which data of the boundary positions between each character created by the boundary line extraction section 20 is entered.

第４アドレス発生部２２はバツフア２２および
２３を、上記(e)において説明した如く、文字の下
方より順次ｘ方向に走査するためのアドレスを発
生するものである。バツフア２３は、第２画像メ
モリ１３にセツトされている第４図に示す如き画
像データがセツトされるバツフア・メモリであ
り、またバツフア２４は第１画像メモリ３にセツ
トされている画像データがセツトされるバツフ
ア・メモリである。そしてこれらのバツフア２
３，２４は、上記(e)に示す処理が行なわれて第８
図に示す如き中心点S₁〜S₄およびそれから発生さ
れる垂直線、水平線等を得るために、文字の下方
位置より順次ｘ方向に走査される。 The fourth address generating section 22 generates addresses for sequentially scanning the buffers 22 and 23 in the x direction starting from the bottom of the character, as explained in (e) above. The buffer 23 is a buffer memory into which the image data as shown in FIG. 4 set in the second image memory 13 is set, and the buffer 24 is into which the image data set in the first image memory 3 is set. buffer memory. And these batshua 2
3 and 24 are the 8th after the process shown in (e) above is performed.
In order to obtain the center points S ₁ to S ₄ and the vertical lines, horizontal lines, etc. generated therefrom as shown in the figure, the characters are sequentially scanned in the x direction from the lower position of the character.

第４特徴抽出部２５は、バツフア２３，２４に
セツトされた画像データにもとづき上記(e)の処理
を行なうものである。すなわち第４図に示す画像
データのセツトされているバツフア２３および第
１画像メモリ３から伝達された画像データのセツ
トされているバツフア２４は、第４アドレス発生
部２２から発生されたアドレス情報にもとづき、
その文字の下方よりｘ方向に順次走査され、その
出力データを順次第４特徴抽出部１８に伝達す
る。この第４特徴抽出部１８はバツフア２３から
伝達されたデータにより、上記中心点S₁〜S₄を
得、その引線回路２５−０によりこれらの中心点
S₁〜S₄からｙ方向に直線を上方に引く。そしてこ
れが文字に接触したとき（勿論この文字位置はバ
ツフア２４から伝達される文字情報より得る）、
今度はその接触点より左右のｘ方向に直線を引
く。これらの各直線は文字と接触するまで引くの
で、結局、第８図に図示の如く、中心点S₁とS₄に
対するｙ方向の直線からはｘ方向の直線は得られ
ず、中心点S₂については直線p₂が、中心点S₃につ
いては直線p₃がそれぞれ引かれることになる。 The fourth feature extractor 25 performs the process (e) above based on the image data set in the buffers 23 and 24. That is, the buffer 23 in which the image data shown in FIG. ,
The character is sequentially scanned in the x direction from below, and the output data is sequentially transmitted to the four feature extraction sections 18. This fourth feature extraction unit 18 obtains the center points S ₁ to _{S 4} based on the data transmitted from the buffer 23, and uses the drawing line circuit 25-0 to extract these center points.
Draw a straight line upward in the y direction from S ₁ to _{S 4} . When this comes into contact with a character (of course, this character position is obtained from the character information transmitted from the buffer 24),
Next, draw a straight line in the x direction to the left and right from the contact point. Each of these straight lines is drawn until it touches _the characters, so as shown in _FIG _. A straight line p ₂ is drawn for the center point S ₃ , and a straight line p ₃ is drawn for the center point S 3 .

以下第１０図の動作について簡単に説明する。 The operation shown in FIG. 10 will be briefly explained below.

(1) まず手書き原稿等の画像情報が入力部１で電
気信号に変換されて「１」、「０」の画像データ
となり、第１画像メモリ３にセツトされる。そ
してこの画像データはバツフア４およびバツフ
ア１０に送出されこれらにもセツトされる。(1) First, image information such as a handwritten manuscript is converted into an electrical signal by the input section 1 to become image data of "1" and "0", and is set in the first image memory 3. This image data is then sent to the buffers 4 and 10 and set there as well.

(2) このようにバツフア４およびバツフア１０に
画像データがセツトされた後、制御部８は第１
アドレス発生部７に対してはバツフア１０をｘ
方向に走査するように、またバツフア１０に対
してはｙ方向に走査するようにアドレスを発生
すべく制御する。これにより第１特徴抽出部５
は上記(a)に示した変化点対P₁，P₂を検出して
この変化点対P₁〜P₂の領域を「１」となし、
この「１」としたアドレス領域を第１アドレ
ス・テーブル６にセツトする。一方第２特徴抽
出部１１は上記(b)に示した変化点Q₁，Q₂の対
を検出してこの変化点対P₁〜P₂の領域を「１」
となし、この「１」としたアドレス領域を第２
アドレス・テーブル１２にセツトする。(2) After the image data is set in the buffers 4 and 10 in this way, the control section 8
For the address generation section 7, add a buffer of 10 x
The buffer 10 is controlled to generate an address so as to scan in the y-direction. As a result, the first feature extraction unit 5
detects the pair of changing points P ₁ and P ₂ shown in (a) above and sets the area of this pair of changing points P ₁ to _{P 2} as “1”,
This address area set to "1" is set in the first address table 6. On the other hand, the second feature extraction unit 11 detects the pair of changing points Q ₁ and Q ₂ shown in (b) above, and sets the area of this pair of changing points P ₁ to P 2 _to “1”.
and set this address area as "1" to the second address area.
Set in address table 12.

(3) このようにして変化点P₁〜P₂およびQ₁〜Q₂
の間の領域を「１」にした後、エクスクルシー
ブ・オア回路１４で、第１アドレス・テーブル
６および第２アドレス・テーブル１２の「１」
の領域のエクスクルシーブ・オアをとり、これ
により第４図に示す如き画像データが得られ、
これが第２画像メモリ１３にセツトされる。(3) In this way, the change points P ₁ ~ P ₂ and Q ₁ ~ Q ₂
After setting the area between them to "1", the exclusive OR circuit 14 sets "1" in the first address table 6 and the second address table 12.
Taking the exclusive OR of the area, image data as shown in Fig. 4 is obtained.
This is set in the second image memory 13.

(4) このようにして得られた第４図に示す画像デ
ータは、第２画像メモリ１３からバツフア１６
および２３に転送され、またバツフア１７およ
びバツフア２４には第１画像メモリ３にセツト
されているオリジナルの画像データが転送され
る。(4) The image data shown in FIG. 4 obtained in this way is transferred from the second image memory 13 to the buffer 16.
and 23, and the original image data set in the first image memory 3 is transferred to the buffers 17 and 24.

(5) それから制御部８は第３アドレス発生部１５
に対し、バツフア１６および１７を文字の上方
よりｘ方向に順次走査するという通常の走査を
行なうためのアドレスを発生させる。これによ
り出力されたデータにもとづき、第３特徴抽出
部１８は、上記(c)に示した如く変化点R₁，R₂
の対を検出し、第６図に示す如く、中心点M₁
〜M₄を求め、これよりｙ方向に垂直線を引き、
これが文字に接触したとき、その点よりｘ方向
に水平に直線を引く。そしてこの水平の直線も
文字に接触するところまで引く。このとき文字
情報はバツフア１７より得ることができる。(5) Then, the control unit 8 controls the third address generation unit 15
On the other hand, an address is generated for normal scanning in which the buffers 16 and 17 are sequentially scanned from above the character in the x direction. Based on the data thus output, the third feature extraction unit 18 extracts the change points R ₁ and R ₂ as shown in (c) above.
As shown in Fig. 6, the center point M ₁
Find ~M ₄ and draw a vertical line in the y direction from this,
When this touches a character, draw a straight line horizontally in the x direction from that point. Then draw this horizontal line until it touches the letters. At this time, character information can be obtained from the buffer 17.

(6) 同時に制御部８は第４アドレス発生部２２に
対し、バツフア２３および２４を、第７図に示
すように、文字の下方よりｘ方向に順次走査す
るという下からの走査を行なうためのアドレス
を発生させる。これにより出力されたデータに
もとづき、第４特徴抽出部２５は、同様の変化
点を求めこれにより第８図に示す如き中心点S₁
〜S₄を得る。そしてこれらの中心点S₁〜S₄から
ｙ方向に直線を上方に引く。そしてこれが文字
と接触したとき今度はその接触点より左右のｘ
方向に直線を引く。この水平の直線も文字に接
触するところまで引く。そしてこのときの文字
情報はバツフア２４より得ることができる。(6) At the same time, the control unit 8 instructs the fourth address generation unit 22 to scan the buffers 23 and 24 from below in order to sequentially scan in the x direction from below the character, as shown in FIG. Generate an address. Based on the data outputted from this, the fourth feature extracting section 25 finds a similar change point and thereby obtains the center point S ₁ as shown in FIG.
~Get _S4 . Then, straight lines are drawn upward in the y direction from these center points S ₁ to S ₄ . And when this comes into contact with the character, this time the x to the left and right of that contact point
Draw a straight line in the direction. Draw this horizontal straight line until it touches the letters. The character information at this time can be obtained from the buffer 24.

(7) このようにして、第８図に示すように、中心
点M₁〜M₄、S₁〜S₄よびびそれらよりｙ方向に
引かれた直線、およびｘ方向に引かれた直線l₁
〜l₃，p₂，p₃等のデータが第３アドレス・テー
ブル１９に送出される。 ₍ ₇ ₎ In this way _, as shown in FIG. ₁
Data such as ~l ₃ , p ₂ , p ₃ etc. are sent to the third address table 19.

(8) 境界線抽出部２０は、この第３アドレス・テ
ーブル１９から送出されたデータにもとづき、
中心点M₁からのｙ方向の直線のように、水平
方向の直線p₂，l₁と交叉するものについてはそ
れらの中央に水平方向の直線lmを引き、また
直線l₂，p₂；l₂，p₃およびl₃，p₃のように互にｘ
方向にオーバラツプしている領域のあるものに
ついてはその領域のところで垂直に線v₁，v₂，
v₃を引く。そしてこのようにして第９図に示す
如く、文字Ａ〜Ｅを個々の文字領域に区分けす
る。(8) Based on the data sent from this third address table 19, the boundary line extraction unit 20
For lines that intersect the horizontal lines p ₂ and l ₁ , such as the line in the y direction from the center point M ₁ , a horizontal line lm is drawn in the center of them, and the lines l ₂ , p ₂ ; ₂ , p ₃ and l ₃ , p ₃ mutually x
For areas with overlapping directions, vertical lines v ₁ , v ₂ ,
v subtract ₃ . In this manner, the characters A to E are divided into individual character areas as shown in FIG.

(9) 上記の如く区分けされた座標情報は境界線テ
ーブル２１にセツトされ、これにもとづき第１
画像メモリにセツトされた画像データが１文字
分ずつ抽出されて出力メモリ２に送出され、こ
れより例えば文字認識装置等に送出され、その
文字の識別が行なわれることになる。(9) The coordinate information divided as above is set in the boundary line table 21, and based on this, the first
The image data set in the image memory is extracted one character at a time and sent to the output memory 2, from which it is sent to, for example, a character recognition device, where the characters are identified.

以上説明の如く、本発明によれば、複数の文字
が接近して書込された場合でも、これを単独に区
分けすることが可能になる。したがつて原稿用紙
の所定の枠内に記載されていない手書き文字でも
これを単独に分離することが可能になる。それ
故、このような手書き文字でも文字を１文字ずつ
分離して抽出できるので、例えば手書き文字に対
する文字認識等を行なう場合に大きな効果を発揮
することができる。 As described above, according to the present invention, even when a plurality of characters are written close to each other, it is possible to separate them into individual characters. Therefore, even handwritten characters that are not written within a predetermined frame on the manuscript paper can be separated individually. Therefore, since even such handwritten characters can be separated and extracted character by character, great effects can be exerted when, for example, character recognition of handwritten characters is performed.

[Brief explanation of the drawing]

第１図は文字読取用のマスクおよび、該マスク
により文字分離が可能な場合および不可能な場合
の説明図、第２図は走査状態説明図、第３図〜第
９図は本発明の動作状態説明図、第１０図は本発
明の一実施例構成図である。図中、１は入力部、２は出力メモリ、３は第１
画像メモリ、４はバツフア、５は第１特徴抽出
部、６は第１アドレス・テーブル、７は第１アド
レス発生部、８は制御部、９は第２アドレス発生
部、１０はバツフア、１１は第２特徴抽出部、１
２は第２アドレス・テーブル、１３は第２画像メ
モリ、１４はエクスクルシーブ・オア回路、１５
は第３アドレス発生部、１６，１７はバツフア、
１８は第３特徴抽出部、１９は第３アドレス・テ
ーブル、２０は境界線抽出部、２１は境界線テー
ブル、２２は第４アドレス発生部、２３，２４は
バツフア、２５は第４特徴抽出部をそれぞれ示
す。 Fig. 1 is an explanatory diagram of a mask for character reading and cases in which character separation is possible and not possible using the mask, Fig. 2 is an explanatory diagram of the scanning state, and Figs. 3 to 9 are diagrams illustrating the operation of the present invention. The state explanatory diagram, FIG. 10, is a configuration diagram of an embodiment of the present invention. In the figure, 1 is the input section, 2 is the output memory, and 3 is the first
Image memory, 4 is a buffer, 5 is a first feature extraction section, 6 is a first address table, 7 is a first address generation section, 8 is a control section, 9 is a second address generation section, 10 is a buffer, 11 is a Second feature extraction unit, 1
2 is a second address table, 13 is a second image memory, 14 is an exclusive OR circuit, 15
is the third address generation part, 16 and 17 are buffers,
18 is a third feature extractor, 19 is a third address table, 20 is a boundary line extractor, 21 is a boundary line table, 22 is a fourth address generator, 23 and 24 are buffers, and 25 is a fourth feature extractor. are shown respectively.

Claims

[Claims]

1. Information holding means for holding image information of characters inputted from an input means, and a first area that scans the image information in a first direction to obtain points of black and white change, and is an area between the points of change. a first feature extraction means for detecting a black and white change point by scanning the image information in a second direction substantially orthogonal to the first direction, and a second feature extracting means for detecting a black and white change point of the image information; a second feature extraction means for detecting, a detection means for detecting an intermediate point when scanning an area where the first area and the second area do not overlap in the first direction, and a second feature extracting means for detecting an intermediate point from the obtained intermediate point. drawing means for drawing a first line in the second direction and drawing a second line in the first direction when the first line matches character information; A character separation method characterized in that the first and second lines are used as separation lines that separate areas where individual character information exists.