JPH03210688A - Line detecting device - Google Patents

Line detecting device

Info

Publication number
JPH03210688A
JPH03210688A JP2005384A JP538490A JPH03210688A JP H03210688 A JPH03210688 A JP H03210688A JP 2005384 A JP2005384 A JP 2005384A JP 538490 A JP538490 A JP 538490A JP H03210688 A JPH03210688 A JP H03210688A
Authority
JP
Japan
Prior art keywords
area
line
row
candidate
division
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2005384A
Other languages
Japanese (ja)
Inventor
Toshiyuki Itezono
射手園 敏行
Koji Ito
伊東 晃治
Yoshiyuki Yamashita
山下 義征
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oki Electric Industry Co Ltd
Original Assignee
Oki Electric Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oki Electric Industry Co Ltd filed Critical Oki Electric Industry Co Ltd
Priority to JP2005384A priority Critical patent/JPH03210688A/en
Publication of JPH03210688A publication Critical patent/JPH03210688A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)

Abstract

PURPOSE:To make processing simple, correct, and high-speed, and to make a device small-scale by repeating the generation of marginal distribution, and detecting a character line based on black block position sequence obtained from the marginal distribution. CONSTITUTION:A device is constituted of a marginal distribution generating part 12, a line candidate detecting part 14, and a line detecting part 16. In this case, the starting end and the terminal end in a row direction of a line candidate area are determined by a first marginal distribution obtained by scanning the prescribed processing area of document picture data in a line direction, and the starting end and the terminal end in a line direction of the line candidate area are determined on the basis of a second marginal distribution obtained by scanning the line candidate area in the row direction, and the presence of a divided element in the line candidate area is decided. When there is the divided element, a divided candidate position in the line direction of the line candidate area is set, and the line candidate area is partitioned at the divided candidate position, and division information to show the feature of the character line in a partitioned area is extracted by third marginal distribution obtained by scanning the partitioned area in the line direction, and the position of a line area is determined according to the division information. Thus, the processing can be made simple, correct, and high-speed, and the device can be made small-scale.

Description

【発明の詳細な説明】 (産業上の利用分野) この発明は行検出装置に間する。[Detailed description of the invention] (Industrial application field) The present invention relates to a row detection device.

(従来の技術) 従来より名刺の画像処理に関する種々の研究及び検討が
なされている。
(Prior Art) Various studies and studies have been conducted regarding image processing of business cards.

例えば文献工:情報処理学会菓37回(昭和63年後期
)全国大会 6W−4p、1652〜1653に開示さ
れている従来技術では、名刺長手方向に名刺画像を走査
し行領域を検出し行領域を長手方向と直交する列方向に
走査し黒ブロックを検出し黒ブロックを長手方向及び列
方向に走査して小ブロックを検出する。そして処理対象
の名刺に間する特徴的事項例えば−文字を構成する複数
の小プロツウは正方形枠内にお?!まる、同じ属性の項
目の文字はほぼ等しい大きざを有する等に基づいて、同
一項目の文字行を構成するとみなぜる小ブロックの統合
を行ない、項目別に文字行を検出する。
For example, in the conventional technology disclosed in the 37th National Conference of the Information Processing Society of Japan (late 1986), 6W-4p, 1652-1653, the business card image is scanned in the longitudinal direction of the business card to detect the line area. is scanned in the column direction perpendicular to the longitudinal direction to detect black blocks, and the black blocks are scanned in the longitudinal direction and column direction to detect small blocks. And characteristic items on the business card to be processed, for example - Are the multiple small letters that make up the letters placed within a square frame? ! Based on the fact that the characters of items with the same attribute have approximately the same size, etc., small blocks that are considered to constitute a character line of the same item are integrated, and character lines are detected for each item.

また、文献II:電子通信学会論文誌  81/11 
 Vol、J64−D  No、11  p、997〜
10o4に開示されでいる従来技術では、名刺の2値画
像中から抽出した黒画素の連結領域の外接矩形を求め、
重なり合う外接矩形の連結鎖vtを互いにマージ(統合
)したのち外接矩形の大きざ、位H1r参照してマーク
部の連結領域を分離する。そしてマーク部を除く連結領
域を文字行方向に投影しで得た周辺分布から各文字行を
検出する。さらに姓名部を含む可能性の高い文字行から
外接矩形の位置、間隔を参照して肩書き部の連結領域を
分離し、また各文字行毎に外接矩形の長辺を一辺とする
正方形枠内に含まれる外接矩形の連結領域を一文字とし
で統合する。そして文字行方向にあいで重なり合う項目
別の文字行を姓名部との相対的値MIWl係、文字の大
きざ、文字行中の文字数等を参照し分離する。
Also, Document II: Journal of the Institute of Electronics and Communication Engineers 81/11
Vol, J64-D No, 11 p, 997~
In the prior art disclosed in 10o4, a circumscribed rectangle of a connected region of black pixels extracted from a binary image of a business card is obtained,
After the connecting chains vt of the overlapping circumscribed rectangles are merged (integrated) with each other, the connected regions of the mark portions are separated with reference to the size and position H1r of the circumscribed rectangles. Then, each character line is detected from the peripheral distribution obtained by projecting the connected region excluding the mark portion in the character line direction. Furthermore, by referring to the position and spacing of the circumscribed rectangle, separate the connected area of the title from the character line that is likely to contain the first and last name, and for each character line, divide it into a square frame whose sides are the long sides of the circumscribed rectangle. Integrate the connected areas of the included circumscribed rectangles into one character. Then, the character lines for each item that overlap in the character line direction are separated by referring to the relative value MIW1 with the first and last name, the size of the characters, the number of characters in the character line, etc.

(発明か解決しようとする課題) しかしながら上述した文献■、Hの従来技術では、−文
字を構成する小ブロック、連結鎖tfiを検出しさらに
同一項目とみなせる文字行を検出するために小ブロック
、連結領域の統合及び再分割を繰返すボトムアップ的な
処理となるため処理が複雑となり、これがため行検出を
高速に行なえる小規模な装置を実現することが困難であ
る。
(Problem to be solved by the invention) However, in the prior art of the above-mentioned documents (■) and (H), in order to detect small blocks and connecting chains tfi constituting the - character, and further detect character lines that can be regarded as the same item, small blocks, small blocks, The process is complicated because it is a bottom-up process that repeats the integration and re-division of connected regions, which makes it difficult to realize a small-scale device that can perform line detection at high speed.

この発明の目的は上述した従来の問題点を解決するため
、文字行領域を検出するため小ブロック等の統合及び分
割を繰返すのではなく周辺分布の作成を繰返すようにし
た行検出部Mを提供することにある。
SUMMARY OF THE INVENTION In order to solve the above-mentioned conventional problems, it is an object of the present invention to provide a line detection unit M that repeatedly creates a marginal distribution to detect a character line area, instead of repeating the integration and division of small blocks. It's about doing.

(課題を解決するための手段) この目的の達成を図るため、この発明の行検出装置は、 指定領域内の文書画像データを指定方向に走査して周辺
分布を作成する周辺分布作成部と、指定領域を所定の行
検出対象領域とし及び指定方向を行方向とする第一の指
示情報に従って得られた第一の周辺分布に基づいて行候
補領域の列方向の位置を決定する行候補検出部と、 指定領域を行候補領域とし及び指定方向を列方向とする
第二の指示情報に従って得られた第二の周辺分布に基づ
いて、行候補領域の行方向の位置を決定すると共に行候
補領域内の分割要素の存否を判定し、 分割要素が存在する場合、行候補領域の行方向の分割候
補値Mを設定し、指定領域を行候補領域の分割要素で区
分された区分領域とし及び指定方向を行方向とする第三
の指示情報に従って得られた第三の周辺分布から分割情
報を抽出し、分割情報に応じて行領域の位11ヲ決定し
、 分割要素か存在しない場合、行候補領域の位置を行領域
の位置として決定する行検出部とを具えて成ることを特
徴とする。
(Means for Solving the Problems) In order to achieve this object, the line detection device of the present invention includes a marginal distribution creation unit that scans document image data in a designated area in a designated direction to create a peripheral distribution; a row candidate detection unit that determines the position of the row candidate area in the column direction based on a first marginal distribution obtained according to first instruction information in which the specified area is a predetermined row detection target area and the specified direction is the row direction; and determining the position of the row candidate region in the row direction based on the second marginal distribution obtained according to the second instruction information in which the designated region is the row candidate region and the designated direction is the column direction. Determine whether there is a dividing element in the row candidate area, and if there is a dividing element, set a division candidate value M in the row direction of the row candidate area, set the designated area as a divided area divided by the dividing element of the row candidate area, and specify Extract division information from the third marginal distribution obtained according to the third instruction information whose direction is the row direction, determine the position 11 of the row area according to the division information, and if there is no division element, select a row candidate. The method is characterized by comprising a row detection unit that determines the position of the area as the position of the row area.

(作用) このような構成の行検出部M1こよれば、文書画像デー
タの行検出対象領域を行方向(文字行に沿う方向)に走
査しで得た藁−の周辺分布に基づいて、行候補領域の列
方向の始端及び終端位置を決定する。
(Function) According to the line detection unit M1 having such a configuration, the line detection unit M1 detects the line based on the peripheral distribution of straw obtained by scanning the line detection target area of document image data in the line direction (direction along the character lines). Determine the starting and ending positions of the candidate area in the column direction.

そして行候補領域を行方向と直交する列方向に走査して
得た第二の周辺分布に基づいて、行候補領域の行方向の
始端及び終端位置を決定すると共に行候補領域内の分割
要素の存否を判定する0分割要素は文字行を区分する要
素(例えば空白や罫線)であり、従って分割要素が存在
する場合、行候補領域は複数の文字行を含む。
Then, based on the second marginal distribution obtained by scanning the row candidate area in the column direction orthogonal to the row direction, the starting and ending positions of the row candidate area in the row direction are determined, and the dividing elements in the row candidate area are determined. The zero division element whose existence is determined is an element (for example, a blank space or a ruled line) that divides a character line. Therefore, if a division element exists, the line candidate area includes a plurality of character lines.

分割要素が存在しない場合には、行候補領域の位置を行
領域の位置として検出する。
If there is no dividing element, the position of the line candidate area is detected as the position of the line area.

また分割要素が存在する場合には、行候補領域の行方向
の分割候補値Mを設定する。そして行候補領域の始端、
終端及び分割候補位置で区分された区分lN域を行方向
に走査し、これにより得た第三の周辺分布から分割情報
を抽出し、この分割情報に応じて行領域の位置を決定す
る。
If a dividing element exists, a dividing candidate value M in the row direction of the row candidate area is set. And the starting end of the row candidate area,
The division IN area divided by the end and division candidate positions is scanned in the row direction, division information is extracted from the third peripheral distribution obtained thereby, and the position of the row area is determined according to this division information.

分割情報は、区分領域内の文字行の特徴(例えば文字行
数、文字行を構成する文字の大きざ、位置等)を表す情
報を含む。文字行の特徴は例えば区分領域内の黒ブロッ
クの個数、幅、位置等で表せる0区分領域内の対応する
文字行同士につきその特徴を比較し、特徴か同しである
とみなせる場合にはこれら文字行同士をひとつの文字行
とし、また特徴か異なるとみなぜる場合にはこれら文字
行をそれぞれ異なる文字行として検出する。
The division information includes information representing the characteristics of the character lines within the segmented area (for example, the number of character lines, the size and position of the characters constituting the character line, etc.). Characteristics of character lines can be expressed, for example, by the number, width, position, etc. of black blocks within a segmented area. Compare the characteristics of corresponding character lines within the 0 segmented area, and if the features can be considered to be the same, these Character lines are considered to be one character line, and if they are considered to have different characteristics, these character lines are detected as different character lines.

(実施例) 以下、図面を参照し、この発明の一実施例につき説明す
る。尚、図面はこの発明か理解できる程度に概略的に示
しであるにすぎず、従ってこの発明は図示例に限定され
るものではない。
(Example) Hereinafter, an example of the present invention will be described with reference to the drawings. It should be noted that the drawings are only schematic representations to the extent that the invention can be understood, and therefore, the invention is not limited to the illustrated examples.

第1図はこの発明の一実施例の構成の説明に供する機能
ブロック図である。
FIG. 1 is a functional block diagram for explaining the configuration of an embodiment of the present invention.

同図にも示すように、この実施例の行検出装置10は、
指定領域内の文書画像データを指定方向に走査して周辺
分布を作成する周辺分布作成部12と、指定領域を所定
の行検出対象領域とし及び指定方向を行方向とする第一
の指示情報に従って得られた第一の周辺分布に基づいて
行候補領域の列方向の位置を決定する行候補検出部14
と、指定領域を行候補領域とし及び指定方向を列方向と
する第二の指示情報に従って得られた第二の周辺分布に
基づいて、行候補領域の行方向の位置を決定すると共に
行候補領域内の分割要素の存否を判定し、分割要素が存
在する場合には行候補領域の行方向の分割候補位置を設
定し、指定領域を行候補領域の分割要素で区分された区
分領域とし及び指定方向を行方向とする第三の指示情報
に従って得られた第三の周辺分布から分割情報を抽出し
、この分割情報に応じて行領域の位置’Fr決定し、分
割要素が存在しない場合には行候補領域の位置を行領域
の位置として決定する行検出部16とを具えで成る。
As shown in the figure, the row detection device 10 of this embodiment is
A peripheral distribution creation unit 12 that scans document image data in a specified area in a specified direction to create a peripheral distribution, and according to first instruction information that sets the specified area as a predetermined line detection target area and that the specified direction is the line direction. A row candidate detection unit 14 that determines the position of the row candidate area in the column direction based on the obtained first marginal distribution.
Then, the position of the row candidate region in the row direction is determined based on the second marginal distribution obtained according to the second instruction information in which the designated region is the row candidate region and the designated direction is the column direction. Determine whether or not there is a dividing element within, and if there is a dividing element, set the dividing candidate position in the row direction of the row candidate area, set the designated area as a divided area divided by the dividing element of the row candidate area, and specify The division information is extracted from the third marginal distribution obtained according to the third instruction information whose direction is the row direction, and the position 'Fr of the row area is determined according to this division information, and if there is no division element, The line detecting section 16 determines the position of the line candidate area as the position of the line area.

行位置検出装M10は名刺、帳票等の文書画像デーク上
での行値Nを検出し次段の外部装置例えば文字切出し装
置へ出力する0文書画像データは画像入力部18により
生成されて画像データ記憶部20に格納される。
The line position detection device M10 detects the line value N on a document image database such as a business card or a form, and outputs it to the next stage external device, such as a character cutting device.The document image data is generated by the image input unit 18 and is converted into image data. It is stored in the storage unit 20.

以下、この実施例につきより詳細に説明する。This example will be explained in more detail below.

(画像入力部、画像データ記憶部) 画像入力部18は文書の行検出対象領域からの光信号を
画素分解した電気信号に変換し、この電気信号から成る
文書画像データを出力する0画像データ記憶部20は画
像入力部18からの文書画像データを走査順次に格納す
る6文書画像データは例えば文字部を黒ビット及び文字
背景部を白ビットで表現した2値画像データである。
(Image input unit, image data storage unit) The image input unit 18 converts the optical signal from the line detection target area of the document into an electrical signal that is divided into pixels, and outputs document image data consisting of this electrical signal. The unit 20 scans and stores the document image data from the image input unit 18.6 The document image data is, for example, binary image data in which the text portion is expressed with black bits and the text background portion is expressed with white bits.

第2図は文N画像データの例を示す図である。FIG. 2 is a diagram showing an example of sentence N image data.

同図に示す文書画像データ22は名刺の例である0行検
出対象領域を処理対象の名刺の広さよりもやや広くなる
ように設定し、画像データ記憶部2oのメモリに行検出
対象領域の文1画像データを記憶する。行検出対象領域
の文書より広い部分の画像デーク22を白ビットで表現
するように成している。
The document image data 22 shown in the figure is an example of a business card, in which the 0 line detection target area is set to be slightly wider than the width of the business card to be processed, and the text of the line detection target area is stored in the memory of the image data storage unit 2o. 1 image data is stored. A portion of the image disk 22 that is wider than the line detection target area of the document is expressed with white bits.

メモリ上にはx−y座標系を設定しており、この座標系
で表される画素位1の画像データの読出しを自在に行な
えるようにしている0行検出対象領域をO≦X≦xEか
つ0≦y≦YEの1囲としX軸方向を文字行に18つ方
向(行方向)とする。
An x-y coordinate system is set on the memory, and the 0-line detection target area, which allows the image data of pixel position 1 represented by this coordinate system to be read freely, is defined as O≦X≦xE. And the X-axis direction is defined as 18 character lines (line direction), with 0≦y≦YE.

(周辺分布作成部) 周辺分布作成]2は行候補検出部14からの第一の指示
情報、或は行検出部16からの第二又は第三の指示情報
を入力する。そして指示情報により指示された指定領域
及び指定方向に基づき、指定領域内の文書画像データを
指定方向に走査し、各副走査位置毎に走査線上の黒画素
数を累積し、指定禦域に関する周辺分布を作成する0周
辺分布は各副走査位置毎の累積黒画素数から成り、周辺
分布作成部12は作成した周辺分布を図示しない周辺分
布記憶部に格納する。
(Marginal distribution creation unit) Marginal distribution creation] 2 inputs the first instruction information from the row candidate detection unit 14 or the second or third instruction information from the line detection unit 16. Then, based on the specified area and specified direction specified by the instruction information, the document image data in the specified area is scanned in the specified direction, the number of black pixels on the scanning line is accumulated for each sub-scanning position, and the surrounding area related to the specified area is scanned. The zero peripheral distribution for creating the distribution is made up of the cumulative number of black pixels for each sub-scanning position, and the peripheral distribution creation section 12 stores the created peripheral distribution in a peripheral distribution storage section (not shown).

(行候補検出部) この実施例の行候補検出部14は、指定領域を文書全体
を包含する行検出対象領域としかつ指定方向を行方向と
する第一の指示情報を、周辺分布作成部12に対し出力
する。
(Line Candidate Detection Unit) The line candidate detection unit 14 of this embodiment sends first instruction information in which the specified area is a line detection target area that includes the entire document and the specified direction is the line direction to the peripheral distribution creation unit 12. Output for.

行候補検出部14は指定領域を指示する情報としてX軸
方向の走査開始位!XS  (この実施例ではXS =
O)及び走査終了位置Xεと、y軸方向の走査開始位x
Ys  (この実施例ではYS =O)及び走査終了位
置YEとを出力し、また指定方向を指示する情報として
X軸方向を出力する。
The line candidate detection unit 14 uses the information indicating the specified area as the scanning start position in the X-axis direction! XS (in this example, XS =
O), scanning end position Xε, and scanning starting position x in the y-axis direction
Ys (YS=O in this embodiment) and the scan end position YE, and also outputs the X-axis direction as information indicating the designated direction.

周辺分布作成部12はこれら第一の指示情報に基づき第
一の周辺分布を作成する。第一の周辺分布の例を藁3図
に示す。第3図の縦軸は第2図のy軸に対応する座標軸
及び横軸は副走査位11yにおける走査線上の累積黒画
素数LX (y)を示す。しX (y)は次式(7)の
ように表せる。
The marginal distribution creation unit 12 creates a first marginal distribution based on the first instruction information. An example of the first marginal distribution is shown in Figure 3. The vertical axis in FIG. 3 corresponds to the y-axis in FIG. 2, and the horizontal axis represents the cumulative number of black pixels LX (y) on the scanning line at the sub-scanning position 11y. X (y) can be expressed as in the following equation (7).

但し、○≦y≦Y6、またP (x、y)は座標(x、
y)の画素位置の画像データが白ビットであればP (
x、y)=O及び黒ビットであればP(x、y)=1で
ある。
However, ○≦y≦Y6, and P (x, y) is the coordinate (x,
If the image data at the pixel position of y) is a white bit, P (
If x, y)=O and black bit, then P(x, y)=1.

周辺分布作成部12による菓−の周辺分布の作成が終了
すると、行候補検出部14は第一の周辺分布を参照し、
所定の閾値LTHL以上の累積黒画素数が連続する領域
(黒ブロック)を行候補領域とみなして各行候補領域の
列方向の始端及び終端値Mを検出する。第3図中、原点
○側がらX座標の小ざい順に数えて第n番目の行候補領
域の列方向の始端値IIを゛符号PLT (n)で及び
終端値Mを符号PL8(n)で表した(但しn=1.2
、−)。
When the marginal distribution creation unit 12 finishes creating the marginal distribution of sweets, the row candidate detection unit 14 refers to the first marginal distribution,
An area (black block) in which the cumulative number of black pixels equal to or greater than a predetermined threshold LTHL is continuous is regarded as a row candidate area, and the start and end values M in the column direction of each row candidate area are detected. In Fig. 3, the starting end value II in the column direction of the n-th row candidate area counting from the origin ○ side in ascending order of the X coordinate is indicated by the symbol PLT (n), and the terminal value M is indicated by the symbol PL8 (n). (however, n=1.2
,-).

藁4図に、行候補検出部14により検出した行候補領域
を符号K (n)を付した点線の矩形で示す。同図にも
示すように、第一の周辺分布から検出した行候補領域K
 (n)の行方向の始端及び終点位Nを仮に座標X、及
びXεと設定する。
In Figure 4, the row candidate area detected by the row candidate detection unit 14 is shown by a dotted rectangle with the symbol K (n). As shown in the figure, the row candidate area K detected from the first marginal distribution
The starting and ending points N in the row direction of (n) are temporarily set as coordinates X and Xε.

(行検出部) この実施例の行検出部16は分割候補位置決定部22、
分割判定部24及び行位置決定部26から成る。
(Line Detection Unit) The line detection unit 16 of this embodiment includes a division candidate position determination unit 22,
It consists of a division determining section 24 and a line position determining section 26.

本分割候補位置決定部 行候補検出部14による行候補領域に(n)の検出が終
了すると、分割候補位置決定部22は第二の周辺分布を
得るべく、指定端t1iを指示する第二の指示情報とし
て行候補領域K (n)のX軸方向の位WXs、Xt及
びy軸方向の位![PLT(n)、PLB (n)を出
力し、また指定方向を指示する第二の指示情報としてy
軸方向を出力する。
Main division candidate position determination unit When the row candidate detection unit 14 completes the detection of (n) in the row candidate area, the division candidate position determination unit 22 uses a second As instruction information, the position of the row candidate area K (n) in the X-axis direction WXs, Xt, and the position in the y-axis direction! [Outputs PLT(n), PLB(n), and also outputs y as second instruction information that instructs the specified direction.
Outputs the axial direction.

周辺分布作成部12はこれら第二の指示情報に基づいて
行候補領域K (n)内の文書画像データをy軸方向に
走査し第二の周辺分布を作成する。
The peripheral distribution creation unit 12 scans the document image data in the row candidate area K (n) in the y-axis direction based on the second instruction information to create a second peripheral distribution.

第5図及び第6図に、行候補911域K (n)内の文
書画像データの例及びこの領域K (n)の第二の周辺
分布を示す、第6図の横軸は第5図のX軸に対応する座
標軸及び縦軸は副走査位置x(こおける走査線上の累積
黒画素数LKY(x)を示す。
5 and 6 show an example of document image data in the line candidate area K (n) and the second peripheral distribution of this area K (n). The horizontal axis in FIG. The coordinate axis corresponding to the X axis and the vertical axis indicate the cumulative number of black pixels LKY(x) on the scanning line at the sub-scanning position x.

LK”l’ (X)は次式(2)のように表せる。LK"l' (X) can be expressed as in the following equation (2).

但し、0≦X≦×6である。However, 0≦X≦×6.

周辺分布作成部12による第二の周辺分布の作成が終了
すると、分割候補位置決定部22は第二の周辺分布を参
照し、所定の閾値XTHL以上の累積黒画素数か連続す
る領域(黒ブロック)を検出し、各黒プロ・ンクの行方
向の始端及び終端位置を検出する。第6図中、原点○側
からX座標の小さい順に数えて寛m番目の黒ブロックの
行方向の始端及び終端値aを符号BXL(m)及びBX
R(m)で表した(但しm:=1.2、−)。
When the creation of the second peripheral distribution by the peripheral distribution creation unit 12 is completed, the division candidate position determination unit 22 refers to the second peripheral distribution and divides the continuous area (black block ) is detected, and the starting and ending positions of each black line in the row direction are detected. In Fig. 6, the starting and ending values a in the row direction of the m-th black block counting from the origin ○ side in descending order of the X coordinate are denoted by BXL (m) and BX.
It was expressed as R(m) (where m:=1.2, -).

そして行候補検出部14は行候補領域K (n)の黒ブ
ロックの行方向の最小始端位置(第5図の例ではBXL
(1))−を行候補領域K (n)の行方向の始端値1
1KXL (n)としで決定し及び行方向の最大終端値
ml(第5図の例ではBXR(5))を行候補l!域K
 (n)の行方向の終端値11KXR(n)として決定
する。
Then, the row candidate detection unit 14 detects the minimum starting end position (BXL in the example of FIG. 5) of the black block in the row candidate area K (n) in the row direction.
(1)) - is the starting end value 1 in the row direction of the row candidate area K (n)
1KXL (n) and determine the maximum terminal value ml in the row direction (BXR(5) in the example of FIG. 5) as a row candidate l! Area K
The terminal value in the row direction of (n) is determined as 11KXR(n).

これと共に分割要素の存否判定のために行候補領域K 
(n)内においてlII接する黒ブロツク間のWI域(
以下、白ブロックと称する)のX軸方向の幅を算出する
。第6図中、蔦m番目の黒ブロックの行方向の終端位置
に#接する白ブロックの幅を符号BBW(i)を付しで
示す(但しi=7.2、−・)、白ブロツク幅BBW(
i)は次式(3)により表せる。
Along with this, the row candidate area K is used to determine the presence or absence of a dividing element.
In (n), the WI region (
The width of the white block (hereinafter referred to as a white block) in the X-axis direction is calculated. In Fig. 6, the width of the white block that is # in contact with the end position of the m-th black block in the row direction is indicated by the symbol BBW(i) (however, i = 7.2, -.), white block width. BBW(
i) can be expressed by the following equation (3).

BBW(i)=BXL (m+1)−BXR(m)+1
       ・・・−(3) 但し、行候補領域K (n)内で検出した黒ブロックの
総個数をM個とすればmは1≦m≦M−1なる自然数で
ある。
BBW(i)=BXL(m+1)-BXR(m)+1
...-(3) However, if the total number of black blocks detected in the row candidate area K (n) is M, then m is a natural number satisfying 1≦m≦M-1.

そして分割候補位置決定部22はひとつの行候補領域K
 (n)内における全部の白ブロツク幅に間しその平均
値幅ABBWと最大値幅MXBBWとを検出する。
Then, the division candidate position determination unit 22 selects one row candidate area K.
The average value width ABBW and maximum value width MXBBW are detected for all white block widths in (n).

平均値幅ABBWは、ひとつの行候補領域K(n)内に
おける白ブロックの総個数をl個とすれば次式(4)に
より表せる。
The average value width ABBW can be expressed by the following equation (4), assuming that the total number of white blocks in one row candidate area K(n) is l.

工 AB8W=  (1/I)  ・Σ  BBW(i)i
=1 ・・・・・・ (4) そして次式(5)を満足する最大幅MX88Wを有する
白ブロックを分割要素とみなし、この分割要素の行方向
の始端位?IIMXL及び終端位置MX日をそれぞれ分
割候補位置DPVI及びDVP2としで設定すると共に
分割要素か行候補領域K (n)内に存在することを表
す情報Pを出力する。第5図IL 示す例ではDVP 
1 =BXR(1)及びDVP2=8XL (2)であ
る、尚、この実施例では、最大幅MXBBWの白ブロッ
クのみを分割要素とみなし、行候補領域K (n) 1
Fr2つの領域に区分することを考える。
Engineering AB8W= (1/I) ・Σ BBW(i)i
=1 ...... (4) Then, a white block having a maximum width MX88W that satisfies the following formula (5) is regarded as a dividing element, and the starting position of this dividing element in the row direction? IIMXL and end position MX are set as division candidate positions DPVI and DVP2, respectively, and information P indicating that the division element exists within the row candidate area K (n) is output. Figure 5 IL In the example shown, DVP
1 = BXR (1) and DVP2 = 8XL (2). In this embodiment, only the white block with the maximum width MXBBW is regarded as a dividing element, and the row candidate area K (n) 1
Consider dividing Fr into two regions.

また次式(5)!満足する最大幅MX 8 BWの白ブ
ロックが存在しない場合、行候補領域K(n)内には分
割要素が存在しないとみなし、行候補領域K (n)の
行方向の決定された位置KXL (n)、KXRCn>
と列方向の決定された位?IIPLT (n)、PLB
 (n)とを出力スルと共に分割要素が行候補領域K 
(n)内に存在しないことを表す情報Qを出力する。
Also, the following formula (5)! If there is no white block with a satisfying maximum width MX 8 BW, it is assumed that no dividing element exists within the row candidate area K(n), and the determined position KXL ( n), KXRCn>
And the determined position of the column direction? IIPLT (n), PLB
(n) and the division element is the row candidate area K.
Outputs information Q indicating that it does not exist in (n).

MXB8W>ABBW*α1   ・・・・−(5)但
し、α1は任意好適に設定される所定値であり例えばα
1:3とする。
MXB8W>ABBW*α1 ・・・・−(5) However, α1 is a predetermined value that is arbitrarily set, for example, α
The ratio is 1:3.

本分割判定部 分割判定部24は行候補領域K (n)に分割要素か存
在するか否かの情報P又はQを分割候補位置決定部22
から入力し、この情報P又はQに応じて分割判定処理を
行なう。
The main division determination partial division determination unit 24 sends information P or Q as to whether or not a division element exists in the row candidate area K (n) to the division candidate position determining unit 22.
The division determination process is performed according to this information P or Q.

分割判定部24は行候補領域K (n)に分割要素か存
在しない情報Qを入力した場合には、分割判定結果とし
て行候補領域K (n)を分割しないことを表す情報q
を出力する。
When the division determination unit 24 inputs information Q indicating that a division element does not exist in the row candidate area K (n), the division determination unit 24 inputs information q indicating that the row candidate area K (n) is not divided as a division determination result.
Output.

また分割判定部24は分割要素が存在する情報Pと共に
分割候補位11fDVP1、DVP2を入力した場合に
は、行候補領域K (n)の分割要素で区分された区分
領域を第三の指示情報の指定領域として行候補頃域K 
(n)の列方向の位1fPLT(n) 、PLB (n
)と行方向の位置KXL(n) 、KXR(n)と分割
候補位11DVPI、DVP2とを出力し、ざらに第三
の指示情報の指定方向として列方向を出力する。
Further, when the division determination unit 24 inputs the division candidate positions 11fDVP1 and DVP2 together with the information P that a division element exists, the division determination unit 24 selects the division area divided by the division element of the row candidate area K (n) according to the third instruction information. Line candidate area K as specified area
(n) column direction 1fPLT(n), PLB(n
), the row direction positions KXL(n), KXR(n), and the division candidate positions 11DVPI and DVP2, and roughly output the column direction as the designated direction of the third instruction information.

周辺分布作成部12はこれら第三の指示情報に基づいて
行候補領tlK(n)の行方向の決定されたから始端位
11KXL (n)から分割候補位置DVP1まての間
の領域R1及び分割候補位1DVP2から行方向の決定
された終端位1fKXR(n)までの間の領域R2を、
それぞれ行方向に走査し、各副走査位11y毎に得た走
査線上の累積黒画素数からなる第三の周辺分布LKXL
 (V)及びLKXR(y)!作成する。これら第三の
周辺分布は次式(6)及び(7)で表せる。
The peripheral distribution creation unit 12 generates a region R1 and division candidates between the starting point 11KXL (n) and the division candidate position DVP1 after the row direction of the row candidate region tlK(n) is determined based on the third instruction information. The area R2 from position 1DVP2 to the determined terminal position 1fKXR(n) in the row direction is
A third peripheral distribution LKXL consisting of the cumulative number of black pixels on the scanning line obtained at each sub-scanning position 11y by scanning in the row direction.
(V) and LKXR(y)! create. These third marginal distributions can be expressed by the following equations (6) and (7).

但し、(6)及び(7)式においでPLT (n)≦y
≦PLB(n)である。
However, in equations (6) and (7), PLT (n)≦y
≦PLB(n).

第7図及び藁8図に、第三の周辺分布LKXL(y)及
びしKXR(y)の例を示す。
FIGS. 7 and 8 show examples of the third marginal distributions LKXL(y) and KXR(y).

周辺分布作成部12による第三の周辺分布の作成が終了
すると、分割判定部24は区分領域R1に関する第三の
周辺分布89照し、閾値YLTHL以上の累積黒画素数
LKXL CV)が連続する領域(黒ブロック)を検出
すると共にこれら黒フロックの列方向の始端及び終端値
Mを検出する。
When the creation of the third peripheral distribution by the peripheral distribution creation unit 12 is completed, the division determination unit 24 checks the third peripheral distribution 89 regarding the segmented region R1, and determines the area in which the cumulative number of black pixels LKXLCV) equal to or greater than the threshold value YLTHL is continuous. (black blocks), and also detect the starting and ending values M of these black blocks in the column direction.

ここで区分領域R1の原点側からy座標の小ざい順に数
えて第1番目の黒プロ・ツクの列方向の始端及び終端値
m+それぞれBLT (j)及びBLB(j)と表す。
Here, the starting and ending values m+ in the column direction of the first black block counted in ascending order of the y coordinate from the origin side of the segmented area R1 are expressed as BLT (j) and BLB (j), respectively.

同様に分割判定部24は区分禦域R2に間する第三の周
辺分布を参照し閾値YRTHL以上の累積黒画素数LK
X日(y)が連続する領域(黒ブロック)を検出すると
共にこれら黒ブロックの列方向の始端及び終端値11を
検出する。区分領域R2の原点側からy座標の小ざい順
に数えで藁に番目の黒ブロックの列方向の始端及び終端
位IIヲそれぞれBRT (k)及びBRB (k)と
表す。
Similarly, the division determination unit 24 refers to the third peripheral distribution between the segmented areas R2 and calculates the cumulative number of black pixels LK equal to or greater than the threshold value YRTHL.
Areas (black blocks) where X days (y) are consecutive are detected, and the starting and ending values 11 in the column direction of these black blocks are detected. The starting and ending positions II in the column direction of the first black block counted from the origin side of the segmented area R2 in ascending order of the y-coordinate are expressed as BRT (k) and BRB (k), respectively.

この結果、区分領域R1及びR2に関する黒ブロック位
11系列BLT (j)、BLB (j)及びBRT 
(k) 、BRB (k)8得る。
As a result, 11 black block sequences BLT (j), BLB (j) and BRT regarding segmented regions R1 and R2 are obtained.
(k), BRB (k)8 is obtained.

分割判定部24は、行候補領域K (n)内の隣接する
区分領域臼1及びR2に着目し、これら領域に間して黒
ブロツク位置系列を利用して分割情報(例えば黒ブロッ
クの幅や中心位N)を作成し下記■〜■の判定基準に従
って分割判定を行なう。判定基準■〜■のいずれがひと
つを満足する場合1こは着目区分領域内円1、R2の文
字行は互いに文字行の特徴が異なるとみなす情報pを出
力し、また判定基準■〜■のすべてを満足しない場合に
は着目区分領域R1、R2の文字行は互いに文字行の特
徴が同しであるとみなすことを表す情報qを出力する。
The division determination unit 24 focuses on the adjacent segmented regions 1 and R2 in the row candidate region K (n), and uses the black block position series to determine division information (for example, the width of the black block, etc.) between these regions. A center position N) is created and a division judgment is made according to the judgment criteria ① to ② below. If one of the judgment criteria ■~■ is satisfied, information p is output that indicates that the character lines of circle 1 and R2 in the focused segment area have different character lines from each other, and the judgment criteria ■~■ If all of the conditions are not satisfied, information q indicating that the character lines in the target segmented regions R1 and R2 are considered to have the same characteristics is output.

■ 1目した2つの区分領域内の黒ブロツク総個数が等
しくない。
■ The total number of black blocks in the first two segmented areas is not equal.

■二着目した2つの区分領域内の黒ブロツク総個数が等
しい場合、両領域の原点側からy座標の小ざい順に数え
た番号が同し黒ブロックの列方向における幅の差が所定
値以上であり、このような黒ブロックか1組以上存在す
る。
■If the total number of black blocks in the two divided areas of interest is the same, the numbers counted from the origin side of both areas in ascending order of the y coordinate are the same, and the difference in width in the column direction of the black blocks is greater than or equal to a predetermined value. Yes, there are at least one set of black blocks like this.

■:着目した2つの区分領域内の黒ブロツク総個数が等
しい場合、両領域の原点側からy座標の小さい順に数え
た番号が同し黒ブロックの列方向における中心位置が列
方向において所定範囲以上すれでおり、このような黒プ
ロ・ツクか1組以上存在する。
■: When the total number of black blocks in the two divided areas of interest is the same, the numbers counted in descending order of the y-coordinate from the origin side of both areas are the same, and the center position of the black blocks in the column direction is greater than or equal to a predetermined range in the column direction. By the way, there are at least one pair of black professional Tsuku like this.

以下、第9図を参照し分割判定処理の流れにつき一例を
挙げで説明する。第9図は分割判定処理の流れを示す図
である。尚、以下の説明では区分領域R1及びR2内の
黒ブロツク総個数を9個及びに個とする。
Hereinafter, with reference to FIG. 9, the flow of the division determination process will be explained by giving an example. FIG. 9 is a diagram showing the flow of division determination processing. In the following description, the total number of black blocks in the divided regions R1 and R2 is assumed to be nine and two.

周辺分布作成部12か第三の周辺分布の作成を終了する
と、分割判定部24は処理を開始しく開始)、ます着目
区分領t!1iR1及び日2の黒ブロツク総個数J及び
Kが等しいか否かを判定し、判定基準■のチエツクを行
なう(Sl)。
When the marginal distribution creation unit 12 finishes creating the third marginal distribution, the division determination unit 24 immediately starts processing), and then selects the focused segment area t! It is determined whether the total numbers J and K of black blocks in 1iR1 and day 2 are equal, and the criterion (2) is checked (Sl).

総個数J、Kが等しくなければ着目区分領域R1内の文
字行と、日2内の文字行とは特徴が異なる文字行である
とみなし行候補領域K (n)を着目区分領域日1、R
2に分割することを表す情報pを出力しくS 13) 
、そののち処理を終了する(S 13) 。
If the total numbers J and K are not equal, it is assumed that the character lines in the target segmented region R1 and the character lines in Day 2 have different characteristics, and the line candidate region K (n) is divided into the focused segmented region Day 1, R
Please output the information p indicating that it is divided into two.S13)
, and then ends the process (S13).

総個数J、にが等しい場合には、制御変数βに初期値=
0を設定したのち、まず33〜S6で判定基準■のチエ
ツクを行なう。総個数J、Kが等しいので、以後の処理
ではKをJで表す。
If the total number J, is equal, the control variable β has an initial value =
After setting the value to 0, first, in steps 33 to S6, the criterion (2) is checked. Since the total numbers J and K are equal, K will be represented by J in the subsequent processing.

83〜S6では、制御変数βに1を加算しくR3)、次
いて着目区分領域日1及び日2の第1番目の黒ブロック
の位置BLT l)、BLB(β)及びBRT(β)、
BRB(β)から、領域81及び日2のMβ番目の列方
向tこおける黒ブロツク幅BLW及びBRWを求める(
R4) 、これら黒ブロツク幅は次式(8)及び(9)
で表せる。
In 83 to S6, 1 is added to the control variable β (R3), and then the position of the first black block of the focused segmentation area day 1 and day 2 BLT l), BLB (β) and BRT (β),
From BRB(β), find the black block widths BLW and BRW in the Mβ-th column direction t in area 81 and day 2 (
R4), these black block widths are expressed by the following equations (8) and (9).
It can be expressed as

BLW=BLB 1)−BLT (42)+ 1−(8
) 8RW=8R8(β)−BRT(A)+1−(9) 次にこれら黒ブロツク幅BLW、BRWの差を所定値β
、と比較する(R5)、尚、この例ではR5で黒ブロツ
ク幅の差IBLW−BRWI7i!絶対値的な値β、と
比較するようにしたが、これにかえで差IBLW−8R
WIIFr例えば次式(10)で表されるような黒ブロ
ツク幅に依存した相対的な値βI *Min (BLW
、BRW)と比較しでもよい。
BLW=BLB 1)-BLT (42)+1-(8
) 8RW=8R8(β)-BRT(A)+1-(9) Next, the difference between these black block widths BLW and BRW is set to a predetermined value β.
, (R5), and in this example, the difference in black block width at R5 is IBLW-BRWI7i! I tried to compare it with the absolute value β, but the difference IBLW-8R
WIIFr is a relative value βI *Min (BLW
, BRW).

BLW−8RWl>8+  *Ml n (8LW、B
RW)              ・・・・・・(1
0)但し、Min (8LW、BRW)はBLW、SR
Wのうち値の小さいほうを表す。
BLW-8RWl>8+ *Ml n (8LW, B
RW) ・・・・・・(1
0) However, Min (8LW, BRW) is BLW, SR
Represents the smaller value of W.

黒ブロツク幅の差か所定値β、を越えるときS13の処
理を行ない、またこれらの差が所定値81以下であると
き番号βは総個数Jと等しいが否か判定する(B6)。
When the difference in black block width exceeds the predetermined value β, the process of S13 is performed, and when the difference is less than the predetermined value 81, it is determined whether the number β is equal to the total number J (B6).

番号βか総個数Jと等しくな1すれば他の組の黒プロ・
ツクに間し判定基準■のチエツクを行なうべく83の処
理に戻り、番号lが総個数Jと等しい場合には次に判定
基準■のチエツクのために87〜S11の処理を行なう
If the number β is 1 equal to the total number J, then the other group's black professional
After that, the process returns to step 83 to check the criterion (2), and if the number l is equal to the total number J, then the processes 87 to S11 are performed to check the criterion (2).

87〜S11では、まず制御変数1に初期値=0を設定
しくS7)、次に制御変数βに1を加算する(B8)。
In steps 87 to S11, the control variable 1 is first set to an initial value of 0 (S7), and then 1 is added to the control variable β (B8).

次いで着目区分wI域日1及びB2の第1番目の黒ブロ
ックの位11BLT(β)、81B(I2)及びBRT
 (β)、′BRB(Il)から、領域R1及びR2の
第β番目の列方向にあける黒ブロツク中心位11BLC
及びBRCt求める(B9)、c:れら中心位置は次式
(11)及び(12)で表せる。
Next, the first black block digits 11BLT (β), 81B (I2) and BRT of the attention division wI area day 1 and B2
(β), 'BRB (Il), black block center position 11BLC opened in the direction of the β-th column of regions R1 and R2
and BRCt (B9), c: These center positions can be expressed by the following equations (11) and (12).

BLG= (BLB (β)+BLT l))/2−(
11) BRW=  (BRB (β) +BRT (,1り)
 /2・・・(12) 次にこれら中心位置8LC,BRCの差を所定値B2と
比較する(S 10) 、尚、この例では810で黒ブ
ロツク幅の差I BLC−BFICli結対値的な値β
2と比較するようにしたが、これにかえで差+ 8LC
−BRClj&例えば次式(13)で表されるような中
心位置に依存した相対的な値82 *Mi n (BL
C,BRC)と比較してもよい。
BLG= (BLB (β)+BLT l))/2−(
11) BRW= (BRB (β) +BRT (,1ri)
/2...(12) Next, the difference between these center positions 8LC and BRC is compared with a predetermined value B2 (S10). In this example, the difference in black block width I is 810. value β
I tried to compare it with 2, but there is a difference + 8LC
−BRClj & For example, a relative value depending on the center position as expressed by the following equation (13) 82 *Min (BL
C, BRC).

I BLC−8日CI>B2 *Mi n (BLCl
BRC)             −−−−−−(1
3)中心位置の差が所定値β2’Fr越えるときS13
の処理を行ない、またこれらの差が所定値B2以下であ
るとき番号βは総個数Jと等しいか否か判定する(31
1)。
I BLC-8th CI>B2 *Min (BLCl
BRC) --------(1
3) When the difference in center position exceeds the predetermined value β2'Fr, S13
and when the difference between them is less than a predetermined value B2, it is determined whether the number β is equal to the total number J (31
1).

番号βが総個数Jと等しくなければ他の組の黒ブロック
に関し判定基準■のチエツクを行なうべくB8の処理に
戻り、番号βが総個数Jと等しい場合には処理を終了す
る(終了)。
If the number .beta. is not equal to the total number J, the process returns to B8 to check the criterion (2) for other sets of black blocks, and if the number .beta. is equal to the total number J, the process is terminated (end).

車行位置決定部 行位置決定部26は分割判定部24からの情報p又はq
に応じて行値Wlヲ決定する。
Vehicle position determination section The row position determination section 26 receives information p or q from the division determination section 24.
The row value Wl is determined accordingly.

ここで着目した2つの区分領域R1及び日2において、
区分Il域R1の第三の周辺分布から検出されy座標の
小さい順に数えて第3番目の黒ブロックを符号81(j
)で表せば、黒ブロック81(j)の列方向の始端及び
終端位置はBLT (j)及びBLB(j)となり、ま
た黒プロ・ツク81 (j)の行方向の始端及び終端位
置は区分領域R1の始端値@KXL (n)及び終端値
@DVPIとなる。尚、〕=1.2、−Jである。
In the two divided regions R1 and Day 2 that we focused on here,
The third black block detected from the third peripheral distribution of section Il area R1 and counted in descending order of y coordinate is designated by code 81(j
), the starting and ending positions of the black block 81 (j) in the column direction are BLT (j) and BLB (j), and the starting and ending positions of the black block 81 (j) in the row direction are classified as The starting value @KXL (n) and the ending value @DVPI of the region R1 are obtained. Note that ]=1.2, -J.

同様に、区分領域R2の第三の周辺分布から検出されy
座標の小ざい順に数えて第に番目の黒ブロックを符号8
2(j)で表せば、黒ブロックB2(j)の列方向の始
端及び終端位置はBRT(j)及びBRB(])となり
、また黒プロ・ツク82(j)の行方向の始端及び終端
位置は区分領域R2の始端値11DVP2及び終端位置
KXR(n)となる、尚、k=1.2、・・・、Kであ
る。
Similarly, y detected from the third peripheral distribution of segmented region R2
Counting the coordinates from smallest to smallest, the first black block is coded 8.
2(j), the starting and ending positions of the black block B2(j) in the column direction are BRT(j) and BRB(]), and the starting and ending positions of the black block B2(j) in the row direction are BRT(j) and BRB(]). The positions are the starting edge value 11DVP2 and the ending edge position KXR(n) of the segmented area R2, where k=1.2, . . . , K.

情報p!大入力た場合には、区分領域R1及びR2内そ
れぞれの黒ブロックの総個数は等しく従ってJ=にであ
るのでj=にとして、行位置決定部26はIEj番目の
黒ブロック81 (j)、82(j)同士をひとつの文
字行Zp(j)とする、この場合、文字行領域Zp (
j)の行方向の始端位置及び終端位置をこれら黒ブロッ
クB1(j)の行方向の始端値WKXL (n)及び黒
ブロック82(j)の行方向の終端値11KXR(n)
とし、かつ文字行領域Zp (j)の列方向の始端位置
を黒ブロック81 (j)、82 (j)の列方向の始
端値11BLT (j)、BRT (j)のうち最小の
始端位置とし及びこのW4域Zp(コ)の列方向の終端
値i!If!黒ブロックB1(j)、82 (j)の列
方向の終端位置BLB(j)、8RB (j)のうち最
大の終端位置とする。
Information p! In the case of a large input, the total number of black blocks in each of the segmented regions R1 and R2 is equal, so J=, so with j=, the row position determination unit 26 selects the IEj-th black block 81 (j), 82(j) as one character line Zp(j). In this case, the character line area Zp (
The starting and ending positions of black block B1(j) in the row direction are the starting end value WKXL (n) of the black block B1(j) and the ending value 11KXR(n) of the black block 82(j) in the row direction.
And let the starting end position of the character line area Zp (j) in the column direction be the minimum starting end position of the starting end values 11BLT (j) and BRT (j) of the black blocks 81 (j) and 82 (j) in the column direction. and the terminal value i in the column direction of this W4 area Zp(k)! If! This is the maximum end position among the end positions BLB(j) and 8RB(j) of the black blocks B1(j) and 82(j) in the column direction.

また情報qを入力した場合には、一方の区分領域R1の
Mj番目の黒ブロック81(j)をひとつの文字行91
t*Za1(j)とし、この領域Zql(j)の行方向
の始端及び終端値Mを黒プロ・ンク81 (j)の行方
向の始端位置KXL(n)及び終端値@DVP1とし、
かつこの領域ZQ1(j)の列方向の始端及び終端位置
を黒プロ・ンクBl(j)の列方向の始端位置BLT(
j)及び終端値WBLB(j)とする、同様に、他方の
区分領域臼2の第に番目の黒ブロックB2 (k)をひ
とつの文字行領域ZQ2 (k)とし、この領域Zq2
 (k)の行方向の始端及び終端値Wl&黒ブロック8
2 (k)の行方向の始端値1fDVP2及び終端位置
KXR(n)とし、かつこの1ItfiZc+2(k)
の列方向の始端及び終端値Nを黒ブロック82 (k)
の列方向の始端値1fBRT (k)及び終端値11B
RB (k)とする。尚、情報qを入力した場合にはJ
=K又はJ≠にである。
In addition, when information q is input, the Mj-th black block 81(j) in one segmented area R1 is divided into one character line 91.
Let t*Za1(j) be the starting end position and end value M in the row direction of this area Zql(j) as the starting end position KXL(n) and end value @DVP1 in the row direction of the black prong 81 (j),
In addition, the starting and ending positions of this area ZQ1(j) in the column direction are the starting end positions of the black block Bl(j) in the column direction BLT(
j) and the terminal value WBLB(j).Similarly, the second black block B2(k) of the other segmented area 2 is defined as one character line area ZQ2(k), and this area Zq2
Starting and ending values Wl in the row direction of (k) & black block 8
2 (k) in the row direction starting end value 1fDVP2 and ending position KXR(n), and this 1ItfiZc+2(k)
The starting and ending values N in the column direction of the black block 82 (k)
The starting end value 1fBRT (k) and the ending value 11B in the column direction of
Let it be RB (k). In addition, if you enter information q, enter J
=K or J≠.

蔦]0図〜第13図に、この実施例による文字行領域の
検出結果例を示す。第10図は藁2図の画像データ22
に間する文字行領域の検出結果、篤11図は判定基準■
を満足しない行候補領域から検出された文字行領域ZQ
 1 (j)、Zq2(k)の例、篤12図は判定基準
■を満足しない行候補領域から検出された文字行領域Z
QI(j)、ZQ2 (k)の例、及び第13図には判
定基準■を満足しない行候補領域から検出された文字行
領域Zql (j)、ZQ2 (k)の例を示す図であ
る。
Figures 0 to 13 show examples of detection results of character line areas according to this embodiment. Figure 10 is image data 22 of Figure 2
The detection result of the character line area between
Character line area ZQ detected from the line candidate area that does not satisfy
1 (j) and Zq2(k), Figure 12 shows the character line area Z detected from the line candidate area that does not satisfy the criterion ■.
Examples of QI(j) and ZQ2(k), and FIG. 13 are diagrams showing examples of character line areas Zql(j) and ZQ2(k) detected from line candidate areas that do not satisfy the criterion ■. .

これら図において、検出された文字行領域を二点鎖線の
矩形枠で示し、情報0%入力した場合の文字行領域Zp
 (j)の検出結果例を符号Zp(])を付して表し、
同様に情報qを入力した場合の文字行領域ZQ 1 (
j) 、ZQ2 (k)の検出結果例に符号ZQ 1 
(j)、ZQ2 (k)を付しで示した。
In these figures, the detected character line area is indicated by a rectangular frame with two-dot chain lines, and the character line area Zp when 0% information is input.
An example of the detection result of (j) is expressed with the symbol Zp (]),
Similarly, when information q is input, the character line area ZQ 1 (
j), ZQ2 (k) has the code ZQ 1 in the detection result example.
(j) and ZQ2 (k) are shown as appendixes.

第10図からも理解できるように、第2図の画像データ
22においでは氏名記載領域に対応する行候補領域を除
く他の行候補領域は全て上述した判定基準■により分割
される。また氏名記載領域に対応する行候補領域は、上
述の(5)式を満足し分割候補値11DVP1、DVP
2が設定されるが、上述の判定基準■〜■のいずれにも
該当しないため分割されず、区分領tlJR1、R2の
対応する黒ブロック81 (j)及び82(j)の双方
を含む文字行領域Zp (j)が検出されることとなる
As can be understood from FIG. 10, in the image data 22 of FIG. 2, all row candidate regions other than the row candidate region corresponding to the name writing region are divided according to the above-mentioned criterion (2). In addition, the line candidate area corresponding to the name writing area satisfies the above equation (5) and has division candidate values 11DVP1, DVP
2 is set, but it is not divided because it does not meet any of the above criteria ■ to ■, and the character line that includes both the corresponding black blocks 81 (j) and 82 (j) in the segmented areas tlJR1 and R2. Region Zp (j) will be detected.

この発明は上述した実施例にのみ限定されるものではな
く、従って各構成成分の動作条件、動作の流れ、入出力
信号、数値的条件及びその(よかの条件を任意好適に変
更できる。
The present invention is not limited to the above-described embodiments; therefore, the operating conditions, flow of operation, input/output signals, numerical conditions, and other conditions of each component can be changed as desired.

例えば上述の式(5)にかえて次式(14)に従って分
割要素の検出及び分割候補位置の設定を行なうようにし
でもよい。
For example, instead of the above equation (5), the division elements may be detected and the division candidate positions may be set according to the following equation (14).

BBW(i+1)−88W(i)l>02・・・(14
) 但し行候補領域内の纂二周辺分布から検出された黒ブロ
ックにはさまれた白ブロックの総個数を1個とすれば、
1は1≦i≦工なる自然数である。
BBW(i+1)-88W(i)l>02...(14
) However, if the total number of white blocks sandwiched between black blocks detected from the Soji marginal distribution in the row candidate area is 1, then
1 is a natural number such that 1≦i≦k.

この場合、式(14)によつX座標の小ざい順に白ブロ
ックの行方向の幅の変動を調べ、所定値02以上の変動
のあった白ブロツク幅88W(i+1)を分割要素とし
で検出し、この分割要素に基づき分割候補位置DVP1
、DVP2を設定する。
In this case, the variation in the width of the white block in the row direction is checked in order of decreasing X coordinate according to equation (14), and the white block width 88W (i+1), which has a variation of more than a predetermined value of 02, is detected as a dividing element. Based on this dividing element, dividing candidate position DVP1 is determined.
, set DVP2.

(発明の効果) 上述した説明からも明らかなように、この発明の行検出
装置によれば、文書画像データの所定の処理領域を行方
向に走査しで得た第一の周辺分布に基づいて行候補領域
の列方向の始端及び終端位置を決定する0次いで行候補
′ga壇内を列方向に走査して得た第二の周辺分布に基
づいて行候補領域の行方向の始端及び終端位8%決定す
ると共に行候補領域内の分割要素の存否を判定する。
(Effects of the Invention) As is clear from the above description, according to the line detection device of the present invention, based on the first marginal distribution obtained by scanning a predetermined processing area of document image data in the line direction, Determine the starting and ending positions of the row candidate area in the column direction. Next, determine the starting and ending positions of the row candidate area in the row direction based on the second peripheral distribution obtained by scanning the row candidate'ga stage in the column direction. 8% is determined, and the presence or absence of a dividing element within the row candidate area is also determined.

そして分割要素が存在しない場合には行候補領域の位置
を行領域の位置としで検出する。
If there is no dividing element, the position of the line candidate area is detected as the position of the line area.

また分割要素か存在する場合には、行候補領域の行方向
の分割候補位置を設宇し、ざらに行候補領域を分割候補
位置で区分し、この区分領域内を行方向に走査しで得た
第三の周辺分布に基づいて区分領域内の文字行の特徴を
表す分割情報を抽出し、この分割情報に応じて行領域の
位置を決定する。この位置決定では着目する2つの区分
領域の対応する文字行同士につきその特徴を比較し、特
徴が同しであるとみなせる場合にはこれら文字行同士を
ひとつの文字行とし、また特徴が異なるとみなせる場合
にはこれら文字行をそれぞれ異なる文字行としで検出す
る。
In addition, if there is a dividing element, a dividing candidate position in the row direction of the row candidate area is set, the row candidate area is roughly divided by the dividing candidate position, and this divided area is scanned in the row direction. Based on the third marginal distribution, division information representing the characteristics of the character lines within the divided area is extracted, and the position of the line area is determined according to this division information. In this position determination, the characteristics of corresponding character lines in the two divided areas of interest are compared, and if the characteristics can be considered to be the same, these character lines are treated as one character line, and if the characteristics are different, then the character lines are treated as one character line. If possible, these character lines are detected as different character lines.

従ってこの発明装置によれば例えば第10図に示すよう
な行装置が入り組んた文書画像についても、周辺分布の
作成を繰返し周辺分布から得た黒ブロツク位置系列に基
づいて文字行を検出できる。これかため、文書画像から
小ブロック等を検出し小ブロック等の外接矩形の大きさ
や位置と処理対象の文書画像に間するルールとに基づい
て小ブロック等を統合し或は再分割することを繰返す従
来装置と比較して、行装置が入り組んた文書画像からの
行検出をより簡単な処理で正確に行なうことができる。
Therefore, according to the apparatus of the present invention, character lines can be detected based on the black block position series obtained from the peripheral distribution by repeatedly creating the peripheral distribution even for a document image with a complicated line arrangement as shown in FIG. 10, for example. For this reason, it is possible to detect small blocks etc. from a document image and integrate or re-divide the small blocks etc. based on the size and position of the circumscribed rectangle of the small blocks etc. and the rules between the document images to be processed. Compared to conventional devices that repeatedly perform line detection from complicated document images, the line device can accurately perform line detection with simpler processing.

また行検出をより簡単な処理で行なえるので、装置の小
規模化を図れるとと共に処理の高速化を図れる。
Furthermore, since line detection can be performed through simpler processing, the apparatus can be made smaller and the processing speed can be increased.

【図面の簡単な説明】[Brief explanation of drawings]

藁1図はこの発明の一実施例の構成の説明に供する機能
ブロック図、 第2図は文書画像データの例を示す図、爾3図は第2図
の文書画像データに関する第一の周辺分布を示す図、 藁4図は行検出部により第3図の第一の周辺分布から検
出した行候補領域を示す図、 第5図は第2図の文書画像データに闇する行候補領域の
画像データを示す図、 第6図は第5図の画像データに間する第二0周辺分布を
示す図、 第7図及び第8図は第5図の画像データに関する第三の
周辺分布を示す図、 第9図は分割判定液処理の流れの一例を示す図、 第10図は第2図の文書画像データに間する行検出結果
を示す図、 第11図〜菓13図は行検出結果の他の例を示す図であ
る。 10・・・行検出装置、 12−・周辺分布作成部14
−・・行候補検出部、16・・・行検出部20・・・画
像データ記憶部
Figure 1 is a functional block diagram for explaining the configuration of an embodiment of the present invention, Figure 2 is a diagram showing an example of document image data, and Figure 3 is a first marginal distribution regarding the document image data in Figure 2. Figure 4 is a diagram showing the line candidate area detected from the first peripheral distribution in Figure 3 by the line detection unit, Figure 5 is an image of the line candidate area hidden in the document image data in Figure 2. Figure 6 is a diagram showing the 20th marginal distribution between the image data in Figure 5. Figures 7 and 8 are diagrams showing the third marginal distribution regarding the image data in Figure 5. , Figure 9 is a diagram showing an example of the flow of division determination liquid processing, Figure 10 is a diagram showing the line detection results between the document image data of Figure 2, and Figures 11 to 13 are diagrams showing the line detection results. It is a figure which shows another example. 10... Row detection device, 12-. Marginal distribution creation unit 14
---Line candidate detection unit, 16...Line detection unit 20...Image data storage unit

Claims (1)

【特許請求の範囲】[Claims] (1)指定領域内の文書画像データを指定方向に走査し
て周辺分布を作成する周辺分布作成部と、前記指定領域
を所定の行検出対象領域とし及び前記指定方向を行方向
とする第一の指示情報に従って得られた第一の周辺分布
に基づいて行候補領域の列方向の位置を決定する行候補
検出部と、 前記指定領域を行候補領域とし及び前記指定方向を列方
向とする第二の指示情報に従って得られた第二の周辺分
布に基づいて、行候補領域の行方向の位置を決定すると
共に行候補領域内の分割要素の存否を判定し、 前記分割要素が存在する場合、前記行候補領域の行方向
の分割候補位置を設定し、前記指定領域を前記行候補領
域の分割要素で区分された区分領域とし及び前記指定方
向を行方向とする第三の指示情報に従って得られた第三
の周辺分布から分割情報を抽出し、該分割情報に応じて
行領域の位置を決定し、 前記分割要素が存在しない場合、前記行候補領域の位置
を行領域の位置として決定する行検出部とを具えて成る
ことを特徴とする行検出装置。
(1) A peripheral distribution creation unit that scans document image data in a designated area in a designated direction to create a peripheral distribution; a row candidate detection unit that determines the position of the row candidate area in the column direction based on the first marginal distribution obtained according to the instruction information; Based on the second marginal distribution obtained according to the second instruction information, the position of the row candidate area in the row direction is determined, and the presence or absence of a division element in the row candidate area is determined, and if the division element exists, A dividing candidate position in the row direction of the row candidate area is set, the specified area is set as a segmented area divided by dividing elements of the row candidate area, and the designated direction is set as the row direction. extracting division information from the third marginal distribution, determining the position of the row area according to the division information, and determining the position of the row candidate area as the position of the row area if the division element does not exist. 1. A row detection device comprising: a detection section.
JP2005384A 1990-01-12 1990-01-12 Line detecting device Pending JPH03210688A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2005384A JPH03210688A (en) 1990-01-12 1990-01-12 Line detecting device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2005384A JPH03210688A (en) 1990-01-12 1990-01-12 Line detecting device

Publications (1)

Publication Number Publication Date
JPH03210688A true JPH03210688A (en) 1991-09-13

Family

ID=11609677

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005384A Pending JPH03210688A (en) 1990-01-12 1990-01-12 Line detecting device

Country Status (1)

Country Link
JP (1) JPH03210688A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005302011A (en) * 2004-03-24 2005-10-27 Microsoft Corp Method and apparatus for populating electronic forms from scanned documents

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6453281A (en) * 1987-05-01 1989-03-01 Ricoh Kk Area extraction method
JPH01295383A (en) * 1987-11-05 1989-11-29 Ricoh Co Ltd Region extraction method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6453281A (en) * 1987-05-01 1989-03-01 Ricoh Kk Area extraction method
JPH01295383A (en) * 1987-11-05 1989-11-29 Ricoh Co Ltd Region extraction method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005302011A (en) * 2004-03-24 2005-10-27 Microsoft Corp Method and apparatus for populating electronic forms from scanned documents

Similar Documents

Publication Publication Date Title
EP0621553A2 (en) Methods and apparatus for inferring orientation of lines of text
JP3452774B2 (en) Character recognition method
US6834121B2 (en) Apparatus for rough classification of words, method for rough classification of words, and record medium recording a control program thereof
US20110043869A1 (en) Information processing system, its method and program
JP2011048816A (en) Discrimination method, discrimination device and computer program
JPH0713995A (en) Automatic text feature determination device
US5502777A (en) Method and apparatus for recognizing table and figure having many lateral and longitudinal lines
Dori et al. Segmentation and recognition of dimensioning text from engineering drawings
JPH0821057B2 (en) Document image analysis method
JPH08320914A (en) Table recognition method and device
JP3898645B2 (en) Form format editing device and form format editing program
JP3852218B2 (en) Image processing method and image processing apparatus
JPH03210688A (en) Line detecting device
JP4418726B2 (en) Character string search device, search method, and program for this method
JPH03142691A (en) Table format document recognizing system
JPH03225579A (en) Character pattern cutting device
JPH08161432A (en) Character cutting method and character cutting device
JP2993533B2 (en) Information processing device and character recognition device
JPS6154569A (en) Document image processing method
JPH0528310A (en) Form type document identification device
JP3435193B2 (en) String extraction method
JPH03268181A (en) Document reader
JP3074210B2 (en) Paper document image processing device
JPH11242716A (en) Image processing method and recording medium
JPH03240184A (en) Attribute decision device