JPH04112282A

JPH04112282A - Word area extracting system

Info

Publication number: JPH04112282A
Application number: JP2410749A
Authority: JP
Inventors: Hiroshi Nakayama; 寛中山
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1990-07-27
Filing date: 1990-12-14
Publication date: 1992-04-14

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

[Detailed description of the invention]

［０００１］ [0001]

[Industrial application field]

本発明は、ハンディスキャナー装置によって読み込んだ
文書画像やデイスプレィ画面に表示された文書画像の部
分画像より、英単語のようなスペースで区切られた文字
列（単語と総称する）の領域を抽出する単語領域抽出方
式に関する。［０００２］The present invention is a word search system that extracts regions of character strings (generally referred to as words) separated by spaces, such as English words, from a document image read by a handy scanner device or a partial image of a document image displayed on a display screen. Regarding area extraction method. [0002]

[Conventional technology]

文字認識システムにおいて、ハンディスキャナー装置を
用いて原稿画像を入力する場合、読み取り領域が不適切
となって単語の切り出しく単語入力）を正しく行なうこ
とができないことがあった。それは従来のハンディスキ
ャナー装置が、そのような単語入力を前提としていなか
ったため、読み取り範囲を覗き窓から確認できるように
はなっていたが、原稿上の目的としている単語に対して
読み取り位置が適切であるか否かを簡単に確認するため
の手段を備えていなかったからである。［０００３］このような不都合を解決するため、原稿読み取り窓と原
稿上の単語の先頭及び行中心との相対的位置をそれぞれ
合わせるための二つのガイドライン（スタートラインと
センターライン）を有し、読み取り領域と単語の先頭及
び行中心との相対的位置がほぼ一定した読み取り画像を
得られるハンディスキャナー装置と、このガイドライン
付きハンディスキャナー装置によって読み込んだ画像よ
り、ガイドライン位置を利用して簡単な処理により単語
領域を抽出する単語領域抽出装置とを考案し、同一出願
人により「ハンディスキャナー装置及び単語領域抽出装
置」なる発明の名称にて平成２年７月１９日付で特許出
願がなされている。［０００４］また、デイスプレィ画面上において、マウス（ポインテ
ィングデバイス）などを用いて必要な単語領域を囲む矩
形の二つの対角頂点を指定し、この矩形内を単語領域と
して抽出するという方式がある。しかし、文書の行間が
狭い場合や単語間スペースが狭い場合、あるいは単語に
接近して罫線がある場合に、単語領域のみを囲む矩形の
指定が容易でないという問題がある。［０００５］この問題を解決するため、前記ハンディスキャナー装置
と同様のガイドラインを持つ、単語領域より十分大きな
一定の大きさの枠を定義し、この枠（全体、ガイドライ
ンのみ、あるいはガイドラインの交点部分のみ）を文書
画像とともにデイスプレィ画面に表示し、ガイドライン
を目的の単語領域の先頭及び行とほぼ一致させるように
枠を移動させ、枠内の部分画像を入力画像として、ガイ
ドラインの位置情報（基準位置情報）を利用し簡単な処
理によって目的の単語領域を抽出する単語領域抽出方式
を考案し、やはり同一出願人により「単語領域抽出方式
」なる発明の名称にて平成２年７月１９日付で特許出願
がなされている。［０００６］In a character recognition system, when inputting a document image using a handy scanner device, the reading area may become inappropriate and word extraction (word input) may not be performed correctly. This is because conventional handheld scanner devices were not designed for such word input, and although the reading range could be checked through a viewing window, it was difficult to find the correct reading position for the target word on the manuscript. This is because there was no means for easily confirming whether or not this was the case. [0003] In order to solve such inconveniences, two guidelines (start line and center line) are provided to align the relative positions of the document reading window and the beginning of words and line centers on the document, respectively. A handy scanner device that can obtain scanned images in which the relative position of the region, the beginning of a word, and the center of a line is almost constant, and a handheld scanner device with a guideline that can obtain a read image that uses the guideline position to easily process words. A patent application was filed on July 19, 1990 under the title of "Handy scanner device and word region extraction device" by the same applicant. [0004] There is also a method in which two diagonal vertices of a rectangle surrounding a desired word area are specified on a display screen using a mouse (pointing device) or the like, and the inside of this rectangle is extracted as a word area. However, there is a problem in that it is not easy to specify a rectangle that encloses only the word area when the document has narrow line spacing, narrow inter-word space, or when there are ruled lines close to words. [0005] In order to solve this problem, we define a frame of a certain size that is sufficiently larger than the word area and has a guideline similar to that of the handheld scanner device. ) on the display screen together with the document image, move the frame so that the guideline almost matches the beginning and line of the target word area, use the partial image inside the frame as the input image, and input the position information of the guideline (reference position information). ) and devised a word region extraction method that extracts the target word region through simple processing, and filed a patent application on July 19, 1990 under the title of "Word region extraction method" by the same applicant. is being done. [0006]

[Problem to be solved by the invention]

本発明の目的は、前記特許出願に係る単語領域抽出装置
あるいは単語領域抽出方式を改良し、ガイドライン付き
ハンディスキャナー装置によって読み込まれた文書画像
あるいはデイスプレィ画面上でガイドライン付き粋によ
り指定された領域の文書画像を人力画像として、人力画
像のスキュー　が大きい場合にも、より確実に単語領域
を抽出可能な単語領域抽出方式を提供することにある。［０００７］An object of the present invention is to improve the word region extraction device or word region extraction method related to the above patent application, and to improve the word region extraction device or the word region extraction method according to the above patent application, and to improve the word region extraction device or the word region extraction method according to the patent application. An object of the present invention is to provide a word region extraction method that can more reliably extract word regions even when the image is a human image and the skew of the human image is large. [0007]

[Means to solve the problem]

特許請求の範囲の請求項１記載の発明は、ガイドライン
付きハンディスキャナー装置によって読み込まれた文書
画像あるいはデイスプレィ画面上でガイドライン付き枠
により指定された領域の文書画像のような、抽出すべき
目的単語領域の先頭基準位置及び行基準位置の情報を有
する入力画像について、行基準位置より水平射影を調べ
ることにより目的単語領域の行を切り出し、切り出した
行の範囲について、先頭基準位置の近傍の垂直射影を調
べることによって目的単語領域の先頭位置を検出し、検
出した先頭位置の後の垂直射影を調べることによって目
的単語領域の末尾位置を検出する単語領域抽出方式にお
いて、入力画像を行方向に一定の重なりを持つ複数の領
域に分割し、先頭の分割領域から末尾の分割領域へ向か
って分割領域毎に、直前の分割領域の行切り出しの結果
を参照しながら目的単語領域の行の切り出しを行なうこ
とを特徴とする。［０００８］請求項２記載の発明は、前記請求項１記載の発明による
各分割領域の行切り出しにおいて、行の上端及び下端の
探索限界を設定し、それぞれの探索限界を越えても、行
の上端または下端に対応した水平射影値が閾値以上から
閾値未満へ変化する位置を検出できないときは、上端ま
たは下端で行接続が起こったと判断し、水平射影値を利
用した接続行を強制分離する処理によって目的単語領域
の行の上端または下端の位置を決定することを特徴とす
る。［０００９］請求項３記載の発明は、前記請求項１記載の発明による
各分割領域の行切り出しにおいて、二つ以上の行が切り
出された場合に、一定の条件を満たすときに二つ以上の
行を併合することを特徴とする。［００１０］The invention recited in claim 1 provides a target word area to be extracted, such as a document image read by a handy scanner device with a guideline or a document image in an area specified by a frame with a guideline on a display screen. For an input image that has information on the start reference position and line reference position, extract the line of the target word area by examining the horizontal projection from the line reference position, and calculate the vertical projection of the vicinity of the start reference position for the range of the cut out line. In the word region extraction method, which detects the beginning position of the target word region by examining the target word region and detects the end position of the target word region by examining the vertical projection after the detected beginning position, the input image is The target word area is divided into multiple areas with Features. [0008] The invention according to claim 2 sets search limits for the upper and lower ends of the row in the row extraction of each divided area according to the invention according to claim 1, and even if each search limit is exceeded, the row is still If the position where the horizontal projection value corresponding to the top or bottom edge changes from above the threshold to below the threshold cannot be detected, it is determined that a line connection has occurred at the top or bottom edge, and the process of forcibly separating the connected lines using the horizontal projection value. It is characterized by determining the position of the upper end or lower end of the line of the target word area. [0009] The invention according to claim 3 provides that when two or more rows are cut out in the row cutting of each divided area according to the invention according to claim 1, two or more rows are cut out when a certain condition is satisfied. It is characterized by merging rows. [0010]

[Effect]

請求項１ないし３記載の発明によれば、人力画像の持つ
先頭基準位置（スタートライン位置）及び行基準位置（
センターライン位置）を利用するため単語領域を簡単な
処理によって効率的に抽出可能であるとともに、入力画
像の行方向の分割領域毎に、直前の分割領域での行切り
出し結果を参照しつつ行切り出しを行なうので、入力画
像全体について一括して行切り出しを行なう方式に比べ
、入力画像のスキューがあっても、スキューによって行
接続が起こりやすい、行間が狭い文書や単語と罫線が接
近したような文書中の目的単語領域の行をより確実に切
り出すことが可能である。［００１１］また請求項２の発明によれば、行接続を判断しその強制
分離を行なうため、入力画像のスキューがかなり大きい
場合においても、目的単語の行を適切に切り出すことが
可能となるため、ハンディスキャナー装置によって入力
画像を読み込む場合のようにスキューが大きくなりゃす
い場合においても、確実な単語領域抽出が可能である。［００１２］さらに請求項３の発明によれば、一つであるべき行が二
つ以上の行に分離する場合、例えばｉやｊのような文字
の上の黒丸部分が一つの行として切り出されたような場
合に、それを修正して、より的確な単語領域抽出を達成
できる。［００１３］According to the invention described in claims 1 to 3, the first reference position (start line position) and the row reference position (
By using the center line position), word regions can be extracted efficiently through simple processing, and lines can be extracted for each divided region in the row direction of the input image while referring to the line clipping results of the immediately preceding divided region. Therefore, compared to a method that performs line cutting for the entire input image at once, even if there is a skew in the input image, line connections are likely to occur due to skew, and this method is useful for documents with narrow line spacing or documents where words and ruled lines are close together. It is possible to more reliably cut out lines in the target word area. [0011] Furthermore, according to the invention of claim 2, since the line connections are determined and the forced separation is performed, it is possible to appropriately cut out the line of the target word even when the skew of the input image is quite large. It is possible to reliably extract word regions even when the skew is likely to be large, such as when an input image is read by a handy scanner device. [0012] Furthermore, according to the invention of claim 3, when a line that should be one line is separated into two or more lines, for example, a black circle above a character such as i or j is cut out as one line. In such cases, it can be corrected to achieve more accurate word region extraction. [0013]

【Example】

図１は本発明を実施するためのシステムの一例を示すブ
ロック図である。１は処理や各種制御を実行するＣＰＵ
、２はオペレーティングシステムや本発明に係る単語領
域抽出処理、その他アプリケーションのプログラムなど
を記憶するためのプログラムメモリ、３は処理の中間デ
ータや処理結果データなどを記憶するためのデータメモ
リ、４と５はＣＲＴデイスプレィとそのインタフェース
、６は画像データを記憶するための画像メモリ、７と８
はマウスとキーボード、９はそのインタフェース、１０
は画像データファイルなどのデータファイルやオペレー
ティングシステム、各種プログラムを保存するための補
助記憶装置、１１はそのインタフェース、１２と１３は
ガイドライン付きハンディスキャナー装置とそのインタ
フェースである。［００１４］単語領域抽出処理の対象となる入力画像は、ハンディス
キャナー装置１２より直接入力され画像メモリ６に取り
込まれた画像であるか、あるいは、補助記憶装置１０よ
り画像メモリ６に取り込まれた画像のＣＲＴデイスプレ
ィ３の画固上で領域指定された部分画像である。［００１５］いずれにしても、図２に示すように、単語領域抽出処理
の対象となる人力画像２０は、目的単語領域の先頭基準
位置であるスタートライン２１と目的単語領域の行基準
位置であるセンターライン２２の情報を持っている。［００１６］ここで、ハンディスキャナー装置１２は、図３に示すよ
うな読み取り窓３０を通して原稿の画像を読み取るもの
で、この読み取り窓３０と原稿上の目的単語領域との位
置合わせのためのガイドラインとしてスタートライン３
１とセンターライン３２とを持っている。図２のスター
トライン２１とセンターライン２２はスタートライン３
１とセンターライン３２の中心線にそれぞれ対応する。このようなハンディスキャナー装置１２によって文書画
像を読み込む場合、原稿上の目的単語領域゛の先頭（左
端）にスタートライン３１を、また目的単語領域の行の
中心にセンターライン３２の中心を、はぼ一致させるよ
うに読み取り窓３０．の位置合わせをし、読み取り窓３
０の領域の画像を読み込む。［００１７］また、大型イメージスキャナー装置などによってページ
単位で読み取られて補助記憶装置１０に格納されている
文書画像中の単語領域抽出を行なう場合、必要ページの
文書画像を画像メモリ６に読み込み、これをＣＲＴデイ
スプレィ６の画面に表示させ、入力画像の領域を指定す
る。図４は、この人力画像領域指定の説明図であり、画
面４０に図３に示した読み取り窓３０と同様の、単語領
域より大きな枠４１を文書画像と重ねて表示し、スター
トライン４２を目的の単語（ここでは’Ｒｅｃｏｇｎｉ
ｔｉｏｎ”　）の領域の先頭と、かつセンターライン４
３を目的単語領域の行と、はぼ一致させるように枠４１
を移動させる。この枠４１内の部分画像が単語領域抽出
処理の入力画像となる。図２のスタートライン２１とセ
ンターライン２２はスタートライン４２とセンターライ
ン４３の中心線にそれぞれ対応する。［００１８］なお、枠４０は図４に示したように全体を画面に表示し
てもよいカミ枠線を表示せず、スタートライン４２とセ
ンターライン４３だけを表示させたり、あるいはスター
トライン４２とセンターライン４３の交点部分だけを表
示させてもよい。要は、二つのガイドラインと目的単語
領域との位置合わせができればよい。［００１９］次に、入力画像（その画像データは画像メモリ６上にあ
る）より目的の単語領域を抽出する処理を説明する。こ
の処理の全体の流れは図５に示す通りである。以下、各処理段階について具体的に説明する。［００２０］分副領埃ｑ設定上まず、図２に示した基準位置情報付きの入力画像２０を
行方向に複数の領域に分割する。隣合う分割領域は一部
重なりを持たせる。入力画像２０を３分割した例を図６
に示す。なお、この例のように左端の分割領域にスター
トライン゛２１が含まれるように分割領域の大きさを選
ぶ。［，００２１］へ判領域′位の′テ七り出し２一つの分割領域についての処理の概要を図７に示す。な
お、この行切り出し処理は左端（先端）の分割領域から
右端（末尾）の分割領域へ向かって順に実行する。（ｏｏｚｚ７位置である。［００２３］始位置とする。［００２４］ただし、行端探索開始位置のｙ座標Ｙは次の条件画像高
さＸｉ／４＜Ｙ＜画像高さ×３／４を満足しなければな
らない。この条件を満足する行端探索開始位置を設定で
きなＴデイスプレィ４の画面上での枠位置設定のエラー
と判断し、域抽出処理は異常終了となる。［００２５］この段階で単語領ン位置に代えて、左隣の分割領域において抽出された現
在性（目的の単語領域の行とみなされた行）の中心のラ
イン位置を起点として、左端の分割領域の場合と同様の
手順により行端探索開始位置を求める。［００２６］ただし、行端探索開始位置のｙ座標Ｙの満足すべき条件
は、左隣分割領域の現在行上端十ノイズ高さくＹかつＹ〈左隣分割領域の現在性下端−ノイズ高さである。こ
のような条件を満足する行端探索開始位置が設定できな
い場合、行切り出し処理は正常終了となる。目的の単語
領域の後（右）に十分な空白が存在しその空白部分の分
割領域（図５の空白分割領域）の場合に、このような正
常終了となる。［００２７］環ステップ１０２　′一端探索打ち切り位置の計定行端
探索開始位置から上向きまたは下向きに行端を探索する
力へ行端探索開始位置から十分能れた位置まで探索を進
めても行端を発見できないときは、隣合った行が接続し
ていると考えられるので、ある位置までで行端探索を打
ち切るべきである。ここでは、そのような探索限界であ
る行端探索打ち切り位置を設定する［００２８］まず左端の分割領域について説明する。行の高さ（ｙ方
向幅）を推定する情報がないため、処理対象とする文字
の高さの最大値より多少大きな高さだけ行端探索開始位
置から離れた位置を行端探索打ち切り位置とする。［００２９］次に左端の分割領域以外の分割領域について説明する。左隣の分割領域で現在行だけが検出され、その前（上）
の行も次（下）の行も検出されていなない場合図８に示
すように、左隣の分割領域における現在行の高さをｈｃ
として、同現在行の上端からｈｃだけ上の位置を上側の
行端探索打ち切り位置とし、現在行の下端からｈｃだけ
下の位置を下側の行端探索打ち切り位置とする。ただし
、このようにして求めた打ち切り位置が画像の上端また
は下端を越えている場合は、画像の上端または下端を打
ち切り位置とする。［００３０１他方、左隣の分割領域で現在行のみならず前の行または
次の行も検出されている場合、左隣の分割領域の現在行
の上端または下端からｈｃだけ上または下の位置カミ前
の行の下端より下または次の行の上端より上ならば、そ
れぞれの位置を上側または下側の行端探索打ち切り位置
とする。しかし、図９に示すように、現在行の上端から
ｈｃだけ上の位置が前の行の下端より上のときは、両位
置の中点位置を上側の打ち切り位置とし、同様に現在行
の下端がらｈｃだけ下の位置が次行の上端より下のとき
は、両位置の中点位置を下側の打ち切り位置とする。FIG. 1 is a block diagram showing an example of a system for implementing the present invention. 1 is a CPU that executes processing and various controls
, 2 is a program memory for storing the operating system, word area extraction processing according to the present invention, and other application programs, 3 is a data memory for storing processing intermediate data, processing result data, etc., 4 and 5 is a CRT display and its interface, 6 is an image memory for storing image data, 7 and 8
is the mouse and keyboard, 9 is its interface, 10
1 is an auxiliary storage device for storing data files such as image data files, an operating system, and various programs; 11 is an interface thereof; and 12 and 13 are handheld scanner devices with guidelines and their interfaces. [0014] The input image to be subjected to the word region extraction process is either an image directly input from the handy scanner device 12 and imported into the image memory 6, or an image imported into the image memory 6 from the auxiliary storage device 10. This is a partial image where an area is designated on the image frame of the CRT display 3. [0015] In any case, as shown in FIG. 2, the human image 20 that is the target of the word region extraction process has a start line 21 that is the first reference position of the target word region and a line reference position of the target word region. I have information about center line 22. [0016] Here, the handy scanner device 12 reads an image of a document through a reading window 30 as shown in FIG. starting line 3
1 and a center line 32. The starting line 21 and center line 22 in Figure 2 are the starting line 3.
1 and center line 32, respectively. When reading a document image with such a handy scanner device 12, the start line 31 is placed at the beginning (left end) of the target word area on the document, the center line 32 is placed at the center of the row of the target word area, and Reading window 30 to match. and read window 3.
Load the image of the 0 area. [0017] Furthermore, when extracting word regions from a document image that has been read page by page by a large image scanner or the like and stored in the auxiliary storage device 10, the document image of the required page is read into the image memory 6, is displayed on the screen of the CRT display 6, and the area of the input image is designated. FIG. 4 is an explanatory diagram of this manual image area designation, in which a frame 41 larger than the word area, similar to the reading window 30 shown in FIG. word (here 'Recogni)
tion") and the center line 4
Draw a frame 41 to match 3 with the row of the target word area.
move. The partial image within this frame 41 becomes the input image for the word area extraction process. The start line 21 and center line 22 in FIG. 2 correspond to the center lines of the start line 42 and center line 43, respectively. [0018] Furthermore, as shown in FIG. 4, the frame 40 may be displayed entirely on the screen without displaying the frame line, and only the start line 42 and center line 43 may be displayed, or the start line 42 and center line 43 may be displayed. Only the intersection of the center lines 43 may be displayed. In short, it is sufficient to align the two guidelines and the target word area. [0019] Next, a process for extracting a target word area from an input image (the image data of which is on the image memory 6) will be described. The overall flow of this process is as shown in FIG. Each processing step will be specifically explained below. [0020] In setting the subregion q, first, the input image 20 with reference position information shown in FIG. 2 is divided into a plurality of regions in the row direction. Adjacent divided areas are made to partially overlap. Figure 6 shows an example of dividing the input image 20 into three parts.
Shown below. Note that, as in this example, the size of the divided area is selected so that the leftmost divided area includes the start line 21. [,0021] 2. Extracting 2 of the 2nd half of the divided region An outline of the processing for one divided region is shown in FIG. Note that this line cutting process is performed in order from the left end (tip) divided region to the right end (tail) divided region. (oozz7 is the position. [0023] It is the starting position. [0024] However, the y-coordinate Y of the line end search starting position satisfies the following condition: image height Xi/4<Y<image height x 3/4 It is determined that there is an error in setting the frame position on the screen of the T-display 4 where the line end search start position that satisfies this condition cannot be set, and the area extraction process ends abnormally. [0025] At this stage Instead of the word area position, the center line position of the currentness extracted in the left adjacent divided area (the line considered to be the line of the target word area) is used as the starting point, as in the case of the leftmost divided area. Find the line end search start position using the following procedure. [0026] However, the conditions that the y-coordinate Y of the line end search start position must satisfy are that the current line upper end of the left adjacent divided area is 10 noise high, and Y<left neighbor The current lower end of the divided area - the noise height.If a line end search start position that satisfies these conditions cannot be set, the line cutting process ends normally. If there is a blank space and the divided area of the blank part (blank divided area in FIG. 5), this is the normal end. If you cannot find the end of a row even if you proceed from the start position of the end of the row search to a position where you can find the end of the row upward or downward, it is likely that adjacent rows are connected. , the line end search should be discontinued up to a certain position.Here, the line end search discontinuation position, which is the search limit, is set [0028] First, the left end divided area will be explained. Since there is no information for estimating the line-end search start position, the line-end search abort position is set at a position that is slightly larger than the maximum height of the character to be processed from the line-end search start position. [0029] Next The divided areas other than the leftmost divided area are explained below. Only the current line is detected in the leftmost divided area, and the line before it (above) is detected.
If neither the row nor the next (bottom) row has been detected, as shown in Figure 8, the height of the current row in the left adjacent divided area is hc
, the position hc above the top of the current line is the upper line end search abort position, and the position hc below the bottom of the current line is the lower line end search abort position. However, if the cutoff position obtained in this way exceeds the top or bottom edge of the image, the top or bottom edge of the image is set as the cutoff position. [00301 On the other hand, if not only the current line but also the previous or next line is detected in the divided area to the left, the position of the divided area to the left is hc above or below from the top or bottom of the current line. If it is below the bottom edge of the previous line or above the top edge of the next line, the respective position is set as the top or bottom line end search abort position. However, as shown in Figure 9, when the position hc above the top of the current line is higher than the bottom of the previous line, the midpoint of both positions is set as the upper truncation position, and similarly the bottom of the current line is When the position below hc is below the top end of the next line, the midpoint position between both positions is set as the lower truncation position.

【００３月行端探索開始位置から下向きに水平射影がオンからオフ
へ変化する行端の探索を打ち切り位置まで行なう　（ス
テップ１０３）。［００３２］行端が検出された場合は、この行端を現在行の下たんと
し、さらに下向きに水平射影が再びオンする位置すなわ
ち現在行の次（下）の行の上端を探索する（ステップ１
０４）。［００３３］ステップ１０３の処理で行端が検出されなかった場合、
現在行と次行とが接続していると判断されるので、接続
行の強制分離の処理（ステップ１ｏ５）を行なつ。［００３４］すなわち、左端の分割領域においては、図１０に示すよ
うに、行端探索開始位置と下側の行端探索打ち切り位置
とを１：３に内分する位置から、下側の行端探索打ち切
り位置までの範囲内で水平射影の値が最小の位置を、現
在行の下端とする。［００３５］左端の分割領域以外の分割領域においては、図１１に示
すように、左隣の分割領域での現在行の下端からｈｅ／
４（ｈｅは現在行の高さ）だけ上の位置より下側の打ち
切り位置までの範囲内で水平射影の値が最小の位置を、
当該分割領域における現在行の下端とする。【００３６】ステップ１０６ないし１０８　上向き′〒”探索・１続
行の強制分離行端探索開始位置より上向きに水平射影が
オンからオフへ変化する行端の探索を行端探索打ち切り
位置まで行なう（ステップ１０６）。［００３７］行端が検出された場合は、この行端を現在行の上端とし
、さらに上向きに水平射影が再びオンする位置すなわち
現在行の前（上）の行の下端を探索する（ステップ１０
７）。［００３８］ステップ１０６の処理で行端が検出されなかった場合、
現在行と次行とが接続していると判断し、接続行の強制
分離の処理（ステップ１０８）を行なう。［００３９］すなわち、左端の分割領域においては、図１２に示すよ
うに、行端探索開始位置と上側の行端探索打ち切り位置
とを１：３に内分する位置から打ち切り位置までの範囲
内で水平射影の値が最小の位置を、現在行の上端とする
。［００４０］左端の分割領域以外の分割領域においては、図１３に示
すように、左隣の分割領域での現在行の上端からｈｃ／
４（ｈｃは現在行の高さ）だけ下の位置より上側の打ち
切り位置までの範囲内で水平射影値が最小の位置を、当
該分割領域におけ現在行の上端とする。［００４１］ステップ１０９　′−分離のチエツクと′合ステップ１
０５及び１０８における行の強制分離の妥当性を行の高
さや左隣の分割領域での現在行の行端などからチエツク
し、分離位置が不適当で統合すべきときは統合を行なっ
て現在行の範囲を修正する。行間が狭い文書や目的の単
語に接近して罫線がある文書などは、その画像のスキュ
ーが大きい場合などに行接続が起こりやすい。このよう
な行接続が起こった場合でも、以上のような分割領域単
位の行切り出し処理により、適切な行切り出しが可能で
あるため、次の単語先頭及び末尾の検出処理を正常に行
ない目的の単語領域を抽出可能となる。［００４２］単紐先販Ｏ検量主処理の内容は図１４に示す通りである。この単語先頭検
出は、スタートライン２１の位置を基準として、先に切
り出された現在性の範囲内の垂直射影を調べることによ
り単語の先頭文字の左端を検出するもので、先頭文字と
スタートラインとの位置関係としては図１５の（ａ）、
　　（ｂ）または（Ｃ）に示すような場合がある。［００４３］まずスタートライン位置の垂直射影値を閾値と比較しく
ステップ２０１）　閾値未満であれば図１５（ａ’）の
ような場合である可能性が大きいので、スタートライン
位置より右側に垂直射影値が閾値以上となる位置を探し
、その位置を先頭文字の左端すなわち単語領域の先頭位
置として仮決定しくステップ２０２）　その位置より右
に垂直射影値が閾値以上の範囲がある値以上続いている
か調べる（ステップ２０３）。垂直射影値が閾値以上の
範囲が一定値以上続いているときは、図１５（ａ）のよ
うな場合であるので、仮決定した先頭位置を本決定して
データメモリ３に書き込む（ステップ２０４）。［００４４］ただし、図１５（ａ）の場合でも、スタートラインと先
頭文字との間のペース内のノイズがステップ２０２で先
頭位置として検出されることがある。この場合法のステ
ップ２０３で条件を満足しないので、ステップ２０２に
戻り、さらに右へ先頭文字を探すことにより、最終的に
先頭文字の左端すなわち単語領域の先頭位置を検出する
。［００４５］図１５（ｂ）または（Ｃ）の場合のように、スタートラ
インがある文字に重なっている場合、ステップ２０１で
垂直射影値が閾値以上と判定されるので、スタートライ
ンから左と右に垂直射影値が閾値未満となる位置を探し
、その位置をスタートラインと重なった文字の左端と右
端とする（ステップ２０５）。そして、注目している文
字の幅が一定値以上であるか調べる（ステップ２０６）
。文字幅が一定値以上であれば、間違いなくスタートラ
インの位置は文字内であるので、ステップ２０５で検出
した左端と右端のいずれがスタートラインに近いか調べ
る（ステップ２０７）。左端のほうがスタートラインに
近いならば、図１５（ｃ）の場合であるので、ステップ
２０５で検出した左端の位置を単語領域の先頭位置に決
定し、データメモリ３に書き込む。［００４６］ステップ２０７で右端のほうがスタートラインに近いと
判定したときは、図１５（ｂ）のように先頭文字より前
の文字がスタートラインと重なっている場合であるので
、ステップ２０２に分岐し、さらに右に単語領域の先頭
を探す。［００４７］また、ステップ２０６で文字幅が一定値未満と判定され
たときは、図１５（ａ）のような場合であるが、これは
スタートラインの直下にノイズがあると判断されるので
、ステップ２０２に分岐して先頭位置を検出する。［００４８］単眼末尾桧山処理↓ 処理内容は図１６に示す通りである。この処理は、単語
の先頭文字を起点として、隣合う文字の間のスペースを
調べ、スペース幅が狭い文字は単語内と判断し、広いス
ペースが見つかったときは、その前の方の文字を末尾単
語とし、その右端を単語領域の末尾位置として検出する
ものである。図１７はその説明図である［００４９］まず、スペースが広いか狭いかの判定のためのスペース
幅閾値を求める（ステップ３０１）。すなわち、先に検
出された行の高さ（または垂直射影値の最大値）と予め
設定された文字の縦横比とから基準文字幅を計算する。この基準文字幅の１．５倍の値と、先に検出された先頭
文字の前のスペースの幅とを比較し、そのスペース幅の
ほうが大きいときは、先頭文字の前のスペースを無効と
する。同様に、基準文字幅の０．　５倍の値と、先頭文
字の前のスペースとを比較し、そのスペース幅のほうが
小さいときも先頭文字の前のスペースを無効とする。［００５０］そして、先頭文字の前のスペースが無効とされた場合は
、計算により求めた基準文字幅の４分の１の値をスペー
ス幅閾値に決定する。先頭文字の前のスペースが有効の
場合は、そのスペース幅の２分の１の値と基準文字幅の
４分の１の値の平均をスペース幅閾値とする。［００５１］このようにしてスペース幅閾値を決定すると、先頭文字
を注目文字に設定しくステップ３０２）　注目文字の右
の垂直射影を調べることにより、注目文字の右に隣接す
る文字を探す（ステップ３０３）。右に隣接する文字が
見つからないときは、その時の注目文字を単語の末尾文
字とし、その右端位置を単語末尾位置として検出し、こ
れをデータメモリ３に書き込む（ステップ３０４）。［００５２］注目文字の右隣接文字が見つかったときは、両文字間の
スペースの幅を求めてスペース幅閾値と比較しくステッ
プ３０５）　スペース幅がスペース幅閾値以上であれば
注目文字を末尾文字とし、その右端を単語領域末尾位置
として検出する（ステップ３０４）。スペース幅がスペ
ース幅閾値より小さいときは、右側の文字を改めて注目
文字に設定しくステップ３０６）　ステップ３０３から
の処理を行なう。［００’５３］以上の処理により単語の行範囲の先頭から末尾までの領
域が単語領域として抽出される。［００５４］次に、分割領域における行切り出し処理（図５の処理２
）の変形例を、図１９に示すフローチャートを用い説明
する。この変形例は、分離性の併合処理に特徴がある。［００５５］ステップ４０１は図７のステップ１０１と同様の行端探
索開始位置の設定を行なう処理ステップである。ステッ
プ４０２は図７のステップ１０２と同様の行端探索打ち
切り位置の設定を行なう処理ステップである。ステップ
４０３゛は、図７のステップ１０３からステップ１０７
までの処理と同様に、行端探索開始位置より上下に水平
射影を行端探索打ち切り位置まで調べて行を切り出す処
理ステ・ノブであるが、切り出すのは現在行だけである
。ステップ４０８は図７のステ・ノブ１０８と同様の接
続行強制分離を行なう処理ステップである。［００５６］ステップ４０４においては、現在行の下端より下向きに
、画像の下端を行端探索打ち切り位置として、水平射影
を調べることにより現在行の次（下）の行を切り出す。ステップ４０６においては、現在行の上端より上向きに
、画像の上端を行端探索打ち切り位置として、水平射影
を調べることにより現在行の前（上）の行を切り出す。［００５７］ステップ４０．５，４０７は行分離のチエツクと分離性
併合を行なう処理ステ・ノブである力板その説明に先だ
って、ここでの処理の対象となる行分離の様態について
図１９及び図２０により説明する。［０’０５８］併合を考慮すべき行分離は２種類に分類できるが、ここ
では、それぞれを連続性分離と非連続行分離と呼ぶ。［００５９］現在処理中の分割領域内の分離行が、左隣の分割領域内
の現在行と重なりを持っている場合に、この分離行が連
続性分離を起こした行である。図１９に、処理中の分割
領域内の現在行の前の行が連続性分離を起こした例を示
す。［００６０１他方、左隣の分割領域内の現在行と重なりを持たない状
態で、現在行から分離した行がある場合、この分離行は
非連続行分離を起こした行である。図２０Ｆ、現在処理
中の分割領域内における現在行の前の行カミ非連続行分
離を起こした例を示す。［００６１］まず連続性分離を起こした行の併合について説明する。この併合処理は、一定の条件を満たす間は、前の行、次
の行ともに繰り返し実行される。また、これはステップ
４０７による「上の連続分離行に対する併合処理」とス
テ・ノブ４０５による「下の連続分離行に対する併合処
理」とがある。［００６２］「上の連続分離行に対する併合処理」について説明する
。この併合を実行するための条件は「現在行の前（上）の分離行（注目分離行）カミ左隣の
分割領域内の現在行と重なりを持ち、カリ、左隣領域内
の現在行の上端から、該現在行の高さの４分の１の値だ
け上の位置より、注目分離行の上端が下にある。」とい
うことである。この条件を満たす注目分離行は現在行に
併合する。［００６３］「下の連続分離行に対する併合処理」は、連続性分離を
起こした前の行を現在行に併合する処理で、その併合条
件は「上の連続分離行に対する併合処理」の併合条件の
上下関係を入れ替えたものである。［００６４］次に、非連続行分離を起こした行の併合処理について説
明する。これは現在行の前（上）の分離行に対してのみ
ステップ４０７で実行される。ｉやｊのような文字の上
の黒丸部分が分離行になる場合を考慮すれば、一般に十
分であるからである。次のａからｅの条件がすべて満た
される場合に、前の行を現在行に併合する。ａ、注目する前の行の下端力板左隣の分割領域内の現在
行の上端より上にある（非連続行分離の条件）。ｂ、注目する前の行と現在行との間隔カミ現在行の高さ
の３分の１未満である（行間と行高さの関係）。Ｃ０左隣の分割領域で前の行が検出されている場合にお
いて、現在処理中の分割領域内の現在行の高さの３分の
１だけ左隣の分割領域内の前の行の下端から上方の位置
より下に、現在処理中の分割領域内の注目している前の
行の上端がある。ｄ、現在処理中の分割領域において、現在行の高さが、
注目する前の行の高さの３倍よりも大きい（行高さの比
）。ｅ、注目する前の行の垂直射影より求めた最大文字幅が
、当該前の行の高さの３分の２より／卦かい（文字幅）
。［００６５］003 The search for the line end where the horizontal projection changes from on to off is performed downward from the line end search start position to the abort position (step 103). [0032] If a line end is detected, this line end is set as the bottom of the current line, and the position where horizontal projection is turned on again downwards, that is, the top end of the next (lower) line of the current line is searched (step 1
04). [0033] If no line end is detected in the process of step 103,
Since it is determined that the current line and the next line are connected, the process of forcibly separating the connected lines (step 1o5) is performed. [0034] In other words, in the left end divided area, as shown in FIG. The position with the minimum horizontal projection value within the range up to the search abort position is set as the lower end of the current line. [0035] In the divided areas other than the leftmost divided area, as shown in FIG.
The position where the horizontal projection value is the minimum within the range from the position above 4 (he is the height of the current row) to the cutoff position below,
The lower end of the current line in the divided area. Steps 106 to 108 Upward ``〒'' Search - A search for the line end where the horizontal projection changes from on to off is performed upward from the line end search start position for one continuation up to the line end search abort position (step 106 ). [0037] If a line end is detected, this line end is set as the top edge of the current line, and the search is performed upward to find the position where horizontal projection is turned on again, that is, the bottom edge of the line before (above) the current line ( Step 10
7). [0038] If no line end is detected in the process of step 106,
It is determined that the current line and the next line are connected, and processing for forcibly separating the connected lines (step 108) is performed. [0039] In other words, in the leftmost divided area, as shown in FIG. The position with the minimum horizontal projection value is the top of the current line. [0040] In the divided areas other than the leftmost divided area, as shown in FIG. 13, hc/
The position where the horizontal projection value is minimum within the range from the position below 4 (hc is the height of the current line) to the cutoff position above is set as the upper end of the current line in the divided area. [0041] Step 109'-Separation Check and Match Step 1
Check the validity of the forced separation of lines in 05 and 108 from the line height and the line end of the current line in the divided area on the left, and if the separation position is inappropriate and should be merged, merge and replace the current line. Modify the range of. In documents with narrow line spacing or documents with ruled lines close to the target word, line connections are likely to occur if the image skew is large. Even if such a line connection occurs, it is possible to properly cut out the line by performing the line cutting process for each divided area as described above, so that the next word start and end detection process can be performed normally and the desired word can be found. It becomes possible to extract the area. [0042] The contents of the single string pre-sale O weighing main process are as shown in FIG. This word start detection detects the left end of the first character of a word by checking the vertical projection within the currentity range cut out earlier, using the position of the start line 21 as a reference, and detects the left end of the first character of the word using the position of the start line 21 as a reference. The positional relationship of FIG. 15(a),
There are cases as shown in (b) or (C). [0043] First, compare the vertical projection value of the starting line position with the threshold value (step 201). If it is less than the threshold value, there is a high possibility that the case is as shown in FIG. Find a position where the value is equal to or greater than the threshold value, and tentatively determine that position as the left end of the first character, that is, the first position of the word area.Step 202) Does the range of vertical projection values equal to or greater than the threshold value continue to the right of that position? Check (step 203). When the range in which the vertical projection value is equal to or greater than the threshold value continues for a certain value or more, the case is as shown in FIG. . [0044] However, even in the case of FIG. 15(a), noise within the pace between the start line and the first character may be detected as the first position in step 202. Since the condition is not satisfied in step 203 of this case, the process returns to step 202 and searches further to the right for the first character, thereby finally detecting the left end of the first character, that is, the first position of the word area. [0045] If the start line overlaps a certain character as in the case of FIG. 15(b) or (C), it is determined in step 201 that the vertical projection value is greater than the threshold, so A search is made for a position where the vertical projection value is less than a threshold value, and these positions are determined as the left and right ends of the character overlapping the start line (step 205). Then, check whether the width of the character of interest is greater than a certain value (step 206)
. If the character width is equal to or greater than a certain value, the start line is definitely within the character, so it is checked whether the left end or the right end detected in step 205 is closer to the start line (step 207). If the left end is closer to the start line, this is the case as shown in FIG. [0046] If it is determined in step 207 that the right end is closer to the start line, this means that the characters before the first character overlap with the start line, as shown in FIG. 15(b), so the process branches to step 202. , find the beginning of the word area further to the right. [0047] Furthermore, when the character width is determined to be less than a certain value in step 206, which is the case as shown in FIG. 15(a), it is determined that there is noise directly below the start line. The process branches to step 202 to detect the leading position. [0048] Monocular trailing Hiyama processing↓ The processing details are as shown in FIG. This process starts from the first character of a word and examines the spaces between adjacent characters. Characters with narrow spaces are determined to be within the word, and if a wide space is found, the previous character is moved to the end. The right end of the word is detected as the end position of the word area. FIG. 17 is an explanatory diagram thereof [0049] First, a space width threshold value for determining whether the space is wide or narrow is determined (step 301). That is, the reference character width is calculated from the previously detected line height (or the maximum vertical projection value) and a preset character aspect ratio. This value of 1.5 times the standard character width is compared with the width of the space before the first character detected earlier, and if the space width is larger, the space before the first character is invalidated. . Similarly, the standard character width is 0. The 5 times value is compared with the space before the first character, and even if the space width is smaller, the space before the first character is invalidated. [0050] If the space before the first character is invalidated, a value of 1/4 of the calculated reference character width is determined as the space width threshold. If the space before the first character is valid, the average of the value of 1/2 of the space width and the value of 1/4 of the reference character width is set as the space width threshold. [0051] Once the space width threshold is determined in this way, the first character is set as the character of interest (step 302). By examining the vertical projection to the right of the character of interest, the character adjacent to the right of the character of interest is searched (step 303). ). If the adjacent character on the right is not found, the character of interest at that time is determined as the last character of the word, its right end position is detected as the word end position, and this is written in the data memory 3 (step 304). [0052] When a character adjacent to the right of the character of interest is found, the width of the space between both characters is determined and compared with the space width threshold (step 305). If the space width is greater than or equal to the space width threshold, the character of interest is treated as the last character. , its right end is detected as the end position of the word area (step 304). If the space width is smaller than the space width threshold, the character on the right is set as the character of interest again (step 306), and the processing from step 303 is performed. [00'53] Through the above processing, the area from the beginning to the end of the word line range is extracted as a word area. [0054] Next, line cutting processing in the divided area (processing 2 in FIG.
) will be explained using the flowchart shown in FIG. This modification is characterized by separable merging processing. [0055] Step 401 is a processing step for setting a line end search start position similar to step 101 in FIG. Step 402 is a processing step similar to step 102 in FIG. 7 to set the line end search abort position. Step 403'' is from step 103 to step 107 in FIG.
Similar to the previous processing, the processing step knob examines the horizontal projection above and below the line end search start position to the line end search abort position and cuts out the line, but only the current line is cut out. Step 408 is a processing step for forcibly separating connected lines similar to step knob 108 in FIG. [0056] In step 404, the next (below) line of the current line is cut out by checking the horizontal projection downward from the bottom of the current line, with the bottom edge of the image as the line end search termination position. In step 406, the line before (above) the current line is cut out by checking the horizontal projection upward from the top of the current line, with the top of the image as the line end search abort position. [0057] Steps 40.5 and 407 are processing knobs that check line separation and perform separable merging. 20 will be explained. [0'058] Row separations that should be considered for merging can be classified into two types, which are herein referred to as continuity separation and non-continuous line separation. [0059] If a separated line in the divided area currently being processed overlaps the current line in the adjacent divided area on the left, this separated line is the line in which continuity separation has occurred. FIG. 19 shows an example in which continuity separation occurs in the line before the current line in the divided area being processed. [00601] On the other hand, if there is a line separated from the current line without overlapping with the current line in the adjacent divided area on the left, this separated line is a line in which discontinuous line separation has occurred. FIG. 20F shows an example in which a non-continuous line separation occurs between the lines before the current line in the divided area currently being processed. [0061] First, merging of rows that have caused continuity separation will be explained. This merging process is repeatedly executed for both the previous row and the next row as long as a certain condition is met. Furthermore, this includes "merging processing for upper continuous separated rows" by step 407 and "merging processing for lower continuous separated rows" by step knob 405. [0062] The "merging process for the above continuous separated rows" will be explained. The conditions for executing this merge are ``The separating line (separated line of interest) before (above) the current line overlaps with the current line in the divided area on the left side, and the current line in the area on the left side overlaps. The upper end of the separated row of interest is below the position above the upper end by a value equal to one-fourth of the height of the current row. The separated line of interest that satisfies this condition is merged with the current line. [0063] "Merge processing for continuous separated rows below" is a processing for merging the previous row that caused continuity separation into the current row, and its merging condition is the merging condition of "merging processing for continuous separated rows above". The upper and lower relationships are reversed. [0064] Next, the process of merging rows that have caused non-continuous row separation will be described. This is performed in step 407 only for the separate line before (above) the current line. This is because it is generally sufficient to consider the case where a black circle above a character such as i or j becomes a separate line. If all of the following conditions a to e are satisfied, merge the previous row with the current row. a, The lower end of the previous row of interest is above the upper end of the current row within the divided area to the left of the force plate (condition for discontinuous row separation). b. The distance between the previous line of interest and the current line is less than one third of the height of the current line (relationship between line spacing and line height). C0 When the previous row is detected in the divided area on the left, the height of the current line in the divided area currently being processed is increased by one-third from the bottom of the previous line in the divided area on the left. Below the upper position is the upper end of the previous row of interest in the divided area currently being processed. d. In the divided area currently being processed, the height of the current line is
Greater than three times the height of the previous row of interest (row height ratio). e, the maximum character width determined from the vertical projection of the previous line of interest is less than two-thirds of the height of the previous line / trigram (character width)
. [0065]

【Effect of the invention】

以上説明したように、請求項１または２記載の発明によ
れば、人力画像の持つ先頭基準位置（スタートライン位
置）及び行基準位置（センターライン位置）を利用して
簡単かつ効率的に単語領域が抽出可能となるとともに、
行方向の分割領域毎の行切り出し処理さらには接続行分
離処理によって、画像人力にハンディスキャナー装置を
用いる場合のように人力画像のスキューが大きいときで
も、スキューによる行接続が起こりやすい行間の狭い文
書や単語と罫線が接近したような文書中の目的単語領域
を確実に抽出することができる。As explained above, according to the invention described in claim 1 or 2, word regions can be easily and efficiently created by using the head reference position (start line position) and line reference position (center line position) of the human image. can be extracted, and
Line cutting processing for each divided area in the row direction as well as connecting line separation processing enables documents with narrow line spacing that are likely to cause line connections due to skew, even when the skew of the human image is large, such as when using a handheld scanner for image processing. It is possible to reliably extract a target word area in a document where words and ruled lines are close to each other.

[Brief explanation of drawings]

【図１】本発明を実施するためのシステムの一例を示すブロック
図である。FIG. 1 is a block diagram showing an example of a system for implementing the present invention.

【図２】入力画像の説明図である。[Figure 2] FIG. 3 is an explanatory diagram of an input image.

【図３】ガイドライン付きハンディスキャナー装置の読み取り窓
及びガイドラインの説明図である。FIG. 3 is an explanatory diagram of a reading window and a guideline of a handy scanner device with a guideline.

【図４】デイスプレィ画面上での入力画像領域の指定方法及びガ
イドライン付き枠の説明図である。FIG. 4 is an explanatory diagram of a method of specifying an input image area on a display screen and a frame with a guideline.

【図５】単語領域抽出処理の全体的流れを示すフローチャートで
ある。FIG. 5 is a flowchart showing the overall flow of word region extraction processing.

【図６】入力画像の領域分割の一例の説明図である。[Figure 6] FIG. 3 is an explanatory diagram of an example of region division of an input image.

【図７】分割領域毎の行切り出し処理のフローチャートである。[Figure 7] 12 is a flowchart of line cutting processing for each divided area.

【図８】行端探索打ち切り位置の設定方法の説明図である。[Figure 8] FIG. 6 is an explanatory diagram of a method of setting a line end search abort position.

【図９】行端探索打ち切り位置の設定方法の説明図である。[Figure 9] FIG. 6 is an explanatory diagram of a method of setting a line end search abort position.

【図１０】行下端側の接続強制分離の説明図である。[Figure 10] FIG. 7 is an explanatory diagram of forced connection/separation on the lower end side of a row.

【図１１】行下端側の接続強制分離の説明図である。[Figure 11] FIG. 7 is an explanatory diagram of forced connection/separation on the lower end side of a row.

【図１２】打上端側の接続強制分離の説明図である。[Figure 12] FIG. 3 is an explanatory diagram of forced connection and separation on the launch end side.

【図１３】打上端側の接続強制分離の説明図である。[Figure 13] FIG. 3 is an explanatory diagram of forced connection and separation on the launch end side.

【図１４】単語先頭検出処理のフローチャートである。[Figure 14] It is a flowchart of word head detection processing.

【図１５】スタートラインと単語との位置関係の説明図である。[Figure 15] FIG. 2 is an explanatory diagram of the positional relationship between a start line and words.

【図１６】単語末尾検出処理のフローチャートである。[Figure 16] It is a flowchart of word end detection processing.

【図１７】単語末尾検出の説明図である。[Figure 17] It is an explanatory diagram of word end detection.

[Explanation of symbols]

　　ＣＰＵ２　プログラムメモリ３　データメモリ４　　ＣＲＴデイスプレィ６　画像メモリ７　マウス８　キーボード１０　補助記憶装置１２　ガイドライン付きハンディスキャナー装置２０　
人力装置スタートラインセンターライン読み取り窓スタートラインセンターライン画面枠スタートラインセンターラインCPU 2 Program memory 3 Data memory 4 CRT display 6 Image memory 7 Mouse 8 Keyboard 10 Auxiliary storage device 12 Handy scanner device with guideline 20
Human power device start line center line reading window start line center line screen frame start line center line

[Document core]

図面 drawing

【図１】[Figure 1]

【図２】[Figure 2]

【図５】[Figure 5]

【図６】[Figure 6]

【図７】[Figure 7]

【図８】[Figure 8]

【図９】[Figure 9]

【図１１】[Figure 11]

【図１２】[Figure 12]

【図１３】[Figure 13]

【図１４】[Figure 14]

【図１５１【図１６】［図１７】[Figure 151 [Figure 16] [Figure 17]

【図１９】[Figure 19]

【図２０】[Figure 20]

【図１８】[Figure 18]

【Document name】【Filing date】【address】 [Display of incident] 【application number】 [Name of the invention] [Person making the correction] [Relationship with the incident] 【Identification number】 [Name or title] [Agent] 【Identification number】【patent attorney】 [Name or title] [Shipping number]

【手続補正　１】[Procedural amendment 1]

[Name of document to be corrected] [Correction target item name] [Correction method] [Contents of correction] [Agent] 【Identification number】【patent attorney】 [Name or title]

【手続補正　２】[Procedural amendment 2]

【補正対象書類名】[Name of document to be corrected]

[Correction target item name] [Correction method] [Contents of correction]

手続補正書平成３年４月２２日 Procedural amendment April 22, 1991

Claims

[Claims]

Claim 1: For an input image having information on the start reference position and line reference position of the target word area to be extracted, cut out the line of the target word area by examining the horizontal projection above and below the line reference position, and extract the extracted line. For the range of , the starting position of the target word area is detected by examining the vertical projection in the vicinity of the starting reference position, and the end position of the target word area is detected by examining the vertical projection after the detected starting position. In the extraction method, the input image is divided into multiple regions with a certain overlap in the row direction, and for each divided region from the first divided region to the last divided region, the row cutting results of the previous divided region are referred to. A word region extraction method characterized by cutting out lines in a target word region.

[Claim 2] In line extraction of each divided area, search limits are set for the upper and lower ends of the line, and even if each search limit is exceeded, the horizontal projection value corresponding to the upper or lower edge of the line is greater than or equal to the threshold. If a position that changes to less than or equal to less than 2. The word region extraction method according to claim 1.

3. The method according to claim 1, wherein when two or more lines are cut out in each divided area, the two or more lines are merged when a certain condition is satisfied. Word region extraction method.