JPH0578068B2

JPH0578068B2 -

Info

Publication number: JPH0578068B2
Application number: JP60077633A
Authority: JP
Inventors: Mariko Takenochi; Masahiro Shimizu
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1985-04-12
Filing date: 1985-04-12
Publication date: 1993-10-28
Also published as: JPS61235990A

Description

【発明の詳細な説明】産業上の利用分野本発明は、新聞・雑誌等の活字および手書き文
字を認識し、たとえばJISコード等の情報量に変
換する文字認識装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a character recognition device that recognizes printed characters and handwritten characters from newspapers, magazines, etc., and converts them into an amount of information such as a JIS code.

従来の技術従来の文字認識装置では、縦書き・横書きおよ
び行間隔・文字間隔等の書式が明確な文書、つま
り読み取る文字の用紙上の絶対的な位置があらか
じめ判明している文書を対象に文字認識を行つて
きた。このことは、文字認識装置が対象とする文
書に制限を与えており、この問題を解決するため
に、入力画像の２次元フーリエ変換を用いて行間
を検出して縦書き・横書き等の行方向を抽出し、
書式が未知の文書に対しても、文章の意味が理解
できるように認識候補文字の順序を決定する方法
がとられていた。（例えば、長谷・星野“印刷文
字列の周期的特徴”信学論(D)，J65−Ｄ，２，
PP.298〜299）発明が解決しようとする問題点しかしながら、入力画像の２次元フーリエ変換
により入力文書の行方向を検出する従来の技術で
は、２次元フーリエ変換という多大な計算量を必
要とする方法を用いているために処理に時間がか
かる。Conventional technology Conventional character recognition devices target documents with clear formats such as vertical writing, horizontal writing, line spacing, character spacing, etc., in other words, documents where the absolute position of the characters to be read on the paper is known in advance. I've been recognizing it. This places restrictions on the documents that can be targeted by character recognition devices, and to solve this problem, two-dimensional Fourier transform of the input image is used to detect the line spacing and the line direction, such as vertical or horizontal writing, is detected. extract,
Even for documents with unknown formats, methods have been used to determine the order of recognition candidate characters so that the meaning of the text can be understood. (For example, Hase and Hoshino, “Periodic characteristics of printed character strings,” IEICE (D), J65-D, 2,
PP.298-299) Problems to be Solved by the Invention However, the conventional technique of detecting the line direction of an input document by two-dimensional Fourier transform of the input image requires a large amount of calculation due to two-dimensional Fourier transform. The process takes time because of the method used.

本発明はかかる点に鑑みてなされたものであ
り、簡易な方法で入力画像の行方向を検出し、書
式が未知の文書に対しても、文章の意味が理解で
きるように認識候補文字を編集することができる
文字認識装置を提供することを目的としている。 The present invention has been made in view of these points, and uses a simple method to detect the line direction of an input image, and edits recognition candidate characters so that the meaning of the text can be understood even for documents with unknown formats. The purpose of the present invention is to provide a character recognition device that can perform the following functions.

問題点を解決するための手段本発明は前記問題点を解決するため、入力画像
を縦方向・横方向に走査して文字部を形成する画
素のヒストグラムを求め、ヒストグラムから求め
た縦方向・横方向それぞれの文字間隔長の平均値
を比較することにより、簡易に文書の行方向を抽
出し、認識候補文字を編集するものである。Means for Solving the Problems In order to solve the above-mentioned problems, the present invention scans an input image in the vertical and horizontal directions to obtain a histogram of pixels forming a character part, and By comparing the average value of the character interval length in each direction, the line direction of the document can be easily extracted and recognition candidate characters can be edited.

作用本発明は前記した技術的手段により、書式が未
知の文書に対して、高速に行方向を抽出して認識
候補文字を文章の意味が理解できるように編集す
ることが可能となる。Effects The present invention makes it possible to extract the line direction of a document whose format is unknown at high speed and edit recognition candidate characters so that the meaning of the sentence can be understood by using the above-mentioned technical means.

実施例以下、本発明の実施例について図面を参照しな
がら説明する。Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings.

第１図は、本発明による文字認識装置の一実施
例の構成図である。１は画像入力部であり、認識
対象文字を含む画像を走査し２値信号で画像を入
力し画像メモリ２に格納する。３は行方向判定部
であり、画像メモリ２を走査して入力画像の縦書
き・横書き判定を行い、同時に行アドレスを検出
する。４は文字切り出し部であり、行方向判定部
３で検出した行方向及び行アドレスを用いて画像
メモリ２を行単位で走査し、行単位の画像の射影
を用いて１字ずつ認識対象文字画像を切り出し、
各文字の入力画像上の文字アドレスを検出する。
５は認識部であり、文字切り出し部４で切り出し
た認識対象文字のストローク等の特徴量を求め、
あらかじめ辞書６に登録されている文字の特徴量
と照合し、最も似た文字を認識候補文字とする。
７は編集部であり、行方向判定部３で求めた行方
向と行アドレス及び文字切り出し部４で求めた文
字アドレスを用いて、認識部５で抽出した認識候
補文字を文章の意味が理解できる順序に編集し、
文章メモリ８に文字コードで格納する。 FIG. 1 is a block diagram of an embodiment of a character recognition device according to the present invention. Reference numeral 1 denotes an image input unit which scans an image including characters to be recognized, inputs the image as a binary signal, and stores it in the image memory 2. Reference numeral 3 denotes a line direction determination unit which scans the image memory 2 to determine whether the input image is written vertically or horizontally, and at the same time detects the line address. Reference numeral 4 denotes a character cutting unit, which scans the image memory 2 line by line using the line direction and line address detected by the line direction determination unit 3, and extracts a character image to be recognized character by character using projection of the image in line units. Cut out,
Detect the character address on the input image for each character.
5 is a recognition unit, which calculates feature quantities such as strokes of characters to be recognized cut out by the character cutout unit 4;
The characters are compared with the feature amounts of the characters registered in advance in the dictionary 6, and the most similar character is selected as the recognition candidate character.
Reference numeral 7 denotes an editing section, which uses the line direction and line address obtained by the line direction determination section 3 and the character address obtained from the character extraction section 4 to understand the meaning of the sentence from the recognition candidate characters extracted by the recognition section 5. Edit in order,
It is stored in the text memory 8 as a character code.

以上のように構成された文字認識装置の動作に
ついて、第２図に示す入力画像Ｐを例に説明す
る。 The operation of the character recognition device configured as described above will be explained using the input image P shown in FIG. 2 as an example.

画像入力部１から入力された画像Ｐは文字部
１、白部０の２値データで画像メモリ２に蓄えら
れる。まず行方向判定部３で画像メモリ２に蓄え
られている入力画像Ｐを走査して、入力画像全体
における文字部を形成する画素の縦方向ヒストグ
ラムH_vと横方向ヒストグラムH_hを求めると第２
図に示す様になる。文字部と文字間部を分けるた
めに、ヒストグラムH_vとH_hそれぞれに対してヒ
ストグラムの値が０画素以下である文字間部分と
０画素より多い画素数の文字部分に分け、各部分
の先頭アドレスを求める。第２図中のy_s1，y_s2…
…y_si……及びx_s1，x_s2……x_si……は文字部分の先
頭アドレスであり、y_e1，y_e2……y_ei及びx_e1，x_e2
……x_ei……は文字間部分の先頭アドレスである。
このアドレスから縦方向の文字間隔長（y_si+1−
y_ei）の平均値（_si+1−_ei）と横方向の文字間隔
長（x_si+1−x_ei）の平均値（_si+1−_ei）とを比較
すると縦方向の値が横方向の値に対して大とな
り、入力画像Ｐの行方向は横書きであることがわ
かる。さらに入力画像Ｐの行方向が横書きと決定
したことから、ヒストグラムH_vの文字部分及び
文字間部分の先頭アドレスy_s1，y_e1……y_si，y_ei…
…は入力画像Ｐの行アドレスになる。 The image P input from the image input section 1 is stored in the image memory 2 as binary data of 1 for text and 0 for white. First, the line direction determining unit 3 scans the input image P stored in the image memory 2 to obtain a vertical histogram H _v and a horizontal histogram H _h of pixels forming a character portion in the entire input image.
The result will be as shown in the figure. In order to separate the character part and the inter-character part, the histograms H _v and H _h are divided into the inter-character part where the histogram value is less than or equal to 0 pixels, and the character part with the number of pixels greater than 0 pixel, and the beginning of each part is Ask for address. y _s1 , y _s2 ... in Figure 2
...y _si ... and x _s1 , x _s2 ...x _si ... are the start addresses of the character parts, y _e1 , y _e2 ... y _ei and x _e1 , x _e2
...x _ei ... is the start address of the intercharacter portion.
From this address, the vertical character spacing length (y _si+1 −
Comparing the average value ( _si+1 − _ei ) of y _ei ) with the average value (si _{+1 − ei ) of the horizontal character spacing length (x si+1} _−x _ei ₎ , it is found that the vertical value is the same as the horizontal value. It is found that the line direction of the input image P is horizontal writing. Furthermore, since the line direction of the input image P has been determined to be horizontal writing, the start addresses of the character portion and the inter-character portion of the histogram H _v are y _s1 , y _e1 ... y _si , y _ei ...
... becomes the row address of the input image P.

次に文字切り出し部４では、行アドレスを用い
て画像メモリ２から第３図に示す行画像Ｌを抽出
する。抽出した行画像Ｌを射影してヒストグラム
H₁を求めると第３図に示す様になる。ヒストグ
ラムH₁から１字１字の横方向のアドレス（z_s1，
z_e1）……（z_si，z_ei）……を求め、認識対象文字
画像を１字ずつ切り出し、さらに、行アドレスと
組合せて文字アドレスを決定する。 Next, the character cutting section 4 extracts the line image L shown in FIG. 3 from the image memory 2 using the line address. Project the extracted row image L to create a histogram
When H ₁ is determined, it becomes as shown in Figure 3. Histogram H ₁ to 1 character horizontal address (z _s1 ,
z _e1 )...(z _si , z _ei )... are obtained, the character image to be recognized is cut out character by character, and the character address is determined by combining with the line address.

認識部５には切り出された認識対象文字の矩形
Ｒが順次入力される。第４図ａに切り出された認
識対象文字『松』を示す。切り出した『松』の各
画素について、第４図ｂの矢印が示す方向に着目
画素を含んでＭ個以上（Ｍはあらかじめ設定）連
つているか否かを調べ方向コードを設定する。方
向コード毎に各画素の連結性を調べてストローク
を抽出し、ストロークの数・位置・長さ等の特微
量を抽出する。第４図ａに認識対象文字『松』の
ストローク抽出結果を示す。抽出した特微量を辞
書６に登録されている文字の特微量と照合し、最
も似た文字「松」を認識候補文字とする。 The rectangles R of the cut out characters to be recognized are sequentially input to the recognition unit 5. FIG. 4a shows the character "pine" cut out to be recognized. For each pixel of the cut out "pine tree", it is checked whether or not there are M or more (M is preset) consecutive pixels including the pixel of interest in the direction indicated by the arrow in FIG. 4b, and a direction code is set. Strokes are extracted by examining the connectivity of each pixel for each direction code, and feature quantities such as the number, position, and length of strokes are extracted. Figure 4a shows the stroke extraction results for the recognition target character ``pine''. The extracted feature amount is compared with the feature amount of characters registered in the dictionary 6, and the most similar character "pine" is selected as a recognition candidate character.

認識部５から順次抽出される入力画像Ｐから切
り出された認識対象文字の認識候補文字は、入力
画像Ｐが横書きであること及びそれぞれの文字ア
ドレスから、編集部７で「松」「下」「電」「器」
……と横方向に左上から右下へ文字をならべて、
文章の意味が理解できるように編集され文章メモ
リ８に文字コードで蓄えられる。 The recognition candidate characters of the recognition target characters cut out from the input image P sequentially extracted from the recognition unit 5 are selected by the editing unit 7 as “pine”, “shita”, “”, based on the fact that the input image P is written horizontally and the respective character addresses. electricity” “vessel”
...and line up the letters horizontally from top left to bottom right,
The text is edited so that its meaning can be understood and stored in the text memory 8 as a character code.

以上のように構成された文字認識装置では、簡
易な方法で求めた行方向及び行アドレスを用い
て、認識対象文字の切り出し、さらに認識候補文
字の編集を行うことにより、文章の意味が理解で
きる文字列を作成することができる。さらに、文
章メモリを文書処理装置等に接続することによ
り、新たな文書編集が可能となる。 With the character recognition device configured as described above, the meaning of a sentence can be understood by cutting out characters to be recognized and editing candidate characters for recognition using the line direction and line address obtained using a simple method. You can create strings. Furthermore, by connecting the text memory to a document processing device or the like, new document editing becomes possible.

尚、行方向判定を行う際のヒストグラムを、本
実施例では入力画像全体について求めたが、行の
傾き等に対処するために、入力画像をブロツクに
分割してブロツク毎のヒストグラムを求めること
により、行方向を決定することができる。 In this example, the histogram used to determine the row direction was obtained for the entire input image, but in order to deal with the inclination of the rows, the input image was divided into blocks and a histogram was obtained for each block. , the row direction can be determined.

発明の効果本発明によれば、入力画像を縦方向・横方向に
走査して文字部を形成する画素のヒストグラムを
求め、ヒストグラムから求めた縦方向・横方向の
文字間隔長の平均値を比較するという簡易な方法
で高速に入力画像の行方向を抽出することによ
り、書式が未知の文書に対して文書の意味が理解
できるように認識候補文字を編集することが可能
となる。Effects of the Invention According to the present invention, an input image is scanned in the vertical and horizontal directions to obtain a histogram of pixels forming a character portion, and the average values of the character spacing lengths in the vertical and horizontal directions obtained from the histogram are compared. By quickly extracting the line direction of an input image using a simple method, it becomes possible to edit recognition candidate characters in a document whose format is unknown so that the meaning of the document can be understood.

[Brief explanation of drawings]

第１図は本発明における一実施例による文字認
識装置の構成図、第２図は入力画像及び入力画像
の行方向と行アドレスの判定方法の説明図、第３
図は入力画像内の認識対象文字切り出し方法の説
明図、第４図は文字認識方法の説明図である。１……画像入力部、２……画像メモリ、３……
行方向判定部、４……文字切り出し部、５……認
識部、６……辞書、７……編集部、８……文章メ
モリ。 FIG. 1 is a block diagram of a character recognition device according to an embodiment of the present invention, FIG. 2 is an explanatory diagram of an input image and a method for determining the line direction and line address of the input image, and FIG.
The figure is an explanatory diagram of a method for cutting out characters to be recognized from an input image, and FIG. 4 is an explanatory diagram of a character recognition method. 1... Image input section, 2... Image memory, 3...
Line direction determination section, 4...Character cutting section, 5...Recognition section, 6...Dictionary, 7...Editing section, 8...Text memory.

Claims

[Claims]

1. An image input section that inputs an image containing characters to be recognized, and an image input section that scans the image input in the vertical and horizontal directions to obtain a histogram of pixels forming a character section, and calculates the value of this histogram. A line direction determination unit that determines vertical writing or horizontal writing by comparing the average value of character spacing length defined by the number of consecutive scanning lines with a character spacing of N pixels or less in the vertical direction and horizontal direction, and an input image. A character recognition system comprising: a character extraction unit that extracts a recognition target character from a text; a recognition unit that extracts a recognition candidate character by comparing the recognition target character with a dictionary; and an editing unit that edits the recognition candidate character. Device.