JPH0850631A

JPH0850631A - Character recognition device

Info

Publication number: JPH0850631A
Application number: JP6184740A
Authority: JP
Inventors: Shiori Ooaku; 志緒理大阿久
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1994-08-05
Filing date: 1994-08-05
Publication date: 1996-02-20

Abstract

(57)【要約】【構成】誤認判定部１０３は、文字認識処理部１０２
による認識結果の信頼度が低く誤認の可能性の高い文字
を判別する。誤認領域判定部１０４は、誤認の可能性が
高いと判定された文字、その前後各ｎ文字を含めた範
囲、あるいは、半数以上の文字が誤認の恐れが高いと判
断さた行の範囲を誤認領域と判定する。誤認領域抽出部
１０５は、誤認領域内の文字については、認識結果に加
えてイメージと候補文字群を認識結果ファイル１０８に
書き込む。【効果】認識結果ファイルのサイズを小さくでき、文
字認識装置のメモリ容量・ファイル容量が小さい場合に
も不都合がない。認識結果ファイルのデータを用いて誤
認文字の修正を行なう際に、修正に必要な文字イメージ
及び候補文字群を表示させることができるため、効率的
な修正作業が可能である。 (57) [Summary] [Configuration] The misidentification determination unit 103 includes a character recognition processing unit 102.
Characters with low reliability of recognition result by and highly likely to be misidentified are identified. The erroneous recognition area determination unit 104 erroneously recognizes a character that is determined to have a high possibility of being erroneously recognized, a range including each n characters before and after the character, or a range of a line in which more than half of the characters are likely to be erroneous. Judge as an area. The false positive area extraction unit 105 writes an image and a candidate character group in the recognition result file 108 in addition to the recognition result for the characters in the false positive area. [Effect] The size of the recognition result file can be reduced, and there is no inconvenience even when the memory capacity and file capacity of the character recognition device are small. When the erroneously recognized character is corrected using the data of the recognition result file, the character image and the candidate character group necessary for the correction can be displayed, so that the effective correction work can be performed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字認識装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device.

【０００２】[0002]

【従来の技術】従来、文字認識と誤認文字の修正に関し
ては、同一装置上において、原稿単位に文字認識と誤認
文字修正とを連続的に行なうというのが一般的であっ
た。この誤認文字修正時に、ディスプレイ上に原稿イメ
ージあるいは文字イメージを表示させることにより、原
稿に視点を移すことなく、ディスプレイ上で原稿と認識
結果とを突き合わせる方法が採用されることが多い（例
えば特開平４−３４６７１号）。また、効率的な修正を
可能にするため、ディスプレイ上に候補文字群を表示さ
せ、その中から正解文字を選択できるようにする等、さ
まざまな方法が考案されている。2. Description of the Related Art Heretofore, with respect to character recognition and correction of erroneously recognized characters, it has generally been common to perform character recognition and erroneously recognized character correction successively for each document on the same apparatus. When correcting this erroneous character, a method is often employed in which the original image or the character image is displayed on the display to match the original with the recognition result on the display without shifting the viewpoint to the original (for example, Kaihei 4-34671). Further, in order to enable efficient correction, various methods have been devised, such as displaying a group of candidate characters on a display so that the correct character can be selected from them.

【０００３】また、一般的に、文字認識と誤認文字修正
は同一のアプリケーションとして行なわれるため、文字
認識結果の候補文字群や文字イメージを保存する必要が
なく、また、保存しない場合が多い。文字イメージを保
存する場合でも、文書全体あるいは認識処理対象範囲の
イメージをそっくり保存するのが一般的であった。Further, since character recognition and correction of erroneously recognized characters are generally performed as the same application, it is not necessary or necessary to save a candidate character group or character image of the character recognition result. Even when a character image is saved, it is common to save the entire document or the image of the recognition processing target range.

【０００４】[0004]

【発明が解決しようとする課題】さて、文字認識装置
は、単独装置として実現されることが多かったが、スキ
ャナーや複写機等の機器に組み込まれる形態も増加しつ
つある。The character recognition device has often been realized as a single device, but the number of forms incorporated into devices such as scanners and copying machines is increasing.

【０００５】この組み込み形態の場合、機器のメモリ容
量もしくはファイル容量が制約されたり、あるいはキー
ボード等の文字入力手段が存在しないか、存在してもそ
の能力が弱体であることが少なくない。このような機器
では、文字認識だけを実行し、その誤認文字の修正は他
の装置上で別のアプリケーションとして行なうのが得策
であり、また、そうせざるを得ない場合が多いであろ
う。In the case of this built-in form, the memory capacity or file capacity of the device is restricted, or there is no character input means such as a keyboard, or even if it exists, its ability is often weak. In such a device, it is a good idea to perform only character recognition, and to correct the erroneously recognized character as another application on another device, and in many cases, it will be unavoidable.

【０００６】このように文字認識と誤認文字修正とを分
離する場合に、前述のような誤認文字の効率的な修正を
可能にするためには、文字認識装置側で認識結果ととも
に、候補文字群及び文字領域のイメージをファイルとし
て保存するようにすると好都合であることは明らかであ
る。しかし、文字認識装置側でメモリ容量もしくはファ
イル容量を多くとれない場合、すべての文字について、
イメージと候補文字群を保存することは能力的に無理で
あったり、また、誤認文字の修正を実行する装置側の環
境にも負荷がかかる可能性がある。When the character recognition and the erroneous character correction are separated as described above, in order to enable the efficient correction of the erroneously recognized character as described above, the character recognition device side recognizes the recognition result and the candidate character group. Obviously, it is convenient to save the image of the character area as a file. However, if the memory capacity or file capacity cannot be increased on the character recognition device side, for all characters,
It may be impossible to save the image and the candidate character group, and the environment of the device side that corrects the misidentified character may be burdened.

【０００７】また、同じ文字認識装置上で文字認識と誤
認文字修正とを別アプリケーションとして実行する形態
においても、認識結果のみならず候補文字群とイメージ
とを保存すると好都合であるが、文字認識装置のメモリ
容量もしくはファイル容量を大きくとれない場合には、
すべての文字についてのイメージと候補文字群を保存す
ることは困難である。Further, even in a mode in which character recognition and erroneous character correction are executed as different applications on the same character recognition device, it is convenient to store not only the recognition result but also the candidate character group and the image. If the memory capacity or file capacity of is not large,
It is difficult to store images and candidate characters for all characters.

【０００８】本発明の目的は、文字認識と誤認文字修正
とを文字認識装置と他の装置とに分離して行なう処理形
態において、あるいは、同一の文字認識装置上で文字認
識と誤認文字修正とを別アプリケーションとして行なう
処理形態において、文字認識装置のメモリ容量もしくは
ファイル容量を多くとれない場合にも、効率的な誤認文
字修正を可能にしようとするものである。より具体的に
は、本発明の目的は、文字認識を行なって認識結果ファ
イルを生成する文字認識装置において、誤認文字修正作
業の効率を犠牲にすることなく、認識結果ファイルのサ
イズを削減することにある。An object of the present invention is to perform character recognition and erroneous character correction separately in a character recognition device and another device, or in the same character recognition device. In a processing mode in which is performed as a separate application, even if the memory capacity or file capacity of the character recognition device cannot be increased, it is intended to enable efficient erroneous character correction. More specifically, an object of the present invention is to reduce the size of a recognition result file in a character recognition device that performs character recognition to generate a recognition result file without sacrificing the efficiency of the operation of correcting a misidentified character. It is in.

【０００９】[0009]

【課題を解決するための手段】請求項１ないし５の各項
の発明は文字認識装置の発明であり、上記目的を達成す
るため、それぞれ以下に述べる構成を有するものであ
る。The invention according to each of claims 1 to 5 is an invention of a character recognition device, and has the following configurations to achieve the above object.

【００１０】請求項１の発明は、文字入力画像上の文字
イメージに対する文字認識を行なう手段と、該手段によ
る認識結果を認識結果ファイルに保存する段と、認識結
果の信頼度の低い文字のイメージを該認識結果ファイル
に保存する手段とを有することを特徴とする。According to a first aspect of the present invention, a means for performing character recognition for a character image on a character input image, a step of storing the recognition result by the means in a recognition result file, and an image of a character with low reliability of the recognition result. Is stored in the recognition result file.

【００１１】請求項２の発明は、入力画像上の文字イメ
ージに対する文字認識を行なう手段と、該手段による認
識結果を認識結果ファイルに保存する手段と、認識結果
の信頼度の低い文字とその前後各ｎ文字（ただしｎは正
整数）のイメージを該認識結果ファイルに保存する手段
とを有することを特徴とする。According to a second aspect of the present invention, means for recognizing a character image on an input image, means for storing the recognition result by the means in a recognition result file, characters with low reliability of the recognition result, and the preceding and following characters. Means for storing an image of each n characters (where n is a positive integer) in the recognition result file.

【００１２】請求項３の発明は、請求項１または２の発
明の構成において、認識結果の信頼度の低い文字が所定
割合以上を占める行について、該行の全文字のイメージ
が認識結果ファイルに保存されることを特徴とする。According to a third aspect of the present invention, in the configuration of the first or second aspect of the invention, for a line in which characters with low reliability in the recognition result occupy a predetermined proportion or more, an image of all characters in the line is stored in the recognition result file. It is characterized by being saved.

【００１３】請求項４の発明は、請求項１または２の発
明の構成に加え、認識結果ファイルにイメージの保存さ
れる文字に対する候補文字群を該認識結果ファイルに保
存する手段を有することを特徴とする。The invention of claim 4 is characterized in that, in addition to the structure of the invention of claim 1 or 2, it has means for storing a candidate character group for a character whose image is stored in the recognition result file in the recognition result file. And

【００１４】請求項５の発明は、請求項３の発明の構成
に加えて、認識結果ファイルにイメージの保存される文
字に対する候補文字群を該認識結果ファイルに保存する
手段を有することを特徴とする。According to a fifth aspect of the present invention, in addition to the configuration of the third aspect of the invention, there is provided means for storing a candidate character group for a character whose image is stored in the recognition result file in the recognition result file. To do.

【００１５】[0015]

【作用】認識結果の信頼度の低い文字は、誤認の可能性
が高く修正が必要となることが予想される。請求項１の
発明によれば、このような文字のイメージも認識結果と
ともに認識結果ファイルに保存されるので、この認識結
果ファイルのデータを用いて同じ文字認識装置上で文字
認識とは別アプリケーションとして誤認文字の修正を行
なう際に、あるいは別の装置上で誤認文字の修正を行な
う際に、修正の必要が予想される文字のイメージを表示
させることにより、原稿を参照することなく、効率的に
認識結果の確認と必要な修正を行なうことができる。し
かも、イメージが保存されるのは認識結果の信頼度の低
い文字のものだけであるので、すべての文字のイメージ
を無条件に保存する場合に比べ、認識結果ファイルのサ
イズを大幅に減らすことができる。[Function] It is expected that a character whose recognition result has low reliability has a high possibility of being erroneously recognized and needs to be corrected. According to the invention of claim 1, since the image of such a character is also stored in the recognition result file together with the recognition result, the data of the recognition result file is used as an application different from the character recognition on the same character recognition device. By displaying the image of the characters that need to be corrected when correcting the erroneous characters or when correcting the erroneous characters on another device, you can efficiently refer to the manuscript without referring to the original. You can check the recognition result and make necessary corrections. Moreover, the image is saved only for the characters with low reliability in the recognition result, so the size of the recognition result file can be significantly reduced compared to the case where the image of all characters is unconditionally saved. it can.

【００１６】文字単位の認識結果の信頼度のみでは、必
ずしも正確に誤認を判断できないことがある。信頼度の
低い文字の前後の文字で、その信頼度が高くとも誤認が
起きていることが少なくない。特に、文字切り出しや言
語処理でバックトラックをしない文字認識処理方式を採
用している場合には、そのようなケースが多い。かかる
点に着目したのが請求項２の発明であり、同発明によれ
ば、認識結果の信頼度の低い文字の前後一定範囲内の文
字まで、イメージの保存範囲が拡張される。したがっ
て、請求項１の発明による場合に比べイメージ保存範囲
の拡張された分だけ認識結果ファイルのサイズは増加す
るが、それでも全文字のイメージを保存する場合に比べ
認識結果ファイルのサイズは十分に小さくなる。しか
も、誤認文字修正の際に、認識結果の信頼度の低い文字
のみならず、その前後一定範囲の文字のイメージをも表
示させることができるので、前後の文字の誤認が生じて
いる場合でも的確かつ効率的な修正作業が可能になる。In some cases, it may not be possible to accurately determine misidentification only by the reliability of the recognition result in character units. Characters before and after a character with low reliability often have misidentification even if the character has high reliability. In particular, such a case often occurs when a character recognition processing method that does not perform backtracking in character extraction or language processing is adopted. The invention of claim 2 focuses on such a point. According to the invention, the storage range of the image is expanded to a character within a certain range before and after the character with low reliability of the recognition result. Therefore, the size of the recognition result file increases by an amount corresponding to the expansion of the image storage range as compared with the case of the invention of claim 1, but the size of the recognition result file is still sufficiently small as compared with the case of saving the image of all characters. Become. In addition, when correcting erroneous characters, not only the characters with low reliability in the recognition result but also the image of a certain range of characters before and after that can be displayed, so even if the characters before and after are erroneously recognized, they can be accurately identified. And efficient correction work becomes possible.

【００１７】文書中の一部の行の認識率が極端に悪いこ
とがある。例えば、仕様外のポイント数の文字が用いら
れた見出しや注釈などの行である。このような行の認識
結果の修正を行なう場合には、行全体の文字イメージを
確認できることが望まれる。しかして、請求項３の発明
によれば、行の一定割合以上の文字の信頼度が低い場合
には、文字イメージの保存範囲を行全体まで拡張するた
め、そのような認識率が極端に悪い見出しや注釈等の行
の誤認文字修正を容易かつ的確に行なうことができるよ
うになる。The recognition rate of some lines in a document may be extremely poor. For example, it is a line such as a heading or an annotation in which characters with the number of points out of the specification are used. When correcting the recognition result of such a line, it is desirable to be able to confirm the character image of the entire line. According to the third aspect of the invention, when the reliability of the character of a certain line or more is low, the storage range of the character image is expanded to the entire line, so that the recognition rate is extremely bad. It becomes possible to easily and accurately correct a misidentified character in a line such as a heading or an annotation.

【００１８】請求項４または５の発明によれば、誤認の
予想される文字について、イメージのみならず候補文字
群も認識結果ファイルに保存される。したがって、同じ
文字認識装置上あるいは他の装置上で、認識結果ファイ
ルのデータを用いて誤認文字修正を行なう場合に、文字
イメージを表示させて参照できるほか、候補文字群を表
示させ、その中から正解文字を選択することができるた
め、効率的な修正作業が可能になる。しかも、イメージ
と候補文字群（キャラクタデータ）が保存されるのは誤
認の可能性の高い文字だけであるので、全文字について
それを保存する場合に比べ、認識結果ファイルのサイズ
を十分に小さくできる。According to the invention of claim 4 or 5, not only an image but also a candidate character group is stored in the recognition result file for a character that is expected to be misidentified. Therefore, on the same character recognition device or on another device, when you use the data in the recognition result file to correct erroneously recognized characters, you can display the character image and refer to it, or display a candidate character group from which Since correct characters can be selected, efficient correction work is possible. Moreover, the image and the candidate character group (character data) are saved only for the characters that are likely to be misidentified, so the size of the recognition result file can be made sufficiently smaller than when saving it for all characters. .

【００１９】[0019]

【実施例】図１は本発明による文字認識装置の一例を示
す概略ブロック図である。画像入力部１０１より文書等
の画像データを入力する。この入力画像データは画像メ
モリ１０７に格納される。文字認識処理部１０２におい
て、画像メモリ１０７に記憶された画像データより文字
イメージを切り出し文字認識するが、この文字認識処理
は従来と同様の手法によってよいので、その詳細説明は
割愛する。1 is a schematic block diagram showing an example of a character recognition apparatus according to the present invention. Image data such as a document is input from the image input unit 101. This input image data is stored in the image memory 107. In the character recognition processing unit 102, a character image is cut out from the image data stored in the image memory 107 and character recognition is performed. Since this character recognition processing may be performed by the same method as the conventional one, detailed description thereof will be omitted.

【００２０】文字認識処理部１０２は、文字毎に、認識
結果（第一候補文字）の文字コード、第二候補以下の候
補文字群の文字コード、文字イメージの座標値（例え
ば、文字イメージの外接矩形の左上位置と右下位置の座
標値）を認識結果情報格納部１０６に（本例では文字毎
の配列の形で）格納する。なお、認識結果情報格納部１
０６に格納される情報としては、上述の情報のほか、認
識結果の信頼度または誤認識別フラグ、領域切り出しフ
ラグがある。The character recognition processing unit 102 determines, for each character, the character code of the recognition result (first candidate character), the character code of the candidate character group of the second candidate and below, the coordinate value of the character image (for example, the circumscribed character image). The coordinate values of the upper left position and the lower right position of the rectangle) are stored in the recognition result information storage unit 106 (in the form of an array for each character in this example). The recognition result information storage unit 1
The information stored in 06 includes, in addition to the above-described information, the reliability of the recognition result or the misrecognition flag, and the area cutout flag.

【００２１】文字認識処理部１０２による認識結果（第
一候補文字）の妥当性あるいは認識誤りの度合い（信頼
度）の判定を、誤認文字判定部１０３で行なう。この判
定に関する従来技術は様々なものがあるが、本発明で
は、その方法は問わない。誤認文字判定部１０３は判定
の結果を出力するが、その出力の内容は、例えば信頼度
の数値または、信頼度がある閾値より低く誤認の予想さ
れることを示す誤認識別フラグである。この出力内容は
認識結果情報格納部１０６内の該当文字の配列に格納さ
れる。なお、信頼度の最も単純な例は、認識結果と辞書
との距離の逆数である。The misrecognized character determination unit 103 determines the validity of the recognition result (first candidate character) by the character recognition processing unit 102 or the degree of recognition error (reliability). There are various conventional techniques for this determination, but the method of the present invention does not matter. The misidentified character determination unit 103 outputs a result of the determination, and the content of the output is, for example, a numerical value of reliability or a misrecognition-specific flag indicating that misrecognition is expected to be lower than a certain threshold. The output content is stored in the array of the corresponding characters in the recognition result information storage unit 106. Note that the simplest example of the reliability is the reciprocal of the distance between the recognition result and the dictionary.

【００２２】次に、誤認領域判定部による誤認領域判定
処理について説明する。図２は、その処理の全体的な流
れを示すフローチャートである。この処理の処理単位は
改行コードで区切られた１行である。図３は、図２中の
処理ステップ２０３の詳細を示すフローチャートであ
る。Next, the erroneous area determining process by the erroneous area determining unit will be described. FIG. 2 is a flowchart showing the overall flow of the processing. The processing unit of this processing is one line separated by a line feed code. FIG. 3 is a flowchart showing details of the processing step 203 in FIG.

【００２３】誤認領域判定部１０４において、認識結果
情報格納部１０６より、改行コードまでの１行分の認識
結果とその付帯情報を読み出して内部のバッファに格納
する（ステップ２０１）。このバッファ内の１行分につ
いて誤認領域の選定の処理（ステップ２０３）を行なう
が、その詳細は図３に示すとおりである。In the erroneous recognition area determination unit 104, the recognition result for one line up to the line feed code and its incidental information are read from the recognition result information storage unit 106 and stored in an internal buffer (step 201). The process of selecting the erroneous region (step 203) is performed for one line in this buffer, the details of which are as shown in FIG.

【００２４】ステップ３０１において、バッファ内の信
頼度または誤認識別フラグを参照して、信頼度の低い文
字の数（Ｎ）をカウントする。信頼度が数値である場合
には、信頼度がある閾値より低い文字を誤認が予想され
るものとしてカウントする。信頼度が誤認識別フラグの
形で表示されている場合には、誤認と予想されることを
示すフラグ値をカウントする。また、対象としている行
内の文字数（Ｍ）をカウントする。In step 301, the number (N) of characters with low reliability is counted by referring to the reliability or the misrecognition flag in the buffer. When the reliability is a numerical value, characters whose reliability is lower than a certain threshold value are counted as being expected to be misidentified. When the reliability is displayed in the form of a flag for each misrecognition, a flag value indicating that misidentification is expected is counted. Also, the number of characters (M) in the target line is counted.

【００２５】つぎにステップ３０２において、対象行の
誤認率＝Ｎ／Ｍを算出する。そして、ステップ３０３に
おいて、誤認率が５０％以上であるかを判定する。Next, in step 302, the misidentification rate of the target line = N / M is calculated. Then, in step 303, it is determined whether the false positive rate is 50% or more.

【００２６】誤認率が５０％以上の場合、対象行全体の
認識結果の信頼度が低いと判断される。したがって、ス
テップ３０４において対象行内のすべての文字に対す
る、認識結果情報格納部１０６内の領域切り出しフラグ
を”１”（ＯＮ）にする。このような例を図４に示す。
図４中の”×”は信頼度が低いことを意味している。When the false recognition rate is 50% or more, it is determined that the reliability of the recognition result of the entire target line is low. Therefore, in step 304, the area cut-out flag in the recognition result information storage unit 106 for all the characters in the target line is set to "1" (ON). Such an example is shown in FIG.
"X" in FIG. 4 means that the reliability is low.

【００２７】誤認率が５０％未満の場合、ステップ３０
５において、信頼度の低い文字と、その前後各１文字に
対する、認識結果情報格納部１０６内の領域切り出しフ
ラグを”１”（ＯＮ）にする。それ以外の文字に対する
領域切り出しフラグは”０”（ＯＦＦ）にする。このよ
うな例を図５に示す。”×”は信頼度が低いことを意味
している。If the false positive rate is less than 50%, step 30
In 5, the region cut-out flag in the recognition result information storage unit 106 for the character with low reliability and each character before and after the character is set to "1" (ON). The area cutout flags for the other characters are set to "0" (OFF). Such an example is shown in FIG. "X" means low reliability.

【００２８】なお、一般的には、認識結果の信頼度の低
い文字と、その前後各ｎ文字の領域切り出しフラグを”
１”にする。ｎは正整数である。ここに示す例はｎ＝１
としたケースであるが、通常、このように設定するのが
最も効率的でる。また、ｎ＝０に選んだ場合には、信頼
度の低い文字の領域切り出しフラグだけが”１”に設定
される。In general, the character with low reliability of the recognition result and the area cut-out flag of each n characters before and after the character are set to "
1 ". N is a positive integer. The example shown here is n = 1.
However, it is usually most efficient to set this way. Further, when n = 0 is selected, only the region cutout flag of the character having low reliability is set to "1".

【００２９】以上の処理を最終の行まで繰り返すと、ス
テップ２０１で終了と判断し、次の誤認領域抽出部１０
５による誤認領域抽出処理に移行する。この処理におい
て、認識結果情報格納部１０６内の領域切り出しフラグ
を参照し、認識結果及び付帯情報、並びに誤認識文字の
修正のために役立つイメージを認識結果ファイル１０８
に書き出す。図６は、この誤認領域抽出処理の流れを示
すフローチャートであり、以下、各ステップの処理内容
を説明する。When the above processing is repeated up to the final line, it is judged that the process is finished in step 201, and the next false recognition area extraction unit 10
The processing shifts to the false recognition area extraction processing by 5. In this process, the recognition result file 108 refers to the area cut-out flag in the recognition result information storage unit 106, and recognizes the recognition result and the incidental information and the image useful for correcting the misrecognized character.
Export to. FIG. 6 is a flowchart showing the flow of this false recognition area extraction processing, and the processing contents of each step will be described below.

【００３０】ステップ４０２において、認識結果情報格
納部１０６内の対象文字の領域切り出しフラグを調べ
る。領域切り出しフラグが”１”（ＯＮ）の場合には、
認識結果情報格納部１０６内のイメージデータの座標値
を使って対象文字のイメージを画像メモリ１０７より切
り出し、認識結果情報格納部１０６より読み出した対象
文字の認識結果（第一候補文字）の文字コード及び信頼
度と、候補文字群フラグ（この場合はＯＮに設定）、イ
メージフラグ（この場合はＯＮに設定）、第二位以下の
候補文字群の文字コード、切り出したイメージ、イメー
ジと候補文字群のデータのバイト数を設定した可変長バ
イト数を認識結果ファイル１０８に書き込む（ステップ
４０３，４０４）。In step 402, the area cutting flag of the target character in the recognition result information storage unit 106 is checked. When the area cutting flag is "1" (ON),
The character code of the recognition result (first candidate character) of the target character read out from the recognition result information storage unit 106 by cutting out the image of the target character from the image memory 107 using the coordinate values of the image data in the recognition result information storage unit 106. And reliability, a candidate character group flag (set to ON in this case), an image flag (set to ON in this case), a character code of a second or lower candidate character group, an extracted image, an image and a candidate character group The number of variable length bytes in which the number of bytes of the data is set is written in the recognition result file 108 (steps 403 and 404).

【００３１】一方、対象文字の領域切り出しフラグが”
０”（ＯＦＦ）の場合には、認識結果の文字コードと信
頼度、候補文字群フラグ（この場合はＯＦＦに設定）、
イメージフラグ（この場合はＯＦＦに設定）、可変長バ
イト数（この場合はゼロに設定）を認識結果ファイル１
０８に書き出す（ステップ４０４）。On the other hand, the area cutting flag of the target character is "
If 0 "(OFF), the character code and reliability of the recognition result, the candidate character group flag (set to OFF in this case),
Image flag (set to OFF in this case), variable length byte count (set to zero in this case) Recognition result file 1
(Step 404).

【００３２】このような処理を１文字単位に繰り返し、
最後の文字の処理終了をステップ４０１で判断すると、
一連の処理を完了する。Such processing is repeated for each character,
When the processing end of the last character is judged in step 401,
A series of processing is completed.

【００３３】以上の処理によって、認識結果の信頼度が
低く修正作業が必要と予想される文字、または、その文
字と周辺の一定範囲内の文字、あるいは、全体として認
識結果の信頼度が低いとみなされる行の範囲に限定し
て、イメージデータ及び候補文字群が認識結果ファイル
１０８に保存される。したがって、すべての文字につい
て、イメージや候補文字群を保存する場合にくらべて、
認識結果ファイル１０８のサイズを大幅に減らすことが
でき、同ファイル用の記憶スペースが少なくて済む。こ
のことは、文字認識装置（文字認識機能）を、メモリ容
量の小さいスキャナや複写機といった機器に組み込む場
合に極めて有利である。By the above processing, when the reliability of the recognition result is low and the character is expected to be corrected, or the character and a character within a certain range around the character, or the reliability of the recognition result as a whole is low. The image data and the candidate character group are stored in the recognition result file 108 only in the range of the considered line. Therefore, compared to saving images and candidate character groups for all characters,
The size of the recognition result file 108 can be significantly reduced, and the storage space for the file can be reduced. This is extremely advantageous when the character recognition device (character recognition function) is incorporated in a device such as a scanner or a copying machine having a small memory capacity.

【００３４】また、誤認の疑いのある文字については、
イメージと候補文字群が保存されるので、文字認識装置
以外のパソコン等の装置上で容易に修正することが可能
であり、文字認識装置は誤認文字修正のための機能を備
える必要がない。このことは、キーボード等の文字入力
手段を持たない、あるいは文字入力機能の貧弱な機器、
例えばスキャナや複写機といった機器への文字認識装置
（文字認識機能）の組み込みを容易にするという効果を
有する。Regarding the characters that are suspected of being misidentified,
Since the image and the candidate character group are stored, they can be easily corrected on a device such as a personal computer other than the character recognition device, and the character recognition device does not need to have a function for correcting a misidentified character. This means that devices that do not have a character input means such as a keyboard or have a poor character input function,
For example, it has an effect of facilitating the incorporation of the character recognition device (character recognition function) into a device such as a scanner or a copying machine.

【００３５】前述の認識結果ファイル１０８のデータを
用いて、パソコン等の装置上で認識結果の修正を行なう
ことができる。当然のことながら、文字認識装置上にお
いて、文字認識とは別のアプリケーションとして誤認文
字の修正を行なうような構成にすることも可能である。
この構成の場合においても、認識結果ファイル１０８の
サイズが小さいということはメモリ容量あるいはファイ
ル容量の面で有利であることは前述したとおりである。Using the data of the recognition result file 108 described above, the recognition result can be corrected on a device such as a personal computer. As a matter of course, on the character recognition device, it is also possible to have a configuration for correcting the erroneously recognized character as an application different from the character recognition.
As described above, even in the case of this configuration, the small size of the recognition result file 108 is advantageous in terms of memory capacity or file capacity.

【００３６】ここで、一般的なパソコン上でアプリケー
ションの一つとして誤認文字修正処理を行なう例を簡単
に説明する。誤認文字修正のためのアプリケーションプ
ログラムは、認識結果ファイル１０８のデータを読み込
み、まず、認識結果の文字コードを、例えば１行分だけ
ディスプレイ上に文字表示する。この際に、信頼度が高
いか低いかによって、文字の表示色や明るさを変えた
り、反転表示をさせる。つぎに、アプリケーションプロ
グラムは、当該行中のイメージフラグがＯＮの文字のイ
メージを、ディスプレイ上に文字と対応付けて表示す
る。これで、オペレータは疑わしい文字を容易に識別で
き、かつ、そのイメージを参照することによって容易に
正解文字を認識できる。Here, an example of performing the erroneously recognized character correction process as one of applications on a general personal computer will be briefly described. The application program for correcting the misidentified characters reads the data of the recognition result file 108, and first displays the character code of the recognition result on the display, for example, for one line. At this time, the display color and brightness of the character are changed or the display is reversed depending on whether the reliability is high or low. Next, the application program displays the image of the character whose image flag is ON in the line in association with the character on the display. With this, the operator can easily identify the suspicious character and can easily recognize the correct character by referring to the image.

【００３７】オペレータがマウス操作等によって修正し
たい文字を指定すると、アプリケーションプログラム
は、その指定文字に対する候補文字群フラグを調べ、そ
れがＯＮであれば、対応の候補文字群をディスプレイに
表示する。オペレータは、表示された候補文字群中に正
解文字が見つかった場合には、その文字をマウス操作あ
るいはキーボードからの番号指定等によって選択する。
正解文字が見つからない場合には、オペレータはキーボ
ードによって正解文字を入力する。アプリケーションプ
ログラムは、選択あるいは入力された正解文字に従って
ディスプレイの表示を更新し、かつメモリ上の認識結果
データを修正する。When the operator designates a character to be corrected by a mouse operation or the like, the application program checks the candidate character group flag for the designated character, and if it is ON, displays the corresponding candidate character group on the display. When the correct character is found in the displayed candidate character group, the operator selects the character by operating the mouse or designating the number from the keyboard.
If the correct answer character is not found, the operator inputs the correct answer character using the keyboard. The application program updates the display on the display and corrects the recognition result data on the memory according to the correct character selected or input.

【００３８】なお、信頼度の低い文字が行中の文字の過
半数を占めた場合には、行中の全文字のイメージと候補
文字群が認識結果ファイル１０８に保存される。このよ
うなケースは、仕様外のポイント数の文字が用いられた
行（例えば見出しや注釈のような行）で見られる。この
場合、修正時に行全体の文字イメージが表示されるため
正解文字の判断が容易になる。また、候補文字群を表示
させることができるので、正解文字の選択入力も容易に
なる。When the characters with low reliability occupy the majority of the characters in the line, the image of all the characters in the line and the candidate character group are stored in the recognition result file 108. Such a case can be seen in a line (for example, a line such as a headline or a comment) in which a character having a number of points out of the specification is used. In this case, since the character image of the entire line is displayed at the time of correction, it is easy to determine the correct character. In addition, since the candidate character group can be displayed, selection and input of the correct answer character becomes easy.

【００３９】[0039]

【発明の効果】請求項１、２または３の発明によれば、
修正が必要となることが予想される範囲に限って認識結
果とともに文字イメージも認識結果ファイルに保存され
るので、この認識結果ファイルのデータを用いて、同じ
文字認識装置上で文字認識とは別アプリケーションとし
て誤認文字の修正を行なう際に、あるいは別の装置上で
誤認文字の修正を行なう際に、修正の必要が予想される
文字のイメージを表示させることにより、原稿を参照す
ることなく、効率的に認識結果の確認と必要な修正を行
なうことができる。請求項２または３の発明によれば、
請求項１の発明によるよりも認識結果ファイルのサイズ
は若干増大する反面、１文字単位の認識結果の信頼度か
らは判断できないような誤認文字のイメージも保存され
るので、それらの文字も、イメージを表示させて効率的
な修正が可能である。According to the invention of claim 1, 2 or 3,
The character image is saved in the recognition result file together with the recognition result only to the extent that correction is expected to be necessary.Therefore, using the data of this recognition result file, different from character recognition on the same character recognition device. By displaying the image of the characters that need to be corrected when correcting the erroneous characters as an application or when correcting the erroneous characters on another device, the efficiency can be improved without referring to the manuscript. The recognition result can be confirmed and necessary correction can be performed. According to the invention of claim 2 or 3,
Although the size of the recognition result file is slightly larger than that according to the invention of claim 1, since the image of the misrecognized character that cannot be judged from the reliability of the recognition result of each character is also stored, those characters are also imaged. Is displayed to enable efficient correction.

【００４０】しかも、イメージが保存されるのは修正が
必要となることが予想される範囲の文字のものだけであ
るので、すべての文字のイメージを無条件に保存する場
合に比べ、認識結果ファイルのサイズを大幅に減らすこ
とができる。そのため、メモリ容量もしくはファイル容
量を多くとれない文字認識装置、例えばスキャナや複写
機等の機器に組み込まれるような文字認識装置において
もイメージを含む認識結果ファイルの保存が可能にな
り、また、このようなメモリ容量もしくはファイル容量
を多くとれない文字認識装置においても、文字認識と切
り離して、誤認文字修正を効率的に行なうことが可能に
なる。Moreover, since the images are saved only for the characters in the range in which correction is expected to be necessary, the recognition result file is not compared to the case where the images of all the characters are unconditionally saved. Can significantly reduce the size of. Therefore, it is possible to save a recognition result file including an image even in a character recognition device that does not have a large memory capacity or file capacity, such as a character recognition device incorporated in a device such as a scanner or a copying machine. Even in a character recognition device that does not have a large memory capacity or file capacity, it is possible to efficiently perform erroneous character correction separately from character recognition.

【００４１】請求項４または５の発明によれば、文字イ
メージを保存した文字について候補文字群も認識結果フ
ァイルに保存されるため、同じ文字認識装置上で文字認
識とは別アプリケーションとして誤認文字修正を行なう
場合、あるいは他の装置上で誤認文字修正を行なう場合
に、認識結果ファイルに保存されたイメージとともに候
補文字群をも表示させ、その中から正解文字を選択する
ことができるため、効率的な修正作業が可能になる。し
かも、候補文字群（キャラクタデータ）が保存される文
字の範囲は限定されるので、認識結果ファイルのサイズ
の増大を抑えることができるため、請求項１乃至３の発
明に関して述べた利益が損なわれることもない。According to the invention of claim 4 or 5, since the candidate character group for the character in which the character image is stored is also stored in the recognition result file, the character recognition device corrects the misidentified character as a separate application from the character recognition. When you perform the error correction or correct the erroneous characters on other devices, you can display the candidate character group together with the image saved in the recognition result file and select the correct character from among them. Correction work becomes possible. Moreover, since the range of characters in which the candidate character group (character data) is stored is limited, it is possible to suppress an increase in the size of the recognition result file, and thus the benefits described with respect to the inventions of claims 1 to 3 are impaired. Nothing.

[Brief description of drawings]

【図１】本発明による文字認識装置の一例を示すブロッ
ク図である。FIG. 1 is a block diagram showing an example of a character recognition device according to the present invention.

【図２】誤認領域判定部の処理の全体的フローを示すフ
ローチャートである。FIG. 2 is a flowchart showing an overall flow of processing of a false positive area determination unit.

【図３】図２中の一部処理ステップの詳細を示すフロー
チャートである。FIG. 3 is a flowchart showing details of some processing steps in FIG.

【図４】誤認領域判定部による処理結果の一例を示す図
である。FIG. 4 is a diagram illustrating an example of a processing result by a false positive area determination unit.

【図５】誤認領域判定部による処理結果の他の一例を示
す図である。FIG. 5 is a diagram showing another example of the processing result by the false positive area determination unit.

【図６】誤認領域抽出部の処理フローを示すフローチャ
ートである。FIG. 6 is a flowchart showing a processing flow of a false positive area extracting unit.

[Explanation of symbols]

１０１画像入力部１０２文字認識処理部１０３誤認文字判定部１０４誤認領域判定部１０５誤認領域抽出部１０６認識結果情報格納部１０７画像メモリ１０８認識結果ファイル Reference Signs List 101 image input unit 102 character recognition processing unit 103 misidentified character determination unit 104 misidentified region determination unit 105 misidentified region extraction unit 106 recognition result information storage unit 107 image memory 108 recognition result file

Claims

[Claims]

1. A means for performing character recognition on a character image on an input image, a means for storing a recognition result by the means in a recognition result file, and an image of a character with low reliability of the recognition result in the recognition result file. A character recognition device comprising: a storing unit.

2. A means for performing character recognition for a character image on an input image, a means for storing a recognition result by the means in a recognition result file, a character having a low reliability of the recognition result and n characters before and after the character (however). a character recognition device for storing an image of n being a positive integer) in the recognition result file.

3. For a line in which characters with low reliability of recognition result occupy a predetermined proportion or more, an image of all characters in the line is stored in a recognition result file. Character recognizer.

4. The character recognition device according to claim 1, further comprising means for storing, in the recognition result file, a candidate character group for a character whose image is stored in the recognition result file.

5. The character recognition device according to claim 3, further comprising means for storing a candidate character group for a character whose image is stored in the recognition result file in the recognition result file.