JPH0962773A

JPH0962773A - Character recognition device

Info

Publication number: JPH0962773A
Application number: JP7233521A
Authority: JP
Inventors: Keiji Kojima; 啓嗣小島
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1995-08-19
Filing date: 1995-08-19
Publication date: 1997-03-07

Abstract

(57)【要約】【課題】認識結果の誤り検出精度の向上を図ることに
より、装置利用者の認識結果の修正作業の効率化を図る
こと。【解決手段】未知の文字についてパターンマッチング
処理等の複数の処理により最終的な認識結果を得るとと
もに、複数の処理中の特定の処理結果によって得られた
情報に基づき総合的に認識結果の確からしさである確信
度を決定する。次に、この決定された確信度と閾値との
比較に先立って、その閾値を所定の条件（例えば認識結
果の文字の種類）に応じて所定値に随時設定する（Ｓ３
１）。次に、確信度を随時設定される閾値と比較し（Ｓ
３２）、その比較の結果により「認識結果は怪しい」ま
たは「認識結果は正しい」と判定する（Ｓ３３、Ｓ３
４）。その判定結果の情報を認識結果の文字に付加して
表示、また印刷する（Ｓ３５）。 (57) 【Abstract】 PROBLEM TO BE SOLVED: To improve the efficiency of error detection accuracy of a recognition result to improve the efficiency of the correction work of the recognition result of a device user. A final recognition result is obtained by a plurality of processes such as a pattern matching process for an unknown character, and a certainty of the recognition result is comprehensively determined based on information obtained by a specific process result among the plurality of processes. Determines the certainty factor. Next, prior to the comparison between the determined certainty factor and the threshold value, the threshold value is set to a predetermined value at any time according to a predetermined condition (for example, the character type of the recognition result) (S3).
1). Next, the certainty factor is compared with a threshold value set at any time (S
32), based on the result of the comparison, it is determined that "the recognition result is suspicious" or "the recognition result is correct" (S33, S3).
4). The information of the determination result is added to the character of the recognition result and displayed or printed (S35).

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、光学的文字読取装
置（ＯＣＲ）などの文字認識装置に関し、特に装置の認
識結果の誤り検出精度の向上を図ったものであり、音声
認識のみならず画像処理にも適用可能なものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device such as an optical character reading device (OCR), and more particularly to improving the accuracy of error detection of the recognition result of the device. It is also applicable to processing.

【０００２】[0002]

【従来の技術】従来、この種の装置としては、例えば、
特開平５−１０１２１６号公報に記載の文字認識装置
や、特開平４−２１１８８３号公報に記載の文字認識装
置が知られている。特開平５−１０１２１６号公報に記
載の文字認識装置は、文書画像から切り出した１文字ず
つの画像の特徴量と、認識辞書内に記憶されている認識
可能な各文字の特徴量との類似度を算出する。そして、
類似度の大きい順に第１候補文字、第２候補文字・・・
を求め、第１候補文字と第２候補文字との類似度の差が
所定値未満の場合には、認識結果が誤認識の確率が高い
として第１候補文字を強調表示する。2. Description of the Related Art Conventionally, as this type of apparatus, for example,
The character recognition device described in Japanese Patent Application Laid-Open No. 5-101216 and the character recognition device described in Japanese Patent Application Laid-Open No. 4-218183 are known. The character recognition device described in Japanese Patent Laid-Open No. 5-101216 discloses a similarity between the feature amount of each character cut out from a document image and the feature amount of each recognizable character stored in the recognition dictionary. To calculate. And
First candidate character, second candidate character in descending order of similarity ...
When the difference in similarity between the first candidate character and the second candidate character is less than a predetermined value, the first candidate character is highlighted because the recognition result has a high probability of erroneous recognition.

【０００３】特開平４−２１１８８３号公報に記載の文
字認識装置は、図１０および図１１のフローチャートに
示すような文字認識処理を行うものである。すなわち、
まず原稿画像を読み取って得られた２値画像から文字画
像を切り出す切り出し処理を行ったのち（ステップ
１）、この切り出された文字画像の歪みを補正するため
に正規化処理を行う（ステップ２）。次に、ステップ３
のパターンマッチング処理では、正規化後の文字画像の
特徴量を抽出し、抽出した特徴量を辞書に登録する標準
的な特徴量と比較して類似度また相違度を算出し、類似
度が大きいまたは相違度が小さい１つ以上の認識候補文
字を決定する。そして、全ての文字について、これらス
テップ１からステップ３までの処理が終了すると（ステ
ップ４；Ｙ）、次のステップ５に進む。The character recognition device described in Japanese Patent Application Laid-Open No. 4-211883 performs character recognition processing as shown in the flowcharts of FIGS. 10 and 11. That is,
First, a character image is cut out from a binary image obtained by reading an original image (step 1), and then a normalization process is performed to correct the distortion of the cut out character image (step 2). . Next, step 3
In the pattern matching process, the feature amount of the normalized character image is extracted, the extracted feature amount is compared with the standard feature amount registered in the dictionary, and the similarity or difference is calculated. Alternatively, one or more recognition candidate characters having a small dissimilarity are determined. Then, when the processing from step 1 to step 3 is completed for all characters (step 4; Y), the process proceeds to the next step 5.

【０００４】ステップ５のパス選択処理では、例えば日
本語の場合には、全角で１文字なのか半角で２文字なの
かなど、最適な文字の組み合わせを選択する。ステップ
６のルール処理では、文字の大きさの情報（正規化する
ので、例えば「Ｃ」と「ｃ」などの大小文字の区別に関
係）、文字種の情報（例えばカタカナ文字列の中に漢数
字の「一」があったら、これはカタカナの長音記号
「ー」に修正する）などのルールによって、パターンマ
ッチング処理による認識結果候補を修正する。ステップ
７の言語処理では、ルール処理によって修正後の認識結
果の文字列を形態素解析し、辞書と照合することにより
認識の誤りを修正する。この言語処理の結果が最終的な
文字認識の結果であり、装置利用者による認識結果の修
正の対象になる。In the path selection process of step 5, for example, in the case of Japanese, an optimum combination of characters such as whether it is one full-width character or two half-width characters is selected. In the rule process of step 6, character size information (which is normalized so that it is related to case sensitivity such as “C” and “c”) and character type information (eg, katakana in a katakana character string) If there is a "one", the recognition result candidate by the pattern matching process is corrected according to a rule such as that which is corrected to the katakana long-sound symbol "-"). In the language processing of step 7, the character string of the corrected recognition result is morphologically analyzed by the rule processing, and the recognition error is corrected by collating with the dictionary. The result of this language processing is the final result of character recognition, and is the target of correction of the recognition result by the device user.

【０００５】ステップ８の確信度決定処理では、パター
ンマッチング処理の第１候補の評価値（例えば辞書との
距離を総輪郭数で除した値など）または第１候補と第２
候補との評価値の差、パス選択処理でパスを決めるとき
の評価値、ルール処理で修正のために適用されたルール
の情報、言語処理で言語修正の結果を表す情報を集め、
これら情報を証拠として後述するデンプスター・シェー
ファーの確率理論を使って総合的に、最終認識の認識結
果の確信度を決定する。そして、全ての文字について、
これらステップ５からステップ８までの処理が終了する
と（ステップ９；Ｙ）、文字認識処理を終了する。次
に、図１１のフローチャートに示すように、その決定さ
れた確信度を予め設定されている閾値と比較し（ステッ
プ１０）、その結果、確信度が閾値よりも小さいとき
（ステップ１０；Ｙ）には「認識結果は怪しい」と判定
し（ステップ１１）、確信度が閾値よりも大きいとき
（ステップ１０；Ｎ）には「認識結果は正しい」と判定
する（ステップ１２）。そして、この判定結果を、認識
結果とともに表示装置などに出力する（ステップ１
３）。In the confidence determination process of step 8, the first candidate evaluation value of the pattern matching process (for example, a value obtained by dividing the distance from the dictionary by the total number of contours) or the first candidate and the second candidate.
The difference of the evaluation value with the candidate, the evaluation value when determining the path in the path selection process, the information of the rule applied for the correction in the rule process, the information showing the result of the language correction in the language process,
The certainty factor of the recognition result of the final recognition is comprehensively determined by using the above-mentioned information as evidence and using the probability theory of Dempster-Schafer described later. And for every character,
When the processes from step 5 to step 8 are completed (step 9; Y), the character recognition process is completed. Next, as shown in the flowchart of FIG. 11, the determined certainty factor is compared with a preset threshold value (step 10), and as a result, when the certainty factor is smaller than the threshold value (step 10; Y). Is determined to be "suspicious" (step 11), and when the certainty factor is greater than the threshold value (step 10; N), "recognition result is correct" (step 12). Then, this determination result is output to a display device or the like together with the recognition result (step 1
3).

【０００６】次に、上述したデンプスター・シェーファ
ーの確率理論を使って総合的に確信度を決定する具体例
について説明する。このデンプスター・シェーファーの
確率理論は、無知量を表すことができ、断片的な情報
（独立な証拠）を集め合成できるという特徴があり、確
率を「信用」、「無知」、「不信用」の領域に分けて表
し、「信用」を下界確率、「信用＋無知」を上界確率と
呼ぶ。具体的には、この文字は正しいという仮説Ｈ１、
この文字は間違いであるという仮説Ｈ２をたて、集まっ
た各証拠について疑似可能性分布（各仮説に割り当てら
れた、その仮説が真である可能性の分布）を求める。例
えば、パターンマッチング処理により集めた証拠である
第１候補の評価値に関する疑似可能性分布、同じく第１
候補と第２候補との評価値の差に関する疑似可能性分
布、パス選択処理により集めた証拠であるパス選択時の
評価値の差に関する疑似可能性分布、ルール処理から集
めた証拠内容に関する疑似可能性分布、言語処理より集
めた修正内容に関する疑似可能性分布を生成する。Next, a specific example in which the certainty factor is comprehensively determined by using the above-described Dempster-Schaefer probability theory will be described. This Dempster-Schafer theory of probability has the characteristic that it can represent ignorance and can collect and synthesize fragmentary information (independent evidence), and the probability of "trust", "ignorance", and "untrust" It is divided into areas and "credibility" is called the lower bound probability, and "credit + ignorance" is called the upper bound probability. Specifically, the hypothesis H1 that this character is correct,
The hypothesis H2 that this character is wrong is set, and the pseudo-potential distribution (the distribution of the probability that the hypothesis is true assigned to each hypothesis) is obtained for each piece of evidence gathered. For example, the pseudo-possibility distribution regarding the evaluation value of the first candidate, which is the evidence gathered by the pattern matching process, also the first possibility
Pseudo-possibility distribution regarding difference in evaluation value between candidate and second candidate, pseudo-possibility distribution regarding difference in evaluation value at the time of path selection which is evidence collected by path selection processing, pseudo-possibility regarding evidence content collected from rule processing Generates a pseudo-possibility distribution related to the correction content collected from the sex distribution and language processing.

【０００７】証拠が数値情報の場合、この疑似可能性分
布は、例えば次のような一次方程式により求める。ｙ＝−ｘ／３＋１００式中、ｙは疑似可能性分布、ｘは数値情報の証拠であ
る。この式で、評価値が１００の場合には、仮説Ｈ１の
可能性６７％、仮説Ｈ２の可能性３３％が求められる。
証拠が数値情報でない場合は、ヒューリスティックに与
える。例えば、あるルールを満足しなかったという証拠
の場合、仮説Ｈ１の可能性は２０％、仮説Ｈ２の可能性
は８０％というように予め決めておく。When the evidence is numerical information, this pseudo-possibility distribution is obtained by, for example, the following linear equation. y = -x / 3 + 100 In the formula, y is a pseudo-possibility distribution and x is evidence of numerical information. If the evaluation value is 100, the probability 67% of the hypothesis H1 and the probability 33% of the hypothesis H2 are calculated by this formula.
If the evidence is not numerical, give it a heuristic. For example, in the case of evidence that a certain rule is not satisfied, the probability of the hypothesis H1 is 20% and the probability of the hypothesis H2 is 80%.

【０００８】このようにして集めた各証拠についての疑
似可能性分布から、次に基本確率割り当てを求める。す
なわち、疑似可能性分布を「正しい」、「無知」、「間
違い」の各確率に振り分ける。例えば、仮説Ｈ１の可能
性が６７％、仮説Ｈ２の可能性が３３％の場合は、３４％（＝６７−３３）：正しい６６％（＝１００−３４）：無知０％：間違いのようにする。また、例えば、仮説Ｈ１の可能性が２０
％、仮説Ｈ２の可能性が８０％の場合は、０％：正しい３０％（＝１００−７０）：無知７０％（＝９０−２０）：間違いのようにする。From the pseudo-potential distribution for each piece of evidence thus collected, the basic probability allocation is obtained. That is, the pseudo possibility distribution is assigned to each probability of “correct”, “ignorance”, and “error”. For example, if the probability of hypothesis H1 is 67% and the probability of hypothesis H2 is 33%, 34% (= 67-33): correct 66% (= 100-34): ignorance 0%: like To do. Also, for example, the probability of hypothesis H1 is 20
%, If the possibility of hypothesis H2 is 80%, 0%: correct 30% (= 100-70): ignorance 70% (= 90-20): make an error.

【０００９】次に、デンプスター（Ｄｅｍｐｓｔｅｒ）
の統合規則により、基本確率を合成する。さらに、下界
確率と上界確率を求める。次の陪審員のモデルを例に説
明する。この例では、有罪の下界確率は１５％、上界確率は７４
％（＝１５＋５９）、無罪の下界確率は２６％、上界確
率は８５％（２６＋５９）となる。最後に、下界確率と
上界確率から、確信度のランクを決定する。例えば、仮
説Ｈ２の上界確率から次のような閾値によりＡ、Ｂ、Ｃ
のランクを決める。Ｈ２の上界確率が０〜１９正しい（Ａランク）Ｈ２の上界確率が２０〜７９怪しい（Ｂランク）Ｈ２の上界確率が８０〜１００間違い（Ｃランク）Next, Dempster
The basic probability is synthesized by the integration rule of. Furthermore, lower bound probability and upper bound probability are calculated. Take the following jury model as an example. In this example, the lower bound probability of guilty is 15% and the upper bound probability is 74.
% (= 15 + 59), the innocence lower bound probability is 26%, and the upper bound probability is 85% (26 + 59). Finally, the confidence level is determined from the lower bound probability and the upper bound probability. For example, from the upper bound probability of hypothesis H2, A, B, C
Decide the rank of. The upper bound probability of H2 is 0 to 19 correct (A rank) The upper bound probability of H2 is 20 to 79 Suspicious (B rank) The upper bound probability of H2 is 80 to 100 False (C rank)

【００１０】[0010]

【発明が解決しようとする課題】ところで、特開平５−
１０１２１６号公報に記載の文字認識装置は、上述のよ
うに、第１候補文字と第２候補文字との類似度の差が所
定値未満の場合には、認識結果が誤認識の確率が高いと
して第１候補文字を強調表示できるにすぎず、認識結果
の確からしさである確信度を閾値と比較することはでき
ない。一方、特開平４−２１１８８３号公報に記載の文
字認識装置は、確信度を閾値と比較して認識結果の誤り
を検出できるが、その閾値が固定であって必ずしも適正
ではない。そのため、認識対象の書体や文字種が増えて
きたことに加え、認識対象の原稿が多種多様化してきて
いる近年では、認識結果の誤りの検出精度が不十分であ
り、もって装置利用者の認識結果の修正作業の効率も不
十分であるという問題が生じていた。SUMMARY OF THE INVENTION Incidentally, Japanese Patent Application Laid-Open No.
As described above, the character recognition device described in Japanese Patent No. 101216 determines that the recognition result has a high probability of erroneous recognition when the difference in the similarity between the first candidate character and the second candidate character is less than a predetermined value. Only the first candidate character can be highlighted, and the certainty factor, which is the certainty of the recognition result, cannot be compared with the threshold value. On the other hand, the character recognition device described in Japanese Patent Application Laid-Open No. 4-211883 can detect an error in the recognition result by comparing the certainty factor with a threshold value, but the threshold value is fixed and not necessarily appropriate. For this reason, in addition to the increase in the typefaces and character types to be recognized, in recent years the number of manuscripts to be recognized has been diversified, the accuracy of error detection in recognition results is insufficient, and the recognition results of device users There was a problem that the efficiency of the correction work of was insufficient.

【００１１】そこで、本発明の目的は、認識結果の誤り
検出精度の向上を図ることにより、装置利用者の認識結
果の修正作業の効率化を図ることができる文字認識装置
を提供することにある。Therefore, an object of the present invention is to provide a character recognition device which can improve the efficiency of error detection accuracy of the recognition result, thereby improving the efficiency of the correction work of the recognition result of the device user. .

【００１２】[0012]

【課題を解決するための手段】請求項１記載の発明で
は、入力された未知の文字について認識結果を得るとと
もに、その認識結果の確からしさである確信度を決定で
きる文字認識装置に、前記の決定された確信度を閾値と
比較して前記認識結果の誤りを検出する検出手段と、こ
の検出手段の検出の際に、前記閾値を所定の条件に応じ
て随時変更させる閾値変更手段と、前記検出手段の検出
した認識結果の誤りを、前記認識結果とともに出力する
出力手段とを、具備させて前記目的を達成する。請求項
２記載の発明では、請求項１記載の文字認識装置におい
て、前記閾値変更手段は、認識された文字の種類に応じ
て前記閾値を随時変更させることにより前記目的を達成
する。請求項３記載の発明では、請求項１記載の文字認
識装置において、前記閾値変更手段は、認識された文字
の修正時における修正者の見逃し易さの程度に応じて、
前記閾値を随時変更させることにより前記目的を達成す
る。請求項４記載の発明では、請求項１記載の文字認識
装置において、前記閾値変更手段は、認識対象である文
字の誤認識のし易さの程度に応じて前記閾値を随時変更
させることにより前記目的を達成する。According to a first aspect of the present invention, there is provided a character recognizing device capable of obtaining a recognition result of an input unknown character and determining a certainty factor which is the certainty of the recognition result. Detecting means for detecting the error in the recognition result by comparing the determined certainty factor with a threshold value, threshold value changing means for changing the threshold value at any time according to a predetermined condition when the detecting means detects the error, The object is achieved by providing an output means for outputting an error in the recognition result detected by the detection means together with the recognition result. According to a second aspect of the present invention, in the character recognition apparatus according to the first aspect, the threshold changing means achieves the object by changing the threshold according to the type of the recognized character. According to a third aspect of the present invention, in the character recognition apparatus according to the first aspect, the threshold value changing means is configured to change the recognized character according to the degree of ease of overlooking by a corrector.
The object is achieved by changing the threshold value at any time. According to a fourth aspect of the present invention, in the character recognition device according to the first aspect, the threshold value changing means changes the threshold value at any time according to the degree of easiness of erroneous recognition of a character to be recognized. Achieve the purpose.

【００１３】[0013]

【発明の実施の形態】以下、本発明の文字認識装置にお
ける好適な実施の形態について、図１ないし図９を参照
して詳細に説明する。図１は本発明の第１の実施の形態
である文字認識装置の構成を表したものである。この文
字認識装置は、図１に示すように、原稿の画像を光学的
に読み取るスキャナ１と、認識された文字の修正作業な
どに用いるキーボードなどの入力装置２と、後述の各要
素からなる文字認識部３と、ＣＲＴなどのディスプレイ
４と、レーザプリンタなどの印字装置５とを備え、これ
ら各構成要素がバスに接続されている。文字認識部３
は、画像を記憶する画像メモリ３１と、後述のような文
字認識処理などを所定の手順で行うために各部を制御す
る中央演算処理装置３２と、中央演算処理装置３２が行
う後述の文字認識処理などの各種のプログラムを予め格
納するＲＯＭ（リード・オンリー・メモリ）３３と、文
字の標準的な特徴量が登録されている辞書３４と、文字
認識処理中の各種のデータを一時的に記憶するワークエ
リアＲＡＭ（ランダム・アクセス・メモリ）３５とを備
えている。BEST MODE FOR CARRYING OUT THE INVENTION Preferred embodiments of the character recognition device of the present invention will be described in detail below with reference to FIGS. FIG. 1 shows a configuration of a character recognition device according to a first embodiment of the present invention. As shown in FIG. 1, this character recognition device includes a scanner 1 for optically reading an image of an original document, an input device 2 such as a keyboard used for correcting a recognized character, and a character composed of each element described later. A recognition unit 3, a display 4 such as a CRT, and a printing device 5 such as a laser printer are provided, and these components are connected to a bus. Character recognition unit 3
Is an image memory 31 that stores images, a central processing unit 32 that controls each unit to perform a character recognition process, which will be described later, in a predetermined procedure, and a character recognition process that will be described later performed by the central processing unit 32. A ROM (Read Only Memory) 33 that stores various programs such as the following in advance, a dictionary 34 in which standard feature amounts of characters are registered, and various data during character recognition processing are temporarily stored. A work area RAM (random access memory) 35 is provided.

【００１４】次に、このような構成からなる文字認識装
置の動作について、図２および図３のフローチャートを
参照して説明する。いま、スキャナ１により原稿画像が
読み取られると２値画像化されて（ステップ２１）、画
像メモリ３１に格納されたのち、その画像メモリ３１に
格納された２値画像から文字画像を切り出す切り出し処
理を行う（ステップ２２）。次に、この切り出された文
字画像の歪みを補正するために正規化処理を行う（ステ
ップ２３）。ステップ２４のパターンマッチング処理で
は、正規化後の文字画像の特徴量を抽出し、抽出した特
徴量を辞書３４に登録する標準的な特徴量と比較して類
似度また相違度を算出し、類似度が大きいまたは相違度
が小さい１つ以上の認識候補文字を決定する。そして、
全ての文字について、これらステップ２２からステップ
２４までの処理が終了すると（ステップ２５；Ｙ）、次
のステップ２６に進む。Next, the operation of the character recognition apparatus having such a configuration will be described with reference to the flowcharts of FIGS. Now, when the document image is read by the scanner 1, it is converted into a binary image (step 21), stored in the image memory 31, and then a cutting process for cutting out a character image from the binary image stored in the image memory 31 is performed. Perform (step 22). Next, a normalization process is performed to correct the distortion of the cut out character image (step 23). In the pattern matching process of step 24, the feature amount of the normalized character image is extracted, the extracted feature amount is compared with the standard feature amount registered in the dictionary 34 to calculate the similarity or the difference, and the similarity is calculated. One or more recognition candidate characters having a high degree or a low degree of difference are determined. And
When the processing from step 22 to step 24 is completed for all characters (step 25; Y), the process proceeds to the next step 26.

【００１５】ステップ２６のパス選択処理では、例えば
日本語の場合には、全角で１文字なのか半角で２文字な
のかなど、最適な文字の組み合わせを選択する。ステッ
プ２７のルール処理では、文字の大きさの情報（正規化
するので、例えば「Ｃ」と「ｃ」などの大小文字の区別
に関係）、文字種の情報（例えばカタカナ文字列の中に
漢数字の「一」があったら、これはカタカナの長音記号
「ー」に修正する）などのルールによって、ステップ２
４のパターンマッチング処理による認識結果候補を修正
する。ステップ２８の言語処理では、ステップ２７のル
ール処理によって修正後の認識結果文字列を形態素解析
し、辞書３４と照合することにより認識の誤りを修正す
る。この言語処理の結果が最終的な文字認識の結果であ
り、装置利用者による認識結果の修正の対象になる。In the path selection process of step 26, for example, in the case of Japanese, an optimum combination of characters such as one full-width character or two half-width characters is selected. In the rule process of step 27, character size information (normalized, so it is related to case sensitivity such as “C” and “c”) and character type information (eg, katakana in a katakana character string). If there is a “one” in the above, this is corrected to the katakana long-sound “–”)
The recognition result candidate by the pattern matching process of No. 4 is corrected. In the language processing of step 28, the corrected recognition result character string is subjected to morphological analysis by the rule processing of step 27 and collated with the dictionary 34 to correct the recognition error. The result of this language processing is the final result of character recognition, and is the target of correction of the recognition result by the device user.

【００１６】ステップ２９の確信度決定処理では、例え
ばステップ２４におけるパターンマッチング処理の第１
候補の評価値（例えば辞書との距離を総輪郭数で除した
値など）または第１候補と第２候補との評価値の差、ス
テップ２６におけるパス選択処理でパスを決めるときの
評価値、ステップ２７におけるルール処理で修正のため
に適用されたルールの情報、ステップ２８における言語
処理で言語修正の結果を表す情報を集め、これら情報を
証拠としてデンプスター・シェーファーの確率理論を使
って総合的に、最終認識の認識結果の確信度を決定す
る。この確信度の具体的な求め方は従来技術で説明した
ので、省略する。ここで、確信度とは、認識結果がどの
程度確からしいかを表すものであり、例えば、０〜１０
０の整数値で表現され、数値が高いほど確からしいこと
を意味する。そして、全ての文字について、これらステ
ップ２６からステップ２９までの処理が終了すると（ス
テップ３０；Ｙ）、文字認識処理を終了する。In the confidence determination process of step 29, for example, the first pattern matching process of step 24 is performed.
An evaluation value of a candidate (for example, a value obtained by dividing the distance from the dictionary by the total number of contours) or a difference between the evaluation values of the first candidate and the second candidate, an evaluation value when a path is determined by the path selection processing in step 26, Information on the rules applied for correction in the rule processing in step 27 and information representing the result of the language correction in the language processing in step 28 are collected, and the information is used as evidence to comprehensively use the probability theory of Dempster-Schaefer. , Determining the certainty factor of the recognition result of the final recognition. Since the specific method of obtaining the certainty factor has been described in the related art, it will be omitted. Here, the certainty factor indicates how likely the recognition result is, for example, 0 to 10
Expressed as an integer value of 0, the higher the number, the more likely it is. Then, when the processing from step 26 to step 29 is completed for all characters (step 30; Y), the character recognition processing is ended.

【００１７】次に、以上のようにして決定された確信度
を閾値と比較し、その比較結果から認識結果の誤りを検
出する認識結果の誤り検出処理について、図３のフロー
チャートを参照して説明する。この処理では、まず確信
度の閾値との比較に先立って、その閾値を文字の認識結
果等の所定の条件に応じて所定値に随時設定する（ステ
ップ３１）。換言すれば、文字の認識結果等の所定の条
件に応じて閾値を随時変更する。そして、決定されてい
る確信度を随時設定される閾値と比較し（ステップ３
２）、その比較の結果、確信度が閾値よりも小さいとき
（ステップ３２；Ｙ）には「認識結果は怪しい」と判定
し（ステップ３３）、確信度が閾値よりも大きいとき
（ステップ３２；Ｎ）には「認識結果は正しい」と判定
する（ステップ３４）。そして、その判定結果の情報を
認識結果の文字に付加してディスプレイ４に表示、また
は印刷装置５により印刷する（ステップ３５）。Next, a recognition result error detection process of comparing the certainty factor determined as described above with a threshold value and detecting an error in the recognition result from the comparison result will be described with reference to the flowchart of FIG. To do. In this processing, first, prior to the comparison with the certainty factor threshold value, the threshold value is set at any time to a predetermined value in accordance with a predetermined condition such as a character recognition result (step 31). In other words, the threshold value is changed at any time according to a predetermined condition such as a character recognition result. Then, the determined certainty factor is compared with a threshold value set at any time (step 3
2) As a result of the comparison, when the certainty factor is smaller than the threshold value (step 32; Y), it is determined that the recognition result is doubtful (step 33), and when the certainty factor is larger than the threshold value (step 32; In N), it is determined that the recognition result is correct (step 34). Then, the information of the determination result is added to the character of the recognition result and displayed on the display 4 or printed by the printing device 5 (step 35).

【００１８】このディスプレイ４の表示例、または印字
装置５による印刷例としては、「認識結果は正しい」場
合の文字は黒、「認識結果は怪しい」場合の文字は背景
は黒で白抜きにしたり、認識結果の正否により文字の輝
度を変化させたりする。このような表示により、装置利
用者は、認識結果の確信度を容易に認識して、修正可能
な文字を素早く且つ的確に見つけ出し、認識結果の修正
作業を効率的に行うことができる。As a display example of the display 4 or a printing example by the printing device 5, a character when "recognition result is correct" is black, and a character when "recognition result is suspicious" is black with a white background. , The brightness of the character is changed depending on whether the recognition result is correct or not. With such a display, the device user can easily recognize the certainty factor of the recognition result, quickly and accurately find the correctable character, and efficiently perform the correction work of the recognition result.

【００１９】このように第１の実施の形態によれば、確
信度を比較する際の閾値を、文字の認識結果などの所定
の条件に応じて所定値に随時設定するようにしたので、
認識結果の誤り検出精度の向上を図ることができ、もっ
て装置利用者の認識結果の修正作業の効率化が図れる。As described above, according to the first embodiment, the threshold value for comparing the certainty factors is set to a predetermined value at any time in accordance with a predetermined condition such as a character recognition result.
The accuracy of error detection of the recognition result can be improved, and the efficiency of the work of correcting the recognition result of the device user can be improved.

【００２０】次に、本発明の第２の実施の形態である文
字認識装置について、図４を参照して説明する。図４
は、本発明の第２の実施の形態である文字認識装置の動
作の要部を示すフローチャートである。この第２の実施
の形態は、第１の実施の形態の図３に示す認識結果の誤
り検出処理を、図４に示す認識結果の誤り検出処理とし
てより具体化したものである。Next, a character recognition device according to a second embodiment of the present invention will be described with reference to FIG. FIG.
FIG. 8 is a flow chart showing a main part of the operation of the character recognition device according to the second embodiment of the present invention. In the second embodiment, the error detection process of the recognition result shown in FIG. 3 of the first embodiment is further embodied as the error detection process of the recognition result shown in FIG.

【００２１】すなわち、第２の実施の形態では、図４に
示すように、確信度と閾値との比較に先立って、その閾
値を認識結果により得られた文字の種類に応じて、図５
に示すような値に随時設定する（ステップ４１）。換言
すれば、認識結果により得られた文字の種類に応じて閾
値を随時変更する。そして、確信度を随時設定される閾
値と比較し（ステップ４２）、その比較の結果、確信度
が閾値よりも小さいとき（ステップ４２；Ｙ）には「認
識結果は怪しい」と判定し（ステップ４３）、確信度が
閾値よりも大きいとき（ステップ４２；Ｎ）には「認識
結果は正しい」と判定する（ステップ４４）。そして、
その判定結果の情報を、第１の実施の形態と同様に認識
結果の文字に付加してディスプレイ４に表示、または印
刷装置５により印刷する（ステップ４５）。なお、第２
の実施の形態の文字認識処理やその構成は第１の実施の
形態と同様であるので、その説明は省略する。That is, in the second embodiment, as shown in FIG. 4, prior to the comparison between the certainty factor and the threshold value, the threshold value is changed according to the type of the character obtained from the recognition result.
The value as shown in (4) is set at any time (step 41). In other words, the threshold is changed as needed according to the type of character obtained from the recognition result. Then, the certainty factor is compared with a threshold value set at any time (step 42), and when the certainty factor is smaller than the threshold value as a result of the comparison (step 42; Y), it is determined that the “recognition result is suspicious” (step 42). 43), when the certainty factor is larger than the threshold value (step 42; N), it is determined that "the recognition result is correct" (step 44). And
Information of the determination result is added to the character of the recognition result and displayed on the display 4 or printed by the printing device 5 as in the first embodiment (step 45). The second
Since the character recognition processing and its configuration of the second embodiment are similar to those of the first embodiment, the description thereof will be omitted.

【００２２】このように第２の実施の形態によれば、確
信度を比較する際の閾値を、認識結果により得られた文
字の種類に応じて随時設定するようにしたので、文字の
種類にによる認識結果の誤り検出精度のばらつきを防止
でき、もって装置利用者の認識結果の修正作業の効率化
が図れる。As described above, according to the second embodiment, the threshold value for comparing the certainty factors is set at any time according to the character type obtained from the recognition result. It is possible to prevent the error detection accuracy from being varied due to the recognition result, and to improve the efficiency of the correction work of the recognition result by the device user.

【００２３】次に、本発明の第３の実施の形態である文
字認識装置について、図６を参照して説明する。図６
は、本発明の第３の実施の形態である文字認識装置の動
作の要部を示すフローチャートである。この第３の実施
の形態は、第１の実施の形態の図３に示す認識結果の誤
り検出処理を、図６に示す認識結果の誤り検出処理とし
てより具体化したものである。Next, a character recognition device according to a third embodiment of the present invention will be described with reference to FIG. Figure 6
FIG. 9 is a flowchart showing a main part of the operation of the character recognition device according to the third embodiment of the present invention. In the third embodiment, the error detection process of the recognition result shown in FIG. 3 of the first embodiment is further embodied as the error detection process of the recognition result shown in FIG.

【００２４】すなわち、第３の実施の形態では、図６に
示すように、確信度と閾値との比較に先立って、認識結
果により得られた文字が、人間が認識結果の修正時に見
逃し易い文字か否かに応じて、閾値を随時設定する（ス
テップ６１）。換言すれば、認識結果により得られた文
字の修正時における見逃し易さの程度に応じて、閾値を
随時変更する。ここで、人間が認識結果の修正時に見逃
し易い文字か否かという意味は、例えば、「Ｏ（オウ）
と０（ゼロ）」、「タと夕（ゆう）」、「へ（ひらが
な）とヘ（カタカナ）」のように、目視では違いがわか
り難いものを指し、これらは、認識結果の修正時には見
逃し易いからである。従って、この種の文字は他の文字
とは異なって、なるべく誤りとして検出し易くする必要
がある。That is, in the third embodiment, as shown in FIG. 6, the character obtained by the recognition result prior to the comparison between the certainty factor and the threshold value is a character that is easily overlooked by a human when correcting the recognition result. A threshold value is set at any time depending on whether or not (step 61). In other words, the threshold value is changed at any time according to the degree of ease of overlooking when correcting the character obtained from the recognition result. Here, the meaning of whether or not a character is easily missed by a person when correcting the recognition result is, for example, “O (oh)”.
And 0 (zero), "Ta and Yu (Yu)", "He (Hiragana) and He (Katakana)", which are hard to see the difference visually. These are overlooked when the recognition result is corrected. Because it is easy. Therefore, unlike other characters, this kind of character should be detected as an error as easily as possible.

【００２５】そこで、例えば図７に示すように、カテゴ
リによって閾値を変更する。カテゴリ「１」は、「Ｏ
（オウ）と０（ゼロ）」、「１（いち）とｌ（エル）」
のような英字−数字関係、カテゴリ「２」は、「タと夕
（ゆう）」、「トと卜（ぼく）」のような漢字−カタカ
ナ関係、カテゴリ「３」は、「へ（ひらがな）とヘ（カ
タカナ）」、「り（ひらがな）とリ（カタカナ）」のよ
うなひらがな−カタカナ関係、それ以外の文字は、その
他のカテゴリといった具合である。そして、確信度を上
述のように随時設定される閾値と比較し（ステップ６
２）、その結果、確信度が閾値よりも小さいとき（ステ
ップ６２；Ｙ）には「認識結果は怪しい」と判定し（ス
テップ６３）、確信度が閾値よりも大きいとき（ステッ
プ６２；Ｎ）には「認識結果は正しい」と判定する（ス
テップ６４）。次に、その判定結果の情報を、第１の実
施の形態と同様に認識結果の文字に付加してディスプレ
イ４に表示、または印刷装置５により印刷する（ステッ
プ６５）。なお、第３の実施の形態の文字認識処理やそ
の構成は第１の実施の形態と同様であるので、その説明
は省略する。Therefore, for example, as shown in FIG. 7, the threshold is changed according to the category. Category "1" is "O
(Oh) and 0 (zero) "," 1 (one) and l (el) "
English-numeric relations such as, category "2" is Kanji-katakana relations such as "Ta and Yu", "To and me", and category "3" is "He (Hiragana)" Hiragana-Katakana relations such as "and f (katakana)", "ri (hiragana) and ri (katakana)", and other characters are other categories. Then, the certainty factor is compared with the threshold value set at any time as described above (step 6).
2) As a result, when the certainty factor is smaller than the threshold value (step 62; Y), it is determined that the recognition result is doubtful (step 63), and when the certainty factor is larger than the threshold value (step 62; N). Is determined to be "correct" (step 64). Next, the information of the determination result is added to the character of the recognition result and displayed on the display 4 or printed by the printing device 5 as in the first embodiment (step 65). Since the character recognition processing and its configuration of the third embodiment are the same as those of the first embodiment, the description thereof will be omitted.

【００２６】このように第３の実施の形態によれば、確
信度を比較する際の閾値を、認識結果により得られた文
字が、人間が認識結果の修正時に見逃し易い文字か否か
に応じて随時設定するようにしたので、人間が認識結果
の修正時に見逃し易い文字について認識結果の誤り検出
精度の向上を図ることができ、もって装置利用者の認識
結果の修正作業の効率化が図れる。As described above, according to the third embodiment, the threshold value used for comparing the certainty factors is determined according to whether the character obtained from the recognition result is a character that is easily overlooked by humans when correcting the recognition result. Since it is set as needed, the accuracy of error detection of the recognition result can be improved for characters that are easily overlooked by humans when correcting the recognition result, and the efficiency of the work of correcting the recognition result of the device user can be improved.

【００２７】次に、本発明の第４の実施の形態である文
字認識装置について、図８を参照して説明する。図８
は、本発明の第４の実施の形態である文字認識装置の動
作の要部を示すフローチャートである。この第４の実施
の形態は、第１の実施の形態の図３に示す認識結果の誤
り検出処理を、図８に示す認識結果の誤り検出処理とし
てより具体化したものである。Next, a character recognition device according to a fourth embodiment of the present invention will be described with reference to FIG. FIG.
FIG. 9 is a flow chart showing a main part of the operation of the character recognition device according to the fourth embodiment of the present invention. In the fourth embodiment, the error detection process of the recognition result shown in FIG. 3 of the first embodiment is further embodied as the error detection process of the recognition result shown in FIG.

【００２８】すなわち、第４の実施の形態は、図８に示
すように、確信度と閾値との比較に先立って、認識され
る文字が文字認識部３で誤認識し易い文字か否かに応じ
て、閾値を随時設定する（ステップ８１）。換言すれ
ば、認識される文字の文字認識部３での誤認識のし易さ
の程度に応じて、閾値を随時変更する。ここで、認識さ
れた文字が誤認識し易い文字か否かという意味は、特徴
量空間で近いかどうかという意味である。従って、文字
認識がどのような特徴量を採用しているかで当然その閾
値に対応するカテゴリの内容は変わる。特徴量空間で近
いかどうかは、全ての文字の特徴量をクラスタリング
し、それによって決まった幾つかのクラスタの中から十
分大きいものを誤認識し易い文字のカテゴリとして閾値
を決める。例えば、図９に示すように、閾値をカテゴリ
によって変える。That is, in the fourth embodiment, as shown in FIG. 8, it is determined whether the recognized character is a character that is likely to be erroneously recognized by the character recognition unit 3 prior to the comparison of the certainty factor and the threshold value. Accordingly, the threshold value is set at any time (step 81). In other words, the threshold is changed at any time according to the degree of ease of erroneous recognition of the recognized character in the character recognition unit 3. Here, the meaning of whether the recognized character is a character that is easily erroneously recognized is that it is close in the feature amount space. Therefore, the content of the category corresponding to the threshold naturally changes depending on what kind of feature amount the character recognition adopts. Whether or not they are close to each other in the feature amount space is determined by clustering the feature amounts of all characters and determining a threshold value as a category of characters that is easily erroneously recognized as a sufficiently large one among several determined clusters. For example, as shown in FIG. 9, the threshold is changed depending on the category.

【００２９】そして、確信度を上述のように随時設定さ
れる閾値と比較し（ステップ８２）、その結果、確信度
が閾値よりも小さいとき（ステップ８２；Ｙ）には「認
識結果は怪しい」と判定し（ステップ８３）、確信度が
閾値よりも大きいとき（ステップ８２；Ｎ）には「認識
結果は正しい」と判定する（ステップ８４）。次に、そ
の判定結果の情報を、第１の実施の形態と同様に認識結
果の文字に付加してディスプレイ４に表示、または印刷
装置５により印刷する（ステップ８５）。なお、第４の
実施の形態の文字認識処理やその構成は第１の実施の形
態と同様であるので、その説明は省略する。Then, the certainty factor is compared with the threshold value set at any time as described above (step 82). As a result, when the certainty factor is smaller than the threshold value (step 82; Y), "the recognition result is suspicious". When the certainty factor is larger than the threshold value (step 82; N), it is determined that the "recognition result is correct" (step 84). Next, the information of the judgment result is added to the character of the recognition result and displayed on the display 4 or printed by the printing device 5 as in the first embodiment (step 85). Since the character recognition processing and its configuration of the fourth embodiment are the same as those of the first embodiment, the description thereof will be omitted.

【００３０】このように第４の実施の形態によれば、確
信度を比較する際の閾値を、認識された文字が誤認識し
やすい文字か否かに応じて随時設定するようにしたの
で、誤認識し易い文字の認識結果の誤り検出の精度向上
を図ることができ、もって装置利用者の認識結果の修正
作業の効率化が図れる。As described above, according to the fourth embodiment, the threshold value for comparing the certainty factors is set at any time according to whether the recognized character is a character that is easily misrecognized. It is possible to improve the accuracy of the error detection of the recognition result of the character that is easily erroneously recognized, so that the efficiency of the correction work of the recognition result of the device user can be improved.

【００３１】なお、以上述べた第２の実施の形態は、文
字の種類に応じて閾値を随時変更するものであり、第３
の実施の形態は、認識結果により得られた文字が、人間
が認識結果の修正時に見逃し易い文字か否かに応じて随
時変更するものである。しかし、閾値の変更条件は、こ
れら２つの条件を考慮してもよい。In the second embodiment described above, the threshold is changed as needed according to the type of character.
In this embodiment, the character obtained from the recognition result is changed at any time depending on whether or not the character is a character that is easily missed by a person when correcting the recognition result. However, the threshold changing condition may take these two conditions into consideration.

【００３２】[0032]

【発明の効果】請求項１記載の発明によれば、認識結果
の確信度と閾値を比較し、認識結果の誤りを検出する際
に、その閾値を所定の条件に応じて随時変更するように
したので、認識結果の誤り検出精度が向上し、もって装
置利用者の認識結果の修正作業の効率化を図ることがで
きる。請求項２記載の発明では、確信度を比較する際の
閾値を、認識結果により得られた文字の種類に応じて随
時変更するようにしたので、文字の種類にによる認識結
果の誤り検出精度のばらつきを防止でき、もって装置利
用者の認識結果の修正作業の効率化を図ることができ
る。According to the first aspect of the present invention, the certainty factor of the recognition result is compared with the threshold value, and when the error in the recognition result is detected, the threshold value is changed according to a predetermined condition. Therefore, the error detection accuracy of the recognition result is improved, and the efficiency of the work of correcting the recognition result by the device user can be improved. According to the second aspect of the present invention, the threshold value used when comparing the certainty factors is changed at any time according to the character type obtained from the recognition result. Therefore, the error detection accuracy of the recognition result depending on the character type can be improved. It is possible to prevent variations, and thus to improve the efficiency of the work of correcting the recognition result of the device user.

【００３３】請求項３記載の発明では、確信度を比較す
る際の閾値を、認識された文字の修正時における修正者
の見逃し易さの程度に応じて随時変更させるようにした
ので、人間が認識結果の修正時に見逃し易い文字につい
て認識結果の誤り検出精度の向上を図ることができ、も
って装置利用者の認識結果の修正作業の効率化を図るこ
とができる。請求項４記載の発明では、確信度を比較す
る際の閾値を、認識される文字の誤認識し易さの程度に
応じて随時変更させるようにしたので、誤認識し易い文
字の認識結果の誤り検出精度の向上を図ることができ、
もって装置利用者の認識結果の修正作業の効率化を図る
ことができる。According to the third aspect of the present invention, the threshold value for comparing the certainty factors is changed at any time according to the degree of ease of overlooking by the corrector when correcting the recognized character. It is possible to improve the accuracy of error detection of the recognition result for a character that is easily overlooked when correcting the recognition result, and thus to improve the efficiency of the work of correcting the recognition result by the device user. In the invention according to claim 4, the threshold value for comparing the certainty factors is changed at any time according to the degree of easiness of erroneously recognizing the recognized character. It is possible to improve the accuracy of error detection,
Therefore, the efficiency of the work of correcting the recognition result of the device user can be improved.

[Brief description of drawings]

【図１】本発明の第１の実施の形態の文字認識装置の基
本構成を示すブロック図である。FIG. 1 is a block diagram showing a basic configuration of a character recognition device according to a first embodiment of the present invention.

【図２】同文字認識装置による文字認識処理を示すフロ
ーチャートである。FIG. 2 is a flowchart showing a character recognition process by the character recognition device.

【図３】同文字認識装置による認識結果の誤り検出処理
を示すフローチャートである。FIG. 3 is a flowchart showing a recognition result error detection process by the character recognition device.

【図４】本発明の第２の実施の形態の文字認識装置によ
る認識結果の誤り検出処理を示すフローチャートであ
る。FIG. 4 is a flowchart showing a recognition result error detection process by the character recognition device according to the second embodiment of the present invention.

【図５】各文字種における閾値の例を示す図表である。FIG. 5 is a table showing an example of threshold values for each character type.

【図６】本発明の第３の実施の形態の文字認識装置によ
る認識結果の誤り検出処理を示すフローチャートであ
る。FIG. 6 is a flowchart showing a recognition result error detection process by the character recognition device according to the third embodiment of the present invention.

【図７】人間から見た各カテゴリにおける閾値の例を示
す図表である。FIG. 7 is a table showing an example of threshold values in each category as viewed from a human.

【図８】本発明の第４の実施の形態の文字認識装置によ
る認識結果の誤り検出処理を示すフローチャートであ
る。FIG. 8 is a flowchart showing a recognition result error detection process by the character recognition device in the fourth embodiment of the present invention.

【図９】文字認識から見た各カテゴリにおける閾値の例
を示す図表である。FIG. 9 is a table showing an example of thresholds in each category as seen from character recognition.

【図１０】従来装置の文字認識処理を示すフローチャー
トである。FIG. 10 is a flowchart showing a character recognition process of a conventional device.

【図１１】従来装置の認識結果の誤り検出処理を示すフ
ローチャートである。FIG. 11 is a flowchart showing an error detection process of a recognition result of a conventional device.

[Explanation of symbols]

１スキャナ２入力装置３文字認識部４ディスプレイ５印字装置３１画像メモリ３２中央演算処理装置３３ＲＯＭ３４辞書３５ワークエリアＲＡＭ 1 Scanner 2 Input Device 3 Character Recognition Unit 4 Display 5 Printing Device 31 Image Memory 32 Central Processing Unit 33 ROM 34 Dictionary 35 Work Area RAM

Claims

[Claims]

1. A character recognition device capable of obtaining a recognition result for an unknown character that has been input and determining a certainty factor, which is the certainty of the recognition result, by comparing the determined certainty factor with a threshold value. A detection unit that detects an error in the recognition result, a threshold value changing unit that changes the threshold value at any time according to a predetermined condition when the detection unit detects the error in the recognition result detected by the detection unit, A character recognition device comprising: an output unit that outputs the recognition result together with the recognition result.

2. The character recognition device according to claim 1, wherein the threshold value changing means changes the threshold value at any time according to the type of the recognized character.

3. The character recognition device according to claim 1, wherein the threshold value changing means changes the threshold value at any time in accordance with the degree of easiness of being missed by the corrector when correcting the recognized character. .

4. The character recognition device according to claim 1, wherein the threshold value changing unit changes the threshold value at any time according to the degree of easiness of erroneous recognition of a character to be recognized.