JPH0223906B2

JPH0223906B2 -

Info

Publication number: JPH0223906B2
Application number: JP56070761A
Authority: JP
Inventors: Masaki Komya
Original assignee: Tokyo Shibaura Electric Co Ltd
Current assignee: Toshiba Corp
Priority date: 1981-05-13
Filing date: 1981-05-13
Publication date: 1990-05-25
Also published as: JPS57187780A

Description

【発明の詳細な説明】本発明はマルチフオントに適合した光学的文学
読取装置（以下OCRと略称する）に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a multi-font compatible optical literature reader (hereinafter abbreviated as OCR).

近年、活字読取OCRで多文字種読取のものが
増えているが、字種が増えるに従つて類似文字も
増加するため読取率が低下する傾向が強くなつて
来た。又ミツクスマルチ（同時混在読取）用の辞
書作成の面でも長時間大容量の計算機を使用しな
ければならず技術的にも問題があつた。 In recent years, the number of print reading OCRs that can read multiple character types has increased, but as the number of character types increases, the number of similar characters also increases, so there is a strong tendency for the reading rate to decline. In addition, creating a dictionary for mix-multi (simultaneous mixed reading) required the use of a large-capacity computer for a long time, which caused technical problems.

本発明は以上の問題点を解消できるOCRを提
供することを目的とするものである。 An object of the present invention is to provide an OCR that can solve the above problems.

本発明はフオント毎に独立して構成されるマル
チ辞書を使用し最初の数文字は全辞書を用いて読
取を行い最も相性の良い（平均的類似度の高い）
辞書を自動選択し、選択された辞書を用いて以降
の読取を行うようにしたものである。この場合、
見かけ上はミツクスマルチを読んでいるように見
えるが内容は大いに異なるものである。 The present invention uses a multi-dictionary that is configured independently for each font, and reads the first few characters using all dictionaries to find the most compatible (high average similarity)
A dictionary is automatically selected and subsequent reading is performed using the selected dictionary. in this case,
At first glance, it looks like you are reading a mix multi, but the content is very different.

以下に実施例により本発明の詳細を説明する。
図は本発明の一実施例の機能を示すブロツク図で
ある。同図において、１は字種毎に独立に構成さ
れた辞書をもつセレクテイブマルチフオント辞
書、２は文字の認識部であり、帳票の最初の一定
数の文字については全辞書を引いて読取を行い、
各文字について各辞書における最高の類似度値を
得点として独立に加算し、各累積点を得点記憶レ
ジスタ３に記憶させて行き、一定数の文字につい
てこの作業を繰返した後、比較器４を用いて最高
得点を検出し、該帳票で使用する辞書を自動選択
する機能をもつものである。 The details of the present invention will be explained below with reference to Examples.
The figure is a block diagram showing the functions of one embodiment of the present invention. In the figure, 1 is a selective multi-font dictionary that has dictionaries configured independently for each type of character, and 2 is a character recognition unit, which reads the first certain number of characters in a form by referring to the entire dictionary. and
For each character, the highest similarity value in each dictionary is independently added as a score, and each cumulative score is stored in the score storage register 3. After repeating this process for a certain number of characters, the comparator 4 is used to It has a function to detect the highest score and automatically select the dictionary to be used in the form.

前記した最初の一定数の文字の読取については
辞書選択の段階であり、この時点では答を出かず
に各辞書が得た最高の類似度値を辞書毎に加算し
て行き、一定文字数の処理が終つたところで各辞
書の類似度値の累積点を比較し、最高の累積点を
示した辞書を該帳票で使用することにし、第１文
字目から識別を再開する。 The reading of the first certain number of characters mentioned above is at the dictionary selection stage, and at this point, the highest similarity value obtained by each dictionary is added up for each dictionary, and the processing of a certain number of characters is performed. When this is completed, the cumulative points of the similarity values of each dictionary are compared, and the dictionary with the highest cumulative score is decided to be used for the form, and identification is restarted from the first character.

辞書選択後の読取に於て、リジエクトが多発
し、一定数値を超えた場合は、辞書選択の為のサ
ンプルを増やしてもう一度辞書の選択を行う。 During reading after dictionary selection, if rejects occur frequently and exceed a certain value, samples for dictionary selection are increased and dictionary selection is performed again.

後戻りして読取が行えない構造のOCRでは辞
書選択段階に於ても各辞書毎に答を保存してお
き、辞書決定時に対応するものを正式の答とし採
用することによつて同様な結果を得ることができ
る。 In OCR, which has a structure that does not allow for backward reading, it is possible to obtain similar results by saving the answer for each dictionary even at the dictionary selection stage, and by adopting the corresponding answer as the official answer when selecting the dictionary. Obtainable.

本発明は以上のようになるものであつて、(i)最
初の数文字については全辞書を使用するがそれ以
降は単独の辞書で読取るので、高精度、高速化
（ミツクスマルチに比べて）が可能である。(ii)辞
書の開発が容易である。等の効果が得られる。 The present invention is as described above, and (i) the first few characters are read using the entire dictionary, but after that, a single dictionary is used to read them, resulting in high accuracy and high speed (compared to mix multi). It is possible. (ii) It is easy to develop a dictionary. Effects such as this can be obtained.

[Brief explanation of drawings]

図は本発明の一実施例の機能を示すブロツク図
である。１：マルチフオント辞書、２：認識部、３：得
点記憶レジスタ、４：比較器。 The figure is a block diagram showing the functions of one embodiment of the present invention. 1: Multi-font dictionary, 2: Recognition unit, 3: Score storage register, 4: Comparator.

Claims

[Claims]

1 A group of dictionaries configured independently for each font, and using these dictionaries, first read a certain number of characters first, and then add the highest similarity value obtained from each dictionary for each character for each dictionary. means for comparing the addition results and selecting a dictionary to be used; and means for reading characters based on the selected dictionary and causing dictionary selection to be performed again when the reading reject exceeds a predetermined limit. A multi-font character reading device characterized by: