JPS59189481A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPS59189481A
JPS59189481A JP58066125A JP6612583A JPS59189481A JP S59189481 A JPS59189481 A JP S59189481A JP 58066125 A JP58066125 A JP 58066125A JP 6612583 A JP6612583 A JP 6612583A JP S59189481 A JPS59189481 A JP S59189481A
Authority
JP
Japan
Prior art keywords
character
picture
black
pixels
pixel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58066125A
Other languages
Japanese (ja)
Inventor
Minoru Nagao
永尾 実
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Omron Corp
Original Assignee
Tateisi Electronics Co
Omron Tateisi Electronics Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tateisi Electronics Co, Omron Tateisi Electronics Co filed Critical Tateisi Electronics Co
Priority to JP58066125A priority Critical patent/JPS59189481A/en
Publication of JPS59189481A publication Critical patent/JPS59189481A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To improve efficiency of processing of character recognition and to make speed of character recognition constant by counting the number of profile black picture elements while judging whether each constituting picture element is black picture element or not and whether the picture element is profile black picture element or not in scanning a character picture. CONSTITUTION:An unknown character 2 indicated on a sheet of paper 1 is read by a pattern reading device 3 consisting of CCD etc. and its optical pattern is converted to a digital signal of time series by an A/D converter 4. A preprocessing circuit 5 performs removing of noise of the digital signal and binary coding by a specified threshold level, and stores a character picture in a picture memory 6. The picture memory 6 is made up of plural vertical and horizontal picture elements, and picture data of binary coded character picture are stored in each picture element. A dictionary collating circuit 8 collates characteristic of character picture obtained by a characteristic extracting circuit 7 with examples of characteristic of reference pattern stored beforehand in a dictionary 9, and outputs the result of collation.

Description

【発明の詳細な説明】 〈発明の技術分野〉 本発明は、未知の文字や記号等(以下「未知文字」とい
う)の光学パターンを読み取り、これを白黒2値化して
文字画像を求めた後、文字画像と標準パターンとを照合
して、未知文字を認識する文字認識装置に関連し、殊に
本発明は、未知文字の線幅が大小して文字認識が不能な
場合において、文字画像の黒画素総数と輪郭線を構成す
る黒画素(以下[輪郭黒画素」という)の総数とを比較
しす、文字の線幅を判定し且つ調整する機能をもつ文字
認識装置に関する。
[Detailed Description of the Invention] <Technical Field of the Invention> The present invention involves reading an optical pattern of unknown characters, symbols, etc. (hereinafter referred to as "unknown characters"), converting the optical pattern into black and white to obtain a character image, and then The present invention relates to a character recognition device that recognizes unknown characters by comparing character images with standard patterns. The present invention relates to a character recognition device having a function of determining and adjusting the line width of a character by comparing the total number of black pixels with the total number of black pixels forming an outline (hereinafter referred to as "contour black pixels").

〈発明の背景〉 従来文字認識装置において、未知文字の光学パターンを
2値化する場合、スレタコ4ホールドレベルの大きさに
応じて文字画像の線幅が大小変化し、これがため対応す
る標準、N6クーンとの照合結果か不一致となり、認識
不能と判定されることがある。この場合、スレシュホー
ルドレベルを変更して文字の線幅を調整した上で再度パ
ターン照合処理を実行することか行なわれている。一般
に文字の線幅を判定する方式として、文字画像の黒画素
総数と輪郭黒画素総数とを求めた後、両者の比率を算出
し、その比率の大きさから文字画顛の線幅を判定する方
式か採用されている。そして輪郭黒画素総数を算出する
のに、従来は、文字画像が格納された画像メモリの構成
画素を順次走査してゆき、黒画素を検出し且つそれが文
字輪郭線を構成する黒画素であった場合、その輪郭線を
追跡して輪郭黒画素数を計数し、更に他の輪郭線につい
ても同様の追跡処理を行なった後、全ての輪郭線につい
て輪郭黒画素総数を算出している。従ってこの種方法で
は、殊に輪郭黒画素総数の算出に著しく時間がかかるた
め、文字認識の処理効率が大幅に低下し、また文字毎に
輪郭線数か相違するため、輪郭黒画素総数の算出時間が
変動し、文字認識処理リ−が一定しない等の問題があっ
た。
<Background of the Invention> In a conventional character recognition device, when the optical pattern of an unknown character is binarized, the line width of the character image changes depending on the size of the thread taco 4 hold level, and this causes the corresponding standard, N6 The results of the comparison with Kuhn may not match, and it may be determined that the object is unrecognizable. In this case, the threshold level is changed to adjust the line width of the characters, and then the pattern matching process is executed again. Generally, the method of determining the line width of a character is to find the total number of black pixels of the character image and the total number of outline black pixels, then calculate the ratio of the two, and determine the line width of the character image based on the size of the ratio. method has been adopted. In order to calculate the total number of outline black pixels, conventionally, the constituent pixels of the image memory in which the character image is stored are sequentially scanned, and the black pixels are detected and the black pixels forming the character outline are detected. In this case, the contour line is traced to count the number of contour black pixels, and the same tracing process is performed for other contour lines, and then the total number of contour black pixels is calculated for all contour lines. Therefore, with this type of method, it takes a significant amount of time to calculate the total number of black contour pixels, which greatly reduces the processing efficiency of character recognition.Also, since the number of contour lines differs for each character, the calculation of the total number of black contour pixels There were problems such as the time fluctuating and the character recognition processing rate being inconsistent.

〈発明の目的〉 本発明は文字の輪郭線を追跡する方式を辞め、文字画像
の走査において各構成画素が黒画素か否か、更には輪郭
黒画素か否かを判断しつつ輪郭黒画素数を計数してゆく
ことによって、文字認識の処理効率を向上させると共に
、文字認識速度が変動しない新規な文字認識装置を提供
することを目的とする。
<Object of the Invention> The present invention eliminates the method of tracing the outline of a character, and determines the number of outline black pixels by determining whether each constituent pixel is a black pixel in scanning a character image, and further determining whether or not it is an outline black pixel. It is an object of the present invention to improve the processing efficiency of character recognition by counting , and to provide a new character recognition device in which the character recognition speed does not fluctuate.

〈発明の構成および効果〉 上記目的を達成するため、本発明では、白黒2値化され
た画像データを格納した画像データ記憶手段を一定視野
範囲のマスクをもって走査し、マスク内の中心画素につ
いて白黒判別並びに輪郭線判別を実行して黒画素検出信
号および輪郭黒画素検出信号を得、これら信号を計数し
て、文字画像の黒画素総数と輪郭黒画素総数とを算出す
ることとしYこ。
<Structure and Effects of the Invention> In order to achieve the above object, the present invention scans an image data storage means storing black and white binary image data with a mask having a fixed field of view, and performs black and white detection for the central pixel within the mask. The black pixel detection signal and the contour black pixel detection signal are obtained by performing discrimination and contour line discrimination, and these signals are counted to calculate the total number of black pixels and the total number of contour black pixels of the character image.

本発明によれば、画像データ記憶手段を一連に走査する
ことにより、゛黒画素総数および輪郭黒画素総数を求め
ることかできるから、文字輪郭線を追跡する従来方式と
比較して、文字の線幅判定、ひいては文字認識の処理時
間を大幅に短縮できると共に、文字毎の認識処理時間の
変動を防止できる等、発明目的を達成した優れた効果を
奏する。
According to the present invention, the total number of black pixels and the total number of contour black pixels can be determined by sequentially scanning the image data storage means, so that the line of the character It is possible to significantly reduce the processing time for width determination and character recognition, and also to prevent fluctuations in the recognition processing time for each character, achieving the excellent effects of achieving the purpose of the invention.

〈実施例の説明〉 第】図は文字認識装置の全体構成を示す。<Explanation of Examples> Figure 1 shows the overall configuration of a character recognition device.

図示例において、紙面1に表示された未知文字2はCC
D等より成るパターン読取装置3をもって読み取られ、
その光学パターンはA/D変換器4番こより時系列のデ
ジタル信号に変換される。前処理回路5は、デジタル信
号のノイズ除去、更には所定のスレシュホールレベルに
よる2値化処理を行なって、文字画像を画像メモリ6に
記′隨させる。画像メモリ6は縦横複数の画素により構
成され、各画素へ2値化された文字画像の画像データか
格納される。特徴抽出回路7は画像メモリ6に格納され
た文字画像より文字認識に必要ないくつかの特徴(交点
、分岐点、ループ数、ストローク長さ等)を抽出する。
In the illustrated example, unknown character 2 displayed on page 1 is CC
It is read with a pattern reading device 3 consisting of D, etc.
The optical pattern is converted into a time-series digital signal by A/D converter No. 4. The preprocessing circuit 5 removes noise from the digital signal and further performs binarization processing using a predetermined threshold level, and writes the character image into the image memory 6. The image memory 6 is composed of a plurality of pixels in the vertical and horizontal directions, and image data of a binary character image is stored in each pixel. The feature extraction circuit 7 extracts several features (intersections, branch points, number of loops, stroke length, etc.) necessary for character recognition from the character images stored in the image memory 6.

辞書照合回路8は、特徴抽出回路7て得た文字画像の特
徴を予め辞椙9に格納した標準パターンの特徴例と照合
して照合結果を出力する。もし文字の線幅に起因して文
字認識か不能であった場合、前記スレシュホールドレベ
ルが変更さし、再度上記と同様の処理、手順が実行され
る。尚図中、CP U (CenLral Proce
ssing Unit ) l Qは、RA M (R
andam Access Memory ) l l
に対する文字認識処理に必要な各種データの読み書きを
実行しつつ、上記各回路の動作を制御し、また各種演算
、処理を実行する。
The dictionary matching circuit 8 matches the features of the character image obtained by the feature extraction circuit 7 with feature examples of standard patterns stored in advance in the dictionary 9, and outputs the matching result. If the character cannot be recognized due to the line width of the character, the threshold level is changed and the same process and procedure as above is executed again. In the figure, CPU (CenLral Process
ssing Unit) l Q is RAM (R
andam Access Memory ) l l
It reads and writes various data necessary for character recognition processing, controls the operation of each of the circuits mentioned above, and executes various calculations and processes.

第2図は、文字の線幅判定回路を示す。図中3個のRA
M12A、12B 、12C,は画像メモリ6に格納さ
れた3行分の画像データを取り込むためのもので、各に
ΔM12A〜12Gにおける3画素分の画1象テータは
夫々シフトレジスタ13A、 13B 、 13Gにセ
ットされる。
FIG. 2 shows a character line width determination circuit. 3 RAs in the diagram
M12A, 12B, and 12C are for capturing three lines of image data stored in the image memory 6, and the image data for three pixels in each of ΔM12A to 12G are stored in shift registers 13A, 13B, and 13G, respectively. is set to

カウンタ14A 、 14B 、 ILCはRAM12
’A〜12Cのアドレス設定用であり、計数動作に応じ
て各RAM12A〜12Cより画1象データが順次読み
出されると共にシフトレジスタ13A〜13Cか右方向
へ一斉にシフト動作する。従つて各シフトレジスター3
A〜13Cには3行につき各3画素分の画像データがセ
ットされ、各シフトレジスター3A〜1’ 3 Cの一
斉シフト動作によって縦3画素×1fA3画素の視野範
囲/ をもつ*マスクW(第3図に示す)をもって画素メモリ
6の構成画素を行方向へ走査するのと同等の作用を得る
Counters 14A, 14B, ILC are RAM12
'A to 12C are used for address setting, and image data is sequentially read out from each RAM 12A to 12C in accordance with the counting operation, and the shift registers 13A to 13C are simultaneously shifted to the right. Therefore, each shift register 3
Image data for 3 pixels each in 3 rows is set in A to 13C, and by the simultaneous shift operation of each shift register 3A to 1' 3C, a *mask W (No. 3) to obtain the same effect as scanning the constituent pixels of the pixel memory 6 in the row direction.

各シフトレジスター3A〜13Cの各ビット出力は、画
素判定データか格納されたテーブルROM (Read
 0nly Memory) 15 ヘ入力される。こ
のテーブルROM15は前記マスク)ヤの中心画素eが
黒画素か否か、更に輪郭黒画素か否かの判定に用いられ
、画素判定データに基つき黒画素検出信号iおよび輪郭
黒画素検出信号Jを出力する。
Each bit output of each shift register 3A to 13C is sent to a table ROM (Read
0nly Memory) 15. This table ROM 15 is used to determine whether or not the center pixel e of the mask (a) is a black pixel, and furthermore, whether or not it is a contour black pixel, and based on the pixel determination data, the black pixel detection signal i and the contour black pixel detection signal J Output.

第4図は輪郭黒画素の判定原理を説明する図であり、マ
スクW内の中心画素eが黒画素であり且つ四方の画素す
、d、[、hの内、1〜3個の画素か黒画素である場合
に中心画素Cは輪郭黒画素であるとする。そこで本実施
例ではシフトレジスタ13A〜13Cの合計9ビツト出
力(画素a % iに対応する)をもってテーブルRO
M15のアドレス(9ビツト)を規定し、前記黒画素配
列に対応するビット構成のアドレスには輪郭黒画素であ
ることを示すデータ、また中心画素eを黒画素とするビ
ット配列のアドレスには黒画素であることを示すデータ
を夫々格納しておく。例えは第4図中、斜線部分か黒画
素であると仮定すると、画素e、’h、fを「1」とす
るアドレス「xQX  011 XIX J (但しX
は「1」「0」いずれでも可)には黒画素および輪郭黒
画素であることを示すデータか格納される。
FIG. 4 is a diagram explaining the principle of determining black pixels on the outline, and it is assumed that the central pixel e in the mask W is a black pixel, and 1 to 3 pixels among the four pixels s, d, [, h When the pixel is a black pixel, the center pixel C is assumed to be an outline black pixel. Therefore, in this embodiment, the table RO
The address of M15 (9 bits) is defined, and the address of the bit configuration corresponding to the black pixel array is data indicating that it is an outline black pixel, and the address of the bit array where the center pixel e is a black pixel is black. Data indicating that each pixel is a pixel is stored. For example, if we assume that the shaded area in Figure 4 is a black pixel, then the address "xQX 011 XIX J (where X
(can be either "1" or "0") stores data indicating that the pixel is a black pixel or a pixel with a black outline.

斯くてテーブルROM15からは黒画素データに基つき
黒画素検出信号iか、また輪郭黒画素データに基づき輪
郭黒画素検出信号jか夫々出力され、これら各信号’+
Jは夫々カウンタ16’、 17により信号数が計数さ
れる。
In this way, the table ROM 15 outputs either a black pixel detection signal i based on the black pixel data or a contour black pixel detection signal j based on the contour black pixel data, and each of these signals '+
The number of signals of J is counted by counters 16' and 17, respectively.

各カウンタ16,17の出力は前記CPUIQのデータ
バスに接続されており、画像メモリ6の走査完了時、C
PUIQのI/。命令により各カウンタ16,17の最
終計数データN1+NZがデータバスを介してcpul
oに取り込まれる。この一方の最終計数データN1は文
字画像における輪郭黒画素数、他方の最終計数データN
2は黒画素数に対応し、CPUIQは両者の比率5−N
1/N2を計算し、比率Sの大きさにより文字の線幅が
太いか、細いかを判定する。例えば第5図に示す英文字
r FJにかかる文字画像の場合、これをマスクWて走
査することにより、図中○印で示す輪郭黒画素、X印で
示すそれ以外の黒画素を検出でき、更に○印の画素数と
、○印およびX印の画素数との比率を求めることによっ
て、文字の線幅を大小判定する。
The outputs of each counter 16, 17 are connected to the CPUIQ data bus, and when scanning of the image memory 6 is completed, the
PUIQ I/. The final count data N1+NZ of each counter 16, 17 is sent to cpul via the data bus according to the instruction.
It is taken into o. One of the final count data N1 is the number of outline black pixels in the character image, and the other final count data N1
2 corresponds to the number of black pixels, and CPUIQ is the ratio of both 5-N
1/N2 is calculated, and based on the size of the ratio S, it is determined whether the line width of the character is thick or thin. For example, in the case of the character image for the English letter r FJ shown in FIG. 5, by scanning it with a mask W, it is possible to detect outline black pixels indicated by ○ marks in the figure and other black pixels indicated by X marks. Furthermore, the line width of the character is determined by determining the ratio between the number of pixels of the circle mark and the number of pixels of the circle mark and the X mark.

然して文字の線幅が太い場合には、前記前処理回路5の
スレシュホールドレベルを高く再設定し、一方文字の線
幅が細い場合には、スレシュホールドレベルを低く再設
定シテ、画像メモリ6へ格納する文字画像の線幅を調整
し、再度同様の文字照合処理を実行するものである。
However, if the line width of the character is thick, the threshold level of the preprocessing circuit 5 is reset to a higher value, while if the line width of the character is thinner, the threshold level is reset to a lower value. The line width of the stored character image is adjusted and the same character matching process is executed again.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は文字認識装置の全体回路構成を示すブロック図
、第2図は文字の線幅判定回路を示すブロック図、第3
図は画像メモリに対し設定されるマスクを説明するため
の図、第4図はマスクの視野に金子れる画素範囲を示す
図、第す図は文字画像例における輪郭黒画素およびそれ
以外の黒画素の配置を説明する図である。 6・・・・・・画像メモリ  10・・・・・・CPU
13A 、 13B 、 13C・・・・・・シフトレ
ジスタ15・・・・・・テーブルROM 16.17・・・・・カウンタ  W・・・・・マスク
特許出願人  立石電機株式会社 梼/口 ?、、aえ 碑2し
Figure 1 is a block diagram showing the overall circuit configuration of the character recognition device, Figure 2 is a block diagram showing the character line width determination circuit, and Figure 3 is a block diagram showing the character line width determination circuit.
The figure is a diagram for explaining the mask set for the image memory, Figure 4 is a diagram showing the pixel range that falls within the field of view of the mask, and Figure 2 is a diagram showing outline black pixels and other black pixels in an example of a character image. It is a figure explaining arrangement of. 6... Image memory 10... CPU
13A, 13B, 13C...Shift register 15...Table ROM 16.17...Counter W...Mask patent applicant Tateishi Electric Co., Ltd. ,,ae Monument 2

Claims (1)

【特許請求の範囲】[Claims] 文字画像の黒画素総数と輪郭黒画素総数とを比較して文
字の線幅を判定する機能をもつ文字認識装置において、
白黒2値化された画像データを複数画素へ格納する画像
データ記憶手段と、1ilii 像データ記憶手段を一
定視野範囲のマスクにて走査しマスク内の中心画素につ
き白黒判別並び(こ輪郭線判別を実行すると共に黒画素
検出信号および輪郭黒画素検出信号を出力する走査検出
手段と、走査検出手段の各信号出力を計数して文字画像
の黒画素総数と輪郭黒画素総数とを算出する計数手段と
を具備して成る文字認識装置。
In a character recognition device that has a function of determining the line width of a character by comparing the total number of black pixels of a character image and the total number of outline black pixels,
An image data storage means for storing black and white binarized image data in a plurality of pixels; scanning detection means for outputting a black pixel detection signal and a contour black pixel detection signal when executed; and a counting means for counting each signal output of the scanning detection means to calculate the total number of black pixels and the total number of contour black pixels of the character image. A character recognition device comprising:
JP58066125A 1983-04-13 1983-04-13 Character recognizing device Pending JPS59189481A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58066125A JPS59189481A (en) 1983-04-13 1983-04-13 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58066125A JPS59189481A (en) 1983-04-13 1983-04-13 Character recognizing device

Publications (1)

Publication Number Publication Date
JPS59189481A true JPS59189481A (en) 1984-10-27

Family

ID=13306837

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58066125A Pending JPS59189481A (en) 1983-04-13 1983-04-13 Character recognizing device

Country Status (1)

Country Link
JP (1) JPS59189481A (en)

Similar Documents

Publication Publication Date Title
US4750209A (en) System for processing an image having both letter and photographic information
JPS63261486A (en) Writing style identifying device
JPS62254282A (en) Method and apparatus for separating overlapped pattern
JPS59189481A (en) Character recognizing device
JPH02293989A (en) character recognition device
JP2503208B2 (en) Business card image processing method
JP2590099B2 (en) Character reading method
JP3064391B2 (en) Character recognition method
JPS6037649Y2 (en) character reading device
JPH0535921A (en) Character matching device and method therefor
JPH0660226A (en) Character reader
JP2980636B2 (en) Character recognition device
JPH0586585B2 (en)
JPH02166583A (en) Character recognizing device
JPS5914078A (en) Reader of business form
CN114612339A (en) Image preprocessing method, device, equipment and storage medium
JP2972443B2 (en) Character recognition device
JPH0677270B2 (en) Character recognition device
JP2902904B2 (en) Character recognition device
JPS6020785B2 (en) Character pattern buffer
JPH01201788A (en) Character reading method
JPH04291479A (en) Ruled line recognition method and table processing method
JPS60140487A (en) Character feature extraction method
JPH0433074B2 (en)
JPS59205681A (en) Character reader