JPS60201480A - Character read system - Google Patents

Character read system

Info

Publication number
JPS60201480A
JPS60201480A JP5793984A JP5793984A JPS60201480A JP S60201480 A JPS60201480 A JP S60201480A JP 5793984 A JP5793984 A JP 5793984A JP 5793984 A JP5793984 A JP 5793984A JP S60201480 A JPS60201480 A JP S60201480A
Authority
JP
Japan
Prior art keywords
character
block
reading
katakana
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP5793984A
Other languages
Japanese (ja)
Inventor
Fumiaki Taira
平 二三章
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp, Nippon Electric Co Ltd filed Critical NEC Corp
Priority to JP5793984A priority Critical patent/JPS60201480A/en
Publication of JPS60201480A publication Critical patent/JPS60201480A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To improve the reading accuracy by separating a read area for designating plural character kinds with a space of a >one character so as to divide a data at each character kind set for the recognition again. CONSTITUTION:A read area on a form is scanned, inputted to a line buffer 11 as a white/black binary-coding image data, and a character image is extracted (12) at each character, a character kind 18 extracted by a parameter and one read area is recognized (13) and the data is separated to each character block. Then the character block is checked for the content by character kind set information 19 designated by the parameter, an optimum character kind set is decided (15), the data is recognized (16) again at each character block and a read area on the form is read. The read area designated for plural character kinds is a block of KATAKANA (square form of Japanese syllabary) and alphanumeric characters where the character kinds are mixed with, e.g., numerals, English letters and KATAKANA. This block is read by a recognition block 13 by a dictionary reading the mixture of numeral, English letter and KATAKANA, the result of recognition is outputted while the KATAKANA is taken as A, the English letter is taken as B by a discrimination output block 14 as to each character kind set.

Description

【発明の詳細な説明】 (技術分野) 本発明は文字読み取り方式に関し、複数の文字種で指定
された帳票上の読み取りエリアを文字種セットごとにス
ペースで分離することにより文字読み取り精度を同上さ
せることができる文字読み取り方式に関する。
[Detailed Description of the Invention] (Technical Field) The present invention relates to a character reading method, and it is possible to improve character reading accuracy by separating the reading area on a form specified by multiple character types with a space for each character type set. Regarding possible character reading methods.

(従来技術) 従来光学文字読み取り装置においては複数の文字種で1
つの読み取りエリアを読み取る場合、混在の認識辞書で
判定しなければならず単独の認識辞書で読むのに比較し
て読み取り精度が低下するという欠点を有していた。
(Prior art) In conventional optical character reading devices, one
When reading two reading areas, judgment must be made using a mixed recognition dictionary, which has the disadvantage that reading accuracy is lower than when reading with a single recognition dictionary.

また従来の文字読み取り装置では混在読み取りの場合混
在する文字種が増えるほどに読み取り精度が低下する欠
点を有していた。
In addition, conventional character reading devices have the disadvantage that in mixed reading, the reading accuracy decreases as the number of mixed character types increases.

(発明の目的) 本発明の目的は従来の光学文字読み取り装置におけるか
かる欠点を除去すると共に読み取りエリアに複数の文字
種を冶する混在読み取りにおける読み取り!′#度を向
上せしめる文字読み取り方式を提供することにめる。
(Object of the Invention) The object of the present invention is to eliminate such drawbacks in conventional optical character reading devices, and to read in mixed reading by providing a plurality of character types in the reading area! '# We aim to provide a method for reading characters that improves reading comprehension.

(発明の構成) 本発明によれは、光学文字読み取り装置における文字読
み取り方式に於いて、複数の又字種を記入する少なくと
も1つの読み取りエリアを有する帳票と、前記読み取り
エリアを複数の文字種で読み、文字種セ9トごとに1文
字以上のスペースで文字ブロックに分離する手段と、前
記文字ブロックの文字種セットする手段と、該判定手段
により判定された文字種セ・ント情報を判定された文字
種にて、再度認識する手段とt有し、文字種ごとに文字
を読み取ることにより文字読み取り精度を向上させるよ
うにしたことを特徴とする文字読み取り方式が得られる
(Structure of the Invention) According to the present invention, in a character reading method in an optical character reading device, there is provided a form having at least one reading area in which a plurality of character types are written, and a form that reads the reading area in a plurality of character types. , means for separating each character type set into character blocks with one or more spaces, means for setting the character type of the character block, and character type set information determined by the determining means based on the determined character type. , a re-recognition means, and t, and a character reading method is obtained which is characterized in that character reading accuracy is improved by reading characters for each character type.

(実施例) 次に本発明の実施例について図面を参照して説明する。(Example) Next, embodiments of the present invention will be described with reference to the drawings.

m1図は本発明の〜実施例會示す。第1図において、本
発明一実施例は光学文字読み取り装置における文字読み
取り方式において、帳票に複数の文字種を記入し得る少
なくとも1つの読み取りエリアを有し、この読み取りエ
リア倉光学的に読み、白黒の2値化イメージデータを一
時記憶するラインバッファ11.!:、該ラインバッフ
ァ11のデータから1文字毎に文字イメージを抽出する
文字抽出s12と、前記文字抽出部12の文字イメージ
をパラメータで指定される文字種18により読み取りエ
リアを認識する認識ブロック13と、該認識ブロック1
3の出力信号により各読み取りエリアを文字プロ、り毎
にスペースで分離し、前記文字プロ・ツクの文字種を判
定する判定出力ブロック14と、該判定出刃ブロック1
4からのスペースで分離された文字ブロックの内容をパ
ラメータにより指定される文字種セット情報19により
チェックし、最適な文字種セットに判定する文字種ブロ
ック判定部15と、前記ブロックごとに再腿認#!を行
う文字種ブロック認識部16と、前記認識結果にもとう
@最終的に出力する最終判定プロ、ツク17と奮含む。
Figure m1 shows embodiments of the present invention. In FIG. 1, one embodiment of the present invention is a character reading method in an optical character reading device, which has at least one reading area in which a plurality of character types can be written on a form, and this reading area is optically readable and black and white. Line buffer 11 for temporarily storing binarized image data. ! :, a character extraction s12 that extracts a character image for each character from the data of the line buffer 11; a recognition block 13 that recognizes a reading area of the character image of the character extraction unit 12 according to a character type 18 specified by a parameter; The recognition block 1
A judgment output block 14 separates each reading area by a space for each character block according to the output signal of 3, and judges the character type of the character block 1;
A character type block determination unit 15 checks the contents of character blocks separated by spaces from 4 to 4 using character type set information 19 specified by parameters, and determines the optimal character type set. It also includes a character type block recognition unit 16 that performs the above recognition, and a final judgment processor 17 that finally outputs the recognition result.

すなわち、本実施例は帳票上の読み取りエリアを走査し
、白黒の2値化イメージデータとしてラインバッファl
に入力し、更に1文字毎に文字イメージ全抽出する。次
にパラメータで指定される文字種18を使用して1つの
読み取りエリアを認識して各文字ブロックに分離する。
That is, in this embodiment, the reading area on the form is scanned and the line buffer l is scanned as black and white binary image data.
, and then extract all character images for each character. Next, one reading area is recognized using the character type 18 specified by the parameter and separated into each character block.

次に文字ブロックはパラメータにより指定される文字種
セット情報19により内容がチェックされ、最適な文字
種セラ)1−判定し、文字ブロックごとに再度認識を行
い帳票上の読取りエリアを読み取るものである。
Next, the contents of the character blocks are checked using the character type set information 19 specified by the parameters, and the optimum character type is determined, and each character block is recognized again to read the reading area on the form.

第2図は本実施例に用いられる複数文字種指定の読み取
りエリアを示す。第2図において、このブロックは字種
が、たとえは数字、英字、およびカタカナが混在した(
カメカナ)と(数字、英字ンのブロックである。このブ
ロックは数字、英字。
FIG. 2 shows a reading area for specifying multiple character types used in this embodiment. In Figure 2, this block contains a mixture of character types, such as numbers, alphabets, and katakana (
This block consists of Kamekana) and (numbers and alphabets.) This block consists of numbers and alphabets.

カタカナの混在読み取り辞書で認識ブロック13により
胱与取り、判定出力ブロック14に、J:り認R紹果が
第3図に示すようにカタカナがA、数字。
In the katakana mixed reading dictionary, the recognition block 13 shows that the recognition block 13 shows that the judgment output block 14 shows that J: ri ren R shogu is katakana as A and numbers as shown in Figure 3.

英字がBとして文字種セットごとに指定され、出力され
る。文字種ブロヅク判定部15では又字極セ、シトに人
がカタカナ、Bが数字および英字のブ 5− ロックであることが分るので、A全カタカナにて再度文
字種ブロック認識部16により判定を行い、まfcBを
数字、英字にて再度文字種プロ・ツク認識s6により判
定を行うことにより最終的には第4図の結果が最終判定
ブロック17に出力される。
The alphabetic character is designated as B for each character type set and output. The character type block recognition unit 15 determines that ``A'' is a block of numbers and alphabetic characters, and ``A'' is a block of numbers and letters. , and fcB as numbers and alphabets are again judged by the character type recognition process s6, and finally the result shown in FIG. 4 is output to the final judgment block 17.

第5図は本方式で指定可能な文字種セットを示す。B1
指定時は(数字、英字)、(カタカナ)。
FIG. 5 shows character type sets that can be specified using this method. B1
When specified: (numbers, alphabetic characters), (katakana).

(記号)の文字種セ、ソト指定となる。同様にB2−B
6指定時においても各々の文字種セット指定となる。な
お、Nは数字とマイナス、Aは英字とマイナス、Kはカ
タカナとマイナス、Sは記号とマイナスである。字種セ
ットは第5図以外にも第6図の組合せも可能でおる。第
6図はたとえは(数字、英字、カタカナ)と(記号)を
文字種セットとする場合を示す。第6図において、文字
種セリ) td、 ” BI B2 ”以外に”BIB
6”、 ”B2B6”。
(symbol) character type se, soto specification. Similarly B2-B
Even when 6 is specified, each character type set is specified. Note that N is a number and a minus, A is an alphabetic character and a minus, K is a katakana and a minus, and S is a symbol and a minus. In addition to the character type set shown in Fig. 5, combinations shown in Fig. 6 are also possible. FIG. 6 shows a case where (numbers, alphabets, katakana) and (symbols) are used as a character type set. In Figure 6, in addition to character types td, ``BI B2'', ``BIB''
6”, “B2B6”.

’BIB2B6″′のいずれでも指定可能である。Either 'BIB2B6''' can be specified.

(発明の効果ン 本発明は以上説明したようにla数の文字種指定の読み
取りエリア’に1文字以上のスペースで分離6一 することにより文字種セットごとに分割して再認識を行
うため読み取!ll梢度を向上させることができる。
(Effects of the Invention) As explained above, the present invention separates each character type set by one or more spaces in the reading area designated by the number of character types, thereby re-recognizing each character type set. It can improve the degree of treetopness.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例を部分的にブロック図で示し
た図、第2図は複数文字種指定の読み取りエリア例を示
す図、第3図は第2図?C第1図の認識ブロックにより
判定された判定出力ブロックの結果の例を示す図、第4
図は第1図の文字種ブロック認識部により判定された最
終判定ブロックの結果の例會示す図、第5図および第6
図は本発明の実施例における指定可能な文字種セ・ソト
ヲ示す図である。 11・・・・・・ラインパ9ファ、12・・・・・・文
字抽出部、13・・・・・・認識ブロック、14・・・
・・・判定出力ブロック、15・・・・・・文字種ブロ
ック判定部、16・・・・・・文字柚ブロック認識都、
17・・・・・・最終判定ブロック、18・・・・・・
複数の文字種パラメータ、19・・・・・・文字図 区
 区 CXJ(ホ)寸 保 殻 象
FIG. 1 is a partial block diagram showing an embodiment of the present invention, FIG. 2 is a diagram showing an example of a reading area for specifying multiple character types, and FIG. C A diagram showing an example of the result of the judgment output block judged by the recognition block in Fig. 1, 4th
The figures show examples of the results of the final judgment block judged by the character type block recognition unit in Fig. 1, and Figs. 5 and 6.
The figure is a diagram showing character types that can be specified in an embodiment of the present invention. 11... line parr 9, 12... character extraction section, 13... recognition block, 14...
... Judgment output block, 15... Character type block judgment section, 16... Character Yuzu block recognition capital,
17...Final judgment block, 18...
Multiple character type parameters, 19...Character diagram Ku Ku CXJ (E) Dimension Shell Elephant

Claims (1)

【特許請求の範囲】[Claims] 光学文字読み取り装置における文字読み取り方式に於い
て、複数の文字種を記入する少なくとも1つの読み取り
エリアを有する帳票と、前記読み取りエリアを複数の文
字種で読み文字種セットごとにL文字以上のスペースで
文字ブロックに分離する手段と、前記文字プロ・ツクの
文字fiiヲ判定する手段と、該判定手段により判定さ
れた文字種セット情報を判定された文字種にて再度認識
する手段とを有し、文字種ごとに文字を読み取ることに
より文字読み取り精度を向上させるようにしたことt特
徴とする文字睨み取v方式。
In a character reading method in an optical character reading device, there is provided a form having at least one reading area in which a plurality of character types are written, and the reading area is read with a plurality of character types into a character block with a space of L characters or more for each character type set. Separating means, means for determining the character fiiwo of the character program, and means for re-recognizing the character type set information determined by the determining means using the determined character type. The character staring method is characterized by improving character reading accuracy by reading the characters.
JP5793984A 1984-03-26 1984-03-26 Character read system Pending JPS60201480A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5793984A JPS60201480A (en) 1984-03-26 1984-03-26 Character read system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5793984A JPS60201480A (en) 1984-03-26 1984-03-26 Character read system

Publications (1)

Publication Number Publication Date
JPS60201480A true JPS60201480A (en) 1985-10-11

Family

ID=13069998

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5793984A Pending JPS60201480A (en) 1984-03-26 1984-03-26 Character read system

Country Status (1)

Country Link
JP (1) JPS60201480A (en)

Similar Documents

Publication Publication Date Title
JP3139521B2 (en) Automatic language determination device
JP3253356B2 (en) Document image area identification method
US4811412A (en) Method of a system for analyzing characters
US5502777A (en) Method and apparatus for recognizing table and figure having many lateral and longitudinal lines
JPH0452510B2 (en)
JPS60201480A (en) Character read system
JPH0991371A (en) Character display device
JPH0514952B2 (en)
JPS59158482A (en) Character recognizing device
JPS58125183A (en) Method for displaying unrecognizable character in optical character reader
JPH0564396B2 (en)
JPH01255986A (en) Preparation of multi-font dictionary
KR100210492B1 (en) Character recognition device and method
JPH05217017A (en) Optical character reader
JPH0715702B2 (en) Character pattern cutting device
JPS6160189A (en) Optical character reader
JPS60110089A (en) Character recognizer
JP2578767B2 (en) Image processing method
JPH01277989A (en) Character string pattern reader
JPS5953984A (en) Character recognizing device
JPH08129608A (en) Character recognition device
JPH0576674B2 (en)
JPS59205679A (en) Character segmentation device
JPS58101378A (en) Manuscript document reading method
JP2000020638A (en) Character string direction discriminating method