JPS60201486A - Handwritten document reading method - Google Patents

Handwritten document reading method

Info

Publication number
JPS60201486A
JPS60201486A JP59057420A JP5742084A JPS60201486A JP S60201486 A JPS60201486 A JP S60201486A JP 59057420 A JP59057420 A JP 59057420A JP 5742084 A JP5742084 A JP 5742084A JP S60201486 A JPS60201486 A JP S60201486A
Authority
JP
Japan
Prior art keywords
character
designation
dictionary
handwritten
character type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP59057420A
Other languages
Japanese (ja)
Other versions
JPH0330191B2 (en
Inventor
Shigeru Goto
茂 後藤
Yoshiyuki Yamashita
山下 義征
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oki Electric Industry Co Ltd
Original Assignee
Oki Electric Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oki Electric Industry Co Ltd filed Critical Oki Electric Industry Co Ltd
Priority to JP59057420A priority Critical patent/JPS60201486A/en
Publication of JPS60201486A publication Critical patent/JPS60201486A/en
Publication of JPH0330191B2 publication Critical patent/JPH0330191B2/ja
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

PURPOSE:To read a handwritten Japanese document with high speed and high precision by detecting a character kind designation in advance, detecting the character lines of a character without designation so as to select the character kind and using a dictionary proper to the character kind so as to identify the character. CONSTITUTION:A character type designation detecting sections 5 divides a pattern register 2 into four areas corresponding to a character kind designation describing frame 22, number of black points in each area is counted, compared with a threshold value, the presence or absence of the designation of the character kind is detected and the presence or absence of designation of the character kind is transmitted to an identification section 6. A character line amount detection section 4 detects the character lines from the character pattern in the pattern register 2 at the same time, the complicated degree D of the character is detected by normalizing the result with the size of the character, and when the complicated degree D of the character is detected, the D is fed to the character kind designation detection section 5. When the character kind designation is not detected on the character kind designation area at the character kind designation detection section 5, the complicated degree D is used, the character kind is decided and the result is fed to the identification section. In the identification section 6, the feature fed from the feature extraction section 3 and the dictionary are collated by using the dictionary memory of the character kind having the designation of character kind corresponding to each character in advance.

Description

【発明の詳細な説明】 (技術分野) 本発明は高速で精度の良い手書文書の読取方法に関する
ものである。
DETAILED DESCRIPTION OF THE INVENTION (Technical Field) The present invention relates to a method for reading handwritten documents at high speed and with high accuracy.

(背景技術) これまでに手書文書の読取方法として記入した文字の文
字種を指定する記入枠内の指定の有無を検出し、指定を
検出したカラムの文字の読取を指定された文字種の辞書
のみを参照して行う方法が提案されている。しかしなが
ら、この方法では、文字種の指定がなかった場合、全て
の辞書を参照しなければならず処理速度が遅くなるとい
う問題があった。
(Background technology) As a method for reading handwritten documents, the presence or absence of a specification in a writing frame that specifies the character type of the written character is detected, and the characters in the column where the specification is detected are read only in a dictionary of the specified character type. A method has been proposed that refers to However, this method has the problem that if no character type is specified, all dictionaries must be referenced, which slows down the processing speed.

(発明の目的および概要) 本発明の目的は従来の技術の上記欠点を改善して高速で
精度のよい手書文書の読取方法を提供することにあり、
その特徴は、文字の文字線量を検出しそれを当該文字の
複雑度とし、字種の指定がなかった場合、その複雑度に
より当該文字の含まれる文字様の辞書を選択して識別を
行うことにある。
(Objective and Summary of the Invention) An object of the present invention is to improve the above-mentioned drawbacks of the conventional technology and provide a method for reading handwritten documents at high speed and with high accuracy.
Its feature is that it detects the character dose of a character and uses it as the complexity of the character, and if the character type is not specified, it selects the dictionary of the character style that contains the character based on the complexity and performs identification. It is in.

(発明の実施例) 第1図は本発明手書日本語文書読取方法における一実施
例を示す構成図である。図において、1は光電変換部、
2はバタンレジスタ、3は特徴抽出部、4は文字線量検
出部、5は文字種検出部、6は識別部、7は文字名出力
、8はひらがな辞書メモリ、9はカタカナ辞書メモリ、
10は英字数字記号辞書メモリ、11は漢字辞書メモリ
である。
(Embodiment of the Invention) FIG. 1 is a block diagram showing an embodiment of the handwritten Japanese document reading method of the present invention. In the figure, 1 is a photoelectric conversion unit;
2 is a button register, 3 is a feature extraction unit, 4 is a character dose detection unit, 5 is a character type detection unit, 6 is an identification unit, 7 is a character name output, 8 is a hiragana dictionary memory, 9 is a katakana dictionary memory,
10 is an alphanumeric symbol dictionary memory, and 11 is a kanji dictionary memory.

また、第2図は本実施例に使用した帳票例を示す図で、
21は帳票、22は字種指定記入枠で、その中の23は
漢字指定欄、24はひらがな指定欄、25はカタカナ指
定欄、26は英字数字記号指定欄、27は文字記入枠、
28は字種指定記入行、29は文字記入行で、例えば第
3図の帳票記入例のように記入しておく。
Furthermore, Figure 2 is a diagram showing an example of the form used in this example.
21 is a form, 22 is a character type specification entry frame, 23 is a kanji specification column, 24 is a hiragana specification column, 25 is a katakana specification column, 26 is an alphanumeric symbol specification column, 27 is a character entry frame,
Reference numeral 28 is a character type specification entry line, and 29 is a character entry line, in which entries are made, for example, as in the form entry example shown in FIG.

以下、この帳票例を用いて本発明の動作を次に説明する
The operation of the present invention will be explained below using this example of the form.

まず第2図の帳票21の字種指定記入行28の行につい
て文字記入行29の各文字に対応した字種指定の有無を
検出し、識別部6へ字種指定の有無を送出する。その動
作は光電変換部1により字種指定記入行28について光
電変換を行ない2値の量子化された電気信号に変換し、
1文字分の領域を切り出して・ぐターンレジスタ2に格
納する。
First, the presence or absence of a character type designation corresponding to each character in the character entry line 29 is detected for the character type designation entry line 28 of the form 21 in FIG. The operation is such that the photoelectric conversion unit 1 performs photoelectric conversion on the character type specification entry line 28 and converts it into a binary quantized electric signal.
Cut out an area for one character and store it in the turn register 2.

字種指定検出部5は・やターンレジスタ2を字種指定記
入枠22に対応する様に4個の領域に分割し、各領域内
の黒点数(文字線部を黒点とする。〕を計数し、閾値と
比較してそれぞれの文種の指定の有無を検出し、識別部
6へ前記字種即ち漢字、ひらがな、カタカナ、記号等の
指定の有無を送出する。以上の動作により文字記入行2
9の各文字に対応した字種指定を検出する。次に、第2
図の文字記入行29の読取りを行なう。その動作は光電
変換部1により文字記入行29について光電変換を行な
い、2値の量子化した電気信号に変換し、1文字分の領
域を切出して・やターンレジスタ2に格納する。特徴抽
出部3は・やターンレジスタ2内の文字パターンより各
種特徴を抽出し、該特徴を識別部6へ送出する。
The character type designation detection unit 5 divides the turn register 2 into four areas corresponding to the character type designation entry frame 22, and counts the number of black dots in each area (character line parts are black dots). Then, it compares it with the threshold value to detect whether each character type has been specified, and sends the presence or absence of the character type, ie, kanji, hiragana, katakana, symbol, etc., to the identification unit 6. By the above operation, the character entry line is 2
The character type designation corresponding to each character of 9 is detected. Next, the second
The character entry line 29 in the figure is read. In this operation, the photoelectric converter 1 performs photoelectric conversion on the character entry line 29, converts it into a binary quantized electric signal, cuts out an area for one character, and stores it in the turn register 2. The feature extraction section 3 extracts various features from the character patterns in the turn register 2 and sends the features to the identification section 6.

同時に文字線量検出部4ではパターンレジスタ2内の文
字・ぐターンより文字線量を検出して文字の大きさで正
規化することに文字の複雑度りとする。複雑度は次式に
よって表わされる。
At the same time, the character dose detection unit 4 detects the character dose from the characters and patterns in the pattern register 2 and normalizes it by the size of the character, depending on the complexity of the character. The complexity is expressed by the following equation.

但しKはDを整数化するための定数、Aは文字枠内の全
黒点数、PBは文字の外接枠のうち高さ方向の大きさ、
同様にPRは幅方向の大きさを示すものである。WLは
文字の線幅で次式によってめる。
However, K is a constant for converting D into an integer, A is the total number of black dots within the character frame, PB is the size of the circumscribed frame of the character in the height direction,
Similarly, PR indicates the size in the width direction. WL is the line width of the character and is determined by the following formula.

但しQは、文字枠内を2×2の窓で全点観測し、4点と
も黒点である個数を表わす。
However, Q represents the number of points in which all four points are black points when observing all points within the character frame using a 2×2 window.

文字の複雑度りが検出されたら、字種指定検出部5へ複
雑度りを送出する。字種指定検出部5では文字種指定領
域で文字種指定が検出できなかった場合、前記複雑度り
を用い以下の条件を判定し、文字種を決定し識別部へ送
出する。
When the degree of complexity of a character is detected, the degree of complexity is sent to the character type designation detection section 5. If the character type designation detecting unit 5 cannot detect a character type designation in the character type designation area, it uses the degree of complexity to determine the following conditions, determines the character type, and sends it to the identification unit.

D (a 全ての辞書を参照する。D (a) Refer to all dictionaries.

D≧a 字種は漢字であるとし漢字の辞書を−り。D≧a Assuming that the character type is a kanji, look up a kanji dictionary.

但し本実施例においてはa = 10、K=5とした。However, in this example, a=10 and K=5.

(5) 識別部6は特徴抽出部3よシ送出された特徴と辞書とを
照合し、最終的に1文字のカラゴリ名を文字泡出カフへ
出力する。
(5) The identification unit 6 compares the features sent from the feature extraction unit 3 with the dictionary, and finally outputs a one-character color name to the character bubble cuff.

識別部6において使用する辞書メモリは、ひらがな辞書
メモリ8、カタカナ辞書メモリ9、英字数字記号辞書メ
モリ10及び漢字辞書メモリ11の4種が用意されてい
るが、前記特徴抽出部3より送出された特徴と辞書との
照合は、前記あらかじめ各文字に対応する字種指定があ
った文字種の辞書メモリを使用して行う。
There are four types of dictionary memories used in the identification unit 6: a hiragana dictionary memory 8, a katakana dictionary memory 9, an alphanumeric symbol dictionary memory 10, and a kanji dictionary memory 11. The feature is compared with the dictionary using the dictionary memory of the character types in which the character types corresponding to each character have been designated in advance.

(発明の効果) 本発明は以−ヒ詳細に説明したようにあらかじめ字種指
定の検出を行い、前記指定のない文字については、文字
の文字線量を検出して、字種の選択を行い字種に適した
辞書により文字の識別を行っているので高速で精度の高
い読取が出来、従って高速で精度の良い手書日本語文書
の読取が可能となる効果がある。
(Effects of the Invention) As described in detail below, the present invention detects the character type designation in advance, and for characters without the above designation, the character type is selected by detecting the character dose of the character. Since characters are identified using a dictionary appropriate for the species, reading can be performed at high speed and with high accuracy, and therefore, handwritten Japanese documents can be read at high speed and with high accuracy.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明による手書文書読取方法の一実(6) 流側を示す構成図、第2図は本発明の実施例で使用した
帳票例を示す図、第3図はその帳票記入例を示す図であ
る。 1・・・光電変換部、2・・・パターンレジスタ、3・
・特徴抽出部、4・・・文字線量検出部、5・・・字種
指定検出部、6・・・識別部、7・・・文字名出力、8
・・・ひらがな辞書メモリ、9・・・カタカナ辞書メモ
リ、lO・・・英字数字記号辞書メモリ、11・・・漢
字辞書メモリ、21・・・帳票、22・・・字種指定記
入枠、23・・・ひらがな指定欄、24・・・カタカナ
指定欄、25・・・英字数字記号指定欄、26・・・漢
字指定欄、27・・・文字記入枠、28・・字種指定記
入枠、29・・・文字記入行。 特許出願人 沖電気工業株式会社 特許出願代理人 弁理士 山 本 恵 − (7) 第1図 手続補正書(自発) 昭和59年8月14−日 特許庁長官 志 賀 学 殿 1、事件の表示 昭和59年 特許願第57420号 2、発明の名称 手書文書読取方法 3、補正をする者 事件との関係 特許出願人 名 称 (029)沖電気工業株式会社5、補正の対象 明細書の特許請求の範囲、発明の詳細な説明及び図面の
簡単な説明の各欄並びに図面6、補正の内容 (11特許請求の範囲を別紙のとおり補正する。 (2)明細書第2頁第20行〜同第3頁第1行の「本発
明・・・一実施例」を「本発明による手書日本語文書読
取方法の一実施例」と補正する。 (3)同第3頁第8行、第11行、第16行、第19〜
20行、同第4頁第3行、第3〜4行、同第5頁第12
行、第13行及び同第7頁第5行、第9行、第12行の
「字種指定」を1文字種指定」と補正する。 (4)同第4頁第2行、第3行、第14行、第15行、
第16行、第18行、第19行及び同第7頁第4行の1
パターン」を「バタン」と補正する。 (5)同第4頁第6行の「文種」を「文字種」と補正す
る。 (6)同第4頁第7行の「識別部6へ前記字種」を「前
記文字種」と補正する。 (7)同第4頁第8〜9行の「送出する。」を1文字種
検出部5内の文字種指定メモリに格納する。」と補正す
る。 (8)同第4頁第10行の「字種指定を」を「字種指定
の有無を」と補正する。 (9)同第4頁第20行の1に文字」を1により文字」
と補正する。 (10)同第5頁第14行の「文字種指定領域で文字種
指定」を「前記文字種指定メモリを順次参照し前記文字
種指定を識別部6へ送出し文字種指定領域で第3図の2
2に示すごとく文字種指定」と補正する。 (11)同第5頁第16行の「識別部へ」を「識別部6
へ」と補正する。 (12)図面の第3図を別紙のとおり補正する。 以 上 (3) 特許請求の範囲 (1) 手書き日本語文書において文字を記入する文字
枠と、該文字枠の近傍にもうけられ文字種を指定する文
字種指定領域を有し、文字種指定領域で指定された文字
種の辞書により文字枠に記入された手書文字を認識する
手書文書読取方法において、文字種の指定がない場合に
、手書文字の文字線量を文字の複雑度としてめ、該複雑
度に対応する文字種の辞書を選択し、該選択された辞書
により手書文字を認識することを特徴とする手書文書読
取方法。 (2)前記辞書がひらがな、かたかな、英数字、及び漢
字に対し各々もうけられることを特徴とする特許請求の
範囲第1項記載の手書文書読取方法。
Figure 1 is a block diagram showing an example of the handwritten document reading method (6) according to the present invention, the flow side, Figure 2 is a diagram showing an example of a form used in an embodiment of the present invention, and Figure 3 is the entry of the form. It is a figure which shows an example. 1... Photoelectric conversion unit, 2... Pattern register, 3...
・Feature extraction unit, 4... Character dose detection unit, 5... Character type specification detection unit, 6... Identification unit, 7... Character name output, 8
...Hiragana dictionary memory, 9...Katakana dictionary memory, lO...Alphabet/numeric symbol dictionary memory, 11...Kanji dictionary memory, 21...Form, 22...Character type specification entry frame, 23 ...Hiragana specification field, 24...Katakana specification field, 25...Alphabet, numeric symbol specification field, 26...Kanji specification field, 27...Character entry box, 28...Character type specification entry box, 29...Character entry line. Patent Applicant Oki Electric Industry Co., Ltd. Patent Application Agent Megumi Yamamoto - (7) Figure 1 Procedural Amendment (Voluntary) August 14, 1980 - Japan Patent Office Commissioner Manabu Shiga 1, Indication of Case 1982 Patent Application No. 57420 2 Title of the invention Handwritten document reading method 3 Relationship with the case of the person making the amendment Patent applicant name (029) Oki Electric Industry Co., Ltd. 5 Patent claim for the specification subject to amendment scope, detailed description of the invention, and brief description of the drawings, drawing 6, and contents of the amendment (11) Claims are amended as shown in the attached sheet. (2) Page 2 of the specification, line 20 to "One embodiment of the present invention..." in the first line of page 3 is amended to read "an embodiment of the handwritten Japanese document reading method according to the present invention." (3) Page 3, line 8, Line 11, line 16, line 19~
Line 20, page 4, line 3, lines 3-4, page 5, line 12
Correct the "Character type designation" in line 1, line 13, and lines 5, 9, and 12 of page 7 to "1 character type designation." (4) Page 4, lines 2, 3, 14, and 15,
Lines 16, 18, 19 and 1 of the 4th line of page 7
Correct the pattern with a bang. (5) Correct "text type" in line 6 of page 4 to "character type". (6) Correct "the character type to the identification unit 6" on the seventh line of the fourth page to "the character type". (7) Store "Send." in the 8th to 9th lines of the 4th page in the character type designation memory in the 1 character type detection unit 5. ” he corrected. (8) On page 4, line 10, "specify character type" is corrected to "specify character type". (9) On the 4th page, line 20, the letter 1 is replaced by the letter 1.
and correct it. (10) "Specify the character type in the character type specification area" on the 14th line of page 5 is changed to "Sequentially refer to the character type specification memory and send the character type specification to the identification unit 6.
2. Specify the character type as shown in Figure 2. (11) Change “To identification section” to “Identification section 6” on page 5, line 16.
"To," he corrected. (12) Figure 3 of the drawings will be amended as shown in the attached sheet. Above (3) Claims (1) A handwritten Japanese document has a character frame in which characters are written, and a character type designation area provided near the character frame to designate a character type, and a character type designation area for specifying a character type. In a handwritten document reading method that recognizes handwritten characters written in a character frame using a dictionary of character types, when the character type is not specified, the character dose of the handwritten character is considered as the complexity of the character, and the complexity is A handwritten document reading method comprising selecting a dictionary of corresponding character types and recognizing handwritten characters using the selected dictionary. (2) The handwritten document reading method according to claim 1, wherein the dictionary is created for each of hiragana, katakana, alphanumeric characters, and kanji.

Claims (2)

【特許請求の範囲】[Claims] (1) 手書き日本語文書の文字を記入する文字枠と、
該文字枠の近傍にもうけられ文字種を指定する文字種指
定領域を有し、文字種指定領域で指定された文字種の辞
書により文字枠に記入された手書文字を認識する手書文
書読取方法において、文字種の指定がない場合に、手書
文字の文字線量を文字の複雑度としてめ、該複雑度に対
応する文字種の辞書を選択し、該選択された辞書により
手書文字を認識することを特徴とする手書文書読取方法
(1) A character frame in which to write the characters of a handwritten Japanese document,
In a handwritten document reading method that has a character type specification area provided near the character frame and specifies the character type, and recognizes handwritten characters written in the character frame using a dictionary of character types specified in the character type specification area, the character type is specified. is not specified, the character dose of handwritten characters is taken as the character complexity, a dictionary of character types corresponding to the complexity is selected, and the handwritten characters are recognized using the selected dictionary. A handwritten document reading method.
(2)前記辞書がひらがな、かたかな、英数字、及び漢
字に対し各々もうけられることを特徴とする特許請求の
範囲第1項記載の手書文書読取方法。
(2) The handwritten document reading method according to claim 1, wherein the dictionaries are respectively created for hiragana, katakana, alphanumeric characters, and kanji.
JP59057420A 1984-03-27 1984-03-27 Handwritten document reading method Granted JPS60201486A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP59057420A JPS60201486A (en) 1984-03-27 1984-03-27 Handwritten document reading method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP59057420A JPS60201486A (en) 1984-03-27 1984-03-27 Handwritten document reading method

Publications (2)

Publication Number Publication Date
JPS60201486A true JPS60201486A (en) 1985-10-11
JPH0330191B2 JPH0330191B2 (en) 1991-04-26

Family

ID=13055154

Family Applications (1)

Application Number Title Priority Date Filing Date
JP59057420A Granted JPS60201486A (en) 1984-03-27 1984-03-27 Handwritten document reading method

Country Status (1)

Country Link
JP (1) JPS60201486A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62214486A (en) * 1986-03-17 1987-09-21 Sanyo Electric Co Ltd Character recognizing device
JPS63782A (en) * 1986-06-20 1988-01-05 Ricoh Co Ltd Pattern recognizing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62214486A (en) * 1986-03-17 1987-09-21 Sanyo Electric Co Ltd Character recognizing device
JPS63782A (en) * 1986-06-20 1988-01-05 Ricoh Co Ltd Pattern recognizing device

Also Published As

Publication number Publication date
JPH0330191B2 (en) 1991-04-26

Similar Documents

Publication Publication Date Title
JPS60201486A (en) Handwritten document reading method
Pal On the developement of an optical character recognition (ocr) system for printed bangla script
JPS63146187A (en) Character recognizing device
JPH03225579A (en) Device for segmenting character pattern
JPS6336389A (en) Character reader
JPS58101378A (en) Manuscript document reading method
JP3492442B2 (en) Document Content Characterization Using Word Shape Tokens
JPH07271911A (en) Character recognition device
JP2578767B2 (en) Image processing method
JPS63188284A (en) Character reader
JPH055144B2 (en)
JPS61109183A (en) Character recognizer
JPS62231389A (en) Character recognizing device
JP2953162B2 (en) Character recognition device
JPH06119497A (en) Character recognition method
JPH0475557B2 (en)
JPS5668880A (en) Character reader
Holstege et al. Visual parsing: an aid to text understanding
JPH08297720A (en) General document reader
JPS60160484A (en) Character reader
JPS60110089A (en) Character recognizer
JPS60160481A (en) Reader of character
JPH01265378A (en) European character recognizing system
JPS6081688A (en) Information recognition method
JPS63220383A (en) Character input device