JPS60217487A - Character recognition device - Google Patents

Character recognition device

Info

Publication number
JPS60217487A
JPS60217487A JP59073412A JP7341284A JPS60217487A JP S60217487 A JPS60217487 A JP S60217487A JP 59073412 A JP59073412 A JP 59073412A JP 7341284 A JP7341284 A JP 7341284A JP S60217487 A JPS60217487 A JP S60217487A
Authority
JP
Japan
Prior art keywords
pattern
character
dictionary
input
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP59073412A
Other languages
Japanese (ja)
Other versions
JPH0634259B2 (en
Inventor
Yoshiaki Kurosawa
由明 黒沢
Yoshikatsu Nakamura
中村 好勝
Katsunori Oi
大井 勝則
Yutaka Hitai
比田井 裕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP59073412A priority Critical patent/JPH0634259B2/en
Publication of JPS60217487A publication Critical patent/JPS60217487A/en
Publication of JPH0634259B2 publication Critical patent/JPH0634259B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To lighten the burden imposed on a writer by preparing an input pattern such as an abbreviation from as a special dictionary pattern with respect to the prescribed character string and by outputting the prescribed character string as a recognition result of the input pattern concerned. CONSTITUTION:A character pattern inputted with a pen 2 on a tablet device 1 is segmented by a detection and segmenting part 3 at every one character, and its features are collated with those of a recognition dictionary 5 in a recognition part 4, whereby a character recognition is carried out. In said dictionary 5 an additionary registration of a dictionary pattern can be performed. This means that an operation is set to a dictionary registration mode, a character string to be registered is inputted as a character code, etc., through a keyboard, and after this a registration dictionary pattern (abbreviated form, etc.) with respect to the character string is inputted through the tablet device. A written pattern of the input registration dictionary parttern is feature-extracted, and additionally registered as a special dictionary pattern. With the aid of the special dictionary pattern (abbreviated form, etc.) the desired character string can be efficiently inputted.

Description

【発明の詳細な説明】 〔発明の技術分野〕 本発明は入カバターンと辞書Ω録された辞書パターンと
を照合して文字認識する文字認識装置に係り、特に簡易
に文字データの入力効率の向上を図り得るようにした文
字認識装置に関する。
[Detailed Description of the Invention] [Technical Field of the Invention] The present invention relates to a character recognition device that recognizes characters by comparing an input cover pattern with a dictionary pattern recorded in a dictionary. The present invention relates to a character recognition device capable of achieving the following.

〔発明の技術的背景とその問題点〕[Technical background of the invention and its problems]

情報処理技術の発達に伴い、計算観て取扱われるデータ
量が膨大化している。これ故、各種データを如何に効率
良く計算機に入力するかが大きな課題となつCいる。
With the development of information processing technology, the amount of data handled in calculations is increasing. Therefore, how to efficiently input various data into a computer is a major issue.

これに対する解答として、近時、帳票等に印刷あるいは
手書きされた文字・記号を光学的に読取って文字認識し
てデータ入力するOCRや、タブレット等の座標入力装
置を介して筆記入力される文字ストロークの情報から実
時間的に文字認識してデータ入力する文字認識装置が各
種開発されている。
As an answer to this, in recent years OCR, which optically reads characters and symbols printed or handwritten on forms, recognizes the characters, and inputs data, and character strokes that are input by hand via a coordinate input device such as a tablet. Various character recognition devices have been developed that recognize characters from information in real time and input data.

ところで、この種の文字認識装置は、基本的には入力文
字パターンを1文字毎に切出し、その文字パターンの特
徴を検出して認識辞書に予め登録された標準辞書パター
ンの特徴と照合し、この照合結果から上記文字パターン
に対する認識結果を(qる如く構成されている。この為
、例えば原稿の下書やメモ書き等で多く利用されている
略字や略記号等を入力することができないと云う不具合
があった。そして、必然的に正式な書体による文字・記
号の筆記が要求されることになり、筆記者に大きな負担
を掛けることのみならず、文字データ入力rf間の増大
、入力ミスの増大等を招く問題があった。
By the way, this type of character recognition device basically cuts out an input character pattern character by character, detects the characteristics of that character pattern, and compares it with the characteristics of a standard dictionary pattern registered in advance in a recognition dictionary. From the matching results, the recognition results for the above character patterns are configured as shown in (q).For this reason, it is not possible to input abbreviations and abbreviations that are often used in drafting manuscripts and writing memos. There was a problem.Inevitably, characters and symbols were required to be written in a formal font, which not only placed a heavy burden on the scribe, but also increased the time required for inputting character data and caused input errors. There was a problem that caused an increase in the amount of water.

〔発明の目的〕[Purpose of the invention]

本発明はこのような事情を考慮してなされたもので、そ
の目的とするところは、効率良く文字データの入力を行
うことができ、しかも文字データ入力の簡易化を図って
筆記者の負担を軽減することのできる実用性の高い文字
認識装置を提供することにある。
The present invention has been made in consideration of these circumstances, and its purpose is to be able to input character data efficiently, and to simplify the input of character data to reduce the burden on the scribe. An object of the present invention is to provide a highly practical character recognition device that can reduce the amount of work required.

〔発明の概要〕[Summary of the invention]

本発明は指定された入カバターンを指定された所定の文
字または文字列に対する特殊辞書パターンとして作成し
、この特殊辞書パターンを上記所定の文字または文字列
に対応付けて認識辞書に追加登録するようにし、上記特
殊辞書パターンに該当する入カバターンが与えられたと
き、前記特殊辞書パターンに対応する前記所定の文字ま
たは文字列を上記入カバターンに対する認識結果として
出力することによって、特殊な記号等で表現される文字
または文字列の効率の良い入力を可能としたものである
The present invention creates a specified input cover pattern as a special dictionary pattern for a specified predetermined character or character string, and further registers this special dictionary pattern in a recognition dictionary in association with the predetermined character or character string. , when an input cover pattern corresponding to the special dictionary pattern is given, the predetermined character or character string corresponding to the special dictionary pattern is output as a recognition result for the input cover pattern, so that the input cover pattern is expressed with a special symbol, etc. This enables efficient input of characters or character strings.

〔発明の効果〕〔Effect of the invention〕

かくして本発明によれば、文字データ入力に際し、出現
頻度の高い文字または文字列、特に文字譚の多い文章等
を特殊な記号パターンとして辞書登録しておくことによ
り、この特殊な記号パターンの入力によって前記出現頻
度の高い文字または文字列、特に文字数の多い文章等を
短詩間に効率良く入力することが可能となる。従−って
、原稿の下書等で良く用いられる略字等によって文字デ
ータ入力を行うことも可能となり、筆記者に対する負担
を大幅に軽減することが可能となる等の効果が奏せられ
る。
Thus, according to the present invention, when inputting character data, by registering frequently appearing characters or character strings, especially sentences with many character stories, etc. in the dictionary as special symbol patterns, the input of this special symbol pattern allows It becomes possible to efficiently input the frequently occurring characters or character strings, especially sentences with a large number of characters, between short poems. Therefore, character data can be input using abbreviations that are often used in drafts of manuscripts, etc., and the burden on the scribe can be significantly reduced.

(発明の実施例) 以下、図面を参照して本発明の一実施例につき説明する
(Embodiment of the Invention) Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

第1図は実施例装置の概略構成図である。この実施例゛
装置はタブレット装置1の座標面上にペン2を用いて筆
記入力された文字・記号パターンを第記ストO−りの時
系列な座標データとして検出し、その特徴を抽出して上
記入力文字・記号パターンを文字認識するものである。
FIG. 1 is a schematic configuration diagram of an embodiment device. This embodiment device detects a character/symbol pattern written on the coordinate plane of a tablet device 1 using a pen 2 as time-series coordinate data according to the first line, and extracts its characteristics. This is to recognize the input character/symbol pattern mentioned above.

しかして、前記タブレット装置1から時系列座標データ
として入力される文字・記号パターンの情報は、検切部
3に導かれて所定の前処理がなされたのち、1文字毎に
切出される。上記前処理は、例えば筆記ストロークデー
タの中から雑音成分を除去したり、また入力文字・記号
パターン毎にその大きざを正規化する等して行われる。
The character/symbol pattern information input as time-series coordinate data from the tablet device 1 is guided to the cutter 3 and subjected to predetermined preprocessing, and then cut out character by character. The above preprocessing is performed, for example, by removing noise components from the written stroke data, or by normalizing the size of each input character/symbol pattern.

その後、上記前処理が施された入カバターンについて、
例えば各筆記ストロークの標本点を特徴情報として抽出
する等して上記入カバターンを表現する特徴データがめ
られる。認識部4は、このようにしてめられた前記入カ
バターンの特徴と、認識辞書5に予め登録された標準辞
書パターンの特徴とを照合し、その類似度をめる等して
入カバターンに最も類似している標準辞書パターンを検
出し、この標準辞書パターンを前記入カバターンの文字
認識結果としてめている。
After that, regarding the input cover pattern that has been subjected to the above pretreatment,
For example, sample points of each writing stroke are extracted as feature information to obtain feature data representing the above-mentioned cover pattern. The recognition unit 4 compares the characteristics of the input cover pattern determined in this way with the characteristics of the standard dictionary pattern registered in advance in the recognition dictionary 5, and calculates the degree of similarity between them to find the most suitable one for the input cover pattern. A similar standard dictionary pattern is detected, and this standard dictionary pattern is accepted as the character recognition result of the input cover pattern.

尚、認識部4における認識処理方式は、従来より知られ
ている種々の方式を適宜用いれば良いものであり、また
前記認識辞書5の構成もその認識方式に応じて定められ
ることは云うまでもない。
It should be noted that the recognition processing method in the recognition unit 4 may be appropriately selected from various conventionally known methods, and it goes without saying that the configuration of the recognition dictionary 5 is also determined according to the recognition method. do not have.

ところで、前記認識辞書5には予め複数の認識対象文字
の各標準辞書パターンが登録されているが、この認識辞
書5に新たな辞書パターンの追加登録ができるようにな
っている。この認識辞書5に追加登録される辞書パター
ンは、辞書作成部6にて作成されるもので、辞書登録モ
ード設定時に前記タブレット装置1を介して入力される
入カバターンの特徴データを新たな辞書パターンとする
等して行われる。
By the way, although each standard dictionary pattern of a plurality of characters to be recognized is registered in advance in the recognition dictionary 5, new dictionary patterns can be additionally registered in this recognition dictionary 5. The dictionary pattern that is additionally registered in the recognition dictionary 5 is created by the dictionary creation section 6, and the characteristic data of the input pattern input through the tablet device 1 when setting the dictionary registration mode is used as a new dictionary pattern. This is done as follows.

即ち、辞書登録モード設定時には、先ず辞書登録すべき
文字または文字列が、例えばキーボード装置(図示甘ず
)を介して文字コード等として入力される。尚、前記タ
ブレット装置1を介して上記文字または文字列を入力す
るようにしても良く、或いは既に入力された文字または
文字列を選択指定して与えるようにしても良い。しかる
後、この指定された文字または文字列に対する登録辞書
パターンを前記タブレット装置1を介して入力する。
That is, when setting the dictionary registration mode, first, a character or character string to be registered in the dictionary is inputted as a character code or the like via, for example, a keyboard device (not shown). Note that the above characters or character strings may be input via the tablet device 1, or characters or character strings that have already been input may be selectively designated and provided. Thereafter, a registered dictionary pattern for the designated character or character string is input via the tablet device 1.

この登録辞書パターンは、例えば第2図に示すように、
文字列「電話」に対しては、その略称である[teJJ
の筆記体パターンとして、或いは文字列「オンライン手
書文字認識装置」に対しては特殊記号パターン等として
与えられる。このようにして入力される登録辞書パター
ンに対して、例えば第3図に示すようにその筆記ストロ
ークを9つの標本座標位置データとして特徴抽出し、こ
れを新たな辞書パターン(特殊辞書パターン)とする。
This registered dictionary pattern is, for example, as shown in Figure 2.
For the character string “telephone”, its abbreviation [teJJ
It is given as a cursive pattern, or as a special symbol pattern for the character string "online handwritten character recognition device". For the registered dictionary pattern input in this way, for example, as shown in Fig. 3, the characteristics of the writing stroke are extracted as nine sample coordinate position data, and this is used as a new dictionary pattern (special dictionary pattern). .

この特殊辞書パターンを前記指定された所定の文字また
は文字列に対応付けて前記H1辞書5に追加登録するこ
とによって、上記指定された文字または文字列に対する
辞書登録が完了する。
By additionally registering this special dictionary pattern in the H1 dictionary 5 in association with the specified character or character string, dictionary registration for the specified character or character string is completed.

かくしてこのような特殊辞書パターンが認識辞書5に追
加登録されると、それ以降に前記タブレット装置1を介
して上記特殊辞書パターンに該当する入カバターンが入
力されると、前述した&Σ識処理によって認識部4は上
記入カバターンが前記追加登録された特殊辞書パターン
であることを認識する。この認識結果を得て、認識部4
は上記特殊辞書パターンに対応して記憶された前記所定
の文字または文字列を前記入カバターンに対する認識結
果として出力する。
In this way, when such a special dictionary pattern is additionally registered in the recognition dictionary 5, when an input cover pattern corresponding to the special dictionary pattern is inputted later via the tablet device 1, it is recognized by the &Σ recognition process described above. The unit 4 recognizes that the entered cover pattern is the additionally registered special dictionary pattern. After obtaining this recognition result, the recognition unit 4
outputs the predetermined character or character string stored in correspondence with the special dictionary pattern as a recognition result for the input cover pattern.

かくして本装置によれば、前記特殊辞書パターンを有効
に利用して所定の文字または文字列を極めて簡易に入力
することが可能となる。つまり、日常使用している略字
や記号を利用して所望とする文字また゛は文字列を効率
良く入力することが可能となり、筆記者に対する負担を
大幅に軽減することができる。またこの特殊辞書パター
ンを利用して文字数の多い文章等を簡易に入力すること
も可能となり、文字データ入力の高速化を図り得る等の
効果も奏せられる。
Thus, according to the present device, it is possible to input a predetermined character or character string extremely easily by effectively utilizing the special dictionary pattern. In other words, it becomes possible to efficiently input a desired character or character string by using abbreviations and symbols that are used on a daily basis, and the burden on the scribe can be significantly reduced. Further, by using this special dictionary pattern, it becomes possible to easily input sentences with a large number of characters, and effects such as speeding up character data input can also be achieved.

尚、本発明は上述した実施例に限定されるものではない
。実施例ではタブレット装置を介して文字パターンを筆
記入力するものについて説明したが、帳票等に記載され
た文字パターンを光学的に入力して文字認識処理する文
字認識装置にも同様に適用して実施することができる。
Note that the present invention is not limited to the embodiments described above. In the embodiment, a character pattern is input by hand through a tablet device, but the present invention can also be applied to a character recognition device that optically inputs a character pattern written on a form etc. and performs character recognition processing. can do.

また文字認識の方式や、この文字認識で用いる入カバタ
ーンの特徴情報等も装置仕様に応じて種々変形すること
ができる。要するに本発明はその要旨を逸脱しない範囲
で種々変形して実施することができる。
Furthermore, the method of character recognition, the characteristic information of the input pattern used in this character recognition, etc. can be variously modified according to the device specifications. In short, the present invention can be implemented with various modifications without departing from the gist thereof.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例装置の概略構成図、第2図は
辞書登録する指定の文字列とこの文字列に対して辞書登
録される特殊辞書パターンとの例を示す図、第3図は入
カバターンに対する特殊辞書パターンデータの抽出例を
示す図である。 1・・・タブレフ1−装置、2・・・ペン、3・・・検
切部、4・・・認識部、5・・・認識辞書、6・・・辞
書作成部。 出願人代理人 弁理士 鈴江武彦
FIG. 1 is a schematic configuration diagram of an apparatus according to an embodiment of the present invention, FIG. 2 is a diagram showing an example of a specified character string to be registered in a dictionary and a special dictionary pattern registered in the dictionary for this character string, and FIG. The figure is a diagram showing an example of extraction of special dictionary pattern data for an input cover pattern. DESCRIPTION OF SYMBOLS 1...Talev 1-device, 2... Pen, 3... Examination part, 4... Recognition part, 5... Recognition dictionary, 6... Dictionary creation part. Applicant's agent Patent attorney Takehiko Suzue

Claims (3)

【特許請求の範囲】[Claims] (1) 入力された文字パターンと認識辞書に予め辞書
登録された辞書パターンとを照合して上記入カバターン
に対する認識結果をめる文字認識装置において、指定さ
れた入カバターンを指定された所定の文字または文字列
に対する特殊辞書パターンとして作成し、この特殊辞書
パターンを上記所定の文字または文字列に対応付けて前
記認識辞書に追加登録する手段と、上記特殊辞書パター
ンに該当する入カバターンが与えられたとき、前記特殊
辞書パターンに対応する前記所定の文字または文字列を
上記入カバターンに対する認識結果として出力する手段
とを具備したことを特徴とする文字認識装置。
(1) In a character recognition device that compares an input character pattern with a dictionary pattern registered in advance in a recognition dictionary and obtains a recognition result for the above-mentioned cover pattern, a specified input cover pattern is used as a specified character. or a means for creating a special dictionary pattern for a character string, and additionally registering this special dictionary pattern in the recognition dictionary in association with the predetermined character or character string, and an input cover pattern corresponding to the special dictionary pattern. and means for outputting the predetermined character or character string corresponding to the special dictionary pattern as a recognition result for the cover pattern.
(2)所定の文字または文字列対応付けて設定される特
殊辞書パターンは、上記所定の文字または文字列を該特
殊辞書パターンの属性データとして付加して認識辞書に
登録されるものである特許請求の範囲第1項記載の文字
認識装置。
(2) A patent claim in which a special dictionary pattern that is set in association with a predetermined character or character string is registered in a recognition dictionary by adding the predetermined character or character string as attribute data of the special dictionary pattern. The character recognition device according to item 1.
(3)入カバターンは、座標入力装置を介して篭記入力
される文字・記号パターンの筆記ストロークを示す時系
列な座標データとして入力されるものである特許請求の
範囲第1項記載の文字認識装置。
(3) Character recognition according to claim 1, wherein the input cover pattern is input as time-series coordinate data indicating the writing strokes of the character/symbol pattern inputted in the basket via a coordinate input device. Device.
JP59073412A 1984-04-12 1984-04-12 Character recognition device Expired - Lifetime JPH0634259B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP59073412A JPH0634259B2 (en) 1984-04-12 1984-04-12 Character recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP59073412A JPH0634259B2 (en) 1984-04-12 1984-04-12 Character recognition device

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP7130644A Division JP2549831B2 (en) 1995-05-29 1995-05-29 Character recognition device input pattern / character string registration method

Publications (2)

Publication Number Publication Date
JPS60217487A true JPS60217487A (en) 1985-10-31
JPH0634259B2 JPH0634259B2 (en) 1994-05-02

Family

ID=13517452

Family Applications (1)

Application Number Title Priority Date Filing Date
JP59073412A Expired - Lifetime JPH0634259B2 (en) 1984-04-12 1984-04-12 Character recognition device

Country Status (1)

Country Link
JP (1) JPH0634259B2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6293775A (en) * 1985-10-21 1987-04-30 Canon Inc Information processing method and device
JPS6293774A (en) * 1985-10-21 1987-04-30 Canon Inc Information processing method

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015004787A1 (en) * 2013-07-11 2015-01-15 Tanaka Shunichi Input assistance device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56135282A (en) * 1980-03-25 1981-10-22 Fujitsu Ltd Real-time handwritten character recognition device
JPS57206988A (en) * 1981-06-15 1982-12-18 Fujitsu Ltd Data processor
JPS5878260A (en) * 1981-11-04 1983-05-11 Toshiba Corp Optical character reader
JPS58149574A (en) * 1982-03-02 1983-09-05 Nec Corp Registering device of standard pattern
JPS58219685A (en) * 1982-06-14 1983-12-21 Canon Inc character processing device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56135282A (en) * 1980-03-25 1981-10-22 Fujitsu Ltd Real-time handwritten character recognition device
JPS57206988A (en) * 1981-06-15 1982-12-18 Fujitsu Ltd Data processor
JPS5878260A (en) * 1981-11-04 1983-05-11 Toshiba Corp Optical character reader
JPS58149574A (en) * 1982-03-02 1983-09-05 Nec Corp Registering device of standard pattern
JPS58219685A (en) * 1982-06-14 1983-12-21 Canon Inc character processing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6293775A (en) * 1985-10-21 1987-04-30 Canon Inc Information processing method and device
JPS6293774A (en) * 1985-10-21 1987-04-30 Canon Inc Information processing method

Also Published As

Publication number Publication date
JPH0634259B2 (en) 1994-05-02

Similar Documents

Publication Publication Date Title
JP2713622B2 (en) Tabular document reader
JP3452774B2 (en) Character recognition method
KR20010093764A (en) Retrieval of cursive chinese handwritten annotations based on radical model
Amin Arabic character recognition
Alghamdi et al. Printed Arabic script recognition: A survey
US6567548B2 (en) Handwriting recognition system and method using compound characters for improved recognition accuracy
Hakro et al. A study of sindhi related and arabic script adapted languages recognition
JPS60217487A (en) Character recognition device
AbdulKader A two-tier arabic offline handwriting recognition based on conditional joining rules
JP2549831B2 (en) Character recognition device input pattern / character string registration method
Deshpande et al. Recognition of hand written devnagari characters with percentage component regular expression matching and classification tree
Sarnacki et al. Character Recognition Based on Skeleton Analysis
Amin Recognition of printed Arabic text using machine learning
Malik et al. Recognition of printed Devnagari characters with regular expression in finite state models
Gong et al. Tibetan character recognition based on machine learning of K-means algorithm
Emon et al. Recognition (OCR) Techniques
JP3151866B2 (en) English character recognition method
JP3015137B2 (en) Handwritten character recognition device
Kandasamy et al. Enhanced Handwritten Text Recognition With Spell Checking by Building a Small Language Model (SLM) With Jaro-Winkler Algorithm
Tan Retrieval of machine-printed Latin documents through Word Shape Coding
JPS60217488A (en) Character recognition device
KR900005141B1 (en) Character Recognition Device
Gangania et al. Handwriting Recognition System Using Optical Character Recognition
JPS58169679A (en) After-processing system of sentence reader
JPH06195508A (en) Character cutting method