JPH0330190B2

JPH0330190B2 -

Info

Publication number: JPH0330190B2
Application number: JP59052935A
Authority: JP
Priority date: 1984-03-19
Filing date: 1984-03-19
Publication date: 1991-04-26
Also published as: JPS60196886A

Description

【発明の詳細な説明】〔発明の技術分野〕この発明は、用紙等に筆記具等で記入される文
字等を認識し、当該文字を表す文字コードを決定
する文字認識装置に関するものであり、特に記入
者毎に特有な略字等の変形文字の認識に関するも
のである。[Detailed Description of the Invention] [Technical Field of the Invention] The present invention relates to a character recognition device that recognizes characters written on paper or the like with a writing instrument, etc., and determines a character code representing the character. This relates to the recognition of modified characters such as abbreviations that are unique to each person.

[Prior art]

従来の文字認識装置の多くは、不特定多数の記
入者によつて記入される文字を認識の主対象とし
ているため、多数の記入者が記入する文字パター
ンの平均的な変形を表現すると考えられる１種類
の認識辞書を用いて文字認識を行つていた。この
ため記入者ごとに特有な略字や大きく変形された
文字に対しては誤認識又は認識拒否となる確率が
大きかつた。これを防止するためにはJISで定め
られた標準字形等に倣つて記入することを記入者
に要望していた。 Most conventional character recognition devices mainly recognize characters written by an unspecified number of people, so they are thought to represent the average deformation of character patterns written by many people. Character recognition was performed using one type of recognition dictionary. For this reason, there is a high probability that abbreviations unique to each person or characters that are significantly deformed will be misrecognized or rejected. In order to prevent this, the person filling out the form was requested to follow the standard character form specified by JIS.

しかし、標準字形に倣つて記入しなければなら
ないという制限に対しては記入者側の抵抗が少な
くなく、文字認識装置の普及をさまたげる原因と
なつていた。 However, there was considerable resistance on the part of those filling out the form, which required them to fill in the form in imitation of the standard character form, and this was a cause of hindering the widespread use of character recognition devices.

これを取除くために提案された方法は記入者限
定方式とも称すべきもので、話者限定式の音声認
識装置に類似し、記入者ごとに定められた認識辞
書を使用するものである。この方式の一例として
は特公昭58−53393号公報に開示されており、複
数の認識辞書を備えていて、記入者コードを入力
することによつて当該記入者用の認識辞書を選び
出して使用するものである。この方式では記入者
に依存した字形の変形に対処することはできる
が、以下に述べるような欠点があるために、実用
的とは言い難い。 The method proposed to eliminate this problem is also called the filler-limited method, which is similar to a speaker-limited speech recognition device and uses a recognition dictionary determined for each filler. An example of this method is disclosed in Japanese Patent Publication No. 58-53393, which is equipped with a plurality of recognition dictionaries, and by inputting a filler code, the recognition dictionary for the filler is selected and used. It is something. Although this method can deal with deformation of the character shape depending on the person writing it, it is difficult to say that it is practical because it has the following drawbacks.

すなわち、第１の欠点としては、多数の記入者
に対し使用可能とするためには多数の種類の認識
辞書を必要とし、これを格納するための膨大な記
憶容量を必要とすることである。漢字を認識する
ための認識辞書は１種類で数メガバイトの記憶容
量を必要とし、100種類の認識辞書を格納する場
合、磁気デイスク装置等の外部記憶装置を用いる
にしても、記憶装置が甚だ高価なものになる。 That is, the first drawback is that in order to be usable by a large number of fillers, many types of recognition dictionaries are required, and a huge amount of storage capacity is required to store them. One type of recognition dictionary for recognizing kanji requires several megabytes of storage capacity, and when storing 100 types of recognition dictionaries, even if an external storage device such as a magnetic disk device is used, the storage device is extremely expensive. Become something.

比較的低価格なフレキシブルデイスク装置等の
外部記憶装置に多数の種類の認識辞書を記憶し、
そのうちから記入者コードによつて選択される１
種類の認識辞書だけを高速メモリに転送して使用
することもできるが、この場合には転送に必要な
時間が大きくなるという欠点がある。 Many types of recognition dictionaries are stored in a relatively low-cost external storage device such as a flexible disk device,
1 selected from among them by the filler code
Although it is also possible to transfer only the type recognition dictionary to a high-speed memory for use, this has the disadvantage that the time required for transfer increases.

第２の欠点は、記入者毎の認識辞書を作成する
ことの困難性にある。文字認識装置自体が、記入
者の記入した文字パターンから学習して、当該記
入者に対する認識辞書を作成することは、現在の
技術においては未だに達成されておらず、認識辞
書の作成には多少とも人間の介在を必要とするた
め、多数の種類の認識辞書を作成するには、多数
の技術者の多大の時間を必要とする。 The second drawback lies in the difficulty of creating a recognition dictionary for each person who fills in the information. Current technology has not yet achieved the ability of the character recognition device itself to learn from character patterns written by a person and create a recognition dictionary for that person. Since human intervention is required, creating many types of recognition dictionaries requires a large amount of time from many engineers.

また、認識辞書を作成する場合には、１カテゴ
リ当り（１つの文字コード当り）少くとも数十種
類以上の文字パターンを必要とするとされてお
り、数千個以上のカテゴリを対象とする漢字に関
しては、認識辞書作成に必要なすべての文字パタ
ーンを記入者に書かせることはほとんど不可能で
ある。 In addition, when creating a recognition dictionary, it is said that at least several dozen character patterns are required per category (per character code), and for kanji that target more than several thousand categories. It is almost impossible to have the person write all the character patterns necessary to create a recognition dictionary.

第３の欠点としては、認識辞書が登録されてい
ない記入者は使用できないという点がある。この
ため、登録されていない記入者は標準的な文字パ
ターンを記入し、標準的な文字パターンを対象と
する認識辞書を使用せねばならぬという欠点があ
つた。 A third drawback is that the recognition dictionary cannot be used by a person who has not registered it. For this reason, there was a drawback in that an unregistered filler had to fill in standard character patterns and use a recognition dictionary for standard character patterns.

[Summary of the invention]

この発明は上記のような従来のものの欠点を除
去するためになされたもので、この発明では標準
的な文字パターンを対象とした認識辞書と、記入
者に対応して更新可能な認識結果修正テーブルを
備え、記入者と装置との間のマンマシンインタフ
エイスを構成するブラウン管表示装置（一般的に
は文字表示手段）及びキーボード（一般的には文
字入力手段）を備え、このマンマシンインタフエ
イスを介して記入者が文字認識結果を修正するこ
とにより認識結果修正テーブルに当該記入者特有
の修正情報を蓄積し、この修正情報が蓄積された
認識結果修正テーブルを用いて修正を行うように
したものである。 This invention was made in order to eliminate the drawbacks of the conventional ones as described above, and this invention provides a recognition dictionary for standard character patterns and a recognition result correction table that can be updated according to the person who wrote it. It is equipped with a cathode ray tube display device (generally a character display means) and a keyboard (generally a character input means) which constitute a man-machine interface between the person writing the data and the device, and this man-machine interface When a user corrects character recognition results through the system, correction information specific to the filler is stored in a recognition result correction table, and corrections are made using the recognition result correction table in which this correction information is stored. It is.

[Embodiments of the invention]

以下この発明の実施例を図面について説明す
る。第１図はこの発明の一実施例を示すブロツク
図であつて、図において、１は用紙、２は走査手
段、３は認識手段、４は認識辞書、５は制御手
段、６は記憶手段、７は文字表示手段、８は文字
入力手段である。認識辞書４は標準的な文字パタ
ーンを対象とする認識辞書である。したがつて
１，２，３，４の動作は標準的な文字パターンを
対象とする認識辞書を備えた従来の装置の動作と
同様であり、よく知られているのでその説明は省
略する。 Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of the present invention, in which 1 is a sheet of paper, 2 is a scanning means, 3 is a recognition means, 4 is a recognition dictionary, 5 is a control means, 6 is a storage means, 7 is a character display means, and 8 is a character input means. The recognition dictionary 4 is a recognition dictionary that targets standard character patterns. Therefore, the operations 1, 2, 3, and 4 are similar to those of a conventional device equipped with a recognition dictionary for standard character patterns, and are well known, so a description thereof will be omitted.

記憶手段６には認識結果修正テーブル（後節で
説明する）が格納されており、制御手段５は汎用
のマイクロプロセツサ等で構成され、認識手段３
から文字の認識結果として出力される文字コード
を記憶手段６に記憶される認識結果修正テーブル
の内容と比較して修正し、最終認識結果とすべき
文字コードを決定し、この決定した文字コードを
文字表示手段７へ入力する。 The storage means 6 stores a recognition result correction table (described in a later section), the control means 5 is composed of a general-purpose microprocessor, etc., and the recognition means 3
The character code output as the character recognition result is corrected by comparing it with the contents of the recognition result correction table stored in the storage means 6, the character code to be the final recognition result is determined, and the determined character code is input to the character display means 7.

文字表示手段７は文字コードをアドレスとして
当該文字のドツトパターンが格納されているフオ
ントメモリを内蔵し、入力された文字コードによ
つてフオントメモリから当該文字のドツトパター
ンを読出してこれをブラウン管表示面に表示す
る。 The character display means 7 has a built-in font memory in which the dot pattern of the character is stored using the character code as an address, and reads out the dot pattern of the character from the font memory according to the input character code and displays it on the cathode ray tube display screen. to be displayed.

記入者は文字表示手段７に表示された文字配列
を観察し、その文字配列中に誤認識文字があつた
場合には、その誤認識文字を文字入力手段８から
の入力によつて正しい文字に置換する。入力すべ
き文字がローマ字やかな文字等の場合は、文字入
力手段８のキーボードのキーに当該文字が書いて
あるキーを操作すればその文字の文字コードを入
力することができ、入力すべき文字が漢字の場合
は、かな漢字変換技術等を利用し、かな文字をキ
ーインすることにより、所望の漢字を入力するこ
とができる。制御手段５の中には文字表示手段７
に表示される文字の配列に対応した位置に文字コ
ードの配列を記憶するリフレツシユメモリが内蔵
されており、文字表示手段７の表示面のカーソル
位置を文字入力手段８からの入力によつて所望の
位置に設定した上で、所望の文字をキーインする
と、上記カーソル位置に対応するリフレツシユメ
モリの位置の文字コードがキーインされた文字の
文字コードに修正され、この修正された文字コー
ドに対応する文字のドツトパターンが文字表示手
段７上に表示される。すべての誤認識文字に対す
る修正を終ると、記入者は文字入力手段８を用い
て制御手段５に終了信号を送る。 The filler observes the character arrangement displayed on the character display means 7, and if there is an incorrectly recognized character in the character arrangement, the incorrectly recognized character is changed to the correct character by inputting from the character input means 8. Replace. If the character to be input is a Roman character, etc., the character code of that character can be input by operating the key on the keyboard of the character input means 8 with the character written on it, and the character to be input can be input. If the character is a kanji, the desired kanji can be input by using kana-kanji conversion technology and keying in the kana characters. In the control means 5, there is a character display means 7.
A refresh memory is built-in to store a character code arrangement at a position corresponding to the character arrangement displayed on the screen. When you key in the desired character after setting the cursor position, the character code at the refresh memory location corresponding to the cursor position will be corrected to the character code of the keyed-in character, and the character code will correspond to this corrected character code. A dot pattern of characters is displayed on the character display means 7. When all the incorrectly recognized characters have been corrected, the filler uses the character input means 8 to send a completion signal to the control means 5.

制御手段５はこの終了信号を受けた時点のリフ
レツシユメモリの内容を正しい認識として、記憶
手段６内の認識結果修正テーブルの内容を更新す
る。 The control means 5 recognizes the contents of the refresh memory at the time of receiving this end signal as correct, and updates the contents of the recognition result correction table in the storage means 6.

第２図はこの発明の動作による認識結果の一例
を示す図で、９，１０，１１，１２は用紙１に記
入された文字、１３，１４，１５，１６は９，１
０，１１，１２がそれぞれ認識された文字コード
を表す。入力文字９では問を略字で記入したため
「同」と誤認識され、入力文字１０では層を略字
で記入したため「乃」と誤認識され、入力文字１
１では機を略字で記入したため「桟」（サン）と
認識され、入力文字１２では訣が認識辞書４中に
存在しない（すなわち訣は外字である）ため
「訳」（ヤク）と誤認識された例を示す。 FIG. 2 is a diagram showing an example of the recognition results obtained by the operation of the present invention, where 9, 10, 11, and 12 are characters written on paper 1, and 13, 14, 15, and 16 are characters that are written on paper 1.
0, 11, and 12 represent recognized character codes, respectively. In input character 9, the question was written in an abbreviation, so it was misrecognized as "same", and in input character 10, the layer was written in an abbreviation, so it was misrecognized as "乃", and input character 1
In input character 1, ``ki'' was entered as an abbreviation, so it was recognized as ``san'', and in input character 12, ``tip'' did not exist in the recognition dictionary 4 (that is, ``tip'' is a foreign character), so it was misrecognized as ``yaku'' (translation). Here is an example.

第３図は記憶装置６に格納されている認識結果
修正テーブルの一部の初期状態を示す図である。
図において、１３ａは認識結果、１７ａは最終認
識結果、１８ａは頻度である。初期状態であるた
め認識結果と最終認識結果が一致し、その頻度は
１であつて、最終認識結果の他のらんは空らんと
なつている。第３図の認識結果のらんは第２図に
関連した説明の便宜のために仮に集めて配列した
ものであつて、認識結果修正テーブル内の配列は
アドレスの容易なように文字コードのビツトパタ
ーンに従い、あるいはJISの漢字コードの順に配
列されている。 FIG. 3 is a diagram showing the initial state of a part of the recognition result correction table stored in the storage device 6. As shown in FIG.
In the figure, 13a is the recognition result, 17a is the final recognition result, and 18a is the frequency. Since this is the initial state, the recognition result and the final recognition result match, the frequency is 1, and the other fields of the final recognition result are empty. The list of recognition results in Figure 3 has been temporarily collected and arranged for the convenience of explanation related to Figure 2, and the arrangement in the recognition result correction table is based on the bit pattern of the character code for easy addressing. They are arranged according to the JIS kanji code.

ある記入者が、初めて第１図に示す装置を使用
する場合には、初期状態の認識結果修正テーブル
が格納されているフレキシブルシートをフレキシ
ブルデイスク装置等の記憶手段に差込み、用紙１
上に手書きによつて入力文字９〜１２（第２図の
場合）を記入したとすると、従来の装置における
と同様の動作により、認識手段３からは第２図１
３〜１６に示す認識結果が出力される。制御手段
５は認識結果修正テーブルを参照して修正を行う
のであるが、この場合、第３図に示すように認識
結果１３ａと最終認識結果１７ａとは同一である
ため、何等修正されることなく第２図１３〜１６
に示す認識結果がそのまま文字表示手段７に表示
される。 When a person uses the device shown in FIG. 1 for the first time, he or she inserts the flexible sheet containing the initial recognition result correction table into a storage device such as a flexible disk device, and then inserts the sheet 1.
If input characters 9 to 12 (in the case of Fig. 2) are written by hand, the recognition means 3 will input the characters 9 to 12 (in the case of Fig. 2) by the same operation as in the conventional device.
The recognition results shown in 3 to 16 are output. The control means 5 makes corrections by referring to the recognition result correction table, but in this case, as shown in FIG. 3, since the recognition result 13a and the final recognition result 17a are the same, no correction is made. Figure 2 13-16
The recognition results shown in are displayed as they are on the character display means 7.

記入者は、先に説明したように、文字入力手段
８を用いて誤認識文字を正しい文字に置換する。
この置換操作によつて、文字表示手段７に表示さ
れた文字パターンが修正され、この文字に対応す
る制御手段５のリフレツシユメモリ内の文字コー
ドが修正され、認識結果修正テーブルが更新され
る。文字表示手段７の表示とリフレツシユメモリ
の文字コードの修正については既に説明した所で
あるが、たとえば、認識結果１３として文字表示
手段７に表示される「同」を「間」に修正した場
合は、認識結果修正テーブルの修正は次のように
して行われる。 As explained above, the person filling in the information uses the character input means 8 to replace the misrecognized characters with correct characters.
By this replacement operation, the character pattern displayed on the character display means 7 is modified, the character code in the refresh memory of the control means 5 corresponding to this character is modified, and the recognition result modification table is updated. The display on the character display means 7 and the modification of the character code in the refresh memory have already been explained, but for example, when "same" displayed on the character display means 7 as the recognition result 13 is modified to "between". The recognition result modification table is modified as follows.

すなわち、第２図の認識結果１３〜１６が認識
手段３から制御手段５に入力されると、この認識
結果１３〜１６に対応する認識結果修正テーブル
の部分が記憶手段６から読出されて制御手段５内
のデータメモリ領域に一時書込まれる。第３図は
その書込まれた記憶であると見ることができる。
但し、第３図中「間」「層」「機」の行は説明の参
考のために併記した部分で制御手段５内に書込む
必要はない。認識結果修正テーブルから制御手段
５内に書込んだ部分を用いて、認識手段３からの
入力を修正するが、第３図に示すように、認識結
果修正テーブルが初期状態の時は、先に説明した
ように、修正が行われず、文字表示手段７には、
たとえば、「同題」１３として表示される。記入
者が「同」を「問」と修正したとき、制御手段５
内のリフレツシユメモリの文字コード（同）が文
字コード（問）と修正されると同時に第３図の認
識結果「同」の行の最終認識結果の第１列が
「問」の文字コードに修正されその頻度らんに１
が記入され、第１列に前から記憶されていたデー
タは第２列にシフトされる。 That is, when the recognition results 13 to 16 shown in FIG. The data is temporarily written to the data memory area in 5. FIG. 3 can be seen as the written memory.
However, in FIG. 3, the lines ``interval'', ``layer'', and ``machine'' are written together for reference purposes and do not need to be written in the control means 5. The input from the recognition means 3 is corrected using the part written in the control means 5 from the recognition result correction table, but as shown in FIG. As explained, no correction is made and the character display means 7 shows the following:
For example, it is displayed as "same title" 13. When the filler corrects "same" to "question", control means 5
The character code (same) in the refresh memory is corrected to the character code (question), and at the same time, the first column of the final recognition result in the row with the recognition result "same" in Figure 3 becomes the character code "question". Corrected its frequency 1
is written, and the data previously stored in the first column is shifted to the second column.

正しい文字への置換が終了すると、記入者は文
字入力手段８を用いて制御部５に終了信号を送
る。その時点において制御部５内の認識結果修正
テーブルの１部は第４図に示すとおりになつてい
る。第４図において「題」１３ｃ、「高」１４ａ、
「械」１５ｃ、「別」１６ｂは修正を受けなかつた
ので頻度らんに１が加算されて２になつている。
最終認識結果の配列は頻度の高いものを前の列に
し、同一頻度のものは後から出て来た文字コード
を前の列にする。 When the replacement with the correct character is completed, the person filling in the information sends a completion signal to the control unit 5 using the character input means 8. At that point, part of the recognition result correction table in the control section 5 is as shown in FIG. In Figure 4, "Title" 13c, "High" 14a,
Since "machine" 15c and "another" 16b were not modified, 1 was added to the frequency, making it 2.
In the final recognition result array, those with higher frequencies are placed in the front column, and those with the same frequency are placed in the previous column with character codes that appear later.

制御部５は終了信号を受けると制御部５内で修
正された内容に従つて記憶手段６内の認識結果修
正テーブルを修正する。 When the control section 5 receives the end signal, it corrects the recognition result correction table in the storage means 6 according to the contents corrected in the control section 5.

また、認識手段３は１個の文字パターンに対し
複数の文字コードと、その確からしさの順位を付
して出力し、この複数の文字コードをそれぞれの
文字のドツトパターンとして文字表示手段７に表
示し、記入者がいずれかの文字を選択するように
構成することもできる。 Further, the recognition means 3 outputs a plurality of character codes for one character pattern and ranks them according to their likelihood, and displays the plurality of character codes as a dot pattern of each character on the character display means 7. However, it can also be configured so that the person filling in the form selects one of the characters.

以上のようにして、フレキシブルシート中には
当該記入者に対応する認識結果修正テーブルの情
報が順次蓄積されてゆくが、当該記入者が第１図
の装置の使用を終了したときは、自分の用いるフ
レキシブルシートを記憶手段６から取り出して別
に保管し、次に第１図の装置を使用するときに再
び記憶手段６に装着すればよい。 As described above, the information of the recognition result correction table corresponding to the person who filled in the information is sequentially accumulated in the flexible sheet, but when the person who filled in the information finishes using the device shown in Fig. It is sufficient to take out the flexible sheet to be used from the storage means 6, store it separately, and attach it to the storage means 6 again the next time the apparatus of FIG. 1 is used.

たとえば、認識結果修正テーブルの内容が第５
図のようになつている場合、第２図入力文字９〜
１２が再び入力されると、文字表示手段７には最
終認識結果の正しい文字が表示され、記入者はこ
れを確認して終了信号を送るだけでよく、この場
合認識結果修正テーブルでは頻度のらんが更新さ
れる。第５図はこの更新された認識結果修正テー
ブルを示す。第３図乃至第５図において左のらん
外に斜線を施した行は第３図について説明したと
おり、第２図の９〜１２に示す入力文字には直接
関係のない部分である。 For example, if the contents of the recognition result correction table are
If it is as shown in the figure, input characters 9 to 9 in Figure 2
12 is input again, the correct character of the final recognition result is displayed on the character display means 7, and the person filling in only needs to confirm this and send an end signal.In this case, the recognition result correction table shows the correct character of the final recognition result. is updated. FIG. 5 shows this updated recognition result correction table. In FIGS. 3 to 5, the lines marked with diagonal lines outside the left frame are portions that are not directly related to the input characters shown at 9 to 12 in FIG. 2, as explained with reference to FIG.

第６図は特定の記入者が、この発明の装置を多
数回使用した場合、その記入者に対する認識結果
修正テーブルの内容の一部の一例を示す図であ
る。この記入者は第２図９〜１２に示すような略
字を使用するため、認識結果が「問」１３ａ、
「層」１４ｂ、「機」１５ａとなるのは他の文字の
文字パターンが誤認識された場合が大部分であ
り、「題」１３ｃ、「高」１４ａ、「械」１５ｃ、
「別」１６ｂは大部分が正しく認識されたことを
示す。また、「乃」１４ｃと「桟」１５ｂ（サン）
はこの記入者によつてほとんど使用されず、「層」
と「機」に修正されたことを示す。また「同」１
３ｂは「同」、「問」、「間」から「同」１３ｂと認
識されることが多く、最終認識結果はこれらの文
字の使用頻度で決定される。「訳」１６ａも使用
頻度が高く、「訳」と最終認識されることが多い
が、外字の「訣」が「訳」と認識されたことがあ
る事実を示す。 FIG. 6 is a diagram showing an example of a part of the contents of a recognition result correction table for a particular person who has used the apparatus of the present invention many times. The person filling this out uses abbreviations as shown in Figure 2 9-12, so the recognition result is "Question" 13a,
"Layer" 14b, "machine" 15a are mostly caused when character patterns of other characters are misrecognized, and "title" 13c, "taka" 14a, "machine" 15c,
“Another” 16b indicates that most of the images were correctly recognized. Also, “No” 14c and “Zan” 15b (Sun)
is rarely used by this author, and is referred to as "layer"
and "machine" to indicate that it has been modified. Also “same” 1
3b is often recognized as "same" 13b from "same", "question", and "ma", and the final recognition result is determined by the frequency of use of these characters. ``Translator'' 16a is also frequently used and is often finally recognized as ``translator'', but this shows the fact that the external character ``tei'' has been recognized as ``translator''.

第７図は第１図の文字表示手段７の表示例を示
す図であつて、１９は制御手段５から出力される
最終認識結果の文字列、２０，２１はそれぞれ文
字列１９の文字「訳」、「別」がそれぞれ第１列目
に存在する認識結果修正テーブル（第６図１６
ａ，１６ｂ）の各行の内容を示す。記入者は２
０，２１の表示中から選択して１９の表示を修正
することができるので、修正すべき文字の文字コ
ードの入力が容易となる。 FIG. 7 is a diagram showing an example of the display of the character display means 7 in FIG. ” and “another” are present in the first column (Fig. 6, 16).
The contents of each line of a and 16b) are shown. There are 2 people who filled in the information.
Since the display of 19 can be corrected by selecting from the display of 0 and 21, it becomes easy to input the character code of the character to be corrected.

以上のようにして、この発明によれば記入者個
有の略字も認識できるようになるが、第６図に示
すとおり、たとえば「乃」及び「桟」（サン）を
それぞれ「層」及び「機」と誤認識することがあ
る。しかし、この記入者は「乃」及び「桟」（サ
ン）は第６図に示すとおり殆んど使用していない
ので問題は少い。 As described above, according to the present invention, it is possible to recognize abbreviations unique to the person who filled in the information. For example, as shown in FIG. It may be mistakenly recognized as "machine". However, this person hardly uses ``no'' and ``san'' as shown in Figure 6, so this is not a problem.

個々の記入者が実際に手書き文字として使用す
る字種は比較的限られた数である。たとえば、漢
字を対象とする文字認識装置においては認識辞書
４内の文字コードの種類はJIS第１水準の約3000
字程度であるとし、仮に認識結果修正テーブルの
認識結果列には3000字の文字コードを配列したと
しても、個々の記入者に限定すれば800字程度が
使用されるものであり、認識結果修正テーブルか
らも上記800字程度の字種に対し読出し書込みが
行われるのである。たとえば、電子工学の技術者
が作成する文章には「乃」とか「桟」（サン）と
かは殆んど含まれておらず、このような文字を誤
認識しても、略字を認識できる方が遥かに効果が
ある。 The number of character types that each person actually uses as handwritten characters is relatively limited. For example, in a character recognition device that targets kanji, the number of character codes in the recognition dictionary 4 is about 3000, which is the JIS first level.
Even if a 3000-character character code were arranged in the recognition result column of the recognition result correction table, about 800 characters would be used if limited to each person who entered it, and the recognition result correction table would be about 800 characters. The table also reads and writes the 800 or so character types mentioned above. For example, sentences created by electronics engineers almost never contain characters such as ``ノ'' or ``san''. is far more effective.

また、たとえば、「桟」（サン）を認識辞書４か
ら除外しても、「機」に対する略字「桟」を記入
した場合、これが「機」と認識される確率は少
く、他の文字に認識されることが多いので、認識
手段３では「桟」（サン）と認識させておいて、
制御手段５により「機」に修正した方がよい。 For example, even if you exclude ``san'' from the recognition dictionary 4, if you write the abbreviation for ``ki'', the probability that this will be recognized as ``ki'' is low, and other characters will be recognized. Since this is often the case, recognition means 3 recognizes it as "san".
It is better to use the control means 5 to correct it to "machine".

文字認識に頻度を利用することは従来の装置に
おいても行われたが、この発明における頻度の利
用とは本質的に異るものである。たとえば、従来
の装置では、認識手段から複数の認識候補文字の
文字コードとその文字パターンの類似度とを出力
し、この類似度と当該文字の使用頻度との関数に
よつて最も確からしい文字コードを決定するもの
であり、認識候補文字中に正しい文字が含まれて
いない場合は、正しく認識されることがない。 Although the use of frequency in character recognition has been done in conventional devices, the use of frequency in this invention is essentially different. For example, in conventional devices, the recognition means outputs the character codes of multiple recognition candidate characters and the degree of similarity of their character patterns, and the most likely character code is determined by a function of this degree of similarity and the frequency of use of the character. If the correct characters are not included in the recognition candidate characters, they will not be recognized correctly.

また、かな漢字変換では、同音語の選択基準の
１つとして使用頻度が利用されることがあるが、
これは、同一読みから選択され得る文字の種類に
変化を生ずることはなく、この発明の認識結果修
正テーブルように最終認識結果が変化するように
利用されるのとは異なる。 In addition, in Kana-Kanji conversion, frequency of use is sometimes used as one of the criteria for selecting homophones;
This does not cause a change in the types of characters that can be selected from the same reading, and is different from the recognition result correction table of the present invention, which is used to change the final recognition result.

なお、第６図において認識結果らんの「同」１
３ｂの最終認識結果の頻度が「同」６５、「問」
６０、「間」５５で３者の間の混同は避け難いが、
これはこの発明に係る文字認識装置の限界であつ
て、マンマシンインタフエイスを介して修正しな
ければならぬ問題である。 In addition, in Figure 6, the recognition result is "same" 1.
The frequency of the final recognition result of 3b is "same" 65, "question"
60. Although it is difficult to avoid confusion between the three parties in "between" 55,
This is a limitation of the character recognition device according to the present invention, and is a problem that must be corrected through the man-machine interface.

なお、以上の説明で、文字表示手段７を観察し
文字入力手段８を操作する操作者は、用紙１へ文
字を記入した記入者と同一人と想定して説明した
が、操作者と記入者が同一人でなくても差支えな
いことは申すまでもない。 In the above explanation, it is assumed that the operator who observes the character display means 7 and operates the character input means 8 is the same person who wrote the characters on the form 1. Needless to say, there is no problem even if they are not the same person.

更に、記入者別の認識結果修正テーブルが、そ
れぞれ別のフレキシブルシートに格納されるとし
て説明したが、すべての記入者に対する認識結果
修正テーブルを記憶手段６にまとめて格納し、記
入者に対応した識別コード等を文字入力手段８か
ら入力することによつて当該記入者に対応する認
識結果修正テーブルを選択し、又は用紙１上の所
定の位置に記入者を識別する文字又は識別コード
を記入しておき、これを認識して当該記入者に対
応する認識結果修正テーブルを選択することがで
きる。認識結果修正テーブルは文字コードを記憶
するだけでよく、比較的小容量の記憶装置に格納
することができる。 Furthermore, although it has been explained that the recognition result correction tables for each filler are stored in separate flexible sheets, it is possible to store the recognition result correction tables for all fillers in the storage means 6 and to correspond to each filler. By inputting an identification code etc. from the character input means 8, the recognition result correction table corresponding to the person filling in the form is selected, or a character or identification code identifying the filler is written in a predetermined position on the sheet 1. By recognizing this, it is possible to select the recognition result correction table corresponding to the person who filled in the information. The recognition result correction table only needs to store character codes, and can be stored in a relatively small capacity storage device.

記入者の同定が不可能な場合は、認識手段３の
出力をそのまま制御手段５を介し文字表示手段７
に表示すればよい。 If it is not possible to identify the person who filled in the information, the output of the recognition means 3 is sent directly to the character display means 7 via the control means 5.
It should be displayed in .

また、以上の説明では、認識手段３からは、１
つの文字パターンに対応して１つの文字コードだ
けを出力したが、これを、１つの文字パターンに
対応して複数の文字コードを、当該文字コードの
基準パターンに対する類似度と共に出力してもよ
いとし、制御手段５においても各文字コードに対
する類似度をも参照して最終認識結果を決定する
こともできる。 In addition, in the above explanation, from the recognition means 3, 1
Although only one character code was output in response to one character pattern, it is also possible to output multiple character codes in response to one character pattern along with the degree of similarity of the character code to the reference pattern. The control means 5 can also refer to the degree of similarity for each character code to determine the final recognition result.

更には、記入者がタブレツト等に手書きする文
字を実時間で認識するオンライン認識にも適用す
ることができる。オンライン認識では記入者毎の
特性が表われやすい筆順情報を利用することが多
く、この発明の効果も発揮されやすい。 Furthermore, it can be applied to online recognition in which characters handwritten by a person writing on a tablet or the like are recognized in real time. Online recognition often uses stroke order information that easily shows the characteristics of each person writing the information, and the effects of this invention are also likely to be exhibited.

〔Effect of the invention〕

以上のようにこの発明によれば、記入者毎に備
えられる認識結果修正テーブルによつて最終認識
結果を決定し、かつ認識結果修正テーブルは使用
するたびに増補改訂が行われるので、各記入者の
記入した文字を正確に文字認識することができ
る。 As described above, according to the present invention, the final recognition result is determined by the recognition result correction table prepared for each filler, and the recognition result correction table is expanded and revised each time it is used. It is possible to accurately recognize the written characters.

[Brief explanation of the drawing]

第１図はこの発明の一実施例を示すブロツク
図、第２図はこの発明の動作による認識結果の一
例を示す図、第３図、第４図、第５図、第６図は
それぞれこの発明の認識結果修正テーブルの一部
の内容を示す図、第７図は第１図の文字表示手段
の表示例を示す図である。１……用紙、２……走査手段、３……認識手
段、４……認識辞書、５……制御手段、６……記
憶手段、７……文字表示手段、８……文字入力手
段。 FIG. 1 is a block diagram showing an embodiment of this invention, FIG. 2 is a diagram showing an example of recognition results obtained by the operation of this invention, and FIGS. FIG. 7 is a diagram showing a part of the contents of the recognition result correction table of the invention, and FIG. 7 is a diagram showing an example of display on the character display means of FIG. 1. DESCRIPTION OF SYMBOLS 1...Paper, 2...Scanning means, 3...Recognition means, 4...Recognition dictionary, 5...Control means, 6...Storage means, 7...Character display means, 8...Character input means.

Claims

[Scope of Claims] 1. A scanning means for photoelectrically converting characters, etc. written on paper, etc. and outputting an electric signal representing a character pattern of the characters, etc., a standard pattern of characters to be recognized, etc., and each standard pattern. a recognition dictionary that stores correspondences with character codes of characters, etc. corresponding to characters, and compares the character pattern output from the scanning means with a reference pattern in the recognition dictionary to match the character pattern output from the scanning means. A recognition means for determining character codes, which arranges all the character codes in the recognition dictionary into a recognition result list, and writes a plurality of character codes and numerical values indicating their frequency of appearance for each character code in the recognition result list. A recognition result correction table is provided for each person who may fill in the above form, etc., and the recognition result correction table is provided with the final recognition result and frequency information of the recognition result correction table in the initial state. Means for writing the character code of the corresponding recognition result RAN in the first position of the frequency RAN and writing the number 1 as the frequency, and selecting and using the recognition result correction table corresponding to the person filling out the form, etc. for each person filling out the form, etc.; Input the character code determined by the recognition means, refer to the above recognition result correction table, and calculate the final recognition result corresponding to the recognition result RAN in which the same character code as the input character code is stored, and the first character of the frequency RAN. A control means for reading out a code, a character display means for converting the character code read by this control means into a dot pattern of the character and displaying it on a display screen, and a character display means for observing the dot pattern of the character displayed on the character display means. and, if necessary, a character input means for inputting the character code of the character to be corrected and correcting the characters displayed on the character display means, after the necessary corrections have been made by this character input means. Write to the recognition result correction table according to the display content of the character display means, and in response to the recognition result, write the character code of the displayed character on the character display means to the final recognition result and the frequency. If the character code is already stored, add the numerical value 1 to the frequency corresponding to the character code, and if the character code of the character displayed on the character display means does not match the frequency with the final recognition result, add the value 1 to the frequency corresponding to the character code. Write the character code and the number 1 as the frequency in the empty space in the result and frequency column, and in the same final recognition result column, place the character code with the highest frequency at the beginning, and the character code with the highest frequency A character recognition device comprising a recognition result correction table augmentation and revision means for arranging a character code that has recently reached a certain frequency among the plural types at the beginning.