JPS5882373A

JPS5882373A - Online character recognizing method

Info

Publication number: JPS5882373A
Application number: JP56180363A
Authority: JP
Inventors: Shuzo Owaku; 大和久　修三; Akio Nagano; 長野　昭夫; Katsuhide Tanoshima; 田野島　克秀; 「まん」木　正義; Masayoshi Yurugi
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1981-11-12
Filing date: 1981-11-12
Publication date: 1983-05-17
Also published as: JPH0258664B2

Abstract

PURPOSE:To reduce the storage capacity of a recognition dictionary and to improve a recognition rate by fractionizing KANJI (Chinese character) which can not be recognized, recognizing the fractionized pseudo radicals, and discriminating the KANJI in the form of a set of the pseudo radicals. CONSTITUTION:Stroke information on a handwritten character is extracted on a tablet 1 and then compared by a recognition part 2 with information which shows features of partial set patterns obtained by fractionizing KANJI stored in a pseudo radical dictionary and features of patterns of the whole characters other than KANJI, thereby storing the comparison result in an input register 4. The contents of the input register 4 are compared by a selecting circuit 5 with a set of partial sets of KANJI stored in a character dictionary 6 and a character code corresponding to a character other hand KANJI, discriminating the character. Thus, KANJI is discriminated by comparison for every partial set to reduce the storage capacity required for the recognition dictionary, and a recognition rate is also improved because a KANA level, i.e. pseudo radical is used for the recognition.

Description

【発明の詳細な説明】不発明は、情報処理機器の入力装置として用いらＪする
オンライン手書文字認識装置に関わる認識方法に関１゛
−るものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a recognition method related to an online handwritten character recognition device used as an input device for information processing equipment.

従来の手書文字入力ワードプロセノサは手書文字を認識
するためにオンライン手書文字認識技術を用いていたが
、・ｌり１１えば数字、アルファベット、ひらがな、漢
字を認識する場合、数字１０ケ、アルファベット２６ケ
、ひらがな４６文字及び濁点、半濁点をはじめと１ろ記
号類を含むと約２００字あり、又、漢字をＪ■５Ｃ６２
２６第１水準漢字集合に限っても２９６５字存在する。Conventional handwritten character input word processors used online handwritten character recognition technology to recognize handwritten characters. There are approximately 200 characters including 26 alphabets, 46 hiragana characters, voiced marks, half-voiced marks, and 1-ro symbols, and kanji J■5C62
There are 2,965 characters in the 26 first level kanji set.

漢字を常用漢字にしはっても１９４５字あり計２０００
文字を越えろこととなる。Even if kanji are used as common kanji, there are 1945 characters, totaling 2000.
It means going beyond the written word.

さて、漢字を当用漢字とし合計２０００字強の文字を認
識する手書文字入力日本語ワードプロセノザも発表され
ているがその認識処理は、例えばＫ　−、Ｌ展開法を用
いて、漢字を直接認識せんと１−ろ等、ハードウェア量
が犬となりＪ−ぎるという欠点があった。又、前記の例
に限らず手書漢字を数字、アルファベント、ひらがな等
と同じアルゴリズムで直接認識せんとするため、認識の
ための計算量が膨大となり処理時間がかがろこととなり
、この処埋時間を短縮するため・・−ドウエア量の増加
を招（という、実用化という観点からみた場合正大な欠
点があった。Now, a handwritten character input Japanese word processor that recognizes over 2000 characters in total using kanji as regular kanji has been announced, but the recognition process uses, for example, the K-, L expansion method to directly recognize kanji. There was a drawback that the amount of hardware was too large, such as front and back. In addition, handwritten kanji, not limited to the above example, are not directly recognized using the same algorithm as numbers, alphabento, hiragana, etc., so the amount of calculation for recognition is enormous and the processing time is long. In order to shorten the embedding time, this resulted in an increase in the amount of hardware (this was a major drawback from the point of view of practical application).

本発明は前記の欠点を除去することを［］的どじ、漢字
を細分化して細分化された文字の：′１’ｌ）分集合を
認識し前記認識された部分集合の集まりと（〜て漢字を
識別１−るものであって、等何泊に認識１′る漢字の字
数を減づろと共に辞書に要Ｉ−ろ記憶簀Ｉ１１゛をも減
づるもので認識対象の文字ｉ’ｉ＃居はＭｔ＋記文字の
部分集合の系列の形で登録できるため認識文字数の増加
にともなう記憶容量の増加を低くオｄさえろことかでき
ろという特徴を有１−ろ。The purpose of the present invention is to eliminate the above-mentioned drawbacks by subdividing Chinese characters, recognizing subsets of the subdivided characters, and forming a collection of the recognized subsets (~). It is a method for identifying kanji, and it is used to reduce the number of kanji that can be recognized, etc., and also to reduce the number of kanji that need to be stored in the dictionary, such as character i'i to be recognized. Since the ``#'' can be registered in the form of a series of subsets of Mt+ characters, it has the feature of being able to reduce the increase in storage capacity due to an increase in the number of recognized characters.

以下図面により実施例を説明する。１第１図は本発明の一実施例を示′１−ブロンク図であっ
て１はタブレット、２は認識部、３は漢字を細分化した
文字の部分集合と漢字以外の文字（以下擬似部首という
）の特徴と擬似部１″］コートを格納した擬似部首辞書
、４は認識＋−１９２で認識さ、ｌｚた擬似部首コ　ド
を格納−４−ろ入力レジスタ、５は入力レジスタ４内に
格納さ、１′１ている１ケ又はＮｂの（３）擬似部首コ・−ドより文字を選択する選択回路、６は擬
似部首コードと文字コートを格納した文字辞書の如く構
成されて居ろ。タブレノｌ−１より入力されたストロー
ク情報は、認識部２へ送られる。Examples will be described below with reference to the drawings. 1 Figure 1 shows an embodiment of the present invention; 1 is a bronch diagram; 1 is a tablet, 2 is a recognition unit, and 3 is a subset of characters obtained by subdividing kanji and characters other than kanji (hereinafter referred to as a pseudo part). A pseudo-radical dictionary that stores the features of the neck) and pseudo-parts 1''] code, 4 is a pseudo-radical dictionary that stores the pseudo-radical codes recognized by recognition +-192, lz-4 is an input register, and 5 is an input register. A selection circuit for selecting a character from a pseudo radical code (3) of 1'1 or Nb stored in 4, and 6 a character dictionary storing pseudo radical codes and character codes. The stroke information input from the tablet notebook l-1 is sent to the recognition unit 2.

認識部２（まオンライン手書に好適な周知のストローク
アナリシス法やに、　−Ｌ展開法等により各擬似部首を
認識１″′るよ５に働く。認識部２で認識１−ろ際は、
擬似部首の特徴と擬似部首コードを格納した擬似部首辞
書３を用いろ。認識部２の出力（′ｆ−１擬似部首コー
ドのり１条で入力レジスタ４に認識さ牙する都度出力さ
れ格納さ才′１ろ。入力レジスタ４より格Ｊｉ’ｌされ
ている擬似部首コードが順次出力され、選択回路５に入
力されろ。選択回路５では、文字辞書６の内容と入力レ
ジスタ４の出力の擬似部首コードとにより文字を選択し
結果の文字コードを出力する。Recognition unit 2 (works to recognize each pseudo radical using the well-known stroke analysis method suitable for online handwriting, -L expansion method, etc.). ,
Use the pseudo-radical dictionary 3 that stores pseudo-radical features and pseudo-radical codes. The output of the recognition unit 2 ('f-1 pseudo radical code is output and stored each time it is recognized in the input register 4. The codes are sequentially output and input to the selection circuit 5. The selection circuit 5 selects characters based on the contents of the character dictionary 6 and the pseudo radical code output from the input register 4, and outputs the resulting character code.

第２図は擬似部首辞書−３の一例を示１゛。０００より
続＜１６進の数字は擬似部首コートを示し、擬似↑ｌＸ
ｌＸ−ドの右側には各擬似部首が記載されている３゜実
際の辞書では、各擬似部首の位置には認識部２（４）の認識アルゴリズムに基（特徴テークが記載されるがこ
こでは説明の便のため各擬似部首そのものを示￥。なお
擬似部首辞書３には、漢字以外のひらがな、数字、アル
ファベント等は細分化さＡ１ず直接格納されている。FIG. 2 shows an example of the pseudo radical dictionary-3. Continuing from 000 < Hexadecimal numbers indicate pseudo radical coats, pseudo ↑lX
Each pseudo-radical is listed on the right side of the IX-do.3 In the actual dictionary, the position of each pseudo-radical is based on the recognition algorithm of recognition unit 2 (4) (feature take is written). For convenience of explanation, each pseudo-radical itself is shown here. In the pseudo-radical dictionary 3, hiragana, numbers, alpha vents, etc. other than kanji are directly stored without being subdivided into A1.

第３図は、入力レジスタ４の詳細を示す。０は認識部２
よりの出力、１０〜（７は入力レジスタ４内の１゜レジ
スター１フレジスタ、］８は切４！　Ｉｔ’ｌｌ　Ｍ　
、円は入力レジスタ４の出力を示す。FIG. 3 shows details of the input register 4. 0 is recognition unit 2
Output from 10~(7 is 1° register 1 register in input register 4, ]8 is off 4! It'll M
, the circle indicates the output of the input register 4.

第４図は文字辞書６の一部分を示す。第４図第３行目は
、擬似部首コード１７６で示さ脂（るＩ−立−１という
擬似部首と擬似部首コード０ＦＩ（で示さ」１ろ「日」
とい５９似部首により「音」という文字であることを示
し、「音」という文字の文字コードは、ＪＩＳＣ６２２
６コードで３２３１−４というコードであることを示ｊ
。尚、第４図の０内文字は説明の便のため記載したもの
で実際の辞書は擬似部−ｉ￥＋コードと文字コートで構
成される。FIG. 4 shows a portion of the character dictionary 6. The third line of Figure 4 shows the pseudo radical code 176, which indicates the pseudo radical RUI-Tachi-1, and the pseudo radical code 0FI (indicated by 1, 日).
The 59-like radical indicates the character ``sound'', and the character code for the character ``sound'' is JISC622.
6 code indicates the code 3231-4
. Note that the characters inside 0 in FIG. 4 are shown for convenience of explanation, and the actual dictionary is composed of a pseudo part -i\+ code and a character code.

第５図は、「彰」という文字を入力した時の本発明によ
る処理を示すため入力レジスタ４の１゜レジスタ１０〜
■７レジスタ１７へ入力される擬似部首コードを示した
ものである。FIG. 5 shows the processing according to the present invention when the character "Akira" is input.
(7) The pseudo radical code input to the register 17 is shown.

以下、第５図を中心と１〜で本発明によるオンライン手
書文字認識の方法について「彰」という文字を例にとり
詳細に説明する。Hereinafter, the online handwritten character recognition method according to the present invention will be described in detail in 1 to 1 with reference to FIG. 5, taking the character "Akira" as an example.

先づタイミング′■゛、でタブレット１より操作渚が「
１」を入力すると「゛」は認識部２へ出力され、認識部
２に於いて擬似部−―静置（以下辞書という。）３を用
いて擬似部首として登録されているか否かを検定するが
「゛」は辞書３に登録されていないため、未定義コード
＊を■。レジスタ１０に登録する。First of all, at the timing '■゛, the operation Nagisa from tablet 1 is ``
1" is input, "゛" is output to the recognition unit 2, and the recognition unit 2 tests whether it is registered as a pseudo radical using the pseudo part -- stationary (hereinafter referred to as dictionary) 3. However, since "゛" is not registered in Dictionary 3, the undefined code * is written as ■. Register in register 10.

ついでタイミングＴ２で１／」かタブレット１より入力
されると、タイミングＴ、で未定義の１１」と合せて「
］という擬似部首が辞書３にあるかどうかを認識部２に
おいて検定すると、第２図で示されろ様に擬似部首コー
ド０５０としてビ′」という擬似部首が登録されている
ため、■ｏレジスタ１０に０５０というコードがセント
されろ。Then, at timing T2, ``1/'' is input from tablet 1, and at timing T, undefined 11'' is input, along with ``1/''.
] When the recognition unit 2 tests whether the pseudo-radical ``B'' exists in the dictionary 3, the pseudo-radical ``Bi''' is registered as the pseudo-radical code 050, as shown in FIG. Write the code 050 to o register 10.

タイミングＴ３で入力されろ「゛」は擬似部首が辞書３
に登録されていないため、■ｏレジスタ１０はそのまま
にして■ルジスタ１１に未定義コー１’　＊　ヲ登録す
る。Input at timing T3 "゛" is a pseudo radical in dictionary 3
Since it is not registered in the register 11, the undefined code 1' * is registered in the register 11 while leaving the o register 10 as is.

タイミング′（４で入力された「′」は、解重３に登録
されていないがタイミングＴ３で未定義の「゛」と合せ
て「′」という擬似部首が辞３！（３に０１４という擬
似部首コードで登録されているため、１ルジスタ月の未
定義コードを消去して新たに０１４というコードがセッ
トされる。なお、１疑似川イ１コード０５０ど０１４で
新たな擬似部首となるがどうかについて［パ・」という
文字で辞書３を用いて検定するが「パ」という文字は独
立の擬似部首として辞書３に存在しないため、■ｏレジ
スタ１０、■ルジスタ１１の内容は変らないで保持され
ろ。The "'" input at timing '(4) is not registered in demultiplexing 3, but at timing T3, the pseudo radical "'" is added to the undefined "゛" and the pseudo radical "014" is added to the end 3! Since it is registered as a pseudo-radical code, the undefined code of 1 Rujista month is deleted and a new code of 014 is set.In addition, 1 pseudo-radical 1 code 050 014 is used as a new pseudo-radical. Dictionary 3 is used to test the character ``pa'' to see if it is true, but since the character ``pa'' does not exist in dictionary 3 as an independent pseudo-radical, the contents of ■o register 10 and ■rugister 11 do not change. Don't be held.

タイミング′Ｉ″５で゛「−」が入力されろと１−」と
（・う文字は辞書３より擬似部首コード００４でル）る
ことか判明するため■２レジスタ１２に００／ｌが七ノ
ドされ、その後「・′」及び１立」について擬似部首コ
ードが辞書３に登録されているが否かを検定する。即ち
、その文字内の全ストロークについて最小個数の擬似部
首コードで表現するために検定な行なうこととなる。こ
の場合し′」は擬似部首コードとして登録されてな（「
立」は、擬似部首コード１７６として辞書に登録されて
いる。従って１゜レジスタ１０．　Ｉ、レジスタ１１、
■２レジスタ１２をリセットし、■。レジスタ１０に１
７６を登録する。か（して「立」という文字は、第２図
で示す擬似部首コード１７６で示される１ケの擬似部首
であることを示すこととなる。At timing ``I'' 5, it becomes clear that ``-'' is input as ``1-'' (the ``character'' is entered with pseudo radical code 004 from dictionary 3), so 00/l is entered in register 12 of 2. After that, it is checked whether pseudo-radical codes are registered in the dictionary 3 for "・'" and "1-tachi". That is, a test is performed to express all strokes in the character using the minimum number of pseudo-radical codes. In this case, "shi'" is registered as a pseudo-radical code ("
” is registered in the dictionary as a pseudo-radical code 176. Therefore, 1° register 10. I, register 11,
■2Reset register 12,■. 1 in register 10
Register 76. (Thus, the character ``tate'' indicates that it is one pseudo radical shown by the pseudo radical code 176 shown in FIG. 2.

同様にして第５図に示すように「彰」という文字に関し
て、結果的に擬似部首コード１７６　、ＯＦＢ　。Similarly, as shown in FIG. 5, for the character "Akira", the resulting pseudo radical code is 176, OFB.

０４５．０６５で表わされる文字であることが認識され
ろ。なお、タイミング′■゛、で３ケの擬似部首コード
よりなる入力文字を再検定して１ケの擬似部首コード１
７６を識別したのと同様に、タイミングＴ９では擬似部
首コード０２１よりなる文字と未定義の１　」及び「　
」より擬似部首コードｏＰＢなる文字「日」を識別し、
さらに又タイミング’ｌ’ｌ　ｌ　＋　Ｔ１３　ｒＴ、
４においてもそれぞれ２ケの擬似部首コードと識別され
た入力文字から１ケの擬似部首コードを識別している。Recognize that it is a character represented by 045.065. In addition, at the timing '■゛, the input character consisting of three pseudo radical codes is re-verified and one pseudo radical code 1 is obtained.
76, at timing T9, the character consisting of the pseudo radical code 021 and the undefined 1' and '
” to identify the character “day” with the pseudo radical code oPB,
Furthermore, the timing 'l'l l + T13 rT,
4, one pseudo radical code is identified from each input character identified as two pseudo radical codes.

こり〕様にして一種の最長一致法により擬似部首コード
の検定を行なつ不いる。一般的に最長一致法の場合、そ
の入力すべてが入力され終ってから一致を見ることが一
般的である。即ち、入力された全ストロークに対して判
定し、擬似部首と認めらねない場合は最後の１ストロー
クを除いて判定するという手法をとるが擬似部面−コー
ドとして辞書３に登録されている擬似部首の数が数字、
アルファベット、ひらがな、記号等を加えても６００ケ
強であり多きな数になら′ＩＩ［いこと、更には、人が
文字を入力する速度が遅いことを勘案して、入力順に検
定を行なっているものである。In this way, pseudo-radical codes are tested using a type of longest-match method. Generally, in the case of the longest match method, a match is generally checked after all of the inputs have been input. In other words, all input strokes are judged, and if the stroke cannot be recognized as a pseudo-radical, the judgment is made excluding the last stroke, which is registered in Dictionary 3 as a pseudo-radical code. The number of pseudoradicals is a number,
Even if you add the alphabet, hiragana, symbols, etc., there are over 600 characters, and if the number is large, it will be difficult to use. It is something that exists.

以上説明したように、タブレット］より「彰」を入力す
ることにより入力レジスタ４内の１゜レジスタ１０〜１
３レジスタ１３内に擬似称（涌コード１７６゜ＯＬ”Ｂ
　、　０４５，０６５　が格納されろ。こねら入力レジ
スタ４内の１゜レジスタ１０〜■７レジスタ１７の内容
は１）換回路１８により出ブ月９に順次導出さ」１、選
択回路５に入力される。選択回路５では、入力された擬
似部首コードにより、第４南に示′１″文字−１￥書６
を用いてＪＩＳ　Ｃ６２２６による文字コードを選択す
る。As explained above, by inputting "Akira" from the tablet, the 1° registers 10 to 1 in the input register 4
3 Pseudo name in register 13 (waku code 176゜OL”B
, 045,065 are stored. The contents of the 1° register 10 to the 7 register 17 in the input register 4 are 1) sequentially derived by the conversion circuit 18 in the month of release 9 and input to the selection circuit 5. In the selection circuit 5, based on the input pseudo-radical code, the 4th south is the '1'' character - 1 ¥6
Select the character code according to JIS C6226 using .

すなわち、入力レジスタ４内の■。レジスタ１０〜■７
レジスタ１７に格納されている擬似部首コードが１７６
、０１”Ｂ　、　０４５，０６５であることにより選択
回路５により、文字辞書６を調べると第４図に示す如く
擬似部首コードが１７６、ＯＦＢ　、　０４５，０６５
である文字は３Ｅ３４なる文字コードで示されろ漢字「
彰」であることが判明する。That is, ■ in the input register 4. Register 10~■7
The pseudo radical code stored in register 17 is 176
, 01"B, 045,065, the selection circuit 5 searches the character dictionary 6, and as shown in FIG. 4, the pseudo radical code is 176, OFB, 045,065.
The character `` is indicated by the character code 3E34.
It turns out to be Akira.

以上の様にして３Ｅ３４なろＪＩＳ　Ｃ６２２６文字コ
ードが選択回路５より出力されろことによりタブレット
１より入力された手書文字が漢字「彰」であることが認
識されろ。As described above, the 3E34 JIS C6226 character code is outputted from the selection circuit 5, thereby recognizing that the handwritten character input from the tablet 1 is the kanji character "Ang".

以上詳細に説明したように前記実施例に於いては、漢字
を細分化して、細分化された文字の擬似部首となづけた
部分集合を認識し前記認識された擬似部首の集まりとし
て漢字を識別する方法を示す。ここで擬似部首を認識す
るためには、簡単なアルゴリズムの認識部２と、漢字の
数に比して極めて少数の擬似部首からなる擬似部首辞書
３でよいこととなる。例えば数字、アルファベット、ひ
らがな、記号類及びＪ　Ｔ　Ｓ　Ｃ６２２６第１水準漢
字集合２９６５字のための擬似部灯の数は６００強でル
）す、この内４００強が漢字２９６５字のための擬似部
首である。このように漢字２９６５字の字数を本発明に
よれば認識時のみ等測的に減少させろ効果を而１−るこ
ととなる。又、第２図に示′ｆ擬似部言辞吉３の内容を
認識するための認識部２の機能は、周知のストロークア
ナリシス法等の簡単なアルゴリズムで゛よいことはその
道の専門家であれば容易に理解できるものと考える。更
にこれら認識さＡ′また擬似部首の集まりとして漢字を
含む文字を識別１−ろための文字群書６は、第４図に示
す様に単に擬似部首コードとＪ　Ｉｓ　Ｃ６２２６によ
る漢字コードだけで構成できるため漢字を含めた文字の
字数が多（なった場合でも極めて少量のメモリしか増加
しないことも本発明の利点であり、擬似部言辞■″３及
び文字辞書６を合せても、直接漢字を含む文字の特徴を
記録した従来の辞書の容量が犬であったのと相違して極
めて少量となるため、オンライン手書文字認識を、その
対象を漢字２９６５字より構成されるＪｉＳ　Ｃ６２２
６第１水準漢字集合にまで広げたとしても、手軽に提供
することができ、もって、［情報処理全般にとって好適な入力装置を安価に提供する
ことができる。As explained in detail above, in the above embodiment, a kanji is subdivided, a subset of the subdivided characters is recognized, and a kanji is created as a collection of the recognized pseudo-radicals. Show how to identify. In order to recognize pseudo-radicals here, it is sufficient to use a recognition unit 2 with a simple algorithm and a pseudo-radical dictionary 3 consisting of a very small number of pseudo-radicals compared to the number of Chinese characters. For example, the number of pseudo parts for numbers, alphabets, hiragana, symbols, and the 2965 characters in the first level kanji set of JTSC6226 is over 600, of which over 400 are for the 2965 characters in kanji. It's the neck. In this way, according to the present invention, the number of 2965 Chinese characters is reduced isometrically only during recognition, resulting in an additional effect. Furthermore, an expert in the field will understand that the function of the recognition unit 2 for recognizing the contents of the pseudo-part 3 shown in Figure 2 can be performed using a simple algorithm such as the well-known stroke analysis method. I think it's easy to understand. Furthermore, the character group book 6 for identifying characters containing kanji as a collection of pseudo-radicals is simply a pseudo-radical code and a kanji code based on J Is C6226, as shown in Figure 4. Another advantage of the present invention is that even if the number of characters including kanji is large (even if the number of characters is large, only a very small amount of memory will be increased), Since the capacity of conventional dictionaries that record the characteristics of characters including kanji is extremely small, unlike dogs, online handwritten character recognition has been developed using JiS C622, which consists of 2965 kanji characters.
Even if it is extended to the 6th level kanji set, it can be easily provided, thereby making it possible to provide an input device suitable for general information processing at a low cost.

前記実施例では基本的な要素について説明したが、以下
に示す様に各種の改良を実施することによりよりよいオ
ンライン手書文字認識力法を提供ｊろことができるので
以下に説明′８ろ。Although the basic elements have been explained in the above embodiment, it is possible to provide a better online handwritten character recognition method by implementing various improvements as shown below, which will be explained below.

第１に、前記実施例では標準的な擬似部首辞書３を１ケ
だけ設けたが、標準的な辞書の他に使用者各個人用の擬
似部首相関辞書を１ケ又は複数個設け、認識しづらい擬
似部首を含んだ文字の擬似部首について後から追加登録
ができる構成とすれば、より一層認識率の向上が図Ｊ″
Ｉるとともに認識　１アルゴリズム自体も簡易なもので
よ（なるという効果がある。First, in the above embodiment, only one standard pseudo-radical dictionary 3 is provided, but in addition to the standard dictionary, one or more pseudo-radical correlation dictionaries for each user are provided. If the configuration is such that the pseudo radicals of characters containing pseudo radicals that are difficult to recognize can be additionally registered later, the recognition rate can be further improved.
The algorithm itself is simple and has the effect of recognizing it.

第２に、前記実施例では文字辞書６の内容として擬似部
首コードと文字コードだけの組合せとしたが第４図の例
でも明らかな様に同じ「＼門という文字でも「立」単独
で１つの文字を＋ｎ成するもの、「妾」、「音」、１章
」、「意」、「−童」の様に文字の上部にｒ−””−」
が位置するもの、「彰」、「韻」の様に文字の左上部に
ド−」が位置１−るもの等がある。これらの「立」は前
記実施例では全く同一に扱って居り、字数が実施例程度
の場合問題はないが、字数を更に増加させたい場合にお
いてはこれら「立」という文字の位置情報を文字辞書６
に含ませＡ１ば更に認識率を向」ニさせろことができる
。位置情報としてはＪ　Ｔ　Ｓ　Ｃ６２２６字形索引第
４項の字形構成を用いれは充分である。Second, in the above embodiment, the content of the character dictionary 6 was a combination of only pseudo-radical codes and character codes, but as is clear from the example in FIG. Something that consists of two letters +n, ``concubine'', ``on'', ``Chapter 1'', ``meaning'', ``-dou''(r-""-") at the top of the letter.
There are some characters such as ``Akira'' and ``Rime'' that have a ``do'' at the upper left of the character. These ``tachi'' are treated exactly the same in the above example, and there is no problem if the number of characters is about the same as in the example, but if you want to further increase the number of characters, the position information of these characters ``tachi'' can be changed to a character dictionary. 6
If A1 is included in A1, the recognition rate can be further improved. It is sufficient to use the glyph structure in item 4 of the JTSC6226 glyph index as the position information.

第３に前記実施例では、第５図を用いて説明した如く新
しいストロークがタブレット１より入力される毎に第１
ストロークまでさかのほって今■）かれた前記性しいス
トローク迄が１つの擬似部１ト１を構成しているかどう
かについて、入力さＡまたストロークについて検定すべ
（説明した。例えば擬似部首「３」と「・」と「−」は
擬似部首コードでは０５０　、０１４　、００４である
が、この３ケの擬似部に１が集った「立」も擬似部首で
ありそのコードは１７６であることを「立」のストロー
クにより擬似部首辞書３を参照して求めた。Thirdly, in the above embodiment, each time a new stroke is input from the tablet 1, as explained using FIG.
Looking backwards to the stroke, it is necessary to verify whether the stroke up to the above-mentioned natural stroke constitutes one pseudo radical 1 (explained).For example, the pseudo radical "3" ”, “・”, and “-” are pseudo-radical codes 050, 014, and 004, but “tate”, which has 1 in these three pseudo-radicals, is also a pseudo-radical and its code is 176. A certain thing was determined by referring to the pseudo radical dictionary 3 using the stroke of ``tate''.

しかしながら、認識部２より参照する辞書として擬似部
首辞書３の他に、（擬似部首０５０　）　＋（擬似部首
０１４　）　＋（擬似部首００４．　）　−（擬似部面
１７６）の如き擬似部首間の相関々係を示す擬似部首相
関辞書を設けれは、認識処理時間が極めて早くなるとい
う効果がある。However, in addition to the pseudo radical dictionary 3 as a dictionary referred to by the recognition unit 2, there are also pseudo radicals such as (pseudo radical 050) + (pseudo radical 014) + (pseudo radical 004.) - (pseudo radical 176). Providing a pseudo-radical correlation dictionary that indicates the correlation between radicals has the effect of extremely shortening the recognition processing time.

以上詳細に説明した様に、本発明は認識が困難な漢字を
細分化しこの細分化した擬似部首を認識し擬似部首の集
まりとして漢字を識別するため、先づ認識辞書に要する
記１意容量が極小でよく更に認識文字数の増加にともな
う記憶容量の増加を極めて低（おさえられ更に、前記説
明の通り認識するのは擬似部首とし・うカナレベルのも
のであるため認識率そのものも高く保てるという好適な
オンライン手書文字認識方法を安価に提供することがで
きる。As explained in detail above, the present invention subdivides kanji that are difficult to recognize, recognizes the subdivided pseudo-radicals, and identifies kanji as a collection of pseudo-radicals. The storage capacity is extremely small, and the increase in memory capacity as the number of recognized characters increases is kept to an extremely low level.Furthermore, as explained above, the recognition rate itself can be kept high because it recognizes only pseudo radicals and kana level. A suitable online handwritten character recognition method can be provided at low cost.

[Brief explanation of drawings]

第１図は本発明の一実施例のブロック図、第２図は擬似
部首辞書の１例を示す図、第３図は入力レジスタの詳細
図、第４図は文字辞書の部分図、第５図は本発明による
認識方法を示Ｉ−図である。１　タブレット、　　　　２・・・認識部、３・・・擬
似部首辞書、　　　４・・・入力レジスタ、５・・選択
回路、　　　　　６・・・文字辞書、９・・認識部２よ
りの出力、１０〜１７・・・入力レジスタ４内の１゜レジスター１
フレジスタ、１８・・切換回路、　　　　　　１９・・・入力レジス
タ４の出力。特許出願人沖電気工業株式会社特許出願代理人弁理士　　山　本　恵　− （１５）第１図第３図FIG. 1 is a block diagram of an embodiment of the present invention, FIG. 2 is a diagram showing an example of a pseudo-radical dictionary, FIG. 3 is a detailed diagram of an input register, FIG. 4 is a partial diagram of a character dictionary, and FIG. FIG. 5 is a diagram showing the recognition method according to the present invention. 1 Tablet, 2... Recognition unit, 3... Pseudo radical dictionary, 4... Input register, 5... Selection circuit, 6... Character dictionary, 9... Output from recognition unit 2, 10 ~17...1° register 1 in input register 4
FR register, 18...Switching circuit, 19...Output of input register 4. Patent applicant Oki Electric Industry Co., Ltd. Patent application agent Megumi Yamamoto - (15) Figure 1 Figure 3

Claims

[Claims]

In an online character identification method that includes a tablet that extracts stroke information of handwritten characters and a dictionary that stores character features, the information from the tablet is compared with the features of the dictionary to identify handwritten characters. a first dictionary that stores characteristics of subdivided subset patterns and the entire pattern of characters other than kanji; a second dictionary that stores character codes for collections of subsets of kanji and characters other than kanji; and a tablet. A recognition unit that compares information from the first dictionary with the output of the first dictionary, an input register that stores the recognition result, and a selection circuit that compares the contents of the input register with the second dictionary to identify characters, An online character recognition method characterized by comparing and identifying kanji characters by sound μ.