JPH0573540A

JPH0573540A - Kana-Kanji converter

Info

Publication number: JPH0573540A
Application number: JP3236297A
Authority: JP
Inventors: Yoko Oike; 陽子大池
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1991-09-17
Filing date: 1991-09-17
Publication date: 1993-03-26

Abstract

(57)【要約】【目的】規則辞書を用いるかな漢字変換装置におい
て、規則辞書を作成する労力低減と、規則辞書のメモリ
容量低減を図り変換効率を向上する。【構成】かな読み文字列は入力装置から入力され、か
な漢字変換プログラムは単語の読みに対する表記を記憶
した基本辞書と品詞情報部と接続テーブルとを参照し、
入力されたかな読み文字列をかな漢字変換する。規則検
索プログラムは、前記基本辞書の単語列のパターンとそ
の書き換え情報を持つ規則を記憶した規則辞書と品詞情
報に対応した品詞ＩＤをまとめた品詞情報部とを参照し
前記規則辞書中の規則と一致するものを検索する。規則
書き換えプログラムは、一致した規則が検索されたと
き、該当のかな漢字変換結果の内容を規則辞書に基づい
て書き換える。 (57) [Abstract] [Purpose] In a kana-kanji conversion device that uses a rule dictionary, it is possible to reduce the labor of creating the rule dictionary and the memory capacity of the rule dictionary to improve the conversion efficiency. [Structure] A kana-reading character string is input from an input device, and a kana-kanji conversion program refers to a basic dictionary storing a notation for reading a word, a part-of-speech information section, and a connection table,
Converts the input kana-reading character string into kana-kanji. The rule search program refers to a rule dictionary that stores a pattern of a word string of the basic dictionary and rules having rewriting information thereof, and a part-of-speech information section that collects a part-of-speech ID corresponding to the part-of-speech information, and refers to the rules in the rule dictionary. Search for a match. The rule rewriting program rewrites the content of the corresponding kana-kanji conversion result based on the rule dictionary when a matching rule is retrieved.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、日本語ワードプロセッ
サ等のかな漢字変換処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion processing device such as a Japanese word processor.

【０００２】[0002]

【従来の技術】従来、この種のかな漢字変換装置として
は、特開平３−１４２６５８号公報で開示されているよ
うに、かなとそれに対応する漢字等の表記が一対になっ
て記憶されている基本辞書と、特定の複数の単語列にお
ける表記方法が各事例別に記憶されている規則辞書とを
備えているものが知られている。2. Description of the Related Art Conventionally, as a kana-kanji conversion device of this kind, as disclosed in Japanese Patent Laid-Open No. 3-142658, a kana and a corresponding kanji character are basically stored as a pair. It is known to have a dictionary and a rule dictionary in which a notation method in a plurality of specific word strings is stored for each case.

【０００３】このようなかな漢字変換装置では、まず、
基本辞書を参照して通常のかな漢字変換が実行され、変
換結果記憶部にそのかな漢字変換の結果が記憶される。
そして、記憶されたかな漢字変換結果と規則辞書中のパ
ターンとが一致すれば、その規則辞書の書き換え情報に
従ってかな漢字変換結果の表記を書き換え、表示装置に
より表示するようにしていた。また、このような一連の
作用を規則変換と称していた。In such a kana-kanji conversion device, first,
Normal kana-kanji conversion is executed with reference to the basic dictionary, and the result of the kana-kanji conversion is stored in the conversion result storage section.
Then, if the stored kana-kanji conversion result matches the pattern in the rule dictionary, the kana-kanji conversion result is rewritten according to the rewriting information of the rule dictionary and displayed on the display device. Moreover, such a series of actions was called rule conversion.

【０００４】例えば、「あかちゃんがたつ」と入力した
とき、まず、基本辞書を参照した通常のかな漢字変換を
行い、「赤ちゃんが建つ」と変換される。次に、規則辞
書を検索し該当するパターンがあれば、その書き換え情
報に従って「赤ちゃんが立つ」と書き換えていた。For example, when "Aka-chan ga tatsu" is input, first, the normal kana-kanji conversion with reference to the basic dictionary is performed, and "babies build" is converted. Next, he searched the rule dictionary and, if there was a corresponding pattern, rewrites it as "baby stands" according to the rewriting information.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記方
法では、例えば、「赤ちゃんがゆっくり建つ」、「赤ち
ゃんがしっかり建つ」などのような「赤ちゃんが＋副詞
＋建つ」という誤変換結果に対して、誤変換された「建
つ」を正しい表記の「立つ」に書き換えるための規則
を、それぞれの副詞ごとに別々にパターンを作って規則
辞書に記憶していた。従って、副詞が異なるだけ、同じ
ようなパターンを多く記憶する必要があり、このため、
規則辞書を作るための作業量も多くなり、さらに規則辞
書のメモリ容量も大きくなっていた。However, in the above method, for example, for a mistranslation result of "baby + adverb + build" such as "baby slowly builds" or "baby firmly builds", The rules for rewriting the wrongly converted "standing" into the correct notation "standing" were created in separate patterns for each adverb and stored in the rule dictionary. Therefore, it is necessary to memorize many similar patterns as the adverbs are different.
The amount of work for creating the rule dictionary has increased, and the memory capacity of the rule dictionary has also increased.

【０００６】本発明は、上記問題点を解決するためにな
されたものであり、規則辞書に任意の品詞を表す品詞情
報を記憶させ、ひとつの規則で文の構造が同じいくつも
のパターンに対応させることにより、規則辞書の容量を
少なくし、かつ正確な変換が可能なかな漢字変換装置を
提供することを目的とする。The present invention has been made in order to solve the above-mentioned problems, and stores part-of-speech information representing an arbitrary part-of-speech in a rule dictionary so that one rule corresponds to several patterns having the same sentence structure. Thus, an object of the present invention is to provide a kana-kanji conversion device capable of reducing the capacity of the rule dictionary and performing accurate conversion.

【０００７】[0007]

【課題を解決するための手段】この問題を解決するため
に本発明のかな漢字変換装置は、図１に示すように、か
な読み文字列を入力するための入力手段と、単語の読み
に対する表記等を記憶した基本辞書と、基本辞書を参照
し、かな漢字変換を行うかな漢字変換手段と、かな漢字
変換手段によるかな漢字変換結果を記憶する変換結果記
憶手段と、かな漢字変換の結果を出力するための出力手
段とを備え、更に、基本辞書の特定の単語の品詞等の情
報と、当該単語を含む単語列のパターンと、その単語列
に対しての書き換え情報とを記憶した規則辞書と、変換
結果記憶手段の内容について規則辞書中から品詞情報を
参照して一致するパターンを検索する規則検索手段と、
規則検索手段により一致したパターンが検索されたと
き、該当の変換結果記憶手段の内容を規則辞書の書き換
え情報に基づいて書き換える規則書き換え手段とを備え
ている。In order to solve this problem, a kana-kanji conversion device of the present invention, as shown in FIG. 1, has an input means for inputting a kana-reading character string, a notation for reading a word, etc. And a kana-kanji conversion means for performing kana-kanji conversion by referring to the basic dictionary, a conversion result storage means for storing kana-kanji conversion results by the kana-kanji conversion means, and an output means for outputting the kana-kanji conversion result. And a rule dictionary storing information such as a part of speech of a specific word in the basic dictionary, a pattern of a word string including the word, and rewriting information for the word string, and a conversion result storage unit. Rule searching means for searching a matching pattern by referring to the part-of-speech information from the rule dictionary for contents,
And a rule rewriting unit that rewrites the contents of the corresponding conversion result storing unit based on the rewriting information of the rule dictionary when the matching pattern is searched by the rule searching unit.

【０００８】[0008]

【作用】上記の構成を有する本発明のかな漢字変換装置
では、まず、入力手段から入力されたかな読み文字列を
かな漢字変換手段にて、基本辞書を参照してかな漢字変
換を行う。そして、変換結果記憶手段はかな漢字変換さ
れた結果を記憶する。規則辞書には、任意の品詞情報を
含む特定の複数の単語列に対応する書き換え規則が記憶
されており、規則検索手段によりかな漢字変換の結果の
中に規則辞書に一致するパターンがあれば、規則書き換
え手段はその規則に従って表記を書き換え、出力手段に
より出力する。In the kana-kanji conversion device of the present invention having the above-described structure, first, the kana-kanji conversion string input from the input means is converted by the kana-kanji conversion means by referring to the basic dictionary. Then, the conversion result storage means stores the result of the kana-kanji conversion. The rule dictionary stores rewriting rules corresponding to a plurality of specific word strings including arbitrary part-of-speech information, and if there is a pattern in the result of kana-kanji conversion by the rule searching means that matches the rule dictionary, the rule The rewriting means rewrites the notation according to the rule and outputs it by the output means.

【０００９】[0009]

【実施例】以下、本発明を具体化した一実施例を図面を
参照して説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【００１０】まず、図２を参照してかな漢字変換装置全
体の構成を説明する。First, the configuration of the entire kana-kanji conversion device will be described with reference to FIG.

【００１１】かな漢字変換する文字列を入力するための
入力装置１０は、装置全体を制御するためのＣＰＵ（中
央処理装置）１２に接続されている。記憶手段としての
ＲＡＭ２０はＣＰＵ１２に接続されており、ＲＡＭ２０
には、かな漢字変換された結果を記憶するための変換結
果記憶領域２２と、入力されたかな読み文字列を記憶す
るための読み入力バッファ領域２４と、変換結果記憶領
域２２の内容をかな漢字文字列にしたものを記憶するた
めの出力バッファ領域２６と、ポインタ情報を記憶する
ワークエリア２８とが設けられている。An input device 10 for inputting a character string for kana-kanji conversion is connected to a CPU (central processing unit) 12 for controlling the entire device. The RAM 20 serving as a storage unit is connected to the CPU 12, and the RAM 20
Is a conversion result storage area 22 for storing the result of kana-kanji conversion, a reading input buffer area 24 for storing the input kana reading character string, and the contents of the conversion result storage area 22 for kana-kanji character string. An output buffer area 26 for storing the stored information and a work area 28 for storing pointer information are provided.

【００１２】変換結果記憶領域２２は、図３に示すよう
に、変換結果全体が単語単位で情報を付して記憶されて
おり、単語順位を表すデータ２２ａ、後述する基本辞書
４２中に記憶されているそれぞれの単語を識別するため
の固有の数値である単語ＩＤ２２ｂ、単語の読み２２
ｃ、単語の表記２２ｄ、後述する品詞情報部４８中に記
憶されている各々の品詞情報を識別するための固有の記
号である品詞ＩＤ２２ｅ、同音語先頭ＩＤ２２ｆ、同音
語末尾ＩＤ２２ｇがデータとして各単語ごとに記憶され
ている。In the conversion result storage area 22, as shown in FIG. 3, the entire conversion result is stored by adding information on a word-by-word basis, and is stored in the data 22a representing the word rank and a basic dictionary 42 described later. Word ID 22b, which is a unique numerical value for identifying each word that is present, and word reading 22
c, word notation 22d, part-of-speech ID 22e, which is a unique symbol for identifying each part-of-speech information stored in the part-of-speech information section 48 described later, homonym start ID 22f, and homonym end ID 22g, as data It is remembered for each.

【００１３】尚、同音語先頭ＩＤ２２ｆは、入力装置１
０から入力された文字列（単語）と同音であり、基本辞
書４２に記憶された単語ＩＤ２２ｂの数値が最も小さい
ものを示すものである。また、同音語末尾ＩＤ２２ｇ
は、入力装置１０から入力された文字列（単語）と同音
であり、基本辞書４２に記憶された単語ＩＤ２２ｂの数
値が最も大きいものを示すものである。The same-sound word head ID 22f corresponds to the input device 1
This is the same as the character string (word) input from 0, and indicates the smallest numerical value of the word ID 22b stored in the basic dictionary 42. Also, the same phoneme ending ID 22g
Indicates a character having the same sound as a character string (word) input from the input device 10 and having the largest numerical value of the word ID 22b stored in the basic dictionary 42.

【００１４】プログラムや辞書等を格納するＲＯＭ３０
はＣＰＵ１２と接続され、プログラム部３２と辞書部４
０とからなる。プログラム部３２は、かな漢字変換プロ
グラム３４と、規則検索プログラム３６と、規則書き換
えプログラム３８とを格納している。ROM 30 for storing programs, dictionaries, etc.
Is connected to the CPU 12, and the program section 32 and the dictionary section 4
It consists of 0 and. The program section 32 stores a kana-kanji conversion program 34, a rule retrieval program 36, and a rule rewriting program 38.

【００１５】また、辞書部４０は基本辞書４２と、接続
テーブル４４と、規則辞書４６と、品詞情報部４８とか
ら構成されている。基本辞書４２は、図４に示すよう
に、その単語の固有の識別識別番号たる単語ＩＤ２２ｂ
と、その単語の読み２２ｃと、その単語の表記２２ｄ
と、品詞ＩＤ２２ｅとが記憶されている。The dictionary unit 40 is composed of a basic dictionary 42, a connection table 44, a rule dictionary 46, and a part-of-speech information unit 48. As shown in FIG. 4, the basic dictionary 42 has a word ID 22b that is a unique identification number of the word.
And reading 22c of the word and notation 22d of the word
And the part-of-speech ID 22e are stored.

【００１６】接続テーブル４４は、単語同士の接続関係
を品詞情報により規定するデータを記憶している。The connection table 44 stores data that defines the connection relationship between words by means of part-of-speech information.

【００１７】規則辞書４６は、図５に示すように、複数
の規則が格納されており、１規則の内容は、複数の項目
４６ａ〜４６ｃから構成されている。一般に規則の１項
目の内容は、大きく三つに分けられ、＜検索因子−検索
情報：書き換え情報＞という形で書かれており、この項
目の組合せで一つの規則がつくられる仕組みになってい
る。検索因子とは、規則検索の方法の種類を示すもの
で、Ｕ因子、Ｄ因子、Ｙ因子、Ｈ因子の４種類がある。As shown in FIG. 5, the rule dictionary 46 stores a plurality of rules, and the content of one rule is composed of a plurality of items 46a to 46c. Generally, the content of one item of the rule is roughly divided into three, and is written in the form of <search factor-search information: rewrite information>, and one rule is created by combining these items. .. The search factor indicates the type of rule search method, and there are four types, U factor, D factor, Y factor, and H factor.

【００１８】Ｕ因子は変換結果記憶領域２２に記憶され
ている単語についての読み及び表記との完全一致を意味
し、Ｄ因子は変換結果記憶領域２２に記憶されている単
語についての読みつまりその単語と同音での一致を意味
する。また、Ｙ因子は変換結果記憶領域２２の中に記憶
されている単語列（一単語でもよい）についての読みと
の一致を意味し、Ｈ因子は変換結果記憶領域２２中に記
憶されている品詞ＩＤとの一致を意味する。検索情報４
６ａ〜４６ｃとは、その規則の各々の項目の該当する単
語のことを示し、ここには単語の具体的な読み及び表記
または品詞情報が入る。The U factor means a perfect match with the reading and notation of the word stored in the conversion result storage area 22, and the D factor is the reading or the word of the word stored in the conversion result storage area 22. Means the same tone. Further, the Y factor means a match with the reading of the word string (may be one word) stored in the conversion result storage area 22, and the H factor is the part of speech stored in the conversion result storage area 22. It means a match with the ID. Search information 4
6a to 46c represent the corresponding words of each item of the rule, and the specific reading and notation of the word or the part of speech information is entered here.

【００１９】書き換え情報は、無変化マーカーＮ、削除
マーカーＤ、または単語列のいずれかであり、無変化マ
ーカーＮが格納されているときは規則辞書４６の項目に
一致した変換結果記憶領域２２中の単語をそのままに
し、書き換えないことを表す。また、削除マーカーＤが
格納されているときはその項目を削除することを表し、
単語列のときは規則辞書４６の項目と一致した変換結果
記憶領域２２中の単語をその単語列に書き換えることを
表している。ここでいう単語列とは、１つ以上の単語が
規則辞書４６に単語ＩＤで記憶されたものであり、この
単語ＩＤに基づいて基本辞書４２を参照することによ
り、変換結果記憶領域２２の単語読み２２ｃ、同音語先
頭ＩＤ２２ｆ、同音語末尾ＩＤ２２ｇ等の設定が可能で
ある。尚、図５においては、理解しやすいように単語Ｉ
Ｄの部分を数値ではなく読み、または表記で表してい
る。The rewriting information is either the unchanged marker N, the deleted marker D, or the word string. When the unchanged marker N is stored, the rewriting information matches the item of the rule dictionary 46 in the conversion result storage area 22. Indicates that the word is left as it is and is not rewritten. When the delete marker D is stored, it means to delete the item,
When it is a word string, it means that the word in the conversion result storage area 22 that matches the item of the rule dictionary 46 is rewritten to the word string. The word string referred to here is one or more words stored in the rule dictionary 46 by word ID, and by referring to the basic dictionary 42 based on this word ID, the words in the conversion result storage area 22 are It is possible to set the reading 22c, the same-sound word start ID 22f, the same-sound word end ID 22g, and the like. In FIG. 5, the word I is used for easy understanding.
The part of D is read or expressed not by a numerical value.

【００２０】品詞情報部４８には、図６に示すように、
基本辞書４２中の品詞ＩＤ４２ａを介して参照する品詞
情報４８ｂが記憶されている。In the part-of-speech information section 48, as shown in FIG.
Part-of-speech information 48b referred to via the part-of-speech ID 42a in the basic dictionary 42 is stored.

【００２１】そして、出力バッファ領域２６の内容を表
示するための出力装置としての出力装置５０はＣＰＵ１
２に接続されている。The output device 50 as an output device for displaying the contents of the output buffer area 26 is the CPU 1
Connected to 2.

【００２２】次に、このように構成されたかな漢字変換
装置の動作を図７のフローチャートを参照して説明す
る。Next, the operation of the kana-kanji conversion device configured as described above will be described with reference to the flowchart of FIG.

【００２３】例えば、読み文字列「はっきりとしない」
が入力装置１０より入力されると、そのかな文字コード
がＲＡＭ２０の読み入力バッファ領域２４に記憶される
（Ｓ１０）。その後、ＲＯＭ３０のかな漢字変換プログ
ラム３４により、基本辞書４２と品詞情報部４８と接続
テーブル４４とを参照して、読み入力バッファ領域２４
に記憶されているかな文字コードが漢字かな混じり文に
変換され、読み入力バッファ領域２４にかな文字コード
で記憶される。例えば、漢字かな混じり文「はっきり都
市内」と変換され、変換結果記憶領域２２に記憶される
（Ｓ１２）。そして、ＲＯＭ３０の規則検索プログラム
３６及び規則書き換えプログラム３８により、変換結果
記憶領域２２の内容と規則辞書４６を参照して規則変換
処理が行われる（Ｓ１４）。For example, the reading character string "not clear"
Is input from the input device 10, the kana character code is stored in the reading input buffer area 24 of the RAM 20 (S10). After that, the kana-kanji conversion program 34 of the ROM 30 refers to the basic dictionary 42, the part-of-speech information section 48, and the connection table 44 to refer to the reading input buffer area 24.
The kana character code stored in is converted into a kanji / kana mixed sentence and stored in the reading input buffer area 24 as the kana character code. For example, the kanji / kana mixed sentence “clearly in the city” is converted and stored in the conversion result storage area 22 (S12). Then, the rule search program 36 and the rule rewrite program 38 of the ROM 30 refer to the contents of the conversion result storage area 22 and the rule dictionary 46 to perform the rule conversion process (S14).

【００２４】次に、規則変換処理の具体的な処理につい
て図８、図９、図１０のフローチャートを参照して説明
する。Next, a specific process of the rule conversion process will be described with reference to the flow charts of FIGS. 8, 9 and 10.

【００２５】まず、前記変換結果記憶領域２２に記憶し
た「はっきり都市内」の先頭の単語「はっきり」にポイ
ンタＰ１を設定し、ポインタＰ１のポインタ情報がＲＡ
Ｍ２０のワークエリア２８に記憶される（Ｓ３０）。First, the pointer P1 is set to the first word "clear" in the "clear city" stored in the conversion result storage area 22, and the pointer information of the pointer P1 is RA.
It is stored in the work area 28 of M20 (S30).

【００２６】次に、ポインタＰ１の指す単語と一致する
規則が規則辞書４６中にあるか否かを検索する（Ｓ３
２）。図１０に示す規則検索サブルーチンにおいては、
まず、ポインタＰ５を規則辞書４６中の先頭の規則６−
１に設定し、ポインタ情報をワークエリア２８に記億す
る（Ｓ３２０）。次いで、ポインタＰ６をポインタＰ５
の指す規則６−１の先頭の項目４６ａに設定し、ポイン
タ情報がワークエリア２８に記憶される（Ｓ３２２）。
次いで、ポインタＰ７をポインタＰ１の指す変換結果記
憶領域２２中のかな漢字変換結果の先頭の単語に設定す
る（Ｓ３２４）。Next, it is searched whether or not the rule matching the word pointed by the pointer P1 exists in the rule dictionary 46 (S3).
2). In the rule search subroutine shown in FIG.
First, the pointer P5 is set to the first rule 6- in the rule dictionary 46.
1 is set and the pointer information is stored in the work area 28 (S320). Next, the pointer P6 is changed to the pointer P5.
Is set in the first item 46a of the rule 6-1 pointed to by and the pointer information is stored in the work area 28 (S322).
Next, the pointer P7 is set to the leading word of the kana-kanji conversion result in the conversion result storage area 22 pointed to by the pointer P1 (S324).

【００２７】そして、ポインタＰ６の指す項目の検索情
報が品詞情報であるか否かを判別する（Ｓ３２６）。こ
こでは、ポインタＰ６の指す項目４６ａは品詞情報の＜
Ｈ−「と」続く副詞：Ｎ＞であるので（Ｓ３２６・ＹＥ
Ｓ）、図６に示す品詞情報部４８を規則検索プログラム
３６により、はじめから順次検索し、一致する品詞情報
「と」に続く副詞４８ｃがあるので、ポインタＰ８を品
詞ＩＤ（Ｈ０６）に設定する。次にポインタＰ８の示す
品詞ＩＤとポインタＰ７の示す変換結果記憶部２２中の
単語品詞ＩＤが同じか比較をする（３４０）。この場
合、品詞ＩＤは一致するので（Ｓ３４０・ＹＥＳ）Ｓ３
３０に進む。Then, it is determined whether or not the search information of the item pointed by the pointer P6 is part-of-speech information (S326). Here, the item 46a pointed to by the pointer P6 is the part of speech information <
H- "to" is an adverb that follows: N> (S326 ・ YE
S), the part-of-speech information part 48 shown in FIG. 6 is sequentially searched by the rule search program 36 from the beginning, and there is an adverb 48c following the matching part-of-speech information "to". Therefore, the pointer P8 is set to the part-of-speech ID (H06). .. Next, the part-of-speech ID indicated by the pointer P8 and the word part-of-speech ID in the conversion result storage unit 22 indicated by the pointer P7 are compared (340). In this case, since the part-of-speech IDs match (S340 / YES), S3
Proceed to 30.

【００２８】次に、ポインタＰ６が末尾項目を示してい
るか否かを判断し（Ｓ３３０）、ポインタＰ６が末尾項
目ではないので（Ｓ３３０・ＮＯ）、ポインタＰ７を変
換結果記憶領域２２中に記憶されている「はっきり」の
次の単語「都市」に移動する。そして、ポインタＰ６も
規則６−１中の次の項目＜Ｙ−とし：と，し＞（４６
ｂ）に移動し、そのポインタ情報をワークエリア２８に
記憶し（Ｓ３３２）、前記３２６に戻る。ここで、ポイ
ンタＰ６の示す項目＜Ｙ−とし：と，し＞（４６ｂ）
は、品詞情報ではなく（Ｓ３２６・ＮＯ）、ポインタＰ
６の示す項目＜Ｙ−とし：と，し＞（４６ｂ）とポイン
タＰ７の示す変換結果記憶領域２２中の単語「都市」が
一致するので（Ｓ３２８・ＹＥＳ）、Ｓ３３０に進む。
ポインタＰ６が示す項目は末尾項目ではないので（Ｓ３
３０・ＮＯ）、ポインタＰ７を変換結果記憶領域２２中
に記憶されている「都市」の次の単語「内」に移動し、
ポインタＰ６も規則６−１中の次の項目＜Ｙ−ない：
な，い＞（４６ｃ）に移動し、そのポインタ情報をワー
クエリア２８に記憶し（Ｓ３３２）、前記３２６に戻
る。Next, it is determined whether or not the pointer P6 indicates the last item (S330). Since the pointer P6 is not the last item (S330.NO), the pointer P7 is stored in the conversion result storage area 22. Move to the next word "city" after "clear". The pointer P6 is also the next item <Y- in rule 6-1: and,> (46
Then, the pointer information is stored in the work area 28 (S332), and the process returns to the step 326. Here, the item indicated by the pointer P6 <Y-:: ,, >> (46b)
Is not the part-of-speech information (S326 / NO), and the pointer P
Since the item <Y-::, >> (46b) indicated by 6 and the word “city” in the conversion result storage area 22 indicated by the pointer P7 match (YES in S328), the process proceeds to S330.
The item indicated by the pointer P6 is not the last item (S3
30 · NO), the pointer P7 is moved to the word “in” next to “city” stored in the conversion result storage area 22,
The pointer P6 also has the next item <Y-not in rule 6-1:
If not, move to (46c), store the pointer information in the work area 28 (S332), and return to 326.

【００２９】以下同様の手順により処理を行う（Ｓ３２
６〜Ｓ３３２）。ポインタＲ６が末尾項目となったとき
（Ｓ３３０・ＹＥＳ）、規則フラグをＯＮとし、その情
報をワークエリア２８に記憶し（Ｓ３４２）、図８に示
すＳ３２に戻る。Thereafter, the same procedure is performed (S32).
6-S332). When the pointer R6 is the last item (YES in S330), the rule flag is turned on, the information is stored in the work area 28 (S342), and the process returns to S32 shown in FIG.

【００３０】尚、一致する規則が検索されないとき（Ｓ
３２８・ＮＯ）は、ポインタＰ５の指す規則が規則辞書
中で最後の規則になるまで（Ｓ３４４・ＮＯ）、ポイン
タＰ５を順次、次の規則へ移動し（Ｓ３４６）、一致す
る規則を検索する（Ｓ３２２〜Ｓ３４６）。ポインタＰ
５が示す規則が、規則辞書で最後の規則となったときは
（Ｓ３４４・ＹＥＳ）、一致する規則がなかったことを
示す規則フラグＯＦＦをワークエリア２８に記憶する
（Ｓ３４８）。When a matching rule is not retrieved (S
328.NO) moves the pointer P5 to the next rule in sequence until the rule pointed to by the pointer P5 becomes the last rule in the rule dictionary (S344.NO) (S346), and searches for a matching rule (S346). S322-S346). Pointer P
When the rule indicated by 5 is the last rule in the rule dictionary (YES in S344), the rule flag OFF indicating that there is no matching rule is stored in the work area 28 (S348).

【００３１】ここでは、ワークエリア２８の規則フラグ
がＯＮとなっているので（Ｓ３４・ＹＥＳ）、規則書き
換えプログラム３８による規則書き換え処理に入る。ま
ず、ポインタＰ２をポインタＰ１の指す単語「はっき
り」に設定し、ポインタＰ２のポインタ情報をワークエ
リア２８に記憶する（Ｓ３６）。次にポインタＰ３をポ
インタＰ７が指す変換結果記憶領域２２に記憶した単語
列「はっきり都市内」の末尾の「内」に設定し、ポイン
タＰ３の情報をワークエリア２８に記憶する（Ｓ３
８）。次にポインタＰ４を規則辞書４２中マッチした規
則６−１（４６ａ）の先頭項目に設定し、ポインタＰ４
のポインタ情報をワークエリア２８に記憶する（Ｓ４
０）。規則辞書４６中の規則６−１においてポインタＰ
４の指す先頭項目＜Ｈ−「と」に続く副詞：Ｎ＞（４６
ａ）の書き換え情報は無変化マーカーＮがついている
（Ｓ４４・ＹＥＳ）ので、単語「はっきり」は書き換え
ずそのままにし、Ｓ４６に進む。Here, since the rule flag of the work area 28 is ON (S34, YES), the rule rewriting process by the rule rewriting program 38 starts. First, the pointer P2 is set to the word "clear" pointed by the pointer P1, and the pointer information of the pointer P2 is stored in the work area 28 (S36). Next, the pointer P3 is set to "in" at the end of the word string "clearly in the city" stored in the conversion result storage area 22 pointed to by the pointer P7, and the information of the pointer P3 is stored in the work area 28 (S3).
8). Next, the pointer P4 is set to the head item of the matched rule 6-1 (46a) in the rule dictionary 42, and the pointer P4 is set.
Information of the pointer is stored in the work area 28 (S4
0). Pointer P in rule 6-1 in rule dictionary 46
4 <H-Adverb following "to": N> (46
Since the rewriting information of a) is attached with the unchanged marker N (S44, YES), the word "clear" is left as it is without rewriting, and the process proceeds to S46.

【００３２】尚、ポインタＰ４の指す項目４６ｂ中の書
き換え情報の場所に削除マーカーＤが立っているときは
（Ｓ４４・ＮＯ、Ｓ４８・ＹＥＳ）、変換結果記憶領域
２２のポインタＰ２の指す単語を削除し（Ｓ５０）、Ｓ
４６に進む。さらにポインタＰ４の指す項目４６ｂ中の
書き換え情報の場所に単語列があるときは（Ｓ４４・Ｎ
Ｏ、Ｓ４８・ＮＯ、Ｓ５２・ＹＥＳ）、変換結果記憶領
域２２のポインタＰ２の指す単語をその単語列と書き換
え（Ｓ５４）、Ｓ４６に進む。When the deletion marker D is set at the location of the rewriting information in the item 46b pointed to by the pointer P4 (S44, NO, S48, YES), the word pointed by the pointer P2 in the conversion result storage area 22 is deleted. (S50), S
Proceed to 46. Furthermore, when there is a word string at the location of the rewriting information in the item 46b pointed by the pointer P4 (S44.N
O, S48, NO, S52, YES), the word pointed to by the pointer P2 in the conversion result storage area 22 is rewritten with the word string (S54), and the process proceeds to S46.

【００３３】Ｓ４６においては、ポインタＰ２の指す単
語とポインタＰ３の指す単語が同じか比較する。つま
り、ポインタＰ２指す単語とポインタＰ３の指す単語が
等しいときは、ポインタＰ２が最後の単語まで進んだこ
とを意味する。ここで、ポインタＰ２の指す「はっき
り」は変換結果記憶領域２２に記憶されている変換結果
の最後の単語ではないので（Ｓ４６・ＮＯ）、ポインタ
Ｐ２を次の単語「都市」に移動し、ポインタＰ４を規則
辞書４６中の次の項目＜Ｙ−とし：とし＞に移動する
（Ｓ５６）、そして、前記Ｓ４４に戻る。In S46, the word pointed by the pointer P2 is compared with the word pointed by the pointer P3. That is, when the word pointed by the pointer P2 and the word pointed by the pointer P3 are equal, it means that the pointer P2 has advanced to the last word. Here, since "clearly" pointed by the pointer P2 is not the last word of the conversion result stored in the conversion result storage area 22 (S46, NO), the pointer P2 is moved to the next word "city", and the pointer is moved. P4 is moved to the next item in the rule dictionary 46 <Set as Y-: Set as> (S56), and the process returns to S44.

【００３４】ここではポインタＰ４の指す項目＜Ｙ−と
し：と，し＞に単語列があるので（Ｓ４４・ＮＯ、Ｓ４
８・ＮＯ、Ｓ５２・ＹＥＳ）、「都市」を「とし」と書
き換え（Ｓ５４）、Ｓ４６に進む。このようにして、順
次変換結果記憶領域２２の内容を規則の書き換え指示に
従って処理していく（Ｓ４４〜Ｓ５６）。Since there is a word string in the item <Y-::, shi> indicated by the pointer P4 (S44, NO, S4).
8. NO, S52, YES), "city" is rewritten as "to" (S54), and the process proceeds to S46. In this way, the contents of the sequential conversion result storage area 22 are processed according to the rule rewriting instruction (S44 to S56).

【００３５】ポインタＰ２とポインタＰ３の指す単語が
同じになったとき、つまり、ポインタＰ２が指す単語
が、変換結果記憶領域２２に記憶されている末尾の単語
になったとき（Ｓ４６・ＹＥＳ）、ポインタＰ１が変換
結果記憶領域２２に記憶されている単語列の末尾の単語
になければ（Ｓ５８・ＮＯ）、ポインタＰ１を次の単語
へ一つずつ移動させ（Ｓ６０）、再び同様の規則検索処
理を規則検索プログラム３６により行う。この場合は、
ポインタＰ１を変換結果記憶領域２２に記憶された文字
列「はっきり都市内」の二番目の単語「都市」に移動す
る（Ｓ５８・ＮＯ、Ｓ６０）。そして、Ｓ３２〜Ｓ６０
の処理を繰り返す。最後にポインタＰ１が変換結果記憶
領域２２中の末尾の単語のとき（Ｓ５８・ＹＥＳ）、規
則変換処理を終了する。When the words pointed to by the pointers P2 and P3 become the same, that is, when the word pointed by the pointer P2 becomes the last word stored in the conversion result storage area 22 (S46, YES), If the pointer P1 is not at the end word of the word string stored in the conversion result storage area 22 (S58, NO), the pointer P1 is moved to the next word one by one (S60), and the same rule search process is performed again. Is performed by the rule search program 36. in this case,
The pointer P1 is moved to the second word "city" of the character string "clearly in the city" stored in the conversion result storage area 22 (S58, NO, S60). And S32 to S60
The process of is repeated. Finally, when the pointer P1 is the last word in the conversion result storage area 22 (YES in S58), the rule conversion process ends.

【００３６】この結果が再度変換結果記憶領域２２に格
納され、そして、前記変換結果記憶領域２２の内容が出
力バッファ領域２６に格納され、出力装置５０に表示さ
れる（図７・Ｓ１６）。その後、ユーザーから確定キー
が入力されれば終了である（Ｓ１８・ＹＥＳ）。確定キ
ー以外のものが入力されたときは（Ｓ１８・ＮＯ）、候
補変更処理に進む（Ｓ２０）。その結果は再度、変換結
果記憶領域２２に格納される。そして前記変換結果２２
の内容が出力バッファ領域２６に格納され、出力装置５
０に表示される（Ｓ１６）。その後、ユーザーから確定
キーが入力されれば（Ｓ１８・ＹＥＳ）終了する。This result is stored again in the conversion result storage area 22, and the contents of the conversion result storage area 22 are stored in the output buffer area 26 and displayed on the output device 50 (S16 in FIG. 7). After that, if the confirmation key is input by the user, the process ends (S18, YES). When a key other than the enter key is input (S18, NO), the process proceeds to the candidate changing process (S20). The result is stored again in the conversion result storage area 22. And the conversion result 22
Of the output device 5 is stored in the output buffer area 26.
0 is displayed (S16). After that, if the confirmation key is input by the user (S18, YES), the process ends.

【００３７】従来は「はっきりとしない」「びくびくと
しない」など「『と』に続く副詞＋としない」という単
語列に関しての規則をいちいち、ひとつひとつの副詞に
ついて作らなければならなかったが、品詞情報を検索因
子として用いることにより、規則辞書作成の労力の削減
と規則辞書の占めるメモリの低減を図ることができる。
また、規則変換を効率的におこなうことも可能である。Conventionally, rules for word strings such as "not clear" and "not jerk" do not say "adverb +" do not follow adverb "" must be made for each adverb. By using as a search factor, it is possible to reduce the labor for creating the rule dictionary and the memory occupied by the rule dictionary.
It is also possible to efficiently perform rule conversion.

【００３８】同様に、図５の規則６―２の例のように、
「いがいときれいだった」が「以外ときれいだった」、
「いがいとかんようなひと」が「以外と寛容な人」とな
るなど「以外と＋形容動詞」という変換結果になってし
まった場合に、本規則変換において「意外と＋形容動
詞」と書き換えられるので、やはり、形容動詞ひとつひ
とつについて、いちいち同じ書き換えをする規則を作ら
なくても済み、上記同様の効果を得ることができる。Similarly, as in the example of rule 6-2 in FIG.
"It was beautiful when I was young," but "It was beautiful except."
If the conversion result of "other than + adjective verb" such as "Iigaitokantohito" becoming "other than forgiving person" is rewritten as "unexpected and + adjective verb" in this rule conversion Therefore, again, it is not necessary to make a rule to rewrite the same adjectives one by one, and the same effect as described above can be obtained.

【００３９】また、図５の規則６−３の「あかちゃんが
ゆっくりたつ」「あかちゃんがしっかりたつ」などの例
においても、「赤ちゃんが（副詞）建つ（絶つ、経つ、
…）」という、通常のかな漢字変換結果がなされたとき
に、いちいち、「ゆっくり」「しっかり」などの副詞を
入れ換えただけの同じような規則を作らなくて済み、上
記同様の効果を得ることができる。Also, in the example of "Aka-chan is slowly standing" and "Aka-chan is firmly standing" in Rule 6-3 of FIG. 5, "Baby (adverb) stands (cut, passed,
It is not necessary to make a similar rule just by replacing adverbs such as "slow" and "firm" when the usual kana-kanji conversion result is made, and the same effect as above can be obtained. it can.

【００４０】なお、本発明は、上記の例に示すような品
詞情報を用いた規則変換に限らず、基本辞書から参照で
きる情報を用いた規則変換に適用することができる。The present invention can be applied not only to the rule conversion using the part-of-speech information as shown in the above example, but also to the rule conversion using the information that can be referred to from the basic dictionary.

【００４１】また、規則辞書の規則数や項目数は本実施
例の数に限ったものではない。The number of rules and the number of items in the rule dictionary are not limited to those in this embodiment.

【００４２】[0042]

【発明の効果】以上説明したことから明らかなように、
本発明のかな漢字変換装置は規則変換処理において、こ
れまで、同じ品詞の単語を入れ換えただけの同じような
規則をいくつも作らなければならなかったが、検索情報
として品詞情報を用いることにより、それらをまとめて
規則を作ることができるようになったため、規則辞書作
成の労力低減を図ることができ、規則辞書のメモリ容量
も低減することができる。また、ひとつの規則によっ
て、いくつもの文例に適用することができるようにな
る。As is clear from the above description,
In the rule conversion process, the kana-kanji conversion device of the present invention had to make many similar rules by simply replacing words of the same part of speech, but by using part-of-speech information as search information, Since it is now possible to make rules together, it is possible to reduce the labor for creating the rule dictionary and also reduce the memory capacity of the rule dictionary. Also, one rule can be applied to many sentence examples.

[Brief description of drawings]

【図１】本発明の構成図である。FIG. 1 is a configuration diagram of the present invention.

【図２】本実施例によるかな漢字変換装置の制御部のブ
ロック図である。FIG. 2 is a block diagram of a control unit of the kana-kanji conversion device according to the present embodiment.

【図３】本実施例による変換結果記憶領域の概念図であ
る。FIG. 3 is a conceptual diagram of a conversion result storage area according to the present embodiment.

【図４】本実施例の基本辞書の内容の概念図である。FIG. 4 is a conceptual diagram of contents of a basic dictionary of this embodiment.

【図５】本実施例の規則辞書の内容の概念図である。FIG. 5 is a conceptual diagram of contents of a rule dictionary of this embodiment.

【図６】本実施例の品詞情報部の内容の概念図である。FIG. 6 is a conceptual diagram of contents of a part-of-speech information unit of the present embodiment.

【図７】本実施例のかな漢字変換装置のフローチャート
である。FIG. 7 is a flowchart of the kana-kanji conversion device of the present embodiment.

【図８】本実施例の規則変換のフローチャートである。FIG. 8 is a flowchart of rule conversion of this embodiment.

【図９】本実施例の規則変換のフローチャートである。FIG. 9 is a flowchart of rule conversion of this embodiment.

【図１０】本実施例の規則検索のフローチャートであ
る。FIG. 10 is a flowchart of a rule search according to this embodiment.

[Explanation of symbols]

１０入力装置１２ＣＰＵ２０ＲＡＭ２２変換結果記憶領域２４読み入力バッファ領域２６出力バッファ領域３０ＲＯＭ３１プログラム部３４かな漢字変換プログラム３６規則検索プログラム３８規則書き換えプログラム４０辞書部４２基本辞書４４接続テーブル４６規則辞書４８品詞情報部５０出力装置 10 Input Device 12 CPU 20 RAM 22 Conversion Result Storage Area 24 Reading Input Buffer Area 26 Output Buffer Area 30 ROM 31 Program Section 34 Kana-Kanji Conversion Program 36 Rule Search Program 38 Rule Rewriting Program 40 Dictionary Section 42 Basic Dictionary 44 Connection Table 46 Rule Dictionary 48 part-of-speech information section 50 output device

Claims

[Claims]

1. A kana-kanji conversion unit for inputting a kana-yomi character string, a basic dictionary storing notations for reading the words, kana-kanji conversion means for converting kana-kanji by referring to the basic dictionary, and the kana-kanji conversion. A kana-kanji conversion device comprising conversion result storage means for storing conversion results by means, and output means for outputting the kana-kanji conversion result, including part-of-speech information of a specific word in the basic dictionary, and the word. A rule dictionary that stores a pattern of a word string and rewriting information for the word string, and a rule search for searching for a matching pattern in the rule dictionary for the contents of the conversion result storage means by referring to part-of-speech information Means, and when the matching pattern is searched by the rule search means, the contents of the corresponding conversion result storage means are converted into rewriting information of the rule dictionary. Kana-kanji conversion apparatus being characterized in that a rule rewriting means for rewriting Zui.