JPH0619880A

JPH0619880A - Kana/kanji converter

Info

Publication number: JPH0619880A
Application number: JP4175581A
Authority: JP
Inventors: Yamahiko Ito; 山彦伊藤
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1992-07-02
Filing date: 1992-07-02
Publication date: 1994-01-28

Abstract

(57)【要約】【目的】文節の途中や、共起関係のある語と語の間で
変換が確定された場合でも文章の流れに合い、共起関係
にかなったかな漢字変換ができるかな漢字変換装置を得
る。【構成】入力手段１よりかな文字列が入力されると、
第２の文字列記憶手段７に記憶されているかな文字列を
読み出して、読み出されたかな文字列の後に、新たに入
力されたかな文字列を連結し、連結された一連のかな文
字列を変換文字列格納手段１１に格納して変換の対象と
する。この文字列を文字列区切手段１３が文節に区切
り、変換手段１５がかな漢字交じり文字列に変換する。
その際、第１の文字列記憶手段５に記憶されている確定
済みのかな漢字文字列の最後の文節や、共起情報格納手
段９中の共起情報を参照する。変換後、表示手段１９が
変換結果から第１の文字列記憶手段５に記憶されている
かな漢字文字列を除外したものを表示する。 (57) [Summary] [Purpose] Kana-Kanji conversion that enables Kana-Kanji conversion that matches the co-occurrence relationship, according to the flow of the sentence even when the conversion between the words that have a co-occurrence relationship or between words is confirmed in the middle of a phrase. Get the device. [Configuration] When a kana character string is input by the input means 1,
The kana character string stored in the second character string storage means 7 is read, and the newly input kana character string is connected after the read kana character string, and a series of connected kana character strings. Is stored in the conversion character string storage means 11 and is converted. The character string delimiter 13 divides the character string into phrases, and the converter 15 converts the kana-kanji mixed character string.
At that time, the last clause of the confirmed kana-kanji character string stored in the first character string storage means 5 and the co-occurrence information in the co-occurrence information storage means 9 are referred to. After the conversion, the display unit 19 displays the conversion result excluding the Kana-Kanji character string stored in the first character string storage unit 5.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、かな文字列を辞書に従
って対応するかな漢字交じり文字列に変換するかな漢字
変換装置に関し、さらに詳しくは確定済みの文節と適切
につながるかな漢字変換ができるかな漢字変換装置に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion device for converting a kana-character string into a corresponding kana-kanji mixed character string according to a dictionary, and more particularly to a kana-kanji conversion device capable of appropriately connecting kana-kanji characters to a fixed phrase. .

【０００２】[0002]

【従来の技術】日本語ワードプロセッサはかなによって
単語の読み情報を入力し、入力された読み情報に対応す
る漢字を辞書部から読み出すが、この場合に、文法情
報、共起関係、頻度情報などが用いられるのが一般的で
ある。文法情報とは品詞と品詞の接続情報であり、例え
ば名詞の後に助詞は続くが助動詞は続かないという規則
に基づいて、文節の区切りや変換の候補を決定するもの
である。共起関係とは、どの語とどの語が共に現れやす
いかを表した関係である。例えば「あかんぼうがなく」
というかな文字列に対しては「赤ん坊が泣く」、「とり
がなく」というかな文字列に対しては「鳥が鳴く」と変
換されるように、同じ「なく」というかな文字列に対し
て、前に来る語が「赤ん坊が」の場合には「泣く」が、
「鳥が」の場合には「鳴く」がそれぞれ共起関係によっ
て現れやすいのである。頻度情報とは、同じ読みに対し
てどの候補が現れやすいかを過去の入力状況に基づいて
表した情報であり、この頻度情報に基づいて出現頻度が
高い候補を先に出現させることによって、変換の効率を
上げることができる。2. Description of the Related Art A Japanese word processor inputs word reading information by kana and reads out Kanji corresponding to the input reading information from a dictionary section. In this case, grammatical information, co-occurrence relations, frequency information, etc. It is generally used. The grammar information is connection information between parts of speech and parts of speech, and determines, for example, punctuation and conversion candidates based on the rule that a particle follows a noun but not an auxiliary verb. The co-occurrence relationship is a relationship that represents which word and which word is likely to appear together. For example, “There is no kanbou”
For the same kana character string, "Baby cries" for the kana character string, and "birds cry" for the kana character string. , If the previous word is "baby", "cry",
In the case of "bird", "crowing" tends to appear depending on the co-occurrence relationship. Frequency information is information that shows which candidates are likely to appear for the same reading based on past input situations. The efficiency of can be improved.

【０００３】また、べた書き文の自動文節分かち書きの
方法については、例えば情報処理学会論文誌Ｖｏｌ．２
０，Ｎｏ．４，Ｊｕｌｙ１９７９，ｐｐ．３３７−３４
５に掲載された論文「べた書き文の分かち書きとかな漢
字変換」に開示されている。図５はこの論文に開示され
た分かち書きおよびかな漢字変換装置の動作を示したフ
ローチャートである。図に基づいて動作を概説すると、
入力文を読み込み（ステップ５０）、読み込んだ入力文
を句読点によって分割した文字列（以下「区分」とい
う）に分解し（ステップ５１）、分解された区分の中か
ら１つの区分を取り出す（ステップ５２）。取り出した
区分に対して接頭語処理、自立語処理、接尾語処理を行
なって文節形を抽出する（ステップ５５）。抽出した文
節形に対して二文節最長一致法を適用して適切な区切り
を行い（ステップ５７）、区切られた文節に対して漢字
化を行う（ステップ５８）。文節の区切りから連続して
２文節が見出だされない場合は、文節の区切りからの最
長の文節形を文節として、付属語分かち書きによって後
続の文字列の区切りを求め（ステップ５）、処理を続行
する。A method of automatically segmenting a solid written sentence is described in, for example, the IPSJ Journal Vol. Two
0, No. 4, July 1979, pp. 337-34
It is disclosed in the paper “Divided text and kana-kanji conversion” published in No. 5. FIG. 5 is a flow chart showing the operation of the segmentation and kana-kanji conversion device disclosed in this paper. The outline of the operation based on the figure is
The input sentence is read (step 50), the read input sentence is decomposed into character strings (hereinafter referred to as "divisions") divided by punctuation marks (step 51), and one division is extracted from the decomposed divisions (step 52). ). Prefix processing, independent word processing, and suffix processing are performed on the extracted section to extract the phrase form (step 55). The two-phrase longest matching method is applied to the extracted bunsetsu to make an appropriate delimitation (step 57), and the demarcated bunsetsu is converted to kanji (step 58). If two bunsetsu are not found in succession from the bunsetsu delimiter, the longest bunsetsu form from the bunsetsu delimiter is used as a bunsetsu to find the delimiter of the subsequent character string by appending word segmentation (step 5), and the process is continued. To do.

【０００４】[0004]

【発明が解決しようとする課題】上記のような従来のか
な漢字変換装置では、変換が確定された文字列は、以降
の変換処理に参照されないので、文節の途中や、共起関
係のある語と語の間でユーザが変換を確定した場合に
は、確定した語に続いて入力されるかな文字列に対して
は確定済みの文字列に適切に対応するかな漢字交じり文
の候補を出現させることができなかった。In the conventional kana-kanji conversion device as described above, since the character string whose conversion has been confirmed is not referred to in the subsequent conversion processing, it is not included in the middle of a phrase or in a word having a co-occurrence relationship. When the user confirms the conversion between words, for the kana character string that is input after the confirmed word, a candidate kana-kanji mixed sentence that appropriately corresponds to the confirmed character string may appear. could not.

【０００５】本発明は上記のような課題を解決するため
になされたもので、文節の途中や、共起関係のある語と
語の間で変換が確定された場合でも文章の流れに合い、
共起関係にかなったかな漢字変換ができるかな漢字変換
装置を得ることを目的としている。The present invention has been made in order to solve the above problems, and fits the flow of a sentence even in the middle of a bunsetsu or even when a conversion between words having a co-occurrence relationship is decided.
The purpose is to obtain a kana-kanji conversion device that can convert kana-kanji according to a co-occurrence relationship.

【０００６】[0006]

【課題を解決するための手段】本発明に係るかな漢字変
換装置は、入力されたかな文字列の直前に位置する確定
済みのかな漢字交じり文字列の最後の文節を記憶する第
１の文字列記憶手段と、該第１の文字列記憶手段に記憶
されたかな漢字交じり文字列に対応するかな文字列を記
憶する第２の文字列記憶手段と、該第２の文字列記憶手
段に記憶されているかな文字列を読み出して、読み出さ
れたかな文字列の後に前記入力されたかな文字列を連結
して格納する変換文字列格納手段と、該変換文字列格納
手段に格納されたかな文字列を第１文節が前記第２の文
字列記憶手段に記憶されているかな文字列を部分文字列
として含むような文節に区切る文字列区切手段と、該文
字列区切手段によって区切られたかな文字列のうち第１
文節は前記第１の文字列記憶手段に記憶されているかな
漢字交じり文字列を部分文字列として含むかな漢字交じ
り文字列に変換し、第２文節以降は該第１文節に適切に
続くかな漢字交じり文字列に変換する変換手段と、該変
換手段によって変換されたかな漢字交じり文字列から前
記第１の文字列記憶手段に記憶されているかな漢字文字
列を除外したかな漢字交じり文字列を表示する表示手段
とを備えたものである。A kana-kanji conversion device according to the present invention is a first character string storage means for storing the last phrase of a fixed kana-kanji mixed character string located immediately before an input kana character string. A second character string storage means for storing a kana character string corresponding to the kana-kanji mixed character string stored in the first character string storage means; and a kana stored in the second character string storage means. A character string is read out, a converted character string storage means for connecting and storing the input kana character string after the read kana character string, and a kana character string stored in the converted character string storage means Of the kana character string that is divided by the character string delimiter means, the character string delimiter that divides the kana character string stored in the second character string storage means as a partial character string into one clause First
The phrase is converted into a kana-kanji mixed character string that includes the kana-kanji mixed character string stored in the first character string storage means as a partial character string, and the kana-kanji mixed character string that appropriately follows the first phrase after the second phrase. And a display unit for displaying a kana-kanji mixed character string excluding the kana-kanji mixed character string stored in the first character string storage means from the kana-kanji mixed character string converted by the converting means. It is a thing.

【０００７】また、変換手段は語と語の共起情報を格納
した共起情報格納手段に格納された共起情報に基づい
て、変換文字列格納手段に格納されたかな文字列の第２
文節以降をかな漢字文字列に変換するようにしたもので
ある。Further, the conversion means, based on the co-occurrence information stored in the co-occurrence information storage means in which the word-to-word co-occurrence information is stored, the second kana character string stored in the converted character string storage means.
It is designed to convert the text after the phrase into a kana-kanji character string.

【０００８】[0008]

【作用】本発明におけるかな漢字変換装置においては、
かな文字列が入力されると第２の文字列記憶手段に記憶
されているかな文字列を読み出して、読み出されたかな
文字列の後に、新たに入力されたかな文字列を連結し、
連結された一連のかな文字列を変換の対象にし、変換後
第１の文字列記憶手段に記憶されているかな漢字文字列
を除外して変換結果を表示する。In the kana-kanji conversion device of the present invention,
When the kana character string is input, the kana character string stored in the second character string storage means is read, and the newly input kana character string is connected after the read kana character string,
A series of concatenated kana character strings are subjected to conversion, the kana-kanji character string stored in the first character string storage means after conversion is excluded and the conversion result is displayed.

【０００９】また、変換文字列格納手段に格納されたか
な文字列の第２文節以降をかな漢字文字列に変換するに
際して、共起情報を参照して共起関係にかなった候補を
出現させる。Further, when converting the second and subsequent clauses of the kana character string stored in the converted character string storage means into a kana-kanji character string, the co-occurrence information is referred to so that a candidate matching the co-occurrence relationship appears.

【００１０】[0010]

【Example】

実施例１．図１は本発明に係るかな漢字変換装置の一実
施例を示すブロック図である。図において、１はかなキ
ー、変換キー、確定キー等を備えたキーボードなどから
なる入力手段であり、かな文字列の他表示単語の変更指
示を制御部３へ入力する。制御部３はマイクロプロセッ
サからなり、図示しないメモリに書き込まれている制御
プログラムに従い後述するデータ処理を行う。Example 1. FIG. 1 is a block diagram showing an embodiment of a kana-kanji conversion device according to the present invention. In the figure, reference numeral 1 denotes an input means such as a keyboard provided with a kana key, a conversion key, a decision key, etc., which inputs a change instruction of a display word other than a kana character string to the control unit 3. The control unit 3 is composed of a microprocessor and performs data processing described later according to a control program written in a memory (not shown).

【００１１】５は確定済みのかな漢字文字列の最後の文
節のかな漢字交じり表記を記憶する第１の文字列記憶手
段、７は第１の文字列記憶手段５に記憶された確定済み
のかな漢字文字列に対応するかな表記を記憶する第２の
文字列記憶手段、９は語と語の共起関係の情報を格納し
た共起情報格納手段である。１１は第２の文字列記憶手
段７に記憶されているかな文字列を読み出して、この読
み出されたかな文字列の後に入力手段１から入力された
かな文字列を結合し、この結合された一連のかな文字列
を格納する変換文字列格納手段である。１３は変換文字
列格納手段に格納されているかな文字列を文節ごとに区
切るかな文字列区切手段、１５はかな文字列区切手段１
３によって区切られた文節に対してかな漢字変換を行う
変換手段である。Reference numeral 5 is a first character string storage means for storing the kana-kanji mixed notation of the last phrase of the confirmed kana-kanji character string, and 7 is a confirmed kana-kanji character string stored in the first character string storage means 5. Second character string storage means for storing kana notation corresponding to, and 9 is co-occurrence information storage means for storing information on the co-occurrence relationship between words. Reference numeral 11 reads the kana character string stored in the second character string storage means 7, combines the kana character string input from the input means 1 after the read kana character string, and combines the kana character strings. It is a conversion character string storage means for storing a series of kana character strings. Reference numeral 13 is a kana character string delimiter that divides the kana character string stored in the converted character string storage means into clauses, and 15 is a kana character string delimiter 1.
It is a conversion means for performing Kana-Kanji conversion for the clauses delimited by 3.

【００１２】１７は照合手段であり、以下の動作を行
う。即ち、第２の文字列記憶手段７に記憶されているか
な文字列を読み出して、この読み出されたかな文字列と
かな文字列区切手段１３によって区切られたかな文字列
の第１文節のかな文字列とを照合すること、及び第１の
文字列記憶手段５に記憶されているかな漢字交じり文字
列を読み出して、この読み出されたかな漢字交じり文字
列と変換手段１５によってかな漢字交じり文に変換され
たかな漢字交じり文の第１文節のかな漢字文字列とを照
合することである。１９は変換手段１５によって変換さ
れたかな漢字交じり文から第１の文字列記憶手段に記憶
されているかな漢字交じり文字列を除外したかな漢字交
じり文を表示する表示手段である。Reference numeral 17 is a collating means, which performs the following operations. That is, the kana character string stored in the second character string storage means 7 is read out, and the kana character string read out by this kana character string and the kana character string first segment kana separated by the kana character string delimiter 13 are read. The kana-kanji mixed character string stored in the first character string storage means 5 is checked and the read kana-kanji mixed character string and the conversion means 15 are converted into kana-kanji mixed sentences. It is to collate with the Kana-Kanji character string in the first bunsetsu of the Takana-Kanji mixed sentence. Reference numeral 19 is a display means for displaying a kana-kanji mixed sentence in which the kana-kanji mixed character string stored in the first character string storage means is excluded from the kana-kanji mixed sentence converted by the converting means 15.

【００１３】図２は本実施例の動作を示すフローチャー
ト、図３はキー操作と表示手段１９に表示される表示結
果との関係を説明するための説明図である。図３におい
て（ａ）〜（ｅ）はキー操作の各段階における表示結果
を示している。以下制御部３の動作を図２及び図３に従
って説明する。先ず「わたしはいとうです」という文字
列の変換を例にして説明すると、入力手段１よりかな文
字列「わたし」を入力し（表示結果は図３の（ａ））、
入力手段１の変換キーの操作によってかな漢字の候補と
して「私」を出現させ（図３の（ｂ））、さらに確定キ
ーの操作によって「私」を確定する（図３の（ｃ））。
以上は従来と同様の動作である。この時点で、第１の文
字列記憶手段５には「私」というかな漢字文字列が、第
２の文字列記憶手段７には「わたし」というかな文字列
がそれぞれ記憶される。FIG. 2 is a flow chart showing the operation of this embodiment, and FIG. 3 is an explanatory diagram for explaining the relationship between key operations and the display results displayed on the display means 19. In FIG. 3, (a) to (e) show display results at each stage of key operation. The operation of the control unit 3 will be described below with reference to FIGS. First, the conversion of the character string "I am Ito" is taken as an example, and the kana character string "I" is input from the input means 1 (the display result is (a) in FIG. 3).
By operating the conversion key of the input means 1, "I" appears as a kana-kanji candidate ((b) in FIG. 3), and by operating the enter key, "I" is confirmed ((c) in FIG. 3).
The above is the same operation as the conventional one. At this point, the kana character string "I" is stored in the first character string storage means 5, and the kana character string "I" is stored in the second character string storage means 7.

【００１４】次ぎに、入力手段１から「はいとうです」
というかな文字列が入力されと（ステップ３０）、表示
手段１９には「私はいとうです」という文字列が表示さ
れる（図３の（ｄ））。このとき第２の文字列記憶手段
７に記憶されている「わたし」というかな文字列が読み
出され、この読み出された「わたし」というかな文字列
の後に新しく入力された「はいとうです」が連結されて
「わたしはいとうです」という一連のかな文字列が変換
文字列格納手段１１に格納される（ステップ３１）。[0014] Next, from the input means 1, "I'm happy".
When a kana character string is input (step 30), the character string "I am my mother" is displayed on the display means 19 ((d) in FIG. 3). At this time, the kana character string "I" stored in the second character string storage means 7 is read out, and the newly input "Kana" character string is newly input "Haitoto". Are concatenated and a series of kana character strings "I am Itou" are stored in the converted character string storage means 11 (step 31).

【００１５】変換文字列格納手段１１に格納された「わ
たしはいとうです」というかな文字列は文字列区切手段
１３によって「わたしは」と「いとうです」に区切られ
る（ステップ３２）。照合手段１３が文字列区切手段１
３によって区切られた第１文節である「わたしは」と第
２の文字列記憶手段７に記憶されている「わたし」とい
うかな文字列を照合し、文字列区切手段１３によって区
切られた第１文節である「わたしは」が、第２の文字列
記憶手段７のかな文字列である「わたし」を部分文字列
として含んでいるか否かを調べ（ステップ３３）、含ん
でいれば第１文節である「わたしは」に対して変換手段
１３がかな漢字変換処理を行う（ステップ３４）。The kana character string "I am Ito" stored in the conversion character string storage unit 11 is divided into "I" and "Ito" by the character string delimiter unit 13 (step 32). The collating means 13 is the character string delimiting means 1
The first phrase separated by 3 is collated with the first phrase "Iwa" and the kana character string "I" stored in the second character string storage means 7, and the first character string is separated by the character string separation means 13. It is checked whether or not the phrase "Iwa" includes the kana character string "I" of the second character string storage means 7 as a partial character string (step 33), and if it does, the first phrase. The conversion means 13 performs kana-kanji conversion processing for "Iwa" (step 34).

【００１６】文字列区切手段１３によって区切られた第
１文節が第２の文字列記憶手段７中の文字列を部分文字
列として含んでいない場合は別の区切りを行い（ステッ
プ３２）、新たに区切られた第１文節が第２の文字列記
憶手段７に記憶されているかな文字列を部分文字列とし
て含んでいるか否かを調べる（ステップ３３）。部分文
字列として含んでいない場合には、別の区切りができる
かどうかを判断し（ステップ３３ａ）、別の区切りがで
きる場合はステップ３２に戻り同じ動作を行う。以上の
動作を繰り返し、可能な全ての区切りを行っても、区切
られた文節の第１文節が第２の文字列記憶手段７中のか
な文字列を部分文字列として含むような区切りが存在し
ない場合には、変換文字列格納手段１１に格納されてい
るかな文字列のうち第２の文字列記憶手段７に記憶され
てるかな文字列と同一のかな文字列については第１の文
字列格納手段５中のかな漢字文字列と同一のかな漢字文
字列に変換して確定し（ステップ３５）、新たに入力さ
れたかな文字列について変換処理を行う（ステップ３
７）。If the first phrase delimited by the character string delimiter 13 does not include the character string in the second character string storage 7 as a partial character string, another delimiter is performed (step 32) and a new character is newly added. It is checked whether or not the delimited first clause contains the kana character string stored in the second character string storage means 7 as a partial character string (step 33). If it is not included as a partial character string, it is determined whether another delimiter is possible (step 33a). If another delimiter is possible, the process returns to step 32 and the same operation is performed. Even if the above operation is repeated to perform all possible delimiters, there is no delimiter such that the first bunsetsu of the delimited bunsetsu contains the kana character string in the second character string storage means 7 as a partial character string. In this case, of the kana character strings stored in the converted character string storage means 11, the same kana character string as the kana character string stored in the second character string storage means 7 is the first character string storage means. The kana-kanji character string identical to the kana-kanji character string in 5 is converted and confirmed (step 35), and conversion processing is performed on the newly input kana-character string (step 3).
7).

【００１７】ところで、本実施例の場合には文字列区切
手段１３によって区切られた第１文節である「わたし
は」が、第２の文字列記憶手段７のかな文字列である
「わたし」を部分文字列として含んでいるので、「わた
しは」に対してかな漢字変換処理を行う（ステップ３
４）。かな漢字変換処理の結果「私は」が候補として出
現すると、照合手段１７が変換された第１文節である
「私は」と第１の文字列記憶手段５に記憶されているか
な漢字文字列である「私」を照合し、変換された第１文
節である「私は」が、第１の文字列記憶手段５に記憶さ
れているかな漢字文字列である「私」を部分文字列とし
て含んでいるか否かを調べる（ステップ３６）。By the way, in the case of the present embodiment, the first phrase "Iwa" delimited by the character string delimiter 13 is replaced by the kana character string "I" in the second character string storage unit 7. Since it is included as a partial character string, kana-kanji conversion processing is performed on "Iwa" (step 3).
4). When "Iwa" appears as a candidate as a result of the Kana-Kanji conversion processing, the matching unit 17 is the converted first phrase "Iwa" and the Kana-Kanji character string stored in the first character string storage unit 5. Is "I", which is the first phrase converted by collating "I", included "I", which is the Kana-Kanji character string stored in the first character string storage means 5, as a partial character string? It is checked whether or not (step 36).

【００１８】部分文字列として含まない場合には、他の
候補が在るかどうかを判断し（ステップ３６ａ）、他の
候補が在る場合にはステップ３４に戻り再変換を行い、
同様の照合を繰り返す。全ての変換結果において、第１
文節のかな漢字文字列が第１の文字列記憶手段５中の文
字列を部分文字列として含まない場合はステップ３２に
戻り、第１文節の区切り処理をやり直す。部分文字列と
して含んでいる場合または上述の再変換によって該当す
る候補が出現した場合には、変換文字列格納手段１１に
格納されているかな文字列のうち第１文節については変
換後のかな漢字文字列で確定しておき、この第１文節を
除いた部分のかな文字列に対して、かな文字列の区切り
が行われ（ステップ３７）、変換キーの操作によってか
な漢字変換が行われ、所望のかな漢字文字列が出現すれ
ば確定する（ステップ３８）。If it is not included as a partial character string, it is judged whether or not there is another candidate (step 36a). If there is another candidate, the process returns to step 34 to perform re-conversion,
The same collation is repeated. First of all conversion results
If the kana-kanji character string of the bunsetsu does not include the character string in the first character string storage means 5 as a partial character string, the process returns to step 32 and the first bunsetsu delimitation process is performed again. When it is included as a partial character string or when a corresponding candidate appears by the above-mentioned re-conversion, the kana-kanji character after conversion is performed for the first phrase of the kana character string stored in the converted character string storage means 11. Kana-kanji is delimited by the row, and the kana-character string of the part excluding the first clause is delimited (step 37), and kana-kanji conversion is performed by operating the conversion key to obtain the desired kana-kanji. If the character string appears, it is confirmed (step 38).

【００１９】本実施例の場合には、変換された第１文節
である「私は」が第１の文字列記憶手段５に記憶されて
いるかな漢字文字列である「私」を部分文字列として含
んでいるので、変換文字列格納手段１１に格納されてい
るかな文字列である「わたしはいとうです」のうち第１
文節である「わたしは」を「私は」として確定し、これ
を除いた部分のかな文字列である「いとうです」に対し
て、かな文字列の区切りが行われ（本例の場合には「い
とうです」がこれ以上文節に区切ることができないので
この動作は省略される）（ステップ３７）、変換キーの
操作によってかな漢字変換が行われ、所望のかな漢字文
字列である「伊藤です」が出現すれば確定する（ステッ
プ３８）。In the case of this embodiment, the converted first phrase "Iwa" is the kana-kanji character string "I" stored in the first character string storage means 5 as a partial character string. Since it contains, the first character in the kana character string “I am Ito” stored in the conversion character string storage means 11.
The phrase "Iwa" is fixed as "Iwa", and the kana character string except for this is separated from the kana character string (in the case of this example, This operation is omitted because "Itosu" cannot be further divided into clauses (step 37). Kana-Kanji conversion is performed by the operation of the conversion key, and the desired Kana-Kanji character string "Ito is" appears. If so, it is confirmed (step 38).

【００２０】このとき変換文字列格納手段１１内での変
換結果は「私は伊藤です」となり、このかな漢字文字列
から第１の文字列記憶手段５に記憶されているかな漢字
文字列である「私」を除いた「は伊藤です」が表示対象
となる。従って、このときの表示手段１９における表示
結果は「私は伊藤です」となる（ステップ３９）（図３
の（ｅ））。At this time, the conversion result in the conversion character string storage means 11 becomes "I am Ito", and the kana-kanji character string stored in the first character string storage means 5 from this kana-kanji character string "I "Is Ito" excluding "is displayed. Therefore, the display result on the display means 19 at this time is "I am Ito" (step 39) (FIG. 3).
(E)).

【００２１】従来のかな漢字変換装置では、変換が確定
した文節である「私」を参照しないで確定後新たに入力
したかな文字列のみを対象にして変換を行っていたの
で、例えば本実施例の新たに入力されたかな文字列であ
る「はいとうです」に対しては「配当です」と変換して
いた。しかし、本発明によれば、確定済みの文字列であ
る「私」を参照して新たに入力したかな文字列の変換候
補を出現させるようにしたので、「は伊藤です」と
「私」に正しく続くかな漢字文字列への変換が容易にで
きる。In the conventional kana-kanji conversion device, the conversion is performed only for the kana character string newly input after the confirmation without referring to the phrase "I" which is the confirmed conversion. The newly entered kana character string "Haitouto" was converted into "dividend." However, according to the present invention, the conversion candidate of the newly input kana character string is made to appear by referring to the fixed character string "I", so that "is Ito" and "I" You can easily convert to the correct Kana-Kanji character string.

【００２２】次ぎに、「とりがなく」という文字列の変
換を例にして説明する。図４はキー操作と表示手段１９
に表示される表示結果との関係を説明するための説明図
であり、図において（ｆ）〜（ｊ）はキー操作の各段階
における表示結果を示している。入力手段１によって
「とりが」というかな文字列が入力され（表示結果は図
４の（ｆ））、変換キーの操作によって変換が行われ
（図４の（ｇ））、「鳥が」がというかな漢字文字列が
確定する（図４の（ｈ））。この時点で、第１の文字列
記憶手段５には「鳥が」という文字列が入り、第２の文
字列記憶手段７には「とりが」という文字列が入る。Next, conversion of the character string "Torashiri" will be described as an example. FIG. 4 shows key operation and display means 19.
FIG. 4 is an explanatory diagram for explaining the relationship with the display result displayed in FIG. 3, in which (f) to (j) show the display result at each stage of the key operation. A kana character string "toriga" is input by the input means 1 (display result is (f) in FIG. 4), and conversion is performed by operating the conversion key ((g) in FIG. 4). The kana-kanji character string is fixed ((h) in FIG. 4). At this point, the character string "bird" is stored in the first character string storage means 5, and the character string "toriga" is stored in the second character string storage means 7.

【００２３】この後、「なく」というかな文字列が入力
されると（図４の（ｉ））、第２の文字列記憶手段７中
の文字列である「とりが」と新たに入力された「なく」
が連結され「とりがなく」というかな文字列が変換文字
列格納手段１１に格納され、変換の対象となる。そし
て、このかな文字列は第１文節が「とりが」、第２文節
が「なく」と区切られ、区切られた第１文節の「とり
が」が第２の文字列記憶手段７中の文字列である「とり
が」を部分文字列として含んでいるかどうか調べる。本
例の場合には含んでいるので、第１文節である「とり
が」に対する変換が行われ、「鳥が」というかな漢字文
字列が候補として出現する。候補として出現した「鳥
が」が第１の文字列記憶手段５中の「鳥が」を部分文字
列として含んでいるかどうかを調べる。本例の場合は含
んでいるので第１文節を「鳥が」としてを確定した後、
第２文節である「なく」に対する変換が行われる。After that, when a kana character string "nashi" is input ((i) in FIG. 4), "toriga" which is the character string in the second character string storage means 7 is newly input. "Without"
Are concatenated and the kana character string “Torashiri” is stored in the conversion character string storage means 11 and becomes a conversion target. In this kana character string, the first clause is delimited as "toriga" and the second clause is delimited, and the delimited "toriga" of the first clause is a character in the second character string storage means 7. Check to see if it contains the string "Toriga" as a substring. In the case of this example, since it is included, the conversion for the first phrase "Toriga" is performed, and the Kana-kanji character string "Toriga" appears as a candidate. It is checked whether or not "bird" that appears as a candidate includes "bird" in the first character string storage means 5 as a partial character string. In the case of this example, it is included, so after confirming that the first phrase is "bird",
The conversion for the second clause, “null”, is performed.

【００２４】このとき、第１文節として確定した「鳥
が」という文字列が共起情報として参照され、これと共
起関係にある「鳴く」が変換候補として出現する（図４
の（ｊ））。従来「鳥が」を確定した後に「なく」とい
うかな文字列を入力して変換した場合には、確定したか
な漢字文字列である「鳥が」が参照されないので、「無
く」「泣く」「鳴く」という変換候補が過去の使用頻度
によって出現し、変換効率が悪かった。しかし、本発明
によれば確定したかな漢字文字列である「鳥が」を参照
にして、新たに入力された「なく」の変換を行うので
「鳥が」と共起関係にある「鳴く」が変換候補として最
初に出現し、効率の良い変換ができる。At this time, the character string "bird" determined as the first clause is referred to as co-occurrence information, and "crow" having a co-occurrence relationship with this appears as a conversion candidate (FIG. 4).
(J)). Conventionally, when a character string "Kan" is input and converted after "Toriga" is confirmed, "Kana", which is the confirmed Kana-Kanji character string, is not referenced, so "No", "Cry", and "Cry" A conversion candidate appeared according to the frequency of use in the past, and the conversion efficiency was poor. However, according to the present invention, the newly input “n” is converted with reference to the fixed kana-kanji character string “bird”, so that “calling” that has a co-occurrence relationship with “bird” does not occur. It appears first as a conversion candidate and allows efficient conversion.

【００２５】[0025]

【発明の効果】以上説明したのように、本発明によれば
確定済みのかな漢字文字列を参照して新たに入力したか
な文字列の変換候補を出現させるようにしたので、確定
済みのかな漢字文字列に正しく続くかな漢字文字列への
変換が容易にできる。As described above, according to the present invention, a conversion candidate for a newly input kana character string is made to appear by referring to a fixed kana kanji character string, and thus a fixed kana kanji character is determined. You can easily convert to a Kana-Kanji character string that follows the sequence correctly.

[Brief description of drawings]

【図１】本発明に係るかな漢字変換装置の一実施例を示
すブロック図である。FIG. 1 is a block diagram showing an embodiment of a kana-kanji conversion device according to the present invention.

【図２】実施例１の動作を示すフローチャートである。FIG. 2 is a flowchart showing the operation of the first embodiment.

【図３】キー操作と表示手段１９に表示される表示結果
との関係を説明するための説明図である。FIG. 3 is an explanatory diagram for explaining a relationship between a key operation and a display result displayed on the display means 19.

【図４】キー操作と表示手段１９に表示される表示結果
との関係を説明するための説明図である。FIG. 4 is an explanatory diagram for explaining a relationship between a key operation and a display result displayed on the display means 19.

【図５】従来の分かち書きおよびかな漢字変換装置の動
作を示したフローチャートである。FIG. 5 is a flowchart showing the operation of a conventional space-dividing and kana-kanji conversion device.

[Explanation of symbols]

１入力手段５第１の文字列記憶手段７第２の文字列記憶手段９共起情報格納手段１１変換文字列格納手段１３文字列区切手段１５変換手段１９表示手段 DESCRIPTION OF SYMBOLS 1 input means 5 1st character string storage means 7 2nd character string storage means 9 co-occurrence information storage means 11 converted character string storage means 13 character string delimitation means 15 conversion means 19 display means

Claims

[Claims]

1. A kana-kanji conversion device for inputting a kana character string, searching a dictionary for a kanji corresponding to the input kana, and converting the kana into a kana-kanji mixed sentence. First character string storage means for storing the last phrase of the kana-kanji mixed character string, and a second character for storing a kana character string corresponding to the kana-kanji mixed character string stored in the first character string storage means A column storage means and a conversion character string for reading out the kana character string stored in the second character string storage means and concatenating and storing the input kana character string after the read kana character string. A storage unit and a character that divides the kana character string stored in the converted character string storage unit into phrases such that the first phrase includes the Kana character string stored in the second character string storage unit as a partial character string. Row And a kana-kanji character string delimited by the character string delimiter means, the first clause is converted to a kana-kanji character string containing the kana-kanji character string stored in the first character string storage means as a partial character string. However, the second and subsequent phrases are converted into a kana-kanji mixed character string that appropriately follows the first phrase, and kana-kanji mixed character strings converted by the conversion means are stored in the first character string storage means. A kana-kanji conversion device comprising: display means for displaying kana-kanji mixed character strings excluding the kana-kanji character strings.

2. The conversion means, based on the co-occurrence information stored in the co-occurrence information storage means storing the word-to-word co-occurrence information, the second clause of the kana character string stored in the converted character string storage means. 2. The kana-kanji conversion device according to claim 1, wherein the subsequent characters are converted into kana-kanji character strings.