JPH05108705A

JPH05108705A - Machine translation device

Info

Publication number: JPH05108705A
Application number: JP3269240A
Authority: JP
Inventors: Yuji Ito; 雄二伊藤
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1991-10-17
Filing date: 1991-10-17
Publication date: 1993-04-30

Abstract

(57)【要約】【目的】原言語から相手言語への翻訳を行う際に、語
の多義性による訳し分けに、語と語の関係を集めた知識
ベースを利用することによってより正確に、また効率的
に翻訳処理を行うことを目的とする。【構成】入力手段１により入力された原言語文字列を
相手言語に翻訳するにあたり、語の多義性による訳し分
けの必要が生じた場合に、６の多義性解消部が、５の知
識ベースに記載されている、一般の文章中に現われる語
と語の関係を集めたデータを参照して多義性の解消を行
い、訳し分けを行う。 (57) [Summary] [Purpose] When translating from a source language into a partner language, it is more accurate by using a knowledge base that collects the relationships between words to distinguish between words based on polysemy. Moreover, it aims at performing translation processing efficiently. When the source language character string input by the input means 1 is translated into a partner language and it is necessary to perform transcribing based on the polysemy of words, the polysemy resolution unit 6 makes the knowledge base 5 Refer to the data that describes the relations between words that appear in ordinary sentences to resolve polysemy and perform translation.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、原言語の文字列を相手
言語の文字列に翻訳する機械翻訳装置に関するものであ
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device for translating a source language character string into a partner language character string.

【０００２】[0002]

【従来の技術】近年、機械翻訳に代表されるように自然
言語処理技術への要請が高まってきている。2. Description of the Related Art In recent years, there has been an increasing demand for natural language processing technology as represented by machine translation.

【０００３】２言語間の翻訳を考えるとき、多義性の解
消が大きな問題となる。例えば、日英間の翻訳を考える
とき、「問題を解く」「荷物を解く」を英語に翻訳しよ
うとする場合、上の例では「解く」にあたる英語はｓｏ
ｌｖｅであり、下の例ではｕｎｔｉｅが適当である。
「解く」の多義性によるこのような訳し分けを行うため
に、従来は動詞（この場合は「解く」）の目的語（この
場合は「問題」と「荷物」）を意味的に分類しておき、
ある意味範疇に入る場合はある訳語を、また他の意味範
疇に入る場合は他の訳語を当てる、というような処理を
行っていた。When considering translation between two languages, resolution of polysemy becomes a big problem. For example, when thinking about translating between Japanese and English, when trying to translate "solve a problem" and "unravel a package" into English, in the above example, the word "solve" is so
lve, and untie is suitable in the example below.
In order to perform such translation based on the polysemy of "solve," the object of the verb (in this case, "solve") (in this case, "problem" and "luggage") is semantically classified. Every
When a certain semantic category is entered, a certain translated word is applied, and when another semantic category is entered, another translated word is applied.

【０００４】[0004]

【発明が解決しようとする課題】上記のように意味分類
により多義性を解消し、適切な語を選ぶという方法で
は、非常に多くの語について意味の分類を行わなければ
ならず、また、矛盾を起こさないようにその意味分類の
基準をつくることにも困難が伴うと考えられる。さら
に、そのような意味分類ですべての多義性が解消できる
わけではない。In the method of eliminating polysemy by means of semantic classification and selecting an appropriate word as described above, it is necessary to classify meanings for a very large number of words, and inconsistencies. It may be difficult to create a criterion for the semantic classification so that the above does not occur. Moreover, such semantic classification does not eliminate all ambiguities.

【０００５】[0005]

【課題を解決するための手段】上記のような問題を解決
するために、単語間の共起関係を集めた知識ベースを利
用することを考える。すなわち、実際に使われている文
の中から、語と語の共起の情報（例えば、上記の例のよ
うな「問題を解く」，「荷物を解く」など）を取り出
し、それに対応する相手言語での表現（この例では“ｓ
ｏｌｖｅａｐｒｏｂｌｅｍ”，“ｕｎｔｉｅａ
ｐａｃｋａｇｅ”など）を含めたデータを大量に準備し
ておく。そして、多義性の問題が生じた場合、この蓄え
られたデータを参照することによって適切な訳し分けを
行うことができる。先の例では、「解く」の多義性に対
して、「問題を解く」という文ならば、ｓｏｌｖｅが訳
語として適切であるということがわかる。[Means for Solving the Problems] In order to solve the above problems, consider using a knowledge base that collects co-occurrence relationships between words. In other words, information about co-occurrence of words (for example, “solve a problem”, “unravel a package”, etc.) is extracted from the sentence that is actually used, and the corresponding partner Expression in language (in this example, "s
"olve a problem", "untie a"
A large amount of data including "package" and the like) is prepared. When a problem of polysemy occurs, appropriate translation can be performed by referring to the stored data. Then, with respect to the polysemy of “solve”, it can be seen that if the sentence “solve a problem”, solve is appropriate as a translation.

【０００６】[0006]

【作用】２言語間の翻訳を行うにあたり、語と語の関係
を集めた知識ベースを準備しておき、語の多義性による
訳し分けに利用することにより、効率的な翻訳処理を行
うことができるようになる。When a translation between two languages is performed, a knowledge base that collects the relations between words is prepared and used for differentiating according to the polysemy of words, so that efficient translation processing can be performed. become able to.

【０００７】[0007]

【実施例】以下、図面を参照しながら説明を行う。DESCRIPTION OF THE PREFERRED EMBODIMENTS A description will be given below with reference to the drawings.

【０００８】図１は、本発明の一実施例における機械翻
訳装置の機能ブロック図である。同図において、１は原
言語の文字列の入力手段である。２は入力手段１から入
力される原言語の文字列を記憶する入力文字列記憶部、
３はそれぞれの言語に関する情報及び２言語間の対訳情
報等を持つ翻訳辞書部、４は翻訳辞書から解析対象の単
語を検索する辞書検索部である。５は原言語の語と語の
関係、及びその対訳を集めた知識ベース部、６は知識ベ
ース５を用いて、語の多義性による訳し分けを行う多義
性解消部である。７は解析結果を出力する解析結果出力
部、８は翻訳処理を制御する翻訳処理制御部である。FIG. 1 is a functional block diagram of a machine translation device according to an embodiment of the present invention. In the figure, reference numeral 1 is an input means for inputting a character string in the source language. Reference numeral 2 denotes an input character string storage unit that stores a character string in the source language input from the input means 1.
Reference numeral 3 is a translation dictionary unit having information about each language and bilingual translation information, and the like, and 4 is a dictionary search unit that searches a translation dictionary for a word to be analyzed. Reference numeral 5 is a knowledge base unit that collects relations between words in the source language and their translations, and 6 is a ambiguity resolution unit that uses the knowledge base 5 to perform translation according to the polysemy of the words. Reference numeral 7 is an analysis result output unit that outputs an analysis result, and 8 is a translation processing control unit that controls translation processing.

【０００９】図２は本発明の一実施例における処理の流
れを表わすフローチャートである。以下、本発明の一実
施例として日英の翻訳を取り上げ、このフローチャート
に従って本装置の動作について説明する。FIG. 2 is a flow chart showing the flow of processing in one embodiment of the present invention. A Japanese-English translation will be taken up as an embodiment of the present invention, and the operation of the present apparatus will be described with reference to this flowchart.

【００１０】まずステップ１で、入力された日本語文字
列に対し、翻訳処理制御部８が辞書検索部４により、翻
訳辞書部３を公知の検索法を使って検索して語に関する
情報を獲得し、翻訳処理を進めていく。First, in step 1, the translation processing control unit 8 searches the translation dictionary unit 3 for the input Japanese character string by the dictionary search unit 4 using a known search method to obtain information about words. And proceed with the translation process.

【００１１】次にステップ２では、翻訳処理の過程で、
ある語の多義性のために訳し分けを行う必要のあるとこ
ろがあるかどうかを判断する。訳し分けを行うべきとこ
ろがあればステップ３の処理を行い、なければそのまま
ステップ７の処理に進む。Next, in step 2, in the process of translation processing,
Determine whether there is a need for translation due to the polysemy of a word. If there is a place to be translated, the process of step 3 is performed, and if not, the process proceeds to step 7 as it is.

【００１２】ステップ３では多義性解消部６が、訳し分
けの必要な部分に対して図４の知識ベース部５に記載さ
れた知識データの中に、問題となっている語の訳し分け
に利用できるものがあるかどうかを判断する。ここで適
用できるデータがない場合はそのままステップ５に進
む。In step 3, the disambiguation unit 6 is used for translating the word in question in the knowledge data described in the knowledge base unit 5 of FIG. Determine if there is something you can do. If there is no applicable data here, the process directly proceeds to step 5.

【００１３】適用可能な知識があれば、ステップ４で多
義性の解消処理を行う。次に、ステップ６で多義性解消
の処理がすべて終ったかどうかを判断し、終っていれば
ステップ７へ進み、終っていなければ、ステップ３の処
理を繰り返す。If there is applicable knowledge, ambiguousness resolution processing is performed in step 4. Next, in step 6, it is judged whether or not the processing for disambiguation has been completed. If completed, the processing proceeds to step 7, and if not completed, the processing in step 3 is repeated.

【００１４】ステップ５では、翻訳処理制御部８が、使
用頻度などの情報を使って訳語をひとつに決定する。In step 5, the translation processing control unit 8 determines one translated word by using information such as the frequency of use.

【００１５】さらに、ステップ６では残りの処理を行
い、次のステップ７で翻訳結果を翻訳結果出力部７に出
力する。Further, in step 6, the remaining processing is performed, and in the next step 7, the translation result is output to the translation result output unit 7.

【００１６】さらに具体例を挙げて実際の処理を詳細に
説明する。次の例文を考える。「彼は、昨日、その問題
を解いた。」まずステップ１で辞書検索部４により翻訳
辞書部３を単語単位に検索しながら翻訳処理を進める。
この例では図３のように、「解いた」の部分に多義性に
よる訳し分けの必要が生じる（ステップ２）。The actual processing will be described in more detail with reference to specific examples. Consider the following example sentence. "He solved the problem yesterday." First, in step 1, the dictionary search unit 4 searches the translation dictionary unit 3 word by word to proceed with the translation process.
In this example, as shown in FIG. 3, it is necessary to separately translate the "solved" portion due to polysemy (step 2).

【００１７】次にステップ３で図４の知識ベース部の中
で「解く」に関するデータを検索する。ここで、「解
く」に関する知識データがみつかるので、ステップ４
で、多義性解消部６はそのデータを使って、この部分の
多義性解消処理を行う。この場合は、「解く」に対して
は「問題を解く」，「荷物を解く」の２つのデータがあ
り、入力文は「問題を解く」に該当するので、「解く」
の対訳としてはｓｏｌｖｅが適当であると判断する。こ
こで、適当な知識データが無い場合は、前述のように使
用頻度などの情報によりどれかひとつを選択することに
なる（ステップ５）。次に、ステップ６で多義性の解消
が必要な部分が他にあるかどうかを判断する。この例で
は、他にないので次のステップ７に進み、残った処理を
行う。Next, in step 3, the data regarding "solve" are searched in the knowledge base section of FIG. Here, since knowledge data about "solving" is found, step 4
Then, the ambiguity resolution unit 6 uses the data to perform the ambiguity resolution processing of this portion. In this case, there are two data for “solve” and “solve luggage” for “solve”, and the input sentence corresponds to “solve the problem”, so “solve”
It is determined that solve is appropriate as a parallel translation of. Here, if there is no appropriate knowledge data, one of them is selected according to the information such as the frequency of use as described above (step 5). Next, in step 6, it is determined whether or not there is another portion that needs to be disambiguated. In this example, since there is nothing else, the process proceeds to the next step 7, and the remaining processing is performed.

【００１８】さらに次のステップで、翻訳の最終的な結
果を翻訳結果出力部７に出力する。この例では図５のよ
うな結果が出力されることになる。In the next step, the final result of the translation is output to the translation result output unit 7. In this example, the result shown in FIG. 5 is output.

【００１９】今は日英の翻訳を例として多義性解消の動
作を説明したが、逆に英日の場合でも同様に行うことが
できる。図３のｔａｋｅの例のように目的語が“ａｃｔ
ｉｏｎ”，“ａｎｅｘａｍｉｎａｔｉｏｎ”によって
訳が変わってくるが、このような場合にも知識データを
使って訳語選択を行うことができる。Although the operation of disambiguation has been described by taking a Japanese-English translation as an example, it can be similarly performed in the case of English-Japanese. As in the example of take in FIG. 3, the object is “act.
Although the translation varies depending on "ion" and "an examination", the translated word can be selected using the knowledge data even in such a case.

【００２０】[0020]

【発明の効果】上記のように、２言語間の翻訳を行うに
あたり、語と語の関係を集めた知識ベースを準備してお
き、語の多義性による訳し分けに利用することにより、
効率的な翻訳処理を行うことができるようになる。As described above, in translating between two languages, a knowledge base that collects relations between words is prepared and used for transcribing according to the polysemy of words.
It becomes possible to perform efficient translation processing.

[Brief description of drawings]

【図１】本発明の一実施例における、機械翻訳装置の機
能ブロック図FIG. 1 is a functional block diagram of a machine translation device according to an embodiment of the present invention.

【図２】本発明の一実施例における、翻訳処理の動作を
表わすフローチャートFIG. 2 is a flowchart showing an operation of a translation process in the embodiment of the present invention.

【図３】本発明の一実施例における、多義性が生じる例
を示す図FIG. 3 is a diagram showing an example in which polysemy occurs in one embodiment of the present invention.

【図４】本発明の一実施例における、知識ベースのデー
タの例を示す図FIG. 4 is a diagram showing an example of knowledge base data according to an embodiment of the present invention.

【図５】本発明の一実施例における、多義性解消の結果
を示す図FIG. 5 is a diagram showing a result of disambiguation according to an embodiment of the present invention.

[Explanation of symbols]

１入力手段２入力文字列記憶部３翻訳辞書部４辞書検索部５知識ベース部６多義性解消部７解析結果出力部８翻訳処理制御部 DESCRIPTION OF SYMBOLS 1 Input means 2 Input character string storage unit 3 Translation dictionary unit 4 Dictionary search unit 5 Knowledge base unit 6 Ambiguity resolution unit 7 Analysis result output unit 8 Translation processing control unit

Claims

[Claims]

1. An input means for inputting a source language character string, an input character string storage section for storing the input character string, a morpheme information of a source language word and a partner language, a bilingual information of a source language and a partner language, and the like. A translation dictionary unit that holds the translation dictionary, a dictionary search unit that searches the translation dictionary, a co-occurrence relationship between words in the source language, and a knowledge base unit that collects corresponding expressions in the other language, and the knowledge base. Section, the ambiguity resolution unit that resolves the ambiguity that occurs when translating an input character string into the partner language, the translation result output unit that outputs the translation processing result, and the translation process between the source language and the partner language. A machine translation device comprising a translation process execution control unit for controlling.