JPH05128146A - English morphological analyzer - Google Patents

English morphological analyzer

Info

Publication number
JPH05128146A
JPH05128146A JP3291146A JP29114691A JPH05128146A JP H05128146 A JPH05128146 A JP H05128146A JP 3291146 A JP3291146 A JP 3291146A JP 29114691 A JP29114691 A JP 29114691A JP H05128146 A JPH05128146 A JP H05128146A
Authority
JP
Japan
Prior art keywords
analysis
morpheme
word
english
stored
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP3291146A
Other languages
Japanese (ja)
Inventor
Shigeko Akiyama
薫子 秋山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP3291146A priority Critical patent/JPH05128146A/en
Publication of JPH05128146A publication Critical patent/JPH05128146A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

PURPOSE:To eliminate morpheme analysis failure due to morpheme division error by outputting plural analyzing results without selecting single result relating to an English text which contains a complex word, duplicated complex words, etc., and which is expected to obtain plural analyzing results. CONSTITUTION:A morpheme analysis control part 4 executes morpheme analysis with the English text inputted by an input means 1. A word dictionary 3 is referred to for each word (morpheme) constituting the inputted text during the morpheme analyzing process. At that time, if the word-string such as a complex word, etc., which permits plural morpheme divisions exists, a re-analysis position storing part 5 stores that position. Then, analysis is continued with one of plural division patterns. the analyzing result is stored in an analyzing pattern storage part 6. Then, analysis is performed again with another pattern starting with the position stored in the re-analysis position storage part 5, and that result is also stored in the storage part 6 and then collectively displayed on a display part 7.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、英語を日本語に翻訳す
る際に行なう英語形態素解析処理に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an English morphological analysis process performed when translating English into Japanese.

【0002】[0002]

【従来の技術】形態素解析処理において、形態素分割の
方法次第で、複合語を含む英文や見かけ上重複して複合
語を形成しているような英文を解析すると、この英文に
ついて複合語にはなり得ないパターンを選択し、形態素
解析を誤ってしまう場合がある。
2. Description of the Related Art In a morphological analysis process, when an English sentence including a compound word or an English sentence that apparently forms a compound word depending on the method of morpheme division is analyzed, this English sentence becomes a compound word. There is a case in which a pattern that cannot be obtained is selected and the morphological analysis is mistaken.

【0003】[0003]

【発明が解決しようとする課題】上記のように、英文が
見かえ上重複して複合語を形成しているような英文を解
析すると、重複している複合語の内どれか一つのパター
ンのみしか解析することができず、そのパターンがこの
英文中においては複合語にはなり得なかった場合、これ
は形態素分割の誤りで、形態素解析の結果はその複合語
のパターンを選んだ時点で既に誤りということになり、
この形態素解析結果を用いて翻訳処理を行なうと誤りの
訳文しか出せなくなる。
As described above, when an English sentence in which the English sentences apparently overlap to form a compound word is analyzed, only one of the patterns of the overlapping compound words is analyzed. However, if the pattern could not be a compound word in this English sentence, this is an error in morpheme division, and the result of the morpheme analysis is already at the time when the pattern of the compound word is selected. It ’s an error,
If a translation process is performed using this morphological analysis result, only an incorrect translated sentence can be issued.

【0004】また、句動詞を含む文でも同様のことが言
える。英文の意味解析を行なわない形態素解析では、句
動詞としてまとめて形態素分割をしたものと、しなかっ
たものとでは、どちらが正しい解釈であるのか判断でき
ず形態素分割で誤り、形態素解析に失敗してしまうこと
もある。
The same applies to a sentence including a phrasal verb. In morphological analysis that does not perform semantic analysis of English sentences, it is not possible to determine which is the correct interpretation, that is, the morpheme splitting that is combined as a phrasal verb, and the one that is not done. It may be lost.

【0005】本発明は上記課題を解決するもので、形態
素分割の誤りによる形態素解析の失敗を無くす英語形態
素解析装置の提供を目的とする。
The present invention solves the above problems, and an object of the present invention is to provide an English morpheme analysis apparatus which eliminates the failure of morpheme analysis due to an error in morpheme division.

【0006】[0006]

【課題を解決するための手段】上記目的を達成するため
に本発明は、重複する複合語など解析結果が複数考えら
れる場合は、一意に解析結果を決めずに、複数の解析結
果を出力する構成を有する。
In order to achieve the above object, the present invention outputs a plurality of analysis results without uniquely determining an analysis result when a plurality of analysis results such as overlapping compound words are considered. Have a configuration.

【0007】[0007]

【作用】本発明は上記した構成で複数の形態素解析結果
を出力することによって、形態素分割のための解析誤り
を無くすことができる。
According to the present invention, by outputting a plurality of morpheme analysis results with the above configuration, it is possible to eliminate an analysis error for morpheme division.

【0008】[0008]

【実施例】以下、本発明の一実施例について図面を参照
しながら説明する。図1は本発明の一実施例における形
態素解析装置の機能ブロック図である。同図において、
1は英文の文字列を入力する入力手段、2は入力手段1
から入力された英文を記憶する入力英文記憶部、3は見
出し語と形態素情報を記憶している単語辞書部、4は3
の単語辞書部を用いて形態素解析処理を行なう形態素解
析制御部、5は解析途中で別の形態素分割の解釈が考え
られる場合にその形態素の位置を記憶しておく再解析位
置記憶部、6は形態素解析結果を記憶する解析結果記憶
部、7は解析結果を表示する表示部である。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a functional block diagram of a morphological analysis apparatus according to an embodiment of the present invention. In the figure,
1 is an input means for inputting an English character string, 2 is an input means 1
An input English sentence storage unit for storing English sentences input from 3 is a word dictionary unit for storing headwords and morpheme information, 4 is 3
The morpheme analysis control unit 5 that performs morpheme analysis processing using the word dictionary unit of 5 is a reanalysis position storage unit 6 that stores the position of the morpheme when another interpretation of morpheme division can be considered during analysis. An analysis result storage unit for storing the morphological analysis result, and a display unit 7 for displaying the analysis result.

【0009】上記構成要素よりなる装置の各構成要素の
関係と動作を述べる。まず、入力手段1によって入力さ
れた英文に対し、形態素解析制御部4を動作させ、形態
素解析を実行する。この形態素解析の処理過程において
は、入力された英文を構成するそれぞれの単語(形態
素)について単語辞書部3を検索する。この際、複合語
など、複数の形態素分割が可能な単語列が存在している
場合は再解析位置記憶部5にその位置を記憶しておき、
複数の分割パターンの内の一つのパターンで解析を続行
し、その解析結果が得られた時点で、解析結果記憶部6
に解析結果を記憶しておき、再解析位置記憶部5に記憶
された位置からまた新たに別のパターンの解釈で解析を
し直し、その結果も解析結果記憶部6に記憶される。こ
のように、一つ以上の解析結果が解析結果記憶部6に記
憶され、表示部7で表示される。
The relation and operation of each constituent element of the apparatus composed of the above constituent elements will be described. First, the morphological analysis control unit 4 is operated for the English sentence input by the input means 1 to execute the morphological analysis. In the morphological analysis process, the word dictionary unit 3 is searched for each word (morpheme) forming the input English sentence. At this time, if there are a plurality of morpheme-dividable word strings such as compound words, their positions are stored in the reanalysis position storage unit 5,
The analysis is continued for one of the plurality of divided patterns, and when the analysis result is obtained, the analysis result storage unit 6
The analysis result is stored in the memory, the analysis is performed again from the position stored in the reanalysis position storage unit 5 again by interpreting another pattern, and the result is also stored in the analysis result storage unit 6. In this way, one or more analysis results are stored in the analysis result storage unit 6 and displayed on the display unit 7.

【0010】以下、図2のフローチャートを参照しなが
ら複数の解析結果が得られる場合の処理について説明す
る。
The processing when a plurality of analysis results are obtained will be described below with reference to the flowchart of FIG.

【0011】まず、ステップAで形態素解析処理を始め
る。ステップBで形態素分割の処理が行なわれ、分割対
象単語列が複合語であったなどの理由で複数の分割が可
能な場合、ステップCで現在の形態素分割位置を再解析
位置として記憶しておく。複数の分割パターンのうちの
一つを用いて解析処理を続行し(ステップD)、一つの
解析結果が得られると、ステップEにおいて形態素解析
結果を記憶する。解析が終了したら、再解析位置を確認
し、再解析の必要があれば、ステップFで再解析位置を
求め再解析位置において先程とは別のパターンで形態素
分割し(ステップB)、形態素解析の処理を行ない(ス
テップC,D)、解析が終了したら、ステップEにおい
て解析結果を記憶する。こうして、一つ以上の形態素解
析結果が記憶され、ステップGで表示される。
First, in step A, the morphological analysis process is started. If a plurality of divisions are possible because the morpheme division process is performed in step B and the division target word string is a compound word, the current morpheme division position is stored as a reanalysis position in step C. .. The analysis process is continued using one of the plurality of division patterns (step D), and when one analysis result is obtained, the morpheme analysis result is stored in step E. After the analysis is completed, the reanalysis position is confirmed, and if reanalysis is necessary, the reanalysis position is obtained in step F, and the reanalysis position is morpheme-divided in a pattern different from the above (step B). When the processing is performed (steps C and D) and the analysis is completed, the analysis result is stored in step E. Thus, one or more morphological analysis results are stored and displayed in step G.

【0012】さらに、具体例を挙げて説明する。次のよ
うな英文に対し、形態素解析の要求があったとする。
Further, a specific example will be described. Suppose there is a request for morphological analysis for the following English sentences.

【0013】The sun set in the west.この英文につい
て形態素分割を行なうと、setの単語でsetとset inの二
つの分割パターンが存在することがわかる。
When the morpheme division is performed on this English sentence, it is found that there are two division patterns of set and set in in the word set.

【0014】まず、setと分割するパターンで解析を行
なうと、 The / sun / set / in / the/ west /
. に分割され、形態素情報を付加した形態素解析結果は、
(実際は、形態素情報として色々なものを付加するがこ
こではそれらを代表して品詞のみを示す。) The / sun / set / in / the / west / . 冠詞 名詞 名詞 前置詞 冠詞 名詞 記号 動詞 副詞 形容詞 が得られる。
First, when analysis is performed with a pattern that is divided into sets, The / sun / set / in / the / west /
The morpheme analysis result with morpheme information divided into.
(In fact, various kinds of morpheme information are added, but only the part of speech is shown here as a representative.) The / sun / set / in / the / west /. Article noun noun preposition article noun symbol verb adverb adjective can get.

【0015】次に、set inという句動詞で形態素分割を
行ない解析すると、 The / sun / set in / the / west / . に分割され、形態素情報を付加した形態素解析結果は、 The / sun / set in / the / west / . 冠詞 名詞 動詞 冠詞 名詞 記号 となり、,の2つのパターンの形態素解析結果が得
られる。
Next, when the morpheme is divided by the phrase verb set in and analyzed, it is divided into The / sun / set in / the / west /., And the morpheme analysis result with morpheme information added is The / sun / set in / the / west /. Article Noun Verb Article Noun It becomes a symbol, and the morphological analysis results of two patterns are obtained.

【0016】このように本発明の実施例の英語形態素解
析装置によれば、複合語などの複数の形態素分割が可能
な単語列を再解析位置記憶部に記憶しておき、複数の分
割パターンについて解析し、表示するので、形態素分割
の処理による形態素解析の失敗は無くなる。
As described above, according to the English morphological analysis apparatus of the embodiment of the present invention, a plurality of morpheme-separable word strings such as compound words are stored in the reanalysis position storage unit, and a plurality of division patterns are stored. Since it is analyzed and displayed, morpheme analysis failure due to morpheme division processing is eliminated.

【0017】[0017]

【発明の効果】以上の実施例から明らかなように本発明
によれば、複合語を含んでいる英文や見掛け上重複して
いる複合語などの、解析結果が複数考えられるような英
文の解析は、一意に解析結果を決めずに、複数の解析結
果を出力するので、形態素分割のための解析誤りの無い
英語形態素解析装置を提供できる。
As is apparent from the above embodiments, according to the present invention, analysis of English sentences containing compound words, compound words apparently overlapping, etc., in which multiple analysis results can be considered. Since a plurality of analysis results are output without uniquely determining the analysis result, it is possible to provide an English morpheme analysis device having no analysis error for morpheme division.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における形態素解析装置の機
能ブロック図
FIG. 1 is a functional block diagram of a morphological analysis apparatus according to an embodiment of the present invention.

【図2】同実施例における複数の解析結果を得るための
処理を表わすフローチャート
FIG. 2 is a flowchart showing a process for obtaining a plurality of analysis results in the same embodiment.

【符号の説明】[Explanation of symbols]

1 入力手段 2 入力英文記憶部 3 単語辞書部 4 形態素解析制御部 5 再解析位置記憶部 6 解析結果記憶部 7 表示部 1 input means 2 input English storage unit 3 word dictionary unit 4 morphological analysis control unit 5 reanalysis position storage unit 6 analysis result storage unit 7 display unit

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 原文の文字列を入力する入力手段と、前
記入力手段によって入力された文字列を記憶する入力英
文記憶部と、単語の見出し語や形態素情報などを記憶し
た単語辞書部と、前記単語辞書部を用いて形態素解析処
理を行なう形態素解析制御部と、解析途中で別の形態素
分割の解釈が考えられる場合にその形態素の位置を記憶
しておく角解析位置記憶部と、複数の形態素解析結果を
記憶する解析結果記憶部と、解析結果を表示する表示部
とを備え、形態素解析結果を一つに決めず、複数のパタ
ーンの解析結果を出力するようにした英語形態素解析装
置。
1. An input unit for inputting a character string of an original sentence, an input English storage unit for storing the character string input by the input unit, and a word dictionary unit for storing a headword of a word, morpheme information and the like. A morphological analysis control unit that performs a morphological analysis process using the word dictionary unit, an angular analysis position storage unit that stores the position of the morphological element when interpretation of another morphological division is considered during analysis, An English morphological analyzer that includes an analysis result storage unit that stores a morpheme analysis result and a display unit that displays the analysis result, and outputs the analysis results of a plurality of patterns without determining one morpheme analysis result.
JP3291146A 1991-11-07 1991-11-07 English morphological analyzer Pending JPH05128146A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3291146A JPH05128146A (en) 1991-11-07 1991-11-07 English morphological analyzer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3291146A JPH05128146A (en) 1991-11-07 1991-11-07 English morphological analyzer

Publications (1)

Publication Number Publication Date
JPH05128146A true JPH05128146A (en) 1993-05-25

Family

ID=17765040

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3291146A Pending JPH05128146A (en) 1991-11-07 1991-11-07 English morphological analyzer

Country Status (1)

Country Link
JP (1) JPH05128146A (en)

Similar Documents

Publication Publication Date Title
US4502128A (en) Translation between natural languages
US4833611A (en) Machine translation system
US4953088A (en) Sentence translator with processing stage indicator
GB2241094A (en) Translation machine
GB2194084A (en) Translation system
JP3136973B2 (en) Language analysis system and method
JPH05128146A (en) English morphological analyzer
JPS6190269A (en) Translation system
JPS61260366A (en) Mechanical translating system having learning function
JP2632806B2 (en) Language analyzer
JP2715419B2 (en) Translation equipment
JP2807586B2 (en) Machine translation equipment
JP2871300B2 (en) Machine translation equipment
JPS6320567A (en) Translation device
JPH0628396A (en) Electronic dictionary
JPS61260367A (en) Mechanical translating system
JPH03282874A (en) Document preparation device
JP3340124B2 (en) Kana-Kanji conversion device
JPS63136264A (en) Mechanical translating device
JPH07182342A (en) Machine translation device
JPH0239357A (en) Automatic checking/correcting device for japanese sentence
JPH07200588A (en) Translation device
JPH0816599A (en) Translation support device
JPS62272358A (en) Display system for interruption of translation in translating device
JPH01316863A (en) Automatic qualifying and correcting device for error in japanese language text