JPH05120330A - Syntax analysis control system - Google Patents

Syntax analysis control system

Info

Publication number
JPH05120330A
JPH05120330A JP3297804A JP29780491A JPH05120330A JP H05120330 A JPH05120330 A JP H05120330A JP 3297804 A JP3297804 A JP 3297804A JP 29780491 A JP29780491 A JP 29780491A JP H05120330 A JPH05120330 A JP H05120330A
Authority
JP
Japan
Prior art keywords
phrase
word
infinitive
sentence
interest
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP3297804A
Other languages
Japanese (ja)
Inventor
Hidenori Aosawa
秀憲 青沢
Toshihisa Tashiro
敏久 田代
Akira Takagi
朗 高木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CSK Corp
Original Assignee
CSK Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CSK Corp filed Critical CSK Corp
Priority to JP3297804A priority Critical patent/JPH05120330A/en
Publication of JPH05120330A publication Critical patent/JPH05120330A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

PURPOSE:To exactly and efficiently execute analysis by selecting an optimum candidate by simultaneously comparing and evaluating plural candidates, for which a 'to' infinitive phrase can be modified, in the middle of the analysis. CONSTITUTION:A syntax analysis part 1 inputs English word sequences to which a morpheme analysis processing is executed, successively pays attention from the phrase on the side of a sentence head, judges the modifying relation and decides English phrase structure or depending structure. In this case, when the phrase under the attention is the 'to' infinitive phrase, a 'to' infinitive phrase modifying destination deciding means composed of a depending relation decision part 20 and a semantic relation judgement rule volume 3 extracts and compares all the modifying relations between the word, which exists in the phrase adjacent to the sentence head side of the phrase under the attention and can be modified by the 'to' infinitive phrase, and the 'to' infinitive phrase. Among the extracted modifying relations, the most suitable relation is roughly selected while considering the entire relations in terms of syntax and meaning of the English sentences, and the modifying destination and use of the phrase in terms of meaning under the attention are decided.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、英文の句構造もしくは
依存構造を機械的に解析する構文解析システムの構文解
析制御方式に関し、特に文中にto不定詞句を含む場合
の修飾先等の決定方法に特徴を有する構文解析制御方式
に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a syntactic analysis control system for a syntactic analysis system that mechanically analyzes a phrase structure or dependency structure of an English sentence, and particularly to a method of determining a modification destination etc. when a sentence includes a to infinitive phrase. The present invention relates to a parsing control method characterized by.

【0002】[0002]

【従来の技術】文の句構造もしくは依存構造を機械的に
解析する場合、まず入力した文を単語に分解し、辞書を
参照して各単語にその構文情報や意味情報を付加する形
態素解析処理を行う。そして、形態素解析処理によって
単語に付加された構文情報や意味情報を利用し、所定の
規則に従って句構造もしくは依存構造を解析する構文解
析処理を行う。
2. Description of the Related Art In the case of mechanically analyzing a phrase structure or a dependency structure of a sentence, a morphological analysis process is performed in which an input sentence is first decomposed into words and the dictionary is referred to with its syntactical information and semantic information I do. Then, using the syntactic information and the semantic information added to the word by the morphological analysis process, the syntactic analysis process of analyzing the phrase structure or the dependency structure according to a predetermined rule is performed.

【0003】ところで、英文の句構造もしくは依存構造
を解析する場合において、上記英文にto不定詞句が含
まれている場合、その用法や意味的関係の多様さのた
め、上記to不定詞句が修飾し得る単語がその英文中に
複数存在することがある。このようなto不定詞句を含
む文に対し、従来の構文解析システムによる解析処理で
は、句構造等を判定する複数のル−ル間に直接的な相互
作用が存在しないため、ル−ルそのものが局所的な修飾
関係や句構造の判定能力しかもたず、個々のル−ル適用
時点で常に上記to不定詞句の正当な修飾先を決定する
ことができるわけではなかった。
By the way, in the case of analyzing a phrase structure or dependency structure of an English sentence, when the above-mentioned English sentence includes a to-infinitive phrase, the above-to-infinitive phrase is modified because of its variety of usages and semantic relationships. There may be more than one word in the English sentence. In a parsing process using a conventional parsing system for a sentence including such an infinitive phrase, there is no direct interaction between a plurality of rules for determining a phrase structure, etc. It has only the ability to determine local modification relations and phrase structure, and it is not always possible to determine the proper modification destination of the to infinitive phrase at the time of applying each rule.

【0004】また従来の構文解析システムでは、依存構
造や句構造を判定するル−ルは、本来、既に作られた句
を表現する解析木のトップノ−ドのみを対象として上記
の関係を判定するものであった。すなわち句構造解析の
場合には、既に作られている句標識のトップに位置する
句範ちゅうまたは語彙範ちゅうを要素とする上位の句の
構成の可否を判定するル−ルであり、依存構造解析の場
合には、既に作られている依存構造木のトップに位置す
る語間の依存関係の成立の可否を判定するル−ルであっ
た。
Further, in the conventional syntax analysis system, the rule for judging the dependency structure and the phrase structure is originally for judging only the top node of the parse tree expressing the already created phrase for the above relation. It was a thing. That is, in the case of phrase structure analysis, it is a rule for determining whether or not to construct a higher-level phrase having a phrase category or a vocabulary category located at the top of the already created phrase indicator as a dependency structure. In the case of analysis, it was a rule for judging whether or not the dependency relationship between the words located at the top of the dependency structure tree already created was established.

【0005】そのため従来は、個々のル−ルの適用結果
を全て正当としながら解析を進め、構文解析の終了時、
あるいは、特別に設定させた構文解析中のある時点等に
事例を参照したり、人が判断することによって、出力さ
れた複数の解析結果から正当なものを一つ半強制的に選
択する必要があった。
Therefore, conventionally, the analysis is proceeded while validating all the application results of each rule, and at the end of the syntax analysis,
Alternatively, it is necessary to forcibly select one and a half legitimate ones out of a plurality of output analysis results by referring to a case at a certain point during parsing that has been specially set or by a person making a judgment. there were.

【0006】[0006]

【発明が解決しようとする課題】しかし、上述した従来
の構文解析システムは、上記のように修飾し得る単語が
複数あるような成分を含む英文に対し、その成分の正当
な修飾先を決定するまで複数の余分な解析処理を伴うた
め、解析効率が悪く、また正確に修飾先の判定を行なう
ことが難しいという欠点があった。また、従来の依存構
造や句構造を判定するル−ルが解析木のトップノ−ドの
みを対象としていたことから、文中にある修飾成分が修
飾可能な候補が複数ある場合に、従来の制御方式ではそ
れらの候補を解析の途中で同時に比較、評価して最適の
候補を選択するという制御を行なうことが困難であっ
た。
However, the above-described conventional parsing system determines an appropriate modification destination of a component of an English sentence containing a component having a plurality of words that can be modified as described above. However, there are disadvantages in that the analysis efficiency is poor and it is difficult to accurately determine the modification destination, because a plurality of extra analysis processes are involved. In addition, since the conventional rule for determining the dependency structure or phrase structure is targeted only at the top node of the parse tree, the conventional control method is used when there are multiple candidates for which the modifying component in the sentence can be modified. However, it was difficult to compare and evaluate these candidates at the same time during the analysis and select the optimal candidate.

【0007】本発明は、上記従来の欠点を解決し、局所
的には修飾先が複数存在するto不定詞句を含む英文に
対して、大域的見地から上記to不定詞の正当な修飾先
を効率的に決定することを目的とする。
The present invention solves the above-mentioned conventional drawbacks, and for an English sentence containing a to-infinitive phrase that locally has a plurality of modifiers, it is possible to efficiently determine the proper modifier of the to-infinitive from a global point of view. The purpose is to make a positive decision.

【0008】[0008]

【課題を解決するための手段】上記の目的を達成するた
め本発明は、形態素解析処理のなされた入力英文の単語
列を入力し、該単語列の文頭側の語句から順次着目して
いき、着目中の語句と着目中の語句の文頭側に隣り合わ
せの語句とが一つの句としてまとまらないと判定した場
合に該着目中の語句と着目中の語句の文頭側に隣り合わ
せの語句とをスタックに格納し着目点を文末方向に一つ
移動してこれを新たな着目中の語句とし、また着目中の
語句と着目中の語句の文頭側に隣り合わせの語句とが一
つの句としてまとまると判定した場合に該着目中の語句
と着目中の語句の文頭側に隣り合わせの語句とを一つに
まとめてこれを新たな着目中の語句とし、再帰的にスタ
ックから取り出した上記着目中の語句の文頭側に隣り合
わせの語句と上記着目中の語句との修飾関係を判定して
上記入力英文の句構造もしくは依存構造を決定する構文
解析処理を行う構文解析システムの構文解析制御方式に
おいて、上記着目中の語句がto不定詞句である場合
に、上記着目中の語句の文頭側に隣り合わせの語句の中
に存在して上記to不定詞句が修飾し得る語と上記to
不定詞句との修飾関係を全て抽出して比較し、上記抽出
した修飾関係のうち最も適当なものを上記英文の全体的
な構文的意味的関係を考慮して大域的に選択し、上記着
目中の語句の修飾先及び意味的用法を決定するto不定
詞句修飾先決定手段を備えることを特徴とする構文解析
制御方式。
In order to achieve the above object, the present invention inputs a word string of an input English sentence that has been subjected to morphological analysis processing, and sequentially focuses on words from the beginning of the word string, When it is determined that the word under consideration and the word adjacent to the sentence head side of the word under consideration are not combined as one phrase, the word under consideration and the word adjacent to the sentence head side of the word under attention are stacked. Storing and moving one point toward the end of the sentence to make it a new phrase under consideration, and it was determined that the phrase under consideration and the phrase adjacent to the beginning of the phrase under consideration are collected as one phrase. In this case, the word under consideration and the word adjacent to the head side of the word under consideration are combined into a new word under consideration, and the beginning of the word under consideration recursively extracted from the stack. Side by side words and above In a parsing control method of a parsing system that performs a parsing process for determining a phrase structure or a dependency structure of the input English sentence by determining a modification relation with the phrase in question, the phrase under attention is a to infinitive phrase. In this case, the word that exists in the word adjacent to the beginning of the word of interest and can be modified by the to infinitive phrase and the word to
All modifier relationships with infinitive phrases are extracted and compared, and among the extracted modifier relationships, the most appropriate one is globally selected in consideration of the overall syntactic and semantic relationship of the English sentence. A parsing control method, comprising: to infinitive phrase modification destination determining means for determining a modification destination and semantic usage of the phrase.

【0009】[0009]

【実施例】以下、本発明の実施例について図面を参照し
て説明する。図1は本発明の一実施例に係る構文解析制
御方式を利用した構文解析システムの構文を示すブロッ
ク図である。
Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the syntax of a syntax analysis system using a syntax analysis control method according to an embodiment of the present invention.

【0010】図示のように、本実施例の構文解析システ
ムは、形態素解析処理の施された英文の単語列を入力し
文頭側の語句から順次着目してその修飾関係を判定し英
文の句構造や依存構造を決定する構文解析部1と、ルー
ルベースとして完結性判定ル−ル集2と意味関係判定ル
ール集3とを備えてなる。
As shown in the figure, the syntax analysis system of the present embodiment inputs an English word string subjected to morpheme analysis processing, sequentially pays attention to words at the beginning of the sentence, and determines their modification relationships to determine the English phrase structure. A syntactic analysis unit 1 for determining a dependency structure and a completeness determination rule set 2 and a semantic relation determination rule set 3 as a rule base.

【0011】上記構文解析部1は、着目中の語句と着目
中の語句の文頭側に隣り合わせの語句とが一つの句とし
てまとまるか否かを判断する完結性判断部10と(以下
では、上記二つの語句が一つの句としてまとまることを
「完結した」と言うこととする)、上記二つの語句の間
に修飾関係があるか否かを判断する依存関係決定部20
と、依存関係決定部20の判断に基づいて解析木を作成
する解析木作成部30と、上記各部を制御する解析制御
部40とからなる。
The syntactic analysis unit 1 and the completeness determination unit 10 (hereinafter, referred to as the above-mentioned, will be referred to as "completeness determination unit 10" for determining whether or not the word under consideration and the word adjacent to the sentence head side of the word under consideration are combined into one phrase. It is referred to as “completed” when two words and phrases are collected as one phrase), and the dependency relationship determining unit 20 that determines whether or not there is a modifying relationship between the two words and phrases.
And an analytic tree creating unit 30 that creates an analytic tree based on the judgment of the dependency relationship determining unit 20, and an analytic control unit 40 that controls the above units.

【0012】完結性判断部10は、図3に示すような完
結性判定テーブル11を有している。そして、着目中の
語句と着目中の語句の文頭側に隣り合わせの語句とを完
結性判定テーブル11に照合する。ここで、図3中Aと
あるのは着目中の語句の文頭側に隣り合わせの語句を示
し(以下、この語句を語句Aと言う)、Bとあるのは着
目中の語句を示す(以下、この語句を語句Bと言う)。
The completeness determination unit 10 has a completeness determination table 11 as shown in FIG. Then, the word under consideration and the word adjacent to the sentence head side of the word under attention are collated with the completeness determination table 11. Here, A in FIG. 3 indicates a word adjacent to the beginning of the word under consideration (hereinafter, this word is referred to as word A), and B indicates the word under attention (hereinafter, This phrase is called phrase B).

【0013】完結性判定テ−ブル11には、語句A、B
に関する句の完結性に基づく修飾の可否について規定さ
れており、完結性判断部10はこの完結性判定テ−ブル
11に基づいて語句A、B間の修飾の可否を判断する。
すなわち、語句A、Bが図3中◎印で示した組合わせに
該当するときは、語句Bは、語句Aが1つの句を完結す
るために修飾されることを待っている必要な成分か、ま
たは語句Aが完結しておりその語句Aを任意に修飾でき
る成分であり、完結性判断部10は語句Bが語句Aを修
飾し得る可能性があると判断する。
The completeness determination table 11 includes words A and B.
Is defined as to whether or not the phrase can be modified based on the completeness of the phrase. Based on the completeness determination table 11, the completeness determination unit 10 determines whether or not the phrase A and B can be modified.
That is, when the phrases A and B correspond to the combination indicated by the ⊚ mark in FIG. 3, the phrase B is a necessary component waiting for the phrase A to be modified to complete one phrase. , Or a component that can complete the phrase A and modify the phrase A arbitrarily, and the completeness determination unit 10 determines that the phrase B may modify the phrase A.

【0014】一方、語句A、Bが無印の組合せに該当す
るときは、語句Bは、語句Aが1つの句を完結するため
に修飾されることを持っている成分でも、語句Aを任意
に修飾できる成分でもなく、完結性判断部10は語句B
が語句Aを修飾しないと判断する。
On the other hand, when the terms A and B correspond to an unmarked combination, the term B is arbitrarily modified even if it is a component that the term A is modified to complete one phrase. It is not a component that can be modified, and the completeness determination unit 10 uses the phrase B.
Judges that the phrase A is not modified.

【0015】また、語句A、Bが★印で示した組合わせ
に該当するときは、語句A及び語句Bの種類、他の語句
との修飾関係の有無等によって修飾の可否が異なり、完
結性判断部10は完結性判定ル−ル集2に格納された完
結性判定ル−ルに基づいて語句Bが語句Aを修飾する可
能性の有無を個別的に判断する。
Further, when the terms A and B correspond to the combination indicated by a star, the adequacy of the qualification differs depending on the types of the terms A and B, the presence / absence of the qualification relation with other terms, etc. The determination unit 10 individually determines whether or not there is a possibility that the phrase B modifies the phrase A based on the completeness determination rule stored in the completeness determination rule collection 2.

【0016】ここで、完結性判定ル−ル集2には、語句
A及び語句Bの種類、他の語句との修飾関係の有無等に
基づいて当該語句A、Bの修飾の可否を記載した完結性
判定ル−ルが、上記★印で示した組合せの名々に対応し
て格納されている。下記の表に完結性判定ル−ルの例を
示す。
Here, in the completeness judgment rule collection 2, whether or not the words A and B can be modified is described based on the types of the words A and B and the presence / absence of a modification relationship with other words. A completeness determination rule is stored corresponding to each name of the combinations shown by the above-mentioned asterisks. The following table shows an example of a rule for determining completeness.

【0017】[0017]

【表1】 [Table 1]

【0018】なお、完結性判断部10は、語句Aが語句
Bを修飾する場合についても、同様にその可否を判断す
る。
In the case where the word A modifies the word B, the completeness judging section 10 also judges the possibility.

【0019】完結性判断部10は、語句A、Bについて
完結性判定テ−ブル11に基づく判断をした後、語句
A、B間に修飾関係が成立する可能性が有ると判断した
ときは語句A、Bを依存関係決定部20に渡し、成立し
ないと判断したときは解析制御部40に渡す。
When the completeness judging unit 10 judges the words A and B based on the completeness judging table 11 and then judges that there is a possibility that a modifying relationship is established between the words A and B, the words and phrases are judged. A and B are passed to the dependency relationship determining unit 20, and when it is determined that they are not satisfied, they are passed to the analysis control unit 40.

【0020】依存関係決定部20は、図4に示すような
依存関係判定テ−ブル21を有している。そして、完結
性判断部10より渡された語句A、Bを依存関係判定テ
−ブル21に照合する。ここで、図4中A、Bとあるの
は図3におけるA、Bと同様である。
The dependency relationship determining section 20 has a dependency relationship determination table 21 as shown in FIG. Then, the phrases A and B passed from the completeness determination unit 10 are collated with the dependency relation determination table 21. Here, A and B in FIG. 4 are the same as A and B in FIG.

【0021】依存関係判定テ−ブル21には、語句A、
Bに関する修飾関係の有無について規定されており、依
存関係決定部20はこの依存関係判定テ−ブル21に基
づいて語句A、Bの修飾関係を決定する。すなわち、語
句A、Bが図4中◎印で示した組合せに該当するとき
は、語句Bは語句Aを必ず修飾し、これに応じて依存関
係決定部20は所定の関係で語句A、B間の修飾関係を
決定する。
The dependency relation table 21 includes the words A,
The presence / absence of a modification relationship regarding B is defined, and the dependency relationship determining unit 20 determines the modification relationship between the words A and B based on the dependency relationship determination table 21. That is, when the terms A and B correspond to the combination indicated by ⊚ in FIG. 4, the term B necessarily modifies the term A, and accordingly, the dependency determining unit 20 has the terms A and B in a predetermined relationship. Determine the modifying relationship between.

【0022】一方、語句A、Bが無印の組合せに該当す
るときは、語句Bは語句Aを絶対に修飾せず、依存関係
決定部20は語句A、B間に修飾関係が成立しないと判
定する。
On the other hand, when the words A and B correspond to the unmarked combination, the word B never modifies the word A, and the dependency determining section 20 determines that the modifier relationship between the words A and B is not established. To do.

【0023】また、語句A、Bが★印で示した組合せに
該当するときは、語句Bが語句Aを修飾するか否か、あ
るいは語句Aが語句Bを修飾するか否かを判断するには
語句A、Bの組合せの他に他の語句をも含む全体的な構
文的意味的関係等も考慮する必要があるため、語句A、
Bの組合わせに応じてそれらの間の修飾関係を個別的に
決定する。
When the terms A and B correspond to the combination indicated by a star, it is necessary to determine whether the term B modifies the term A or whether the term A modifies the term B. Needs to consider the overall syntactic / semantic relationship including other phrases in addition to the combination of the phrases A and B.
Depending on the combination of B, the modifying relationship between them is individually determined.

【0024】解析木作成部30は、依存関係決定部20
により語句A、B間の修飾関係が成立すると判定された
ときは、その結果に基づいて解析木を作成する。そし
て、この解析木を解析制御部40に渡す。
The analytic tree creating section 30 includes a dependency determining section 20.
When it is determined that the modification relation between the phrases A and B is established, the parse tree is created based on the result. Then, this analysis tree is passed to the analysis control unit 40.

【0025】解析制御部40は、解析木作成部30より
渡された解析木を新たな語句Bとして完結性判断部10
に渡す。また、完結性判断部10または依存関係決定部
20により語句A、B間に修飾関係が成立しないと判定
されたときは語句Aを解析スタックに入れ、語句Bを新
たな語句Aとして完結性判断部10に渡す。
The analysis control unit 40 defines the parse tree passed from the parse tree creating unit 30 as a new word B, and the completeness judging unit 10
Pass to. Further, when the completeness determination unit 10 or the dependency relationship determination unit 20 determines that the modification relation is not established between the phrases A and B, the phrase A is put in the analysis stack, and the completeness determination is performed with the phrase B as a new phrase A. Hand over to part 10.

【0026】ここで、入力英文がto不定詞句を含む場
合の、該to不定詞句の修飾先の決定方法について詳細
に説明する。to不定詞句は、他の語を修飾する用法と
して、動詞を修飾する用法、名詞を修飾する用法、形容
詞を修飾する用法、副詞を修飾する用法、前置詞を修飾
する用法がある。また、各用法における意味的関係にお
いても、動詞を修飾する場合には例えば目的語として名
詞的に修飾したり、副詞的に修飾したり、あるいは、補
語として修飾したりし、名詞を形容詞的に修飾する場合
にも例えばその名詞の内容属性を規定するように修飾し
たり、名詞が目的属性を持つ場合に修飾しりたするな
ど、その関係は多様に存在する。
Here, a method of determining the modification destination of the to infinitive phrase when the input English sentence includes the to infinitive phrase will be described in detail. The to-infinitive phrase has various uses as a modifier for other words, such as a verb modifier, a noun modifier, an adjective modifier, an adverb modifier, and a preposition modifier. Also, in the semantic relationship in each usage, when modifying a verb, for example, a noun is modified as an object, an adverb is modified, or a complement is modified. In the case of modification, there are various relations such as modification so as to define the content attribute of the noun, modification when the noun has a target attribute, and the like.

【0027】このようにto不定詞句は、その用法や被
修飾語との意味的関係が多様であるため、to不定詞句
が修飾し得る語が(以下、単に「被修飾語」という)が
英文中に複数存在する場合が生じ得る。このような場合
に、本実施例の構文解析制御方式におけるto不定詞句
修飾先決定手段によってto不定詞句の正当な修飾先を
決定する。
As described above, since the to infinitive phrase has various usages and semantic relationships with the modified word, the word that can be modified by the to infinitive phrase (hereinafter, simply referred to as “modified word”) is in English. There may be a plurality of cases. In such a case, the to-infinitive phrase modification destination determining unit in the syntax analysis control method of the present embodiment determines a valid modification destination of the to-infinitive phrase.

【0028】本実施例のto不定詞句修飾先決定手段
は、構文解析部1の依存関係決定部20と、意味関係判
断ルール集3とからなる。
The to infinitive phrase modification destination determining means of this embodiment comprises a dependency relationship determining section 20 of the syntactic analysis section 1 and a semantic relationship determination rule collection 3.

【0029】上記意味関係判定ルール集4には、to不
定詞句が所定の語をいかなる用法で、またどのような意
味的関係で修飾するかという、語句間の修飾関係を決定
するためのルールが格納されている。このルールとして
は、従来の構文解析システムに用いられているものを利
用することができる。
In the above-mentioned semantic relation determination rule set 4, there are rules for determining a modifying relation between words, such as how the infinitive phrase modifies a given word and with what semantic relation. It is stored. As this rule, the rule used in the conventional parsing system can be used.

【0030】上記依存関係決定部20は、まず各被修飾
語に意味関係判定ルール集3より得られたto不定詞句
と被修飾語との修飾関係に対応して、各修飾関係ごとに
予め設定された点数を付加する。下記の表に、各修飾関
係に対する点数の設定例を示す。なお表中のCに
「“〜”タイプの形容詞」とあるのは、形容詞を文中の
他の語との関係に基づいていくつかのタイプに分類し、
そのタイプごとに点数を設定したことを示す。
The dependency relationship determining section 20 first sets in advance for each modified relationship corresponding to the modified relationship between the infinitive phrase and the modified word obtained from the semantic relationship determination rule set 3 for each modified word. The added points are added. The table below shows an example of setting points for each modification relationship. It should be noted that "C" in the table is "adjective of type""means that adjectives are classified into several types based on their relationship with other words in the sentence.
Indicates that the score has been set for each type.

【0031】[0031]

【表2】 [Table 2]

【0032】上記の表に示した点数は、各々の修飾関係
をto不定詞句との間で有する各被修飾語について、そ
の相対的な優先順位を定めるためのものである。したが
って、同様の優先順位を得られるならば点数の数値その
ものは上記表2に示したものに限る必要はない。
The scores shown in the above table are for determining the relative priority of each modified word having each modification relation with the to infinitive phrase. Therefore, the numerical value of the score itself need not be limited to that shown in Table 2 as long as the same priority can be obtained.

【0033】例えば、名詞が動詞を修飾している語句と
to不定詞句の間の場合のto不定詞句の修飾先決定に
おいて、動詞については目的のto不定詞で修飾し、名
詞については目的格関係で修飾するという結果が得られ
たとき、表2と照らし合わせると、 動詞に目的のto不定詞で修飾するなら4点 名詞に目的格関係で修飾するなら20点 となり、目的格関係のほうが点数が高いので、「目的の
to不定詞より目的格関係を優先する」という結果が得
られる。もちろん、このようなルールを多数格納したル
ールベースを構築し、to不定詞の被修飾語が複数ある
場合に上記ルールベース中のルールを適用して正当な修
飾先を定めるようにしても良い。
For example, in determining the modification destination of a to-infinitive phrase when a noun is between a phrase that modifies a verb and a to-infinitive phrase, the verb is modified with the target to-infinitive, and the noun is associated with the objective case relation. When the result of qualifying with is obtained, comparing with Table 2, it is 4 points if the verb is modified with the objective to infinitive, and 20 points if the noun is modified with the objective case relation, and the objective case relation is scored. Is high, the result is that “the objective case relation is given priority over the objective to infinitive”. Of course, a rule base storing a large number of such rules may be constructed, and when there are a plurality of to-infinitive qualifiers, the rules in the above rule base may be applied to determine the proper modification destination.

【0034】また、各被修飾語の間の優先順位は経験的
に定められるものであり、構文解析を行う英文の内容が
含まれる分野に応じて変更することが許されるものであ
る。特にこの場合、優先順位をル−ルベ−スにしておく
ならば、全体的なル−ルの変更が必要になるが、表2の
ようにしておくことによって、部分的な変更を用意に行
なえるようになるなお、複数の被修飾語について該当す
る修飾関係の点数が同じである場合、すなわち、被修飾
語の優先順位の優劣を決められない場合には、その他の
英文の構造に基づく情報等に基づいてto不定詞の修飾
先を決定する。このような情報としては、1.動詞同士
の間の優先順位決定の場合は遠いものを優先する、2.
名詞同士 、形容詞同士の間の優先順位決定の場合は近
いものを優先する、3.異品詞間の優先順位決定の場合
は動詞を優先する(デフォルト)等がある。
Further, the order of priority among the modified words is empirically determined, and can be changed according to the field containing the content of the English sentence to be parsed. In this case, in particular, if the priority order is set to rule base, it is necessary to change the rule as a whole, but by making it as shown in Table 2, partial changes can be made easily. In addition, when the scores of the corresponding qualification relationships are the same for multiple modified words, that is, when the priority of the modified words cannot be determined, information based on the structure of other English sentences is used. Based on, etc., the modification destination of the infinitive is determined. Such information includes 1. 1. When determining the priority order between verbs, prioritize the farthest.
2. When determining the priority order between nouns and adjectives, prioritize the closest ones. When determining the priority order between different parts of speech, there is a verb priority (default).

【0035】次に、依存関係決定部20は、付加された
点数の最も高い被修飾語をto不定詞句の正当な被修飾
先として選択する。以上のto不定詞句修飾先決定手段
による修飾先決定動作を図2のフローチャートに示す。
Next, the dependency determining section 20 selects the modified word with the highest added score as the legal modified target of the to infinitive phrase. FIG. 2 is a flow chart showing the modification destination determining operation by the above infinitive phrase modification destination determining means.

【0036】以上説明したto不定詞句修飾先決定手段
は、to不定詞句の被修飾語がto不定詞句よりも文頭
側にある場合(すなわち、被修飾語が語句A内にあり、
to不定詞句が語句Bである場合)であっても、またt
o不定詞句よりも文尾側にある場合(すなわち、被修飾
語が語句B内にあり、to不定詞句が語句Aである場
合)であっても何ら異なることなく用いることができ
る。
The to-infinitive phrase modification destination determining means described above is used when the modified word of the to-infinitive phrase is on the sentence head side of the to-infinitive phrase (that is, the modified word is in the phrase A,
to infinitive phrase B)), t
Even if it is on the tail end side of the infinitive phrase (that is, the modified word is in the phrase B and the to infinitive phrase is the phrase A), it can be used without any difference.

【0037】なお、to不定詞句が修飾し得る語が英文
中に一つしかないときは、上記のto不定詞句修飾先決
定手段によって選択するまでもなくto不定詞句の修飾
先は一意的に決まることは言うまでもない。
When there is only one word that can be modified by the to infinitive phrase in the English sentence, the modification destination of the to infinitive phrase is uniquely determined without selecting by the above-described to infinitive phrase modification destination determining means. Needless to say.

【0038】次に、具体的な解析例を挙げて本実施例の
解析制御方式を詳細に説明する。図5は本実施例の構文
解析システムで“The company withd
rew a proposal toincrease
the number of authorized
shares.”という英文を解析した場合の語句
A、語句B、及び解析スタック31のなかの語句を示
す。上記の例文は形態素解析がなされた後、構文解析部
1に入力され文頭側から順に単語ごとに修飾関係を判定
される。
Next, the analysis control method of this embodiment will be described in detail with reference to specific analysis examples. FIG. 5 shows the syntax analysis system of the present embodiment, "The company withd
rew a possible toincrease
the number of authenticated
shares. "A" and "B" in the case of analyzing the English sentence "," and the words in the analysis stack 31. The above example sentence is input to the syntactic analysis unit 1 after morphological analysis, and is input word by word from the beginning of the sentence. The modification relationship is determined.

【0039】なお、本実施例では各語ごとの修飾関係の
判定についての説明は省略し、図5に示すように“Th
e company withdrew a prop
osal”という完結した句(語句A)とto不定詞句
(語句B)との修飾関係の判定について説明する。
In the present embodiment, the description of the determination of the modification relation for each word is omitted, and as shown in FIG.
e company withdraw a prop
The determination of the modification relation between the complete phrase "phrase A" and the infinitive phrase "phrase B" will be described.

【0040】ここでは、語句Aが動詞句であり、語句B
がto不定詞句であることから、完結性判断部10は完
結性判定テ−ブル11に照らして修飾関係が成立する可
能性があると判断し、語句Aと語句Bとを依存関係決定
部20に渡す。
Here, the phrase A is a verb phrase and the phrase B is
Is a to infinitive phrase, the completeness determination unit 10 determines that a modification relation may be established in light of the completeness determination table 11, and the dependency relation determination unit 20 determines that the phrase A and the phrase B are satisfied. Pass to.

【0041】依存関係決定部20は、語句Aと語句Bと
を依存関係判定テ−ブル21に照らし、個別的に判断を
行う。まず、語句Bであるto不定詞が修飾し得る語句
A内の全ての語と語句Bとの修飾関係を全て抽出する。
ここでは、語句Bのto不定詞句は語句A内の“wit
hdrew”又は“proposal”を修飾すること
から、意味関係判定ル−ルにより「to不定詞句は“w
ithdrew”を副詞用法(目的)で修飾する可能性
がある。」という関係と、「to不定詞句は“prop
osal”を形容詞用法(動詞からの派生)で修飾する
可能性がある。」という関係を得る。
The dependency relationship determining unit 20 compares the terms A and B with the dependency determination table 21 and individually determines them. First, all the modification relationships between all the words in the phrase A that can be modified by the to infinitive which is the phrase B and the phrase B are extracted.
Here, the infinitive phrase of phrase B is “wit” in phrase A.
Since "hdrew" or "proposal" is modified, "to infinitive phrase is" w
There is a possibility that it will be modified by the adverbial usage (purpose). "and" to infinitive phrase is "prop
osal "may be modified by adjective usage (derivative from verb)."

【0042】次に、依存関係決定部20は、抽出された
修飾関係のうち最適なものを選択する。ここでは、表2
より“withdrew”との関係である「副詞用法
(目的)で修飾する場合」は4点、“proposa
l”との関係である「形容詞用法(動詞からの派生)で
修飾する場合」は10点であることから、「to不定詞
句は“withdrew”を副詞用法(目的)で修飾す
る可能性がある。」という関係よりも「to不定詞句は
“proposal”を形容詞用法(動詞からの派生)
で修飾する可能性がある。」という関係が優先されるこ
ととなり語句Bは語句A内の“proposal”を形
容詞用法で修飾すると判定する。そして、これらの判断
結果を解析木作成部30に渡す。以上のようにしてto
不定詞句の正当な修飾先が決定されることとなる。
Next, the dependency relationship determining section 20 selects an optimum one among the extracted modification relationships. Here, Table 2
"To modify with adverb usage (purpose)", which is more related to "withdraw", has 4 points, "proposa"
There is a possibility of modifying "withdrew" with an adverbial usage (purpose) for "to infinitive phrase" since the "adjective usage (derivative from verb)" relationship with l "is 10 points. . ”Rather than“ to infinitive phrase is “proposal” adjective usage (derived from verb)
May be modified with. Therefore, it is determined that the phrase B modifies “proposal” in the phrase A by the adjective usage. Then, these judgment results are passed to the analysis tree creating unit 30. As above
The proper modification destination of the infinitive phrase will be determined.

【0043】[0043]

【発明の効果】以上説明したように、本発明における構
文解析システムの構文解析制御方式によれば、局所的な
語と語の修飾関係の判定を常に大域的な総合判定の下で
行ないながら解析を進めることができるため、入力英文
中にto不定詞句を含み、かつそのto不定詞句が修飾
可能な候補が該英文中に複数ある場合にも、それらの候
補を解析の途中で同時に比較、評価して最適の候補を選
択するという制御を容易に行なうことができ、正確かつ
効率的な解析を行なうことができるという効果がある。
As described above, according to the syntactic analysis control method of the syntactic analysis system of the present invention, the determination of the local word-to-word modification relation is always performed under the global comprehensive determination. Therefore, even if there are multiple candidates in the input English sentence that include a to infinitive phrase and the to infinitive phrase can be modified, the candidates are compared and evaluated at the same time during the analysis. Therefore, it is possible to easily perform the control of selecting the optimum candidate, and it is possible to perform the accurate and efficient analysis.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の構文解析制御方式を利用した構文解析
システムの構成を示すブロック図である。
FIG. 1 is a block diagram showing a configuration of a syntax analysis system using a syntax analysis control system of the present invention.

【図2】図1の構文解析制御方式によるto不定詞句修
飾先決定手段示すフロ−チャ−トである。
FIG. 2 is a flowchart showing a to infinitive phrase modification destination determining means by the syntax analysis control system of FIG.

【図3】図1の完結性判定テ−ブルを示す図である。FIG. 3 is a diagram showing a completeness determination table of FIG.

【図4】図1の依存関係判定テ−ブルを示す図である。FIG. 4 is a diagram showing a dependency relationship determination table of FIG.

【符号の説明】[Explanation of symbols]

1 構文解析部 2 完結性判定ル−ル集 3 意味関係判定ル−ル集 10 完結性判断部 20 依存関係決定部 30 解析木作成部 40 解析制御部 DESCRIPTION OF SYMBOLS 1 Syntax analysis part 2 Completeness judgment rule collection 3 Semantic relationship judgment rule collection 10 Completeness judgment part 20 Dependency determination part 30 Analysis tree creation part 40 Analysis control part

─────────────────────────────────────────────────────
─────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成4年10月19日[Submission date] October 19, 1992

【手続補正1】[Procedure Amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】図面の簡単な説明[Name of item to be corrected] Brief description of the drawing

【補正方法】変更[Correction method] Change

【補正内容】[Correction content]

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の構文解析制御方式を利用した構文解析
システムの構成を示すブロック図である。
FIG. 1 is a block diagram showing a configuration of a syntax analysis system using a syntax analysis control system of the present invention.

【図2】図1の構文解析制御方式によるto不定詞句修
飾先決定手段を示すフローチャートである。
FIG. 2 is a flowchart showing a to infinitive phrase modification destination determining means according to the syntax analysis control method of FIG.

【図3】図1の完結性判定テーブルを示す図である。FIG. 3 is a diagram showing a completeness determination table of FIG.

【図4】図1の依存関係判定テーブルを示す図である。FIG. 4 is a diagram showing a dependency relationship determination table of FIG. 1.

【図5】本実施例による解析例を示す図である。FIG. 5 is a diagram showing an analysis example according to the present embodiment.

【符号の説明】 1 構文解析部 2 完結性判定ルール集 3 意味関係判定ルール集 10 完結性判断部 20 依存関係決定部 30 解析木作成部 40 解析制御部[Explanation of Codes] 1 syntax analysis unit 2 completeness determination rule collection 3 semantic relation determination rule collection 10 completeness determination unit 20 dependency relationship determination unit 30 parse tree creation unit 40 analysis control unit

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 形態素解析処理のなされた入力英文の単
語列を入力し、該単語列の文頭側の語句から順次着目し
ていき、 着目中の語句と着目中の語句の文頭側に隣り合わせの語
句とが一つの句としてまとまらないと判定した場合に該
着目中の語句と着目中の語句の文頭側に隣り合わせの語
句とをスタックに格納し着目点を文末方向に一つ移動し
てこれを新たな着目中の語句とし、また着目中の語句と
着目中の語句の文頭側に隣り合わせの語句とが一つの句
としてまとまると判定した場合に該着目中の語句と着目
中の語句の文頭側に隣り合わせの語句とを一つにまとめ
てこれを新たな着目中の語句とし、 再帰的にスタックから取り出した上記着目中の語句の文
頭側に隣り合わせの語句と上記着目中の語句との修飾関
係を判定して上記入力英文の句構造もしくは依存構造を
決定する構文解析処理を行う構文解析システムの構文解
析制御方式において、 上記着目中の語句がto不定詞句である場合に、上記着
目中の語句の文頭側に隣り合わせの語句の中に存在して
上記to不定詞句が修飾し得る語と上記to不定詞句と
の修飾関係を全て抽出して比較し、上記抽出した修飾関
係のうち最も適当なものを上記英文の全体的な構文的意
味的関係を考慮して大域的に選択し、上記着目中の語句
の修飾先及び意味的用法を決定するto不定詞句修飾先
決定手段を備えることを特徴とする構文解析制御方式。
1. A word string of an input English sentence that has been subjected to morphological analysis processing is input, and attention is sequentially paid to words beginning with a sentence beginning side of the word string. When it is determined that the words and phrases are not combined as one phrase, the word of interest and the word adjacent to the sentence beginning side of the word of interest are stored in the stack, and the point of interest is moved one by one toward the end of the sentence. As a new phrase under attention, and when it is determined that the phrase under consideration and the phrase adjacent to the sentence beginning side of the phrase under consideration are collected as one phrase, the phrase under consideration and the sentence beginning side of the phrase under consideration The adjacent words and phrases are combined into a new word of interest, and the modified relation between the adjacent words and the word of interest is recursively extracted from the stack at the beginning of the word of interest. Of the above input English sentence In a parsing control method of a parsing system that performs a parsing process for determining a structure or a dependency structure, when the word of interest is a to infinitive phrase, among words adjacent to the beginning of the word of interest Existing in the above infinitive phrase and all the modifying relations of the above infinitive phrase and the above infinitive phrase are extracted and compared, and the most appropriate one of the extracted modifying relations is extracted as a whole syntactical sentence of the English sentence. A syntactic analysis control method comprising: to infinitive phrase modification destination determining means for globally selecting in consideration of a semantic relationship and determining a modification destination and a semantic usage of the phrase of interest.
JP3297804A 1991-10-19 1991-10-19 Syntax analysis control system Pending JPH05120330A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3297804A JPH05120330A (en) 1991-10-19 1991-10-19 Syntax analysis control system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3297804A JPH05120330A (en) 1991-10-19 1991-10-19 Syntax analysis control system

Publications (1)

Publication Number Publication Date
JPH05120330A true JPH05120330A (en) 1993-05-18

Family

ID=17851386

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3297804A Pending JPH05120330A (en) 1991-10-19 1991-10-19 Syntax analysis control system

Country Status (1)

Country Link
JP (1) JPH05120330A (en)

Similar Documents

Publication Publication Date Title
US5890103A (en) Method and apparatus for improved tokenization of natural language text
JP4024861B2 (en) Natural language parser with dictionary-based part-of-speech probabilities
Woliński Morfeusz—a practical tool for the morphological analysis of Polish
JP4625178B2 (en) Automatic recognition of discourse structure of text body
US5680628A (en) Method and apparatus for automated search and retrieval process
JP3476237B2 (en) Parser
US20060241944A1 (en) Method and system for generating spelling suggestions
US20070233460A1 (en) Computer-Implemented Method for Use in a Translation System
WO1997004405A9 (en) Method and apparatus for automated search and retrieval processing
JPH0411906B2 (en)
KR20160138077A (en) Machine translation system and method
JP2004246440A (en) Morphemic analyzing device, natural language processor, and morphemic analyzing method and program
KR100327115B1 (en) Device and method for generating translated sentences based on partial translation patterns
JPH1185766A (en) Keyword extraction method and apparatus, and storage medium storing keyword extraction program
JPH05120330A (en) Syntax analysis control system
JP2632806B2 (en) Language analyzer
KR100420474B1 (en) Apparatus and method of long sentence translation using partial sentence frame
JPH09146951A (en) System and method for language analysis
JPH0581315A (en) Syntax analysis control system
JP3197110B2 (en) Natural language analyzer and machine translator
JPH09185623A (en) Language processing apparatus and method
JPS63136269A (en) Automatic translating device
JPH064570A (en) Syntax analysis system and syntax analysis control system
JPH09231217A (en) System and method for language analysis
Paggio Syntactic Analysis and Error Correction for Danish in the SCARRIE project