JPH03144831A - System recovery method - Google Patents

System recovery method

Info

Publication number
JPH03144831A
JPH03144831A JP1283610A JP28361089A JPH03144831A JP H03144831 A JPH03144831 A JP H03144831A JP 1283610 A JP1283610 A JP 1283610A JP 28361089 A JP28361089 A JP 28361089A JP H03144831 A JPH03144831 A JP H03144831A
Authority
JP
Japan
Prior art keywords
recovery
unit
failure
knowledge base
observation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1283610A
Other languages
Japanese (ja)
Inventor
Miwaka Takahashi
美和夏 高橋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP1283610A priority Critical patent/JPH03144831A/en
Publication of JPH03144831A publication Critical patent/JPH03144831A/en
Pending legal-status Critical Current

Links

Landscapes

  • Devices For Executing Special Programs (AREA)

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 産業上の利用分野 本発明(戴 システムの故障時における自動復旧を行な
うシステム復旧方法に関するものであも従来の技術 従来 故障時における作業内容・作業環境の復旧(よ 
その専門家によって専門家自身の知識をもとに復旧作業
を行う場合が多かった しかし このような方法では故
障が発生する度、必ず少なくとも1人の専門家が復旧作
業の為に常駐しなくてはなら式 また常駐していない場
合には 専門家が来るまで復旧作業を待たなくてはいけ
ないという問題があっ九 そこで現在では コンピュー
タの発達に伴し\ コンピュータを利用した故障診断方
法があも コンピュータを利用した故障診断はエキスパ
ートシステムが用いられており、予めコンピュータに専
門家の知識ベースを蓄えておき、システムに故障が生じ
た時に この知識ベースを用いて硯測される結果から故
障の原因を追求しシステムの指示通りにユーザが故障を
復旧していく方法である。
[Detailed Description of the Invention] Industrial Field of Application The present invention (Dai) relates to a system recovery method for automatic recovery in the event of a system failure.
In many cases, restoration work was carried out by the specialist based on the specialist's own knowledge. However, with this method, whenever a failure occurs, at least one specialist is always on hand to carry out restoration work. In addition, if there is no one on duty, there is the problem of having to wait for recovery work until an expert arrives.So now, with the development of computers, failure diagnosis methods using computers are becoming more popular. An expert system is used for fault diagnosis using the system, which stores an expert's knowledge base on a computer in advance, and when a fault occurs in the system, this knowledge base is used to determine the cause of the fault from the results obtained. This is a method in which the user recovers from a failure according to the system's instructions.

発明が解決しようとする課題 このような従来の故障診断エキスパートシステムを用い
ての復旧作業法で(よ システムは必要に応じてユーザ
に質問するので、ユーザが観測した状況を自由に入力す
ることができず、また質問の順序は規則を書いた順に依
存するので必ずしも自然な順序ではなく、この状況人力
はユーザの負担になるものであっ九 又 故障原因の追求後に システムの指示のもとに復旧
作業を行うのであるバ ユーザレベルの知識で(戴 こ
の復旧作業は困難でかつ複雑であり負担になるなど問題
を有してい九 本発明(友 かかる点に鑑みてなされた
もので、簡単な構成で、ユーザの負担にならないように
 故障復旧作業を自動的に行うことが可能なシステム復
旧方法を提供することを目的としていも 課題を解決するための手段 本発明(よ システムの正常時における動作を経験規則
として、故障時の復旧作業手順を復旧規則として記憶し
ておく知識ベース部と、常時システムの動作状況を観測
する動作観測部と、前記知識ベース部で記憶している経
験規則と、前記動作観測部で観測された結果とを比較し
て矛盾の有無を判定する故障判定部と、前記知識ベース
部に記憶している復旧規則を用いて故障復旧作業を行う
復旧部とを備えたシステム復旧方法である。
Problems to be Solved by the Invention In this conventional recovery work method using a failure diagnosis expert system, the system asks the user questions as necessary, so the user can freely input the observed situation. In addition, the order of questions depends on the order in which the rules were written, so it is not necessarily in a natural order, and in this situation human labor becomes a burden on the user. The present invention has been devised in view of these points, and has a simple configuration. Although the purpose of the present invention is to provide a system recovery method that can automatically perform failure recovery work without putting a burden on the user, the present invention is a method for solving the problem. As empirical rules, there is a knowledge base section that stores recovery work procedures in the event of a failure as recovery rules, an operation observation section that constantly observes the operating status of the system, an empirical rule that is stored in the knowledge base section, and A system comprising a failure determination unit that compares the results observed by the operation observation unit to determine whether there is a contradiction, and a recovery unit that performs failure recovery work using recovery rules stored in the knowledge base unit. This is a recovery method.

作用 本発明は上述の構成により、人手を介さないで、故障の
原因を解析し 復旧作業を自動的に行うので、システム
のメンテナンスが容易となる。
Function: With the above-described configuration, the present invention analyzes the cause of failure and automatically performs recovery work without human intervention, thereby facilitating system maintenance.

実施例 以下、図面を用いて本発明の詳細な説明すも第1図は 
本発明のシステム復旧方法の機能ブロック図であも 知
識ベース部IHL  システムに依存する観測を経験規
則12として記憶(知識)すも即ちシステムの通常動作
を経験として記憶する。
EXAMPLES Below, the present invention will be described in detail with reference to the drawings.
In the functional block diagram of the system recovery method of the present invention, knowledge base section IHL stores system-dependent observations as experience rules 12 (knowledge), that is, the normal operation of the system as experience.

そしてまた知識ベース部11(上  故障時の復旧作業
手順を復旧規則13として記憶する。動作観測部14玄
 常に動作の観測を行っており、その観測した動作を知
識ベース部11の経験規則12と、故障判定部15で逐
次比較し 異常動作(故障)かどうかを認知する。故障
と認めたなら(′L 知識ベース部11の復旧規則13
に基づき動作観測部22と専用コネクタにより接続した
復旧部16で復旧作業を行う。第2図(よ 本発明の一
実施例におけるシステム復旧方法を示す構成図である。
The knowledge base unit 11 (top) stores the recovery procedure in the event of a failure as the recovery rule 13. The operation observation unit 14 constantly observes the operation, and the observed operation is stored as the experience rule 12 of the knowledge base unit 11. , the failure determination unit 15 successively compares and recognizes whether or not there is an abnormal operation (failure).If it is recognized as a failure, ('L)
Based on this, the recovery unit 16 connected to the operation observation unit 22 through a dedicated connector performs recovery work. FIG. 2 is a configuration diagram showing a system recovery method in an embodiment of the present invention.

動作観測部22(上  システムを使用するユーザ21
のシステム使用時Q 状況動作を観測する部分である。
Operation observation unit 22 (upper) User 21 who uses the system
This is the part that observes the Q situation and behavior when using the system.

この動作観測部22で求められた観測26(ヨ  逐次
知識ベース部33の動作原理27と故障判定部25で比
較され 観測26が動作原理27と一致しないならば 
復旧部24に動作を移す。
The observation 26 obtained by the operation observation unit 22 (Y) is compared with the operation principle 27 of the sequential knowledge base unit 33 by the failure determination unit 25, and if the observation 26 does not match the operation principle 27,
The operation is transferred to the recovery unit 24.

例えば ユーザ21がシステムを使用している時は常時
動作観測部22力t システム状況動作の観測を行って
いる。その観測26で、Xという観測項目についてAと
いう結果を、Yという観測項目についてDという結果を
得られたとすも そして又知識ベース部23の動作原理
27において(上 動作原理27の項目Xに対して+よ
 A−Cが正常であるという知識が得られ 項目Yに対
して(友 Zという知識が得られる。するとこの観測結
果と、動作原理27における知識を故障判定部25で比
較をすると、観測項目XについてAという結果(上 動
作原理27の知識と一致することにより、観測項目Xに
ついては正常な動作を行っているといえ&  −X  
観測項目YについてDという観測結果(よ 動作原理2
7の項目YはZであるという知識と一致しな(もこの事
象により、観測項目Yについての動作は故障動作と認め
られる。この故障動作において、知識ベース部23のも
う一つの知識である復旧規則28より、YについてZと
いう結果が得られない場合(よ 操作aを行い復旧作業
を行うよう復旧部24に要求すも 復旧作業が終了すれ
(渋 復旧終了をユーザ21に連絡すも 次に第3図を
用いてLSIのマスクレイアウトエディタシステムの具
体的な動作例を説明すも ユーザ33がレイアウトエデ
ィタシステムを起動すると、システムを常時正常に動作
すべく、知識ベース部35の動作原理341;  シス
テム起動時のプロセスとして登録されている4プロセス
が生成されも これらのプロセス(よ レイアウトエデ
ィタシステムを終了するまで存在すaそして、常時シス
テムの動作状況を観測している動作観測部36で(上 
これらのプロセスの動作状況を一定時間をもって観測し
ている。この動作観測結果38と、知識ベース部に格納
されているシステムの正常時における動作原理34とを
常時故障判定部39において比較し 現在のシステムの
動作状況が正常か異常かを判断する。何らかの要因で前
記4プロセスcilプロセスが異常終了したと動作観測
部36で観測された場合、 このレイアウトエディタシ
ステムについての動作は故障動作と認められ屯 そして
、異常終了したと動作観測部36で観測されたプロセス
の復旧方法を、知識ベース部35のもう一つの知識であ
太 システムを正常にすべく復旧規則37を求△ それ
に従って復旧部38が自動的に復旧作業を行なう。
For example, when the user 21 is using the system, the operation observation unit 22 constantly observes the system status and operation. In observation 26, we obtained a result A for the observation item X and a result D for the observation item Y. Then, in the operating principle 27 of the knowledge base section 23 (above) for the item X of the operating principle 27. The knowledge that A-C is normal is obtained, and the knowledge that Z is obtained for item Y.Then, when this observation result is compared with the knowledge in the operating principle 27 in the failure judgment unit 25, The result A for observation item
Observation result D for observation item Y (Operating principle 2
7 is inconsistent with the knowledge that item Y is Z (because of this event, the operation regarding observation item Y is recognized as a failure operation. According to Rule 28, if the result Z for Y is not obtained (Y), the recovery unit 24 is requested to perform operation a and perform recovery work, but when the recovery work is completed (reluctant), the user 21 is notified of the completion of recovery. A specific example of the operation of the LSI mask layout editor system will be explained using FIG. 3. When the user 33 starts up the layout editor system, the operating principle 341 of the knowledge base unit 35; Although the four processes registered as processes at system startup are generated, these processes (a) remain in existence until the layout editor system is terminated.
The operating status of these processes is observed over a certain period of time. This operation observation result 38 is compared with the operating principle 34 of the system during normal operation stored in the knowledge base unit in a constant failure determination unit 39 to determine whether the current operating status of the system is normal or abnormal. If the operation observation unit 36 observes that the four processes have terminated abnormally due to some reason, the operation regarding this layout editor system is recognized as a failure operation, and the operation observation unit 36 observes that the four processes have terminated abnormally. Another knowledge of the knowledge base unit 35 is how to restore the process that has been restored.Recovery rules 37 are determined in order to restore the system to normality.The recovery unit 38 automatically performs the recovery work in accordance with the recovery rules 37.

発明の効果 以上述べてきたように本発明(よ ユーザが使用してい
るシステムと知識ベースとのインターフェースをとるこ
とで、自動的に故障を復旧できる自動復旧方法を得るこ
とができ、実用的に極めて有用であも
Effects of the Invention As described above, the present invention provides an automatic recovery method that can automatically recover from failures by interfacing the system used by the user with the knowledge base. extremely useful

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明のシステム復旧方法の機能ブロック阻 
第2図は本発明の一実施例におけるシステム復旧方法を
示す構成は 第3図は本発明の具体例の構成図である。
Figure 1 shows functional block blocks of the system recovery method of the present invention.
FIG. 2 is a configuration diagram showing a system recovery method in an embodiment of the present invention. FIG. 3 is a configuration diagram of a specific example of the present invention.

Claims (2)

【特許請求の範囲】[Claims] (1)システムの正常時における動作を経験規則として
、故障時の復旧作業手順を復旧規則として記憶しておく
知識ベース部と、常時システムの動作状況を観測する動
作観測部と、前記知識ベース部で記憶している経験規則
と、前記動作観測部で観測された結果とを比較して矛盾
の有無を判定する故障判定部と、前記知識ベース部に記
憶している復旧規則を用いて故障復旧作業を行う復旧部
とを備えたシステム復旧方法。
(1) A knowledge base unit that stores the operation of the system during normal times as empirical rules and recovery work procedures in the event of a failure as recovery rules, an operation observation unit that constantly observes the operating status of the system, and the knowledge base unit a failure determination unit that compares the empirical rules stored in the memory with the results observed by the operation observation unit and determines whether there is a contradiction; and a failure determination unit that performs failure recovery using the recovery rules stored in the knowledge base unit. A system recovery method comprising a recovery section that performs the work.
(2)動作観測部と故障判定部とを専用コネクタにより
接続したことを特徴とする特許請求の範囲第1項記載の
システム復旧方法。
(2) The system recovery method according to claim 1, characterized in that the operation observation section and the failure determination section are connected by a dedicated connector.
JP1283610A 1989-10-31 1989-10-31 System recovery method Pending JPH03144831A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1283610A JPH03144831A (en) 1989-10-31 1989-10-31 System recovery method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1283610A JPH03144831A (en) 1989-10-31 1989-10-31 System recovery method

Publications (1)

Publication Number Publication Date
JPH03144831A true JPH03144831A (en) 1991-06-20

Family

ID=17667734

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1283610A Pending JPH03144831A (en) 1989-10-31 1989-10-31 System recovery method

Country Status (1)

Country Link
JP (1) JPH03144831A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6041425A (en) * 1996-09-03 2000-03-21 Hitachi, Ltd. Error recovery method and apparatus in a computer system
JP2009076103A (en) * 2008-12-22 2009-04-09 Nec Corp Fault restoration device, fault restoration method, and program
US7620849B2 (en) 2003-07-16 2009-11-17 Nec Corporation Fault recovery system and method for adaptively updating order of command executions according to past results

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6041425A (en) * 1996-09-03 2000-03-21 Hitachi, Ltd. Error recovery method and apparatus in a computer system
US7620849B2 (en) 2003-07-16 2009-11-17 Nec Corporation Fault recovery system and method for adaptively updating order of command executions according to past results
JP2009076103A (en) * 2008-12-22 2009-04-09 Nec Corp Fault restoration device, fault restoration method, and program

Similar Documents

Publication Publication Date Title
JPH04139544A (en) Data restoring method
CN114116493A (en) Test data processing method and device, computer equipment and storage medium
JPH03144831A (en) System recovery method
CN106970877A (en) Control the device and data processor of debugging request
JPS62249259A (en) Computer system
JPH04369735A (en) Backup system for computer system
JP3134878B2 (en) Programmable controller
JP2705478B2 (en) Remote controller
JP2652989B2 (en) File transfer device
JPS62135901A (en) Programmable controller
JPS6238945A (en) Information process system
JPH04184551A (en) Data restoring system for electronic disk device
JPS62284440A (en) Software resource maintenance system for terminal equipment
JPH01310455A (en) Method for restoring fault
JPS638834A (en) Operating condition control system for automatic trouble recovery in computer system
JPH0756793A (en) Automatic restoration system for file fault
JPS61235925A (en) Operating system for electronic computer system
JPH03100836A (en) Fault diagnostic processing system
JPH0756759A (en) Information processor
JPS63217927A (en) Automatic restoration system against power system failure
JPS6376027A (en) Fault analyzing system in computer
JPH05282167A (en) Fault handling method
JPS61208554A (en) Trouble recovery system for data transfer
JPH03244057A (en) Program transfer maintenance system
JPH04152435A (en) Maintenance diagnosis method