JPH01217651A - Automatic fault informing system - Google Patents
Automatic fault informing systemInfo
- Publication number
- JPH01217651A JPH01217651A JP63043691A JP4369188A JPH01217651A JP H01217651 A JPH01217651 A JP H01217651A JP 63043691 A JP63043691 A JP 63043691A JP 4369188 A JP4369188 A JP 4369188A JP H01217651 A JPH01217651 A JP H01217651A
- Authority
- JP
- Japan
- Prior art keywords
- failure
- information
- fault
- intermittent
- faults
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012423 maintenance Methods 0.000 claims abstract description 25
- 230000010365 information processing Effects 0.000 claims description 22
- 230000003449 preventive effect Effects 0.000 abstract description 6
- 238000003745 diagnosis Methods 0.000 description 6
- 238000000034 method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
Landscapes
- Debugging And Monitoring (AREA)
- Test And Diagnosis Of Digital Computers (AREA)
Abstract
Description
【発明の詳細な説明】
〔産業上の利用分野〕
本発明は情報処理装置の保守1診断を回線を経由して遠
隔保守センタから可能ならしめる保守診断装置に関し、
特に情報処理装置の障害発生時における保守診断装置か
ら遠隔保守センタへの障害自動通報方式の改良に関する
。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a maintenance diagnosis device that enables maintenance diagnosis of an information processing device from a remote maintenance center via a line.
In particular, the present invention relates to an improvement in an automatic failure reporting method from a maintenance diagnosis device to a remote maintenance center when a failure occurs in an information processing device.
情報処理装置を構成している装置に障害が発生し、その
装置やシステムが使用できなくなれば迅速にこれを復旧
する必要がある。また発生した障害が情報処理装置の持
つ自動訂正機能により自動修復されても後の予防保守の
為には発生した障害に関する障害情報を保存しておく必
要がある。このため、従来より、回線を介して遠隔保守
センタと通信を行なう機能を有すると共に情報処理装置
に障害が発生したときに障害時の運転を制御する機能を
有する保守診断装置を情報処理装置に接続し、情報処理
装置の障害発生時、保守診断装置により情報処理装置か
ら障害情報を収集して解析し、回復を行なえなかった障
害すなわち訂正不可能な障害であり一部の装置やシステ
ムのダウンに至った障害(以下、固定障害と称す)のと
きは、その障害情報を記憶手段に記憶すると共に迅速な
復旧を可能にする為に回線を介して遠隔保守センタに通
報を発し、また回復を行なえた障害すなわちlビア)障
害のような自動訂正機能により処理の続行ができた障害
(以下、間欠障害と称す)のときは、後の予防保守の為
にその障害情報および同一箇所の障害件数を前記記憶手
段に記憶させている。2. Description of the Related Art If a failure occurs in a device constituting an information processing device and the device or system becomes unusable, it is necessary to quickly restore the device or system. Furthermore, even if a fault that has occurred is automatically repaired by the automatic correction function of the information processing device, it is necessary to save fault information regarding the fault that has occurred for future preventive maintenance. For this reason, conventionally, maintenance diagnostic equipment has been connected to information processing equipment, which has the function of communicating with a remote maintenance center via a line and also has the function of controlling operation in the event of a failure in the information processing equipment. However, when a failure occurs in an information processing device, a maintenance diagnostic device collects and analyzes failure information from the information processing device, and identifies failures that cannot be recovered from, that is, uncorrectable failures, and may cause some devices or systems to go down. In the event of a major failure (hereinafter referred to as a fixed failure), the failure information is stored in the storage means, and a notification is sent to the remote maintenance center via the line to enable quick recovery. In the case of a failure (hereinafter referred to as an intermittent failure) that allows processing to continue due to the automatic correction function, such as an intermittent failure (intermittent failure), the failure information and the number of failures at the same location are recorded for future preventive maintenance. The information is stored in the storage means.
遠隔保守センタでは、通報を受けることにより固定障害
に対し迅速な対応が可能となり、また記憶手段に記憶さ
れた間欠障害に関する障害情報および同一箇所の障害件
数を定期保守により調べることで、予防保守が可能とな
る。At the remote maintenance center, it is possible to quickly respond to fixed failures by receiving reports, and preventive maintenance can be carried out by checking the failure information about intermittent failures stored in the storage means and the number of failures at the same location during regular maintenance. It becomes possible.
ところで、間欠障害であってもそれが多発するときは、
重大な固定障害に発展し、最悪時はシステムダウンに至
ることがある。従って、そのような状況を−早く知るこ
とが必要となるが、従来は間欠障害については通報の対
象としていなかったので、遠隔保守センタでは次回の定
期保守までその事実を見逃すことが多かった。そのため
、固定障害に発展しシステムダウン等を招来する危険性
が高いという欠点があった。By the way, even if it is an intermittent disorder, if it occurs frequently,
This can develop into a serious fixed failure, and in the worst case scenario, it can lead to system failure. Therefore, it is necessary to know about such situations as soon as possible, but since intermittent failures were not reported in the past, remote maintenance centers often overlooked the fact until the next scheduled maintenance. Therefore, there is a drawback that there is a high risk of developing into a fixed failure and causing a system failure.
本発明は、間欠障害の多発を遠隔保守センタにおいて速
やかに知ることができるようにすることを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to enable a remote maintenance center to quickly detect frequent occurrences of intermittent failures.
本発明は上記目的を達成するために、
情報処理装置の障害発生時、前記情報処理装置から障害
情報を収集して解析し、前記障害情報が固定障害に関す
るものであるときは前記障害情報を記憶手段に記憶する
と共に回線を介して遠隔保守センタに通報を発し、前記
障害情報が間欠障害に関するものであるときは前記障害
情報及び同一箇所の障害件数を前記記憶手段に登録する
保守診断装置において、
前記記憶手段に登録されている同一箇所の障害件数と予
め設定された判定基準件数とを比較する比較手段を設け
、
該比較手段による比較の結果、前記同一箇所の障害件数
が前記判定基準件数を越えているときは、前記遠隔保守
センタに通報を発するようにしている。In order to achieve the above object, the present invention collects and analyzes fault information from the information processing device when a fault occurs in the information processing device, and stores the fault information when the fault information is related to a fixed fault. A maintenance diagnostic device that stores information in a means and sends a report to a remote maintenance center via a line, and when the fault information relates to an intermittent fault, registers the fault information and the number of faults at the same location in the storage means, Comparing means is provided for comparing the number of failures at the same location registered in the storage means with a preset determination reference number, and as a result of the comparison by the comparison means, the number of failures at the same location exceeds the determination reference number. When it exceeds the limit, a notification is sent to the remote maintenance center.
情報処理装置の同一箇所で間欠障害が多発し、同一箇所
での間欠障害の件数が判定基準件数を越えると、これが
比較手段で検出され、固定障害発生時と同様に遠隔保守
センタに通報が発せられる。If intermittent failures occur frequently at the same location in the information processing equipment and the number of intermittent failures at the same location exceeds the criterion number, this will be detected by the comparison means and a notification will be sent to the remote maintenance center in the same way as when a fixed failure occurs. It will be done.
次に本発明の実施例について図面を参照して説明する。 Next, embodiments of the present invention will be described with reference to the drawings.
第1図は本発明の障害自動通報方式を適用した情報処理
システムの一例を示すブロック図であり、演算処理装置
、主記憶装置等から構成された情報処理装置lと、これ
に接続されると共に回線4を介して遠隔保守センタ3と
も接続される保守診断装置2とで構成されている。保守
診断装置2は、障害情報収集手段21.記憶手段22.
障害情報登録手段23.障害件数比較手段24.自動通
報手段25を含み、記憶手段22は固定障害情報部22
0、間欠障害情報部2219間欠障害件数部222を含
んでいる。FIG. 1 is a block diagram showing an example of an information processing system to which the automatic failure notification system of the present invention is applied, and includes an information processing device L consisting of an arithmetic processing unit, a main storage device, etc., and a The maintenance diagnosis device 2 is also connected to a remote maintenance center 3 via a line 4. The maintenance diagnosis device 2 includes a failure information collection means 21. Storage means 22.
Fault information registration means 23. Means for comparing number of failures 24. The storage means 22 includes an automatic notification means 25 and a fixed fault information section 22.
0, an intermittent failure information section 2219 and an intermittent failure number section 222 are included.
第2図は障害情報収集手段21の処理例の流れ図、第3
図は障害情報登録手段23の処理例の流れ図、第4図は
障害件数比較手段24の処理例の流れ図、第5図は自動
通報手段25の処理例の流れ図であり、以下、各図を参
照して本実施例の動作を説明する。FIG. 2 is a flowchart of an example of processing by the fault information collection means 21;
Figure 4 is a flowchart of an example of processing by the fault information registration means 23, Figure 4 is a flowchart of an example of processing by the failure number comparison means 24, and Figure 5 is a flowchart of an example of processing by the automatic notification means 25. Please refer to each figure below. The operation of this embodiment will now be explained.
情報処理装置1に障害が発生し、その旨が保守診断装置
2に通知されると、障害情報収集手段21は第2図に示
すように情報処理装置1から今回の障害に関する障害情
報を読出しく211)、障害情報登録手段23を起動す
る(212)。When a fault occurs in the information processing device 1 and the maintenance diagnostic device 2 is notified of this, the fault information collection means 21 reads fault information regarding the current fault from the information processing device 1 as shown in FIG. 211), and activates the failure information registration means 23 (212).
障害情報登録手段23は起動されると、第3図に示すよ
うに障害情報収集手段21で読出された今回の障害情報
を解析して固定障害に関する障害情報か或いは間欠障害
に関する障害情報かを、例えば障害情報中に自動訂正し
た旨の記述があるか否かで判定しく230)、固定障害
であれば今回の障害情報を記憶手段22の固定障害情報
部220に登録しく234>、自動通報手段25を起動
する(235)、自動通報手段25はこれに応答して第
5図に示すように自動ダイヤル発信処理等により回線4
を介して遠隔保守センタ3と接続しく251)、例えば
固定障害が発生した旨および障害箇所情報を含む通報を
遠隔保守センタ3に送出する(252)。When the fault information registration means 23 is activated, as shown in FIG. 3, it analyzes the current fault information read by the fault information collection means 21 and determines whether the fault information is a fixed fault or an intermittent fault. For example, it is determined based on whether or not there is a description of automatic correction in the fault information (230), and if the fault is a fixed fault, the current fault information is registered in the fixed fault information section 220 of the storage means 22 (234), and the automatic notification means 25 (235), and the automatic notification means 25 responds to this by automatically dialing the line 4 as shown in FIG.
251), and sends a report to the remote maintenance center 3 including, for example, the occurrence of a fixed fault and information on the location of the fault (252).
他方、今回の障害が間欠障害のときは、障害情報登録手
段23はその障害情報を記憶手段22の間欠障害情報部
221に登録しく231)、次いでその障害情報から判
明する障害箇所に関する障害件数に1を加算する(23
2)、この障害件数の加算は、例えば今回の障害情報で
判明する障害箇所に対応する障害件数が間欠障害件数部
222中になければ今回の障害箇所に対応する障害件数
格納エリアを間欠障害件数部222中に確保してその内
容を「1」とし、既にあればそのエリアの障害件数を+
1するものである。そして、障害情報登録手段23は障
害件数比較手段24を起動する(233)。On the other hand, if the current failure is an intermittent failure, the failure information registration means 23 registers the failure information in the intermittent failure information section 221 of the storage means 22 (231), and then calculates the number of failures related to the failure location identified from the failure information. Add 1 (23
2) When adding the number of failures, for example, if the number of failures corresponding to the failure point identified by the current failure information is not in the intermittent failure number section 222, the failure number storage area corresponding to the current failure point is added to the number of intermittent failures. 222 and set its content to "1", and if it already exists, increase the number of failures in that area.
1. Then, the failure information registration means 23 activates the failure number comparison means 24 (233).
障害件数比較手段24は起動されると、第4図に示すよ
うに、障害情報登録手段23によって今回更新された間
欠障害件数部222中の障害件数(Nとする)を読出し
く241)、その障害件数Nと予め設定された判定基準
件数N mayとを比較しく242)、N≦N霧axな
らば処理を終える。When the failure number comparison means 24 is activated, as shown in FIG. The number N of failures is compared with the preset determination standard number N may (242), and if N≦N ax, the process ends.
しかし、N>Na+ax即ち同一箇所で判定基準件数N
IIIaxを越えるほど間欠障害が多発しているとき
は、自動通報手段25を起動する(243)、自動通報
手段25は起動されると、第5図に示したように自動ダ
イヤル発信処理等により回線4を介して遠隔保守センタ
3と接続しく251)、例えば間欠障害が多発した旨お
よび障害箇所情報を含む通報を発する(252)。However, N>Na+ax, that is, the number of judgment criteria at the same location N
When intermittent failures occur frequently enough to exceed IIIax, the automatic notification means 25 is activated (243). When activated, the automatic notification means 25 disconnects the line by automatic dialing processing etc. as shown in FIG. The remote maintenance center 3 is connected to the remote maintenance center 3 via 4 (251), and a report containing, for example, information indicating that intermittent failures have occurred frequently and failure location information is issued (252).
遠隔保守センタ3では、上記の通報により情報処理装置
1に間欠障害が多発していることを知ることができるの
で、保守診断装置2内の公知の機能を使用して情報処理
装置1の遠隔保守を開始することにより、速やかに予防
保守を行なうことができる。The remote maintenance center 3 can learn from the above notification that intermittent failures are occurring frequently in the information processing device 1, and therefore performs remote maintenance of the information processing device 1 using a known function in the maintenance diagnosis device 2. By starting, preventive maintenance can be carried out promptly.
以上説明したように、本発明は、情報処理装置に自動訂
正可能な間欠障害が発生し、その障害が同一箇所で予め
設定された判定基準件数を越えた場合、保守診断装置か
ら遠隔保守センタに自動的に通報が発せられるので、自
動訂正されたが故に次の定期保守まで従来見逃すことが
多かった間欠障害の多発を速やかに遠隔保守センタ側で
知ることができる。このため、迅速な予防保守が可能と
なり、間欠障害が重大な固定障害に発展しシステムダウ
ン等になる事態を未然に防ぐことができる。As explained above, the present invention enables automatic correction of intermittent faults that occur in information processing equipment, and when the number of faults exceeds a predetermined criterion number at the same location, the maintenance diagnostic equipment sends information to a remote maintenance center. Since notifications are automatically issued, the remote maintenance center can quickly become aware of frequent occurrences of intermittent failures that were conventionally often overlooked until the next periodic maintenance because they were automatically corrected. Therefore, prompt preventive maintenance can be performed, and a situation where an intermittent failure develops into a serious fixed failure and the system goes down can be prevented.
第1図は本発明の障害自動通報方式を適用した情報処理
システムの一例を示すブロック図、第2図は障害情報収
集手段21の処理例の流れ図、
第3図は障害情報登録手段23の処理例の流れ図、
第4図は障害件数比較手段24の処理例の流れ図および
、
第5図は自動通報手段25の処理例の流れ図である。
図において、
1・・・情報処理装置
2・・・保守診断装置
3・・・遠隔保守センタ
4・・・回線
21・・・障害情報収集手段
22・・・記憶手段
23・・・障害情報登録手段
24・・・障害件数比較手段
25・・・自動通報手段
220・・・固定障害情報部
221・・・間欠障害情報部
222・・・間欠障害件数部
特許出願人 日本電気株式会社外1名FIG. 1 is a block diagram showing an example of an information processing system to which the automatic failure notification system of the present invention is applied, FIG. 2 is a flowchart of an example of processing by the failure information collection means 21, and FIG. 3 is a process by the failure information registration means 23. Example Flowchart: FIG. 4 is a flowchart of a processing example of the failure number comparison means 24, and FIG. 5 is a flowchart of a processing example of the automatic notification means 25. In the figure, 1... Information processing device 2... Maintenance diagnostic device 3... Remote maintenance center 4... Line 21... Fault information collection means 22... Storage means 23... Fault information registration Means 24...Failure number comparison means 25...Automatic reporting means 220...Fixed fault information section 221...Intermittent fault information section 222...Intermittent fault number section Patent applicant: 1 person other than NEC Corporation
Claims (1)
害情報を収集して解析し、前記障害情報が回復を行なえ
なかった障害に関するものであるときは前記障害情報を
記憶手段に記憶すると共に回線を介して遠隔保守センタ
に通報を発し、前記障害情報が回復を行なえた障害に関
するものであるときは前記障害情報及び同一箇所の障害
件数を前記記憶手段に登録する保守診断装置において、
前記記憶手段に登録されている同一箇所の障害件数と予
め設定された判定基準件数とを比較する比較手段を設け
、 該比較手段による比較の結果、前記同一箇所の障害件数
が前記判定基準件数を越えているときは、前記遠隔保守
センタに通報を発することを特徴とする障害自動通報方
式。[Scope of Claims] When a failure occurs in an information processing device, failure information is collected from the information processing device and analyzed, and if the failure information relates to a failure that cannot be recovered, the failure information is stored in a storage means. a maintenance diagnostic device that stores information in the storage means and sends a report to a remote maintenance center via a line, and when the fault information relates to a fault that has been recovered, registers the fault information and the number of faults at the same location in the storage means; In,
Comparing means is provided for comparing the number of failures at the same location registered in the storage means with a preset determination reference number, and as a result of the comparison by the comparison means, the number of failures at the same location exceeds the determination reference number. An automatic failure notification system characterized in that when the fault exceeds the limit, a notification is sent to the remote maintenance center.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP63043691A JPH01217651A (en) | 1988-02-26 | 1988-02-26 | Automatic fault informing system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP63043691A JPH01217651A (en) | 1988-02-26 | 1988-02-26 | Automatic fault informing system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| JPH01217651A true JPH01217651A (en) | 1989-08-31 |
Family
ID=12670856
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP63043691A Pending JPH01217651A (en) | 1988-02-26 | 1988-02-26 | Automatic fault informing system |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPH01217651A (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0395641A (en) * | 1989-09-07 | 1991-04-22 | Fujitsu Ltd | System down preventing system |
| JPH03123945A (en) * | 1989-10-06 | 1991-05-27 | Nec Corp | Information processor |
| JP2010170462A (en) * | 2009-01-26 | 2010-08-05 | Nec Computertechno Ltd | Fault handling device and method |
-
1988
- 1988-02-26 JP JP63043691A patent/JPH01217651A/en active Pending
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0395641A (en) * | 1989-09-07 | 1991-04-22 | Fujitsu Ltd | System down preventing system |
| JPH03123945A (en) * | 1989-10-06 | 1991-05-27 | Nec Corp | Information processor |
| JP2010170462A (en) * | 2009-01-26 | 2010-08-05 | Nec Computertechno Ltd | Fault handling device and method |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7055062B2 (en) | Method, system and program product for establishing a self-diagnosing and self-repairing automated system | |
| US4462075A (en) | Job processing method utilizing a plurality of information processing devices | |
| JPH01217651A (en) | Automatic fault informing system | |
| JPH06175887A (en) | Fault monitoring / notification method | |
| JPH06175934A (en) | One bit error processing system | |
| JP3025573B2 (en) | Building remote monitoring device | |
| JP3099140B2 (en) | Remote monitoring and control device | |
| JPH1074108A (en) | Fault detection system | |
| JPS62236056A (en) | Input/output controller for information processing system | |
| JP2842718B2 (en) | Processor bus fault identification apparatus and method | |
| JPH08249212A (en) | Fault monitoring method in multiplexed computer system | |
| JPH03263238A (en) | Service processor | |
| JPS63250746A (en) | Automatic fault informing system | |
| JPH08194879A (en) | Remote monitoring device | |
| JP2000089981A (en) | Automatic fault occurrence determination method | |
| JPH07114489A (en) | Failure information automatic contact method | |
| JPH05334128A (en) | Automatic fault reporting system | |
| JPH086445A (en) | Fault information noticing method for monitor of copying machine | |
| JPH0430229A (en) | Automatic notice system for fault | |
| JPS6040056B2 (en) | Failure determination method | |
| JPH08161207A (en) | Network system | |
| JPS5841537B2 (en) | Fault detection identification method | |
| CN119201519A (en) | Fault detection method, device, equipment and medium | |
| JPH05244309A (en) | Communication terminal | |
| JPH05334205A (en) | I/o time-out fault recovery system for computer system |