WO2022037332A1 - Procédé et appareil d'alarme de défaillance pour machine virtuelle, et dispositif électronique et support de stockage lisible par ordinateur - Google Patents
Procédé et appareil d'alarme de défaillance pour machine virtuelle, et dispositif électronique et support de stockage lisible par ordinateur Download PDFInfo
- Publication number
- WO2022037332A1 WO2022037332A1 PCT/CN2021/106346 CN2021106346W WO2022037332A1 WO 2022037332 A1 WO2022037332 A1 WO 2022037332A1 CN 2021106346 W CN2021106346 W CN 2021106346W WO 2022037332 A1 WO2022037332 A1 WO 2022037332A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- virtual machine
- operation module
- module
- designated operation
- migration
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/50—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
- G06F21/52—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems during program execution, e.g. stack integrity ; Preventing unwanted data erasure; Buffer overflow
- G06F21/53—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems during program execution, e.g. stack integrity ; Preventing unwanted data erasure; Buffer overflow by executing in a restricted environment, e.g. sandbox or secure virtual machine
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/455—Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
- G06F9/45533—Hypervisors; Virtual machine monitors
- G06F9/45558—Hypervisor-specific management and integration aspects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/455—Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
- G06F9/45533—Hypervisors; Virtual machine monitors
- G06F9/45558—Hypervisor-specific management and integration aspects
- G06F2009/4557—Distribution of virtual machine instances; Migration and load balancing
Definitions
- Migration of virtual machines is an important function of virtual machines. It is to completely migrate a running virtual machine from one host to another without stopping its operation. superior. Migration of virtual machines usually includes disk migration, memory migration, and CPU state migration. During the migration of virtual machines, the migration may fail due to various unexpected situations. If the failure cannot be detected in time, the normal operation of the entire system will be affected. Therefore, timely detection of faults is an indispensable technology in virtual machine migration.
- qemu logs and libvirt logs to discover migration failures.
- qemu logs and libvirt logs only report common migration failures that can be predicted in advance (such as failures such as virtual machines cannot be powered on), and it is difficult to find more and unpredictable failures.
- the purpose of the present application is to provide a fault alarm method, device, electronic device and computer-readable storage medium for a virtual machine, which can timely detect the fault in the running of the virtual machine and issue an alarm in real time.
- the end position of the above-mentioned designated operation module is configured with a timer closing instruction; the method further includes: in response to the process of the virtual machine running to the end position of the designated operation module, reading the timer closing instruction, and closing the designated operation.
- the timer corresponding to the module.
- the above-mentioned designated operation module includes a module corresponding to a virtual machine migration operation; the virtual machine migration operation includes a disk migration operation and a memory migration operation; the disk migration operation includes a disk migration preparation operation and a disk migration execution operation; the memory migration operation includes Memory migration prepare operation and memory migration execute operation.
- an embodiment of the present application further provides a fault alarm device for a virtual machine, the virtual machine includes at least one designated operation module, and the device includes: a timer start module configured to run to the designated operation module in response to a process of the virtual machine.
- the timer corresponding to the specified operation module is turned on; the process judgment module is set to respond to the event that the time duration of the timer reaches the set duration, and judge whether the process of the virtual machine runs to the end position of the specified operation module; the fault alarm module, It is set to give a fault alarm to the specified operation module of the virtual machine when the judgment result of the process judgment module is that the process of the virtual machine has not run to the end position of the specified operation module.
- the above-mentioned designated operation module includes a module corresponding to a virtual machine migration operation; the virtual machine migration operation includes a disk migration operation and a memory migration operation; the disk migration operation includes a disk migration preparation operation and a disk migration execution operation; the memory migration operation includes Memory migration prepare operation and memory migration execute operation.
- the virtual machine includes a plurality of designated operation modules, timers are respectively set in the plurality of designated operation modules, and the time duration of the timer of each designated operation module is set according to the designated operation module.
- embodiments of the present application further provide an electronic device, including a processor and a memory, where the memory stores computer-executable instructions that can be executed by the processor, and the processor executes the computer-executable instructions to implement the above method.
- embodiments of the present application further provide a computer-readable storage medium, where the computer-readable storage medium stores computer-executable instructions, and when the computer-executable instructions are invoked and executed by the processor, the computer-executable instructions cause the processor to Implement the above method.
- FIG. 2 is a schematic diagram of modules corresponding to a migration operation of a virtual machine according to an embodiment of the present application
- FIG. 3 is a structural block diagram of a fault alarm device for a virtual machine provided by an embodiment of the present application.
- FIG. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
- Migration of virtual machines is an important function of virtual machines. It is to completely migrate a running virtual machine from one host to another without stopping its operation. superior. Migration of virtual machines usually includes disk migration, memory migration, and CPU state migration. During the migration of virtual machines, the migration may fail due to various unexpected situations. If the failure cannot be detected in time, the normal operation of the entire system will be affected. Therefore, timely detection of faults is an indispensable technology in virtual machine migration.
- the virtual machine includes at least one designated operation module, and the above method includes the following steps:
- the above-mentioned specified operation module may be a module corresponding to the virtual machine migration operation, for example, the virtual machine migration operation includes a disk migration operation and a memory migration operation; the disk migration operation includes a disk migration preparation operation and a disk migration execution operation; the memory migration operation includes a memory migration preparation operation operations and memory migration execution operations.
- the specified operation module may be a module corresponding to a disk migration preparation operation, a module corresponding to a disk migration execution operation, a module corresponding to a memory migration preparation operation, and a module corresponding to the memory migration execution operation.
- the specified operation module may also be a module corresponding to other running operations, which is not limited here.
- the setting code of the timer is pre-added to the code corresponding to each designated operation module, and the time of the timer is set to the operation reference time of the designated operation module.
- the reference time can be determined according to the multiple actual running times of the specified operation module. For example, the maximum value of the multiple actual running times is used as the running reference time, or the maximum value after removing the error value is used as the running reference time, or it can be Other determination methods are not limited here.
- the timer corresponding to the designated operation module can be automatically started to start timing.
- timing duration of the timer reaches the set duration, that is, the duration of the running reference time, it is determined whether the process of the virtual machine runs to the end position of the specified operation module.
- Step S106 if the process of the virtual machine does not run to the end position of the designated operation module, a fault alarm is performed on the designated operation module of the virtual machine.
- a fault alarm can be performed on the specified operation module of the virtual machine.
- the timer can directly trigger the alarm operation, or directly call the alarm program to perform the alarm operation.
- the migration operation of the virtual machine is firstly divided.
- the virtual machine migration operation may include a disk migration operation and a memory migration operation; the disk migration operation It can include the disk migration preparation operation and the disk migration execution operation; the memory migration operation can include the memory migration preparation operation and the memory migration execution operation. Therefore, during the virtual machine migration operation, it can include multiple designated operation modules, which are respectively prepared for the disk migration.
- the modules corresponding to the operations, the modules corresponding to the disk migration execution operations, the modules corresponding to the memory migration preparation operations, and the modules corresponding to the memory migration execution operations, as shown in FIG. 2 the programs of the four modules are executed sequentially.
- a corresponding timer is set in each designated operation module, and the set time corresponding to the timer is the reference time of the operation process of the designated operation module.
- Timers can be implemented using timers built into QEMU (a software set up to run a virtual machine).
- the start instruction of the timer is set at the initial position of each designated operation module. When the process of the virtual machine runs to the initial position of the designated operation module, the timer start instruction is read to start the timer corresponding to the designated operation module.
- the timer After the timer is turned on, it starts timing. When the timing reaches the set time, it is judged whether the process of the virtual machine runs to the end position of the specified operation module. When it is judged that the process of the virtual machine does not run to the end position of the specified operation module In the case of failure alarm operation of the designated operation module.
- the above-mentioned timer is further provided with a callback function, and the alarm function that can be implemented inside the callback function can be set in different ways as required.
- the timer timeout is triggered, and the callback function in the timer is called to realize the fault alarm of the specified operation module of the virtual machine.
- a timer closing instruction is configured at the end position of each designated operation module; when the process of the virtual machine runs to the end position of the designated operation module, the timer closing instruction is read, and the timer corresponding to the designated operation module is closed. .
- the above-mentioned step of alarming the failure of the designated operation module of the virtual machine may further include: displaying prompt information that the designated operation module of the virtual machine has a failure.
- the displayed prompt information can be accompanied by an alarm mark, such as "!!!”.
- the related art can only discover the faults that are clearly known and may exist, and the existence of such faults and the possibility of such faults must be known in advance, so that such faults can be identified, discovered and alarmed.
- the fault alarm method for a virtual machine provided by the embodiment of the present application does not pay attention to the details or types of the fault, but only pays attention to the consequence that the program execution exceeds the normal time caused by the fault, no matter what kind of fault, as long as it causes the program execution to exceed the normal time At normal times, an alarm can be made. Therefore, the embodiments of the present application can discover and alarm unexpected and unknown faults.
- the embodiments of the present application further provide a fault alarm device for a virtual machine.
- the virtual machine includes at least one designated operation module.
- the device includes: a timer start module 32 configured to respond to The process of the virtual machine runs to the initial position of the designated operation module, and starts the timer corresponding to the designated operation module; the process judgment module 34 is set to respond to the event that the timing duration of the timer reaches the set duration, and judges whether the process of the virtual machine runs to the specified duration.
- the end position of the designated operation module and the fault alarm module 36, which is configured to issue a fault alarm to the designated operation module of the virtual machine when the judgment result of the process judgment module is no.
- the initial position of the above-mentioned designated operation module is configured with a timer start instruction; the above-mentioned timer start module 32 is further set to: in response to the process of the virtual machine running to the initial position of the designated operation module, read Take the timer start command to start the timer corresponding to the specified operation module.
- the end position of the above-mentioned designated operation module is configured with a timer shutdown instruction; the above-mentioned apparatus further includes a timer shutdown module configured to: in response to the process of the virtual machine running to the end position of the designated operation module, read Fetch the timer close command to close the timer corresponding to the specified operation module.
- FIG. 4 is a schematic structural diagram of the electronic device, wherein the electronic device includes a processor 41 and a memory 40 , and the memory 40 stores data that can be used by the processor 41 .
- the processor 41 executes the computer-executable instructions to implement the above-mentioned method for alarming a fault of a virtual machine.
- the electronic device further includes a bus 42 and a communication interface 43 , wherein the processor 41 , the communication interface 43 and the memory 40 are connected through the bus 42 .
- the memory 40 may include a high-speed random access memory (RAM, Random Access Memory), and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory.
- the communication connection between the network element of the system and at least one other network element is realized through at least one communication interface 43 (which may be wired or wireless), which may use the Internet, a wide area network, a local network, a metropolitan area network, and the like.
- the bus 42 may be an ISA (Industry Standard Architecture, industry standard architecture) bus, a PCI (Peripheral Component Interconnect, peripheral component interconnect standard) bus, or an EISA (Extended Industry Standard Architecture, extended industry standard architecture) bus and the like.
- the bus 42 can be divided into an address bus, a data bus, a control bus, and the like. For ease of representation, only one bidirectional arrow is shown in Figure 4, but it does not mean that there is only one bus or one type of bus.
- the processor 41 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above-mentioned method can be completed by a hardware integrated logic circuit in the processor 41 or an instruction in the form of software.
- the above-mentioned processor 41 can be a general-purpose processor, including a central processing unit (Central Processing Unit, referred to as CPU), a network processor (Network Processor, referred to as NP), etc.; it can also be a digital signal processor (Digital Signal Processor, referred to as DSP) ), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, and discrete hardware components.
- CPU Central Processing Unit
- NP Network Processor
- DSP Digital Signal Processor
- ASIC Application Specific Integrated Circuit
- FPGA Field-Programmable Gate Array
- a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
- the steps of the method disclosed in conjunction with the embodiments of the present application may be directly embodied as executed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor.
- the software modules may reside in storage media mature in the art, such as random access memory, flash memory, read-only memory, programmable read-only memory, electrically erasable programmable memory, and registers.
- the storage medium is located in the memory, and the processor 41 reads the information in the memory, and completes the steps of the method for alarming the fault of the virtual machine in the foregoing embodiment in combination with its hardware.
- the virtual machine includes at least one designated operation module, and each designated operation module is preset with a corresponding timer.
- the timer corresponding to the specified operation module is started; when the timing of the timer reaches the set duration event, it is judged whether the process of the virtual machine runs to the end position of the specified operation module; if It is judged that the process of the virtual machine has not run to the end position of the designated operation module, and a fault alarm is performed on the designated operation module of the virtual machine.
- the operation process of the operation module is over, it can be determined whether there is a fault in the designated operation module, and if there is a fault, a real-time alarm will be issued.
- the present application can be applied to the technical field of data detection, and realizes a fault alarm method, device, electronic device and computer-readable storage medium for a virtual machine that can timely detect faults in virtual machine operation and perform real-time alarming.
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Hardware Design (AREA)
- Debugging And Monitoring (AREA)
Abstract
L'invention concerne un procédé et un appareil d'alarme de défaillance pour une machine virtuelle, ainsi qu'un dispositif électronique et un support de stockage lisible par ordinateur, la machine virtuelle comportant au moins un module d'opération désigné. Le procédé comporte les étapes consistant: en réponse au fait que le processus d'une machine virtuelle s'exécute jusqu'à une position initiale d'un module d'opération désigné, à démarrer un temporisateur correspondant au module d'opération désigné; en réponse à l'éventualité où une durée de minutage du temporisateur atteint une durée spécifiée, à déterminer si le processus de la machine virtuelle s'exécute jusqu'à une position de fin du module d'opération désigné; et lorsqu'il est déterminé que le processus de la machine virtuelle ne s'exécute pas jusqu'à la position de fin du module d'opération désigné, à émettre une alarme de défaillance sur le module d'opération désigné de la machine virtuelle. Au moyen de la présente invention, une défaillance survenant pendant l'exécution d'une machine virtuelle peut être décelée en temps opportun, et une alarme en temps réel peut être émise.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010844384.0A CN111984969B (zh) | 2020-08-20 | 2020-08-20 | 虚拟机的故障报警方法、装置及电子设备 |
| CN202010844384.0 | 2020-08-20 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2022037332A1 true WO2022037332A1 (fr) | 2022-02-24 |
Family
ID=73442671
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2021/106346 Ceased WO2022037332A1 (fr) | 2020-08-20 | 2021-07-14 | Procédé et appareil d'alarme de défaillance pour machine virtuelle, et dispositif électronique et support de stockage lisible par ordinateur |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN111984969B (fr) |
| WO (1) | WO2022037332A1 (fr) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111984969B (zh) * | 2020-08-20 | 2024-12-27 | 北京金山云网络技术有限公司 | 虚拟机的故障报警方法、装置及电子设备 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110047543A1 (en) * | 2009-08-21 | 2011-02-24 | Preet Mohinder | System and Method for Providing Address Protection in a Virtual Environment |
| CN102521098A (zh) * | 2011-11-23 | 2012-06-27 | 中兴通讯股份有限公司 | Cpu死机监控的处理方法和装置 |
| CN104794016A (zh) * | 2015-04-23 | 2015-07-22 | 惠州Tcl移动通信有限公司 | 一种移动终端死机检测处理方法及系统 |
| CN111984969A (zh) * | 2020-08-20 | 2020-11-24 | 北京金山云网络技术有限公司 | 虚拟机的故障报警方法、装置及电子设备 |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1962192A1 (fr) * | 2007-02-21 | 2008-08-27 | Deutsche Telekom AG | Procédé et système pour la migration transparente du stockage machine virtuel |
| CN102902599B (zh) * | 2012-09-17 | 2016-08-24 | 华为技术有限公司 | 虚拟机内部故障处理方法、装置及系统 |
| CN107179957B (zh) * | 2016-03-10 | 2020-08-25 | 阿里巴巴集团控股有限公司 | 物理机故障分类处理方法、装置和虚拟机恢复方法、系统 |
| WO2018010068A1 (fr) * | 2016-07-11 | 2018-01-18 | 华为技术有限公司 | Procédé et dispositif permettant de fournir une alerte dans un environnement de virtualisation de fonction de réseau |
| WO2018101736A1 (fr) * | 2016-11-30 | 2018-06-07 | 제주대학교 산학협력단 | Procédé et appareil de migration de machine virtuelle |
| CN107704314B (zh) * | 2017-11-09 | 2023-09-12 | 北京百度网讯科技有限公司 | 用于迁移虚拟机的方法和装置 |
| CN108334401B (zh) * | 2018-01-31 | 2020-07-28 | 武汉噢易云计算股份有限公司 | 实现逻辑卷动态分配并支持虚拟机动态迁移的系统及方法 |
| US11074095B2 (en) * | 2019-01-04 | 2021-07-27 | International Business Machines Corporation | Event-based virtual machine that hosts microservices for handling program faults |
| CN110389838B (zh) * | 2019-07-24 | 2021-06-25 | 北京邮电大学 | 一种适用于虚拟资源的实时调度和在线迁移管控方法 |
-
2020
- 2020-08-20 CN CN202010844384.0A patent/CN111984969B/zh active Active
-
2021
- 2021-07-14 WO PCT/CN2021/106346 patent/WO2022037332A1/fr not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110047543A1 (en) * | 2009-08-21 | 2011-02-24 | Preet Mohinder | System and Method for Providing Address Protection in a Virtual Environment |
| CN102521098A (zh) * | 2011-11-23 | 2012-06-27 | 中兴通讯股份有限公司 | Cpu死机监控的处理方法和装置 |
| CN104794016A (zh) * | 2015-04-23 | 2015-07-22 | 惠州Tcl移动通信有限公司 | 一种移动终端死机检测处理方法及系统 |
| CN111984969A (zh) * | 2020-08-20 | 2020-11-24 | 北京金山云网络技术有限公司 | 虚拟机的故障报警方法、装置及电子设备 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111984969B (zh) | 2024-12-27 |
| CN111984969A (zh) | 2020-11-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113157504A (zh) | 一种触摸屏设备测试方法和一种触摸屏设备测试系统 | |
| CN107924356A (zh) | 零开销代码覆盖分析 | |
| WO2016173353A1 (fr) | Procédé de traitement et de détection de panne de machine à laver | |
| WO2022228315A1 (fr) | Procédé et appareil pour configurer une adresse de base mmio d'un système serveur | |
| WO2022037332A1 (fr) | Procédé et appareil d'alarme de défaillance pour machine virtuelle, et dispositif électronique et support de stockage lisible par ordinateur | |
| CN114721886A (zh) | 一种故障检测方法、装置、设备及可读存储介质 | |
| CN110287115A (zh) | 测试报告的生成方法、装置和服务器 | |
| CN116302694A (zh) | 故障处理方法、装置、通信设备及存储介质 | |
| CN111427777B (zh) | 一种sol测试的方法、系统、设备以及介质 | |
| CN114924945A (zh) | 一种测试驱动的业务监控方法、装置、设备及介质 | |
| CN113612661B (zh) | 检验程序稳定性的方法、装置、计算设备及存储介质 | |
| CN117851234A (zh) | 在软件测试中用于确定初始报错版本的方法及相关产品 | |
| CN117349833A (zh) | 病毒监控方法、装置、电子设备及存储介质 | |
| CN116095048A (zh) | 一种mac地址检测方法、系统、装置及计算机可读存储介质 | |
| CN116756037A (zh) | 异常代码定位系统、方法、设备及计算机可读存储介质 | |
| CN113127277A (zh) | 一种设备测试方法、装置、电子设备及可读存储介质 | |
| CN112130886B (zh) | 一种交通信号控制机的主控程序升级方法及装置 | |
| CN115525501A (zh) | 用于上报崩溃信息的方法及装置、电子设备、存储介质 | |
| CN113626233B (zh) | 一种自动检测bios看门狗功能的方法、装置及设备 | |
| CN113687869A (zh) | 一种兼容txt功能和asd功能的方法和装置 | |
| CN114297009B (zh) | 一种服务器测试的方法及装置、电子设备 | |
| JPS58181160A (ja) | 緊急動作制御方式 | |
| CN117493292A (zh) | 一种能源控制器app日志记录的控制方法和装置 | |
| CN117389863A (zh) | Ecc故障策略的自检方法、系统、介质及电子设备 | |
| JP2002116926A (ja) | プログラム処理装置およびプログラム処理方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21857427 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 21857427 Country of ref document: EP Kind code of ref document: A1 |