Checkpoint/Restart of Virtual Machines Based on Xen
- ORNL
System level virtualization provides several advantages: (i) customization is eased since virtual machines may be based on different systems; (ii) virtual machines are isolated from hardware, subsequently applications are isolated via the virtual machines; (iii) basic fault tolerance mechanisms -- pro-active fault tolerance through virtual machine migration and virtual machine snapshot/restore; and (iv) basic load balancing mechanisms -- the capability to move and stop virtual machines running in the system. However, the current Xen implementation does not natively provide mechanisms for virtual machine checkpoint/restart. This document presents the design of a reactive fault tolerant system, based on a checkpoint/restart mechanism for Xen virtual machines. We present the infrastructure for the management of virtual machines' checkpoint data as well as challenges for the implementation of a virtual machine checkpoint/restart mechanism based on Xen.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Laboratory Directed Research and Development (LDRD) Program
- DOE Contract Number:
- DE-AC05-00OR22725
- OSTI ID:
- 931386
- Resource Relation:
- Conference: High Availability and Performance Computing Workshop 2006, Santa Fe, NM, USA, 20061017, 20061017
- Country of Publication:
- United States
- Language:
- English
Similar Records
Proactive Fault Tolerance for HPC with Xen Virtualization
Transparent System-level Migration of PGAs Applications using Xen on Infiniband