Enabling Diverse Software Stacks on Supercomputers using High Performance Virtual Clusters.

Younge, Andrew J.; Pedretti, Kevin; Grant, Ryan; Brightwell, Ron

doi:10.2172/1367280

Enabling Diverse Software Stacks on Supercomputers using High Performance Virtual Clusters.

Technical Report · Mon May 01 04:00:00 EDT 2017

DOI:https://doi.org/10.2172/1367280· OSTI ID:1367280

Younge, Andrew J. ^[1]; Pedretti, Kevin ^[1]; Grant, Ryan ^[1]; Brightwell, Ron ^[1]

Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

While large-scale simulations have been the hallmark of the High Performance Computing (HPC) community for decades, Large Scale Data Analytics (LSDA) workloads are gaining attention within the scientific community not only as a processing component to large HPC simulations, but also as standalone scientific tools for knowledge discovery. With the path towards Exascale, new HPC runtime systems are also emerging in a way that differs from classical distributed com- puting models. However, system software for such capabilities on the latest extreme-scale DOE supercomputing needs to be enhanced to more appropriately support these types of emerging soft- ware ecosystems. In this paper, we propose the use of Virtual Clusters on advanced supercomputing resources to enable systems to support not only HPC workloads, but also emerging big data stacks. Specifi- cally, we have deployed the KVM hypervisor within Cray's Compute Node Linux on a XC-series supercomputer testbed. We also use libvirt and QEMU to manage and provision VMs directly on compute nodes, leveraging Ethernet-over-Aries network emulation. To our knowledge, this is the first known use of KVM on a true MPP supercomputer. We investigate the overhead our solution using HPC benchmarks, both evaluating single-node performance as well as weak scaling of a 32-node virtual cluster. Overall, we find single node performance of our solution using KVM on a Cray is very efficient with near-native performance. However overhead increases by up to 20% as virtual cluster size increases, due to limitations of the Ethernet-over-Aries bridged network. Furthermore, we deploy Apache Spark with large data analysis workloads in a Virtual Cluster, ef- fectively demonstrating how diverse software ecosystems can be supported by High Performance Virtual Clusters.

Research Organization:: Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA)

DOE Contract Number:: AC04-94AL85000

OSTI ID:: 1367280

Report Number(s):: SAND--2017-5325; 653446

Country of Publication:: United States

Language:: English

Similar Records

Management of Virtual Large-scale High-performance Computing Systems

Conference · Fri Dec 31 23:00:00 EST 2010 · OSTI ID:1024319

Exploring Infiniband Hardware Virtualization in OpenNebula towards Efficient High-Performance Computing

Conference · Mon Nov 10 23:00:00 EST 2014 · OSTI ID:1294527

Performance Characterization of De Novo Genome Assembly on Leading Parallel Systems.

Conference · Tue Aug 01 00:00:00 EDT 2017 · Lecture Notes in Computer Science, vol 10417. Springer, Cham · OSTI ID:1567514

Related Subjects

97 MATHEMATICS AND COMPUTING

Enabling Diverse Software Stacks on Supercomputers using High Performance Virtual Clusters.

Citation Formats

Similar Records

Related Subjects