skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: YARNsim: Simulating Hadoop YARN

Conference ·

Despite the popularity of the Apache Hadoop system, its success has been limited by issues such as single points of failure, centralized job/task management, and lack of support for programming models other than MapReduce. The next generation of Hadoop, Apache Hadoop YARN, is designed to address these issues. In this paper, we propose YARNsim, a simulation system for Hadoop YARN. YARNsim is based on parallel discrete event simulation and provides protocol-level accuracy in simulating key components of YARN. YARNsim provides a virtual platform on which system architects can evaluate the design and implementation of Hadoop YARN systems. Also, application developers can tune job performance and understand the tradeoffs between different configurations, and Hadoop YARN system vendors can evaluate system efficiency under limited budgets. To demonstrate the validity of YARNsim, we use it to model two real systems and compare the experimental results from YARNsim and the real systems. The experiments include standard Hadoop benchmarks, synthetic workloads, and a bioinformatics application. The results show that the error rate is within 10% for the majority of test cases. The experiments prove that YARNsim can provide what-if analysis for system designers in a timely manner and at minimal cost compared with testing and evaluating on a real system.

Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science - Office of Advanced Scientific Computing Research
DOE Contract Number:
AC02-06CH11357
OSTI ID:
1335904
Resource Relation:
Conference: 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 05/04/15 - 05/07/15, Shenzhen, Guangdong, CN
Country of Publication:
United States
Language:
English

Similar Records

Large-scale seismic waveform quality metric calculation using Hadoop
Journal Article · Fri May 27 00:00:00 EDT 2016 · Computers and Geosciences · OSTI ID:1335904

An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics
Journal Article · Tue Dec 21 00:00:00 EST 2010 · BMC Bioinformatics, 11(Suppl 12):S1 · OSTI ID:1335904

MARIANE: MApReduce Implementation Adapted for HPC Environments
Conference · Wed Jul 06 00:00:00 EDT 2011 · OSTI ID:1335904