skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: AsyncStageOut: Distributed User Data Management for CMS Analysis

Conference · · J.Phys.Conf.Ser.

AsyncStageOut (ASO) is a new component of the distributed data analysis system of CMS, CRAB, designed for managing users' data. It addresses a major weakness of the previous model, namely that mass storage of output data was part of the job execution resulting in inefficient use of job slots and an unacceptable failure rate at the end of the jobs. ASO foresees the management of up to 400k files per day of various sizes, spread worldwide across more than 60 sites. It must handle up to 1000 individual users per month, and work with minimal delay. This creates challenging requirements for system scalability, performance and monitoring. ASO uses FTS to schedule and execute the transfers between the storage elements of the source and destination sites. It has evolved from a limited prototype to a highly adaptable service, which manages and monitors the user file placement and bookkeeping. To ensure system scalability and data monitoring, it employs new technologies such as a NoSQL database and re-uses existing components of PhEDEx and the FTS Dashboard. We present the asynchronous stage-out strategy and the architecture of the solution we implemented to deal with those issues and challenges. The deployment model for the high availability and scalability of the service is discussed. The performance of the system during the commissioning and the first phase of production are also shown, along with results from simulations designed to explore the limits of scalability.

Research Organization:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
DOE Contract Number:
AC02-07CH11359
OSTI ID:
1247517
Report Number(s):
FERMILAB-CONF-15-605-CD; 1413970
Journal Information:
J.Phys.Conf.Ser., Vol. 664, Issue 6; Conference: 21st International Conference on Computing in High Energy and Nuclear Physics, Okinawa, Japan, 04/13-04/17/2015
Country of Publication:
United States
Language:
English

Similar Records

A comparison of different database technologies for the CMS AsyncStageOut transfer database
Journal Article · Thu Nov 23 00:00:00 EST 2017 · Journal of Physics. Conference Series · OSTI ID:1247517

A gLite FTS based solution for managing user output in CMS
Conference · Sun Jan 01 00:00:00 EST 2012 · J.Phys.Conf.Ser. · OSTI ID:1247517

CMS distributed data analysis with CRAB3
Journal Article · Wed Dec 23 00:00:00 EST 2015 · Journal of Physics. Conference Series · OSTI ID:1247517

Related Subjects