skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A gLite FTS based solution for managing user output in CMS

Conference · · J.Phys.Conf.Ser.

The CMS distributed data analysis workflow assumes that jobs run in a different location from where their results are finally stored. Typically the user output must be transferred across the network from one site to another, possibly on a different continent or over links not necessarily validated for high bandwidth/high reliability transfer. This step is named stage-out and in CMS was originally implemented as a synchronous step of the analysis job execution. However, our experience showed the weakness of this approach both in terms of low total job execution efficiency and failure rates, wasting precious CPU resources. The nature of analysis data makes it inappropriate to use PhEDEx, the core data placement system for CMS. As part of the new generation of CMS Workload Management tools, the Asynchronous Stage-Out system (AsyncStageOut) has been developed to enable third party copy of the user output. The AsyncStageOut component manages glite FTS transfers of data from the temporary store at the site where the job ran to the final location of the data on behalf of that data owner. The tool uses python daemons, built using the WMCore framework, and CouchDB, to manage the queue of work and FTS transfers. CouchDB also provides the platform for a dedicated operations monitoring system. In this paper, we present the motivations of the asynchronous stage-out system. We give an insight into the design and the implementation of key features, describing how it is coupled with the CMS workload management system. Finally, we show the results and the commissioning experience.

Research Organization:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
DOE Contract Number:
AC02-07CH11359
OSTI ID:
1405168
Report Number(s):
FERMILAB-CONF-12-844-CD; 1210958
Journal Information:
J.Phys.Conf.Ser., Vol. 396; Conference: 19th International Conference on Computing in High Energy and Nuclear Physics, New York, USA, 05/21-05/25/2012
Country of Publication:
United States
Language:
English

Similar Records

AsyncStageOut: Distributed User Data Management for CMS Analysis
Conference · Wed Dec 23 00:00:00 EST 2015 · J.Phys.Conf.Ser. · OSTI ID:1405168

A comparison of different database technologies for the CMS AsyncStageOut transfer database
Journal Article · Thu Nov 23 00:00:00 EST 2017 · Journal of Physics. Conference Series · OSTI ID:1405168

The WorkQueue project: A task queue for the CMS workload management system
Conference · Sun Jan 01 00:00:00 EST 2012 · J.Phys.Conf.Ser. · OSTI ID:1405168

Related Subjects