skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Producing Madgraph5_aMC@NLO gridpacks and using TensorFlow GPU resources in the CMS HTCondor Global Pool

Conference · · EPJ Web Conf.

The CMS experiment has an HTCondor Global Pool, composed of more than 200K CPU cores available for Monte Carlo production and the analysis of da.The submission of user jobs to this pool is handled by either CRAB, the standard workflow management tool used by CMS users to submit analysis jobs requiring event processing of large amounts of data, or by CMS Connect, a service focused on final stage condor-like analysis jobs and applications that already have a workflow job manager in place. The latest scenario canbring cases in which workflows need further adjustments in order to efficiently work in a globally distributed pool of resources. For instance, the generation of matrix elements for high energy physics processes via Madgraph5_aMC@NLO and the usage of tools not (yet) fully supported by the CMS software, such as Ten-sorFlow with GPUsupport, are tasks with particular requirements. A special adaption, either at the pool factory level (advertising GPU resources) or at the execute level (e.g: to handle special parameters that describe certain needs for the remote execute nodes during submission) is needed in order to adequately work in the CMS global pool. This contribution describes the challenges and efforts performed towards adaptingsuch workflows so they can properly profit from the Global Pool via CMS Connect.

Research Organization:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
DOE Contract Number:
AC02-07CH11359
OSTI ID:
1843235
Report Number(s):
FERMILAB-CONF-19-833-SCD; oai:inspirehep.net:1760909
Journal Information:
EPJ Web Conf., Vol. 214
Country of Publication:
United States
Language:
English

References (7)

CRAB3: Establishing a new generation of services for distributed analysis at CMS journal December 2012
The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations journal July 2014
CMS Connect journal October 2017
The CMS CERN Analysis Facility (CAF) journal April 2010
Distributing LHC application software and conditions databases using the CernVM file system journal December 2011
Singularity: Scientific containers for mobility of compute journal May 2017
OSG and GPUs: A tale of two use cases journal January 2019

Similar Records

Pushing HTCondor and glideinWMS to 200K+ Jobs in a Global Pool for CMS before Run 2
Conference · Wed Dec 23 00:00:00 EST 2015 · J.Phys.Conf.Ser. · OSTI ID:1843235

Stability and scalability of the CMS Global Pool: Pushing HTCondor and glideinWMS to new limits
Journal Article · Wed Nov 22 00:00:00 EST 2017 · Journal of Physics. Conference Series · OSTI ID:1843235

Reaching new peaks for the future of the CMS HTCondor Global Pool
Journal Article · Mon Aug 23 00:00:00 EDT 2021 · EPJ Web of Conferences (Online) · OSTI ID:1843235

Related Subjects