Efficient monitoring of CRAB jobs at CMS
Abstract
CRAB is a tool used for distributed analysis of CMS data. Users can submit sets of jobs with similar requirements (tasks) with a single request. CRAB uses a client-server architecture, where a lightweight client, a server, and ancillary services work together and are maintained by CMS operators at CERN. As with most complex software, good monitoring tools are crucial for efficient use and longterm maintainability. This work gives an overview of the monitoring tools developed to ensure the CRAB server and infrastructure are functional, help operators debug user problems, and minimize overhead and operating cost. This work also illustrates the design choices and gives a report on our experience with the tools we developed and the external ones we used.
- Authors:
-
- Univ. Estadual Paulista, Sao Paulo (Brazil)
- California Inst. of Technology (CalTech), Pasadena, CA (United States)
- Istituto Nazionale di Fisica Nucleare (INFN), Trieste (Italy)
- Istituto Nazionale di Fisica Nucleare (INFN), Perugia (Italy)
- Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
- Vilnius Univ. (Lithuania)
- Univ. of Sofia (Bulgaria)
- Research Centre for Energy, Environment and Technology (CIEMAT), Madrid (Spain)
- Publication Date:
- Research Org.:
- Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), High Energy Physics (HEP)
- OSTI Identifier:
- 1415638
- Report Number(s):
- FERMILAB-CONF-17-578-CD
Journal ID: ISSN 1742-6588; 1638628
- Grant/Contract Number:
- AC02-07CH11359
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Journal of Physics. Conference Series
- Additional Journal Information:
- Journal Volume: 898; Journal Issue: 9; Conference: 22nd International Conference on Computing in High Energy and Nuclear Physics, San Francisco, CA, 10/10-10/14/2016; Journal ID: ISSN 1742-6588
- Publisher:
- IOP Publishing
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS; 97 MATHEMATICS AND COMPUTING
Citation Formats
Silva, J. M. D., Balcas, J., Belforte, S., Ciangottini, D., Mascheroni, M., Rupeika, E. A., Ivanov, T. T., Hernandez, J. M., and Vaandering, E. Efficient monitoring of CRAB jobs at CMS. United States: N. p., 2017.
Web. doi:10.1088/1742-6596/898/9/092036.
Silva, J. M. D., Balcas, J., Belforte, S., Ciangottini, D., Mascheroni, M., Rupeika, E. A., Ivanov, T. T., Hernandez, J. M., & Vaandering, E. Efficient monitoring of CRAB jobs at CMS. United States. https://doi.org/10.1088/1742-6596/898/9/092036
Silva, J. M. D., Balcas, J., Belforte, S., Ciangottini, D., Mascheroni, M., Rupeika, E. A., Ivanov, T. T., Hernandez, J. M., and Vaandering, E. Wed .
"Efficient monitoring of CRAB jobs at CMS". United States. https://doi.org/10.1088/1742-6596/898/9/092036. https://www.osti.gov/servlets/purl/1415638.
@article{osti_1415638,
title = {Efficient monitoring of CRAB jobs at CMS},
author = {Silva, J. M. D. and Balcas, J. and Belforte, S. and Ciangottini, D. and Mascheroni, M. and Rupeika, E. A. and Ivanov, T. T. and Hernandez, J. M. and Vaandering, E.},
abstractNote = {CRAB is a tool used for distributed analysis of CMS data. Users can submit sets of jobs with similar requirements (tasks) with a single request. CRAB uses a client-server architecture, where a lightweight client, a server, and ancillary services work together and are maintained by CMS operators at CERN. As with most complex software, good monitoring tools are crucial for efficient use and longterm maintainability. This work gives an overview of the monitoring tools developed to ensure the CRAB server and infrastructure are functional, help operators debug user problems, and minimize overhead and operating cost. This work also illustrates the design choices and gives a report on our experience with the tools we developed and the external ones we used.},
doi = {10.1088/1742-6596/898/9/092036},
journal = {Journal of Physics. Conference Series},
number = 9,
volume = 898,
place = {United States},
year = {Wed Nov 22 00:00:00 EST 2017},
month = {Wed Nov 22 00:00:00 EST 2017}
}