CRAB3: Establishing a new generation of services for distributed analysis at CMS
- CERN
- INFN, Bologna
- Madrid, CIEMAT
- INFN, Perugia
- Fermilab
In CMS Computing the highest priorities for analysis tools are the improvement of the end users ability to produce and publish reliable samples and analysis results as well as a transition to a sustainable development and operations model. To achieve these goals CMS decided to incorporate analysis processing into the same framework as data and simulation processing. This strategy foresees that all workload tools (TierO, Tier1, production, analysis) share a common core with long term maintainability as well as the standardization of the operator interfaces. The re-engineered analysis workload manager, called CRAB3, makes use of newer technologies, such as RESTFul based web services and NoSQL Databases, aiming to increase the scalability and reliability of the system. As opposed to CRAB2, in CRAB3 all work is centrally injected and managed in a global queue. A pool of agents, which can be geographically distributed, consumes work from the central services serving the user tasks. The new architecture of CRAB substantially changes the deployment model and operations activities. In this paper we present the implementation of CRAB3, emphasizing how the new architecture improves the workflow automation and simplifies maintainability. In particular, we will highlight the impact of the new design on daily operations.
- Research Organization:
- Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), High Energy Physics (HEP)
- DOE Contract Number:
- AC02-07CH11359
- OSTI ID:
- 1405169
- Report Number(s):
- FERMILAB-CONF-12-845-CD; 1210957
- Journal Information:
- J.Phys.Conf.Ser., Vol. 396; Conference: 19th International Conference on Computing in High Energy and Nuclear Physics, New York, USA, 05/21-05/25/2012
- Country of Publication:
- United States
- Language:
- English
Similar Records
Use of DAGMan in CRAB3 to improve the splitting of CMS user jobs
AsyncStageOut: Distributed User Data Management for CMS Analysis