skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A grid job monitoring system

Abstract

This paper presents a web-based Job Monitoring framework for individual Grid sites that allows users to follow in detail their jobs in quasi-real time. The framework consists of several independent components: (a) a set of sensors that run on the site CE and worker nodes and update a database, (b) a simple yet extensible web services framework and (c) an Ajax powered web interface having a look-and-feel and control similar to a desktop application. The monitoring framework supports LSF, Condor and PBS-like batch systems. This is one of the first monitoring systems where an X.509 authenticated web interface can be seamlessly accessed by both end-users and site administrators. While a site administrator has access to all the possible information, a user can only view the jobs for the Virtual Organizations (VO) he/she is a part of. The monitoring framework design supports several possible deployment scenarios. For a site running a supported batch system, the system may be deployed as a whole, or existing site sensors can be adapted and reused with the web services components. A site may even prefer to build the web server independently and choose to use only the Ajax powered web interface. Finally, the system ismore » being used to monitor a glideinWMS instance. This broadens the scope significantly, allowing it to monitor jobs over multiple sites.« less

Authors:
; ; ; ; ; ; ;
Publication Date:
Research Org.:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
983373
Report Number(s):
FERMILAB-CONF-10-229-CD
TRN: US1004459
DOE Contract Number:
AC02-07CH11359
Resource Type:
Conference
Resource Relation:
Journal Name: J.Phys.Conf.Ser.219:072051,2010; Conference: Prepared for 17th International Conference on Computing in High Energy and Nuclear Physics (CHEP 09), Prague, Czech Republic, 21-27 Mar 2009
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE; DESIGN; MONITORING; MONITORS; NUCLEAR PHYSICS; Computing

Citation Formats

Dumitrescu, Catalin, /Fermilab, Nowack, Andreas, /Aachen, Tech. Hochsch., Padhi, Sanjay, /UC, San Diego, Sarkar, Subir, and /INFN, Pisa /Pisa, Scuola Normale Superiore. A grid job monitoring system. United States: N. p., 2010. Web. doi:10.1088/1742-6596/219/7/072051.
Dumitrescu, Catalin, /Fermilab, Nowack, Andreas, /Aachen, Tech. Hochsch., Padhi, Sanjay, /UC, San Diego, Sarkar, Subir, & /INFN, Pisa /Pisa, Scuola Normale Superiore. A grid job monitoring system. United States. doi:10.1088/1742-6596/219/7/072051.
Dumitrescu, Catalin, /Fermilab, Nowack, Andreas, /Aachen, Tech. Hochsch., Padhi, Sanjay, /UC, San Diego, Sarkar, Subir, and /INFN, Pisa /Pisa, Scuola Normale Superiore. Fri . "A grid job monitoring system". United States. doi:10.1088/1742-6596/219/7/072051. https://www.osti.gov/servlets/purl/983373.
@article{osti_983373,
title = {A grid job monitoring system},
author = {Dumitrescu, Catalin and /Fermilab and Nowack, Andreas and /Aachen, Tech. Hochsch. and Padhi, Sanjay and /UC, San Diego and Sarkar, Subir and /INFN, Pisa /Pisa, Scuola Normale Superiore},
abstractNote = {This paper presents a web-based Job Monitoring framework for individual Grid sites that allows users to follow in detail their jobs in quasi-real time. The framework consists of several independent components: (a) a set of sensors that run on the site CE and worker nodes and update a database, (b) a simple yet extensible web services framework and (c) an Ajax powered web interface having a look-and-feel and control similar to a desktop application. The monitoring framework supports LSF, Condor and PBS-like batch systems. This is one of the first monitoring systems where an X.509 authenticated web interface can be seamlessly accessed by both end-users and site administrators. While a site administrator has access to all the possible information, a user can only view the jobs for the Virtual Organizations (VO) he/she is a part of. The monitoring framework design supports several possible deployment scenarios. For a site running a supported batch system, the system may be deployed as a whole, or existing site sensors can be adapted and reused with the web services components. A site may even prefer to build the web server independently and choose to use only the Ajax powered web interface. Finally, the system is being used to monitor a glideinWMS instance. This broadens the scope significantly, allowing it to monitor jobs over multiple sites.},
doi = {10.1088/1742-6596/219/7/072051},
journal = {J.Phys.Conf.Ser.219:072051,2010},
number = ,
volume = ,
place = {United States},
year = {Fri Jan 01 00:00:00 EST 2010},
month = {Fri Jan 01 00:00:00 EST 2010}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: