Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A grid job monitoring system

Conference · · J.Phys.Conf.Ser.219:072051,2010
This paper presents a web-based Job Monitoring framework for individual Grid sites that allows users to follow in detail their jobs in quasi-real time. The framework consists of several independent components: (a) a set of sensors that run on the site CE and worker nodes and update a database, (b) a simple yet extensible web services framework and (c) an Ajax powered web interface having a look-and-feel and control similar to a desktop application. The monitoring framework supports LSF, Condor and PBS-like batch systems. This is one of the first monitoring systems where an X.509 authenticated web interface can be seamlessly accessed by both end-users and site administrators. While a site administrator has access to all the possible information, a user can only view the jobs for the Virtual Organizations (VO) he/she is a part of. The monitoring framework design supports several possible deployment scenarios. For a site running a supported batch system, the system may be deployed as a whole, or existing site sensors can be adapted and reused with the web services components. A site may even prefer to build the web server independently and choose to use only the Ajax powered web interface. Finally, the system is being used to monitor a glideinWMS instance. This broadens the scope significantly, allowing it to monitor jobs over multiple sites.
Research Organization:
Fermi National Accelerator Laboratory (FNAL), Batavia, IL
Sponsoring Organization:
USDOE
DOE Contract Number:
AC02-07CH11359
OSTI ID:
983373
Report Number(s):
FERMILAB-CONF-10-229-CD
Conference Information:
Journal Name: J.Phys.Conf.Ser.219:072051,2010
Country of Publication:
United States
Language:
English

Similar Records

Software for batch farms
Conference · Mon Jan 31 23:00:00 EST 2000 · OSTI ID:839252

Level-2 Milestone 4468: Lorenz Simulation Interface Beta Release
Technical Report · Tue Dec 20 23:00:00 EST 2011 · OSTI ID:1093932

Scalability and interoperability within glideinWMS
Conference · Thu Dec 31 23:00:00 EST 2009 · J.Phys.Conf.Ser.219:062036,2010 · OSTI ID:986995