skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Web Proxy Auto Discovery for the WLCG

Journal Article · · Journal of Physics. Conference Series
 [1];  [2];  [3];  [4];  [5];  [2]
  1. Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
  2. European Organization for Nuclear Research (CERN), Geneva (Switzerland)
  3. Johns Hopkins Univ., Baltimore, MD (United States)
  4. Istituto Nazionale di Fisica Nucleare, Rome (Italy)
  5. Rutherford Appleton Lab., Oxfordshire (England)

All four of the LHC experiments depend on web proxies (that is, squids) at each grid site to support software distribution by the CernVM FileSystem (CVMFS). CMS and ATLAS also use web proxies for conditions data distributed through the Frontier Distributed Database caching system. ATLAS & CMS each have their own methods for their grid jobs to find out which web proxies to use for Frontier at each site, and CVMFS has a third method. Those diverse methods limit usability and flexibility, particularly for opportunistic use cases, where an experiment’s jobs are run at sites that do not primarily support that experiment. This paper describes a new Worldwide LHC Computing Grid (WLCG) system for discovering the addresses of web proxies. The system is based on an internet standard called Web Proxy Auto Discovery (WPAD). WPAD is in turn based on another standard called Proxy Auto Configuration (PAC). Both the Frontier and CVMFS clients support this standard. The input into the WLCG system comes from squids registered in the ATLAS Grid Information System (AGIS) and CMS SITECONF files, cross-checked with squids registered by sites in the Grid Configuration Database (GOCDB) and the OSG Information Management (OIM) system, and combined with some exceptions manually configured by people from ATLAS and CMS who operate WLCG Squid monitoring. WPAD servers at CERN respond to http requests from grid nodes all over the world with a PAC file that lists available web proxies, based on IP addresses matched from a database that contains the IP address ranges registered to organizations. Large grid sites are encouraged to supply their own WPAD web servers for more flexibility, to avoid being affected by short term long distance network outages, and to offload the WLCG WPAD servers at CERN. The CERN WPAD servers additionally support requests from jobs running at non-grid sites (particularly for LHC@Home) which it directs to the nearest publicly accessible web proxy servers. Furthermore, the responses to those requests are geographically ordered based on a separate database that maps IP addresses to longitude and latitude.

Research Organization:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
Grant/Contract Number:
AC02-07CH11359
OSTI ID:
1346930
Report Number(s):
FERMILAB-CONF-17-024-CD; 1517405
Journal Information:
Journal of Physics. Conference Series, Vol. 898; Conference: 22nd International Conference on Computing in High Energy and Nuclear Physics, San Francisco, CA (United States), 14-16 Oct 2016; ISSN 1742-6588
Publisher:
IOP PublishingCopyright Statement
Country of Publication:
United States
Language:
English

Similar Records

WLCG Web Proxy Auto Discovery for Dynamically Created Web Proxies
Conference · Mon Dec 02 00:00:00 EST 2019 · OSTI ID:1346930

The Open High Throughput Computing Content Delivery Network
Journal Article · Tue Sep 17 00:00:00 EDT 2019 · EPJ Web of Conferences · OSTI ID:1346930

Virtual machine provisioning, code management, and data movement design for the Fermilab HEPCloud Facility
Journal Article · Sun Oct 01 00:00:00 EDT 2017 · Journal of Physics. Conference Series · OSTI ID:1346930

Related Subjects