| | |
Summary: SC'03, November 15-21, 2003, Phoenix, Arizona, USA
Copyright 2003 ACM 1-58113-695-1/03/0011...$5.00
A Metadata Catalog Service for Data Intensive Applications
Gurmeet Singh, Shishir Bharathi, Ann Chervenak, Ewa Deelman, Carl Kesselman,
Mary Manohar, Sonal Patil, Laura Pearlman
Information Sciences Institute, University of Southern California
Marina Del Rey, CA 90292
{gurmeet, shishir, annc, deelman, carl, mmanohar, sonal, laura}@isi.edu
Abstract
Advances in computational, storage and network technologies as well as middleware such as the
Globus Toolkit allow scientists to expand the sophistication and scope of data-intensive applications.
These applications produce and analyze terabytes and petabytes of data that are distributed in millions
of files or objects. To manage these large data sets efficiently, metadata or descriptive information
about the data needs to be managed. There are various types of metadata, and it is likely that a range
of metadata services will exist in Grid environments that are specialized for particular types of
metadata cataloguing and discovery. In this paper, we present the design of a Metadata Catalog
Service (MCS) that provides a mechanism for storing and accessing descriptive metadata and allows
users to query for data items based on desired attributes. We describe our experience in using the
MCS with several applications and present a scalability study of the service.
1 Introduction
|