| | |
Summary: Enabling Information Integration and Workflows in a Grid
Environment with Automatic Wrapper Generation
Xuan Zhang Gagan Agrawal
Department of Computer Science and Engineering
Ohio State University, Columbus OH 43210 zhangx,agrawalĄ @cse.ohio-state.edu
ABSTRACT
With a growing trend towards grid-based data repositories and data
analysis services, scientific data analysis often involves accessing
multiple data sources, and analyzing the data using a variety of
analysis programs. One critical challenge in this, however, is that
data sources often hold the same type of data in a number of dif-
ferent formats, and also, the formats expected and generated by
various data analysis services are often distinct.
We believe that the traditional approach for dealing with this
problem, which is using hand-written wrappers, is not an effective
and scalable solution for a grid environment. This paper presents
a new approach, which involves generating wrappers automatically
for enabling grid-based information integration and workflows. In
this approach, a layout descriptor is used for describing the data
format for each data source, as well as the input and output format
|