| | |
Summary: DataCutter: Middleware for Filtering Very Large Scientific Datasets
on Archival Storage Systems
Michael Beynony, Renato Ferreiray, Tahsin Kurcy, Alan Sussmany, Joel Saltzyz
y
Department of Computer
Science
University of Maryland
College Park, MD 20742
z
Department of Pathology
Johns Hopkins Medical
Institutions
Baltimore, MD 21287
fbeynon,renato,kurc,als,saltzg@cs.umd.edu
Abstract
In this paper we present a middleware infrastructure, called DataCutter, that en-
ables processing of scientific datasets stored in archival storage systems across a wide-
area network. DataCutter provides support for subsetting of datasets through multi-
dimensional range queries, and application specific aggregation on scientific datasets
stored in an archival storage system. We also present experimental results from a pro-
|