| | |
Summary: Supporting High Performance Bioinformatics Flat-File Data Processing Using
Indices
Xuan Zhang Gagan Agrawal
Department of Computer Science and Engineering
Ohio State University
Columbus, OH, 43220
{zhangx,agrawal}@cse.ohio-state.edu
Abstract
As an essential part of in vitro analysis, biological
database query has become more and more important in
the research process. A few challenges that are specific
to bioinformatics applications are data heterogeneity, large
data volume and exponential data growth, constant appear-
ance of new data types and data formats. We have devel-
oped an integration system that processes data in their flat
file formats. Its advantages include the reduction of over-
head and programming efforts. In the paper, we discuss the
usage of indicing techniques on top of this flat file query sys-
tem. Besides the advantage of processing flat files directly,
the system also improves its performance and functionality
|