Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Graphics Processing Unit Enhanced Parallel Document Flocking Clustering

Conference ·
OSTI ID:986787

Analyzing and clustering documents is a complex problem. One explored method of solving this problem borrows from nature, imitating the flocking behavior of birds. One limitation of this method of document clustering is its complexity O(n2). As the number of documents grows, it becomes increasingly difficult to generate results in a reasonable amount of time. In the last few years, the graphics processing unit (GPU) has received attention for its ability to solve highly-parallel and semi-parallel problems much faster than the traditional sequential processor. In this paper, we have conducted research to exploit this archi- tecture and apply its strengths to the flocking based document clustering problem. Using the CUDA platform from NVIDIA, we developed a doc- ument flocking implementation to be run on the NVIDIA GEFORCE GPU. Performance gains ranged from thirty-six to nearly sixty times improvement of the GPU over the CPU implementation.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
ORNL LDRD Seed-Money; ORNL work for others
DOE Contract Number:
AC05-00OR22725
OSTI ID:
986787
Country of Publication:
United States
Language:
English

Similar Records

Flocking-based Document Clustering on the Graphics Processing Unit
Conference · Mon Dec 31 23:00:00 EST 2007 · OSTI ID:932628

FLOCKING-BASED DOCUMENT CLUSTERING ON THE GRAPHICS PROCESSING UNIT [Book Chapter]
Book · Mon Dec 31 23:00:00 EST 2007 · OSTI ID:1052106

Large-Scale Multi-Dimensional Document Clustering on GPU Clusters
Conference · Thu Dec 31 23:00:00 EST 2009 · OSTI ID:986781

Related Subjects