Graphics Processing Unit Enhanced Parallel Document Flocking Clustering
- ORNL
Analyzing and clustering documents is a complex problem. One explored method of solving this problem borrows from nature, imitating the flocking behavior of birds. One limitation of this method of document clustering is its complexity O(n2). As the number of documents grows, it becomes increasingly difficult to generate results in a reasonable amount of time. In the last few years, the graphics processing unit (GPU) has received attention for its ability to solve highly-parallel and semi-parallel problems much faster than the traditional sequential processor. In this paper, we have conducted research to exploit this archi- tecture and apply its strengths to the flocking based document clustering problem. Using the CUDA platform from NVIDIA, we developed a doc- ument flocking implementation to be run on the NVIDIA GEFORCE GPU. Performance gains ranged from thirty-six to nearly sixty times improvement of the GPU over the CPU implementation.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Laboratory Directed Research and Development (LDRD) Program; Work for Others (WFO)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 986787
- Resource Relation:
- Conference: ICSI 2010 - International Conference on Swarm Intelligence, BEIJING, China, 20100612, 20100615
- Country of Publication:
- United States
- Language:
- English
Similar Records
Flocking-based Document Clustering on the Graphics Processing Unit
Parallel Latent Semantic Analysis using a Graphics Processing Unit