Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

GPU-Accelerated Text Mining

Conference ·
OSTI ID:962625

Accelerating hardware devices represent a novel promise for improving the performance for many problem domains but it is not clear for which domains what accelerators are suitable. While there is no room in general-purpose processor design to significantly increase the processor frequency, developers are instead resorting to multi-core chips duplicating conventional computing capabilities on a single die. Yet, accelerators offer more radical designs with a much higher level of parallelism and novel programming environments. This present work assesses the viability of text mining on CUDA. Text mining is one of the key concepts that has become prominent as an effective means to index the Internet, but its applications range beyond this scope and extend to providing document similarity metrics, the subject of this work. We have developed and optimized text search algorithms for GPUs to exploit their potential for massive data processing. We discuss the algorithmic challenges of parallelization for text search problems on GPUs and demonstrate the potential of these devices in experiments by reporting significant speedups. Our study may be one of the first to assess more complex text search problems for suitability for GPU devices, and it may also be one of the first to exploit and report on atomic instruction usage that have recently become available in NVIDIA devices.

Research Organization:
Oak Ridge National Laboratory (ORNL)
Sponsoring Organization:
ORNL LDRD Seed-Money
DOE Contract Number:
AC05-00OR22725
OSTI ID:
962625
Country of Publication:
United States
Language:
English

Similar Records

Large-Scale Multi-Dimensional Document Clustering on GPU Clusters
Conference · Thu Dec 31 23:00:00 EST 2009 · OSTI ID:986781

Hands-on Performance Tuning of 3D Finite Difference Earthquake Simulation on GPU Fermi Chipset
Journal Article · Sat Jun 02 00:00:00 EDT 2012 · Procedia Computer Science · OSTI ID:1567289

Summer Student Internship working on the GPU
Technical Report · Thu Aug 15 00:00:00 EDT 2013 · OSTI ID:1090694