skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: MASSIVELY PARALLEL LATENT SEMANTIC ANALYSES USING A GRAPHICS PROCESSING UNIT

Journal Article · · Journal of Undergraduate Research
OSTI ID:1052114

Latent Semantic Analysis (LSA) aims to reduce the dimensions of large term-document datasets using Singular Value Decomposition. However, with the ever-expanding size of datasets, current implementations are not fast enough to quickly and easily compute the results on a standard PC. A graphics processing unit (GPU) can solve some highly parallel problems much faster than a traditional sequential processor or central processing unit (CPU). Thus, a deployable system using a GPU to speed up large-scale LSA processes would be a much more effective choice (in terms of cost/performance ratio) than using a PC cluster. Due to the GPU’s application-specifi c architecture, harnessing the GPU’s computational prowess for LSA is a great challenge. We presented a parallel LSA implementation on the GPU, using NVIDIA® Compute Unifi ed Device Architecture and Compute Unifi ed Basic Linear Algebra Subprograms software. The performance of this implementation is compared to traditional LSA implementation on a CPU using an optimized Basic Linear Algebra Subprograms library. After implementation, we discovered that the GPU version of the algorithm was twice as fast for large matrices (1 000x1 000 and above) that had dimensions not divisible by 16. For large matrices that did have dimensions divisible by 16, the GPU algorithm ran fi ve to six times faster than the CPU version. The large variation is due to architectural benefi ts of the GPU for matrices divisible by 16. It should be noted that the overall speeds for the CPU version did not vary from relative normal when the matrix dimensions were divisible by 16. Further research is needed in order to produce a fully implementable version of LSA. With that in mind, the research we presented shows that the GPU is a viable option for increasing the speed of LSA, in terms of cost/performance ratio.

Research Organization:
DOESC (USDOE Office of Science (SC) (United States))
Sponsoring Organization:
USDOE Office of Science (SC)
OSTI ID:
1052114
Journal Information:
Journal of Undergraduate Research, Vol. 9
Country of Publication:
United States
Language:
English

Similar Records

Massively Parallel Latent Semantic Analyzes using a Graphics Processing Unit
Journal Article · Thu Jan 01 00:00:00 EST 2009 · Journal of Undergraduate Research · OSTI ID:1052114

Parallel Latent Semantic Analysis using a Graphics Processing Unit
Conference · Thu Jan 01 00:00:00 EST 2009 · OSTI ID:1052114

FLOCKING-BASED DOCUMENT CLUSTERING ON THE GRAPHICS PROCESSING UNIT [Book Chapter]
Book · Tue Jan 01 00:00:00 EST 2008 · OSTI ID:1052114

Related Subjects