MASSIVELY PARALLEL LATENT SEMANTIC ANALYSES USING A GRAPHICS PROCESSING UNIT
Journal Article
·
· Journal of Undergraduate Research
OSTI ID:1052114
Latent Semantic Analysis (LSA) aims to reduce the dimensions of large term-document datasets using Singular Value Decomposition. However, with the ever-expanding size of datasets, current implementations are not fast enough to quickly and easily compute the results on a standard PC. A graphics processing unit (GPU) can solve some highly parallel problems much faster than a traditional sequential processor or central processing unit (CPU). Thus, a deployable system using a GPU to speed up large-scale LSA processes would be a much more effective choice (in terms of cost/performance ratio) than using a PC cluster. Due to the GPU’s application-specifi c architecture, harnessing the GPU’s computational prowess for LSA is a great challenge. We presented a parallel LSA implementation on the GPU, using NVIDIA® Compute Unifi ed Device Architecture and Compute Unifi ed Basic Linear Algebra Subprograms software. The performance of this implementation is compared to traditional LSA implementation on a CPU using an optimized Basic Linear Algebra Subprograms library. After implementation, we discovered that the GPU version of the algorithm was twice as fast for large matrices (1 000x1 000 and above) that had dimensions not divisible by 16. For large matrices that did have dimensions divisible by 16, the GPU algorithm ran fi ve to six times faster than the CPU version. The large variation is due to architectural benefi ts of the GPU for matrices divisible by 16. It should be noted that the overall speeds for the CPU version did not vary from relative normal when the matrix dimensions were divisible by 16. Further research is needed in order to produce a fully implementable version of LSA. With that in mind, the research we presented shows that the GPU is a viable option for increasing the speed of LSA, in terms of cost/performance ratio.
- Research Organization:
- DOESC (USDOE Office of Science (SC) (United States))
- Sponsoring Organization:
- USDOE Office of Science (SC)
- OSTI ID:
- 1052114
- Journal Information:
- Journal of Undergraduate Research, Journal Name: Journal of Undergraduate Research Vol. 9
- Country of Publication:
- United States
- Language:
- English
Similar Records
Massively Parallel Latent Semantic Analyzes using a Graphics Processing Unit
Parallel Latent Semantic Analysis using a Graphics Processing Unit
Computation of Large Covariance Matrices by SAMMY on Graphical Processing Units and Multicore CPUs
Journal Article
·
Wed Dec 31 23:00:00 EST 2008
· Journal of Undergraduate Research
·
OSTI ID:986774
Parallel Latent Semantic Analysis using a Graphics Processing Unit
Conference
·
Wed Dec 31 23:00:00 EST 2008
·
OSTI ID:962623
Computation of Large Covariance Matrices by SAMMY on Graphical Processing Units and Multicore CPUs
Conference
·
Fri Dec 31 23:00:00 EST 2010
·
OSTI ID:1018596