skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Diverse Power Iteration Embeddings and Its Applications

Abstract

Abstract—Spectral Embedding is one of the most effective dimension reduction algorithms in data mining. However, its computation complexity has to be mitigated in order to apply it for real-world large scale data analysis. Many researches have been focusing on developing approximate spectral embeddings which are more efficient, but meanwhile far less effective. This paper proposes Diverse Power Iteration Embeddings (DPIE), which not only retains the similar efficiency of power iteration methods but also produces a series of diverse and more effective embedding vectors. We test this novel method by applying it to various data mining applications (e.g. clustering, anomaly detection and feature selection) and evaluating their performance improvements. The experimental results show our proposed DPIE is more effective than popular spectral approximation methods, and obtains the similar quality of classic spectral embedding derived from eigen-decompositions. Moreover it is extremely fast on big data applications. For example in terms of clustering result, DPIE achieves as good as 95% of classic spectral clustering on the complex datasets but 4000+ times faster in limited memory environment.

Authors:
; ; ;
Publication Date:
Research Org.:
Brookhaven National Lab. (BNL), Upton, NY (United States)
Sponsoring Org.:
USDOE SC OFFICE OF ADVANCED SCIENTIFIC COMPUTING RESEARCH
OSTI Identifier:
1164792
Report Number(s):
BNL-107039-2014-CP
DOE Contract Number:  
DE-AC02-98CH10886
Resource Type:
Conference
Resource Relation:
Conference: IEEE International Conference on Data Mining (ICDM) 2014; Shenzhen, China; 20141214 through 20141217
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Huang, H., Yoo, S., Yu, D., and Qin, H. Diverse Power Iteration Embeddings and Its Applications. United States: N. p., 2014. Web.
Huang, H., Yoo, S., Yu, D., & Qin, H. Diverse Power Iteration Embeddings and Its Applications. United States.
Huang, H., Yoo, S., Yu, D., and Qin, H. 2014. "Diverse Power Iteration Embeddings and Its Applications". United States. https://www.osti.gov/servlets/purl/1164792.
@article{osti_1164792,
title = {Diverse Power Iteration Embeddings and Its Applications},
author = {Huang, H. and Yoo, S. and Yu, D. and Qin, H.},
abstractNote = {Abstract—Spectral Embedding is one of the most effective dimension reduction algorithms in data mining. However, its computation complexity has to be mitigated in order to apply it for real-world large scale data analysis. Many researches have been focusing on developing approximate spectral embeddings which are more efficient, but meanwhile far less effective. This paper proposes Diverse Power Iteration Embeddings (DPIE), which not only retains the similar efficiency of power iteration methods but also produces a series of diverse and more effective embedding vectors. We test this novel method by applying it to various data mining applications (e.g. clustering, anomaly detection and feature selection) and evaluating their performance improvements. The experimental results show our proposed DPIE is more effective than popular spectral approximation methods, and obtains the similar quality of classic spectral embedding derived from eigen-decompositions. Moreover it is extremely fast on big data applications. For example in terms of clustering result, DPIE achieves as good as 95% of classic spectral clustering on the complex datasets but 4000+ times faster in limited memory environment.},
doi = {},
url = {https://www.osti.gov/biblio/1164792}, journal = {},
number = ,
volume = ,
place = {United States},
year = {Sun Dec 14 00:00:00 EST 2014},
month = {Sun Dec 14 00:00:00 EST 2014}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: