Bipartite graph partitioning and data clustering

Zha, Hongyuan; He, Xiaofeng; Ding, Chris; Gu, Ming; Simon, Horst D

doi:10.2172/816202

Title: Bipartite graph partitioning and data clustering

Technical Report · Mon May 07 00:00:00 EDT 2001

DOI:https://doi.org/10.2172/816202· OSTI ID:816202

Zha, Hongyuan; He, Xiaofeng; Ding, Chris; Gu, Ming; Simon, Horst D

Many data types arising from data mining applications can be modeled as bipartite graphs, examples include terms and documents in a text corpus, customers and purchasing items in market basket analysis and reviewers and movies in a movie recommender system. In this paper, the authors propose a new data clustering method based on partitioning the underlying biopartite graph. The partition is constructed by minimizing a normalized sum of edge weights between unmatched pairs of vertices of the bipartite graph. They show that an approximate solution to the minimization problem can be obtained by computing a partial singular value decomposition (SVD) of the associated edge weight matrix of the bipartite graph. They point out the connection of their clustering algorithm to correspondence analysis used in multivariate analysis. They also briefly discuss the issue of assigning data objects to multiple clusters. In the experimental results, they apply their clustering algorithm to the problem of document clustering to illustrate its effectiveness and efficiency.

View Technical Report

Cite

Export

Save

Research Organization:: Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: USDOE Director, Office of Science. Office of Advanced Scientific Computing Research. Mathematical, Information, and Computational Sciences Division (US)

DOE Contract Number:: AC03-76SF00098

OSTI ID:: 816202

Report Number(s):: LBNL-47970; R&D Project: 365935; TRN: US0304939

Resource Relation:: Other Information: PBD: 7 May 2001

Country of Publication:: United States

Language:: English

Similar Records

Evolving bipartite authentication graph partitions

Journal Article · Mon Jan 16 00:00:00 EST 2017 · IEEE Transactions on Dependable and Secure Computing · OSTI ID:816202

Pope, Aaron Scott; Tauritz, Daniel Remy; Kent, Alexander D.

A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs

Journal Article · Mon May 16 00:00:00 EDT 2016 · Parallel Computing · OSTI ID:816202

Azad, Ariful; Buluç, Aydın

The polytope of block diagonal matrices and complete bipartite partitionings

Conference · Sat Dec 31 00:00:00 EST 1994 · OSTI ID:816202

Oosten, M; Crama, Y

Related Subjects

12 MANAGEMENT OF RADIOACTIVE WASTES, AND NON-RADIOACTIVE WASTES FROM NUCLEAR FACILITIES
99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
ALGORITHMS
EFFICIENCY
MARKET
MINIMIZATION
MINING
MULTIVARIATE ANALYSIS

Title: Bipartite graph partitioning and data clustering

Citation Formats

Similar Records

Related Subjects