Visual Analytics of Heterogeneous Data Using Hypergraph Learning
- Stony Brook Univ., Stony Brook, NY (United States)
- Brookhaven National Lab. (BNL), Upton, NY (United States)
For real-world learning tasks (e.g., classification), graph-based models are commonly used to fuse the information distributed in diverse data sources, which can be heterogeneous, redundant, and incomplete. Thesemodels represent the relations in different datasets as pairwise links. However, these links cannot deal with high-order relations which connect multiple objects (e.g., in public health datasets, more than two patient groups admitted by the same hospital in 2014). In this article, we propose a visual analytics approach forthe classification on heterogeneous datasets using the hypergraph model. The hypergraph is an extension to traditional graphs in which a hyperedge connects multiple vertices instead of just two. We model various high-order relations in heterogeneous datasets as hyperedges and fuse different datasets with a unified hypergraph structure. We use the hypergraph learning algorithm for predicting missing labels in the datasets.To allow users to inject their domain knowledge into the model-learning process, we augment the traditional learning algorithm in a number of ways. In addition, we also propose a set of visualizations which enable the user to construct the hypergraph structure and the parameters of the learning model interactively during the analysis. Here, we demonstrate the capability of our approach via two real-world cases.
- Research Organization:
- Brookhaven National Laboratory (BNL), Upton, NY (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (SC-21)
- Grant/Contract Number:
- SC0012704
- OSTI ID:
- 1496572
- Report Number(s):
- BNL-211294-2019-JAAM
- Journal Information:
- ACM Transactions on Intelligent Systems and Technology, Vol. 10, Issue 1; ISSN 2157-6904
- Country of Publication:
- United States
- Language:
- English
Web of Science
Similar Records
Topological Analysis of Temporal Hypergraphs
Unsupervised Group Discovery and LInk Prediction in Relational Datasets: a nonparametric Bayesian approach