skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Statistically significant relational data mining :

Technical Report ·
DOI:https://doi.org/10.2172/1204082· OSTI ID:1204082

This report summarizes the work performed under the project (3z(BStatitically significant relational data mining.(3y (BThe goal of the project was to add more statistical rigor to the fairly ad hoc area of data mining on graphs. Our goal was to develop better algorithms and better ways to evaluate algorithm quality. We concetrated on algorithms for community detection, approximate pattern matching, and graph similarity measures. Approximate pattern matching involves finding an instance of a relatively small pattern, expressed with tolerance, in a large graph of data observed with uncertainty. This report gathers the abstracts and references for the eight refereed publications that have appeared as part of this work. We then archive three pieces of research that have not yet been published. The first is theoretical and experimental evidence that a popular statistical measure for comparison of community assignments favors over-resolved communities over approximations to a ground truth. The second are statistically motivated methods for measuring the quality of an approximate match of a small pattern in a large graph. The third is a new probabilistic random graph model. Statisticians favor these models for graph analysis. The new local structure graph model overcomes some of the issues with popular models such as exponential random graph models and latent variable models.

Research Organization:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
AC04-94AL85000
OSTI ID:
1204082
Report Number(s):
SAND2014-1105; 498870
Country of Publication:
United States
Language:
English

Similar Records

Decomposition of Large Scale Semantic Graphsvia an Efficient Communities Algorithm
Technical Report · Fri Feb 08 00:00:00 EST 2008 · OSTI ID:1204082

Effects of ray profile modeling on resolution recovery in clinical CT
Journal Article · Sat Feb 15 00:00:00 EST 2014 · Medical Physics · OSTI ID:1204082

Effects of ray profile modeling on resolution recovery in clinical CT
Journal Article · Sat Feb 15 00:00:00 EST 2014 · Medical Physics · OSTI ID:1204082

Related Subjects