Correcting evaluation bias of relational classifiers with network cross validation

Neville, Jennifer; Gallagher, Brian; Eliassi-Rad, Tina; Wang, Tao

doi:10.1007/s10115-010-0373-1

Title: Correcting evaluation bias of relational classifiers with network cross validation

Journal Article · Tue Jan 04 00:00:00 EST 2011 · Knowledge and Information Systems

DOI:https://doi.org/10.1007/s10115-010-0373-1· OSTI ID:1343835

Neville, Jennifer ^[1]; Gallagher, Brian ^[2]; Eliassi-Rad, Tina ^[3]; Wang, Tao ^[1]

Purdue Univ., West Lafayette, IN (United States)
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Rutgers Univ., Piscataway, NJ (United States)

Recently, a number of modeling techniques have been developed for data mining and machine learning in relational and network domains where the instances are not independent and identically distributed (i.i.d.). These methods specifically exploit the statistical dependencies among instances in order to improve classification accuracy. However, there has been little focus on how these same dependencies affect our ability to draw accurate conclusions about the performance of the models. More specifically, the complex link structure and attribute dependencies in relational data violate the assumptions of many conventional statistical tests and make it difficult to use these tests to assess the models in an unbiased manner. In this work, we examine the task of within-network classification and the question of whether two algorithms will learn models that will result in significantly different levels of performance. We show that the commonly used form of evaluation (paired t-test on overlapping network samples) can result in an unacceptable level of Type I error. Furthermore, we show that Type I error increases as (1) the correlation among instances increases and (2) the size of the evaluation set increases (i.e., the proportion of labeled nodes in the network decreases). Lastly, we propose a method for network cross-validation that combined with paired t-tests produces more acceptable levels of Type I error while still providing reasonable levels of statistical power (i.e., 1–Type II error).

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)

Sponsoring Organization:: USDOE

Grant/Contract Number:: AC52-07NA27344

OSTI ID:: 1343835

Report Number(s):: LLNL-JRNL-455699

Journal Information:: Knowledge and Information Systems, Vol. 30, Issue 1; ISSN 0219-1377

Publisher:: SpringerCopyright Statement

Country of Publication:: United States

Language:: English

Citation Metrics:

Cited by: 4 works

Citation information provided by
Web of Science

References (19)

Enhanced hypertext categorization using hyperlinks Chakrabarti, Soumen; Dom, Byron; Indyk, Piotr ACM SIGMOD Record, Vol. 27, Issue 2 https://doi.org/10.1145/276305.276332	journal	June 1998
Cautious Inference in Collective Classification McDowell, Luke K.; Gupta, Kalyan M.; Aha, David W. AAAI Conference on Artificial Intelligence https://doi.org/10.21236/ADA479429	report	July 2007
Learning relational probability trees Neville, Jennifer; Jensen, David; Friedland, Lisa Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '03 https://doi.org/10.1145/956750.956830	conference	January 2003
Leveraging Relational Autocorrelation with Latent Group Models Neville, J.; Jensen, D. Fifth IEEE International Conference on Data Mining (ICDM'05) https://doi.org/10.1109/ICDM.2005.89	conference	January 2005
Distribution-based aggregation for relational learning with identifier attributes Perlich, Claudia; Provost, Foster Machine Learning, Vol. 62, Issue 1-2 https://doi.org/10.1007/s10994-006-6064-1	journal	January 2006
An Examination of Experimental Methodology for Classifiers of Relational Data Gallagher, Brian; Eliassi-Rad, Tina 2007 Seventh IEEE International Conference on Data Mining - Workshops (ICDM Workshops), Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007) https://doi.org/10.1109/ICDMW.2007.27	conference	October 2007
Exploiting Cluster-Structure to Predict the Labeling of a Graph Herbster, Mark Lecture Notes in Computer Science https://doi.org/10.1007/978-3-540-87987-9_9	book	January 2008
A Simple Relational Classifier Macskassy, Sofus A.; Provost, Foster https://doi.org/10.21236/ADA452802	report	January 2003
Using ghost edges for classification in sparsely labeled networks Gallagher, Brian; Tong, Hanghang; Eliassi-Rad, Tina Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08 https://doi.org/10.1145/1401890.1401925	conference	January 2008
Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms Dietterich, Thomas G. Neural Computation, Vol. 10, Issue 7 https://doi.org/10.1162/089976698300017197	journal	October 1998
Simple estimators for relational Bayesian classifiers Neville, J.; Jensen, D.; Gallagher, B. Third IEEE International Conference on Data Mining https://doi.org/10.1109/ICDM.2003.1250989	conference	January 2003
Why collective inference improves relational classification Jensen, David; Neville, Jennifer; Gallagher, Brian Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '04 https://doi.org/10.1145/1014052.1014125	conference	January 2004
Collective Classification in Network Data Sen, Prithviraj; Namata, Galileo; Bilgic, Mustafa AI Magazine, Vol. 29, Issue 3 https://doi.org/10.1609/aimag.v29i3.2157	journal	September 2008
Evaluating Statistical Tests for Within-Network Classifiers of Relational Data Neville, Jennifer; Gallagher, Brian; Eliassi-Rad, Tina 2009 Ninth IEEE International Conference on Data Mining (ICDM) https://doi.org/10.1109/ICDM.2009.50	conference	December 2009
National Longitudinal Study of Adolescent to Adult Health (Add Health), 1994-2008 [Public Use] (ICPSR 21600). Version 18 Harris, Kathleen Mullan; Udry, J. Richard Data Sharing for Demographic Research , multi-year/versions https://doi.org/10.3886/icpsr21600.v18	dataset	October 2017
Enhanced hypertext categorization using hyperlinks Chakrabarti, Soumen; Dom, Byron; Indyk, Piotr Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98 https://doi.org/10.1145/276304.276332	conference	January 1998
Leveraging relational autocorrelation with latent group models Neville, Jennifer; Jensen, David Proceedings of the 4th international workshop on Multi-relational mining - MRDM '05 https://doi.org/10.1145/1090193.1090201	conference	January 2005
Leveraging relational autocorrelation with latent group models Neville, Jennifer; Jensen, David Schloss Dagstuhl - Leibniz-Zentrum für Informatik https://doi.org/10.4230/dagsemproc.05051.10	text	January 2006
Learning Probabilistic Relational Models Getoor, Lise; Friedman, Nir; Koller, Daphne Relational Data Mining https://doi.org/10.1007/978-3-662-04599-2_13	book	January 2001

Cited By (1)

Bayesian Model Selection on Random Networks Marios, Papamichalis arXiv https://doi.org/10.48550/arxiv.2004.13880	preprint	January 2020

Linked Research (24)

Similar Records

Final Technical Report - Applications of Machine Learning Techniques to Geothermal Play Fairway Analysis in the Great Basin Region, Nevada

Technical Report · Sat Feb 10 00:00:00 EST 2024 · OSTI ID:1343835

Faulds, James E; Smith, Connor M; Brown, Stephen; +13 more

Constructing the Nitrogen Flux Maps (NFMs) of Plants

Technical Report · Fri Mar 22 00:00:00 EDT 2024 · OSTI ID:1343835

Maeda, Hiroshi; Yoshikuni, Yasuo; Northen, Trent; +2 more

Definition of an Acceptable Glass composition Region (AGCR) via an Index System and a Partitioning Function

Technical Report · Sun Jun 26 00:00:00 EDT 2005 · OSTI ID:1343835

Peeler, D K; Taylor, A S; Edwards, T B

Related Subjects

99 GENERAL AND MISCELLANEOUS
97 MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
relational learning
collective classification
statistical tests
methodology