Iterative random forests to discover predictive and stable high-order interactions
Journal Article
·
· Proceedings of the National Academy of Sciences of the United States of America
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14853,, Department of Statistical Science, Cornell University, Ithaca, NY 14853,, Data Driven Decisions Department, Preminon LLC, Antioch, CA 94531,
- Statistics Department, University of California, Berkeley, CA 94720,
- Data Driven Decisions Department, Preminon LLC, Antioch, CA 94531,, Statistics Department, University of California, Berkeley, CA 94720,, Centre for Computational Biology, School of Biosciences, University of Birmingham, Edgbaston B15 2TT, United Kingdom,, Molecular Ecosystems Biology Department, Biosciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA 94720,
- Data Driven Decisions Department, Preminon LLC, Antioch, CA 94531,, Statistics Department, University of California, Berkeley, CA 94720,, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720
Significance We developed a predictive, stable, and interpretable tool: the iterative random forest algorithm (iRF). iRF discovers high-order interactions among biomolecules with the same order of computational cost as random forests. We demonstrate the efficacy of iRF by finding known and promising interactions among biomolecules, of up to fifth and sixth order, in two data examples in transcriptional regulation and alternative splicing.
- Sponsoring Organization:
- USDOE
- Grant/Contract Number:
- SC0017069
- OSTI ID:
- 1417528
- Journal Information:
- Proceedings of the National Academy of Sciences of the United States of America, Journal Name: Proceedings of the National Academy of Sciences of the United States of America Journal Issue: 8 Vol. 115; ISSN 0027-8424
- Publisher:
- Proceedings of the National Academy of SciencesCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Ranger-based Iterative Random Forest
A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks
iRF v2.0
Software
·
2019
·
OSTI ID:code-45760
A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks
Journal Article
·
2019
· Genes
·
OSTI ID:1576840
+3 more
iRF v2.0
Software
·
2021
·
OSTI ID:code-56049
+1 more