DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: UNNT: A novel Utility for comparing Neural Net and Tree-based models

Journal Article · · PLoS Computational Biology (Online)

The use of deep learning (DL) is steadily gaining traction in scientific challenges such as cancer research. Advances in enhanced data generation, machine learning algorithms, and compute infrastructure have led to an acceleration in the use of deep learning in various domains of cancer research such as drug response problems. In our study, we explored tree-based models to improve the accuracy of a single drug response model and demonstrate that tree-based models such as XGBoost (eXtreme Gradient Boosting) have advantages over deep learning models, such as a convolutional neural network (CNN), for single drug response problems. However, comparing models is not a trivial task. To make training and comparing CNNs and XGBoost more accessible to users, we developed an open-source library called UNNT (A novel Utility for comparing Neural Net and Tree-based models). The case studies, in this manuscript, focus on cancer drug response datasets however the application can be used on datasets from other domains, such as chemistry.

Research Organization:
Univ. of Delaware, Newark, DE (United States)
Sponsoring Organization:
USDOE
OSTI ID:
2470381
Journal Information:
PLoS Computational Biology (Online), Journal Name: PLoS Computational Biology (Online) Journal Issue: 4 Vol. 20; ISSN 1553-7358
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (11)

Tabular data: Deep learning is not all you need journal May 2022
Metrics other than potency reveal systematic variation in responses to cancer drugs journal September 2013
The NCI60 human tumour cell line anticancer drug screen journal October 2006
PharmacoDB: an integrative database for mining in vitro anticancer drug screening studies journal October 2017
ComBat-seq: batch effect adjustment for RNA-seq count data journal September 2020
Gradient-based learning applied to document recognition journal January 1998
XGBoost: A Scalable Tree Boosting System conference January 2016
CANDLE/Supervisor: a workflow framework for machine learning applied to cancer research journal December 2018
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions journal March 2021
AI Meets Exascale Computing: Advancing Cancer Research With Large-Scale High Performance Computing journal October 2019
The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation journal July 2021

Similar Records

Solving Seismic Wave Equations on Variable Velocity Models With Fourier Neural Operator
Journal Article · 2023 · IEEE Transactions on Geoscience and Remote Sensing · OSTI ID:2507005

Learning curves for drug response prediction in cancer cell lines
Journal Article · 2021 · BMC Bioinformatics · OSTI ID:1815847

Deep Learning for Subsurface Flow: A Comparative Study of U‐Net, Fourier Neural Operators, and Transformers in Underground Hydrogen Storage
Journal Article · 2025 · Journal of Geophysical Research. Machine Learning and Computation (Online) · OSTI ID:2530652