skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Machine learning models for rat multigeneration reproductive toxicity prediction

Journal Article · · Frontiers in Pharmacology
 [1];  [1];  [1];  [2];  [2];  [1];  [1]
  1. U.S. Food and Drug Administration (FDA), Jefferson, AR (United States)
  2. U.S. Food and Drug Administration (FDA), College Park, MD (United States)

Reproductive toxicity is one of the prominent endpoints in the risk assessment of environmental and industrial chemicals. Due to the complexity of the reproductive system, traditional reproductive toxicity testing in animals, especially guideline multigeneration reproductive toxicity studies, take a long time and are expensive. Therefore, machine learning, as a promising alternative approach, should be considered when evaluating the reproductive toxicity of chemicals. We curated rat multigeneration reproductive toxicity testing data of 275 chemicals from ToxRefDB (Toxicity Reference Database) and developed predictive models using seven machine learning algorithms (decision tree, decision forest, random forest, k-nearest neighbors, support vector machine, linear discriminant analysis, and logistic regression). A consensus model was built based on the seven individual models. An external validation set was curated from the COSMOS database and the literature. The performances of individual and consensus models were evaluated using 500 iterations of 5-fold cross-validations and the external validation data set. The balanced accuracy of the models ranged from 58% to 65% in the 5-fold cross-validations and 45%–61% in the external validations. Prediction confidence analysis was conducted to provide additional information for more appropriate applications of the developed models. The impact of our findings is in increasing confidence in machine learning models. We demonstrate the importance of using consensus models for harnessing the benefits of multiple machine learning models (i.e., using redundant systems to check validity of outcomes). While we continue to build upon the models to better characterize weak toxicants, there is current utility in saving resources by being able to screen out strong reproductive toxicants before investing in vivo testing. The modeling approach (machine learning models) is offered for assessing the rat multigeneration reproductive toxicity of chemicals. Our results suggest that machine learning may be a promising alternative approach to evaluate the potential reproductive toxicity of chemicals.

Research Organization:
Oak Ridge Institute for Science and Education (ORISE), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC); US Food and Drug Administration (FDA)
Grant/Contract Number:
SC0014664
OSTI ID:
1983097
Journal Information:
Frontiers in Pharmacology, Vol. 13; ISSN 1663-9812
Publisher:
Frontiers Research FoundationCopyright Statement
Country of Publication:
United States
Language:
English

References (36)

Profiling Chemicals Based on Chronic Toxicity Results from the U.S. EPA ToxRef Database journal March 2009
Logistic regression analysis of clinical and computed tomography features of pulmonary abscesses and risk factors for pulmonary abscess-related empyema journal January 2019
Predictive Model of Rat Reproductive Toxicity from ToxCast High Throughput Screening1 journal August 2011
Predictive Models of Prenatal Developmental Toxicity from ToxCast High-Throughput Screening Data journal August 2011
Animal testing and alternative approaches for the human health risk assessment under the proposed new European chemicals regulation journal May 2004
Combined retrospective analysis of 498 rat multi-generation reproductive toxicity studies: On the impact of parameters related to F1 mating and F2 offspring journal May 2011
Development of estrogen receptor beta binding prediction model using large sets of chemicals journal October 2017
Estrogenic Activity Data Extraction and in Silico Prediction Show the Endocrine Disruption Potential of Bisphenol A Replacement Compounds journal September 2015
An in silico ensemble method for lead discovery: decision forest journal August 2005
QSAR Models for Reproductive Toxicity and Endocrine Disruption Activity journal March 2010
Toxicity testing is evolving! journal April 2020
Alternative Models of Developmental and Reproductive Toxicity in Pharmaceutical Risk Assessment and the 3Rs journal December 2016
Variability of Molecular Descriptors in Compound Databases Revealed by Shannon Entropy Calculations
  • Godden, Jeffrey W.; Stahura, Florence L.; Bajorath, Jürgen
  • Journal of Chemical Information and Computer Sciences, Vol. 40, Issue 3 https://doi.org/10.1021/ci000321u
journal April 2000
In silico prediction of chemical reproductive toxicity using machine learning journal January 2019
QSAR modeling for predicting reproductive toxicity of chemicals in rats for regulatory purposes journal January 2016
Development of a Nicotinic Acetylcholine Receptor nAChR α7 Binding Activity Prediction Model journal March 2020
Strategic Focus on 3R Principles Reveals Major Reductions in the Use of Animals in Pharmaceutical Toxicity Testing journal July 2014
Drug repositioning: identifying and developing new uses for existing drugs journal August 2004
Support-vector networks journal September 1995
Development of Decision Forest Models for Prediction of Drug-Induced Liver Injury in Humans Using A Large Set of FDA-approved Drugs journal December 2017
Mold 2 , Molecular Descriptors from 2D Structures for Chemoinformatics and Toxicoinformatics journal June 2008
Developing novel in silico prediction models for assessing chemical reproductive toxicity using the naïve Bayes classifier method journal March 2020
The Protective Effect of Hydroalcoholic Extract of Zingiber officinale Roscoe (Ginger) on Ethanol-Induced Reproductive Toxicity in Male Rats journal January 2017
Development and Validation of Decision Forest Model for Estrogen Receptor Binding Prediction of Chemicals Using Large Data Sets journal November 2015
Multiclass Decision Forest—A Novel Pattern Recognition Method for Multiclass Classification in Microarray Data Analysis journal October 2004
An Introduction to Support Vector Machines and Other Kernel-based Learning Methods book January 2013
The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation journal February 2021
Decision Forest:  Combining the Predictions of Multiple Independent Decision Tree Models journal February 2003
How to improve R&D productivity: the pharmaceutical industry's grand challenge journal February 2010
Profiling 58 compounds including cosmetic-relevant chemicals using ToxRefDB and ToxCast journal October 2019
Quantitative Structure-Activity Relationship Models for Predicting Drug-Induced Liver Injury Based on FDA-Approved Drug Labeling Annotation and Using a Large Collection of Drugs journal August 2013
What are decision trees? journal September 2008
Random Forests journal January 2001
On the impact of second generation mating and offspring in multi-generation reproductive toxicity studies on classification and labelling of substances in Europe journal November 2011
The era of 3Rs implementation in developmental and reproductive toxicity (DART) testing: Current overview and future perspectives journal September 2017
Predicting the reproductive toxicity of chemicals using ensemble learning methods and molecular fingerprints journal April 2021