DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: DORA-XGB: an improved enzymatic reaction feasibility classifier trained using a novel synthetic data approach

Journal Article · · Molecular Systems Design & Engineering
DOI: https://doi.org/10.1039/D4ME00118D · OSTI ID:2480755
 [1];  [1];  [1]; ORCiD logo [1];  [1]
  1. Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA, Center for Synthetic Biology, Northwestern University, Evanston, IL, USA

We outline a method for synthetically generating negative data by considering alternative reaction centers on small-molecule substrates that are known to participate in enzymatic reactions.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NONE; SC0018249; AC02-05CH11231
OSTI ID:
2480755
Journal Information:
Molecular Systems Design & Engineering, Journal Name: Molecular Systems Design & Engineering Journal Issue: 2 Vol. 10; ISSN MSDEBG; ISSN 2058-9689
Publisher:
Royal Society of Chemistry (RSC)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (53)

Expanding Metabolic Capabilities Using Novel Pathway Designs: Computational Tools and Case Studies journal July 2019
A deep learning approach to evaluate the feasibility of enzymatic reactions generated by retrobiosynthesis journal January 2021
Exploring De Novo metabolic pathways from pyruvate to propionic acid journal March 2016
Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction journal February 2017
Graph Neural Networks for Molecules book October 2023
Network representation and analysis of energy coupling mechanisms in cellular metabolism by a graph-theoretical approach journal May 2022
Automatic tuning of hyperparameters using Bayesian optimization journal May 2020
Channeling in native microbial pathways: Implications and challenges for metabolic engineering journal November 2017
Design of computational retrobiosynthesis tools for the design of de novo synthetic pathways journal October 2015
Coal resources under carbon peak: Segmentation of massive laser point clouds for coal mining in underground dusty environments using integrated graph deep learning model journal December 2023
The BRENDA enzyme information system–From a database to an expert system journal November 2017
Toward the energy efficiency of resource allocation algorithms for OFDMA downlink MIMO systems journal December 2019
Metabolic channeling: predictions, deductions, and evidence journal September 2021
Metabolon formation and metabolic channeling in the biosynthesis of plant natural products journal June 2005
Metabolic networks: enzyme function and metabolite structure journal June 2004
Application of message passing neural networks for molecular property prediction journal August 2023
Generation of an atlas for commodity chemical production in Escherichia coli and a novel pathway prediction algorithm, GEM-Path journal September 2014
RetroPath2.0: A retrosynthesis workflow for metabolic engineers journal January 2018
Curating a comprehensive set of enzymatic reaction rules for efficient novel biosynthetic pathway design journal May 2021
The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. journal May 1965
Atom pairs as molecular features in structure-activity studies: definition and applications journal May 1985
Reoptimization of MDL Keys for Use in Drug Discovery journal November 2002
Extended-Connectivity Fingerprints journal April 2010
Retropath: Automated Pipeline for Embedded Metabolic Circuits journal October 2013
Absolute metabolite concentrations and implied enzyme active site occupancy in Escherichia coli journal June 2009
Nontargeted in vitro metabolomics for high-throughput identification of novel enzymes in Escherichia coli journal December 2016
Pathway design using de novo steps through uncharted biochemical spaces journal January 2018
A comprehensive metabolic map for production of bio-based chemicals journal January 2019
Principles and functions of metabolic compartmentalization journal October 2022
Enzyme promiscuity prediction using hierarchy-informed multi-label classification journal January 2021
eQuilibrator 3.0: a database solution for thermodynamic constant estimation journal November 2021
KEGG: new perspectives on genomes, pathways, diseases and drugs journal November 2016
RetroRules: a database of reaction rules for engineering biology journal October 2018
The MetaCyc database of metabolic pathways and enzymes - a 2019 update journal October 2019
Energy coupling in Saccharomyces cerevisiae: selected opportunities for metabolic engineering journal April 2012
Manufacturing Molecules Through Metabolic Engineering journal December 2010
Evolutionary-scale prediction of atomic-level protein structure with a language model journal March 2023
Enzyme function prediction using contrastive learning journal March 2023
The EcoCyc Database journal February 2018
XGBoost: A Scalable Tree Boosting System conference January 2016
Bias in machine learning software: why? how? what to do?
  • Chakraborty, Joymallya; Majumder, Suvodeep; Menzies, Tim
  • Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering https://doi.org/10.1145/3468264.3468537
conference August 2021
Stepping on the Gas to a Circular Economy: Accelerating Development of Carbon-Negative Chemical Production from Gas Fermentation journal June 2021
The influence of negative training set size on machine learning-based virtual screening journal June 2014
Pickaxe: a Python library for the prediction of novel metabolic reactions journal March 2023
MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics journal August 2015
Mordred: a molecular descriptor calculator journal February 2018
Automated reaction database and reaction network analysis: extraction of reaction templates using cheminformatics journal March 2018
One molecular fingerprint to rule them all: drugs, biomolecules, and the metabolome journal June 2020
Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models journal February 2021
Pathway Thermodynamics Highlights Kinetic Obstacles in Central Metabolism journal February 2014
Predictive classifier models built from natural products with antimalarial bioactivity using machine learning approach journal September 2018
An empirical evaluation of sampling methods for the classification of imbalanced data journal July 2022
SMOTE: Synthetic Minority Over-sampling Technique journal January 2002