EnZymClass: Substrate specificity prediction tool of plant acyl-ACP thioesterases based on ensemble learning
Journal Article
·
· Current Research in Biotechnology
- Pennsylvania State Univ., University Park, PA (United States); University of Illinois
- Univ. of Wisconsin, Madison, WI (United States)
- Pennsylvania State Univ., University Park, PA (United States)
Characterizing the functional properties of plant acyl-ACP thioesterases (TEs), a key enzyme class used in the production of renewable oleochemicals in microbial hosts, experimentally, can be an expensive and time consuming process since it requires manual screening of thousands of candidates in a database. Using amino acid sequence to computationally predict an enzyme’s function might accelerate this process; however obtaining the necessary amount of information on previously characterized enzymes and their respective sequences required by standard Machine Learning (ML) based approaches to accurately infer sequence-function relationships can be prohibitive, especially with a low-throughput testing cycle. Experimental noise, unbalanced dataset where high sequence similarity does not always imply identical functional properties will further prevent robust prediction performance. Herein we present a ML method, Ensemble method for enZyme Classification (EnZymClass), that is specifically designed to address these issues. We used EnZymClass to classify TEs into short, long and mixed free fatty acid substrate specificity categories. While general guidelines for inferring substrate specificity have been proposed before, prediction of chain-length preference from primary sequence has remained elusive for plant acyl-ACP TEs. By applying EnZymClass to a subset of TEs in the ThYme database, we identified two medium chain TEs, ClFatB3 and CwFatB2, with previously uncharacterized activity in E. coli fatty acid production hosts.
- Research Organization:
- CABBI, Urbana, IL (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- Grant/Contract Number:
- SC0018420
- OSTI ID:
- 1855996
- Journal Information:
- Current Research in Biotechnology, Journal Name: Current Research in Biotechnology Vol. 4; ISSN 2590-2628
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
deeprob/ThioesteraseEnzymeSpecificity: EnZymClass-first-release
Computational Redesign of Acyl-ACP Thioesterase with Improved Selectivity toward Medium-Chain-Length Fatty Acids
Chimeric Fatty Acyl-Acyl Carrier Protein Thioesterases Provide Mechanistic Insight into Enzyme Specificity and Expression
Dataset
·
Thu Jun 24 20:00:00 EDT 2021
·
OSTI ID:3014139
Computational Redesign of Acyl-ACP Thioesterase with Improved Selectivity toward Medium-Chain-Length Fatty Acids
Journal Article
·
Wed Apr 19 20:00:00 EDT 2017
· ACS Catalysis
·
OSTI ID:1408279
Chimeric Fatty Acyl-Acyl Carrier Protein Thioesterases Provide Mechanistic Insight into Enzyme Specificity and Expression
Journal Article
·
Thu Mar 15 20:00:00 EDT 2018
· Applied and Environmental Microbiology
·
OSTI ID:1503630