Biosystems Design by Machine Learning
Abstract
Biosystems such as enzymes, pathways, and whole cells have been increasingly explored for biotechnological applications. Yet, the intricate connectivity and complexity of biosystems pose a major hurdle in designing biosystems with desired features. As -omics and other high throughput technologies have been rapidly developed, the promise of applying machine learning (ML) techniques in biosystems design has started to become a reality. ML models enable the identification of patterns within complicated biological data across multiple scales of analysis and can augment biosystems design applications by predicting new candidates for optimized performance. ML is being used at every stage of biosystems design to help find non-obvious engineering solutions with fewer design iterations. In this review, we first describe commonly used models and modeling paradigms within ML. We then discuss some applications of these models that have already shown success in biotechnological applications. Moreover, we discuss successful applications at all scales of biosystems design, including nucleic acids, genetic circuits, proteins, pathways, genomes, and bioprocess. Lastly, we discuss some limitations of these methods and potential solutions as well as prospects of the combination of ML and biosystems design.
- Authors:
-
- Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States); Univ. of Illinois at Urbana-Champaign, IL (United States). Carl R. Woese Inst. for Genomic Biology and Dept. of Chemical and Biomolecular Engineering
- Univ. of Illinois at Urbana-Champaign, IL (United States). Dept. of Computer Science; Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States)
- Univ. of Illinois at Urbana-Champaign, IL (United States). Dept. of Chemistry and Carl R. Woese Inst. for Genomic Biology; Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States)
- Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States); Univ. of Illinois at Urbana-Champaign, IL (United States). Carl R. Woese Inst. for Genomic Biology, Dept. of Chemical and Biomolecular Engineering, and Dept. of Chemistry
- Publication Date:
- Research Org.:
- Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States); Univ. of Illinois at Urbana-Champaign, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER); National Institutes of Health (NIH)
- OSTI Identifier:
- 1632114
- Grant/Contract Number:
- SC0018420; SC0018260; 1UM1HG009402; 1U54DK107965; AI144967
- Resource Type:
- Accepted Manuscript
- Journal Name:
- ACS Synthetic Biology
- Additional Journal Information:
- Journal Volume: 9; Journal Issue: 7; Journal ID: ISSN 2161-5063
- Publisher:
- American Chemical Society (ACS)
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; Machine learning; Biosystems design; Synthetic biology; Metabolic engineering
Citation Formats
Volk, Michael Jeffrey, Lourentzou, Ismini, Mishra, Shekhar, Vo, Lam Tung, Zhai, Chengxiang, and Zhao, Huimin. Biosystems Design by Machine Learning. United States: N. p., 2020.
Web. doi:10.1021/acssynbio.0c00129.
Volk, Michael Jeffrey, Lourentzou, Ismini, Mishra, Shekhar, Vo, Lam Tung, Zhai, Chengxiang, & Zhao, Huimin. Biosystems Design by Machine Learning. United States. https://doi.org/10.1021/acssynbio.0c00129
Volk, Michael Jeffrey, Lourentzou, Ismini, Mishra, Shekhar, Vo, Lam Tung, Zhai, Chengxiang, and Zhao, Huimin. Tue .
"Biosystems Design by Machine Learning". United States. https://doi.org/10.1021/acssynbio.0c00129. https://www.osti.gov/servlets/purl/1632114.
@article{osti_1632114,
title = {Biosystems Design by Machine Learning},
author = {Volk, Michael Jeffrey and Lourentzou, Ismini and Mishra, Shekhar and Vo, Lam Tung and Zhai, Chengxiang and Zhao, Huimin},
abstractNote = {Biosystems such as enzymes, pathways, and whole cells have been increasingly explored for biotechnological applications. Yet, the intricate connectivity and complexity of biosystems pose a major hurdle in designing biosystems with desired features. As -omics and other high throughput technologies have been rapidly developed, the promise of applying machine learning (ML) techniques in biosystems design has started to become a reality. ML models enable the identification of patterns within complicated biological data across multiple scales of analysis and can augment biosystems design applications by predicting new candidates for optimized performance. ML is being used at every stage of biosystems design to help find non-obvious engineering solutions with fewer design iterations. In this review, we first describe commonly used models and modeling paradigms within ML. We then discuss some applications of these models that have already shown success in biotechnological applications. Moreover, we discuss successful applications at all scales of biosystems design, including nucleic acids, genetic circuits, proteins, pathways, genomes, and bioprocess. Lastly, we discuss some limitations of these methods and potential solutions as well as prospects of the combination of ML and biosystems design.},
doi = {10.1021/acssynbio.0c00129},
journal = {ACS Synthetic Biology},
number = 7,
volume = 9,
place = {United States},
year = {Tue Jun 02 00:00:00 EDT 2020},
month = {Tue Jun 02 00:00:00 EDT 2020}
}
Web of Science
Works referenced in this record:
Biosystems design by directed evolution
journal, July 2019
- Wang, Yajie; Yu, Xiaowei; Zhao, Huimin
- AIChE Journal
Complex systems in metabolic engineering
journal, December 2015
- Winkler, James D.; Erickson, Keesha; Choudhury, Alaksh
- Current Opinion in Biotechnology, Vol. 36
Synthetic biology and regulatory networks: where metabolic systems biology meets control engineering
journal, April 2016
- He, Fei; Murabito, Ettore; Westerhoff, Hans V.
- Journal of The Royal Society Interface, Vol. 13, Issue 117
Metabolic engineering, synthetic biology and systems biology
journal, January 2012
- Nielsen, Jens; Pronk, Jack T.
- FEMS Yeast Research, Vol. 12, Issue 2
Systems biology of yeast: enabling technology for development of cell factories for production of advanced biofuels
journal, August 2012
- de Jong, Bouke; Siewers, Verena; Nielsen, Jens
- Current Opinion in Biotechnology, Vol. 23, Issue 4
Machine learning techniques for protein function prediction
journal, October 2019
- Bonetta, Rosalin; Valentino, Gianluca
- Proteins: Structure, Function, and Bioinformatics, Vol. 88, Issue 3
Using deep learning to model the hierarchical structure and function of a cell
journal, March 2018
- Ma, Jianzhu; Yu, Michael Ku; Fong, Samson
- Nature Methods, Vol. 15, Issue 4
Representation Learning: A Review and New Perspectives
journal, August 2013
- Bengio, Y.; Courville, A.; Vincent, P.
- IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, Issue 8
Building high-level features using large scale unsupervised learning
conference, May 2013
- Le, Quoc V.
- ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Multi-Label Classification: An Overview
journal, July 2007
- Tsoumakas, Grigorios; Katakis, Ioannis
- International Journal of Data Warehousing and Mining, Vol. 3, Issue 3
Taking the Human Out of the Loop: A Review of Bayesian Optimization
journal, January 2016
- Shahriari, Bobak; Swersky, Kevin; Wang, Ziyu
- Proceedings of the IEEE, Vol. 104, Issue 1
The Elements of Statistical Learning
book, January 2009
- Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome
- Springer Series in Statistics
Unsupervised Learning: Foundations of Neural Computation
January 1999
- Hinton, Geoffrey; Sejnowski, Terrence J.
- The MIT Press
Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes
journal, August 2018
- Shamsi, Zahra; Cheng, Kevin J.; Shukla, Diwakar
- The Journal of Physical Chemistry B, Vol. 122, Issue 35
A deep learning based data driven soft sensor for bioprocesses
journal, August 2018
- Gopakumar, Vineet; Tiwari, Sarthak; Rahman, Imran
- Biochemical Engineering Journal, Vol. 136
Active Learning
journal, June 2012
- Settles, Burr
- Synthesis Lectures on Artificial Intelligence and Machine Learning, Vol. 6, Issue 1
Automated analysis of high‐content microscopy data with deep learning
journal, April 2017
- Kraus, Oren Z.; Grys, Ben T.; Ba, Jimmy
- Molecular Systems Biology, Vol. 13, Issue 4
A Survey on Transfer Learning
journal, October 2010
- Pan, Sinno Jialin; Yang, Qiang
- IEEE Transactions on Knowledge and Data Engineering, Vol. 22, Issue 10
Solving large scale linear prediction problems using stochastic gradient descent algorithms
conference, January 2004
- Zhang, Tong
- Twenty-first international conference on Machine learning - ICML '04
Data Analysis Using Regression and Multilevel/Hierarchical Models
book, January 2006
- Gelman, Andrew; Hill, Jennifer
An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression
journal, August 1992
- Altman, N. S.
- The American Statistician, Vol. 46, Issue 3
Deep Residual Learning for Image Recognition
conference, June 2016
- He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing
- 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Long Short-Term Memory
journal, November 1997
- Hochreiter, Sepp; Schmidhuber, Jürgen
- Neural Computation, Vol. 9, Issue 8
Unified rational protein engineering with sequence-based deep representation learning
journal, October 2019
- Alley, Ethan C.; Khimulya, Grigory; Biswas, Surojit
- Nature Methods, Vol. 16, Issue 12
A deep learning framework for modeling structural features of RNA-binding protein targets
journal, October 2015
- Zhang, Sai; Zhou, Jingtian; Hu, Hailin
- Nucleic Acids Research, Vol. 44, Issue 4
DeepSol: a deep learning framework for sequence-based protein solubility prediction
journal, March 2018
- Khurana, Sameer; Rawi, Reda; Kunji, Khalid
- Bioinformatics, Vol. 34, Issue 15
DeepLoc: prediction of protein subcellular localization using deep learning
journal, July 2017
- Almagro Armenteros, José Juan; Sønderby, Casper Kaae; Sønderby, Søren Kaae
- Bioinformatics, Vol. 33, Issue 21
Construction of precise support vector machine based models for predicting promoter strength
journal, March 2017
- Meng, Hailin; Ma, Yingfei; Mai, Guoqin
- Quantitative Biology, Vol. 5, Issue 1
iTerm-PseKNC: a sequence-based tool for predicting bacterial transcriptional terminators
journal, October 2018
- Feng, Chao-Qin; Zhang, Zhao-Yue; Zhu, Xiao-Juan
- Bioinformatics, Vol. 35, Issue 9
Tuning the Performance of Synthetic Riboswitches using Machine Learning
journal, December 2018
- Groher, Ann-Christin; Jager, Sven; Schneider, Christopher
- ACS Synthetic Biology, Vol. 8, Issue 1
Prediction of aptamer–protein interacting pairs based on sparse autoencoder feature extraction and an ensemble classifier
journal, May 2019
- Yang, Qing; Jia, Cangzhi; Li, Taoying
- Mathematical Biosciences, Vol. 311
A novel nucleic acid sequence encoding strategy for high-performance aptamer identification and the aid of sequence design and optimization
journal, November 2017
- Yang, Qin; Wang, Sui-Ping; Yu, Xin-Liang
- Chemometrics and Intelligent Laboratory Systems, Vol. 170
Prediction of Aptamer-Target Interacting Pairs with Pseudo-Amino Acid Composition
journal, January 2014
- Li, Bi-Qing; Zhang, Yu-Chao; Huang, Guo-Hua
- PLoS ONE, Vol. 9, Issue 1
Prediction of aptamer-protein interacting pairs using an ensemble classifier in combination with various protein sequence attributes
journal, May 2016
- Zhang, Lina; Zhang, Chengjin; Gao, Rui
- BMC Bioinformatics, Vol. 17, Issue 1
The new frontier of genome engineering with CRISPR-Cas9
journal, November 2014
- Doudna, Jennifer A.; Charpentier, Emmanuelle
- Science, Vol. 346, Issue 6213
Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9
journal, January 2016
- Doench, John G.; Fusi, Nicolo; Sullender, Meagan
- Nature Biotechnology, Vol. 34, Issue 2
CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing
journal, May 2019
- Labun, Kornel; Montague, Tessa G.; Krause, Maximilian
- Nucleic Acids Research, Vol. 47, Issue W1
A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
journal, October 2017
- Abadi, Shiran; Yan, Winston X.; Amar, David
- PLOS Computational Biology, Vol. 13, Issue 10
Deep learning improves prediction of CRISPR–Cpf1 guide RNA activity
journal, January 2018
- Kim, Hui Kwon; Min, Seonwoo; Song, Myungjae
- Nature Biotechnology, Vol. 36, Issue 3
DeepCRISPR: optimized CRISPR guide RNA design by deep learning
journal, June 2018
- Chuai, Guohui; Ma, Hanhui; Yan, Jifang
- Genome Biology, Vol. 19, Issue 1
Prediction of off-target activities for the end-to-end design of CRISPR guide RNAs
journal, January 2018
- Listgarten, Jennifer; Weinstein, Michael; Kleinstiver, Benjamin P.
- Nature Biomedical Engineering, Vol. 2, Issue 1
Off-target predictions in CRISPR-Cas9 gene editing using deep learning
journal, September 2018
- Lin, Jiecong; Wong, Ka-Chun
- Bioinformatics, Vol. 34, Issue 17
General Conclusions: Teleonomic Mechanisms in Cellular Metabolism, Growth, and Differentiation
journal, January 1961
- Monod, J.; Jacob, F.
- Cold Spring Harbor Symposia on Quantitative Biology, Vol. 26, Issue 0
Circuit simulation of genetic networks
journal, August 1995
- McAdams, H.; Shapiro, L.
- Science, Vol. 269, Issue 5224
Writing DNA with GenoCADTM
journal, May 2009
- Czar, M. J.; Cai, Y.; Peccoud, J.
- Nucleic Acids Research, Vol. 37, Issue Web Server
iBioSim: a tool for the analysis and design of genetic circuits
journal, July 2009
- Myers, C. J.; Barker, N.; Jones, K.
- Bioinformatics, Vol. 25, Issue 21
Genetic circuit design automation
journal, March 2016
- Nielsen, A. A. K.; Der, B. S.; Shin, J.
- Science, Vol. 352, Issue 6281
An atlas of gene regulatory networks reveals multiple three‐gene mechanisms for interpreting morphogen gradients
journal, January 2010
- Cotterell, James; Sharpe, James
- Molecular Systems Biology, Vol. 6, Issue 1
Optimal Regulatory Circuit Topologies for Fold-Change Detection
journal, February 2017
- Adler, Miri; Szekely, Pablo; Mayo, Avi
- Cell Systems, Vol. 4, Issue 2
Reverse Engineering the Gap Gene Network of Drosophila melanogaster
journal, May 2006
- Perkins, Theodore J.; Jaeger, Johannes; Reinitz, John
- PLoS Computational Biology, Vol. 2, Issue 5
Efficient Reverse-Engineering of a Developmental Gene Regulatory Network
journal, July 2012
- Crombach, Anton; Wotton, Karl R.; Cicin-Sain, Damjan
- PLoS Computational Biology, Vol. 8, Issue 7
Evolving Robust Gene Regulatory Networks
journal, January 2015
- Noman, Nasimul; Monjo, Taku; Moscato, Pablo
- PLOS ONE, Vol. 10, Issue 1
Evolving phenotypic networks in silico
journal, November 2014
- François, Paul
- Seminars in Cell & Developmental Biology, Vol. 35
Designing synthetic networks in silico: a generalised evolutionary algorithm approach
journal, December 2017
- Smith, Robert W.; van Sluijs, Bob; Fleck, Christian
- BMC Systems Biology, Vol. 11, Issue 1
Adapting machine-learning algorithms to design gene circuits
journal, April 2019
- Hiscock, Tom W.
- BMC Bioinformatics, Vol. 20, Issue 1
The Synthetic Biology Open Language (SBOL) provides a community standard for communicating designs in synthetic biology
journal, June 2014
- Galdzicki, Michal; Clancy, Kevin P.; Oberortner, Ernst
- Nature Biotechnology, Vol. 32, Issue 6
The game of chess and searches in protein sequence space
journal, December 1998
- Mandecki, Wlodek
- Trends in Biotechnology, Vol. 16, Issue 5
Exploring protein fitness landscapes by directed evolution
journal, December 2009
- Romero, Philip A.; Arnold, Frances H.
- Nature Reviews Molecular Cell Biology, Vol. 10, Issue 12
AAindex: amino acid index database, progress report 2008
journal, December 2007
- Kawashima, S.; Pokarowski, P.; Pokarowska, M.
- Nucleic Acids Research, Vol. 36, Issue Database
ProFET: Feature engineering captures high-level protein functions
journal, June 2015
- Ofer, Dan; Linial, Michal
- Bioinformatics, Vol. 31, Issue 21
Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics
journal, November 2015
- Asgari, Ehsaneddin; Mofrad, Mohammad R. K.
- PLOS ONE, Vol. 10, Issue 11
Learned protein embeddings for machine learning
journal, March 2018
- Yang, Kevin K.; Wu, Zachary; Bedbrook, Claire N.
- Bioinformatics, Vol. 34, Issue 15
Amino acid substitution matrices from protein blocks.
journal, November 1992
- Henikoff, S.; Henikoff, J. G.
- Proceedings of the National Academy of Sciences, Vol. 89, Issue 22, p. 10915-10919
Deep mutational scanning: a new style of protein science
journal, July 2014
- Fowler, Douglas M.; Fields, Stanley
- Nature Methods, Vol. 11, Issue 8
Deep learning
journal, May 2015
- LeCun, Yann; Bengio, Yoshua; Hinton, Geoffrey
- Nature, Vol. 521, Issue 7553
Metabolic Burden: Cornerstones in Synthetic Biology and Metabolic Engineering Applications
journal, August 2016
- Wu, Gang; Yan, Qiang; Jones, J. Andrew
- Trends in Biotechnology, Vol. 34, Issue 8
Metabolic pathway engineering
journal, March 2018
- Alper, Hal S.; Avalos, José L.
- Synthetic and Systems Biotechnology, Vol. 3, Issue 1
A machine learning approach to predict metabolic pathway dynamics from time-series multiomics data
journal, May 2018
- Costello, Zak; Martin, Hector Garcia
- npj Systems Biology and Applications, Vol. 4, Issue 1
Approaches to Computational Strain Design in the Multiomics Era
journal, April 2019
- St. John, Peter C.; Bomble, Yannick J.
- Frontiers in Microbiology, Vol. 10
Succinate Overproduction: A Case Study of Computational Strain Design Using a Comprehensive Escherichia coli Kinetic Model
journal, January 2015
- Khodayari, Ali; Chowdhury, Anupam; Maranas, Costas D.
- Frontiers in Bioengineering and Biotechnology, Vol. 2
A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains
journal, December 2016
- Khodayari, Ali; Maranas, Costas D.
- Nature Communications, Vol. 7, Issue 1
Machine learning methods for metabolic pathway prediction
journal, January 2010
- Dale, Joseph M.; Popescu, Liviu; Karp, Peter D.
- BMC Bioinformatics, Vol. 11, Issue 1
Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling
journal, January 2018
- Cuperlovic-Culf, Miroslava
- Metabolites, Vol. 8, Issue 1
Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review
journal, March 2016
- Zhang, Xue; Acencio, Marcio Luis; Lemke, Ney
- Frontiers in Physiology, Vol. 7
Mathematical Modeling and Dynamic Simulation of Metabolic Reaction Systems Using Metabolome Time Series Data
journal, May 2016
- Sriyudthsak, Kansuporn; Shiraishi, Fumihide; Hirai, Masami Yokota
- Frontiers in Molecular Biosciences, Vol. 3
Principal component analysis of proteomics (PCAP) as a tool to direct metabolic engineering
journal, March 2015
- Alonso-Gutierrez, Jorge; Kim, Eun-Mi; Batth, Tanveer S.
- Metabolic Engineering, Vol. 28
From in vivo to in silico biology and back
journal, October 2006
- Di Ventura, Barbara; Lemerle, Caroline; Michalodimitrakis, Konstantinos
- Nature, Vol. 443, Issue 7111
Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering
journal, August 2016
- Xu, Peng; Rizzoni, Elizabeth Anne; Sul, Se-Yeong
- ACS Synthetic Biology, Vol. 6, Issue 1
Towards a fully automated algorithm driven platform for biosystems design
journal, November 2019
- HamediRad, Mohammad; Chao, Ran; Weisberg, Scott
- Nature Communications, Vol. 10, Issue 1
Customized optimization of metabolic pathways by combinatorial transcriptional engineering
journal, June 2012
- Du, Jing; Yuan, Yongbo; Si, Tong
- Nucleic Acids Research, Vol. 40, Issue 18
Machine Learning of Designed Translational Control Allows Predictive Pathway Optimization in Escherichia coli
journal, December 2018
- Jervis, Adrian J.; Carbonell, Pablo; Vinaixa, Maria
- ACS Synthetic Biology, Vol. 8, Issue 1
DeePromoter: Robust Promoter Predictor Using Deep Learning
journal, April 2019
- Oubounyt, Mhaned; Louadi, Zakaria; Tayara, Hilal
- Frontiers in Genetics, Vol. 10
Using Genome-scale Models to Predict Biological Capabilities
journal, May 2015
- O’Brien, Edward J.; Monk, Jonathan M.; Palsson, Bernhard O.
- Cell, Vol. 161, Issue 5
Current status and applications of genome-scale metabolic models
journal, June 2019
- Gu, Changdai; Kim, Gi Bae; Kim, Won Jun
- Genome Biology, Vol. 20, Issue 1
An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR)
journal, August 2003
- Reed, Jennifer L.; Vo, Thuy D.; Schilling, Christophe H.
- Genome Biology, Vol. 4, Issue 9, p. R54
Leveraging knowledge engineering and machine learning for microbial bio-manufacturing
journal, July 2018
- Oyetunde, Tolutola; Bao, Forrest Sheng; Chen, Jiung-Wen
- Biotechnology Advances, Vol. 36, Issue 4
Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
journal, April 2016
- Wu, Stephen Gang; Wang, Yuxuan; Jiang, Wu
- PLOS Computational Biology, Vol. 12, Issue 4
An integrated approach to characterize genetic interaction networks in yeast metabolism
journal, May 2011
- Szappanos, Balázs; Kovács, Károly; Szamecz, Béla
- Nature Genetics, Vol. 43, Issue 7
Systematizing the generation of missing metabolic knowledge
journal, June 2010
- Orth, Jeffrey D.; Palsson, Bernhard Ø.
- Biotechnology and Bioengineering, Vol. 107, Issue 3
Machine and deep learning meet genome-scale metabolic modeling
journal, July 2019
- Zampieri, Guido; Vijayakumar, Supreeta; Yaneske, Elisabeth
- PLOS Computational Biology, Vol. 15, Issue 7
Machine Learning Predicts the Yeast Metabolome from the Quantitative Proteome of Kinase Knockouts
journal, September 2018
- Zelezniak, Aleksej; Vowinckel, Jakob; Capuano, Floriana
- Cell Systems, Vol. 7, Issue 3
Genome-Scale Identification of Legionella pneumophila Effectors Using a Machine Learning Approach
journal, July 2009
- Burstein, David; Zusman, Tal; Degtyar, Elena
- PLoS Pathogens, Vol. 5, Issue 7
MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data
journal, June 2010
- Mathelier, Anthony; Carbone, Alessandra
- Bioinformatics, Vol. 26, Issue 18
Facilitate Collaborations among Synthetic Biology, Metabolic Engineering and Machine Learning
journal, March 2016
- Wu, Stephen Gang; Shimizu, Kazuyuki; Tang, Joseph Kuo-Hsiang
- ChemBioEng Reviews, Vol. 3, Issue 2
The potential of random forest and neural networks for biomass and recombinant protein modeling in Escherichia coli fed-batch fermentations
journal, August 2015
- Melcher, Michael; Scharl, Theresa; Spangl, Bernhard
- Biotechnology Journal, Vol. 10, Issue 11
Boosted structured additive regression for Escherichia coli fed-batch fermentation modeling: STAR Fermentation Modeling
journal, August 2016
- Melcher, Michael; Scharl, Theresa; Luchner, Markus
- Biotechnology and Bioengineering, Vol. 114, Issue 2
Data Mining and Analytics in the Process Industry: The Role of Machine Learning
journal, January 2017
- Ge, Zhiqiang; Song, Zhihuan; Ding, Steven X.
- IEEE Access, Vol. 5
Analysis of the tendency for the electronic conductivity to change during alcoholic fermentation
journal, April 2019
- Li, Chongwei; Wang, Yue; Sha, Shuang
- Scientific Reports, Vol. 9, Issue 1
Qualitative discrimination of yeast fermentation stages based on an olfactory visualization sensor system integrated with a pattern recognition algorithm
journal, January 2019
- Xu, Weidong; Jiang, Hui; Liu, Tong
- Analytical Methods, Vol. 11, Issue 26
User-friendly optimization approach of fed-batch fermentation conditions for the production of iturin A using artificial neural networks and support vector machine
journal, July 2015
- Chen, Fudi; Li, Hao; Xu, Zhihan
- Electronic Journal of Biotechnology, Vol. 18, Issue 4
Machine learning framework for assessment of microbial factory performance
journal, January 2019
- Oyetunde, Tolutola; Liu, Di; Martin, Hector Garcia
- PLOS ONE, Vol. 14, Issue 1
Evaluating Factors That Influence Microbial Synthesis Yields by Linear Regression with Numerical and Ordinal Variables
journal, November 2010
- Colletti, Peter F.; Goyal, Yogesh; Varman, Arul M.
- Biotechnology and Bioengineering, Vol. 108, Issue 4
Statistics-based model for prediction of chemical biosynthesis yield from Saccharomyces cerevisiae
journal, January 2011
- Varman, Arul M.; Xiao, Yi; Leonard, Effendi
- Microbial Cell Factories, Vol. 10, Issue 1
Sequencing technologies — the next generation
journal, December 2009
- Metzker, Michael L.
- Nature Reviews Genetics, Vol. 11, Issue 1
Analytics for Metabolic Engineering
journal, September 2015
- Petzold, Christopher J.; Chan, Leanne Jade G.; Nhan, Melissa
- Frontiers in Bioengineering and Biotechnology, Vol. 3
Engineering Cellular Metabolism
journal, March 2016
- Nielsen, Jens; Keasling, Jay D.
- Cell, Vol. 164, Issue 6
Engineering biological systems using automated biofoundries
journal, July 2017
- Chao, Ran; Mishra, Shekhar; Si, Tong
- Metabolic Engineering, Vol. 42
Building biological foundries for next-generation synthetic biology
journal, May 2015
- Chao, Ran; Yuan, YongBo; Zhao, HuiMin
- Science China Life Sciences, Vol. 58, Issue 7
Automated multiplex genome-scale engineering in yeast
journal, May 2017
- Si, Tong; Chao, Ran; Min, Yuhao
- Nature Communications, Vol. 8, Issue 1
Machine Learning in Medicine
journal, April 2019
- Rajkomar, Alvin; Dean, Jeffrey; Kohane, Isaac
- New England Journal of Medicine, Vol. 380, Issue 14
Improving catalytic function by ProSAR-driven enzyme evolution
journal, February 2007
- Fox, Richard J.; Davis, S. Christopher; Mundorff, Emily C.
- Nature Biotechnology, Vol. 25, Issue 3
Q-learning
journal, May 1992
- Watkins, Christopher J. C. H.; Dayan, Peter
- Machine Learning, Vol. 8, Issue 3-4
Human-level control through deep reinforcement learning
journal, February 2015
- Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David
- Nature, Vol. 518, Issue 7540
Mastering the game of Go without human knowledge
journal, October 2017
- Silver, David; Schrittwieser, Julian; Simonyan, Karen
- Nature, Vol. 550, Issue 7676
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
journal, December 2018
- Silver, David; Hubert, Thomas; Schrittwieser, Julian
- Science, Vol. 362, Issue 6419
Unsupervised word sense disambiguation rivaling supervised methods
conference, January 1995
- Yarowsky, David
- Proceedings of the 33rd annual meeting on Association for Computational Linguistics -
Effective self-training for parsing
conference, January 2006
- McClosky, David; Charniak, Eugene; Johnson, Mark
- Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics -
Combining labeled and unlabeled data with co-training
conference, January 1998
- Blum, Avrim; Mitchell, Tom
- Proceedings of the eleventh annual conference on Computational learning theory - COLT' 98
Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning
journal, August 2019
- Miyato, Takeru; Maeda, Shin-Ichi; Koyama, Masanori
- IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 41, Issue 8
Dual Strategy Active Learning
book, January 2007
- Donmez, Pinar; Carbonell, Jaime G.; Bennett, Paul N.
- Machine Learning: ECML 2007
Active learning using pre-clustering
conference, January 2004
- Nguyen, Hieu T.; Smeulders, Arnold
- Twenty-first international conference on Machine learning - ICML '04
Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT
journal, April 2020
- Li, Xinhao; Fourches, Denis
- Journal of Cheminformatics, Vol. 12, Issue 1
I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure
journal, July 2005
- Capriotti, E.; Fariselli, P.; Casadio, R.
- Nucleic Acids Research, Vol. 33, Issue Web Server
Predicting protein stability changes from sequences using support vector machines
journal, September 2005
- Capriotti, E.; Fariselli, P.; Calabrese, R.
- Bioinformatics, Vol. 21, Issue Suppl 2
Prediction of protein stability changes for single-site mutations using support vector machines
journal, December 2005
- Cheng, Jianlin; Randall, Arlo; Baldi, Pierre
- Proteins: Structure, Function, and Bioinformatics, Vol. 62, Issue 4
In silico characterization of protein chimeras: Relating sequence and function within the same fold
journal, October 2009
- Buske, Fabian A.; Their, Ricarda; Gillam, Elizabeth M. J.
- Proteins: Structure, Function, and Bioinformatics, Vol. 77, Issue 1
ProTherm, Thermodynamic Database for Proteins and Mutants: developments in version 3.0
journal, January 2002
- Gromiha, M. M.
- Nucleic Acids Research, Vol. 30, Issue 1
Predicting changes in protein thermostability brought about by single- or multi-site mutations
journal, January 2010
- Tian, Jian; Wu, Ningfeng; Chu, Xiaoyu
- BMC Bioinformatics, Vol. 11, Issue 1
Grading amino acid properties increased accuracies of single point mutation on protein stability prediction
journal, March 2012
- Liu, Jianguo; Kang, Xianjiang
- BMC Bioinformatics, Vol. 13, Issue 1
PROTS-RF: A Robust Model for Predicting Mutation-Induced Protein Stability Changes
journal, October 2012
- Li, Yunqi; Fang, Jianwen
- PLoS ONE, Vol. 7, Issue 10
Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools
journal, September 2015
- Jia, Lei; Yarlagadda, Ramya; Reed, Charles C.
- PLOS ONE, Vol. 10, Issue 9
mCSM: predicting the effects of mutations in proteins using graph-based signatures
journal, November 2013
- Pires, Douglas E. V.; Ascher, David B.; Blundell, Tom L.
- Bioinformatics, Vol. 30, Issue 3
Navigating the protein fitness landscape with Gaussian processes
journal, December 2012
- Romero, P. A.; Krause, A.; Arnold, F. H.
- Proceedings of the National Academy of Sciences, Vol. 110, Issue 3
mGPfusion: predicting protein stability changes with Gaussian process kernel learning and data fusion
journal, June 2018
- Jokinen, Emmi; Heinonen, Markus; Lähdesmäki, Harri
- Bioinformatics, Vol. 34, Issue 13
Fast and accurate predictions of protein stability changes upon mutations using statistical potentials and neural networks: PoPMuSiC-2.0
journal, August 2009
- Dehouck, Yves; Grosfils, Aline; Folch, Benjamin
- Bioinformatics, Vol. 25, Issue 19
NeEMO: a method using residue interaction networks to improve prediction of protein stability upon mutation
journal, January 2014
- Giollo, Manuel; Martin, Alberto JM; Walsh, Ian
- BMC Genomics, Vol. 15, Issue Suppl 4
PROSO II - a new method for protein solubility prediction: PROSO II
journal, May 2012
- Smialowski, Pawel; Doose, Gero; Torkler, Phillipp
- FEBS Journal, Vol. 279, Issue 12
Evaluation of methods for modeling transcription factor sequence specificity
journal, January 2013
- Weirauch, Matthew T.; Cote, Atina; Norel, Raquel
- Nature Biotechnology, Vol. 31, Issue 2
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
journal, July 2015
- Alipanahi, Babak; Delong, Andrew; Weirauch, Matthew T.
- Nature Biotechnology, Vol. 33, Issue 8
Convolutional neural network architectures for predicting DNA–protein binding
journal, June 2016
- Zeng, Haoyang; Edwards, Matthew D.; Liu, Ge
- Bioinformatics, Vol. 32, Issue 12
sc-PDB: a 3D-database of ligandable binding sites—10 years on
journal, October 2014
- Desaphy, Jérémy; Bret, Guillaume; Rognan, Didier
- Nucleic Acids Research, Vol. 43, Issue D1
DeepSite: protein-binding site predictor using 3D-convolutional neural networks
journal, May 2017
- Jiménez, J.; Doerr, S.; Martínez-Rosell, G.
- Bioinformatics, Vol. 33, Issue 19
Learning epistatic interactions from sequence-activity data to predict enantioselectivity
journal, December 2017
- Zaugg, Julian; Gumulya, Yosephine; Malde, Alpeshkumar K.
- Journal of Computer-Aided Molecular Design, Vol. 31, Issue 12
BRENDA, enzyme data and metabolic information
journal, January 2002
- Schomburg, I.
- Nucleic Acids Research, Vol. 30, Issue 1
Semisupervised Gaussian Process for Automated Enzyme Search
journal, March 2016
- Mellor, Joseph; Grigoras, Ioana; Carbonell, Pablo
- ACS Synthetic Biology, Vol. 5, Issue 6
Machine learning-assisted directed protein evolution with combinatorial libraries
journal, April 2019
- Wu, Zachary; Kan, S. B. Jennifer; Lewis, Russell D.
- Proceedings of the National Academy of Sciences, Vol. 116, Issue 18
Machine Learning Identifies Chemical Characteristics That Promote Enzyme Catalysis
journal, February 2019
- Bonk, Brian M.; Weis, James W.; Tidor, Bruce
- Journal of the American Chemical Society, Vol. 141, Issue 9
Global Topology Analysis of the Escherichia coli Inner Membrane Proteome
journal, May 2005
- Daley, D. O.
- Science, Vol. 308, Issue 5726
A statistical model for improved membrane protein expression using sequence-derived features
journal, March 2018
- Saladi, Shyam M.; Javed, Nauman; Müller, Axel
- Journal of Biological Chemistry, Vol. 293, Issue 13
Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
journal, October 2017
- Bedbrook, Claire N.; Yang, Kevin K.; Rice, Austin J.
- PLOS Computational Biology, Vol. 13, Issue 10
UniProt: a worldwide hub of protein knowledge
November 2018
- Consortium, The UniProt
- Nucleic Acids Research
Machine-Learning-Guided Mutagenesis for Directed Evolution of Fluorescent Proteins
journal, August 2018
- Saito, Yutaka; Oikawa, Misaki; Nakazawa, Hikaru
- ACS Synthetic Biology, Vol. 7, Issue 9
The artificial neural network approach based on uniform design to optimize the fed-batch fermentation condition: application to the production of iturin A
journal, January 2014
- Peng, Wenjing; Zhong, Juan; Yang, Jie
- Microbial Cell Factories, Vol. 13, Issue 1
Predicting growth rate from gene expression
journal, December 2018
- Wytock, Thomas P.; Motter, Adilson E.
- Proceedings of the National Academy of Sciences, Vol. 116, Issue 2
Data denoising with transfer learning in single-cell transcriptomics
journal, August 2019
- Wang, Jingshu; Agarwal, Divyansh; Huang, Mo
- Nature Methods, Vol. 16, Issue 9
Single-cell RNA-seq denoising using a deep count autoencoder
journal, January 2019
- Eraslan, Gökcen; Simon, Lukas M.; Mircea, Maria
- Nature Communications, Vol. 10, Issue 1