DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Deep data analytics for genetic engineering of diatoms linking genotype to phenotype via machine learning

Journal Article · · npj Computational Materials
ORCiD logo [1];  [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1];  [2]; ORCiD logo [1];  [1];  [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]
  1. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
  2. Univ. of California, San Diego, La Jolla, CA (United States)

Genome engineering for materials synthesis is a promising avenue for manufacturing materials with unique properties under ambient conditions. Biomineralization in diatoms, unicellular algae that use silica to construct micron-scale cell walls with nanoscale features, is an attractive candidate for functional synthesis of materials for applications including photonics, sensing, filtration, and drug delivery. Therefore, controllably modifying diatom structure through targeted genetic modifications for these applications is a very promising field. In this work, we used gene knockdown in Thalassiosira pseudonana diatoms to create modified strains with changes to structural morphology and linked genotype to phenotype using supervised machine learning. An artificial neural network (NN) was developed to distinguish wild and modified diatoms based on the SEM images of frustules exhibiting phenotypic changes caused by a specific protein (Thaps3_21880), resulting in 94% detection accuracy. Class activation maps visualized physical changes that allowed the NNs to separate diatom strains, subsequently establishing a specific gene that controls pores. A further NN was created to batch process image data, automatically recognize pores, and extract pore-related parameters. Class interrelationship of the extracted paraments was visualized using a multivariate data visualization tool, called CrossVis, and allowed to directly link changes in morphological diatom phenotype of pore size and distribution with changes in the genotype.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC); USDOE
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1619640
Alternate ID(s):
OSTI ID: 1550758
Journal Information:
npj Computational Materials, Vol. 5, Issue 1; ISSN 2057-3960
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (46)

scikit-image: image processing in Python journal January 2014
The plane with parallel coordinates journal August 1985
Marine diatoms as optical chemical sensors: A time-resolved study journal March 2008
Diatom silica biomineralization: Parallel development of approaches and understanding journal October 2015
Machine learning quantum phases of matter beyond the fermion sign problem journal August 2017
Deep Learning of Atomically Resolved Scanning Transmission Electron Microscopy Images: Chemical Identification and Tracking Local Transformations journal October 2016
Learning Deep Features for Discriminative Localization conference June 2016
Electroluminescence and Photoluminescence from Nanostructured Diatom Frustules Containing Metabolically Inserted Germanium journal July 2008
A theoretical investigation of the diatom cell size reduction–restitution cycle journal December 2015
Diverse and conserved nano- and mesoscale structures of diatom silica revealed by atomic force microscopy journal August 2009
Neural network models of potential energy surfaces journal September 1995
Nanoscale control of silica morphology and three-dimensional structure during diatom cell wall formation journal October 2006
Big data visual analytics for exploratory earth system simulation analysis journal December 2013
Classification of single particles by neural networks based on the computer-controlled scanning electron microscopy data journal August 1997
Architecture and material properties of diatom shells provide effective mechanical protection journal February 2003
Potential Energy Surfaces Fitted by Artificial Neural Networks journal March 2010
Learning surface molecular structures via machine vision journal August 2017
Diatom Frustule Morphogenesis and Function: a Multidisciplinary Survey journal October 2017
UV-shielding and wavelength conversion by centric diatom nanopatterned frustules journal November 2018
A Nested Molecule-Independent Neural Network Approach for High-Quality Potential Fits journal April 2006
Targeted drug delivery using genetically engineered diatom biosilica journal November 2015
Reconstituting the formation of hierarchically porous silica patterns using diatom biomolecules journal October 2018
Prediction of capillary gas chromatographic retention times of fatty acid methyl esters in human blood using MLR, PLS and back-propagation artificial neural networks journal January 2011
Empirical modeling of polymer electrolyte membrane fuel cell performance using artificial neural networks journal August 2004
A Performance evaluation of neural network models in traffic volume forecasting journal May 1998
Evidence for a Regulatory Role of Diatom Silicon Transporters in Cellular Silicon Responses journal November 2014
Merging Biological Self-Assembly with Synthetic Chemical Tailoring: The Potential for 3-D Genetically Engineered Micro/Nano-Devices (3-D GEMS) journal July 2005
Helium Ion Microscopy for Imaging and Quantifying Porosity at the Nanoscale journal December 2017
Photoluminescence Detection of Biomolecules by Antibody-Functionalized Diatom Biosilica journal March 2009
Interpolating moving least-squares methods for fitting potential energy surfaces: Computing high-density potential energy surface data from low-density ab initio data points journal May 2007
Dynamics of silica cell wall morphogenesis in the diatom Cyclotella cryptica: Substructure formation and the role of microfilaments journal January 2010
Microstructure provides insights into evolutionary design and resilience of Coscinodiscus sp. frustule journal February 2016
Influence of geometry on mechanical properties of bio-inspired silica-based hierarchical materials journal June 2012
Whole transcriptome analysis of the silicon response of the diatom Thalassiosira pseudonana journal January 2012
Prospects of Manipulating Diatom Silica Nanostructure journal January 2005
Temperature affects the silicate morphology in a diatom journal June 2015
Learning phase transitions by confusion journal February 2017
Detection of typhoid fever by diatom-based optical biosensor journal June 2017
Life Cycle, size Reduction Patterns, and Ultrastructure of the Pennate Planktonic Diatom Pseudo-Nitzschia Delicatissima (Bacillariophyceae)1: life Cycle of Pseudo-Nitzschia Delicatissima journal May 2005
Machine learning phases of matter journal February 2017
Back Propagation neural network modeling for warpage prediction and optimization of plastic products during injection molding journal April 2011
Using molecular dynamics to quantify the electrical double layer and examine the potential for its direct observation in the in-situ TEM journal March 2015
Characterization of a New Protein Family Associated With the Silica Deposition Vesicle Membrane Enables Genetic Manipulation of Diatom Silica journal October 2017
Machine learning phases of matter text January 2016
Learning phase transitions by confusion text January 2016
Deep Learning of Atomically Resolved Scanning Transmission Electron Microscopy Images: Chemical Identification and Tracking Local Transformations text January 2018