skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: CAMELOT: A machine learning approach for coarse-grained simulations of aggregation of block-copolymeric protein sequences

Journal Article · · Journal of Chemical Physics
DOI:https://doi.org/10.1063/1.4935066· OSTI ID:22493371
 [1];  [2]
  1. Computational and Systems Biology Program and Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, Missouri 63130-4899 (United States)
  2. Department of Physics and Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, Missouri 63130-4899 (United States)

We report the development and deployment of a coarse-graining method that is well suited for computer simulations of aggregation and phase separation of protein sequences with block-copolymeric architectures. Our algorithm, named CAMELOT for Coarse-grained simulations Aided by MachinE Learning Optimization and Training, leverages information from converged all atom simulations that is used to determine a suitable resolution and parameterize the coarse-grained model. To parameterize a system-specific coarse-grained model, we use a combination of Boltzmann inversion, non-linear regression, and a Gaussian process Bayesian optimization approach. The accuracy of the coarse-grained model is demonstrated through direct comparisons to results from all atom simulations. We demonstrate the utility of our coarse-graining approach using the block-copolymeric sequence from the exon 1 encoded sequence of the huntingtin protein. This sequence comprises of 17 residues from the N-terminal end of huntingtin (N17) followed by a polyglutamine (polyQ) tract. Simulations based on the CAMELOT approach are used to show that the adsorption and unfolding of the wild type N17 and its sequence variants on the surface of polyQ tracts engender a patchy colloid like architecture that promotes the formation of linear aggregates. These results provide a plausible explanation for experimental observations, which show that N17 accelerates the formation of linear aggregates in block-copolymeric N17-polyQ sequences. The CAMELOT approach is versatile and is generalizable for simulating the aggregation and phase behavior of a range of block-copolymeric protein sequences.

OSTI ID:
22493371
Journal Information:
Journal of Chemical Physics, Vol. 143, Issue 24; Other Information: (c) 2015 AIP Publishing LLC; Country of input: International Atomic Energy Agency (IAEA); ISSN 0021-9606
Country of Publication:
United States
Language:
English