skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Multiresolution persistent homology for excessively large biomolecular datasets

Journal Article · · Journal of Chemical Physics
DOI:https://doi.org/10.1063/1.4931733· OSTI ID:22489667
;  [1];  [1]
  1. Department of Mathematics, Michigan State University, East Lansing, Michigan 48824 (United States)

Although persistent homology has emerged as a promising tool for the topological simplification of complex data, it is computationally intractable for large datasets. We introduce multiresolution persistent homology to handle excessively large datasets. We match the resolution with the scale of interest so as to represent large scale datasets with appropriate resolution. We utilize flexibility-rigidity index to access the topological connectivity of the data set and define a rigidity density for the filtration analysis. By appropriately tuning the resolution of the rigidity density, we are able to focus the topological lens on the scale of interest. The proposed multiresolution topological analysis is validated by a hexagonal fractal image which has three distinct scales. We further demonstrate the proposed method for extracting topological fingerprints from DNA molecules. In particular, the topological persistence of a virus capsid with 273 780 atoms is successfully analyzed which would otherwise be inaccessible to the normal point cloud method and unreliable by using coarse-grained multiscale persistent homology. The proposed method has also been successfully applied to the protein domain classification, which is the first time that persistent homology is used for practical protein domain analysis, to our knowledge. The proposed multiresolution topological method has potential applications in arbitrary data sets, such as social networks, biological networks, and graphs.

OSTI ID:
22489667
Journal Information:
Journal of Chemical Physics, Vol. 143, Issue 13; Other Information: (c) 2015 AIP Publishing LLC; Country of input: International Atomic Energy Agency (IAEA); ISSN 0021-9606
Country of Publication:
United States
Language:
English

Similar Records

Multiscale Persistent Functions for Biomolecular Structure Characterization
Journal Article · Thu Nov 02 00:00:00 EDT 2017 · Bulletin of Mathematical Biology · OSTI ID:22489667

Fast and anisotropic flexibility-rigidity index for protein flexibility and fluctuation analysis
Journal Article · Sat Jun 21 00:00:00 EDT 2014 · Journal of Chemical Physics · OSTI ID:22489667

Multiscale analysis of nonlinear systems using computational homology
Technical Report · Wed May 19 00:00:00 EDT 2010 · OSTI ID:22489667