Protein subcellular location pattern classification in cellular images using latent discriminative models

Li, Jieyue; Xiong, Liang; Schneider, Jeff; Murphy, R. F.

doi:10.1093/bioinformatics/bts230

Title: Protein subcellular location pattern classification in cellular images using latent discriminative models

Journal Article · Mon Jun 11 00:00:00 EDT 2012 · Bioinformatics

DOI:https://doi.org/10.1093/bioinformatics/bts230· OSTI ID:1625279

Li, Jieyue ^[1]; Xiong, Liang ^[2]; Schneider, Jeff ^[3]; Murphy, R. F. ^[4]

Carnegie Mellon Univ., Pittsburgh, PA (United States). Center for bioimage Informatics; Carnegie Mellon Univ., Pittsburgh, PA (United States). Dept. of Biomedical Engineering
Carnegie Mellon Univ., Pittsburgh, PA (United States). Machine Learning Dept.
Carnegie Mellon Univ., Pittsburgh, PA (United States). Machine Learning Dept.; Carnegie Mellon Univ., Pittsburgh, PA (United States). The Robotics Inst.
Carnegie Mellon Univ., Pittsburgh, PA (United States). Center for Bioimage Informatics. Dept. of Biomedical Engineering. Machine Learning Dept. Lane Center for Comoputational Biology. Dept. of Biological Sciences; Albert Ludwing Univ. of Freiburg (Germany). Freiburg Inst. for Advanced Studies

Motivation: Knowledge of the subcellular location of a protein is crucial for understanding its functions. The subcellular pattern of a protein is typically represented as the set of cellular components in which it is located, and an important task is to determine this set from microscope images. In this article, we address this classification problem using confocal immunofluorescence images from the Human Protein Atlas (HPA) project. The HPA contains images of cells stained for many proteins; each is also stained for three reference components, but there are many other components that are invisible. Given one such cell, the task is to classify the pattern type of the stained protein. We first randomly select local image regions within the cells, and then extract various carefully designed features from these regions. This region-based approach enables us to explicitly study the relationship between proteins and different cell components, as well as the interactions between these components. To achieve these two goals, we propose two discriminative models that extend logistic regression with structured latent variables. The first model allows the same protein pattern class to be expressed differently according to the underlying components in different regions. The second model further captures the spatial dependencies between the components within the same cell so that we can better infer these components. To learn these models, we propose a fast approximate algorithm for inference, and then use gradient-based methods to maximize the data likelihood. Results: In the experiments, we show that the proposed models help improve the classification accuracies on synthetic data and real cellular images. The best overall accuracy we report in this article for classifying 942 proteins into 13 classes of patterns is about 84.6%, which to our knowledge is the best so far. In addition, the dependencies learned are consistent with prior knowledge of cell organization.

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Carnegie Mellon Univ., Pittsburgh, PA (United States)

Sponsoring Organization:: USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division

Grant/Contract Number:: SC0002607

OSTI ID:: 1625279

Journal Information:: Bioinformatics, Vol. 28, Issue 12; ISSN 1367-4803

Publisher:: Oxford University PressCopyright Statement

Country of Publication:: United States

Language:: English

References (4)

Hidden Conditional Random Fields Quattoni, Ariadna; Wang, Sybor; Morency, Louis-Philippe IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, Issue 10 https://doi.org/10.1109/tpami.2007.1124	journal	October 2007
Toward a Confocal Subcellular Atlas of the Human Proteome Barbe, Laurent; Lundberg, Emma; Oksvold, Per Molecular & Cellular Proteomics, Vol. 7, Issue 3 https://doi.org/10.1074/mcp.m700325-mcp200	journal	March 2008
Automated analysis of Human Protein Atlas immunofluorescence images Newberg, Justin Y.; Rao, Arvind 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro https://doi.org/10.1109/ISBI.2009.5193229	conference	June 2009
Belief Updating by Network Propagation Pearl, Judea Probabilistic Reasoning in Intelligent Systems https://doi.org/10.1016/B978-0-08-051489-5.50010-2	book	January 1988

Cited By (7)

Spatial proteomics: a powerful discovery tool for cell biology Lundberg, Emma; Borner, Georg H. H. Nature Reviews Molecular Cell Biology, Vol. 20, Issue 5 https://doi.org/10.1038/s41580-018-0094-y	journal	January 2019
Determining the subcellular location of new proteins from microscope images using local features. Coelho, Luis Pedro; Kangas, Joshua D.; Naik, Armaghan W. Figshare https://doi.org/10.1184/r1/6097679	text	January 2018
Determining the subcellular location of new proteins from microscope images using local features. Coelho, Luis Pedro; Kangas, Joshua D.; Naik, Armaghan W. Figshare https://doi.org/10.1184/r1/6097679.v1	text	January 2018
iLoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins Lin, Wei-Zhong; Fang, Jian-An; Xiao, Xuan Molecular BioSystems, Vol. 9, Issue 4 https://doi.org/10.1039/c3mb25466f	journal	January 2013
A reference library for assigning protein subcellular localizations by image-based machine learning Schormann, Wiebke; Hariharan, Santosh; Andrews, David W. Journal of Cell Biology, Vol. 219, Issue 3 https://doi.org/10.1083/jcb.201904090	journal	January 2020
Deep learning is combined with massive-scale citizen science to improve large-scale image classification Sullivan, Devin P.; Winsnes, Casper F.; Åkesson, Lovisa Nature Biotechnology, Vol. 36, Issue 9 https://doi.org/10.1038/nbt.4225	journal	October 2018
Determining the subcellular location of new proteins from microscope images using local features Coelho, Luis Pedro; Kangas, Joshua D.; Naik, Armaghan W. Bioinformatics, Vol. 29, Issue 18 https://doi.org/10.1093/bioinformatics/btt392	journal	July 2013

Similar Records

Automatic image analysis for gene expression patterns of fly embryos

Journal Article · Sun Jul 01 00:00:00 EDT 2007 · BMC Cell Biology · OSTI ID:1625279

Peng, Hanchuan; Long, Fuhui; Zhou, Jie; +3 more

Automating cell detection and classification in human brain fluorescent microscopy images using dictionary learning and sparse coding

Journal Article · Sat Mar 04 00:00:00 EST 2017 · Journal of Neuroscience Methods · OSTI ID:1625279

Alegro, Maryana; Theofilas, Panagiotis; Nguy, Austin; +5 more

Subcellular Localized Chemical Imaging of Benthic Algal Nutritional Content via HgCdTe Array FT-IR

Journal Article · Tue Jan 01 00:00:00 EST 2008 · Vibrational Spectroscopy · OSTI ID:1625279

Wetzel, D; Murdock, J; Dodds, W

Related Subjects

59 BASIC BIOLOGICAL SCIENCES
97 MATHEMATICS AND COMPUTING
Biochemistry & Molecular Biology
Biotechnology & Applied Microbiology
Computer Science
Mathematical & Computational Biology
Mathematics

Title: Protein subcellular location pattern classification in cellular images using latent discriminative models

Citation Formats

References (4)

Cited By (7)

Similar Records

Related Subjects