DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples

Abstract

We describe algorithms for discovering immunophenotypes from large collections of flow cytometry samples and using them to organize the samples into a hierarchy based on phenotypic similarity. The hierarchical organization is helpful for effective and robust cytometry data mining, including the creation of collections of cell populations’ characteristic of different classes of samples, robust classification, and anomaly detection. We summarize a set of samples belonging to a biological class or category with a statistically derived template for the class. Whereas individual samples are represented in terms of their cell populations (clusters), a template consists of generic meta-populations (a group of homogeneous cell populations obtained from the samples in a class) that describe key phenotypes shared among all those samples. We organize an FC data collection in a hierarchical data structure that supports the identification of immunophenotypes relevant to clinical diagnosis. A robust template-based classification scheme is also developed, but our primary focus is in the discovery of phenotypic signatures and inter-sample relationships in an FC data collection. This collective analysis approach is more efficient and robust since templates describe phenotypic signatures common to cell populations in several samples while ignoring noise and small sample-specific variations. We have applied the template-basedmore » scheme to analyze several datasets, including one representing a healthy immune system and one of acute myeloid leukemia (AML) samples. The last task is challenging due to the phenotypic heterogeneity of the several subtypes of AML. However, we identified thirteen immunophenotypes corresponding to subtypes of AML and were able to distinguish acute promyelocytic leukemia (APL) samples with the markers provided. Clinically, this is helpful since APL has a different treatment regimen from other subtypes of AML. Core algorithms used in our data analysis are available in the flowMatch package at www.bioconductor.org. It has been downloaded nearly 6,000 times since 2014.« less

Authors:
 [1];  [2];  [2]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Purdue Univ., West Lafayette, IN (United States)
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
OSTI Identifier:
1379582
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
Frontiers in Oncology
Additional Journal Information:
Journal Volume: 6; Journal ID: ISSN 2234-943X
Publisher:
Frontiers Research Foundation
Country of Publication:
United States
Language:
English
Subject:
60 APPLIED LIFE SCIENCES; 59 BASIC BIOLOGICAL SCIENCES; flow cytometry; clusters; meta-clusters; template; matching; classification

Citation Formats

Azad, Ariful, Rajwa, Bartek, and Pothen, Alex. Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples. United States: N. p., 2016. Web. doi:10.3389/fonc.2016.00188.
Azad, Ariful, Rajwa, Bartek, & Pothen, Alex. Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples. United States. https://doi.org/10.3389/fonc.2016.00188
Azad, Ariful, Rajwa, Bartek, and Pothen, Alex. Wed . "Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples". United States. https://doi.org/10.3389/fonc.2016.00188. https://www.osti.gov/servlets/purl/1379582.
@article{osti_1379582,
title = {Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples},
author = {Azad, Ariful and Rajwa, Bartek and Pothen, Alex},
abstractNote = {We describe algorithms for discovering immunophenotypes from large collections of flow cytometry samples and using them to organize the samples into a hierarchy based on phenotypic similarity. The hierarchical organization is helpful for effective and robust cytometry data mining, including the creation of collections of cell populations’ characteristic of different classes of samples, robust classification, and anomaly detection. We summarize a set of samples belonging to a biological class or category with a statistically derived template for the class. Whereas individual samples are represented in terms of their cell populations (clusters), a template consists of generic meta-populations (a group of homogeneous cell populations obtained from the samples in a class) that describe key phenotypes shared among all those samples. We organize an FC data collection in a hierarchical data structure that supports the identification of immunophenotypes relevant to clinical diagnosis. A robust template-based classification scheme is also developed, but our primary focus is in the discovery of phenotypic signatures and inter-sample relationships in an FC data collection. This collective analysis approach is more efficient and robust since templates describe phenotypic signatures common to cell populations in several samples while ignoring noise and small sample-specific variations. We have applied the template-based scheme to analyze several datasets, including one representing a healthy immune system and one of acute myeloid leukemia (AML) samples. The last task is challenging due to the phenotypic heterogeneity of the several subtypes of AML. However, we identified thirteen immunophenotypes corresponding to subtypes of AML and were able to distinguish acute promyelocytic leukemia (APL) samples with the markers provided. Clinically, this is helpful since APL has a different treatment regimen from other subtypes of AML. Core algorithms used in our data analysis are available in the flowMatch package at www.bioconductor.org. It has been downloaded nearly 6,000 times since 2014.},
doi = {10.3389/fonc.2016.00188},
journal = {Frontiers in Oncology},
number = ,
volume = 6,
place = {United States},
year = {Wed Aug 31 00:00:00 EDT 2016},
month = {Wed Aug 31 00:00:00 EDT 2016}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 6 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Inferring Phenotypic Properties from Single-Cell Characteristics
journal, May 2012


A Computational Framework to Emulate the Human Perspective in Flow Cytometric Data Analysis
journal, May 2012


Flow cytometer electronics
journal, January 2004


Flow cytometer electronics
journal, January 2004


Variance stabilization applied to microarray data calibration and to the quantification of differential expression
journal, July 2002


Automated high-dimensional flow cytometric data analysis
journal, May 2009

  • Pyne, S.; Hu, X.; Wang, K.
  • Proceedings of the National Academy of Sciences, Vol. 106, Issue 21
  • DOI: 10.1073/pnas.0903028106

Inferring Phenotypic Properties from Single-Cell Characteristics
journal, May 2012


Modeling of inter-sample variation in flow cytometric data with the joint clustering and matching procedure: Modeling of Inter-Sample Variation
journal, October 2015

  • Lee, Sharon X.; McLachlan, Geoffrey J.; Pyne, Saumyadipta
  • Cytometry Part A, Vol. 89, Issue 1
  • DOI: 10.1002/cyto.a.22789

An invariant form for the prior probability in estimation problems
journal, September 1946

  • Jeffreys, Harold
  • Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, Vol. 186, Issue 1007, p. 453-461
  • DOI: 10.1098/rspa.1946.0056

Data reduction for spectral clustering to analyze high throughput flow cytometry data
journal, January 2010


Aberrant expression of CD19 in AML with t(8;21) involves a poised chromatin structure and PAX5
journal, March 2010

  • Walter, K.; Cockerill, P. N.; Barlow, R.
  • Oncogene, Vol. 29, Issue 20
  • DOI: 10.1038/onc.2010.56

Aberrant expression of CD19 in AML with t(8;21) involves a poised chromatin structure and PAX5
journal, March 2010

  • Walter, K.; Cockerill, P. N.; Barlow, R.
  • Oncogene, Vol. 29, Issue 20
  • DOI: 10.1038/onc.2010.56

Fluorescence Spectral Overlap Compensation for Any Number of Flow Cytometry Parameters
journal, March 1993


Automated gating of flow cytometry data via robust model-based clustering
journal, April 2008

  • Lo, Kenneth; Brinkman, Ryan Remy; Gottardo, Raphael
  • Cytometry Part A, Vol. 73A, Issue 4
  • DOI: 10.1002/cyto.a.20531

Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum
journal, May 2011


The immunophenotype of 177 adults with acute myeloid leukemia: proposal of a prognostic score
journal, August 2000


A new “Logicle” display method avoids deceptive effects of logarithmic scaling for low signals and compensated data
journal, January 2006

  • Parks, David R.; Roederer, Mario; Moore, Wayne A.
  • Cytometry Part A, Vol. 69A, Issue 6
  • DOI: 10.1002/cyto.a.20258

Partition-distance: A problem and class of perfect graphs arising in clustering
journal, May 2002


Increased CD38 expression is associated with favorable prognosis in adult acute leukemia
journal, February 2000


A dendrite method for cluster analysis
journal, January 1974

  • Calinski, T.; Harabasz, J.
  • Communications in Statistics - Theory and Methods, Vol. 3, Issue 1
  • DOI: 10.1080/03610927408827101

Generalized unmixing model for multispectral flow cytometry utilizing nonsquare compensation matrices
journal, March 2013

  • Novo, David; Grégori, Gérald; Rajwa, Bartek
  • Cytometry Part A, Vol. 83A, Issue 5
  • DOI: 10.1002/cyto.a.22272

The immunophenotype of 177 adults with acute myeloid leukemia: proposal of a prognostic score
journal, August 2000


Critical assessment of automated flow cytometry data analysis techniques
journal, February 2013

  • Aghaeepour, Nima; Finak, Greg; Hoos, Holger
  • Nature Methods, Vol. 10, Issue 3
  • DOI: 10.1038/nmeth.2365

Transformation Theory: How Normal is a Family of Distributions?
journal, June 1982


Data clustering: a review
journal, September 1999

  • Jain, A. K.; Murty, M. N.; Flynn, P. J.
  • ACM Computing Surveys, Vol. 31, Issue 3, p. 264-323
  • DOI: 10.1145/331499.331504

Flow cytometry CD45 gating for immunophenotyping of acute myeloid leukemia
journal, November 1997


Rapid cell population identification in flow cytometry data
journal, December 2010

  • Aghaeepour, Nima; Nikolic, Radina; Hoos, Holger H.
  • Cytometry Part A, Vol. 79A, Issue 1
  • DOI: 10.1002/cyto.a.21007

Rapid cell population identification in flow cytometry data
journal, December 2010

  • Aghaeepour, Nima; Nikolic, Radina; Hoos, Holger H.
  • Cytometry Part A, Vol. 79A, Issue 1
  • DOI: 10.1002/cyto.a.21007

Data clustering: a review
journal, September 1999

  • Jain, A. K.; Murty, M. N.; Flynn, P. J.
  • ACM Computing Surveys, Vol. 31, Issue 3, p. 264-323
  • DOI: 10.1145/331499.331504

Hyperlog?A flexible log-like transform for negative, zero, and positive valued data
journal, January 2005


Flow cytometry histograms: Transformations, resolution, and display
journal, August 2008


Data quality assessment of ungated flow cytometry data in high throughput experiments
journal, January 2007

  • Le Meur, Nolwenn; Rossini, Anthony; Gasparetto, Maura
  • Cytometry Part A, Vol. 71A, Issue 6
  • DOI: 10.1002/cyto.a.20396

Expression of cell-surface antigens in acute promyelocytic leukaemia
journal, September 2003


An invariant form for the prior probability in estimation problems
journal, September 1946

  • Jeffreys, Harold
  • Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, Vol. 186, Issue 1007, p. 453-461
  • DOI: 10.1098/rspa.1946.0056

Critical assessment of automated flow cytometry data analysis techniques
journal, February 2013

  • Aghaeepour, Nima; Finak, Greg; Hoos, Holger
  • Nature Methods, Vol. 10, Issue 3
  • DOI: 10.1038/nmeth.2365

Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum
journal, May 2011


Bioconductor : open software development for computational biology and bioinformatics
text, January 2004

  • C., Gentleman, Robert; J., Carey, Vincent; M., Bates, Douglas
  • BioMed Central
  • DOI: 10.21256/zhaw-1907

Flow Cytometry Bioinformatics
journal, December 2013


The immunophenotype of acute myeloid leukemia: is there a relationship with prognosis?
journal, March 2006


Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
journal, March 2013


GenePattern flow cytometry suite
journal, January 2013

  • Spidlen, Josef; Barsky, Aaron; Breuer, Karin
  • Source Code for Biology and Medicine, Vol. 8, Issue 1
  • DOI: 10.1186/1751-0473-8-14

Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
journal, March 2013


Flow Cytometry Bioinformatics
journal, December 2013


GenePattern flow cytometry suite
journal, January 2013

  • Spidlen, Josef; Barsky, Aaron; Breuer, Karin
  • Source Code for Biology and Medicine, Vol. 8, Issue 1
  • DOI: 10.1186/1751-0473-8-14

Data analysis in flow cytometry: The future just started
journal, April 2010

  • Lugli, Enrico; Roederer, Mario; Cossarizza, Andrea
  • Cytometry Part A, Vol. 77A, Issue 7
  • DOI: 10.1002/cyto.a.20901

Optimizing transformations for automated, high throughput analysis of flow cytometry data
journal, November 2010


Immunophenotyping of leukemia
journal, September 2000


flowVS: channel-specific variance stabilization in flow cytometry
journal, July 2016


Data analysis in flow cytometry: The future just started
journal, April 2010

  • Lugli, Enrico; Roederer, Mario; Cossarizza, Andrea
  • Cytometry Part A, Vol. 77A, Issue 7
  • DOI: 10.1002/cyto.a.20901

Optimizing transformations for automated, high throughput analysis of flow cytometry data
journal, November 2010


Flow cytometry histograms: Transformations, resolution, and display
journal, August 2008


Amine reactive dyes: An effective tool to discriminate live and dead cells in polychromatic flow cytometry
journal, June 2006

  • Perfetto, Stephen P.; Chattopadhyay, Pratip K.; Lamoreaux, Laurie
  • Journal of Immunological Methods, Vol. 313, Issue 1-2
  • DOI: 10.1016/j.jim.2006.04.007

Data quality assessment of ungated flow cytometry data in high throughput experiments
journal, January 2007

  • Le Meur, Nolwenn; Rossini, Anthony; Gasparetto, Maura
  • Cytometry Part A, Vol. 71A, Issue 6
  • DOI: 10.1002/cyto.a.20396

Hyperlog?A flexible log-like transform for negative, zero, and positive valued data
journal, January 2005


On Clustering Validation Techniques
journal, January 2001

  • Halkidi, Maria; Batistakis, Yannis; Vazirgiannis, Michalis
  • Journal of Intelligent Information Systems, Vol. 17, Issue 2/3, p. 107-145
  • DOI: 10.1023/A:1012801612483

CD56 antigenic expression in acute myeloid leukemia identifies patients with poor clinical prognosis
journal, July 2001


Flow cytometry CD45 gating for immunophenotyping of acute myeloid leukemia
journal, November 1997


flowVS: channel-specific variance stabilization in flow cytometry
journal, July 2016


Web-Based Analysis and Publication of Flow Cytometry Experiments
journal, July 2010


On Information and Sufficiency
journal, March 1951

  • Kullback, S.; Leibler, R. A.
  • The Annals of Mathematical Statistics, Vol. 22, Issue 1
  • DOI: 10.1214/aoms/1177729694

The immunophenotype of acute myeloid leukemia: is there a relationship with prognosis?
journal, March 2006


flowCore: a Bioconductor package for high throughput flow cytometry
journal, April 2009


Transformation Theory: How Normal is a Family of Distributions?
journal, June 1982


A Dendrite Method for Cluster Analysis
journal, January 1974

  • Calinski, T.; Harabasz, J.
  • Communications in Statistics - Simulation and Computation, Vol. 3, Issue 1
  • DOI: 10.1080/03610917408548446

Modified box-cox transform for modulating the dynamic range of flow cytometry data
journal, November 1989


Matching phosphorylation response patterns of antigen-receptor-stimulated T cells via flow cytometry
journal, March 2012


Modified box-cox transform for modulating the dynamic range of flow cytometry data
journal, November 1989


Bioconductor: open software development for computational biology and bioinformatics
journal, September 2004

  • Gentleman, Robert C.; Carey, Vincent J.; Bates, Douglas M.
  • Genome Biology, Vol. 5, Issue 10, p. R80
  • DOI: 10.1186/gb-2004-5-10-r80

Automated gating of flow cytometry data via robust model-based clustering
journal, April 2008

  • Lo, Kenneth; Brinkman, Ryan Remy; Gottardo, Raphael
  • Cytometry Part A, Vol. 73A, Issue 4
  • DOI: 10.1002/cyto.a.20531

A new “Logicle” display method avoids deceptive effects of logarithmic scaling for low signals and compensated data
journal, January 2006

  • Parks, David R.; Roederer, Mario; Moore, Wayne A.
  • Cytometry Part A, Vol. 69A, Issue 6
  • DOI: 10.1002/cyto.a.20258

Amine reactive dyes: An effective tool to discriminate live and dead cells in polychromatic flow cytometry
journal, June 2006

  • Perfetto, Stephen P.; Chattopadhyay, Pratip K.; Lamoreaux, Laurie
  • Journal of Immunological Methods, Vol. 313, Issue 1-2
  • DOI: 10.1016/j.jim.2006.04.007

Web-Based Analysis and Publication of Flow Cytometry Experiments
journal, July 2010


Acute myeloid leukaemia
journal, November 2006


CD56 antigenic expression in acute myeloid leukemia identifies patients with poor clinical prognosis
journal, July 2001


The Earth Mover's Distance as a Metric for Image Retrieval
journal, November 2000


Data reduction for spectral clustering to analyze high throughput flow cytometry data
text, January 2010


Classifying Immunophenotypes With Templates From Flow Cytometry
conference, September 2013

  • Azad, Ariful; Khan, Arif; Rajwa, Bartek
  • BCB'13: ACM-BCB2013, Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
  • DOI: 10.1145/2506583.2506627

Works referencing / citing this record:

Guidelines for the use of flow cytometry and cell sorting in immunological studies (second edition)
journal, October 2019

  • Cossarizza, Andrea; Chang, Hyun‐Dong; Radbruch, Andreas
  • European Journal of Immunology, Vol. 49, Issue 10
  • DOI: 10.1002/eji.201970107

Improved Unsupervised Color Segmentation Using a Modified H S V Color Model and a Bagging Procedure in K -Means++ Algorithm
journal, January 2018

  • Chavolla, Edgar; Valdivia, Arturo; Diaz, Primitivo
  • Mathematical Problems in Engineering, Vol. 2018
  • DOI: 10.1155/2018/2786952