Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples
Abstract
We describe algorithms for discovering immunophenotypes from large collections of flow cytometry samples and using them to organize the samples into a hierarchy based on phenotypic similarity. The hierarchical organization is helpful for effective and robust cytometry data mining, including the creation of collections of cell populations’ characteristic of different classes of samples, robust classification, and anomaly detection. We summarize a set of samples belonging to a biological class or category with a statistically derived template for the class. Whereas individual samples are represented in terms of their cell populations (clusters), a template consists of generic meta-populations (a group of homogeneous cell populations obtained from the samples in a class) that describe key phenotypes shared among all those samples. We organize an FC data collection in a hierarchical data structure that supports the identification of immunophenotypes relevant to clinical diagnosis. A robust template-based classification scheme is also developed, but our primary focus is in the discovery of phenotypic signatures and inter-sample relationships in an FC data collection. This collective analysis approach is more efficient and robust since templates describe phenotypic signatures common to cell populations in several samples while ignoring noise and small sample-specific variations. We have applied the template-basedmore »
- Authors:
-
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Purdue Univ., West Lafayette, IN (United States)
- Publication Date:
- Research Org.:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- OSTI Identifier:
- 1379582
- Grant/Contract Number:
- AC02-05CH11231
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Frontiers in Oncology
- Additional Journal Information:
- Journal Volume: 6; Journal ID: ISSN 2234-943X
- Publisher:
- Frontiers Research Foundation
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 60 APPLIED LIFE SCIENCES; 59 BASIC BIOLOGICAL SCIENCES; flow cytometry; clusters; meta-clusters; template; matching; classification
Citation Formats
Azad, Ariful, Rajwa, Bartek, and Pothen, Alex. Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples. United States: N. p., 2016.
Web. doi:10.3389/fonc.2016.00188.
Azad, Ariful, Rajwa, Bartek, & Pothen, Alex. Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples. United States. https://doi.org/10.3389/fonc.2016.00188
Azad, Ariful, Rajwa, Bartek, and Pothen, Alex. Wed .
"Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples". United States. https://doi.org/10.3389/fonc.2016.00188. https://www.osti.gov/servlets/purl/1379582.
@article{osti_1379582,
title = {Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples},
author = {Azad, Ariful and Rajwa, Bartek and Pothen, Alex},
abstractNote = {We describe algorithms for discovering immunophenotypes from large collections of flow cytometry samples and using them to organize the samples into a hierarchy based on phenotypic similarity. The hierarchical organization is helpful for effective and robust cytometry data mining, including the creation of collections of cell populations’ characteristic of different classes of samples, robust classification, and anomaly detection. We summarize a set of samples belonging to a biological class or category with a statistically derived template for the class. Whereas individual samples are represented in terms of their cell populations (clusters), a template consists of generic meta-populations (a group of homogeneous cell populations obtained from the samples in a class) that describe key phenotypes shared among all those samples. We organize an FC data collection in a hierarchical data structure that supports the identification of immunophenotypes relevant to clinical diagnosis. A robust template-based classification scheme is also developed, but our primary focus is in the discovery of phenotypic signatures and inter-sample relationships in an FC data collection. This collective analysis approach is more efficient and robust since templates describe phenotypic signatures common to cell populations in several samples while ignoring noise and small sample-specific variations. We have applied the template-based scheme to analyze several datasets, including one representing a healthy immune system and one of acute myeloid leukemia (AML) samples. The last task is challenging due to the phenotypic heterogeneity of the several subtypes of AML. However, we identified thirteen immunophenotypes corresponding to subtypes of AML and were able to distinguish acute promyelocytic leukemia (APL) samples with the markers provided. Clinically, this is helpful since APL has a different treatment regimen from other subtypes of AML. Core algorithms used in our data analysis are available in the flowMatch package at www.bioconductor.org. It has been downloaded nearly 6,000 times since 2014.},
doi = {10.3389/fonc.2016.00188},
journal = {Frontiers in Oncology},
number = ,
volume = 6,
place = {United States},
year = {Wed Aug 31 00:00:00 EDT 2016},
month = {Wed Aug 31 00:00:00 EDT 2016}
}
Web of Science
Works referenced in this record:
Inferring Phenotypic Properties from Single-Cell Characteristics
journal, May 2012
- Qiu, Peng
- PLoS ONE, Vol. 7, Issue 5
A Computational Framework to Emulate the Human Perspective in Flow Cytometric Data Analysis
journal, May 2012
- Ray, Surajit; Pyne, Saumyadipta
- PLoS ONE, Vol. 7, Issue 5
Flow cytometer electronics
journal, January 2004
- Snow, Christopher ?Kit?
- Cytometry, Vol. 57A, Issue 2
Flow cytometer electronics
journal, January 2004
- Snow, Christopher ?Kit?
- Cytometry, Vol. 57A, Issue 2
Variance stabilization applied to microarray data calibration and to the quantification of differential expression
journal, July 2002
- Huber, W.; von Heydebreck, A.; Sultmann, H.
- Bioinformatics, Vol. 18, Issue Suppl 1
Automated high-dimensional flow cytometric data analysis
journal, May 2009
- Pyne, S.; Hu, X.; Wang, K.
- Proceedings of the National Academy of Sciences, Vol. 106, Issue 21
Inferring Phenotypic Properties from Single-Cell Characteristics
journal, May 2012
- Qiu, Peng
- PLoS ONE, Vol. 7, Issue 5
Modeling of inter-sample variation in flow cytometric data with the joint clustering and matching procedure: Modeling of Inter-Sample Variation
journal, October 2015
- Lee, Sharon X.; McLachlan, Geoffrey J.; Pyne, Saumyadipta
- Cytometry Part A, Vol. 89, Issue 1
An invariant form for the prior probability in estimation problems
journal, September 1946
- Jeffreys, Harold
- Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, Vol. 186, Issue 1007, p. 453-461
Data reduction for spectral clustering to analyze high throughput flow cytometry data
journal, January 2010
- Zare, Habil; Shooshtari, Parisa; Gupta, Arvind
- BMC Bioinformatics, Vol. 11, Issue 1
Aberrant expression of CD19 in AML with t(8;21) involves a poised chromatin structure and PAX5
journal, March 2010
- Walter, K.; Cockerill, P. N.; Barlow, R.
- Oncogene, Vol. 29, Issue 20
Aberrant expression of CD19 in AML with t(8;21) involves a poised chromatin structure and PAX5
journal, March 2010
- Walter, K.; Cockerill, P. N.; Barlow, R.
- Oncogene, Vol. 29, Issue 20
Fluorescence Spectral Overlap Compensation for Any Number of Flow Cytometry Parameters
journal, March 1993
- Bagwell, C. Bruce; Adams, Earl G.
- Annals of the New York Academy of Sciences, Vol. 677, Issue 1 Clinical Flow
Automated gating of flow cytometry data via robust model-based clustering
journal, April 2008
- Lo, Kenneth; Brinkman, Ryan Remy; Gottardo, Raphael
- Cytometry Part A, Vol. 73A, Issue 4
Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum
journal, May 2011
- Bendall, S. C.; Simonds, E. F.; Qiu, P.
- Science, Vol. 332, Issue 6030
The immunophenotype of 177 adults with acute myeloid leukemia: proposal of a prognostic score
journal, August 2000
- Legrand, Ollivier; Perrot, Jean-Yves; Baudard, Marion
- Blood, Vol. 96, Issue 3
A new “Logicle” display method avoids deceptive effects of logarithmic scaling for low signals and compensated data
journal, January 2006
- Parks, David R.; Roederer, Mario; Moore, Wayne A.
- Cytometry Part A, Vol. 69A, Issue 6
Partition-distance: A problem and class of perfect graphs arising in clustering
journal, May 2002
- Gusfield, Dan
- Information Processing Letters, Vol. 82, Issue 3
Increased CD38 expression is associated with favorable prognosis in adult acute leukemia
journal, February 2000
- Keyhani, Afsaneh; Huh, Yang O.; Jendiroba, David
- Leukemia Research, Vol. 24, Issue 2
A dendrite method for cluster analysis
journal, January 1974
- Calinski, T.; Harabasz, J.
- Communications in Statistics - Theory and Methods, Vol. 3, Issue 1
Generalized unmixing model for multispectral flow cytometry utilizing nonsquare compensation matrices
journal, March 2013
- Novo, David; Grégori, Gérald; Rajwa, Bartek
- Cytometry Part A, Vol. 83A, Issue 5
The immunophenotype of 177 adults with acute myeloid leukemia: proposal of a prognostic score
journal, August 2000
- Legrand, Ollivier; Perrot, Jean-Yves; Baudard, Marion
- Blood, Vol. 96, Issue 3
Critical assessment of automated flow cytometry data analysis techniques
journal, February 2013
- Aghaeepour, Nima; Finak, Greg; Hoos, Holger
- Nature Methods, Vol. 10, Issue 3
Transformation Theory: How Normal is a Family of Distributions?
journal, June 1982
- Efron, Bradley
- The Annals of Statistics, Vol. 10, Issue 2
Data clustering: a review
journal, September 1999
- Jain, A. K.; Murty, M. N.; Flynn, P. J.
- ACM Computing Surveys, Vol. 31, Issue 3, p. 264-323
Flow cytometry CD45 gating for immunophenotyping of acute myeloid leukemia
journal, November 1997
- Lacombe, F.; Durrieu, F.; Briais, A.
- Leukemia, Vol. 11, Issue 11
Rapid cell population identification in flow cytometry data
journal, December 2010
- Aghaeepour, Nima; Nikolic, Radina; Hoos, Holger H.
- Cytometry Part A, Vol. 79A, Issue 1
Rapid cell population identification in flow cytometry data
journal, December 2010
- Aghaeepour, Nima; Nikolic, Radina; Hoos, Holger H.
- Cytometry Part A, Vol. 79A, Issue 1
Data clustering: a review
journal, September 1999
- Jain, A. K.; Murty, M. N.; Flynn, P. J.
- ACM Computing Surveys, Vol. 31, Issue 3, p. 264-323
Hyperlog?A flexible log-like transform for negative, zero, and positive valued data
journal, January 2005
- Bagwell, C. Bruce
- Cytometry Part A, Vol. 64A, Issue 1
Criteria for the Diagnosis of Acute Leukemia of Megakaryocyte Lineage (M7): A Report of the French-American-British Cooperative Group
journal, September 1985
- Bennett, John M.
- Annals of Internal Medicine, Vol. 103, Issue 3
Elucidation of seventeen human peripheral blood B-cell subsets and quantification of the tetanus response using a density-based method for the automated identification of cell populations in multidimensional flow cytometry data
journal, January 2010
- Qian, Yu; Wei, Chungwen; Eun-Hyung Lee, F.
- Cytometry Part B: Clinical Cytometry, Vol. 78B, Issue S1
Flow cytometry histograms: Transformations, resolution, and display
journal, August 2008
- Novo, David; Wood, James
- Cytometry Part A, Vol. 73A, Issue 8
Data quality assessment of ungated flow cytometry data in high throughput experiments
journal, January 2007
- Le Meur, Nolwenn; Rossini, Anthony; Gasparetto, Maura
- Cytometry Part A, Vol. 71A, Issue 6
Expression of cell-surface antigens in acute promyelocytic leukaemia
journal, September 2003
- Paietta, Elisabeth
- Best Practice & Research Clinical Haematology, Vol. 16, Issue 3
An invariant form for the prior probability in estimation problems
journal, September 1946
- Jeffreys, Harold
- Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, Vol. 186, Issue 1007, p. 453-461
Critical assessment of automated flow cytometry data analysis techniques
journal, February 2013
- Aghaeepour, Nima; Finak, Greg; Hoos, Holger
- Nature Methods, Vol. 10, Issue 3
Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum
journal, May 2011
- Bendall, S. C.; Simonds, E. F.; Qiu, P.
- Science, Vol. 332, Issue 6030
Bioconductor : open software development for computational biology and bioinformatics
text, January 2004
- C., Gentleman, Robert; J., Carey, Vincent; M., Bates, Douglas
- BioMed Central
Flow Cytometry Bioinformatics
journal, December 2013
- O'Neill, Kieran; Aghaeepour, Nima; Špidlen, Josef
- PLoS Computational Biology, Vol. 9, Issue 12
The immunophenotype of acute myeloid leukemia: is there a relationship with prognosis?
journal, March 2006
- Mason, Kylie D.; Juneja, Surender K.; Szer, Jeff
- Blood Reviews, Vol. 20, Issue 2
Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
journal, March 2013
- Biehl, Michael; Bunte, Kerstin; Schneider, Petra
- PLoS ONE, Vol. 8, Issue 3
GenePattern flow cytometry suite
journal, January 2013
- Spidlen, Josef; Barsky, Aaron; Breuer, Karin
- Source Code for Biology and Medicine, Vol. 8, Issue 1
Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
journal, March 2013
- Biehl, Michael; Bunte, Kerstin; Schneider, Petra
- PLoS ONE, Vol. 8, Issue 3
Flow Cytometry Bioinformatics
journal, December 2013
- O'Neill, Kieran; Aghaeepour, Nima; Špidlen, Josef
- PLoS Computational Biology, Vol. 9, Issue 12
GenePattern flow cytometry suite
journal, January 2013
- Spidlen, Josef; Barsky, Aaron; Breuer, Karin
- Source Code for Biology and Medicine, Vol. 8, Issue 1
Data analysis in flow cytometry: The future just started
journal, April 2010
- Lugli, Enrico; Roederer, Mario; Cossarizza, Andrea
- Cytometry Part A, Vol. 77A, Issue 7
Optimizing transformations for automated, high throughput analysis of flow cytometry data
journal, November 2010
- Finak, Greg; Perez, Juan-Manuel; Weng, Andrew
- BMC Bioinformatics, Vol. 11, Issue 1
Immunophenotyping of leukemia
journal, September 2000
- Campana, Dario; Behm, Frederick G.
- Journal of Immunological Methods, Vol. 243, Issue 1-2
flowVS: channel-specific variance stabilization in flow cytometry
journal, July 2016
- Azad, Ariful; Rajwa, Bartek; Pothen, Alex
- BMC Bioinformatics, Vol. 17, Issue 1
Data analysis in flow cytometry: The future just started
journal, April 2010
- Lugli, Enrico; Roederer, Mario; Cossarizza, Andrea
- Cytometry Part A, Vol. 77A, Issue 7
Optimizing transformations for automated, high throughput analysis of flow cytometry data
journal, November 2010
- Finak, Greg; Perez, Juan-Manuel; Weng, Andrew
- BMC Bioinformatics, Vol. 11, Issue 1
Flow cytometry histograms: Transformations, resolution, and display
journal, August 2008
- Novo, David; Wood, James
- Cytometry Part A, Vol. 73A, Issue 8
Amine reactive dyes: An effective tool to discriminate live and dead cells in polychromatic flow cytometry
journal, June 2006
- Perfetto, Stephen P.; Chattopadhyay, Pratip K.; Lamoreaux, Laurie
- Journal of Immunological Methods, Vol. 313, Issue 1-2
Data quality assessment of ungated flow cytometry data in high throughput experiments
journal, January 2007
- Le Meur, Nolwenn; Rossini, Anthony; Gasparetto, Maura
- Cytometry Part A, Vol. 71A, Issue 6
Hyperlog?A flexible log-like transform for negative, zero, and positive valued data
journal, January 2005
- Bagwell, C. Bruce
- Cytometry Part A, Vol. 64A, Issue 1
On Clustering Validation Techniques
journal, January 2001
- Halkidi, Maria; Batistakis, Yannis; Vazirgiannis, Michalis
- Journal of Intelligent Information Systems, Vol. 17, Issue 2/3, p. 107-145
CD56 antigenic expression in acute myeloid leukemia identifies patients with poor clinical prognosis
journal, July 2001
- Raspadori, D.; Damiani, D.; Lenoci, M.
- Leukemia, Vol. 15, Issue 8
Flow cytometry CD45 gating for immunophenotyping of acute myeloid leukemia
journal, November 1997
- Lacombe, F.; Durrieu, F.; Briais, A.
- Leukemia, Vol. 11, Issue 11
flowVS: channel-specific variance stabilization in flow cytometry
journal, July 2016
- Azad, Ariful; Rajwa, Bartek; Pothen, Alex
- BMC Bioinformatics, Vol. 17, Issue 1
Web-Based Analysis and Publication of Flow Cytometry Experiments
journal, July 2010
- Kotecha, Nikesh; Krutzik, Peter O.; Irish, Jonathan M.
- Current Protocols in Cytometry, Vol. 53, Issue 1
On Information and Sufficiency
journal, March 1951
- Kullback, S.; Leibler, R. A.
- The Annals of Mathematical Statistics, Vol. 22, Issue 1
The immunophenotype of acute myeloid leukemia: is there a relationship with prognosis?
journal, March 2006
- Mason, Kylie D.; Juneja, Surender K.; Szer, Jeff
- Blood Reviews, Vol. 20, Issue 2
flowCore: a Bioconductor package for high throughput flow cytometry
journal, April 2009
- Hahne, Florian; LeMeur, Nolwenn; Brinkman, Ryan R.
- BMC Bioinformatics, Vol. 10, Issue 1
Transformation Theory: How Normal is a Family of Distributions?
journal, June 1982
- Efron, Bradley
- The Annals of Statistics, Vol. 10, Issue 2
A Dendrite Method for Cluster Analysis
journal, January 1974
- Calinski, T.; Harabasz, J.
- Communications in Statistics - Simulation and Computation, Vol. 3, Issue 1
Modified box-cox transform for modulating the dynamic range of flow cytometry data
journal, November 1989
- Dvorak, James A.; Banks, Steven M.
- Cytometry, Vol. 10, Issue 6
Matching phosphorylation response patterns of antigen-receptor-stimulated T cells via flow cytometry
journal, March 2012
- Azad, Ariful; Pyne, Saumyadipta; Pothen, Alex
- BMC Bioinformatics, Vol. 13, Issue S2
Modified box-cox transform for modulating the dynamic range of flow cytometry data
journal, November 1989
- Dvorak, James A.; Banks, Steven M.
- Cytometry, Vol. 10, Issue 6
Bioconductor: open software development for computational biology and bioinformatics
journal, September 2004
- Gentleman, Robert C.; Carey, Vincent J.; Bates, Douglas M.
- Genome Biology, Vol. 5, Issue 10, p. R80
Automated gating of flow cytometry data via robust model-based clustering
journal, April 2008
- Lo, Kenneth; Brinkman, Ryan Remy; Gottardo, Raphael
- Cytometry Part A, Vol. 73A, Issue 4
A new “Logicle” display method avoids deceptive effects of logarithmic scaling for low signals and compensated data
journal, January 2006
- Parks, David R.; Roederer, Mario; Moore, Wayne A.
- Cytometry Part A, Vol. 69A, Issue 6
Amine reactive dyes: An effective tool to discriminate live and dead cells in polychromatic flow cytometry
journal, June 2006
- Perfetto, Stephen P.; Chattopadhyay, Pratip K.; Lamoreaux, Laurie
- Journal of Immunological Methods, Vol. 313, Issue 1-2
Elucidation of seventeen human peripheral blood B-cell subsets and quantification of the tetanus response using a density-based method for the automated identification of cell populations in multidimensional flow cytometry data
journal, January 2010
- Qian, Yu; Wei, Chungwen; Eun-Hyung Lee, F.
- Cytometry Part B: Clinical Cytometry, Vol. 78B, Issue S1
Web-Based Analysis and Publication of Flow Cytometry Experiments
journal, July 2010
- Kotecha, Nikesh; Krutzik, Peter O.; Irish, Jonathan M.
- Current Protocols in Cytometry, Vol. 53, Issue 1
Acute myeloid leukaemia
journal, November 2006
- Estey, Elihu; Döhner, Hartmut
- The Lancet, Vol. 368, Issue 9550
CD56 antigenic expression in acute myeloid leukemia identifies patients with poor clinical prognosis
journal, July 2001
- Raspadori, D.; Damiani, D.; Lenoci, M.
- Leukemia, Vol. 15, Issue 8
Criteria for the Diagnosis of Acute Leukemia of Megakaryocyte Lineage (M7): A Report of the French-American-British Cooperative Group
journal, September 1985
- Bennett, John M.
- Annals of Internal Medicine, Vol. 103, Issue 3
The Earth Mover's Distance as a Metric for Image Retrieval
journal, November 2000
- Rubner, Yossi
- International Journal of Computer Vision, Vol. 40, Issue 2, p. 99-121
Data reduction for spectral clustering to analyze high throughput flow cytometry data
text, January 2010
- Zare, Habil; Shooshtari, Parisa; Gupta, Arvind
- BioMed Central
Classifying Immunophenotypes With Templates From Flow Cytometry
conference, September 2013
- Azad, Ariful; Khan, Arif; Rajwa, Bartek
- BCB'13: ACM-BCB2013, Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Works referencing / citing this record:
Guidelines for the use of flow cytometry and cell sorting in immunological studies (second edition)
journal, October 2019
- Cossarizza, Andrea; Chang, Hyun‐Dong; Radbruch, Andreas
- European Journal of Immunology, Vol. 49, Issue 10
Improved Unsupervised Color Segmentation Using a Modified Color Model and a Bagging Procedure in -Means++ Algorithm
journal, January 2018
- Chavolla, Edgar; Valdivia, Arturo; Diaz, Primitivo
- Mathematical Problems in Engineering, Vol. 2018