DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Computational Framework for Proteome-Wide Pursuit and Prediction of Metalloproteins using ICP-MS and MS/MS Data

Abstract

Metal-containing proteins comprise a diverse and sizable category within the proteomes of organisms, ranging from proteins that use metals to catalyze reactions to proteins in which metals play key structural roles. Unfortunately, reliably predicting that a protein will contain a specific metal from its amino acid sequence is not currently possible. We recently developed a generally-applicable experimental technique for finding metalloproteins on a genome-wide scale. Applying this metal-directed protein purification approach (ICP-MS and MS/MS based) to the prototypical microbe Pyrococcus furiosus conclusively demonstrated the extent and diversity of the uncharacterized portion of microbial metalloproteomes since a majority of the observed metal peaks could not be assigned to known or predicted metalloproteins. However, even using this technique, it is not technically feasible to purify to homogeneity all metalloproteins in an organism. In order to address these limitations and complement the metal-directed protein purification, we developed a computational infrastructure and statistical methodology to aid in the pursuit and identification of novel metalloproteins. We demonstrate that our methodology enables predictions of metal-protein interactions using an experimental data set derived from a chromatography fractionation experiment in which 870 proteins and 10 metals were measured over 2,589 fractions. For each of the 10 metals, cobalt,more » iron, manganese, molybdenum, nickel, lead, tungsten, uranium, vanadium, and zinc, clusters of proteins frequently occurring in metal peaks (of a specific metal) within the fractionation space were defined. This resulted in predictions that there are from 5 undiscovered vanadium- to 13 undiscovered cobalt-containing proteins in Pyrococcus furiosus. Molybdenum and nickel were chosen for additional assessment producing lists of genes predicted to encode metalloproteins or metalloprotein subunits, 22 for nickel including seven from known nickel-proteins, and 20 for molybdenum including two from known molybdo-proteins. The uncharacterized proteins are prime candidates for metal-based purification or recombinant approaches to validate these predictions. We conclude that the largely uncharacterized extent of native metalloproteomes can be revealed through analysis of the co-occurrence of metals and proteins across a fractionation space. This can significantly impact our understanding of metallobiochemistry, disease mechanisms, and metal toxicity, with implications for bioremediation, medicine and other fields.« less

Authors:
 [1];  [1];  [1];  [1];  [1];  [1];  [2];  [1];  [3];  [3];  [3];  [3];  [4];  [1]
  1. Univ. of Georgia, Athens, GA (United States). Dept. of Biochemistry and Molecular Biology
  2. Univ. of Georgia, Athens, GA (United States). Dept. of Biochemistry and Molecular Biology; Philadelphia College of Osteopathic Medicine, Suwanee, GA (United States)
  3. The Scipps Research Inst., La Jolla, CA (United States). Depts. of Molecular Biology and Chemistry. Scripps Center for Mass Spectrometry
  4. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Life Sciences Division
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
OSTI Identifier:
1626284
Alternate Identifier(s):
OSTI ID: 1788451
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
BMC Bioinformatics
Additional Journal Information:
Journal Volume: 12; Journal Issue: 1; Journal ID: ISSN 1471-2105
Publisher:
BioMed Central
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING; Biochemistry & Molecular Biology; Biotechnology & Applied Microbiology; Mathematical & Computational Biology; 37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY

Citation Formats

Lancaster, W. Andrew, Praissman, Jeremy L., Poole, Farris L., Cvetkovic, Aleksandar, Menon, Angeli Lal, Scott, Joseph W., Jenney, Francis E., Thorgersen, Michael P., Kalisiak, Ewa, Apon, Junefredo V., Trauger, Sunia A., Siuzdak, Gary, Tainer, John A., and W Adams, Michael W. A Computational Framework for Proteome-Wide Pursuit and Prediction of Metalloproteins using ICP-MS and MS/MS Data. United States: N. p., 2011. Web. doi:10.1186/1471-2105-12-64.
Lancaster, W. Andrew, Praissman, Jeremy L., Poole, Farris L., Cvetkovic, Aleksandar, Menon, Angeli Lal, Scott, Joseph W., Jenney, Francis E., Thorgersen, Michael P., Kalisiak, Ewa, Apon, Junefredo V., Trauger, Sunia A., Siuzdak, Gary, Tainer, John A., & W Adams, Michael W. A Computational Framework for Proteome-Wide Pursuit and Prediction of Metalloproteins using ICP-MS and MS/MS Data. United States. https://doi.org/10.1186/1471-2105-12-64
Lancaster, W. Andrew, Praissman, Jeremy L., Poole, Farris L., Cvetkovic, Aleksandar, Menon, Angeli Lal, Scott, Joseph W., Jenney, Francis E., Thorgersen, Michael P., Kalisiak, Ewa, Apon, Junefredo V., Trauger, Sunia A., Siuzdak, Gary, Tainer, John A., and W Adams, Michael W. Mon . "A Computational Framework for Proteome-Wide Pursuit and Prediction of Metalloproteins using ICP-MS and MS/MS Data". United States. https://doi.org/10.1186/1471-2105-12-64. https://www.osti.gov/servlets/purl/1626284.
@article{osti_1626284,
title = {A Computational Framework for Proteome-Wide Pursuit and Prediction of Metalloproteins using ICP-MS and MS/MS Data},
author = {Lancaster, W. Andrew and Praissman, Jeremy L. and Poole, Farris L. and Cvetkovic, Aleksandar and Menon, Angeli Lal and Scott, Joseph W. and Jenney, Francis E. and Thorgersen, Michael P. and Kalisiak, Ewa and Apon, Junefredo V. and Trauger, Sunia A. and Siuzdak, Gary and Tainer, John A. and W Adams, Michael W.},
abstractNote = {Metal-containing proteins comprise a diverse and sizable category within the proteomes of organisms, ranging from proteins that use metals to catalyze reactions to proteins in which metals play key structural roles. Unfortunately, reliably predicting that a protein will contain a specific metal from its amino acid sequence is not currently possible. We recently developed a generally-applicable experimental technique for finding metalloproteins on a genome-wide scale. Applying this metal-directed protein purification approach (ICP-MS and MS/MS based) to the prototypical microbe Pyrococcus furiosus conclusively demonstrated the extent and diversity of the uncharacterized portion of microbial metalloproteomes since a majority of the observed metal peaks could not be assigned to known or predicted metalloproteins. However, even using this technique, it is not technically feasible to purify to homogeneity all metalloproteins in an organism. In order to address these limitations and complement the metal-directed protein purification, we developed a computational infrastructure and statistical methodology to aid in the pursuit and identification of novel metalloproteins. We demonstrate that our methodology enables predictions of metal-protein interactions using an experimental data set derived from a chromatography fractionation experiment in which 870 proteins and 10 metals were measured over 2,589 fractions. For each of the 10 metals, cobalt, iron, manganese, molybdenum, nickel, lead, tungsten, uranium, vanadium, and zinc, clusters of proteins frequently occurring in metal peaks (of a specific metal) within the fractionation space were defined. This resulted in predictions that there are from 5 undiscovered vanadium- to 13 undiscovered cobalt-containing proteins in Pyrococcus furiosus. Molybdenum and nickel were chosen for additional assessment producing lists of genes predicted to encode metalloproteins or metalloprotein subunits, 22 for nickel including seven from known nickel-proteins, and 20 for molybdenum including two from known molybdo-proteins. The uncharacterized proteins are prime candidates for metal-based purification or recombinant approaches to validate these predictions. We conclude that the largely uncharacterized extent of native metalloproteomes can be revealed through analysis of the co-occurrence of metals and proteins across a fractionation space. This can significantly impact our understanding of metallobiochemistry, disease mechanisms, and metal toxicity, with implications for bioremediation, medicine and other fields.},
doi = {10.1186/1471-2105-12-64},
journal = {BMC Bioinformatics},
number = 1,
volume = 12,
place = {United States},
year = {Mon Feb 28 00:00:00 EST 2011},
month = {Mon Feb 28 00:00:00 EST 2011}
}

Works referenced in this record:

Coordination of selenium to molybdenum in formate dehydrogenase H from Escherichia coli.
journal, August 1994

  • Gladyshev, V. N.; Khangulov, S. V.; Axley, M. J.
  • Proceedings of the National Academy of Sciences, Vol. 91, Issue 16
  • DOI: 10.1073/pnas.91.16.7708

Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation
journal, December 2006

  • Lu, Peng; Vogel, Christine; Wang, Rong
  • Nature Biotechnology, Vol. 25, Issue 1
  • DOI: 10.1038/nbt1270

Metallomics: the concept and methodology
journal, January 2009

  • Mounicou, Sandra; Szpunar, Joanna; Lobinski, Ryszard
  • Chemical Society Reviews, Vol. 38, Issue 4
  • DOI: 10.1039/b713633c

How do bacterial cells ensure that metalloproteins get the correct metal?
journal, January 2009

  • Waldron, Kevin J.; Robinson, Nigel J.
  • Nature Reviews Microbiology, Vol. 7, Issue 1
  • DOI: 10.1038/nrmicro2057

Development of a generic approach to native metalloproteomics: application to the quantitative identification of soluble copper proteins in Escherichia coli
journal, February 2009

  • Sevcenco, Ana-Maria; Krijger, Gerard C.; Pinkse, Martijn W. H.
  • JBIC Journal of Biological Inorganic Chemistry, Vol. 14, Issue 4
  • DOI: 10.1007/s00775-009-0477-9

A General Framework for Weighted Gene Co-Expression Network Analysis
journal, January 2005

  • Zhang, Bin; Horvath, Steve
  • Statistical Applications in Genetics and Molecular Biology, Vol. 4, Issue 1
  • DOI: 10.2202/1544-6115.1128

Absolute Quantification of Proteins by LCMS E : A Virtue of Parallel ms Acquisition
journal, October 2005

  • Silva, Jeffrey C.; Gorenstein, Marc V.; Li, Guo-Zhong
  • Molecular & Cellular Proteomics, Vol. 5, Issue 1
  • DOI: 10.1074/mcp.M500230-MCP200

Metalloproteins and metal sensing
journal, August 2009

  • Waldron, Kevin J.; Rutherford, Julian C.; Ford, Dianne
  • Nature, Vol. 460, Issue 7257
  • DOI: 10.1038/nature08300

Dynameomics: a multi-dimensional analysis-optimized database for dynamic protein data
journal, April 2008

  • Kehl, C.; Simms, A. M.; Toofanny, R. D.
  • Protein Engineering Design and Selection, Vol. 21, Issue 6
  • DOI: 10.1093/protein/gzn015

Het-PDB Navi.: A Database for Protein-Small Molecule Interactions
journal, January 2004


Molybdenum cofactors, enzymes and pathways
journal, August 2009

  • Schwarz, Günter; Mendel, Ralf R.; Ribbe, Markus W.
  • Nature, Vol. 460, Issue 7257
  • DOI: 10.1038/nature08302

NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
journal, January 2007

  • Pruitt, K. D.; Tatusova, T.; Maglott, D. R.
  • Nucleic Acids Research, Vol. 35, Issue Database
  • DOI: 10.1093/nar/gkl842

Metalloproteomes: A Bioinformatic Approach
journal, October 2009

  • Andreini, Claudia; Bertini, Ivano; Rosato, Antonio
  • Accounts of Chemical Research, Vol. 42, Issue 10
  • DOI: 10.1021/ar900015x

Novel Multiprotein Complexes Identified in the Hyperthermophilic Archaeon Pyrococcus furiosus by Non-denaturing Fractionation of the Native Proteome
journal, November 2008

  • Menon, Angeli Lal; Poole, Farris L.; Cvetkovic, Aleksandar
  • Molecular & Cellular Proteomics, Vol. 8, Issue 4
  • DOI: 10.1074/mcp.M800246-MCP200

Crystal Structure of HypA, a Nickel-Binding Metallochaperone for [NiFe] Hydrogenase Maturation
journal, December 2009

  • Watanabe, Satoshi; Arai, Takayuki; Matsumi, Rie
  • Journal of Molecular Biology, Vol. 394, Issue 3
  • DOI: 10.1016/j.jmb.2009.09.030

Metallomics: lessons for metalliferous soil remediation
journal, June 2010


InterPro: the integrative protein signature database
journal, January 2009

  • Hunter, S.; Apweiler, R.; Attwood, T. K.
  • Nucleic Acids Research, Vol. 37, Issue Database
  • DOI: 10.1093/nar/gkn785

History of biological metal utilization inferred through phylogenomic analysis of protein structures
journal, May 2010

  • Dupont, C. L.; Butcher, A.; Valas, R. E.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 23
  • DOI: 10.1073/pnas.0912491107

Metal Binding Affinity and Selectivity in Metalloproteins: Insights from Computational Studies
journal, June 2008


Molybdenum and tungsten enzymes: the xanthine oxidase family
journal, April 2006

  • Brondino, Carlos D.; Romão, Maria João; Moura, Isabel
  • Current Opinion in Chemical Biology, Vol. 10, Issue 2
  • DOI: 10.1016/j.cbpa.2006.01.034

Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R
journal, November 2007


A Greedy Heuristic for the Set-Covering Problem
journal, August 1979


Protein-folding location can regulate manganese-binding versus copper- or zinc-binding
journal, October 2008

  • Tottey, Steve; Waldron, Kevin J.; Firbank, Susan J.
  • Nature, Vol. 455, Issue 7216
  • DOI: 10.1038/nature07340

Crystal Structure of Methanobacterium thermoautotrophicum Phosphoribosyl-AMP Cyclohydrolase HisI ,
journal, August 2005

  • Sivaraman, J.; Myers, Rebecca S.; Boju, Lorena
  • Biochemistry, Vol. 44, Issue 30
  • DOI: 10.1021/bi050472w

Microbial metalloproteomes are largely uncharacterized
journal, July 2010

  • Cvetkovic, Aleksandar; Menon, Angeli Lal; Thorgersen, Michael P.
  • Nature, Vol. 466, Issue 7307
  • DOI: 10.1038/nature09265

Metal ions in biological catalysis: from enzyme databases to general principles
journal, July 2008

  • Andreini, Claudia; Bertini, Ivano; Cavallaro, Gabriele
  • JBIC Journal of Biological Inorganic Chemistry, Vol. 13, Issue 8
  • DOI: 10.1007/s00775-008-0404-5

The challenges of determining metal–protein affinities
journal, January 2010

  • Xiao, Zhiguang; Wedd, Anthony G.
  • Natural Product Reports, Vol. 27, Issue 5
  • DOI: 10.1039/b906690j

Metallomics: lessons for metalliferous soil remediation
journal, June 2010


Metal ions in biological catalysis: from enzyme databases to general principles
journal, July 2008

  • Andreini, Claudia; Bertini, Ivano; Cavallaro, Gabriele
  • JBIC Journal of Biological Inorganic Chemistry, Vol. 13, Issue 8
  • DOI: 10.1007/s00775-008-0404-5

Development of a generic approach to native metalloproteomics: application to the quantitative identification of soluble copper proteins in Escherichia coli
journal, February 2009

  • Sevcenco, Ana-Maria; Krijger, Gerard C.; Pinkse, Martijn W. H.
  • JBIC Journal of Biological Inorganic Chemistry, Vol. 14, Issue 4
  • DOI: 10.1007/s00775-009-0477-9

Molybdenum and tungsten enzymes: the xanthine oxidase family
journal, April 2006

  • Brondino, Carlos D.; Romão, Maria João; Moura, Isabel
  • Current Opinion in Chemical Biology, Vol. 10, Issue 2
  • DOI: 10.1016/j.cbpa.2006.01.034

Structure and function of molybdopterin containing enzymes
journal, September 1997

  • João Romão, Maria; Knäblein, Jörg; Huber, Robert
  • Progress in Biophysics and Molecular Biology, Vol. 68, Issue 2-3
  • DOI: 10.1016/s0079-6107(97)00022-9

Metalloproteomes: A Bioinformatic Approach
journal, October 2009

  • Andreini, Claudia; Bertini, Ivano; Rosato, Antonio
  • Accounts of Chemical Research, Vol. 42, Issue 10
  • DOI: 10.1021/ar900015x

Crystal Structure of Methanobacterium thermoautotrophicum Phosphoribosyl-AMP Cyclohydrolase HisI ,
journal, August 2005

  • Sivaraman, J.; Myers, Rebecca S.; Boju, Lorena
  • Biochemistry, Vol. 44, Issue 30
  • DOI: 10.1021/bi050472w

Metalloproteins and metal sensing
journal, August 2009

  • Waldron, Kevin J.; Rutherford, Julian C.; Ford, Dianne
  • Nature, Vol. 460, Issue 7257
  • DOI: 10.1038/nature08300

Microbial metalloproteomes are largely uncharacterized
journal, July 2010

  • Cvetkovic, Aleksandar; Menon, Angeli Lal; Thorgersen, Michael P.
  • Nature, Vol. 466, Issue 7307
  • DOI: 10.1038/nature09265

Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation
journal, December 2006

  • Lu, Peng; Vogel, Christine; Wang, Rong
  • Nature Biotechnology, Vol. 25, Issue 1
  • DOI: 10.1038/nbt1270

How do bacterial cells ensure that metalloproteins get the correct metal?
journal, January 2009

  • Waldron, Kevin J.; Robinson, Nigel J.
  • Nature Reviews Microbiology, Vol. 7, Issue 1
  • DOI: 10.1038/nrmicro2057

Metallomics: the concept and methodology
journal, January 2009

  • Mounicou, Sandra; Szpunar, Joanna; Lobinski, Ryszard
  • Chemical Society Reviews, Vol. 38, Issue 4
  • DOI: 10.1039/b713633c

The challenges of determining metal–protein affinities
journal, January 2010

  • Xiao, Zhiguang; Wedd, Anthony G.
  • Natural Product Reports, Vol. 27, Issue 5
  • DOI: 10.1039/b906690j

History of biological metal utilization inferred through phylogenomic analysis of protein structures
journal, May 2010

  • Dupont, C. L.; Butcher, A.; Valas, R. E.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 23
  • DOI: 10.1073/pnas.0912491107

Coordination of selenium to molybdenum in formate dehydrogenase H from Escherichia coli.
journal, August 1994

  • Gladyshev, V. N.; Khangulov, S. V.; Axley, M. J.
  • Proceedings of the National Academy of Sciences, Vol. 91, Issue 16
  • DOI: 10.1073/pnas.91.16.7708

Het-PDB Navi.: A Database for Protein-Small Molecule Interactions
journal, January 2004


NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
journal, December 2004

  • Pruitt, K. D.
  • Nucleic Acids Research, Vol. 33, Issue Database issue
  • DOI: 10.1093/nar/gki025

InterPro: the integrative protein signature database
journal, January 2009

  • Hunter, S.; Apweiler, R.; Attwood, T. K.
  • Nucleic Acids Research, Vol. 37, Issue Database
  • DOI: 10.1093/nar/gkn785

Characterization of a Novel Zinc-Containing, Lysine-Specific Aminopeptidase from the Hyperthermophilic Archaeon Pyrococcus furiosus
journal, March 2005


Metal Binding Affinity and Selectivity in Metalloproteins: Insights from Computational Studies
journal, June 2008


Mass Spectrometry-Based Label-Free Quantitative Proteomics
journal, January 2010

  • Zhu, Wenhong; Smith, Jeffrey W.; Huang, Chun-Ming
  • Journal of Biomedicine and Biotechnology, Vol. 2010
  • DOI: 10.1155/2010/840518

A Greedy Heuristic for the Set-Covering Problem
journal, August 1979


A General Framework for Weighted Gene Co-Expression Network Analysis
journal, January 2005

  • Zhang, Bin; Horvath, Steve
  • Statistical Applications in Genetics and Molecular Biology, Vol. 4, Issue 1
  • DOI: 10.2202/1544-6115.1128

Works referencing / citing this record:

Role of Mn 2+ and Compatible Solutes in the Radiation Resistance of Thermophilic Bacteria and Archaea
journal, January 2012


Decreased Copper in Alzheimer’s Disease Brain Is Predominantly in the Soluble Extractable Fraction
journal, January 2013

  • Rembach, Alan; Hare, Dominic J.; Lind, Monica
  • International Journal of Alzheimer's Disease, Vol. 2013
  • DOI: 10.1155/2013/623241

Role of Mn 2+ and Compatible Solutes in the Radiation Resistance of Thermophilic Bacteria and Archaea
journal, January 2012


Decreased Copper in Alzheimer’s Disease Brain Is Predominantly in the Soluble Extractable Fraction
journal, January 2013

  • Rembach, Alan; Hare, Dominic J.; Lind, Monica
  • International Journal of Alzheimer's Disease, Vol. 2013
  • DOI: 10.1155/2013/623241

Archaeal Genome Guardians Give Insights into Eukaryotic DNA Replication and Damage Response Proteins
journal, January 2014

  • Shin, David S.; Pratt, Ashley J.; Tainer, John A.
  • Archaea, Vol. 2014
  • DOI: 10.1155/2014/206735

Metalloproteomics: principles, challenges and applications to neurodegeneration
journal, January 2013