skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Classification of Complete Proteomes of Different Organisms and Protein Sets Based on Their Protein Distributions in Terms of Some Key Attributes of Proteins

Journal Article · · International Journal of Genomics
DOI:https://doi.org/10.1155/2018/9784161· OSTI ID:1423699

Here, the existence of complete genome sequences makes it important to develop different approaches for classification of large-scale data sets and to make extraction of biological insights easier. Here, we propose an approach for classification of complete proteomes/protein sets based on protein distributions on some basic attributes. We demonstrate the usefulness of this approach by determining protein distributions in terms of two attributes: protein lengths and protein intrinsic disorder contents (ID). The protein distributions based on L and ID are surveyed for representative proteome organisms and protein sets from the three domains of life. The two-dimensional maps (designated as fingerprints here) from the protein distribution densities in the LD space defined by ln(L) and ID are then constructed. The fingerprints for different organisms and protein sets are found to be distinct with each other, and they can therefore be used for comparative studies. As a test case, phylogenetic trees have been constructed based on the protein distribution densities in the fingerprints of proteomes of organisms without performing any protein sequence comparison and alignments. The phylogenetic trees generated are biologically meaningful, demonstrating that the protein distributions in the LD space may serve as unique phylogenetic signals of the organisms at the proteome level.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
SC0008834; AC05-00OR22725
OSTI ID:
1423699
Alternate ID(s):
OSTI ID: 1468038
Journal Information:
International Journal of Genomics, Journal Name: International Journal of Genomics Vol. 2018; ISSN 2314-436X
Publisher:
Hindawi Publishing CorporationCopyright Statement
Country of Publication:
Egypt
Language:
English
Citation Metrics:
Cited by: 3 works
Citation information provided by
Web of Science

References (68)

Therapeutic Interventions of Cancers Using Intrinsically Disordered Proteins as Drug Targets: c-Myc as Model System journal January 2017
The relationship between proteome size, structural disorder and organism complexity journal January 2011
Intrinsically disordered proteins are potential drug targets journal August 2010
Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life journal June 2012
Intrinsically disordered proteins: emerging interaction specialists journal December 2015
The Sequence of the Human Genome journal February 2001
A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) journal April 2002
A Eukaryote without a Mitochondrial Organelle journal May 2016
Intrinsically disordered proteins in cellular signalling and regulation journal December 2014
Protein disorder in the human diseasome: unfoldomics of human genetic diseases journal January 2009
The Amborella Genome and the Evolution of Flowering Plants journal December 2013
Protein length in eukaryotic and prokaryotic proteomes journal June 2005
Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. journal June 1990
The 1.2-Megabase Genome Sequence of Mimivirus journal November 2004
The pineapple genome and the evolution of CAM photosynthesis journal November 2015
Understanding protein non-folding journal June 2010
Intrinsically disordered proteins and multicellular organisms journal January 2015
The essential gene set of a photosynthetic organism journal October 2015
How Common Is Disorder? Occurrence of Disordered Residues in Four Domains of Life journal August 2015
Intrinsic Disorder in Cell-signaling and Cancer-associated Proteins journal October 2002
The Physcomitrella Genome Reveals Evolutionary Insights into the Conquest of Land by Plants journal December 2007
The Complete Genome Sequence of Escherichia coli K-12 journal September 1997
A structural phylogenetic map for chloroplast photosynthesis journal December 2011
Giant viruses come of age journal June 2016
Mitochondrial Gene Expression: A Playground of Evolutionary Tinkering journal June 2016
Ten good reasons not to exclude giruses from the evolutionary picture journal August 2009
Why chloroplasts and mitochondria retain their own genomes and genetic systems: Colocation for redox regulation of gene expression journal May 2015
Functional advantages of dynamic protein disorder journal June 2015
Targeting intrinsically disordered proteins in neurodegenerative and protein dysfunction diseases: another illustration of the D 2 concept journal August 2010
Identification of Inhibitors of Biological Interactions Involving Intrinsically Disordered Proteins journal April 2015
The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) journal September 2006
A decade and a half of protein intrinsic disorder: Biology still waits for physics: Protein Intrinsic Disorder journal April 2013
The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions journal October 2007
Theoretical Perspectives on Protein Folding journal April 2010
Protein-length distributions for the three domains of life journal March 2000
Are viruses alive? The replicator paradigm sheds decisive light on an old but misguided question
  • Koonin, Eugene V.; Starokadomskyy, Petro
  • Studies in History and Philosophy of Science Part C: Studies in History and Philosophy of Biological and Biomedical Sciences, Vol. 59 https://doi.org/10.1016/j.shpsc.2016.02.016
journal October 2016
GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis journal January 2009
Pathogen to powerhouse journal February 2016
The multifaceted roles of intrinsic disorder in protein complexes journal June 2015
To be or not to be alive: How recent discoveries challenge the traditional definitions of viruses and life journal October 2016
An integrated phylogenomic approach toward pinpointing the origin of mitochondria journal January 2015
Genome-Wide Analysis of Protein Disorder in Arabidopsis thaliana: Implications for Plant Environmental Adaptation journal February 2013
The p53 Pathway: Origins, Inactivation in Cancer, and Emerging Therapeutic Approaches journal June 2016
Targeting intrinsically disordered proteins in rational drug discovery journal November 2015
Giant viruses and the origin of modern eukaryotes journal June 2016
Physical limits of cells and proteomes journal October 2011
Highly Disordered Proteins in Prostate Cancer journal February 2017
T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks journal June 2012
Protein disorder in plants: a view from the chloroplast journal January 2012
Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life journal June 2014
Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention journal February 2016
MEGA5: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods journal May 2011
Drugging Undruggable Molecular Cancer Targets journal January 2016
Ten reasons to exclude viruses from the tree of life journal March 2009
A High-Resolution Radiation Hybrid Map of the Human Genome Draft Sequence journal February 2001
Pandoraviruses: Amoeba Viruses with Genomes Up to 2.5 Mb Reaching That of Parasitic Eukaryotes journal July 2013
Pathological Unfoldomics of Uncontrolled Chaos: Intrinsically Disordered Proteins and Human Diseases journal December 2013
Complex archaea that bridge the gap between prokaryotes and eukaryotes journal May 2015
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana journal December 2000
PLAZA 3.0: an access point for plant comparative genomics journal October 2014
Monophyletic origins of the metazoa: an evolutionary link with fungi journal April 1993
Intrinsically disordered proteins: a 10-year recap journal December 2012
Saccharomyces Genome Database: the genomics resource of budding yeast journal November 2011
A genomic analysis of the archaeal system Ignicoccus hospitalis-Nanoarchaeum equitans journal January 2008
Evolution of viruses and cells: do we need a fourth domain of life to explain the origin of eukaryotes? journal September 2015
Unexpected features of the dark proteome journal November 2015
Natively unfolded proteins: A point where biology waits for physics journal April 2002
The Genome Sequence of Drosophila melanogaster journal March 2000

Cited By (1)


Similar Records

SALAD database: a motif-based database of protein annotations for plant comparative genomics
Journal Article · Fri Oct 23 00:00:00 EDT 2009 · Nucleic Acids Research · OSTI ID:1423699

Proteomic Analyses using High-Efficiency Separations and Accurate Mass Measurements
Book · Tue Aug 01 00:00:00 EDT 2006 · OSTI ID:1423699

Accurate Mass Measurements in Proteomics
Journal Article · Wed Aug 01 00:00:00 EDT 2007 · Chemical Reviews, 107(8):3621-3653 · OSTI ID:1423699

Related Subjects