skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: MultiSeq: unifying sequence and structure data for evolutionary analysis

Journal Article · · BMC Bioinformatics
 [1];  [1];  [2];  [3]
  1. Univ. of Illinois at Urbana-Champaign, IL (United States). Center for Biophysics and Computational Biology
  2. Univ. of Illinois at Urbana-Champaign, IL (United States). Graduate School of Library and Information Sciences
  3. Univ. of Illinois at Urbana-Champaign, IL (United States). Center for Biophysics and Computational Biology; Univ. of Illinois at Urbana-Champaign, IL (United States). Dept. of Chemistry

Background: Since the publication of the first draft of the human genome in 2000, bioinformatic data have been accumulating at an overwhelming pace. Currently, more than 3 million sequences and 35 thousand structures of proteins and nucleic acids are available in public databases. Finding correlations in and between these data to answer critical research questions is extremely challenging. This problem needs to be approached from several directions: information science to organize and search the data; information visualization to assist in recognizing correlations; mathematics to formulate statistical inferences; and biology to analyze chemical and physical properties in terms of sequence and structure changes. Results: Here we present MultiSeq, a unified bioinformatics analysis environment that allows one to organize, display, align and analyze both sequence and structure data for proteins and nucleic acids. While special emphasis is placed on analyzing the data within the framework of evolutionary biology, the environment is also flexible enough to accommodate other usage patterns. The evolutionary approach is supported by the use of predefined metadata, adherence to standard ontological mappings, and the ability for the user to adjust these classifications using an electronic notebook. MultiSeq contains a new algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of a homologous group of distantly related proteins. The method, based on the multidimensional QR factorization of multiple sequence and structure alignments, removes redundancy from the alignments and orders the protein sequences by increasing linear dependence, resulting in the identification of a minimal basis set of sequences that spans the evolutionary space of the homologous group of proteins. Conclusion: MultiSeq is a major extension of the Multiple Alignment tool that is provided as part of VMD, a structural visualization program for analyzing molecular dynamics simulations. Both are freely distributed by the NIH Resource for Macromolecular Modeling and Bioinformatics and MultiSeq is included with VMD starting with version 1.8.5. The MultiSeq website has details on how to download and use the software: http://www.scs.uiuc.edu/~schulten/multiseq/

Research Organization:
Univ. of Illinois at Urbana-Champaign, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
Grant/Contract Number:
FG02-05ER64144
OSTI ID:
1626326
Journal Information:
BMC Bioinformatics, Vol. 7, Issue 1; ISSN 1471-2105
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (65)

SCOP database in 2004: refinements integrate structure and sequence family data journal January 2004
The ASTRAL Compendium in 2004 journal January 2004
Compilation of tRNA sequences and sequences of tRNA genes journal January 1998
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice journal January 1994
Protein modelling for all journal September 1999
CATH – a hierarchic classification of protein domain structures journal August 1997
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs journal September 1997
The relation between the divergence of sequence and structure in proteins. journal April 1986
tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence journal March 1997
UCSF Chimera?A visualization system for exploratory research and analysis journal January 2004
CINEMA—a novel Colour INteractive Editor for Multiple Alignments journal October 1998
Evolutionary profiles from the QR factorization of multiple sequence alignments journal March 2005
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins journal December 2004
DDBJ in preparation for overview of research activities behind data submissions journal January 2006
The evolutionary history of Cys-tRNACys formation journal December 2005
MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment journal January 2004
Basic local alignment search tool journal October 1990
Evolutionary information for specifying a protein fold journal September 2005
A graph-theory algorithm for rapid protein side-chain prediction journal September 2003
The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis journal December 2004
RASMOL: biomolecular graphics for all journal September 1995
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 journal January 2003
The Diamond STING server journal July 2005
The Protein Data Bank journal January 2000
MrBayes 3: Bayesian phylogenetic inference under mixed models journal August 2003
Database resources of the National Center for Biotechnology Information journal January 2000
T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. Thornton journal September 2000
Profile hidden Markov models journal October 1998
Cn3D: sequence and structure views for Entrez journal June 2000
The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs journal January 2002
On the Evolution of Structure in Aminoacyl-tRNA Synthetases journal December 2003
VMD: Visual molecular dynamics journal February 1996
Multiple protein sequence alignment from tertiary structure comparison: Assignment of global and residue confidence levels journal October 1992
Multiple sequence alignments journal June 2005
EMBL Nucleotide Sequence Database: developments in 2005 journal January 2006
Evolutionary Profiles Derived from the QR Factorization of Multiple Structural Alignments Gives an Economy of Information journal February 2005
Protein family annotation in a multiple alignment viewer journal March 2003
Friend, an integrated analytical front-end application for bioinformatics journal August 2005
Knowledge-based protein secondary structure assignment journal December 1995
The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools journal December 1997
The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy journal January 2003
Aminoacyl-tRNA Synthetases, the Genetic Code, and the Evolutionary Process journal March 2000
A translation approach to portable ontology specifications journal June 1993
The Jalview Java alignment editor journal January 2004
The integrated microbial genomes (IMG) system journal January 2006
Assessment of Protein Distance Measures and Tree-Building Methods for Phylogenetic Tree Reconstruction journal July 2005
Multiple Alignment of protein structures and sequences for VMD journal December 2005
GenBank journal December 2004
Evolutionary trees from DNA sequences: A maximum likelihood approach journal November 1981
Summary: the modified nucleosides of RNA journal January 1994
Comparative Protein Structure Modeling of Genes and Genomes journal June 2000
MollDE: a homology modeling framework you can click with journal April 2005
Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases journal December 2001
Evaluating protein structure-prediction schemes using energy landscape theory journal May 2001
A standard reference frame for the description of nucleic acid base-pair geometry 1 1Edited by P. E. Wright 2 2This is a document of the Nomenclature Committee of IUBMB (NC-IUBMB)/IUPAC-IUBMB Joint Commission on Biochemical Nomenclature (JCBN), whose members are R. Cammack (chairman), A. Bairoch, H.M. Berman, S. Boyce, C.R. Cantor, K. Elliott, D. Horton, M. Kanehisa, A. Kotyk, G.P. Moss, N. Sharon and K.F. Tipton. journal October 2001
A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood journal October 2003
The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids journal September 1992
Comparative analyses of the mitochondrial genomes of the cattle tick Rhipicephalus microplus clades A and B from China journal April 2022
Molecular and biological characterization of an isolate of the potyvirus passiflora virus Y naturally infecting soybean (Glycine max) in Brazil journal September 2022
Which craft is best in bioinformatics? journal July 2001
Codon-optimized FAM132b gene therapy prevents dietary obesity by blockading adrenergic response and insulin action journal August 2022
Stepwise gating of the Sec61 protein-conducting channel by Sec63 and Sec62 journal January 2021
tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence journal March 1997
Database resources of the National Center for Biotechnology Information journal January 2001
Cryo-EM structures reveal intricate Fe-S cluster arrangement and charging in Rhodobacter capsulatus formate dehydrogenase text January 2020

Cited By (109)

Single‐molecule imaging reveals the translocation and DNA looping dynamics of hepatitis C virus NS3 helicase journal March 2017
Diverse dynamics features of novel protein kinase C (PKC) isozymes determine the selectivity of a fluorinated balanol analogue for PKCε journal February 2019
All-atom molecular dynamics comparison of disease-associated zinc fingers text January 2017
Crystal Structure of Cytomegalovirus IE1 Protein Reveals Targeting of TRIM Family Member PML via Coiled-Coil Interactions journal November 2014
Mechanistic basis for the evolution of chalcone synthase catalytic cysteine reactivity in land plants journal September 2018
Mechanism of the electroneutral sodium/proton antiporter PaNhaP from transition-path shooting journal April 2019
Pathways to disease from natural variations in human cytoplasmic tRNAs journal April 2019
Dynamical networks in tRNA:protein complexes journal April 2009
Protein–protein interactions within photosystem II under photoprotection: the synergy between CP29 minor antenna, subunit S (PsbS) and zeaxanthin at all-atom resolution journal January 2018
Structural basis for the modulation of plant cytosolic triosephosphate isomerase activity by mimicry of redox‐based modifications journal June 2019
A novel virus genome discovered in an extreme environment suggests recombination between unrelated groups of RNA and DNA viruses journal January 2012
Structural dynamics of inosine triphosphate pyrophosphatase (ITPA) protein and two clinically relevant mutants: molecular dynamics simulations journal March 2020
The TATA‐binding Protein DNA‐binding domain of eukaryotic parasites is a potentially druggable target journal October 2019
Structure of pyrrolysyl-tRNA synthetase, an archaeal enzyme for genetic code innovation journal June 2007
Neutron and X-ray crystal structures of Lactobacillus brevis alcohol dehydrogenase reveal new insights into hydrogen-bonding pathways journal November 2018
GASP‐1 and GASP‐2, two closely structurally related proteins with a functional duality in antitrypsin inhibition specificity: a mechanistic point of view journal October 2019
A conserved molecular switch in Class F receptors regulates receptor activation and pathway selection journal February 2019
Molecular dynamics simulations of nucleotide release from the circadian clock protein KaiC reveal atomic-resolution functional insights journal November 2018
Coupling between an electrostatic network and the Zn2+ binding site modulates Hv1 activation journal May 2018
The high-affinity calcium sensor synaptotagmin-7 serves multiple roles in regulated exocytosis journal May 2018
Probing the impact of nairovirus genomic diversity on viral ovarian tumor domain protease (vOTU) structure and deubiquitinase activity journal January 2019
The Crystal Structure of Human Soluble CD14 Reveals a Bent Solenoid with a Hydrophobic Amino-Terminal Pocket journal December 2012
Bifunctional ADP-dependent phosphofructokinase/glucokinase activity in the order Methanococcales - biochemical characterization of the mesophilic enzyme from Methanococcus maripaludis journal March 2014
Functional divergence between evolutionary‐related LuxG and Fre oxidoreductases of luminous bacteria journal April 2019
Selenomodification of tRNA in archaea requires a bipartite rhodanese enzyme journal January 2012
Gld2 activity is regulated by phosphorylation in the N-terminal domain text January 2019
Gld2 activity is regulated by phosphorylation in the N-terminal domain journal May 2019
Structural insight into β-Clamp and its interaction with DNA Ligase in Helicobacter pylori journal August 2016
Gld2 activity is regulated by phosphorylation in the N-terminal domain text January 2020
Evidence for a Non-Catalytic Ion-Binding Site in Multiple RNA-Dependent RNA Polymerases journal July 2012
All-atom molecular dynamics comparison of disease-associated zinc fingers text January 2017
Structural and dynamic basis of substrate permissiveness in hydroxycinnamoyltransferase (HCT) journal October 2018
A QM/MM approach on the structural and stereoelectronic factors governing glycosylation by GTF-SI from Streptococcus mutans journal January 2018
Structure and function of Toll/interleukin-1 receptor/resistance protein (TIR) domains journal December 2014
Molecular model of human heparanase with proposed binding mode of a heparan sulfate oligosaccharide and catalytic amino acids journal July 2011
Structure of an archaeal non-discriminating glutamyl-tRNA synthetase: a missing link in the evolution of Gln-tRNAGln formation journal July 2010
The first plant acyl-CoA-binding protein structures: the close homologues OsACBP1 and OsACBP2 from rice journal April 2017
Effect of catalytic subunit phosphorylation on the properties of SnRK1 from Phaseolus vulgaris embryos journal July 2018
Structure functional insights into calcium binding during the activation of coagulation factor XIII A journal August 2019
Nucleotide-dependent conformations of FtsZ dimers and force generation observed through molecular dynamics simulations journal May 2012
Genome-Wide Identification and Evolution of HECT Genes in Soybean journal April 2015
Histone H2A and H4 N-terminal Tails Are Positioned by the MEP50 WD Repeat Protein for Efficient Methylation by the PRMT5 Arginine Methyltransferase journal April 2015
Characterization and evolutionary history of an archaeal kinase involved in selenocysteinyl-tRNA formation journal January 2008
Structural dynamics of inosine triphosphate pyrophosphatase (ITPA) protein and two clinically relevant mutants: molecular dynamics simulations text January 2020
Atomistic probing of aptameric binding of CD19 outer membrane domain reveals an “aptamer walking” mechanism journal January 2020
Profiling the interaction of 1‐phenylbenzimidazoles to cyclooxygenases
  • Gómez‐Castro, Carlos Z.; López‐Martínez, Margarita; Hernández‐Pineda, Jessica
  • Journal of Molecular Recognition, Vol. 32, Issue 11 https://doi.org/10.1002/jmr.2801
journal July 2019
Fine-tuned preparation of cross-linked laccase nanoaggregates journal May 2019
Insights into the Fold Organization of TIM Barrel from Interaction Energy Based Structure Networks journal May 2012
Mapping of heparin/heparan sulfate binding sites on αvβ3 integrin by molecular docking: HEPARIN - αvβ3 INTEGRIN COMPLEX journal January 2013
Mechanistic basis for the evolution of chalcone synthase catalytic cysteine reactivity in land plants journal November 2018
NetworkView: 3D display and analysis of protein{middle dot}RNA interaction networks journal September 2012
Structural Insights into the Polyphyletic Origins of Glycyl tRNA Synthetases text January 2016
Static Clathrin Assemblies at the Peripheral Vacuole-Plasma Membrane Interface of the Parasitic Protozoan Giardia lamblia. text January 2016
Cooperative recruitment of Yan via a high-affinity ETS supersite organizes repression to confer specificity and robustness to cardiac cell fate specification journal March 2018
Escherichia coli B2 strains prevalent in inflammatory bowel disease patients have distinct metabolic capabilities that enable colonization of intestinal mucosa journal June 2018
Long distance electron transfer through the aqueous solution between redox partner proteins journal December 2018
Identification and characterization of NF-YB family genes in tung tree journal June 2015
Crystal structure and its bearing towards an understanding of key biological functions of EpCAM journal August 2014
Approaches for Designing new Potent Inhibitors of Farnesyl Pyrophosphate Synthase journal February 2016
Electron cryo-microscopy structure of the canonical TRPC4 ion channel journal May 2018
Structural Insights into the Polyphyletic Origins of Glycyl tRNA Synthetases journal July 2016
Gld2 activity is regulated by phosphorylation in the N-terminal domain text January 2020
Measles Virus Bearing Measles Inclusion Body Encephalitis-Derived Fusion Protein Is Pathogenic after Infection via the Respiratory Route journal February 2019
Resurrection of efficient Precambrian endoglucanases for lignocellulosic biomass hydrolysis journal July 2019
All-atom molecular dynamics comparison of disease-associated zinc fingers journal August 2017
Structural dynamics of inosine triphosphate pyrophosphatase (ITPA) protein and two clinically relevant mutants: molecular dynamics simulations text January 2020
Structures of the prefusion form of measles virus fusion protein in complex with inhibitors journal February 2018
Static Clathrin Assemblies at the Peripheral Vacuole—Plasma Membrane Interface of the Parasitic Protozoan Giardia lamblia journal July 2016
Identification of sesquiterpene synthases from the Basidiomycota Coniophora puteana for the efficient and highly selective β-copaene and cubebol production in E. coli journal October 2018
Naturally occurring aminoacyl-tRNA synthetases editing-domain mutations that cause mistranslation in Mycoplasma parasites journal May 2011
Biogenesis of cytochrome c oxidase — in vitro approaches to study cofactor insertion into a bacterial subunit I journal July 2008
Structural Determinants of Cadherin-23 Function in Hearing and Deafness journal January 2010
Theoretical and Computational Investigation of Flagellin Translocation and Bacterial Flagellum Growth journal June 2011
Conformational Coupling of the Nucleotide-Binding and the Transmembrane Domains in ABC Transporters journal August 2011
Can all heritable biology really be reduced to a single dimension? journal March 2016
A structural analysis of the AAA+ domains in Saccharomyces cerevisiae cytoplasmic dynein journal June 2014
Regulation of Phosphoribosyl-Linked Serine Ubiquitination by Deubiquitinases DupA and DupB journal January 2020
Trimerization of dopamine transporter triggered by AIM-100 binding: Molecular mechanism and effect of mutations journal December 2019
Microsecond Molecular Dynamics Simulations of Influenza Neuraminidase Suggest a Mechanism for the Increased Virulence of Stalk-Deletion Mutants journal May 2016
Src activation by β-adrenoreceptors is a key switch for tumour metastasis journal January 2013
A pathway for protective quenching in antenna proteins of Photosystem II journal May 2017
Search for non-lactam inhibitors of mtb β-lactamase led to its open shape in apo state: new concept for antibiotic design journal July 2017
Comparative sequence analysis suggests a conserved gating mechanism for TRP channels journal June 2015
Boulder ALignment Editor (ALE): a web-based RNA alignment tool journal May 2011
Structural insights into RNA-dependent eukaryal and archaeal selenocysteine formation journal December 2007
The Molecular Dynamics ofTrypanosoma bruceiUDP‐Galactose 4′‐Epimerase: A Drug Target for African Sleeping Sickness journal May 2012
Flexible mapping of homology onto structure with Homolmapper journal April 2007
Horizontal gene transfer of zinc and non-zinc forms of bacterial ribosomal protein S4 journal January 2009
Graphical analysis of pH-dependent properties of proteins predicted using PROPKA journal January 2011
Quantifying Intramolecular Binding in Multivalent Interactions: A Structure-Based Synergistic Study on Grb2-Sos1 Complex journal October 2011
Membrane Sculpting by F-BAR Domains Studied by Molecular Dynamics Simulations journal January 2013
Utilizing a Dynamical Description of IspH to Aid in the Development of Novel Antimicrobial Drugs journal December 2013
X-Ray Structure Reveals a New Class and Provides Insight into Evolution of Alkaline Phosphatases journal July 2011
Conformational Stability Analyses of Alpha Subunit I Domain of LFA-1 and Mac-1 journal August 2011
Characterization of Danio rerio Mn2+-Dependent ADP-Ribose/CDP-Alcohol Diphosphatase, the Structural Prototype of the ADPRibase-Mn-Like Protein Family journal July 2012
Exploring PHD Fingers and H3K4me0 Interactions with Molecular Dynamics Simulations and Binding Free Energy Calculations: AIRE-PHD1, a Comparative Study journal October 2012
Crystal Structure, SAXS and Kinetic Mechanism of Hyperthermophilic ADP-Dependent Glucokinase from Thermococcus litoralis Reveal a Conserved Mechanism for Catalysis journal June 2013
Homology Modeling of the CheW Coupling Protein of the Chemotaxis Signaling Complex journal August 2013
Structural-Functional Analysis Reveals a Specific Domain Organization in Family GH20 Hexosaminidases journal May 2015
Characterization and expression analysis of Galnts in developing Strongylocentrotus purpuratus embryos journal April 2017
Linking epigenetic function to electrostatics: The DNMT2 structural model example journal June 2017
Comparative genomic analyses of the cyanobacterium, Lyngbya aestuarii BL J, a powerful hydrogen producer journal January 2013
Antifungal Activity against Filamentous Fungi of Ts1, a Multifunctional Toxin from Tityus serrulatus Scorpion Venom journal June 2017
Polyol specificity of recombinant Arabidopsis thaliana sorbitol dehydrogenase studied by enzyme kinetics and in silico modeling journal February 2015
A Comparative Study of the Structural Dynamics of Four Terminal Uridylyl Transferases journal June 2017
Studying RNA homology and conservation with Infernal: from single sequences to RNA families preprint January 2012
From Molecular Phylogenetics to Quantum Chemistry: Discovering Enzyme Design Principles Through Computation journal September 2012
Annotation of gene sequence and protein structure of brinjal EDS1 journal March 2017
Proton currents constrain structural models of voltage sensor activation journal August 2016