Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Large-Scale Trends in the Evolution of Gene Structures within 11 Animal Genomes

Journal Article · · PLoS Computational Biology (Online)
 [1];  [2];  [3];  [4];  [3];  [3];  [3];  [5]
  1. University of California, Berkeley, CA (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); DOE/OSTI
  2. University of California, Berkeley, CA (United States)
  3. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
  4. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Dept. of Genome Sciences
  5. University of California, Berkeley, CA (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
We have used the annotations of six animal genomes (Homo sapiens, Mus musculus, Ciona intestinalis, Drosophila melanogaster, Anopheles gambiae, and Caenorhabditis elegans) together with the sequences of five unannotated Drosophila genomes to survey changes in protein sequence and gene structure over a variety of timescales—from the less than 5 million years since the divergence of D. simulans and D. melanogaster to the more than 500 million years that have elapsed since the Cambrian explosion. To do so, we have developed a new open-source software library called CGL (for ‘‘Comparative Genomics Library’’). Our results demonstrate that change in intron–exon structure is gradual, clock-like, and largely independent of coding-sequence evolution. This means that genome annotations can be used in new ways to inform, corroborate, and test conclusions drawn from comparative genomics analyses that are based upon protein and nucleotide sequence similarities.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
Howard Hughes Medical Institute; National Institutes of Health (NIH); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1627180
Journal Information:
PLoS Computational Biology (Online), Journal Name: PLoS Computational Biology (Online) Journal Issue: 3 Vol. 2; ISSN 1553-7358
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (44)

Amino acid substitution matrices from an information theoretic perspective journal June 1991
Genomic clocks and evolutionary timescales journal April 2003
The evolution of spliceosomal introns journal December 2002
Initial sequencing and analysis of the human genome journal February 2001
Evidence for a clade of nematodes, arthropods and other moulting animals journal May 1997
Initial sequencing and comparative analysis of the mouse genome journal January 2002
Origins of recently gained introns in Caenorhabditis journal July 2004
Resolution of a deep animal divergence by the pattern of intron conservation journal March 2005
More on the sequencing of the human genome journal March 2003
Amino acid substitution matrices from protein blocks. journal November 1992
Selective Constraints on Intron Evolution in Drosophila journal December 2003
Temporal Patterns of Fruit Fly (Drosophila) Evolution Revealed by Mutation Clocks journal October 2003
The Evolutionary Gain of Spliceosomal Introns: Sequence and Phase Preferences journal March 2004
Ensembl 2004 journal January 2004
Prevalence of intron gain over intron loss in the evolution of paralogous gene families journal July 2004
FlyBase: genes and gene models journal December 2004
WormBase: a comprehensive data resource for Caenorhabditis biology and genomics journal December 2004
Estimating Synonymous and Nonsynonymous Substitution Rates Under Realistic Evolutionary Models journal January 2000
Conservation, Regulation, Synteny, and Introns in a Large-scale C. briggsae-C. elegans Genomic Alignment journal August 2000
Indel-Based Evolutionary Distance and Mouse-Human Divergence journal August 2004
The Bioperl Toolkit: Perl Modules for the Life Sciences journal October 2002
A Computer Program for Aligning a cDNA Sequence with a Genomic DNA Sequence journal September 1998
The Genome Sequence of the Malaria Mosquito Anopheles gambiae journal October 2002
Genome Sequence of the Nematode C. elegans: A Platform for Investigating Biology journal December 1998
Comparative Genomics of the Eukaryotes journal March 2000
Amino acid substitution matrices from an information theoretic perspective journal June 1991
Basic local alignment search tool journal October 1990
Genomic clocks and evolutionary timescales journal April 2003
The evolution of spliceosomal introns journal December 2002
Molecular Evolution: Introns Fall into Place journal May 2004
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana journal December 2000
Nonlinear partial differential equations and applications: The signal of ancient introns is obscured by intron density and homolog number journal November 2002
Bootstrap confidence levels for phylogenetic trees journal November 1996
Toward a resolution of the introns early/late debate: Only phase zero introns are correlated with the structure of ancient proteins journal April 1998
Phylogenetic Mapping of Intron Positions: A Case Study of Translation Initiation Factor eIF2γ journal September 2004
Splicing signals in Drosophila : intron size, information content, and consensus sequences journal January 1992
FlyBase: a Drosophila database journal January 1997
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution journal January 2005
Retroposed New Genes Out of the X in Drosophila journal November 2002
The Sequence of the Human Genome journal February 2001
The Draft Genome of Ciona intestinalis: Insights into Chordate and Vertebrate Origins journal December 2002
Determining Divergence Times of the Major Kingdoms of Living Organisms with a Protein Clock journal January 1996
Molecular Evidence for Deep Precambrian Divergences Among Metazoan Phyla journal October 1996
Genome-scale evidence of the nematode-arthropod clade journal January 2005

Cited By (34)

Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes) journal June 2011
ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species journal January 2012
Comparative Analysis of Teleost Genome Sequences Reveals an Ancient Intron Size Expansion in the Zebrafish Lineage journal January 2011
Splice Sites Seldom Slide: Intron Evolution in Oomycetes journal July 2016
Quantitative measures for the management and comparison of annotated genomes journal January 2009
The nuclear OXPHOS genes in insecta: a common evolutionary origin, a common cis-regulatory motif, a common destiny for gene duplicates journal November 2007
Accumulation of CTCF-binding sites drives expression divergence between tandemly duplicated genes in humans journal January 2014
Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus journal November 2006
A cricket Gene Index: a genomic resource for studying neurobiology, speciation, and molecular evolution journal January 2007
Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire journal January 2010
A standard variation file format for human genome sequences journal January 2010
Genome-Wide Analysis of Human Disease Alleles Reveals That Their Locations Are Correlated in Paralogous Proteins journal November 2008
The Agassiz’s desert tortoise genome provides a resource for the conservation of a threatened species journal May 2017
A Novel Model for Development, Organization, and Function of Gonadotropes in Fish Pituitary journal October 2014
The Fibrillar Collagen Family journal January 2010
Additional file 1 of Protein length distribution is remarkably uniform across the tree of life dataset January 2023
Additional file 3 of Protein length distribution is remarkably uniform across the tree of life dataset January 2023
Additional file 4 of Protein length distribution is remarkably uniform across the tree of life dataset January 2023
Additional file 5 of Protein length distribution is remarkably uniform across the tree of life dataset January 2023
Additional file 6 of Protein length distribution is remarkably uniform across the tree of life dataset January 2023
Additional file 7 of Protein length distribution is remarkably uniform across the tree of life dataset January 2023
Diversity of animal small RNA pathways and their biological utility: Diversity of animal small RNA pathways journal November 2011
Accelerated Evolutionary Rate of Housekeeping Genes in Tunicates journal August 2010
Phylogenetic analysis and evolution of aromatic amino acid hydroxylase journal November 2010
Intronic microRNA precursors that bypass Drosha processing journal June 2007
Gene length as a biological timer to establish temporal transcriptional regulation journal January 2017
A Chado case study: an ontology-based modular schema for representing genome-associated biological information journal July 2007
Endogenous Mechanisms for the Origins of Spliceosomal Introns journal July 2009
Large-Scale Parsimony Analysis of Metazoan Indels in Protein-Coding Genes journal October 2009
Evolution of the Nitric Oxide Synthase Family in Metazoans journal July 2010
The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions journal October 2007
In with the Old, in with the New: The Promiscuity of the Duplication Process Engenders Diverse Pathways for Novel Gene Creation journal January 2012
COGNATE: comparative gene annotation characterizer journal July 2017
A General Definition and Nomenclature for Alternative Splicing Events journal August 2008

Similar Records

Intra-species sequence comparisons for annotating genomes
Journal Article · Thu Jul 15 00:00:00 EDT 2004 · Genome Research · OSTI ID:838983

Unlocking the secrets of the genome
Journal Article · Tue Jun 16 20:00:00 EDT 2009 · Nature (London) · OSTI ID:1815440

A second-generation assembly of the Drosophila simulans genome provides new insights into patterns of lineage-specific divergence
Journal Article · Wed Aug 29 20:00:00 EDT 2012 · Genome Research · OSTI ID:1625619