Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

Conference ·
OSTI ID:957404

Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

Research Organization:
Ernest Orlando Lawrence Berkeley National Laboratory, Berkeley, CA (US)
Sponsoring Organization:
Genomics Division
DOE Contract Number:
AC02-05CH11231
OSTI ID:
957404
Report Number(s):
LBNL-1895E
Country of Publication:
United States
Language:
English

Similar Records

Sequencing the Black Aspergilli species complex
Technical Report · Thu Mar 10 23:00:00 EST 2011 · OSTI ID:1012480

De novo Assembly of a 40 Mb Eukaryotic Genome from Short Sequence Reads: Sordaria macrospora, a Model Organism for Fungal Morphogenesis
Journal Article · Thu Apr 08 00:00:00 EDT 2010 · PLoS Genetics · OSTI ID:1627282

Investigation of inter- and intraspecies variation through genome sequencing of Aspergillus section Nigri
Journal Article · Mon Oct 22 00:00:00 EDT 2018 · Nature Genetics · OSTI ID:1492689