Final progress report, Construction of a genome-wide highly characterized clone resource for genome sequencing
At TIGR, the human Bacterial Artificial Chromosome (BAC) end sequencing and trimming were with an overall sequencing success rate of 65%. CalTech human BAC libraries A, B, C and D as well as Roswell Park Cancer Institute's library RPCI-11 were used. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length {approx}460 bp for a total of 141 Mb covering {approx}4.7% of the genome. Over sixty percent of the clones have BAC end sequences (BESs) from both ends representing over five-fold coverage of the genome by the paired-end clones. The average phred Q20 length is {approx}400 bp. This high accuracy makes our BESs match the human finished sequences with an average identity of 99% and a match length of 450 bp, and a frequency of one match per 12.8 kb contig sequence. Our sample tracking has ensured a clone tracking accuracy of >90%, which gives researchers a high confidence in (1) retrieving the right clone from the BA C libraries based on the sequence matches; and (2) building a minimum tiling path of sequence-ready clones across the genome and genome assembly scaffolds.
- Research Organization:
- The Institute for Genomic Research (US)
- Sponsoring Organization:
- USDOE Office of Energy Research (ER) (US)
- DOE Contract Number:
- FC02-97ER62500
- OSTI ID:
- 807742
- Resource Relation:
- Other Information: PBD: 14 Feb 2000
- Country of Publication:
- United States
- Language:
- English
Similar Records
A BAC-based physical map of the Drosophila buzzatii genome
Final Report: Complete Sequencing of the 2.3Mbp Genome of the Hyperthermophilic Archaeon Pyrbaculum Aerophilum, January 1, 1998 - December 31, 1998