Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An experimentally derived data set constructed for testing large-scale DNA sequence assembly algorithms

Journal Article · · Genomics; (United States)
; ;  [1]
  1. California Inst. of Technology, Pasadena (United States)
A data set consisting of DNA sequences from a large-scale shotgun DNA cloning and sequencing project has been collected and posted for public release. The purpose is to propose a standard genomic DNA sequencing data set by which various algorithms and implementations can be tested. This set of data is divided into two subsets, one containing raw DNA sequence data (1023 clones) and the other consisting of the corresponding partially refined or edited DNA sequence data (820 clones). Suggested criteria or guidelines for this data refinement are presented so that algorithms for preprocessing and screening raw sequences may be developed. Development of such preprocessing, screening, aligning, and assembling algorithms will expedite large-scale DNA sequencing projects so that the complete unambiguous consensus DNA sequences will be made available to the general research community in a quicker manner. Smaller scale routine DNA sequencing projects will also be greatly aided by such computational efforts. 8 refs., 2 tabs.
DOE Contract Number:
FG03-91ER61182
OSTI ID:
6886127
Journal Information:
Genomics; (United States), Journal Name: Genomics; (United States) Vol. 15:3; ISSN GNMCEP; ISSN 0888-7543
Country of Publication:
United States
Language:
English

Similar Records

Artificially generated data sets for testing DNA sequence assembly algorithms
Journal Article · Wed Mar 31 23:00:00 EST 1993 · Genomics; (United States) · OSTI ID:6179640

Assembly of shotgun sequencing data
Conference · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:495284

Manifold sequencing: Efficient processing of large sets of sequencing reactions
Journal Article · Mon Mar 14 23:00:00 EST 1994 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:86519