An experimentally derived data set constructed for testing large-scale DNA sequence assembly algorithms
Journal Article
·
· Genomics; (United States)
- California Inst. of Technology, Pasadena (United States)
A data set consisting of DNA sequences from a large-scale shotgun DNA cloning and sequencing project has been collected and posted for public release. The purpose is to propose a standard genomic DNA sequencing data set by which various algorithms and implementations can be tested. This set of data is divided into two subsets, one containing raw DNA sequence data (1023 clones) and the other consisting of the corresponding partially refined or edited DNA sequence data (820 clones). Suggested criteria or guidelines for this data refinement are presented so that algorithms for preprocessing and screening raw sequences may be developed. Development of such preprocessing, screening, aligning, and assembling algorithms will expedite large-scale DNA sequencing projects so that the complete unambiguous consensus DNA sequences will be made available to the general research community in a quicker manner. Smaller scale routine DNA sequencing projects will also be greatly aided by such computational efforts. 8 refs., 2 tabs.
- DOE Contract Number:
- FG03-91ER61182
- OSTI ID:
- 6886127
- Journal Information:
- Genomics; (United States), Journal Name: Genomics; (United States) Vol. 15:3; ISSN GNMCEP; ISSN 0888-7543
- Country of Publication:
- United States
- Language:
- English
Similar Records
Artificially generated data sets for testing DNA sequence assembly algorithms
Assembly of shotgun sequencing data
Manifold sequencing: Efficient processing of large sets of sequencing reactions
Journal Article
·
Wed Mar 31 23:00:00 EST 1993
· Genomics; (United States)
·
OSTI ID:6179640
Assembly of shotgun sequencing data
Conference
·
Mon Dec 30 23:00:00 EST 1996
·
OSTI ID:495284
Manifold sequencing: Efficient processing of large sets of sequencing reactions
Journal Article
·
Mon Mar 14 23:00:00 EST 1994
· Proceedings of the National Academy of Sciences of the United States of America
·
OSTI ID:86519