DNA sequence confidence estimation

Lipshutz, R J; Taverner, F; Hennessy, K; Hartzell, G; Davis, R

doi:10.1006/geno.1994.1089

Title: DNA sequence confidence estimation

Journal Article · Tue Feb 01 00:00:00 EST 1994 · Genomics; (United States)

DOI:https://doi.org/10.1006/geno.1994.1089· OSTI ID:6482756

Lipshutz, R J ^[1]; Taverner, F ^[2]; Hennessy, K ^[3]; Hartzell, G ^[4]; Davis, R ^[5]

Affymetrix, Santa Clara, CA (United States)
Daniel H. Wagner Associates, Sunnyvale, CA (United States)
Applied Biosystems, Inc., Foster City, CA (United States)
Univ. of California, Berkeley, CA (United States)
Stanford Univ., CA (United States)

A significant bottleneck in the current DNA sequencing process is the manual editing of trace data generated by automated DNA sequencers. This step is used to correct base calls and to associate to each base call a confidence level. The confidence levels are used in the assembly process to determine overlaps and to resolve discrepancies in determining the consensus sequence. This single step may cost as much as 4 to 8 cents per finished base. The authors report an approach to automated trace editing using classification trees to detect and exploit context-based patterns in trace peak heights. Local base composition and nearby peak heights account for 80% of the variations in peak heights. Classification algorithms were developed to identify 37% of automated base calls that differ from the consensus sequence. With these algorithms, 12% of the base calls had confidence levels less than 90%. 16 refs., 7 figs., 3 tabs.

Cite

Export

Save

OSTI ID:: 6482756

Journal Information:: Genomics; (United States), Vol. 19:3; ISSN 0888-7543

Country of Publication:: United States

Language:: English

Similar Records

DNA BP CALLING SW. Automated DNA Base Pair Calling Algorithm

Technical Report · Sat Jun 12 00:00:00 EDT 1999 · OSTI ID:6482756

Yeung, E S; Hall, D R

Automated DNA Base Pair Calling Algorithm

Software · Wed Jul 07 00:00:00 EDT 1999 · OSTI ID:6482756

Yeung, Edward S.; Hall, David R.; Hazen, Kevin H.

Improved predictions of transcription factor binding sites using physicochemical features of DNA

Journal Article · Fri Aug 24 00:00:00 EDT 2012 · Nucleic Acids Research · OSTI ID:6482756

Maienschein-Cline, Mark; Dinner, Aaron R.; Hlavacek, William S.; +1 more

Related Subjects

59 BASIC BIOLOGICAL SCIENCES
99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
AUTOMATION
ALGORITHMS
DNA SEQUENCING
MATHEMATICAL LOGIC
STRUCTURAL CHEMICAL ANALYSIS
550400* - Genetics
550200 - Biochemistry
990200 - Mathematics & Computers

Title: DNA sequence confidence estimation

Citation Formats

Similar Records

Related Subjects