Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

DNA sequence assembly and genetic algorithms new results and puzzling insights

Technical Report ·
OSTI ID:401855
;  [1]
  1. Univ. of Central Florida, Orlando, FL (United States)

Applying genetic algorithms to DNA sequence assembly is not a straightforward process. Significantly improved results in terms of performance, quality of results, and the scaling of applicability have been realized through non-standard and even counter-intuitive parameter settings. Specifically, the solution time for a 10kb data set was reduced by an order of magnitude, and a 20kb data set that was previously unsolved by the genetic algorithm was solved in a time that represents only a linear increase from the 10kb data set. Additionally, significant progress has been made on a 35kb data set representing real biological data. A single contig solution was found for a 752 fragment subset of the data set, and a 15 contig solution was found for the full data set. This paper discusses the new results, the modifications to the previous genetic algorithm used in this study, the experimental design process by which the new results were obtained, the questions raised by these results, and some preliminary attempts to explain these results.

Research Organization:
Stanford Univ., CA (United States)
OSTI ID:
401855
Report Number(s):
CONF-9507246--
Country of Publication:
United States
Language:
English

Similar Records

Genetic algorithms for DNA sequence assembly
Conference · Tue Apr 13 00:00:00 EDT 1993 · OSTI ID:10155117

Genetic algorithms for DNA sequence assembly
Conference · Tue Apr 13 00:00:00 EDT 1993 · OSTI ID:6774676

Characterizing Large Text Corpora Using a Maximum Variation Sampling Genetic Algorithm
Conference · Sat Dec 31 23:00:00 EST 2005 · OSTI ID:931452