DNA sequence assembly and genetic algorithms new results and puzzling insights
- Univ. of Central Florida, Orlando, FL (United States)
Applying genetic algorithms to DNA sequence assembly is not a straightforward process. Significantly improved results in terms of performance, quality of results, and the scaling of applicability have been realized through non-standard and even counter-intuitive parameter settings. Specifically, the solution time for a 10kb data set was reduced by an order of magnitude, and a 20kb data set that was previously unsolved by the genetic algorithm was solved in a time that represents only a linear increase from the 10kb data set. Additionally, significant progress has been made on a 35kb data set representing real biological data. A single contig solution was found for a 752 fragment subset of the data set, and a 15 contig solution was found for the full data set. This paper discusses the new results, the modifications to the previous genetic algorithm used in this study, the experimental design process by which the new results were obtained, the questions raised by these results, and some preliminary attempts to explain these results.
- Research Organization:
- Stanford Univ., CA (United States)
- OSTI ID:
- 401855
- Report Number(s):
- CONF-9507246--
- Country of Publication:
- United States
- Language:
- English
Similar Records
Genetic algorithms for DNA sequence assembly
Characterizing Large Text Corpora Using a Maximum Variation Sampling Genetic Algorithm