DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Accurate Allele Frequencies from Ultra-low Coverage Pool-Seq Samples in Evolve-and-Resequence Experiments

Journal Article · · G3

Evolve-and-resequence (E+R) experiments leverage next-generation sequencing technology to track the allele frequency dynamics of populations as they evolve. While previous work has shown that adaptive alleles can be detected by comparing frequency trajectories from many replicate populations, this power comes at the expense of high-coverage (>100x) sequencing of many pooled samples, which can be cost-prohibitive. Here, we show that accurate estimates of allele frequencies can be achieved with very shallow sequencing depths (<5x) via inference of known founder haplotypes in small genomic windows. This technique can be used to efficiently estimate frequencies for any number of bi-allelic SNPs in populations of any model organism founded with sequenced homozygous strains. Using both experimentally-pooled and simulated samples of Drosophila melanogaster, we show that haplotype inference can improve allele frequency accuracy by orders of magnitude for up to 50 generations of recombination, and is robust to moderate levels of missing data, as well as different selection regimes. Finally, we show that a simple linear model generated from these simulations can predict the accuracy of haplotype-derived allele frequencies in other model organisms and experimental designs. To make these results broadly accessible for use in E+R experiments, we introduce HAF-pipe, an open-source software tool for calculating haplotype-derived allele frequencies from raw sequencing data. Ultimately, by reducing sequencing costs without sacrificing accuracy, our method facilitates E+R designs with higher replication and resolution, and thereby, increased power to detect adaptive alleles.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1591845
Journal Information:
G3, Journal Name: G3 Journal Issue: 12 Vol. 9; ISSN 2160-1836
Publisher:
Genetics Society of AmericaCopyright Statement
Country of Publication:
United States
Language:
English

References (68)

Genome evolution and adaptation in a long-term experiment with Escherichia coli journal October 2009
Experimental evolution reveals natural selection on standing genetic variation journal January 2009
Estimating population haplotype frequencies from pooled SNP data using incomplete database information journal October 2009
forqs: forward-in-time simulation of recombination, quantitative traits and selection journal December 2013
Accurate estimation of haplotype frequency from pooled sequencing data and cost-effective identification of rare haplotype carriers by overlapping pool sequencing journal October 2014
Population-Genetic Inference from Pooled-Sequencing Data journal April 2014
Quantifying Selection Acting on a Complex Trait Using Allele Frequency Time Series Data journal November 2011
Maximum Likelihood Estimation of Frequencies of Known Haplotypes from Pooled Sequence Data journal January 2013
The Power to Detect Quantitative Trait Loci Using Resequenced, Experimentally Evolved Populations of Diploid, Sexual Organisms journal January 2014
Whole-Genome Resequencing of Experimental Populations Reveals Polygenic Basis of Egg-Size Variation in Drosophila melanogaster journal June 2015
Reconstruction of Haplotype-Blocks Selected during Experimental Evolution journal October 2016
Genomics of Parallel Experimental Evolution in Drosophila journal January 2017
Different Trajectories of Parallel Evolution During Viral Adaptation journal July 1999
Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA journal January 2013
Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data journal March 2016
Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster journal July 2012
LDx: Estimation of Linkage Disequilibrium from High-Throughput Pooled Resequencing Data journal November 2012
Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata journal October 2015
Parallel Genome-Wide Fixation of Ancestral Alleles in Partially Outcrossing Experimental Populations of Caenorhabditis elegans journal July 2014
Investigating Natural Variation in Drosophila Courtship Song by the Evolve and Resequence Approach journal March 2012
Power Analysis of Artificial Selection Experiments Using Efficient Whole Genome Simulation of Quantitative Traits journal February 2015
Ancestral population reconstitution from isofemale lines as a tool for experimental evolution journal August 2016
Identification and sequencing of 59 highly polymorphic microhaplotypes for analysis of DNA mixtures journal January 2021
Combining experimental evolution with next-generation sequencing: a powerful tool to study adaptation from standing genetic variation journal October 2014
Genome evolution and adaptation in a long-term experiment with Escherichia coli journal October 2009
Genome-wide analysis of a long-term evolution experiment with Drosophila journal September 2010
Evidence of widespread selection on standing variation in Europe at height-associated SNPs journal August 2012
Experimental evolution reveals natural selection on standing genetic variation journal January 2009
Elucidating the molecular architecture of adaptation via evolve and resequence experiments journal September 2015
Estimating population haplotype frequencies from pooled SNP data using incomplete database information journal October 2009
forqs: forward-in-time simulation of recombination, quantitative traits and selection journal December 2013
Accurate estimation of haplotype frequency from pooled sequencing data and cost-effective identification of rare haplotype carriers by overlapping pool sequencing journal October 2014
Population-Genetic Inference from Pooled-Sequencing Data journal April 2014
Quantifying Selection Acting on a Complex Trait Using Allele Frequency Time Series Data journal November 2011
Maximum Likelihood Estimation of Frequencies of Known Haplotypes from Pooled Sequence Data journal January 2013
A Guide for the Design of Evolve and Resequencing Studies journal November 2013
The Power to Detect Quantitative Trait Loci Using Resequenced, Experimentally Evolved Populations of Diploid, Sexual Organisms journal January 2014
Standing Genetic Variation Drives Repeatable Experimental Evolution in Outcrossing Populations of Saccharomyces cerevisiae journal August 2014
Whole-Genome Resequencing of Experimental Populations Reveals Polygenic Basis of Egg-Size Variation in Drosophila melanogaster journal June 2015
A Thousand Fly Genomes: An Expanded Drosophila Genome Nexus journal September 2016
Reconstruction of Haplotype-Blocks Selected during Experimental Evolution journal October 2016
CeNDR, the Caenorhabditis elegans natural diversity resource journal October 2016
How does adaptation sweep through the genome? Insights from long-term selection experiments journal October 2012
Rapid seasonal evolution in innate immunity of wild Drosophila melanogaster journal January 2018
Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines journal April 2014
Adaptation of Drosophila to a novel laboratory environment reveals temporally heterogeneous trajectories of selected alleles: GENOMIC SIGNATURES OF ADAPTATION TO NEW ENVIRONMENT journal June 2012
Rapid Construction of Empirical RNA Fitness Landscapes journal October 2010
Different Trajectories of Parallel Evolution During Viral Adaptation journal July 1999
Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA journal January 2013
Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data journal March 2016
Standing genetic variation as a major contributor to adaptation in the Virginia chicken lines selection experiment journal October 2015
The Many Landscapes of Recombination in Drosophila melanogaster journal October 2012
PoolHap: Inferring Haplotype Frequencies from Pooled Samples by Next Generation Sequencing journal January 2011
Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster journal July 2012
LDx: Estimation of Linkage Disequilibrium from High-Throughput Pooled Resequencing Data journal November 2012
Inexpensive Multiplexed Library Preparation for Megabase-Sized Genomes journal May 2015
Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata journal October 2015
Parallel Genome-Wide Fixation of Ancestral Alleles in Partially Outcrossing Experimental Populations of Caenorhabditis elegans journal July 2014
Drosophila simulans : A Species with Improved Resolution in Evolve and Resequence Studies journal May 2017
Genomic Differentiation Between Temperate and Tropical Australian Populations of Drosophila melanogaster journal November 2010
Investigating Natural Variation in Drosophila Courtship Song by the Evolve and Resequence Approach journal March 2012
Power Analysis of Artificial Selection Experiments Using Efficient Whole Genome Simulation of Quantitative Traits journal February 2015
Maximum Likelihood Estimation of Frequencies of Known Haplotypes from Pooled Sequence Data text January 2012
LDx: estimation of linkage disequilibrium from high-throughput pooled resequencing data text January 2012
forqs: Forward-in-time Simulation of Recombination, Quantitative Traits, and Selection preprint January 2013
Standing genetic variation as a major contributor to adaptation in the Virginia chicken lines selection experiment collection January 2015
Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data collection January 2016
Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA text January 2013