Strainer: Software for analysis of population variation in community genomic datasets
- Univ. of California, Berkeley, CA (United States). Dept. of Bioengineering; Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States). Dept. of Civil and Environmental Engineering; DOE/OSTI
- Univ. of California, Berkeley, CA (United States). Dept. of Environmental Science, Policy and Management; Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States). Dept. of Civil and Environmental Engineering
- Univ. of California, Berkeley, CA (United States). Dept. of Environmental Science, Policy and Management
Background: Metagenomic analyses of microbial communities that are comprehensive enough to provide multiple samples of most loci in the genomes of the dominant organism types will also reveal patterns of genetic variation within natural populations. New bioinformatic tools will enable visualization and comprehensive analysis of this sequence variation and inference of recent evolutionary and ecological processes. Results: We have developed a software package for analysis and visualization of genetic variation in populations and reconstruction of strain variants from otherwise co-assembled sequences. Sequencing reads can be clustered by matching patterns of single nucleotide polymorphisms to generate predicted gene and protein variant sequences, identify conserved intergenic regulatory sequences, and determine the quantity and distribution of recombination events. Conclusion: The Strainer software, a first generation metagenomic bioinformatics tool, facilitates comprehension and analysis of heterogeneity intrinsic in natural communities. The program reveals the degree of clustering among closely related sequence variants and provides a rapid means to generate gene and protein sequences for functional, ecological, and evolutionary analyses.
- Research Organization:
- Univ. of California, Berkeley, CA (United States)
- Sponsoring Organization:
- National Science Foundation (NSF); USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
- Grant/Contract Number:
- FG02-05ER64134
- OSTI ID:
- 1626345
- Journal Information:
- BMC Bioinformatics, Journal Name: BMC Bioinformatics Journal Issue: 1 Vol. 8; ISSN 1471-2105
- Publisher:
- BioMed CentralCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Population Genomic Analysis of Strain Variation in Leptospirillum Group II Bacteria Involved in Acid Mine Drainage Formation
Tetranucleotide frequencies differentiate genomic boundaries and metabolic strategies across environmental microbiomes
Community-wide analysis of microbial genome sequence signatures
Journal Article
·
Mon Jul 21 20:00:00 EDT 2008
· PLoS Biology (Online)
·
OSTI ID:1627157
Tetranucleotide frequencies differentiate genomic boundaries and metabolic strategies across environmental microbiomes
Journal Article
·
Mon Aug 18 20:00:00 EDT 2025
· mSystems
·
OSTI ID:2583292
Community-wide analysis of microbial genome sequence signatures
Journal Article
·
Wed Dec 31 19:00:00 EST 2008
· GenomeBiology.com
·
OSTI ID:1626734