Defining window-boundaries for genomic analyses using smoothing spline techniques
- Univ. of California, Davis, CA (United States). Dept. Plant Sciences.
- Univ. of Wisconsin, Madison, WI (United States). Dept. of Animal Sciences and Dept. of Biostatistics and Medical Information.
- Univ. of Wisconsin, Madison, WI (United States). Dept. of Agronomy and Dept. of Energy Great Lakes Bioenergy Research Center.
- Univ. of Wisconsin, Madison, WI (United States). Dept. of Animal Sciences, Dept. of Biostatistics and Medical Information and Dept. of Dairy Science.
High-density genomic data is often analyzed by combining information over windows of adjacent markers. Interpretation of data grouped in windows versus at individual locations may increase statistical power, simplify computation, reduce sampling noise, and reduce the total number of tests performed. However, use of adjacent marker information can result in over- or under-smoothing, undesirable window boundary specifications, or highly correlated test statistics. We introduce a method for defining windows based on statistically guided breakpoints in the data, as a foundation for the analysis of multiple adjacent data points. This method involves first fitting a cubic smoothing spline to the data and then identifying the inflection points of the fitted spline, which serve as the boundaries of adjacent windows. This technique does not require prior knowledge of linkage disequilibrium, and therefore can be applied to data collected from individual or pooled sequencing experiments. Moreover, in contrast to existing methods, an arbitrary choice of window size is not necessary, since these are determined empirically and allowed to vary along the genome.
- Research Organization:
- Univ. of Wisconsin, Madison, WI (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- Grant/Contract Number:
- FC02-07ER64494
- OSTI ID:
- 1184786
- Journal Information:
- Genetics Selection Evolution (Online), Vol. 47, Issue 1; ISSN 1297-9686
- Publisher:
- BioMed CentralCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Web of Science
Similar Records
Mathematical modelling for the drying method and smoothing drying rate using cubic spline for seaweed Kappaphycus Striatum variety Durian in a solar dryer
Growth curve analysis for plasma profiles using smoothing splines