Training Population Optimization for Genomic Selection in Miscanthus
Abstract
Miscanthus is a perennial grass with potential for lignocellulosic ethanol production. To ensure its utility for this purpose, breeding efforts should focus on increasing genetic diversity of the nothospecies Miscanthus × giganteus (M×g) beyond the single clone used in many programs. Germplasm from the corresponding parental species M. sinensis (Msi) and M. sacchariflorus (Msa) could theoretically be used as training sets for genomic prediction of M×g clones with optimal genomic estimated breeding values for biofuel traits. To this end, we first showed that subpopulation structure makes a substantial contribution to the genomic selection (GS) prediction accuracies within a 538-member diversity panel of predominately Msi individuals and a 598-member diversity panels of Msa individuals. We then assessed the ability of these two diversity panels to train GS models that predict breeding values in an interspecific diploid 216-member M×g F2 panel. Low and negative prediction accuracies were observed when various subsets of the two diversity panels were used to train these GS models. To overcome the drawback of having only one interspecific M×g F2 panel available, we also evaluated prediction accuracies for traits simulated in 50 simulated interspecific M×g F2 panels derived from different sets of Msi and diploid Msa parents. Themore »
- Authors:
-
more »
- Univ. of Illinois at Urbana-Champaign, IL (United States). Dept. of Crop Sciences
- Univ. of Georgia, Athens, GA (United States). Plant Genome Mapping Lab.
- Hokkaido Univ. (Japan). Research Faculty of Agriculture. Applied Plant Genome Lab.
- Hokkaido Univ. (Japan). Field Science Center for Northern Biosphere
- Colorado State Univ., Fort Collins, CO (United States). Dept. of Soil and Crop Sciences
- Konkuk Univ., Seoul (Korea, Republic of). Dept. of Applied Bioscience
- Vavilov All-Russian Inst. of Plant Genetic Resources, St. Petersburg (Russian Federation)
- Vavilov All-Russian Inst. of Plant Genetic Resources, St. Petersburg (Russian Federation)
- Univ. of Nebraska, Lincoln, NE (United States). Dept. of Biochemistry
- Konkuk Univ., Seoul (Korea, Republic of). Dept. of Applied Plant Science
- Zhejiang Univ., Hangzhou (China). Dept. of Agronomy
- China National Seed Group Co. Ltd, Wuhan (China)
- Kangwon National University, Chuncheon (Korea, Republic of). Dept. of Applied Plant Sciences
- Huazhong Agricultural Univ., Wuhan (China). College of Plant Science and Technology
- Publication Date:
- Research Org.:
- Univ. of Illinois at Urbana-Champaign, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- OSTI Identifier:
- 1802902
- Grant/Contract Number:
- SC0016264
- Resource Type:
- Accepted Manuscript
- Journal Name:
- G3
- Additional Journal Information:
- Journal Volume: 10; Journal Issue: 7; Journal ID: ISSN 2160-1836
- Publisher:
- Genetics Society of America
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; Genetics & Heredity; Miscanthus; Prediction Accuracy; Genomic selection; Population Structure; GenPred; Shared data resources
Citation Formats
Olatoye, Marcus O., Clark, Lindsay V., Labonte, Nicholas R., Dong, Hongxu, Dwiyanti, Maria S., Anzoua, Kossonou G., Brummer, Joe E., Ghimire, Bimal K., Dzyubenko, Elena, Dzyubenko, Nikolay, Bagmet, Larisa, Sabitov, Andrey, Chebukin, Pavel, Głowacka, Katarzyna, Heo, Kweon, Jin, Xiaoli, Nagano, Hironori, Peng, Junhua, Yu, Chang Y., Yoo, Ji H., Zhao, Hua, Long, Stephen P., Yamada, Toshihiko, Sacks, Erik J., and Lipka, Alexander E. Training Population Optimization for Genomic Selection in Miscanthus. United States: N. p., 2020.
Web. doi:10.1534/g3.120.401402.
Olatoye, Marcus O., Clark, Lindsay V., Labonte, Nicholas R., Dong, Hongxu, Dwiyanti, Maria S., Anzoua, Kossonou G., Brummer, Joe E., Ghimire, Bimal K., Dzyubenko, Elena, Dzyubenko, Nikolay, Bagmet, Larisa, Sabitov, Andrey, Chebukin, Pavel, Głowacka, Katarzyna, Heo, Kweon, Jin, Xiaoli, Nagano, Hironori, Peng, Junhua, Yu, Chang Y., Yoo, Ji H., Zhao, Hua, Long, Stephen P., Yamada, Toshihiko, Sacks, Erik J., & Lipka, Alexander E. Training Population Optimization for Genomic Selection in Miscanthus. United States. https://doi.org/10.1534/g3.120.401402
Olatoye, Marcus O., Clark, Lindsay V., Labonte, Nicholas R., Dong, Hongxu, Dwiyanti, Maria S., Anzoua, Kossonou G., Brummer, Joe E., Ghimire, Bimal K., Dzyubenko, Elena, Dzyubenko, Nikolay, Bagmet, Larisa, Sabitov, Andrey, Chebukin, Pavel, Głowacka, Katarzyna, Heo, Kweon, Jin, Xiaoli, Nagano, Hironori, Peng, Junhua, Yu, Chang Y., Yoo, Ji H., Zhao, Hua, Long, Stephen P., Yamada, Toshihiko, Sacks, Erik J., and Lipka, Alexander E. Wed .
"Training Population Optimization for Genomic Selection in Miscanthus". United States. https://doi.org/10.1534/g3.120.401402. https://www.osti.gov/servlets/purl/1802902.
@article{osti_1802902,
title = {Training Population Optimization for Genomic Selection in Miscanthus},
author = {Olatoye, Marcus O. and Clark, Lindsay V. and Labonte, Nicholas R. and Dong, Hongxu and Dwiyanti, Maria S. and Anzoua, Kossonou G. and Brummer, Joe E. and Ghimire, Bimal K. and Dzyubenko, Elena and Dzyubenko, Nikolay and Bagmet, Larisa and Sabitov, Andrey and Chebukin, Pavel and Głowacka, Katarzyna and Heo, Kweon and Jin, Xiaoli and Nagano, Hironori and Peng, Junhua and Yu, Chang Y. and Yoo, Ji H. and Zhao, Hua and Long, Stephen P. and Yamada, Toshihiko and Sacks, Erik J. and Lipka, Alexander E.},
abstractNote = {Miscanthus is a perennial grass with potential for lignocellulosic ethanol production. To ensure its utility for this purpose, breeding efforts should focus on increasing genetic diversity of the nothospecies Miscanthus × giganteus (M×g) beyond the single clone used in many programs. Germplasm from the corresponding parental species M. sinensis (Msi) and M. sacchariflorus (Msa) could theoretically be used as training sets for genomic prediction of M×g clones with optimal genomic estimated breeding values for biofuel traits. To this end, we first showed that subpopulation structure makes a substantial contribution to the genomic selection (GS) prediction accuracies within a 538-member diversity panel of predominately Msi individuals and a 598-member diversity panels of Msa individuals. We then assessed the ability of these two diversity panels to train GS models that predict breeding values in an interspecific diploid 216-member M×g F2 panel. Low and negative prediction accuracies were observed when various subsets of the two diversity panels were used to train these GS models. To overcome the drawback of having only one interspecific M×g F2 panel available, we also evaluated prediction accuracies for traits simulated in 50 simulated interspecific M×g F2 panels derived from different sets of Msi and diploid Msa parents. The results revealed that genetic architectures with common causal mutations across Msi and Msa yielded the highest prediction accuracies. Ultimately, these results suggest that the ideal training set should contain the same causal mutations segregating within interspecific M×g populations, and thus efforts should be undertaken to ensure that individuals in the training and validation sets are as closely related as possible.},
doi = {10.1534/g3.120.401402},
journal = {G3},
number = 7,
volume = 10,
place = {United States},
year = {Wed Jul 01 00:00:00 EDT 2020},
month = {Wed Jul 01 00:00:00 EDT 2020}
}
Works referenced in this record:
Adding Genetically Distant Individuals to Training Populations Reduces Genomic Prediction Accuracy in Barley
journal, November 2015
- Lorenz, Aaron J.; Smith, Kevin P.
- Crop Science, Vol. 55, Issue 6
Increasing Genomic‐Enabled Prediction Accuracy by Modeling Genotype × Environment Interactions in Kansas Wheat
journal, July 2017
- Jarquín, Diego; Lemes da Silva, Cristiano; Gaynor, R. Chris
- The Plant Genome, Vol. 10, Issue 2
Predicting genomic selection efficiency to optimize calibration set and to assess prediction accuracy in highly structured populations
journal, August 2017
- Rincent, R.; Charcosset, A.; Moreau, L.
- Theoretical and Applied Genetics, Vol. 130, Issue 11
TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline
journal, February 2014
- Glaubitz, Jeffrey C.; Casstevens, Terry M.; Lu, Fei
- PLoS ONE, Vol. 9, Issue 2
Genome‐wide association and genomic prediction for biomass yield in a genetically diverse Miscanthus sinensis germplasm panel phenotyped at five locations in Asia and North America
journal, May 2019
- Clark, Lindsay V.; Dwiyanti, Maria S.; Anzoua, Kossonou G.
- GCB Bioenergy
Winter hardiness of Miscanthus (II): Genetic mapping for overwintering ability and adaptation traits in three interconnected Miscanthus populations
journal, January 2019
- Dong, Hongxu; Liu, Siyao; Clark, Lindsay V.
- GCB Bioenergy, Vol. 11, Issue 5
Multibreed genomic evaluations using purebred Holsteins, Jerseys, and Brown Swiss
journal, September 2012
- Olson, K. M.; VanRaden, P. M.; Tooker, M. E.
- Journal of Dairy Science, Vol. 95, Issue 9
Empirical Comparison of Tropical Maize Hybrids Selected Through Genomic and Phenotypic Selections
journal, November 2019
- Beyene, Yoseph; Gowda, Manje; Olsen, Michael
- Frontiers in Plant Science, Vol. 10
Genomic Prediction of Gene Bank Wheat Landraces
journal, July 2016
- Crossa, José; Jarquín, Diego; Franco, Jorge
- G3 Genes|Genomes|Genetics, Vol. 6, Issue 7
Genomic Selection for Predicting Head Blight Resistance in a Wheat Breeding Program
journal, January 2015
- Arruda, Marcio P.; Brown, Patrick J.; Lipka, Alexander E.
- The Plant Genome, Vol. 8, Issue 3
A footprint of past climate change on the diversity and population structure of Miscanthus sinensis
journal, June 2014
- Clark, Lindsay V.; Brummer, Joe E.; Głowacka, Katarzyna
- Annals of Botany, Vol. 114, Issue 1
The impact of population structure on genomic prediction in stratified populations
journal, January 2014
- Guo, Zhigang; Tucker, Dominic M.; Basten, Christopher J.
- Theoretical and Applied Genetics, Vol. 127, Issue 3
Genomic prediction in early selection stages using multi-year data in a hybrid rye breeding program
journal, May 2017
- Bernal-Vasquez, Angela-Maria; Gordillo, Andres; Schmidt, Malthe
- BMC Genetics, Vol. 18, Issue 1
Principal components analysis corrects for stratification in genome-wide association studies
journal, July 2006
- Price, Alkes L.; Patterson, Nick J.; Plenge, Robert M.
- Nature Genetics, Vol. 38, Issue 8
polyRAD: Genotype Calling with Uncertainty from Sequencing Data in Polyploids and Diploids
journal, March 2019
- Clark, Lindsay V.; Lipka, Alexander E.; Sacks, Erik J.
- G3 Genes|Genomes|Genetics, Vol. 9, Issue 3
Ridge Regression: Biased Estimation for Nonorthogonal Problems
journal, February 1970
- Hoerl, Arthur E.; Kennard, Robert W.
- Technometrics, Vol. 12, Issue 1
Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP
journal, January 2011
- Endelman, Jeffrey B.
- The Plant Genome Journal, Vol. 4, Issue 3
Meeting US biofuel goals with less land: the potential of Miscanthus
journal, September 2008
- Heaton, Emily A.; Dohleman, Frank G.; Long, Stephen P.
- Global Change Biology, Vol. 14, Issue 9
The land use–climate change–energy nexus
journal, May 2011
- Dale, Virginia H.; Efroymson, Rebecca A.; Kline, Keith L.
- Landscape Ecology, Vol. 26, Issue 6
Genetic mapping of biomass yield in three interconnected Miscanthus populations
journal, August 2017
- Dong, Hongxu; Liu, Siyao; Clark, Lindsay V.
- GCB Bioenergy, Vol. 10, Issue 3
Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis
journal, December 2013
- Slavov, Gancho T.; Nipper, Rick; Robson, Paul
- New Phytologist, Vol. 201, Issue 4
Marker-assisted selection using ridge regression
journal, April 2000
- Whittaker, John C.; Thompson, Robin; Denham, Mike C.
- Genetical Research, Vol. 75, Issue 2
The use of dna sequencing (ITS and trnL-F), AFLP, and fluorescent in situ hybridization to study allopolyploid Miscanthus (Poaceae)
journal, February 2002
- Hodkinson, T. R.; Chase, M. W.; Takahashi, C.
- American Journal of Botany, Vol. 89, Issue 2
Evaluation of RR-BLUP Genomic Selection Models that Incorporate Peak Genome-Wide Association Study Signals in Maize and Sorghum
journal, January 2019
- Rice, Brian; Lipka, Alexander E.
- The Plant Genome, Vol. 12, Issue 1
Genomic prediction with multiple biparental families
journal, October 2019
- Brauner, Pedro C.; Müller, Dominik; Molenaar, Willem S.
- Theoretical and Applied Genetics, Vol. 133, Issue 1
The Effect of Linkage Disequilibrium and Family Relationships on the Reliability of Genomic Prediction
journal, February 2013
- Wientjes, Yvonne C. J.; Veerkamp, Roel F.; Calus, Mario P. L.
- Genetics, Vol. 193, Issue 2
Precision and information in linear models of genetic evaluation
journal, January 1993
- Laloë, D.
- Genetics Selection Evolution, Vol. 25, Issue 6
Evaluation of genomic selection and marker-assisted selection in Miscanthus and energycane
journal, December 2019
- Olatoye, Marcus O.; Clark, Lindsay V.; Wang, Jianping
- Molecular Breeding, Vol. 39, Issue 12
Assessment of Genetic Heterogeneity in Structured Plant Populations Using Multivariate Whole-Genome Regression Models
journal, June 2015
- Lehermeier, Christina; Schön, Chris-Carolin; de los Campos, Gustavo
- Genetics, Vol. 201, Issue 1
Genetic relationships between spring emergence, canopy phenology, and biomass yield increase the accuracy of genomic prediction in Miscanthus
journal, October 2017
- Davey, Christopher L.; Robson, Paul; Hawkins, Sarah
- Journal of Experimental Botany, Vol. 68, Issue 18
Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
journal, November 2015
- de los Campos, Gustavo; Veturi, Yogasudha; Vazquez, Ana I.
- Journal of Agricultural, Biological, and Environmental Statistics, Vol. 20, Issue 4
The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes
journal, February 2012
- Clark, Samuel A.; Hickey, John M.; Daetwyler, Hans D.
- Genetics Selection Evolution, Vol. 44, Issue 1
Maximizing the Reliability of Genomic Selection by Optimizing the Calibration Set of Reference Individuals: Comparison of Methods in Two Diverse Groups of Maize Inbreds ( Zea mays L.)
journal, August 2012
- Rincent, R.; Laloë, D.; Nicolas, S.
- Genetics, Vol. 192, Issue 2
Breeding progress and preparedness for mass-scale deployment of perennial lignocellulosic biomass crops switchgrass, miscanthus, willow and poplar
journal, October 2018
- Clifton-Brown, John; Harfouche, Antoine; Casler, Michael D.
- GCB Bioenergy, Vol. 11, Issue 1
Training set optimization under population structure in genomic selection
journal, November 2014
- Isidro, Julio; Jannink, Jean-Luc; Akdemir, Deniz
- Theoretical and Applied Genetics, Vol. 128, Issue 1
Fast gapped-read alignment with Bowtie 2
journal, March 2012
- Langmead, Ben; Salzberg, Steven L.
- Nature Methods, Vol. 9, Issue 4
Biomass yield in a genetically diverse Miscanthus sinensis germplasm panel evaluated at five locations revealed individuals with exceptional potential
journal, March 2019
- Clark, Lindsay V.; Dwiyanti, Maria S.; Anzoua, Kossonou G.
- GCB Bioenergy, Vol. 11, Issue 10
Accuracy and Training Population Design for Genomic Selection on Quantitative Traits in Elite North American Oats
journal, July 2011
- Asoro, Franco G.; Newell, Mark A.; Beavis, William D.
- The Plant Genome, Vol. 4, Issue 2
Optimal Designs for Genomic Selection in Hybrid Crops
journal, March 2019
- Guo, Tingting; Yu, Xiaoqing; Li, Xianran
- Molecular Plant, Vol. 12, Issue 3
Population structure of Miscanthus sacchariflorus reveals two major polyploidization events, tetraploid-mediated unidirectional introgression from diploid M. sinensis, and diversity centred around the Yellow Sea
journal, September 2018
- Clark, Lindsay V.; Jin, Xiaoli; Petersen, Karen Koefoed
- Annals of Botany, Vol. 124, Issue 4
Utility of whole-genome sequence data for across-breed genomic prediction
journal, May 2018
- Raymond, Biaty; Bouwman, Aniek C.; Schrooten, Chris
- Genetics Selection Evolution, Vol. 50, Issue 1
The Gene Pool of Miscanthus Species and Its Improvement
book, July 2012
- Sacks, Erik J.; Juvik, John A.; Lin, Qi
- Genomics of the Saccharinae
TagDigger: user-friendly extraction of read counts from GBS and RAD-seq data
journal, July 2016
- Clark, Lindsay V.; Sacks, Erik J.
- Source Code for Biology and Medicine, Vol. 11, Issue 1
Accuracy of genotypic value predictions for marker-based selection in biparental plant populations
journal, October 2009
- Lorenzana, Robenzon E.; Bernardo, Rex
- Theoretical and Applied Genetics, Vol. 120, Issue 1
Cold-Tolerance of Seedlings and Effects of Spring and Autumn Frosts on Mature Clonally Replicated Cultivars
journal, January 2015
- Kaiser, Christopher M.; Sacks, Erik J.
- Crop Science, Vol. 55, Issue 5
Empirical and deterministic accuracies of across-population genomic prediction
journal, January 2015
- Wientjes, Yvonne; Veerkamp, Roel F.; Bijma, Piter
- Genetics Selection Evolution, Vol. 47, Issue 1
Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps
journal, April 2001
- Meuwissen, T. H. E.; Hayes, B. J.; Goddard, M. E.
- Genetics, Vol. 157, Issue 4