DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Training Population Optimization for Genomic Selection in Miscanthus

Abstract

Miscanthus is a perennial grass with potential for lignocellulosic ethanol production. To ensure its utility for this purpose, breeding efforts should focus on increasing genetic diversity of the nothospecies Miscanthus × giganteus (M×g) beyond the single clone used in many programs. Germplasm from the corresponding parental species M. sinensis (Msi) and M. sacchariflorus (Msa) could theoretically be used as training sets for genomic prediction of M×g clones with optimal genomic estimated breeding values for biofuel traits. To this end, we first showed that subpopulation structure makes a substantial contribution to the genomic selection (GS) prediction accuracies within a 538-member diversity panel of predominately Msi individuals and a 598-member diversity panels of Msa individuals. We then assessed the ability of these two diversity panels to train GS models that predict breeding values in an interspecific diploid 216-member M×g F2 panel. Low and negative prediction accuracies were observed when various subsets of the two diversity panels were used to train these GS models. To overcome the drawback of having only one interspecific M×g F2 panel available, we also evaluated prediction accuracies for traits simulated in 50 simulated interspecific M×g F2 panels derived from different sets of Msi and diploid Msa parents. Themore » results revealed that genetic architectures with common causal mutations across Msi and Msa yielded the highest prediction accuracies. Ultimately, these results suggest that the ideal training set should contain the same causal mutations segregating within interspecific M×g populations, and thus efforts should be undertaken to ensure that individuals in the training and validation sets are as closely related as possible.« less

Authors:
ORCiD logo [1]; ORCiD logo [1];  [1];  [2];  [3];  [4];  [5];  [6];  [7];  [8];  [8];  [8];  [8];  [9];  [10]; ORCiD logo [11];  [4];  [12];  [13]; ORCiD logo [13] more »;  [14];  [1];  [4];  [1];  [1] « less
  1. Univ. of Illinois at Urbana-Champaign, IL (United States). Dept. of Crop Sciences
  2. Univ. of Georgia, Athens, GA (United States). Plant Genome Mapping Lab.
  3. Hokkaido Univ. (Japan). Research Faculty of Agriculture. Applied Plant Genome Lab.
  4. Hokkaido Univ. (Japan). Field Science Center for Northern Biosphere
  5. Colorado State Univ., Fort Collins, CO (United States). Dept. of Soil and Crop Sciences
  6. Konkuk Univ., Seoul (Korea, Republic of). Dept. of Applied Bioscience
  7. Vavilov All-Russian Inst. of Plant Genetic Resources, St. Petersburg (Russian Federation)
  8. Vavilov All-Russian Inst. of Plant Genetic Resources, St. Petersburg (Russian Federation)
  9. Univ. of Nebraska, Lincoln, NE (United States). Dept. of Biochemistry
  10. Konkuk Univ., Seoul (Korea, Republic of). Dept. of Applied Plant Science
  11. Zhejiang Univ., Hangzhou (China). Dept. of Agronomy
  12. China National Seed Group Co. Ltd, Wuhan (China)
  13. Kangwon National University, Chuncheon (Korea, Republic of). Dept. of Applied Plant Sciences
  14. Huazhong Agricultural Univ., Wuhan (China). College of Plant Science and Technology
Publication Date:
Research Org.:
Univ. of Illinois at Urbana-Champaign, IL (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
OSTI Identifier:
1802902
Grant/Contract Number:  
SC0016264
Resource Type:
Accepted Manuscript
Journal Name:
G3
Additional Journal Information:
Journal Volume: 10; Journal Issue: 7; Journal ID: ISSN 2160-1836
Publisher:
Genetics Society of America
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; Genetics & Heredity; Miscanthus; Prediction Accuracy; Genomic selection; Population Structure; GenPred; Shared data resources

Citation Formats

Olatoye, Marcus O., Clark, Lindsay V., Labonte, Nicholas R., Dong, Hongxu, Dwiyanti, Maria S., Anzoua, Kossonou G., Brummer, Joe E., Ghimire, Bimal K., Dzyubenko, Elena, Dzyubenko, Nikolay, Bagmet, Larisa, Sabitov, Andrey, Chebukin, Pavel, Głowacka, Katarzyna, Heo, Kweon, Jin, Xiaoli, Nagano, Hironori, Peng, Junhua, Yu, Chang Y., Yoo, Ji H., Zhao, Hua, Long, Stephen P., Yamada, Toshihiko, Sacks, Erik J., and Lipka, Alexander E. Training Population Optimization for Genomic Selection in Miscanthus. United States: N. p., 2020. Web. doi:10.1534/g3.120.401402.
Olatoye, Marcus O., Clark, Lindsay V., Labonte, Nicholas R., Dong, Hongxu, Dwiyanti, Maria S., Anzoua, Kossonou G., Brummer, Joe E., Ghimire, Bimal K., Dzyubenko, Elena, Dzyubenko, Nikolay, Bagmet, Larisa, Sabitov, Andrey, Chebukin, Pavel, Głowacka, Katarzyna, Heo, Kweon, Jin, Xiaoli, Nagano, Hironori, Peng, Junhua, Yu, Chang Y., Yoo, Ji H., Zhao, Hua, Long, Stephen P., Yamada, Toshihiko, Sacks, Erik J., & Lipka, Alexander E. Training Population Optimization for Genomic Selection in Miscanthus. United States. https://doi.org/10.1534/g3.120.401402
Olatoye, Marcus O., Clark, Lindsay V., Labonte, Nicholas R., Dong, Hongxu, Dwiyanti, Maria S., Anzoua, Kossonou G., Brummer, Joe E., Ghimire, Bimal K., Dzyubenko, Elena, Dzyubenko, Nikolay, Bagmet, Larisa, Sabitov, Andrey, Chebukin, Pavel, Głowacka, Katarzyna, Heo, Kweon, Jin, Xiaoli, Nagano, Hironori, Peng, Junhua, Yu, Chang Y., Yoo, Ji H., Zhao, Hua, Long, Stephen P., Yamada, Toshihiko, Sacks, Erik J., and Lipka, Alexander E. Wed . "Training Population Optimization for Genomic Selection in Miscanthus". United States. https://doi.org/10.1534/g3.120.401402. https://www.osti.gov/servlets/purl/1802902.
@article{osti_1802902,
title = {Training Population Optimization for Genomic Selection in Miscanthus},
author = {Olatoye, Marcus O. and Clark, Lindsay V. and Labonte, Nicholas R. and Dong, Hongxu and Dwiyanti, Maria S. and Anzoua, Kossonou G. and Brummer, Joe E. and Ghimire, Bimal K. and Dzyubenko, Elena and Dzyubenko, Nikolay and Bagmet, Larisa and Sabitov, Andrey and Chebukin, Pavel and Głowacka, Katarzyna and Heo, Kweon and Jin, Xiaoli and Nagano, Hironori and Peng, Junhua and Yu, Chang Y. and Yoo, Ji H. and Zhao, Hua and Long, Stephen P. and Yamada, Toshihiko and Sacks, Erik J. and Lipka, Alexander E.},
abstractNote = {Miscanthus is a perennial grass with potential for lignocellulosic ethanol production. To ensure its utility for this purpose, breeding efforts should focus on increasing genetic diversity of the nothospecies Miscanthus × giganteus (M×g) beyond the single clone used in many programs. Germplasm from the corresponding parental species M. sinensis (Msi) and M. sacchariflorus (Msa) could theoretically be used as training sets for genomic prediction of M×g clones with optimal genomic estimated breeding values for biofuel traits. To this end, we first showed that subpopulation structure makes a substantial contribution to the genomic selection (GS) prediction accuracies within a 538-member diversity panel of predominately Msi individuals and a 598-member diversity panels of Msa individuals. We then assessed the ability of these two diversity panels to train GS models that predict breeding values in an interspecific diploid 216-member M×g F2 panel. Low and negative prediction accuracies were observed when various subsets of the two diversity panels were used to train these GS models. To overcome the drawback of having only one interspecific M×g F2 panel available, we also evaluated prediction accuracies for traits simulated in 50 simulated interspecific M×g F2 panels derived from different sets of Msi and diploid Msa parents. The results revealed that genetic architectures with common causal mutations across Msi and Msa yielded the highest prediction accuracies. Ultimately, these results suggest that the ideal training set should contain the same causal mutations segregating within interspecific M×g populations, and thus efforts should be undertaken to ensure that individuals in the training and validation sets are as closely related as possible.},
doi = {10.1534/g3.120.401402},
journal = {G3},
number = 7,
volume = 10,
place = {United States},
year = {Wed Jul 01 00:00:00 EDT 2020},
month = {Wed Jul 01 00:00:00 EDT 2020}
}

Works referenced in this record:

Adding Genetically Distant Individuals to Training Populations Reduces Genomic Prediction Accuracy in Barley
journal, November 2015


Increasing Genomic‐Enabled Prediction Accuracy by Modeling Genotype × Environment Interactions in Kansas Wheat
journal, July 2017


Predicting genomic selection efficiency to optimize calibration set and to assess prediction accuracy in highly structured populations
journal, August 2017

  • Rincent, R.; Charcosset, A.; Moreau, L.
  • Theoretical and Applied Genetics, Vol. 130, Issue 11
  • DOI: 10.1007/s00122-017-2956-7

TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline
journal, February 2014


Multibreed genomic evaluations using purebred Holsteins, Jerseys, and Brown Swiss
journal, September 2012

  • Olson, K. M.; VanRaden, P. M.; Tooker, M. E.
  • Journal of Dairy Science, Vol. 95, Issue 9
  • DOI: 10.3168/jds.2011-5006

Empirical Comparison of Tropical Maize Hybrids Selected Through Genomic and Phenotypic Selections
journal, November 2019


Genomic Prediction of Gene Bank Wheat Landraces
journal, July 2016

  • Crossa, José; Jarquín, Diego; Franco, Jorge
  • G3 Genes|Genomes|Genetics, Vol. 6, Issue 7
  • DOI: 10.1534/g3.116.029637

Genomic Selection for Predicting Head Blight Resistance in a Wheat Breeding Program
journal, January 2015


A footprint of past climate change on the diversity and population structure of Miscanthus sinensis
journal, June 2014

  • Clark, Lindsay V.; Brummer, Joe E.; Głowacka, Katarzyna
  • Annals of Botany, Vol. 114, Issue 1
  • DOI: 10.1093/aob/mcu084

The impact of population structure on genomic prediction in stratified populations
journal, January 2014

  • Guo, Zhigang; Tucker, Dominic M.; Basten, Christopher J.
  • Theoretical and Applied Genetics, Vol. 127, Issue 3
  • DOI: 10.1007/s00122-013-2255-x

Genomic prediction in early selection stages using multi-year data in a hybrid rye breeding program
journal, May 2017

  • Bernal-Vasquez, Angela-Maria; Gordillo, Andres; Schmidt, Malthe
  • BMC Genetics, Vol. 18, Issue 1
  • DOI: 10.1186/s12863-017-0512-8

Principal components analysis corrects for stratification in genome-wide association studies
journal, July 2006

  • Price, Alkes L.; Patterson, Nick J.; Plenge, Robert M.
  • Nature Genetics, Vol. 38, Issue 8
  • DOI: 10.1038/ng1847

polyRAD: Genotype Calling with Uncertainty from Sequencing Data in Polyploids and Diploids
journal, March 2019

  • Clark, Lindsay V.; Lipka, Alexander E.; Sacks, Erik J.
  • G3 Genes|Genomes|Genetics, Vol. 9, Issue 3
  • DOI: 10.1534/g3.118.200913

Ridge Regression: Biased Estimation for Nonorthogonal Problems
journal, February 1970


Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP
journal, January 2011


Meeting US biofuel goals with less land: the potential of Miscanthus
journal, September 2008


The land use–climate change–energy nexus
journal, May 2011

  • Dale, Virginia H.; Efroymson, Rebecca A.; Kline, Keith L.
  • Landscape Ecology, Vol. 26, Issue 6
  • DOI: 10.1007/s10980-011-9606-2

Genetic mapping of biomass yield in three interconnected Miscanthus populations
journal, August 2017

  • Dong, Hongxu; Liu, Siyao; Clark, Lindsay V.
  • GCB Bioenergy, Vol. 10, Issue 3
  • DOI: 10.1111/gcbb.12472

Marker-assisted selection using ridge regression
journal, April 2000


The use of dna sequencing (ITS and trnL-F), AFLP, and fluorescent in situ hybridization to study allopolyploid Miscanthus (Poaceae)
journal, February 2002

  • Hodkinson, T. R.; Chase, M. W.; Takahashi, C.
  • American Journal of Botany, Vol. 89, Issue 2
  • DOI: 10.3732/ajb.89.2.279

Genomic prediction with multiple biparental families
journal, October 2019

  • Brauner, Pedro C.; Müller, Dominik; Molenaar, Willem S.
  • Theoretical and Applied Genetics, Vol. 133, Issue 1
  • DOI: 10.1007/s00122-019-03445-7

The Effect of Linkage Disequilibrium and Family Relationships on the Reliability of Genomic Prediction
journal, February 2013


Precision and information in linear models of genetic evaluation
journal, January 1993


Evaluation of genomic selection and marker-assisted selection in Miscanthus and energycane
journal, December 2019


Assessment of Genetic Heterogeneity in Structured Plant Populations Using Multivariate Whole-Genome Regression Models
journal, June 2015


Genetic relationships between spring emergence, canopy phenology, and biomass yield increase the accuracy of genomic prediction in Miscanthus
journal, October 2017

  • Davey, Christopher L.; Robson, Paul; Hawkins, Sarah
  • Journal of Experimental Botany, Vol. 68, Issue 18
  • DOI: 10.1093/jxb/erx339

Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
journal, November 2015

  • de los Campos, Gustavo; Veturi, Yogasudha; Vazquez, Ana I.
  • Journal of Agricultural, Biological, and Environmental Statistics, Vol. 20, Issue 4
  • DOI: 10.1007/s13253-015-0222-5

The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes
journal, February 2012

  • Clark, Samuel A.; Hickey, John M.; Daetwyler, Hans D.
  • Genetics Selection Evolution, Vol. 44, Issue 1
  • DOI: 10.1186/1297-9686-44-4

Breeding progress and preparedness for mass-scale deployment of perennial lignocellulosic biomass crops switchgrass, miscanthus, willow and poplar
journal, October 2018

  • Clifton-Brown, John; Harfouche, Antoine; Casler, Michael D.
  • GCB Bioenergy, Vol. 11, Issue 1
  • DOI: 10.1111/gcbb.12566

Training set optimization under population structure in genomic selection
journal, November 2014

  • Isidro, Julio; Jannink, Jean-Luc; Akdemir, Deniz
  • Theoretical and Applied Genetics, Vol. 128, Issue 1
  • DOI: 10.1007/s00122-014-2418-4

Fast gapped-read alignment with Bowtie 2
journal, March 2012

  • Langmead, Ben; Salzberg, Steven L.
  • Nature Methods, Vol. 9, Issue 4
  • DOI: 10.1038/nmeth.1923

Biomass yield in a genetically diverse Miscanthus sinensis germplasm panel evaluated at five locations revealed individuals with exceptional potential
journal, March 2019

  • Clark, Lindsay V.; Dwiyanti, Maria S.; Anzoua, Kossonou G.
  • GCB Bioenergy, Vol. 11, Issue 10
  • DOI: 10.1111/gcbb.12606

Accuracy and Training Population Design for Genomic Selection on Quantitative Traits in Elite North American Oats
journal, July 2011


Optimal Designs for Genomic Selection in Hybrid Crops
journal, March 2019


Utility of whole-genome sequence data for across-breed genomic prediction
journal, May 2018

  • Raymond, Biaty; Bouwman, Aniek C.; Schrooten, Chris
  • Genetics Selection Evolution, Vol. 50, Issue 1
  • DOI: 10.1186/s12711-018-0396-8

The Gene Pool of Miscanthus Species and Its Improvement
book, July 2012


TagDigger: user-friendly extraction of read counts from GBS and RAD-seq data
journal, July 2016


Accuracy of genotypic value predictions for marker-based selection in biparental plant populations
journal, October 2009


Cold-Tolerance of Seedlings and Effects of Spring and Autumn Frosts on Mature Clonally Replicated Cultivars
journal, January 2015


Empirical and deterministic accuracies of across-population genomic prediction
journal, January 2015

  • Wientjes, Yvonne; Veerkamp, Roel F.; Bijma, Piter
  • Genetics Selection Evolution, Vol. 47, Issue 1
  • DOI: 10.1186/s12711-014-0086-0

Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps
journal, April 2001