Data from: Multiple maize reference genomes impact the identification of variants by GWAS in a diverse inbred panel
Abstract
Use of a single reference genome for genome-wide association studies (GWAS) limits the gene space represented to that of a single accession. This limitation can complicate identification and characterization of genes located within presence/absence variations (PAVs). In this study, we present the draft de novo genome assembly of PHJ89, an Oh43-type inbred line. Using three separate reference genome assemblies (B73, PH207, and PHJ89) that represent the predominant germplasm groups of maize, we generated three separate whole-seedling gene expression profile and single nucleotide polymorphism (SNP) matrices from a panel of 942 diverse inbred lines. We identified 34,447 (B73), 39,672 (PH207), and 37,436 (PHJ89) transcripts that are not present in the respective reference genome assembly. GWAS was conducted in the 942 inbred panel using both the SNP and expression data values to map sugarcane mosaic virus (SCMV) resistance. Highlighting the impact of alternative reference genomes in gene discovery, GWAS results for SCMV resistance using expression values as a surrogate measure of PAV resulted in robust detection of the physical location of a known resistance gene when using the B73 reference that contains the gene, but not when using the PH207 reference. This study provides the valuable resource of the Oh43-type PHJ89 genomemore »
- Authors:
-
- Univ. of Wisconsin, Madison, WI (United States); OSTI
- Michigan State Univ., East Lansing, MI (United States)
- Monsanto Company, DeForest, WI (United States)
- USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)
- Univ. of Wisconsin, Madison, WI (United States)
- Univ. of Illinois at Urbana-Champaign, IL (United States)
- Publication Date:
- DOE Contract Number:
- FC02-07ER64494
- Research Org.:
- Great Lakes Bioenergy Research Center, Madison, WI (United States); Univ. of Wisconsin, Madison, WI (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES
- OSTI Identifier:
- 1874035
- DOI:
- https://doi.org/10.5061/dryad.dk22g4h
Citation Formats
Gage, Joseph L., Vaillancourt, Brieanne, Hamilton, John P., Manrique-Carpintero, Norma C., Gustafson, Timothy J., Barry, Kerrie, Lipzen, Anna, Tracy, William F., Mikel, Mark A., Kaeppler, Shawn M., Buell, C. Robin, and de Leon, Natalia. Data from: Multiple maize reference genomes impact the identification of variants by GWAS in a diverse inbred panel. United States: N. p., 2019.
Web. doi:10.5061/dryad.dk22g4h.
Gage, Joseph L., Vaillancourt, Brieanne, Hamilton, John P., Manrique-Carpintero, Norma C., Gustafson, Timothy J., Barry, Kerrie, Lipzen, Anna, Tracy, William F., Mikel, Mark A., Kaeppler, Shawn M., Buell, C. Robin, & de Leon, Natalia. Data from: Multiple maize reference genomes impact the identification of variants by GWAS in a diverse inbred panel. United States. doi:https://doi.org/10.5061/dryad.dk22g4h
Gage, Joseph L., Vaillancourt, Brieanne, Hamilton, John P., Manrique-Carpintero, Norma C., Gustafson, Timothy J., Barry, Kerrie, Lipzen, Anna, Tracy, William F., Mikel, Mark A., Kaeppler, Shawn M., Buell, C. Robin, and de Leon, Natalia. 2019.
"Data from: Multiple maize reference genomes impact the identification of variants by GWAS in a diverse inbred panel". United States. doi:https://doi.org/10.5061/dryad.dk22g4h. https://www.osti.gov/servlets/purl/1874035. Pub date:Wed Feb 13 04:00:00 UTC 2019
@article{osti_1874035,
title = {Data from: Multiple maize reference genomes impact the identification of variants by GWAS in a diverse inbred panel},
author = {Gage, Joseph L. and Vaillancourt, Brieanne and Hamilton, John P. and Manrique-Carpintero, Norma C. and Gustafson, Timothy J. and Barry, Kerrie and Lipzen, Anna and Tracy, William F. and Mikel, Mark A. and Kaeppler, Shawn M. and Buell, C. Robin and de Leon, Natalia},
abstractNote = {Use of a single reference genome for genome-wide association studies (GWAS) limits the gene space represented to that of a single accession. This limitation can complicate identification and characterization of genes located within presence/absence variations (PAVs). In this study, we present the draft de novo genome assembly of PHJ89, an Oh43-type inbred line. Using three separate reference genome assemblies (B73, PH207, and PHJ89) that represent the predominant germplasm groups of maize, we generated three separate whole-seedling gene expression profile and single nucleotide polymorphism (SNP) matrices from a panel of 942 diverse inbred lines. We identified 34,447 (B73), 39,672 (PH207), and 37,436 (PHJ89) transcripts that are not present in the respective reference genome assembly. GWAS was conducted in the 942 inbred panel using both the SNP and expression data values to map sugarcane mosaic virus (SCMV) resistance. Highlighting the impact of alternative reference genomes in gene discovery, GWAS results for SCMV resistance using expression values as a surrogate measure of PAV resulted in robust detection of the physical location of a known resistance gene when using the B73 reference that contains the gene, but not when using the PH207 reference. This study provides the valuable resource of the Oh43-type PHJ89 genome assembly as well as SNP and expression data for 942 individuals generated using three different reference genomes.},
doi = {10.5061/dryad.dk22g4h},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Wed Feb 13 04:00:00 UTC 2019},
month = {Wed Feb 13 04:00:00 UTC 2019}
}
