DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Breaking the curse of dimensionality to identify causal variants in Breeding 4

Journal Article · · Theoretical and Applied Genetics
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [3]
  1. Cornell Univ., Ithaca, NY (United States). Inst. of Biotechnology, Inst. for Genomic Diversity
  2. Cornell Univ., Ithaca, NY (United States). Section of Plant Breeding and Genetics
  3. Cornell Univ., Ithaca, NY (United States). Inst. of Biotechnology, Inst. for Genomic Diversity and Section of Plant Breeding and Genetics; US Dept. of Agriculture (USDA)., Ithaca, NY(United States). Agricultural Research Service (ARS)

In the past, plant breeding has undergone three major transformations and is currently transitioning to a new technological phase, Breeding 4. This phase is characterized by the development of methods for biological design of plant varieties, including transformation and gene editing techniques directed toward causal loci. The application of such technologies will require to reliably estimate the effect of loci in plant genomes by avoiding the situation where the number of loci assayed ($$p$$) surpasses the number of plant genotypes (n). Here, we discuss approaches to avoid this curse of dimensionality ($$n\ll p$$), which will involve analyzing intermediate phenotypes such as molecular traits and component traits related to plant morphology or physiology. Because these approaches will rely on novel data types such as DNA sequences and high-throughput phenotyping images, Breeding 4 will call for analyses that are complementary to traditional quantitative genetic studies, being based on machine learning techniques which make efficient use of sequence and image data. In this article, we will present some of these techniques and their application for prioritizing causal loci and developing improved varieties in Breeding 4.

Research Organization:
UHV Technologies, Inc., Fort Worth, TX (United States); Cornell Univ., Ithaca, NY (United States)
Sponsoring Organization:
USDA; Agricultural Research Service (ARS); National Science Foundation (NSF); USDOE Advanced Research Projects Agency - Energy (ARPA-E); US Agency for International Development (USAID)
Grant/Contract Number:
AR0000422
OSTI ID:
1623594
Journal Information:
Theoretical and Applied Genetics, Vol. 132, Issue 3; ISSN 0040-5752
Publisher:
Springer NatureCopyright Statement
Country of Publication:
United States
Language:
English

References (126)

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation conference June 2014
The Population Genetics of Adaptation: The Distribution of Factors Fixed during Adaptive Evolution journal August 1998
Incomplete dominance of deleterious alleles contributes substantially to trait variation and heterosis in maize journal September 2017
Modelling strategies for assessing and increasing the effectiveness of new phenotyping techniques in plant breeding journal May 2019
Yield–trait performance landscapes: from theory to application in breeding maize for drought tolerance journal November 2010
Deep learning journal May 2015
Commentary: Fisher’s infinitesimal model: A story for the ages journal December 2017
Impact of Marker Ascertainment Bias on Genomic Selection Accuracy and Estimates of Genetic Diversity journal September 2013
Experimental Designs journal January 1950
Morphogenic Regulators Baby boom and Wuschel Improve Monocot Transformation journal September 2016
Modeling QTL for complex traits: detection and context for plant breeding journal April 2009
Cassava haplotype map highlights fixation of deleterious mutations during clonal propagation journal April 2017
Using Deep Learning for Image-Based Plant Disease Detection journal September 2016
Deep Learning text January 2018
Graphtyper enables population-scale genotyping using pangenome graphs journal September 2017
The Population Genetics of Adaptation: the Distribution of Factors Fixed During Adaptive Evolution journal August 1998
Using markers in gene introgression breeding programs. journal December 1992
Deep Learning: Individual Maize Segmentation From Terrestrial Lidar Data Using Faster R-CNN and Regional Growth Algorithms journal June 2018
Functional Genetic Variants Revealed by Massively Parallel Precise Genome Editing journal October 2018
Emerging technologies in DNA sequencing journal December 2005
Experimental Designs. journal September 1958
Gradient-based learning applied to document recognition journal January 1998
Coming of age: ten years of next-generation sequencing technologies journal May 2016
Multitrait, Random Regression, or Simple Repeatability Model in High-Throughput Phenotyping Data Improve Genomic Prediction for Wheat Grain Yield journal July 2017
Modelling strategies for assessing and increasing the effectiveness of new phenotyping techniques in plant breeding text January 2019
Natural Variations and Genome-Wide Association Studies in Crop Plants journal April 2014
Current Status of the Gene-For-Gene Concept journal September 1971
ARGOS8 variants generated by CRISPR-Cas9 improve maize grain yield under field drought stress conditions journal August 2016
Rich feature hierarchies for accurate object detection and semantic segmentation preprint January 2013
Unmanned Aerial Vehicles for High-Throughput Phenotyping and Agronomic Research journal July 2016
Recombination in diverse maize is stable, predictable, and associated with genetic load journal March 2015
Mapping Intercellular CO 2 Mole Fraction ( C i ) in Rosa rubiginosa Leaves Fed with Abscisic Acid by Using Chlorophyll Fluorescence Imaging: Significance of C i Estimated from Leaf Gas Exchange journal March 1998
Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ journal December 2010
Linkage Mapping of Domestication Loci in a Large Maize–Teosinte Backcross Resource journal October 2007
The Estimation of Environmental and Genetic Trends from Records Subject to Culling journal June 1959
Deep Learning for Plant Stress Phenotyping: Trends and Future Perspectives journal October 2018
Automated Identification of Northern Leaf Blight-Infected Maize Plants from Field Imagery Using Deep Learning journal November 2017
Image analysis is driving a renaissance in growth measurement journal February 2013
Natural Variations and Genome-Wide Association Studies in Crop Plants journal April 2014
ALLOZYME FREQUENCY CHANGES ASSOCIATED WITH SELECTION FOR INCREASED GRAIN YIELD IN MAIZE (ZEA MAYS L.) journal May 1980
The genetical theory of natural selection. book January 1930
On the power of experimental designs for the detection of linkage between marker loci and quantitative loci in crosses between inbred lines journal January 1976
Deep learning for computational biology journal July 2016
Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk journal July 2018
Genomic innovation for crop improvement journal March 2017
Relationships between leaf pigment content and spectral reflectance across a wide range of species, leaf structures and developmental stages journal August 2002
DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning collection January 2017
Emerging technologies in DNA sequencing journal December 2005
Experimental Designs journal December 1950
Recombination in diverse maize is stable, predictable, and associated with genetic load journal March 2015
Maize domestication and gene interaction journal July 2018
Priors in Whole-Genome Regression: The Bayesian Alphabet Returns journal May 2013
Dysregulation of expression correlates with rare-allele burden and fitness loss in maize journal March 2018
Gradient-based learning applied to document recognition journal January 1998
Using Best Linear Unbiased Predictions to Enhance Breeding for Yield in Soybean: I. Choosing Parents journal January 1995
Graphtyper enables population-scale genotyping using pangenome graphs journal September 2017
The Genetical Theory of Natural Selection journal January 1932
Field high-throughput phenotyping: the new crop breeding frontier journal January 2014
Perspectives for Genomic Selection Applications and Research in Plants journal January 2015
Predicting effects of noncoding variants with deep learning–based sequence model journal August 2015
Mendel's Laws of Inheritance and Wheat Breeding journal January 1905
New types of deep neural network learning for speech recognition and related applications: an overview conference May 2013
Using markers in gene introgression breeding programs. journal December 1992
Use of naturally-occurring enzyme variation to detect and map genes controlling quantitative traits in an interspecific backcross of tomato journal August 1982
DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning journal April 2017
Trait physiology and crop modelling as a framework to link phenotypic complexity to underlying genetic systems journal January 2005
New insights into the history of rice domestication journal November 2007
Supermodels: sorghum and maize provide mutual insight into the genetics of flowering time journal March 2013
The Population Genetics of Adaptation: The Distribution of Factors Fixed during Adaptive Evolution journal August 1998
Incomplete dominance of deleterious alleles contributes substantially to trait variation and heterosis in maize journal September 2017
Mapping Intercellular CO 2 Mole Fraction ( C i ) in Rosa rubiginosa Leaves Fed with Abscisic Acid by Using Chlorophyll Fluorescence Imaging: Significance of C i Estimated from Leaf Gas Exchange journal March 1998
Functional Genetic Variants Revealed by Massively Parallel Precise Genome Editing journal October 2018
Commentary: Fisher’s infinitesimal model: A story for the ages journal December 2017
Yield–trait performance landscapes: from theory to application in breeding maize for drought tolerance journal November 2010
Modeling QTL for complex traits: detection and context for plant breeding journal April 2009
XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. journal January 1919
New insights into the history of rice domestication journal November 2007
Open chromatin reveals the functional maize genome journal May 2016
Linkage Mapping of Domestication Loci in a Large Maize–Teosinte Backcross Resource journal October 2007
Deep Learning for Plant Stress Phenotyping: Trends and Future Perspectives journal October 2018
Morphogenic Regulators Baby boom and Wuschel Improve Monocot Transformation journal September 2016
Impact of Marker Ascertainment Bias on Genomic Selection Accuracy and Estimates of Genetic Diversity journal September 2013
Unmanned Aerial Vehicles for High-Throughput Phenotyping and Agronomic Research journal July 2016
Support Vector Machines and Kernels for Computational Biology text January 2008
Coming of age: ten years of next-generation sequencing technologies journal May 2016
Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review journal March 2017
Deep Learning: Individual Maize Segmentation From Terrestrial Lidar Data Using Faster R-CNN and Regional Growth Algorithms journal June 2018
On the Road to Breeding 4.0: Unraveling the Good, the Bad, and the Boring of Crop Quantitative Genomics journal November 2018
Beyond Genomic Prediction: Combining Different Types of omics Data Can Improve Prediction of Hybrid Performance in Maize journal January 2018
The Genetical Theory of Natural Selection. By R. A. Fisher,F.R.S. ., Pp. xiv+272. 17s. 6d. 1930. (Oxford University Press.) journal October 1931
Leveraging biological insight and environmental variation to improve phenotypic prediction: Integrating crop growth models (CGM) with whole genome prediction (WGP) journal October 2018
Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps journal April 2001
Genetic and morphological analysis of a maize-teosinte F2 population: implications for the origin of maize. journal December 1990
Experimental Designs journal September 1997
Systems of Mating. i. the Biometric Relations Between Parent and Offspring journal May 1921
Support Vector Machines and Kernels for Computational Biology journal October 2008
Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ journal December 2010
Use of naturally-occurring enzyme variation to detect and map genes controlling quantitative traits in an interspecific backcross of tomato journal August 1982
Pheno-Deep Counter: a unified and versatile deep learning architecture for leaf counting journal September 2018
The Genetical Theory of Natural Selection journal April 2000
Deep learning for computational biology journal July 2016
Experimental Designs journal January 1957
Theoretical Basis of the Beavis Effect journal December 2003
The Genetical Theory of Natural Selection journal August 1930
Image analysis is driving a renaissance in growth measurement journal February 2013
Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps journal April 2001
Genetic and morphological analysis of a maize-teosinte F2 population: implications for the origin of maize. journal December 1990
Metabolomic prediction of yield in hybrid rice journal August 2016
Systems of Mating. i. the Biometric Relations Between Parent and Offspring journal March 1921
Theoretical Basis of the Beavis Effect journal December 2003
Dysregulation of expression correlates with rare-allele burden and fitness loss in maize journal March 2018
Predicting effects of noncoding variants with deep learning–based sequence model journal August 2015
Perspectives for Genomic Selection Applications and Research in Plants journal January 2015
Priors in Whole-Genome Regression: The Bayesian Alphabet Returns journal May 2013
The genes of the Green Revolution journal January 2003
Machine Learning for Plant Phenotyping Needs Image Processing journal December 2016
Genomic innovation for crop improvement journal March 2017
The Estimation of Environmental and Genetic Trends from Records Subject to Culling journal June 1959
ARGOS8 variants generated by CRISPR-Cas9 improve maize grain yield under field drought stress conditions journal August 2016
A unified mixed-model method for association mapping that accounts for multiple levels of relatedness journal December 2005
Development and evaluation of a field-based high-throughput phenotyping platform journal January 2014
Using Best Linear Unbiased Predictions to Enhance Breeding for Yield in Soybean: II. Selection of Superior Crosses from a Limited Number of Yield Trials journal March 1995
The genetic theory of adaptation: a brief history journal February 2005
Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk journal July 2018
Open chromatin reveals the functional maize genome journal May 2016
On the Road to Breeding 4.0: Unraveling the Good, the Bad, and the Boring of Crop Quantitative Genomics journal November 2018

Cited By (4)

A sorghum practical haplotype graph facilitates genome‐wide imputation and cost‐effective genomic prediction journal March 2020
From QTLs to Adaptation Landscapes: Using Genotype-To-Phenotype Models to Characterize G×E Over Time journal December 2019
Biological reality and parsimony in crop models—why we need both in crop improvement! journal January 2019
QTG-Finder2: A Generalized Machine-Learning Algorithm for Prioritizing QTL Causal Genes in Plants journal May 2020

Figures / Tables (2)