skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A History of Regression and Related Model-Fitting in the Earth Sciences (1636?-2000)

Abstract

The (statistical) modeling of the behavior of a dependent variate as a function of one or more predictors provides examples of model-fitting which span the development of the earth sciences from the 17th Century to the present. The historical development of these methods and their subsequent application is reviewed. Bond's predictions (c. 1636 and 1668) of change in the magnetic declination at London may be the earliest attempt to fit such models to geophysical data. Following publication of Newton's theory of gravitation in 1726, analysis of data on the length of a 1{sup o} meridian arc, and the length of a pendulum beating seconds, as a function of sin{sup 2}(latitude), was used to determine the ellipticity of the oblate spheroid defining the Figure of the Earth. The pioneering computational methods of Mayer in 1750, Boscovich in 1755, and Lambert in 1765, and the subsequent independent discoveries of the principle of least squares by Gauss in 1799, Legendre in 1805, and Adrain in 1808, and its later substantiation on the basis of probability theory by Gauss in 1809 were all applied to the analysis of such geodetic and geophysical data. Notable later applications include: the geomagnetic survey of Ireland by Lloyd,more » Sabine, and Ross in 1836, Gauss's model of the terrestrial magnetic field in 1838, and Airy's 1845 analysis of the residuals from a fit to pendulum lengths, from which he recognized the anomalous character of measurements of gravitational force which had been made on islands. In the early 20th Century applications to geological topics proliferated, but the computational burden effectively held back applications of multivariate analysis. Following World War II, the arrival of digital computers in universities in the 1950s facilitated computation, and fitting linear or polynomial models as a function of geographic coordinates, trend surface analysis, became popular during the 1950-60s. The inception of geostatistics in France at this time by Matheron had its roots in meeting the evident need for improved estimators in spatial interpolation. Technical advances in regression analysis during the 1970s embraced the development of regression diagnostics and consequent attention to outliers; the recognition of problems caused by correlated predictors, and the subsequent introduction of ridge regression to overcome them; and techniques for fitting errors-in-variables and mixture models. Improvements in computational power have enabled ever more computer-intensive methods to be applied. These include algorithms which are robust in the presence of outliers, for example Rousseeuw's 1984 Least Median Squares; nonparametric smoothing methods, such as kernel-functions, splines and Cleveland's 1979 LOcally WEighted Scatterplot Smoother (LOWESS); and the Classification and Regression Tree (CART) technique of Breiman and others in 1984. Despite a continuing improvement in the rate of technology-transfer from the statistical to the earth-science community, despite an abrupt drop to a time-lag of about 10 years following the introduction of digital computers, these more recent developments are only just beginning to penetrate beyond the research community of earth scientists. Examples of applications to problem-solving in the earth sciences are given.« less

Authors:
 [1]
  1. University College London, Department of Geological Sciences (United Kingdom), E-mail: r.howarth@ucl.ac.uk
Publication Date:
OSTI Identifier:
21064320
Resource Type:
Journal Article
Journal Name:
Natural Resources Research (New York, N.Y.)
Additional Journal Information:
Journal Volume: 10; Journal Issue: 4; Other Information: DOI: 10.1023/A:1013928826796; Copyright (c) 2001 International Association for Mathematical Geology; Country of input: International Atomic Energy Agency (IAEA); Journal ID: ISSN 1520-7439
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE; ALGORITHMS; CALCULATION METHODS; CLEVELAND; DATA ANALYSIS; DIGITAL COMPUTERS; FRANCE; INTERPOLATION; IRELAND; LEAST SQUARE FIT; MAGNETIC FIELDS; MATHEMATICAL MODELS; MULTIVARIATE ANALYSIS; POLYNOMIALS; PROBABILITY; REGRESSION ANALYSIS; TECHNOLOGY TRANSFER

Citation Formats

Howarth, Richard J. A History of Regression and Related Model-Fitting in the Earth Sciences (1636?-2000). United States: N. p., 2001. Web. doi:10.1023/A:1013928826796.
Howarth, Richard J. A History of Regression and Related Model-Fitting in the Earth Sciences (1636?-2000). United States. https://doi.org/10.1023/A:1013928826796
Howarth, Richard J. 2001. "A History of Regression and Related Model-Fitting in the Earth Sciences (1636?-2000)". United States. https://doi.org/10.1023/A:1013928826796.
@article{osti_21064320,
title = {A History of Regression and Related Model-Fitting in the Earth Sciences (1636?-2000)},
author = {Howarth, Richard J.},
abstractNote = {The (statistical) modeling of the behavior of a dependent variate as a function of one or more predictors provides examples of model-fitting which span the development of the earth sciences from the 17th Century to the present. The historical development of these methods and their subsequent application is reviewed. Bond's predictions (c. 1636 and 1668) of change in the magnetic declination at London may be the earliest attempt to fit such models to geophysical data. Following publication of Newton's theory of gravitation in 1726, analysis of data on the length of a 1{sup o} meridian arc, and the length of a pendulum beating seconds, as a function of sin{sup 2}(latitude), was used to determine the ellipticity of the oblate spheroid defining the Figure of the Earth. The pioneering computational methods of Mayer in 1750, Boscovich in 1755, and Lambert in 1765, and the subsequent independent discoveries of the principle of least squares by Gauss in 1799, Legendre in 1805, and Adrain in 1808, and its later substantiation on the basis of probability theory by Gauss in 1809 were all applied to the analysis of such geodetic and geophysical data. Notable later applications include: the geomagnetic survey of Ireland by Lloyd, Sabine, and Ross in 1836, Gauss's model of the terrestrial magnetic field in 1838, and Airy's 1845 analysis of the residuals from a fit to pendulum lengths, from which he recognized the anomalous character of measurements of gravitational force which had been made on islands. In the early 20th Century applications to geological topics proliferated, but the computational burden effectively held back applications of multivariate analysis. Following World War II, the arrival of digital computers in universities in the 1950s facilitated computation, and fitting linear or polynomial models as a function of geographic coordinates, trend surface analysis, became popular during the 1950-60s. The inception of geostatistics in France at this time by Matheron had its roots in meeting the evident need for improved estimators in spatial interpolation. Technical advances in regression analysis during the 1970s embraced the development of regression diagnostics and consequent attention to outliers; the recognition of problems caused by correlated predictors, and the subsequent introduction of ridge regression to overcome them; and techniques for fitting errors-in-variables and mixture models. Improvements in computational power have enabled ever more computer-intensive methods to be applied. These include algorithms which are robust in the presence of outliers, for example Rousseeuw's 1984 Least Median Squares; nonparametric smoothing methods, such as kernel-functions, splines and Cleveland's 1979 LOcally WEighted Scatterplot Smoother (LOWESS); and the Classification and Regression Tree (CART) technique of Breiman and others in 1984. Despite a continuing improvement in the rate of technology-transfer from the statistical to the earth-science community, despite an abrupt drop to a time-lag of about 10 years following the introduction of digital computers, these more recent developments are only just beginning to penetrate beyond the research community of earth scientists. Examples of applications to problem-solving in the earth sciences are given.},
doi = {10.1023/A:1013928826796},
url = {https://www.osti.gov/biblio/21064320}, journal = {Natural Resources Research (New York, N.Y.)},
issn = {1520-7439},
number = 4,
volume = 10,
place = {United States},
year = {Sat Dec 15 00:00:00 EST 2001},
month = {Sat Dec 15 00:00:00 EST 2001}
}