DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Stochastic average model methods

Journal Article · · Computational Optimization and Applications
ORCiD logo [1]; ORCiD logo [2]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States)
  2. Argonne National Laboratory (ANL), Argonne, IL (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)

We consider the solution of finite-sum minimization problems, such as those appearing in nonlinear least-squares or general empirical risk minimization problems. We are motivated by problems in which the summand functions are computationally expensive and evaluating all summands on every iteration of an optimization method may be undesirable. Here we present the idea of stochastic average model (SAM) methods, inspired by stochastic average gradient methods. SAM methods sample component functions on each iteration of a trust-region method according to a discrete probability distribution on component functions; the distribution is designed to minimize an upper bound on the variance of the resulting stochastic model. We present promising numerical results concerning an implemented variant extending the derivative-free model-based trust-region solver POUNDERS, which we name SAM-POUNDERS.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
2476911
Journal Information:
Computational Optimization and Applications, Journal Name: Computational Optimization and Applications Journal Issue: 2 Vol. 88; ISSN 0926-6003
Publisher:
SpringerCopyright Statement
Country of Publication:
United States
Language:
English

References (19)

General Properties and Estimation of Conditional Bernoulli Models journal July 2000
Minimizing finite sums with the stochastic average gradient journal June 2016
Stochastic optimization using a trust-region method and random models journal April 2017
On the local convergence of a derivative-free algorithm for least-squares minimization journal October 2010
On optimal probabilities in stochastic coordinate descent methods journal July 2015
A derivative-free Gauss–Newton method journal May 2019
A Python surrogate modeling framework with derivatives journal September 2019
Optimization and supervised machine learning methods for fitting numerical physics models without derivatives journal January 2021
Weighted finite population sampling to maximize entropy journal January 1994
Benchmarking Derivative-Free Optimization Algorithms journal January 2009
A Derivative-Free Algorithm for Least-Squares Minimization journal January 2010
Introduction to Derivative-Free Optimization book January 2009
Trust Region Methods book January 2000
Chapter 40: POUNDERS in TAO: Solving Derivative-Free Nonlinear Least-Squares Problems with POUNDERS book April 2017
Stochastic First- and Zeroth-Order Methods for Nonconvex Stochastic Programming journal January 2013
Optimization Methods for Large-Scale Machine Learning journal January 2018
Stochastic Estimation of the Maximum of a Regression Function journal September 1952
A Stochastic Approximation Method journal September 1951
Convergence Rate Analysis of a Stochastic Trust-Region Method via Supermartingales journal April 2019