Exact and inexact subsampled Newton methods for optimization

Bollapragada, Raghu; Byrd, Richard H.; Nocedal, Jorge

doi:10.1093/imanum/dry009

Exact and inexact subsampled Newton methods for optimization

Journal Article · Tue Apr 03 00:00:00 EDT 2018 · IMA Journal of Numerical Analysis

DOI:https://doi.org/10.1093/imanum/dry009· OSTI ID:1610078

Bollapragada, Raghu ^[1]; Byrd, Richard H. ^[2]; Nocedal, Jorge ^[2]

Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL, USA; DOE/OSTI
Department of Computer Science, University of Colorado, Boulder, CO, USA

Abstract

The paper studies the solution of stochastic optimization problems in which approximations to the gradient and Hessian are obtained through subsampling. We first consider Newton-like methods that employ these approximations and discuss how to coordinate the accuracy in the gradient and Hessian to yield a superlinear rate of convergence in expectation. The second part of the paper analyzes an inexact Newton method that solves linear systems approximately using the conjugate gradient (CG) method, and that samples the Hessian and not the gradient (the gradient is assumed to be exact). We provide a complexity analysis for this method based on the properties of the CG iteration and the quality of the Hessian approximation, and compare it with a method that employs a stochastic gradient iteration instead of the CG method. We report preliminary numerical results that illustrate the performance of inexact subsampled Newton methods on machine learning applications based on logistic regression.

Research Organization:: Northwestern Univ., Evanston, IL (United States)

Sponsoring Organization:: USDOE Office of Science (SC)

DOE Contract Number:: FG02-87ER25047

OSTI ID:: 1610078

Journal Information:: IMA Journal of Numerical Analysis, Journal Name: IMA Journal of Numerical Analysis Journal Issue: 2 Vol. 39; ISSN 0272-4979

Publisher:: Oxford University Press/Institute of Mathematics and its Applications

Country of Publication:: United States

Language:: English

References (12)

Numerical Optimization Nocedal, Jorge; Wright, Stephen J. Springer Series in Operations Research and Financial Engineering https://doi.org/10.1007/b98874	book	January 1999
Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables Blackard, Jock A.; Dean, Denis J. Computers and Electronics in Agriculture, Vol. 24, Issue 3 https://doi.org/10.1016/S0168-1699(99)00046-0	journal	December 1999
Computational Methods for Sparse Solution of Linear Inverse Problems Tropp, Joel A.; Wright, Stephen J. Proceedings of the IEEE, Vol. 98, Issue 6 https://doi.org/10.1109/JPROC.2010.2044010	journal	June 2010
Efficient least-squares imaging with sparsity promotion and compressive sensing: Compressive imaging Herrmann, Felix J.; Li, Xiang Geophysical Prospecting, Vol. 60, Issue 4 https://doi.org/10.1111/j.1365-2478.2011.01041.x	journal	January 2012
On the Use of Stochastic Hessian Information in Optimization Methods for Machine Learning Byrd, Richard H.; Chin, Gillian M.; Neveitt, Will SIAM Journal on Optimization, Vol. 21, Issue 3 https://doi.org/10.1137/10079923X	journal	July 2011
Sample size selection in optimization methods for machine learning Byrd, Richard H.; Chin, Gillian M.; Nocedal, Jorge Mathematical Programming, Vol. 134, Issue 1 https://doi.org/10.1007/s10107-012-0572-5	journal	June 2012
On-line learning for very large data sets Bottou, L�on; Le Cun, Yann Applied Stochastic Models in Business and Industry, Vol. 21, Issue 2 https://doi.org/10.1002/asmb.538	journal	January 2005
Inexact Newton Methods Dembo, Ron S.; Eisenstat, Stanley C.; Steihaug, Trond SIAM Journal on Numerical Analysis, Vol. 19, Issue 2 https://doi.org/10.1137/0719025	journal	April 1982
Parallel Boosting with Momentum Mukherjee, Indraneel; Canini, Kevin; Frongillo, Rafael Advanced Information Systems Engineering https://doi.org/10.1007/978-3-642-40994-3_2	book	January 2013
Simulation optimization: a review of algorithms and applications Amaran, Satyajith; Sahinidis, Nikolaos V.; Sharda, Bikram 4OR, Vol. 12, Issue 4 https://doi.org/10.1007/s10288-014-0275-2	journal	November 2014
Hybrid Deterministic-Stochastic Methods for Data Fitting Friedlander, Michael P.; Schmidt, Mark SIAM Journal on Scientific Computing, Vol. 34, Issue 3 https://doi.org/10.1137/110830629	journal	January 2012
Training Deep and Recurrent Networks with Hessian-Free Optimization Martens, James; Sutskever, Ilya Lecture Notes in Computer Science https://doi.org/10.1007/978-3-642-35289-8_27	book	January 2012

Similar Records

An investigation of Newton-Sketch and subsampled Newton methods

Journal Article · Tue Feb 11 23:00:00 EST 2020 · Optimization Methods and Software · OSTI ID:1657509

Inexact Newton-CG algorithms with complexity guarantees

Journal Article · Mon Aug 22 00:00:00 EDT 2022 · IMA Journal of Numerical Analysis · OSTI ID:2424937

An inexact regularized Newton framework with a worst-case iteration complexity of $ {\mathscr O}(\varepsilon^{-3/2}) $ for nonconvex optimization

Journal Article · Tue May 08 00:00:00 EDT 2018 · IMA Journal of Numerical Analysis · OSTI ID:1611471

Related Subjects

Mathematics

Exact and inexact subsampled Newton methods for optimization

Citation Formats

References (12)

Similar Records

Related Subjects