Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks

Bennett, Christopher H.; Parmar, Vivek; Calvet, Laurie E.; Klein, Jacques-Olivier; Suri, Manan; Marinella, Matthew J.; Querlioz, Damien

doi:10.1109/ACCESS.2019.2920076

Title: Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks

Abstract

Recently, a Cambrian explosion of novel, non-volatile memory (NVM) devices known as memristive devices have inspired effort in building hardware neural networks that learn like the brain. Early experimental prototypes built simple perceptrons from nanosynapses, and recently, fully-connected multi-layer perceptron (MLP) learning systems have been realized. However, while backpropagating learning systems pair well with high-precision computer memories and achieve state-of-the-art performances, this typically comes with a massive energy budget. For future Internet of Things/peripheral use cases, system energy footprint will be a major constraint, and emerging NVM devices may fill the gap by sacrificing high bit precision for lower energy. In this work, we contrast the well known MLP approach with the Extreme Learning Machine (ELM) or NoProp approach, which uses a large layer of random weights to improve the separability of high-dimensional tasks, and is usually considered inferior in a software context. However, we find that when taking device non-linearity into account, NoProp manages to equal hardware MLP system in terms of accuracy. While also using a sign-based adaptation of the delta rule for energy-savings, we find that NoProp can learn effectively with four to six ’bits’ of device analog capacity, while MLP requires eight bit capacity with themore »« less

Authors:

Bennett, Christopher H. ^[1]; Parmar, Vivek ^[2]; Calvet, Laurie E. ^[3]; Klein, Jacques-Olivier ^[3]; Suri, Manan ^[2]; Marinella, Matthew J. ^[4]; Querlioz, Damien ^[3]

Univ. Paris-Sud, Universite Paris-Saclay (France); Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Indian Institute of Technology Delhi, New Delhi (India)
Univ. Paris-Sud, Universite Paris-Saclay (France)
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

Publication Date:: Thu May 30 00:00:00 EDT 2019

Research Org.:: Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

Sponsoring Org.:: USDOE National Nuclear Security Administration (NNSA)

OSTI Identifier:: 1526218

Report Number(s):: SAND-2019-5935J
Journal ID: ISSN 2169-3536; 675857

Grant/Contract Number:: AC04-94AL85000; NA0003525

Resource Type:: Accepted Manuscript

Journal Name:: IEEE Access

Additional Journal Information:: Journal Volume: 7; Journal ID: ISSN 2169-3536

Publisher:: IEEE

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING; hardware neural networks; memristive devices; online learning; edge computing

Citation Formats


                    Bennett, Christopher H., Parmar, Vivek, Calvet, Laurie E., Klein, Jacques-Olivier, Suri, Manan, Marinella, Matthew J., and Querlioz, Damien. Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks.  United States: N. p., 2019. 
Web.  doi:10.1109/ACCESS.2019.2920076.

Copy to clipboard


                    Bennett, Christopher H., Parmar, Vivek, Calvet, Laurie E., Klein, Jacques-Olivier, Suri, Manan, Marinella, Matthew J., & Querlioz, Damien. Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks.  United States.  https://doi.org/10.1109/ACCESS.2019.2920076

Copy to clipboard


                    Bennett, Christopher H., Parmar, Vivek, Calvet, Laurie E., Klein, Jacques-Olivier, Suri, Manan, Marinella, Matthew J., and Querlioz, Damien. Thu .  
"Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks".  United States.  https://doi.org/10.1109/ACCESS.2019.2920076.  https://www.osti.gov/servlets/purl/1526218.

Copy to clipboard


                    
@article{osti_1526218,

  title        = {Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks},

  author       = {Bennett, Christopher H. and Parmar, Vivek and Calvet, Laurie E. and Klein, Jacques-Olivier and Suri, Manan and Marinella, Matthew J. and Querlioz, Damien},

  abstractNote = {Recently, a Cambrian explosion of novel, non-volatile memory (NVM) devices known as memristive devices have inspired effort in building hardware neural networks that learn like the brain. Early experimental prototypes built simple perceptrons from nanosynapses, and recently, fully-connected multi-layer perceptron (MLP) learning systems have been realized. However, while backpropagating learning systems pair well with high-precision computer memories and achieve state-of-the-art performances, this typically comes with a massive energy budget. For future Internet of Things/peripheral use cases, system energy footprint will be a major constraint, and emerging NVM devices may fill the gap by sacrificing high bit precision for lower energy. In this work, we contrast the well known MLP approach with the Extreme Learning Machine (ELM) or NoProp approach, which uses a large layer of random weights to improve the separability of high-dimensional tasks, and is usually considered inferior in a software context. However, we find that when taking device non-linearity into account, NoProp manages to equal hardware MLP system in terms of accuracy. While also using a sign-based adaptation of the delta rule for energy-savings, we find that NoProp can learn effectively with four to six ’bits’ of device analog capacity, while MLP requires eight bit capacity with the same rule. This may allow the requirements for memristive devices to be relaxed in the context of online learning. By comparing the energy footprint of these systems for several candidate nanosynapses, as well as realistic peripherals, we confirm that memristive NoProp systems save energy compared to MLP systems. Lastly, we show that ELM/NoProp systems can achieve better generalization abilities than nanosynaptic MLP systems when paired with pre-processing layers (which do not require backpropagated error). Collectively, these advantages make such systems worthy of consideration in future accelerators or embedded hardware.},

  doi          = {10.1109/ACCESS.2019.2920076},

  journal      = {IEEE Access},

  number       = ,

  volume       = 7,

  place        = {United States},

  year         = {Thu May 30 00:00:00 EDT 2019},

  month        = {Thu May 30 00:00:00 EDT 2019}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Accepted Manuscript (DOE)

Publisher's Version of Record

https://doi.org/10.1109/ACCESS.2019.2920076

Other availability

Search WorldCat to find libraries that may hold this journal

Citation Metrics:

Cited by: 7 works

Citation information provided by
Web of Science

Figures / Tables:

FIGURE 1: (a) and (b) show jump tables for device evolution starting at $G$_on/$Gmax$ and $G$^off/$Gmin$ conductance respectively using the linear model (Eqn. 1); (c) and (d) depict the same but for the non-linear case (Eqn. 2) where $ΔG$ is now modulated by the device's state relative to itsmore »

All figures and tables (12 total)

Save / Share:

Export Metadata

Save to My Library

Works referencing / citing this record:

Voltage control of domain walls in magnetic nanowires for energy-efficient neuromorphic devices
journal, January 2020

Azam, Md Ali; Bhattacharya, Dhritiman; Querlioz, Damien
Nanotechnology, Vol. 31, Issue 14
DOI: 10.1088/1361-6528/ab6234

Figures / Tables found in this record:

Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.

Similar Records in DOE PAGES and OSTI.GOV collections:

Blackcomb: Hardware-Software Co-design for Non-Volatile Memory in Exascale Systems

Technical Report Schreiber, Robert

Summary of technical results of Blackcomb Memory Devices We explored various different memory technologies (STTRAM, PCRAM, FeRAM, and ReRAM). The progress can be classified into three categories, below. Modeling and Tool Releases Various modeling tools have been developed over the last decade to help in the design of SRAM or DRAM-based memory hierarchies. To explore new design opportunities that NVM technologies can bring to the designers, we have developed similar high-level models for NVM, including PCRAMsim [Dong 2009], NVSim [Dong 2012], and NVMain [Poremba 2012]. NVSim is a circuit-level model for NVM performance, energy, and area estimation, which supports variousmore »« less
Interval neural networks

Conference Patil, R B

Traditional neural networks like multi-layered perceptrons (MLP) use example patterns, i.e., pairs of real-valued observation vectors, ({rvec x},{rvec y}), to approximate function {cflx f}({rvec x}) = {rvec y}. To determine the parameters of the approximation, a special version of the gradient descent method called back-propagation is widely used. In many situations, observations of the input and output variables are not precise; instead, we usually have intervals of possible values. The imprecision could be due to the limited accuracy of the measuring instrument or could reflect genuine uncertainty in the observed variables. In such situation input and output data consist ofmore »« less
Full Text Available
Design-Technology Co-Optimization for NVM-based Neuromorphic Processing Elements

Journal Article Song, Shihao ; Balaji, Adarsha ; Das, Anup ; ... - ACM Transactions on Embedded Computing Systems

An emerging use-case of machine learning (ML) is to train a model on a high-performance system and deploy the trained model on energy-constrained embedded systems. Neuromorphic hardware platforms, which operate on principles of the biological brain, can significantly lower the energy overhead of a machine learning inference task, making these platforms an attractive solution for embedded ML systems. In this paper, we present a design-technology tradeoff analysis to implement such inference tasks on the processing elements (PEs) of a Non-Volatile Memory (NVM)-based neuromorphic hardware. Through detailed circuit-level simulations at scaled process technology nodes, we show the negative impact of technologymore »« less
https://doi.org/10.1145/3524068

Full Text Available
Roadmap for unconventional computing with nanotechnology

Journal Article Finocchio, Giovanni ; Incorvia, Jean Anne C. ; Friedman, Joseph S. ; ... - Nano Futures

Abstract In the ‘Beyond Moore’s Law’ era, with increasing edge intelligence, domain-specific computing embracing unconventional approaches will become increasingly prevalent. At the same time, adopting a variety of nanotechnologies will offer benefits in energy cost, computational speed, reduced footprint, cyber resilience, and processing power. The time is ripe for a roadmap for unconventional computing with nanotechnologies to guide future research, and this collection aims to fill that need. The authors provide a comprehensive roadmap for neuromorphic computing using electron spins, memristive devices, two-dimensional nanomaterials, nanomagnets, and various dynamical systems. They also address other paradigms such as Ising machines, Bayesian inferencemore »« less
https://doi.org/10.1088/2399-1984/ad299a
A Hardware and Software Co-design Framework for Energy Efficient Neuromorphic Systems

Technical Report Li, Hai ; Taylor, Brady

Neuromorphic systems can be realized by a variety of algorithms and architectures. A common understanding is that spiking neuromorphic designs, which encode information into spatio-temporal spiking events, are both a biologically-accurate and efficient way of processing information. However, representing the information through timing relationships induces sophisticated circuit designs in traditional CMOS-based implementations. In recent years, high-capacity resistive memory (RRAM, aka, memristor) has demonstrated great potential in mimicking synaptic behaviors. Several RRAM-based spiking neuromorphic designs exist, most of which focus on rate coding schemes. These designs simplify circuit implementations of neuron models and explore challenges such as unsatisfactory speed, resolution, andmore »« less
https://doi.org/10.2172/1985762

Full Text Available

Similar Records

Title: Contrasting advantages of learning with random weights and backpropagation in non-volatile memory neural networks

Abstract

Citation Formats

Figures / Tables:

Voltage control of domain walls in magnetic nanowires for energy-efficient neuromorphic devices journal, January 2020

Voltage control of domain walls in magnetic nanowires for energy-efficient neuromorphic devices
journal, January 2020