Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Compute-in-Memory Technologies and Architectures for Deep Learning Workloads

Journal Article · · IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Not provided.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC02-06CH11357
OSTI ID:
2423196
Journal Information:
IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Journal Name: IEEE Transactions on Very Large Scale Integration (VLSI) Systems Journal Issue: 11 Vol. 30; ISSN 1063-8210
Publisher:
IEEE
Country of Publication:
United States
Language:
English

References (88)

Aquabolt-XL: Samsung HBM2-PIM with in-memory processing for ML accelerators and beyond conference August 2021
Temporal Coding in Spiking Neural Networks with Alpha Synaptic Function conference May 2020
Enabling Spike-Based Backpropagation for Training Deep Neural Network Architectures journal February 2020
Resistive Crossbars as Approximate Hardware Building Blocks for Machine Learning: Opportunities and Challenges journal December 2020
NVM neuromorphic core with 64k-cell (256-by-256) phase change memory synaptic array with on-chip neuron circuits for continuous in-situ learning conference December 2015
Xcel-RAM: Accelerating Binary Neural Networks in High-Throughput SRAM Compute Arrays journal August 2019
Hybrid memory cube new DRAM architecture increases density and performance conference June 2012
Reduction and IR-drop compensations techniques for reliable neuromorphic computing systems conference November 2014
PRIME: a novel processing-in-memory architecture for neural network computation in ReRAM-based main memory journal October 2016
Towards ADC-Less Compute-In-Memory Accelerators for Energy Efficient Deep Learning conference March 2022
SpiNNaker: A 1-W 18-Core System-on-Chip for Massively-Parallel Neural Network Simulation journal August 2013
TraNNsformer: Neural network transformation for memristive crossbar based neuromorphic system design conference November 2017
Deep Residual Learning for Image Recognition conference June 2016
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks conference June 2018
Volta: Performance and Programmability journal March 2018
A Twin-8T SRAM Computation-in-Memory Unit-Macro for Multibit CNN-Based AI Edge Processors journal January 2020
Spin-Transfer Torque Memories: Devices, Circuits, and Systems journal July 2016
ImageNet classification with deep convolutional neural networks journal May 2017
Input-Splitting of Large Neural Networks for Power-Efficient Accelerator with Resistive Crossbar Memory Array conference July 2018
RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural Network conference June 2020
15.5 A 28nm 64Kb 6T SRAM Computing-in-Memory Macro with 8b MAC Operation for AI Edge Chips conference February 2020
TensorDIMM conference October 2019
DRISA: a DRAM-based Reconfigurable In-Situ Accelerator conference January 2017
Design Tools for Resistive Crossbar based Machine Learning Accelerators conference June 2021
Phase Change Memory journal December 2010
CxDNN: Hardware-software Compensation Methods for Deep Neural Networks on Resistive Crossbar Systems journal January 2020
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing conference May 2020
ELP2IM: Efficient and Low Power Bitwise Operation Processing in DRAM conference February 2020
Deep In-Memory Architectures in SRAM: An Analog Approach to Approximate Computing journal December 2020
SOT-MRAM Based Analog in-Memory Computing for DNN Inference conference June 2020
In-memory computing with ferroelectrics journal May 2020
Mapping Binary ResNets on Computing-In-Memory Hardware with Low-bit ADCs conference February 2021
PipeLayer: A Pipelined ReRAM-Based Accelerator for Deep Learning
  • Song, Linghao; Qian, Xuehai; Li, Hai
  • 2017 IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) https://doi.org/10.1109/HPCA.2017.55
conference February 2017
ISAAC: a convolutional neural network accelerator with in-situ analog arithmetic in crossbars journal June 2016
Q-PIM: A Genetic Algorithm based Flexible DNN Quantization Method and Application to Processing-In-Memory Platform conference July 2020
Enabling Robust SOT-MTJ Crossbars for Machine Learning using Sparsity-Aware Device-Circuit Co-design conference July 2021
IMPULSE: A 65-nm Digital Compute-in-Memory Macro With Fused Weights and Membrane Potential for Spike-Based Sequential Learning Tasks journal January 2021
A review of emerging non-volatile memory (NVM) technologies and applications journal November 2016
SPARE: Spiking Neural Network Acceleration Using ROM-Embedded RAMs as In-Memory-Computation Primitives journal August 2019
14.3 A 65nm Computing-in-Memory-Based CNN Processor with 2.9-to-35.8TOPS/W System Energy Efficiency Using Dynamic-Sparsity Performance-Scaling Architecture and Energy-Efficient Inter/Intra-Macro Data Reuse conference February 2020
PIM-DRAM: Accelerating Machine Learning Workloads Using Processing in Commodity DRAM journal December 2021
In-Memory Low-Cost Bit-Serial Addition Using Commodity DRAM Technology journal January 2020
Fundamental limits on the precision of in-memory architectures conference November 2020
Softermax: Hardware/Software Co-Design of an Efficient Softmax for Transformers conference December 2021
A Programmable Heterogeneous Microprocessor Based on Bit-Scalable In-Memory Computing journal September 2020
Fast-classifying, high-accuracy spiking deep networks through weight and threshold balancing conference July 2015
NeuroSim: A Circuit-Level Macro Model for Benchmarking Neuro-Inspired Architectures in Online Learning journal December 2018
R-MRAM: A ROM-Embedded STT MRAM Cache journal October 2013
Design-Technology Co-Optimizations (DTCO) for General-Purpose Computing In-Memory Based on 55nm NOR Flash Technology conference December 2021
Ferroelectric FET analog synapse for acceleration of deep neural network training conference December 2017
In-Memory Computing: Advances and Prospects journal January 2019
Technology Aware Training in Memristive Neuromorphic Systems for Nonideal Synaptic Crossbars journal October 2018
U-Net: Convolutional Networks for Biomedical Image Segmentation
  • Ronneberger, Olaf; Fischer, Philipp; Brox, Thomas
  • Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III https://doi.org/10.1007/978-3-319-24574-4_28
book November 2015
Analogue signal and image processing with large memristor crossbars journal December 2017
High-Performance Mixed-Signal Neurocomputing With Nanoscale Floating-Gate Memory Cell Arrays journal October 2018
Spiking Neural Networks with Improved Inherent Recurrence Dynamics for Sequential Learning journal June 2022
Basic principles of STT-MRAM cell operation in memory arrays journal February 2013
8T SRAM Cell as a Multibit Dot-Product Engine for Beyond Von Neumann Computing journal November 2019
Assessing merged DRAM/Logic technology journal July 1999
Trends in Analog and Digital Intensive Compute-in-SRAM Designs conference June 2021
S-FLASH: A NAND Flash-based Deep Neural Network Accelerator Exploiting Bit-level Sparsity journal January 2021
PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning Inference
  • Ankit, Aayush; Hajj, Izzat El; Chalamalasetti, Sai Rahul
  • ASPLOS '19: Architectural Support for Programming Languages and Operating Systems, Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems https://doi.org/10.1145/3297858.3304049
conference April 2019
Accurate deep neural network inference using computational phase-change memory journal May 2020
In-Datacenter Performance Analysis of a Tensor Processing Unit conference January 2017
SIMDRAM: a framework for bit-serial SIMD processing using DRAM
  • Hajinazar, Nastaran; Oliveira, Geraldo F.; Gregorio, Sven
  • Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems https://doi.org/10.1145/3445814.3446749
conference April 2021
NVIDIA A100 GPU: Performance & Innovation for GPU Computing conference August 2020
CIM-SECDED: A 40nm 64Kb Compute In-Memory RRAM Macro with ECC Enabling Reliable Operation conference November 2021
GENIEx: A Generalized Approach to Emulating Non-Ideality in Memristive Xbars using Neural Networks conference July 2020
RRAM for Compute-in-Memory: From Inference to Training journal July 2021
BLADE: An in-Cache Computing Architecture for Edge Devices journal September 2020
You Only Look Once: Unified, Real-Time Object Detection conference June 2016
Neurocube: A Programmable Digital Neuromorphic Architecture with High-Density 3D Memory conference June 2016
Newton: A DRAM-maker’s Accelerator-in-Memory (AiM) Architecture for Machine Learning conference October 2020
ReTransformer conference November 2020
Bi-Real Net: Enhancing the Performance of 1-Bit CNNs with Improved Representational Capability and Advanced Training Algorithm book January 2018
DrAcc: a DRAM based Accelerator for Accurate CNN Inference conference June 2018
An Embedded nand Flash-Based Compute-In-Memory Array Demonstrated in a Standard Logic Process journal February 2022
Metal–Oxide RRAM journal June 2012
RecSSD: near data processing for solid state drive based recommendation inference
  • Wilkening, Mark; Gupta, Udit; Hsia, Samuel
  • Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems https://doi.org/10.1145/3445814.3446763
conference April 2021
Algorithm/Hardware Co-Design for In-Memory Neural Network Computing with Minimal Peripheral Circuit Overhead conference July 2020
X-SRAM: Enabling In-Memory Boolean Computations in CMOS Static Random Access Memories journal December 2018
PANTHER: A Programmable Architecture for Neural Network Training Harnessing Energy-Efficient ReRAM journal August 2020
Tutorial: Brain-inspired computing using phase-change memory devices journal September 2018
Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks conference June 2016
A 35.5-127.2 TOPS/W Dynamic Sparsity-Aware Reconfigurable-Precision Compute-in-Memory SRAM Macro for Machine Learning journal January 2021
Area Efficient ROM-Embedded SRAM Cache journal September 2013
A Relaxed Quantization Training Method for Hardware Limitations of Resistive Random Access Memory (ReRAM)-Based Computing-in-Memory journal June 2020
Tetris journal April 2017

Similar Records

Ferroelectric capacitors and field-effect transistors as in-memory computing elements for machine learning workloads
Journal Article · 2024 · Scientific Reports · OSTI ID:2581900

Variation-Resilient FeFET-Based In-Memory Computing Leveraging Probabilistic Deep Learning
Journal Article · 2024 · IEEE Transactions on Electron Devices · OSTI ID:2341283

Deep learning for computational chemistry
Journal Article · 2017 · Journal of Computational Chemistry · OSTI ID:1406688

Related Subjects