Skip to main content

Showing 1–15 of 15 results for author: Lu, W D

.
  1. arXiv:2310.09385  [pdf, other

    cs.AR

    PIM-GPT: A Hybrid Process-in-Memory Accelerator for Autoregressive Transformers

    Authors: Yuting Wu, Ziyu Wang, Wei D. Lu

    Abstract: Decoder-only Transformer models such as GPT have demonstrated exceptional performance in text generation, by autoregressively predicting the next token. However, the efficacy of running GPT on current hardware systems is bounded by low compute-to-memory-ratio and high memory access. Process-in-memory (PIM) architectures can minimize off-chip data movement and utilize high internal bandwidth. They… ▽ More

    Submitted 13 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  2. arXiv:2305.14547  [pdf

    cs.AR cs.ET cs.LG

    Bulk-Switching Memristor-based Compute-In-Memory Module for Deep Neural Network Training

    Authors: Yuting Wu, Qiwen Wang, Ziyu Wang, Xinxin Wang, Buvna Ayyagari, Siddarth Krishnan, Michael Chudzik, Wei D. Lu

    Abstract: The need for deep neural network (DNN) models with higher performance and better functionality leads to the proliferation of very large models. Model training, however, requires intensive computation time and energy. Memristor-based compute-in-memory (CIM) modules can perform vector-matrix multiplication (VMM) in situ and in parallel, and have shown great promises in DNN inference applications. Ho… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Journal ref: Adv. Mater.35 (2023) 2305465

  3. arXiv:2304.11056  [pdf, other

    cs.CR cs.LG

    PowerGAN: A Machine Learning Approach for Power Side-Channel Attack on Compute-in-Memory Accelerators

    Authors: Ziyu Wang, Yuting Wu, Yongmo Park, Sangmin Yoo, Xinxin Wang, Jason K. Eshraghian, Wei D. Lu

    Abstract: Analog compute-in-memory (CIM) systems are promising for deep neural network (DNN) inference acceleration due to their energy efficiency and high throughput. However, as the use of DNNs expands, protecting user input privacy has become increasingly important. In this paper, we identify a potential security vulnerability wherein an adversary can reconstruct the user's private input data from a powe… ▽ More

    Submitted 27 May, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  4. arXiv:2303.10770  [pdf, other

    cs.CV cs.AI eess.IV

    RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network

    Authors: Sangmin Yoo, Eric Yeu-Jer Lee, Ziyu Wang, Xinxin Wang, Wei D. Lu

    Abstract: Event-based cameras are inspired by the sparse and asynchronous spike representation of the biological visual system. However, processing the event data requires either using expensive feature descriptors to transform spikes into frames, or using spiking neural networks that are expensive to train. In this work, we propose a neural network architecture, Reservoir Nodes-enabled neuromorphic vision… ▽ More

    Submitted 24 May, 2024; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 12 pages, 5 figures, 4 tables

  5. arXiv:2211.10725  [pdf, other

    cs.LG

    Intelligence Processing Units Accelerate Neuromorphic Learning

    Authors: Pao-Sheng Vincent Sun, Alexander Titterton, Anjlee Gopiani, Tim Santos, Arindam Basu, Wei D. Lu, Jason K. Eshraghian

    Abstract: Spiking neural networks (SNNs) have achieved orders of magnitude improvement in terms of energy consumption and latency when performing inference with deep learning workloads. Error backpropagation is presently regarded as the most effective method for training SNNs, but in a twist of irony, when training on modern graphics processing units (GPUs) this becomes more expensive than non-spiking netwo… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 10 pages, 9 figures, journal

  6. Side-channel attack analysis on in-memory computing architectures

    Authors: Ziyu Wang, Fan-hsuan Meng, Yongmo Park, Jason K. Eshraghian, Wei D. Lu

    Abstract: In-memory computing (IMC) systems have great potential for accelerating data-intensive tasks such as deep neural networks (DNNs). As DNN models are generally highly proprietary, the neural network architectures become valuable targets for attacks. In IMC systems, since the whole model is mapped on chip and weight memory read can be restricted, the pre-mapped DNN model acts as a ``black box'' for u… ▽ More

    Submitted 25 March, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Journal ref: IEEE Transactions on Emerging Topics in Computing (2023)

  7. arXiv:2206.12992  [pdf, other

    cs.NE cs.AI cs.AR cs.ET

    Gradient-based Neuromorphic Learning on Dynamical RRAM Arrays

    Authors: Peng Zhou, Jason K. Eshraghian, Dong-Uk Choi, Wei D. Lu, Sung-Mo Kang

    Abstract: We present MEMprop, the adoption of gradient-based learning to train fully memristive spiking neural networks (MSNNs). Our approach harnesses intrinsic device dynamics to trigger naturally arising voltage spikes. These spikes emitted by memristive dynamics are analog in nature, and thus fully differentiable, which eliminates the need for surrogate gradient methods that are prevalent in the spiking… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  8. arXiv:2202.07221  [pdf, other

    cs.LG cs.NE

    Navigating Local Minima in Quantized Spiking Neural Networks

    Authors: Jason K. Eshraghian, Corey Lammie, Mostafa Rahimi Azghadi, Wei D. Lu

    Abstract: Spiking and Quantized Neural Networks (NNs) are becoming exceedingly important for hyper-efficient implementations of Deep Learning (DL) algorithms. However, these networks face challenges when trained using error backpropagation, due to the absence of gradient signals when applying hard thresholds. The broadly accepted trick to overcoming this is through the use of biased gradient estimators: sur… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  9. arXiv:2201.11915  [pdf, other

    cs.NE cs.LG q-bio.NC

    The fine line between dead neurons and sparsity in binarized spiking neural networks

    Authors: Jason K. Eshraghian, Wei D. Lu

    Abstract: Spiking neural networks can compensate for quantization error by encoding information either in the temporal domain, or by processing discretized quantities in hidden states of higher precision. In theory, a wide dynamic range state-space enables multiple binarized inputs to be accumulated together, thus improving the representational capacity of individual neurons. This may be achieved by increas… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

  10. arXiv:2201.06703  [pdf, other

    cs.ET cs.AI cs.AR

    Design Space Exploration of Dense and Sparse Mapping Schemes for RRAM Architectures

    Authors: Corey Lammie, Jason K. Eshraghian, Chenqi Li, Amirali Amirsoleimani, Roman Genov, Wei D. Lu, Mostafa Rahimi Azghadi

    Abstract: The impact of device and circuit-level effects in mixed-signal Resistive Random Access Memory (RRAM) accelerators typically manifest as performance degradation of Deep Learning (DL) algorithms, but the degree of impact varies based on algorithmic features. These include network architecture, capacity, weight distribution, and the type of inter-layer connections. Techniques are continuously emergin… ▽ More

    Submitted 24 January, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: Accepted at 2022 IEEE International Symposium on Circuits and Systems (ISCAS). [v2] Fixed incorrectly labeled author affiliations for Chenqi Li, Amirali Amirsoleimani, and Roman Genov

  11. arXiv:2109.12894  [pdf, other

    cs.NE cs.ET cs.LG

    Training Spiking Neural Networks Using Lessons From Deep Learning

    Authors: Jason K. Eshraghian, Max Ward, Emre Neftci, Xinxin Wang, Gregor Lenz, Girish Dwivedi, Mohammed Bennamoun, Doo Seok Jeong, Wei D. Lu

    Abstract: The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what the future of deep learning might look like. This paper serves as a tutorial and perspective showing how to apply the lessons learnt from several decades of research in deep learning, gradient descent, backpropagation and neurosc… ▽ More

    Submitted 13 August, 2023; v1 submitted 27 September, 2021; originally announced September 2021.

  12. arXiv:2105.06923  [pdf

    cs.ET cs.AI cs.LG

    Hierarchical Architectures in Reservoir Computing Systems

    Authors: John Moon, Wei D. Lu

    Abstract: Reservoir computing (RC) offers efficient temporal data processing with a low training cost by separating recurrent neural networks into a fixed network with recurrent connections and a trainable linear network. The quality of the fixed network, called reservoir, is the most important factor that determines the performance of the RC system. In this paper, we investigate the influence of the hierar… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  13. arXiv:2103.06506  [pdf, other

    cs.ET cs.AI cs.AR cs.LG

    Memristive Stochastic Computing for Deep Learning Parameter Optimization

    Authors: Corey Lammie, Jason K. Eshraghian, Wei D. Lu, Mostafa Rahimi Azghadi

    Abstract: Stochastic Computing (SC) is a computing paradigm that allows for the low-cost and low-power computation of various arithmetic operations using stochastic bit streams and digital logic. In contrast to conventional representation schemes used within the binary domain, the sequence of bit streams in the stochastic domain is inconsequential, and computation is usually non-deterministic. In this brief… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted by IEEE Transactions on Circuits and Systems Part II: Express Briefs

    Journal ref: IEEE Transactions on Circuits and Systems Part II: Express Briefs, 2021

  14. arXiv:2001.05430  [pdf, other

    q-bio.NC eess.IV q-bio.QM

    A Real-Time Retinomorphic Simulator Using a Conductance-Based Discrete Neuronal Network

    Authors: Jason K. Eshraghian, Seungbum Baek, Wesley Thio, Yulia Sandamirskaya, Herbert H. C. Iu, Wei D. Lu

    Abstract: We present an optimized conductance-based retina microcircuit simulator which transforms light stimuli into a series of graded and spiking action potentials through photo transduction. We use discrete retinal neuron blocks based on a collation of single-compartment models and morphologically realistic formulations, and successfully achieve a biologically real-time simulator. This is done by optimi… ▽ More

    Submitted 26 December, 2019; originally announced January 2020.

    Comments: 5 pages, 4 figures, accepted for 2020 IEEE AICAS

  15. arXiv:1612.02913  [pdf, other

    cs.ET cs.AR cs.NE

    Field-Programmable Crossbar Array (FPCA) for Reconfigurable Computing

    Authors: Mohammed A. Zidan, YeonJoo Jeong, Jong Hong Shin, Chao Du, Zhengya Zhang, Wei D. Lu

    Abstract: For decades, advances in electronics were directly driven by the scaling of CMOS transistors according to Moore's law. However, both the CMOS scaling and the classical computer architecture are approaching fundamental and practical limits, and new computing architectures based on emerging devices, such as resistive random-access memory (RRAM) devices, are expected to sustain the exponential growth… ▽ More

    Submitted 20 July, 2017; v1 submitted 8 December, 2016; originally announced December 2016.