Skip to main content

Showing 1–10 of 10 results for author: Rasch, M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00004  [pdf, other

    cs.AR cs.AI cs.ET

    Rapid yet accurate Tile-circuit and device modeling for Analog In-Memory Computing

    Authors: J. Luquin, C. Mackin, S. Ambrogio, A. Chen, F. Baldi, G. Miralles, M. J. Rasch, J. Büchel, M. Lalwani, W. Ponghiran, P. Solomon, H. Tsai, G. W. Burr, P. Narayanan

    Abstract: Analog In-Memory Compute (AIMC) can improve the energy efficiency of Deep Learning by orders of magnitude. Yet analog-domain device and circuit non-idealities -- within the analog ``Tiles'' performing Matrix-Vector Multiply (MVM) operations -- can degrade neural-network task accuracy. We quantify the impact of low-level distortions and noise, and develop a mathematical model for Multiply-ACcumulat… ▽ More

    Submitted 5 May, 2025; originally announced June 2025.

  2. arXiv:2505.11067  [pdf, other

    cs.LG cs.AI cs.AR cs.CV cs.DC cs.NE

    Assessing the Performance of Analog Training for Transfer Learning

    Authors: Omobayode Fagbohungbe, Corey Lammie, Malte J. Rasch, Takashi Ando, Tayfun Gokmen, Vijay Narayanan

    Abstract: Analog in-memory computing is a next-generation computing paradigm that promises fast, parallel, and energy-efficient deep learning training and transfer learning (TL). However, achieving this promise has remained elusive due to a lack of suitable training algorithms. Analog memory devices exhibit asymmetric and non-linear switching behavior in addition to device-to-device variation, meaning that… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  3. arXiv:2406.12774  [pdf, other

    cs.LG cs.AR math.OC

    Towards Exact Gradient-based Training on Analog In-memory Computing

    Authors: Zhaoxian Wu, Tayfun Gokmen, Malte J. Rasch, Tianyi Chen

    Abstract: Given the high economic and environmental costs of using large vision or language models, analog in-memory accelerators present a promising solution for energy-efficient AI. While inference on analog accelerators has been studied recently, the training perspective is underexplored. Recent studies have shown that the "workhorse" of digital AI training - stochastic gradient descent (SGD) algorithm c… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures,2 tables

  4. Using the IBM Analog In-Memory Hardware Acceleration Kit for Neural Network Training and Inference

    Authors: Manuel Le Gallo, Corey Lammie, Julian Buechel, Fabio Carta, Omobayode Fagbohungbe, Charles Mackin, Hsinyu Tsai, Vijay Narayanan, Abu Sebastian, Kaoutar El Maghraoui, Malte J. Rasch

    Abstract: Analog In-Memory Computing (AIMC) is a promising approach to reduce the latency and energy consumption of Deep Neural Network (DNN) inference and training. However, the noisy and non-linear device characteristics, and the non-ideal peripheral circuitry in AIMC chips, require adapting DNNs to be deployed on such hardware to achieve equivalent accuracy to digital computing. In this tutorial, we prov… ▽ More

    Submitted 26 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Journal ref: APL Machine Learning (2023) 1 (4): 041102

  5. Fast offset corrected in-memory training

    Authors: Malte J. Rasch, Fabio Carta, Omebayode Fagbohungbe, Tayfun Gokmen

    Abstract: In-memory computing with resistive crossbar arrays has been suggested to accelerate deep-learning workloads in highly efficient manner. To unleash the full potential of in-memory computing, it is desirable to accelerate the training as well as inference for large deep neural networks (DNNs). In the past, specialized in-memory training algorithms have been proposed that not only accelerate the forw… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 14 pages, 10 figures

  6. arXiv:2302.08469  [pdf, ps, other

    cs.LG cs.ET

    Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

    Authors: Malte J. Rasch, Charles Mackin, Manuel Le Gallo, An Chen, Andrea Fasoli, Frederic Odermatt, Ning Li, S. R. Nandakumar, Pritish Narayanan, Hsinyu Tsai, Geoffrey W. Burr, Abu Sebastian, Vijay Narayanan

    Abstract: Analog in-memory computing (AIMC) -- a promising approach for energy-efficient acceleration of deep learning workloads -- computes matrix-vector multiplications (MVMs) but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable deep neural network (DNN) inference accuracy as compared to a conventional floating point (FP) impl… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 35 pages, 7 figures, 5 tables

  7. A flexible and fast PyTorch toolkit for simulating training and inference on analog crossbar arrays

    Authors: Malte J. Rasch, Diego Moreda, Tayfun Gokmen, Manuel Le Gallo, Fabio Carta, Cindy Goldberg, Kaoutar El Maghraoui, Abu Sebastian, Vijay Narayanan

    Abstract: We introduce the IBM Analog Hardware Acceleration Kit, a new and first of a kind open source toolkit to simulate analog crossbar arrays in a convenient fashion from within PyTorch (freely available at https://github.com/IBM/aihwkit). The toolkit is under active development and is centered around the concept of an "analog tile" which captures the computations performed on a crossbar array. Analog t… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: Submitted to AICAS2021

  8. arXiv:1906.02698  [pdf, ps, other

    cs.NE cs.ET cs.LG

    Training large-scale ANNs on simulated resistive crossbar arrays

    Authors: Malte J. Rasch, Tayfun Gokmen, Wilfried Haensch

    Abstract: Accelerating training of artificial neural networks (ANN) with analog resistive crossbar arrays is a promising idea. While the concept has been verified on very small ANNs and toy data sets (such as MNIST), more realistically sized ANNs and datasets have not yet been tackled. However, it is to be expected that device materials and hardware design constraints, such as noisy computations, finite num… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  9. arXiv:1807.01356  [pdf, ps, other

    cs.ET cs.LG stat.ML

    Efficient ConvNets for Analog Arrays

    Authors: Malte J. Rasch, Tayfun Gokmen, Mattia Rigotti, Wilfried Haensch

    Abstract: Analog arrays are a promising upcoming hardware technology with the potential to drastically speed up deep learning. Their main advantage is that they compute matrix-vector products in constant time, irrespective of the size of the matrix. However, early convolution layers in ConvNets map very unfavorably onto analog arrays, because kernel matrices are typically small and the constant time operati… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

  10. arXiv:0805.2368  [pdf, ps, other

    cs.LG cs.AI

    A Kernel Method for the Two-Sample Problem

    Authors: Arthur Gretton, Karsten Borgwardt, Malte J. Rasch, Bernhard Scholkopf, Alexander J. Smola

    Abstract: We propose a framework for analyzing and comparing distributions, allowing us to design statistical tests to determine if two samples are drawn from different distributions. Our test statistic is the largest difference in expectations over functions in the unit ball of a reproducing kernel Hilbert space (RKHS). We present two tests based on large deviation bounds for the test statistic, while a… ▽ More

    Submitted 15 May, 2008; originally announced May 2008.

    ACM Class: G.3; I.2.6