Skip to main content

Showing 1–24 of 24 results for author: Karunaratne, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.00354  [pdf, other

    cs.LG cs.AI

    On the Role of Noise in Factorizers for Disentangling Distributed Representations

    Authors: Geethan Karunaratne, Michael Hersche, Abu Sebastian, Abbas Rahimi

    Abstract: To efficiently factorize high-dimensional distributed representations to the constituent atomic vectors, one can exploit the compute-in-superposition capabilities of vector-symbolic architectures (VSA). Such factorizers however suffer from the phenomenon of limit cycles. Applying noise during the iterative decoding is one mechanism to address this issue. In this paper, we explore ways to further… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

    Comments: Published at Second Workshop on Machine Learning with New Compute Paradigms at 38th NeurIPS 2024 (MLNCP 2024)

  2. arXiv:2410.00004  [pdf, other

    cs.IR cs.AI cs.CL

    Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization

    Authors: Gentiana Rashiti, Geethan Karunaratne, Mrinmaya Sachan, Abu Sebastian, Abbas Rahimi

    Abstract: The retrieval augmented generation (RAG) system such as Retro has been shown to improve language modeling capabilities and reduce toxicity and hallucinations by retrieving from a database of non-parametric memory containing trillions of entries. We introduce Retro-li that shows retrieval can also help using a small-scale database, but it demands more accurate and better neighbors when searching in… ▽ More

    Submitted 26 March, 2025; v1 submitted 12 September, 2024; originally announced October 2024.

    Journal ref: Published in: Proceedings of 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, IOS Press, 392, 2024, pp. 2974 - 2982

  3. arXiv:2401.16876  [pdf, other

    cs.CV cs.LG

    Zero-shot Classification using Hyperdimensional Computing

    Authors: Samuele Ruffino, Geethan Karunaratne, Michael Hersche, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Classification based on Zero-shot Learning (ZSL) is the ability of a model to classify inputs into novel classes on which the model has not previously seen any training examples. Providing an auxiliary descriptor in the form of a set of attributes describing the new classes involved in the ZSL-based classification is one of the favored approaches to solving this challenging task. In this work, ins… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: This is the extended version of a paper accepted in the Design, Automation, and Test in Europe Conference (DATE), 2024

  4. arXiv:2312.05605  [pdf, other

    cs.LG cs.CV

    TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing

    Authors: Aleksandar Terzic, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: MEGA is a recent transformer-based architecture, which utilizes a linear recurrent operator whose parallel computation, based on the FFT, scales as $O(LlogL)$, with $L$ being the sequence length. We build upon their approach by replacing the linear recurrence with a special temporal convolutional network which permits larger receptive field size with shallower networks, and reduces the computation… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  5. arXiv:2312.02829  [pdf, other

    cs.LG cs.AI stat.ML

    MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition

    Authors: Nicolas Menet, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: With the advent of deep learning, progressively larger neural networks have been designed to solve complex tasks. We take advantage of these capacity-rich models to lower the cost of inference by exploiting computation in superposition. To reduce the computational burden per input, we propose Multiple-Input-Multiple-Output Neural Networks (MIMONets) capable of handling many inputs at once. MIMONet… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: accepted in NeurIPS 2023

  6. arXiv:2303.13957  [pdf, other

    cs.CV cs.LG cs.NE

    Factorizers for Distributed Sparse Block Codes

    Authors: Michael Hersche, Aleksandar Terzic, Geethan Karunaratne, Jovin Langenegger, Angéline Pouget, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Distributed sparse block codes (SBCs) exhibit compact representations for encoding and manipulating symbolic data structures using fixed-width vectors. One major challenge however is to disentangle, or factorize, the distributed representation of data structures into their constituent elements without having to search through all possible combinations. This factorization becomes more challenging w… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted at Neurosymbolic Artificial Intelligence

  7. WHYPE: A Scale-Out Architecture with Wireless Over-the-Air Majority for Scalable In-memory Hyperdimensional Computing

    Authors: Robert Guirado, Abbas Rahimi, Geethan Karunaratne, Eduard Alarcón, Abu Sebastian, Sergi Abadal

    Abstract: Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using long random vectors known as hypervectors. Among different hardware platforms capable of executing HDC algorithms, in-memory computing (IMC) has shown promise as it is very efficient in performing matrix-vector multiplications, which are common in the HDC algebra. Although H… ▽ More

    Submitted 4 February, 2023; originally announced March 2023.

    Comments: Accepted at IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS). arXiv admin note: text overlap with arXiv:2205.10889

  8. A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference

    Authors: Manuel Le Gallo, Riduan Khaddam-Aljameh, Milos Stanisavljevic, Athanasios Vasilopoulos, Benedikt Kersting, Martino Dazzi, Geethan Karunaratne, Matthias Braendli, Abhairaj Singh, Silvia M. Mueller, Julian Buechel, Xavier Timoneda, Vinay Joshi, Urs Egger, Angelo Garofalo, Anastasios Petropoulos, Theodore Antonakopoulos, Kevin Brew, Samuel Choi, Injo Ok, Timothy Philip, Victor Chan, Claire Silvestre, Ishtiaq Ahsan, Nicole Saulnier , et al. (4 additional authors not shown)

    Abstract: The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly withi… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Journal ref: Nature Electronics 6, 680-693 (2023)

  9. arXiv:2211.05052  [pdf, other

    cs.ET cs.CV cs.LG cs.NE

    In-memory factorization of holographic perceptual representations

    Authors: Jovin Langenegger, Geethan Karunaratne, Michael Hersche, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Disentanglement of constituent factors of a sensory signal is central to perception and cognition and hence is a critical task for future artificial intelligence systems. In this paper, we present a compute engine capable of efficiently factorizing holographic perceptual representations by exploiting the computation-in-superposition capability of brain-inspired hyperdimensional computing and the i… ▽ More

    Submitted 16 February, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 23 pages, 4 figures, 1 extended data figure, 3 supplementary notes, 2 supplementary figures and 3 supplementary tables

  10. arXiv:2207.06810  [pdf, other

    cs.LG

    In-memory Realization of In-situ Few-shot Continual Learning with a Dynamically Evolving Explicit Memory

    Authors: Geethan Karunaratne, Michael Hersche, Jovin Langenegger, Giovanni Cherubini, Manuel Le Gallo-Bourdeau, Urs Egger, Kevin Brew, Sam Choi, INJO OK, Mary Claire Silvestre, Ning Li, Nicole Saulnier, Victor Chan, Ishtiaq Ahsan, Vijay Narayanan, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Continually learning new classes from a few training examples without forgetting previous old classes demands a flexible architecture with an inevitably growing portion of storage, in which new examples and classes can be incrementally stored and efficiently retrieved. One viable architectural solution is to tightly couple a stationary deep neural network to a dynamically evolving explicit memory… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at the European Solid-state Devices and Circuits Conference (ESSDERC), September 2022

  11. arXiv:2206.04796  [pdf, other

    cs.AR eess.SY

    Scale up your In-Memory Accelerator: Leveraging Wireless-on-Chip Communication for AIMC-based CNN Inference

    Authors: Nazareno Bruschi, Giuseppe Tagliavini, Francesco Conti, Sergi Abadal, Alberto Cabellos-Aparicio, Eduard Alarcón, Geethan Karunaratne, Irem Boybat, Luca Benini, Davide Rossi

    Abstract: Analog In-Memory Computing (AIMC) is emerging as a disruptive paradigm for heterogeneous computing, potentially delivering orders of magnitude better peak performance and efficiency over traditional digital signal processing architectures on Matrix-Vector multiplication. However, to sustain this throughput in real-world applications, AIMC tiles must be supplied with data at very high bandwidth and… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  12. arXiv:2205.10889  [pdf, other

    cs.AR

    Wireless On-Chip Communications for Scalable In-memory Hyperdimensional Computing

    Authors: Robert Guirado, Abbas Rahimi, Geethan Karunaratne, Eduard Alarcón, Abu Sebastian, Sergi Abadal

    Abstract: Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using very long random vectors (aka hypervectors). Among different hardware platforms capable of executing HDC algorithms, in-memory computing (IMC) systems have been recently proved to be one of the most energy-efficient options, due to hypervector manipulations in the memory its… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: This paper has been accepted at 2022 IEEE International Joint Conference on Neural Networks (IJCNN)

  13. arXiv:2203.16588  [pdf, other

    cs.CV cs.LG

    Constrained Few-shot Class-incremental Learning

    Authors: Michael Hersche, Geethan Karunaratne, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Continually learning new classes from fresh data without forgetting previous knowledge of old classes is a very challenging research problem. Moreover, it is imperative that such learning must respect certain memory and computational constraints such as (i) training samples are limited to only a few per class, (ii) the computational cost of learning a novel class remains constant, and (iii) the me… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 camera-ready version

  14. Generalized Key-Value Memory to Flexibly Adjust Redundancy in Memory-Augmented Networks

    Authors: Denis Kleyko, Geethan Karunaratne, Jan M. Rabaey, Abu Sebastian, Abbas Rahimi

    Abstract: Memory-augmented neural networks enhance a neural network with an external key-value memory whose complexity is typically dominated by the number of support vectors in the key memory. We propose a generalized key-value memory that decouples its dimension from the number of support vectors by introducing a free parameter that can arbitrarily add or remove redundancy to the key memory representation… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 8 pages, 7 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2022

  15. arXiv:2201.01089  [pdf, other

    cs.AR cs.DC cs.LG cs.NE

    A Heterogeneous In-Memory Computing Cluster For Flexible End-to-End Inference of Real-World Deep Neural Networks

    Authors: Angelo Garofalo, Gianmarco Ottavi, Francesco Conti, Geethan Karunaratne, Irem Boybat, Luca Benini, Davide Rossi

    Abstract: Deployment of modern TinyML tasks on small battery-constrained IoT devices requires high computational energy efficiency. Analog In-Memory Computing (IMC) using non-volatile memory (NVM) promises major efficiency improvements in deep neural network (DNN) inference and serves as on-chip memory storage for DNN weights. However, IMC's functional flexibility limitations and their impact on performance… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: 14 pages (not including final biography page), 13 figures (excluded authors pictures)

  16. End-to-end 100-TOPS/W Inference With Analog In-Memory Computing: Are We There Yet?

    Authors: Gianmarco Ottavi, Geethan Karunaratne, Francesco Conti, Irem Boybat, Luca Benini, Davide Rossi

    Abstract: In-Memory Acceleration (IMA) promises major efficiency improvements in deep neural network (DNN) inference, but challenges remain in the integration of IMA within a digital system. We propose a heterogeneous architecture coupling 8 RISC-V cores with an IMA in a shared-memory cluster, analyzing the benefits and trade-offs of in-memory computing on the realistic use case of a MobileNetV2 bottleneck… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: 4 pages,6 figures, conference

    Journal ref: 2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS)

  17. Energy Efficient In-memory Hyperdimensional Encoding for Spatio-temporal Signal Processing

    Authors: Geethan Karunaratne, Manuel Le Gallo, Michael Hersche, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: The emerging brain-inspired computing paradigm known as hyperdimensional computing (HDC) has been proven to provide a lightweight learning framework for various cognitive tasks compared to the widely used deep learning-based approaches. Spatio-temporal (ST) signal processing, which encompasses biosignals such as electromyography (EMG) and electroencephalography (EEG), is one family of applications… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 68, no. 5, pp. 1725-1729, May 2021

  18. Graphene-based Wireless Agile Interconnects for Massive Heterogeneous Multi-chip Processors

    Authors: Sergi Abadal, Robert Guirado, Hamidreza Taghvaee, Akshay Jain, Elana Pereira de Santana, Peter Haring Bolívar, Mohamed Saeed, Renato Negra, Zhenxing Wang, Kun-Ta Wang, Max C. Lemme, Joshua Klein, Marina Zapater, Alexandre Levisse, David Atienza, Davide Rossi, Francesco Conti, Martino Dazzi, Geethan Karunaratne, Irem Boybat, Abu Sebastian

    Abstract: The main design principles in computer architecture have recently shifted from a monolithic scaling-driven approach to the development of heterogeneous architectures that tightly co-integrate multiple specialized processor and memory chiplets. In such data-hungry multi-chip architectures, current Networks-in-Package (NiPs) may not be enough to cater to their heterogeneous and fast-changing communi… ▽ More

    Submitted 21 September, 2023; v1 submitted 8 November, 2020; originally announced November 2020.

    Comments: 8 pages, 4 figures, 1 table

    Journal ref: IEEE Wireless Communications Magazine, vol. 30, no. 4, pp. 162-169, 2023

  19. Robust High-dimensional Memory-augmented Neural Networks

    Authors: Geethan Karunaratne, Manuel Schmuck, Manuel Le Gallo, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Traditional neural networks require enormous amounts of data to build their complex mappings during a slow training procedure that hinders their abilities for relearning and adapting to new data. Memory-augmented neural networks enhance neural networks with an explicit memory to overcome these issues. Access to this explicit memory, however, occurs via soft read and write operations involving ever… ▽ More

    Submitted 19 March, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: This is a pre-print of an article accepted for publication in Nature Communications

    Journal ref: Nature Communications volume 12, Article number: 2468 (2021)

  20. arXiv:2005.07137  [pdf, other

    eess.SP cs.AR

    ChewBaccaNN: A Flexible 223 TOPS/W BNN Accelerator

    Authors: Renzo Andri, Geethan Karunaratne, Lukas Cavigelli, Luca Benini

    Abstract: Binary Neural Networks enable smart IoT devices, as they significantly reduce the required memory footprint and computational complexity while retaining a high network performance and flexibility. This paper presents ChewBaccaNN, a 0.7 mm$^2$ sized binary convolutional neural network (CNN) accelerator designed in GlobalFoundries 22 nm technology. By exploiting efficient data re-use, data buffering… ▽ More

    Submitted 26 February, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted at IEEE ISCAS 2021, Daegu, South Korea, 23-26 May 2021

  21. arXiv:2003.11256  [pdf, other

    cs.LG cs.NE

    ESSOP: Efficient and Scalable Stochastic Outer Product Architecture for Deep Learning

    Authors: Vinay Joshi, Geethan Karunaratne, Manuel Le Gallo, Irem Boybat, Christophe Piveteau, Abu Sebastian, Bipin Rajendran, Evangelos Eleftheriou

    Abstract: Deep neural networks (DNNs) have surpassed human-level accuracy in a variety of cognitive tasks but at the cost of significant memory/time requirements in DNN training. This limits their deployment in energy and memory limited applications that require real-time learning. Matrix-vector multiplications (MVM) and vector-vector outer product (VVOP) are the two most expensive operations associated wit… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: 5 pages. 5 figures. Accepted at ISCAS 2020 for publication

  22. Mixed-precision deep learning based on computational memory

    Authors: S. R. Nandakumar, Manuel Le Gallo, Christophe Piveteau, Vinay Joshi, Giovanni Mariani, Irem Boybat, Geethan Karunaratne, Riduan Khaddam-Aljameh, Urs Egger, Anastasios Petropoulos, Theodore Antonakopoulos, Bipin Rajendran, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Deep neural networks (DNNs) have revolutionized the field of artificial intelligence and have achieved unprecedented success in cognitive tasks such as image and speech recognition. Training of large DNNs, however, is computationally intensive and this has motivated the search for novel computing architectures targeting this application. A computational memory unit with nanoscale resistive memory… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Journal ref: Frontiers in Neuroscience 14:406 (2020)

  23. arXiv:1906.01548  [pdf, other

    cs.ET cs.AI physics.app-ph

    In-memory hyperdimensional computing

    Authors: Geethan Karunaratne, Manuel Le Gallo, Giovanni Cherubini, Luca Benini, Abbas Rahimi, Abu Sebastian

    Abstract: Hyperdimensional computing (HDC) is an emerging computational framework that takes inspiration from attributes of neuronal circuits such as hyperdimensionality, fully distributed holographic representation, and (pseudo)randomness. When employed for machine learning tasks such as learning and classification, HDC involves manipulation and comparison of large patterns within memory. Moreover, a key a… ▽ More

    Submitted 9 April, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

  24. arXiv:1803.05849  [pdf, other

    cs.CV cs.AI cs.AR cs.NE eess.IV

    XNORBIN: A 95 TOp/s/W Hardware Accelerator for Binary Convolutional Neural Networks

    Authors: Andrawes Al Bahou, Geethan Karunaratne, Renzo Andri, Lukas Cavigelli, Luca Benini

    Abstract: Deploying state-of-the-art CNNs requires power-hungry processors and off-chip memory. This precludes the implementation of CNNs in low-power embedded systems. Recent research shows CNNs sustain extreme quantization, binarizing their weights and intermediate feature maps, thereby saving 8-32\x memory and collapsing energy-intensive sum-of-products into XNOR-and-popcount operations. We present XNO… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.