Skip to main content

Showing 1–50 of 77 results for author: Sebastian, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20354  [pdf, ps, other

    cs.LG cs.AI

    A foundation model with multi-variate parallel attention to generate neuronal activity

    Authors: Francesco Carzaniga, Michael Hersche, Abu Sebastian, Kaspar Schindler, Abbas Rahimi

    Abstract: Learning from multi-variate time-series with heterogeneous channel configurations remains a fundamental challenge for deep neural networks (DNNs), particularly in clinical domains such as intracranial electroencephalography (iEEG), where channel setups vary widely across subjects. In this work, we introduce multi-variate parallel attention (MVPA), a novel self-attention mechanism that disentangles… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: The code is available at https://github.com/IBM/multi-variate-parallel-transformer. The SWEC iEEG dataset is available at https://mb-neuro.medical-blocks.ch/public_access/databases/ieeg/swec_ieeg

  2. arXiv:2506.00597  [pdf, ps, other

    q-bio.GN cs.AR

    Processing-in-memory for genomics workloads

    Authors: William Andrew Simon, Leonid Yavits, Konstantina Koliogeorgi, Yann Falevoz, Yoshihiro Shibuya, Dominique Lavenier, Irem Boybat, Klea Zambaku, Berkan Şahin, Mohammad Sadrosadati, Onur Mutlu, Abu Sebastian, Rayan Chikhi, The BioPIM Consortium, Can Alkan

    Abstract: Low-cost, high-throughput DNA and RNA sequencing (HTS) data is the main workforce for the life sciences. Genome sequencing is now becoming a part of Predictive, Preventive, Personalized, and Participatory (termed 'P4') medicine. All genomic data are currently processed in energy-hungry computer clusters and centers, necessitating data transfer, consuming substantial energy, and wasting valuable ti… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  3. arXiv:2505.09663  [pdf, ps, other

    cs.LG

    Analog Foundation Models

    Authors: Julian Büchel, Iason Chalas, Giovanni Acampa, An Chen, Omobayode Fagbohungbe, Sidney Tsai, Kaoutar El Maghraoui, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian

    Abstract: Analog in-memory computing (AIMC) is a promising compute paradigm to improve speed and power efficiency of neural network inference beyond the limits of conventional von Neumann-based architectures. However, AIMC introduces fundamental challenges such as noisy computations and strict constraints on input and output quantization. Because of these constraints and imprecisions, off-the-shelf LLMs are… ▽ More

    Submitted 16 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

    Comments: 43 pages, 8 figures, under review

  4. CiMBA: Accelerating Genome Sequencing through On-Device Basecalling via Compute-in-Memory

    Authors: William Andrew Simon, Irem Boybat, Riselda Kodra, Elena Ferro, Gagandeep Singh, Mohammed Alser, Shubham Jain, Hsinyu Tsai, Geoffrey W. Burr, Onur Mutlu, Abu Sebastian

    Abstract: As genome sequencing is finding utility in a wide variety of domains beyond the confines of traditional medical settings, its computational pipeline faces two significant challenges. First, the creation of up to 0.5 GB of data per minute imposes substantial communication and storage overheads. Second, the sequencing pipeline is bottlenecked at the basecalling step, consuming >40% of genome analysi… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: Accepted to IEEE Transactions on Parallel and Distributed Systems

    Journal ref: IEEE Transactions on Parallel and Distributed Systems, pp. 1-15, 2025

  5. arXiv:2503.11207  [pdf, ps, other

    cs.AI cs.LG

    Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty?

    Authors: Giacomo Camposampiero, Michael Hersche, Roger Wattenhofer, Abu Sebastian, Abbas Rahimi

    Abstract: This work presents a first evaluation of two state-of-the-art Large Reasoning Models (LRMs), OpenAI's o3-mini and DeepSeek R1, on analogical reasoning, focusing on well-established nonverbal human IQ tests based on Raven's progressive matrices. We benchmark with the I-RAVEN dataset and its extension, I-RAVEN-X, which tests the ability to generalize to longer reasoning rules and ranges of the attri… ▽ More

    Submitted 4 June, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: Accepted at the 19th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2025

  6. arXiv:2503.09625  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    Learning second-order TVD flux limiters using differentiable solvers

    Authors: Chenyang Huang, Amal S. Sebastian, Venkatasubramanian Viswanathan

    Abstract: This paper presents a data-driven framework for learning optimal second-order total variation diminishing (TVD) flux limiters via differentiable simulations. In our fully differentiable finite volume solvers, the limiter functions are replaced by neural networks. By representing the limiter as a pointwise convex linear combination of the Minmod and Superbee limiters, we enforce both second-order a… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  7. arXiv:2502.12373  [pdf

    cs.RO cs.AI

    Soft Robotics for Search and Rescue: Advancements, Challenges, and Future Directions

    Authors: Abhishek Sebastian

    Abstract: Soft robotics has emerged as a transformative technology in Search and Rescue (SAR) operations, addressing challenges in navigating complex, hazardous environments that often limit traditional rigid robots. This paper critically examines advancements in soft robotic technologies tailored for SAR applications, focusing on their unique capabilities in adaptability, safety, and efficiency. By leverag… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  8. arXiv:2412.19350  [pdf, ps, other

    cs.LG cs.AI cs.CL

    On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages

    Authors: Aleksandar Terzić, Michael Hersche, Giacomo Camposampiero, Thomas Hofmann, Abu Sebastian, Abbas Rahimi

    Abstract: Selective state-space models (SSMs) are an emerging alternative to the Transformer, offering the unique advantage of parallel training and sequential inference. Although these models have shown promising performance on a variety of tasks, their formal expressiveness and length generalization properties remain underexplored. In this work, we provide insight into the workings of selective SSMs by an… ▽ More

    Submitted 4 July, 2025; v1 submitted 26 December, 2024; originally announced December 2024.

    Comments: 13 pages, 7 figures, Published in AAAI 2025

  9. arXiv:2412.05586  [pdf, other

    cs.AI cs.LG cs.SC

    Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract Reasoning

    Authors: Michael Hersche, Giacomo Camposampiero, Roger Wattenhofer, Abu Sebastian, Abbas Rahimi

    Abstract: This work compares large language models (LLMs) and neuro-symbolic approaches in solving Raven's progressive matrices (RPM), a visual abstract reasoning test that involves the understanding of mathematical rules such as progression or arithmetic addition. Providing the visual attributes directly as textual prompts, which assumes an oracle visual perception module, allows us to measure the model's… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

  10. arXiv:2412.00354  [pdf, other

    cs.LG cs.AI

    On the Role of Noise in Factorizers for Disentangling Distributed Representations

    Authors: Geethan Karunaratne, Michael Hersche, Abu Sebastian, Abbas Rahimi

    Abstract: To efficiently factorize high-dimensional distributed representations to the constituent atomic vectors, one can exploit the compute-in-superposition capabilities of vector-symbolic architectures (VSA). Such factorizers however suffer from the phenomenon of limit cycles. Applying noise during the iterative decoding is one mechanism to address this issue. In this paper, we explore ways to further… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

    Comments: Published at Second Workshop on Machine Learning with New Compute Paradigms at 38th NeurIPS 2024 (MLNCP 2024)

  11. The Inherent Adversarial Robustness of Analog In-Memory Computing

    Authors: Corey Lammie, Julian Büchel, Athanasios Vasilopoulos, Manuel Le Gallo, Abu Sebastian

    Abstract: A key challenge for Deep Neural Network (DNN) algorithms is their vulnerability to adversarial attacks. Inherently non-deterministic compute substrates, such as those based on Analog In-Memory Computing (AIMC), have been speculated to provide significant adversarial robustness when performing DNN inference. In this paper, we experimentally validate this conjecture for the first time on an AIMC chi… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

  12. arXiv:2411.05836  [pdf

    cs.CV eess.SP physics.optics

    Prion-ViT: Prions-Inspired Vision Transformers for Temperature prediction with Specklegrams

    Authors: Abhishek Sebastian, Pragna R, Sonaa Rajagopal, Muralikrishnan Mani

    Abstract: Fiber Specklegram Sensors (FSS) are vital for environmental monitoring due to their high temperature sensitivity, but their complex data poses challenges for predictive models. This study introduces Prion-ViT, a prion-inspired Vision Transformer model, inspired by biological prion memory mechanisms, to improve long-term dependency modeling and temperature prediction accuracy using FSS data. Prion-… ▽ More

    Submitted 25 January, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

  13. arXiv:2411.03375  [pdf, other

    cs.LG cs.AR

    Kernel Approximation using Analog In-Memory Computing

    Authors: Julian Büchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian

    Abstract: Kernel functions are vital ingredients of several machine learning algorithms, but often incur significant memory and computational costs. We introduce an approach to kernel approximation in machine learning algorithms suitable for mixed-signal Analog In-Memory Computing (AIMC) architectures. Analog In-Memory Kernel Approximation addresses the performance bottlenecks of conventional kernel-based m… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  14. arXiv:2410.10434  [pdf

    eess.AS cs.SD

    In-Materia Speech Recognition

    Authors: Mohamadreza Zolfagharinejad, Julian Büchel, Lorenzo Cassola, Sachin Kinge, Ghazi Sarwat Syed, Abu Sebastian, Wilfred G. van der Wiel

    Abstract: With the rise of decentralized computing, as in the Internet of Things, autonomous driving, and personalized healthcare, it is increasingly important to process time-dependent signals at the edge efficiently: right at the place where the temporal data are collected, avoiding time-consuming, insecure, and costly communication with a centralized computing facility (or cloud). However, modern-day pro… ▽ More

    Submitted 15 May, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

  15. arXiv:2410.00004  [pdf, other

    cs.IR cs.AI cs.CL

    Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization

    Authors: Gentiana Rashiti, Geethan Karunaratne, Mrinmaya Sachan, Abu Sebastian, Abbas Rahimi

    Abstract: The retrieval augmented generation (RAG) system such as Retro has been shown to improve language modeling capabilities and reduce toxicity and hallucinations by retrieving from a database of non-parametric memory containing trillions of entries. We introduce Retro-li that shows retrieval can also help using a small-scale database, but it demands more accurate and better neighbors when searching in… ▽ More

    Submitted 26 March, 2025; v1 submitted 12 September, 2024; originally announced October 2024.

    Journal ref: Published in: Proceedings of 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, IOS Press, 392, 2024, pp. 2974 - 2982

  16. arXiv:2407.06209  [pdf, other

    cs.LG

    Self-supervised Pretraining for Partial Differential Equations

    Authors: Varun Madhavan, Amal S Sebastian, Bharath Ramsundar, Venkatasubramanian Viswanathan

    Abstract: In this work, we describe a novel approach to building a neural PDE solver leveraging recent advances in transformer based neural network architectures. Our model can provide solutions for different values of PDE parameters without any need for retraining the network. The training is carried out in a self-supervised manner, similar to pretraining approaches applied in language and vision tasks. We… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  17. Calorie Burn Estimation in Community Parks Through DLICP: A Mathematical Modelling Approach

    Authors: Abhishek Sebastian, Annis Fathima A, Pragna R, Madhan Kumar S, Jesher Joshua M

    Abstract: Community parks play a crucial role in promoting physical activity and overall well-being. This study introduces DLICP (Deep Learning Integrated Community Parks), an innovative approach that combines deep learning techniques specifically, face recognition technology with a novel walking activity measurement algorithm to enhance user experience in community parks. The DLICP utilizes a camera with f… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted and to be presented at Intellisys 2024 , Also Part of the Indian Patent: 202441050325

  18. arXiv:2407.02353  [pdf, other

    eess.SP cs.AR eess.SY

    Roadmap to Neuromorphic Computing with Emerging Technologies

    Authors: Adnan Mehonic, Daniele Ielmini, Kaushik Roy, Onur Mutlu, Shahar Kvatinsky, Teresa Serrano-Gotarredona, Bernabe Linares-Barranco, Sabina Spiga, Sergey Savelev, Alexander G Balanov, Nitin Chawla, Giuseppe Desoli, Gerardo Malavena, Christian Monzio Compagnoni, Zhongrui Wang, J Joshua Yang, Ghazi Sarwat Syed, Abu Sebastian, Thomas Mikolajick, Beatriz Noheda, Stefan Slesazeck, Bernard Dieny, Tuo-Hung, Hou, Akhil Varri , et al. (28 additional authors not shown)

    Abstract: The roadmap is organized into several thematic sections, outlining current computing challenges, discussing the neuromorphic computing approach, analyzing mature and currently utilized technologies, providing an overview of emerging technologies, addressing material challenges, exploring novel computing concepts, and finally examining the maturity level of emerging technologies while determining t… ▽ More

    Submitted 5 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 90 pages, 22 figures, roadmap, neuromorphic

  19. arXiv:2406.19121  [pdf, other

    cs.LG cs.AI cs.SC

    Towards Learning Abductive Reasoning using VSA Distributed Representations

    Authors: Giacomo Camposampiero, Michael Hersche, Aleksandar Terzić, Roger Wattenhofer, Abu Sebastian, Abbas Rahimi

    Abstract: We introduce the Abductive Rule Learner with Context-awareness (ARLC), a model that solves abstract reasoning tasks based on Learn-VRF. ARLC features a novel and more broadly applicable training objective for abductive reasoning, resulting in better interpretability and higher accuracy when solving Raven's progressive matrices (RPM). ARLC allows both programming domain knowledge and learning the r… ▽ More

    Submitted 30 August, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2024 [Spotlight]

  20. arXiv:2406.03372  [pdf, other

    physics.app-ph cs.LG

    Training of Physical Neural Networks

    Authors: Ali Momeni, Babak Rahmani, Benjamin Scellier, Logan G. Wright, Peter L. McMahon, Clara C. Wanjura, Yuhang Li, Anas Skalli, Natalia G. Berloff, Tatsuhiro Onodera, Ilker Oguz, Francesco Morichetti, Philipp del Hougne, Manuel Le Gallo, Abu Sebastian, Azalia Mirhoseini, Cheng Zhang, Danijela Marković, Daniel Brunner, Christophe Moser, Sylvain Gigan, Florian Marquardt, Aydogan Ozcan, Julie Grollier, Andrea J. Liu , et al. (3 additional authors not shown)

    Abstract: Physical neural networks (PNNs) are a class of neural-like networks that leverage the properties of physical systems to perform computation. While PNNs are so far a niche research area with small-scale laboratory demonstrations, they are arguably one of the most underappreciated important opportunities in modern AI. Could we train AI models 1000x larger than current ones? Could we do this and also… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 29 pages, 4 figures

  21. arXiv:2404.08872  [pdf

    cond-mat.mtrl-sci cs.AI

    Enhanced Hydrogen Evolution Activity of MOS$_2$-rGO Composite Synthesized via Hydrothermal Technique

    Authors: Abhishek Sebastian, Pragna R

    Abstract: Hydrogen evolution reaction (HER) has emerged as a promising technique for the production of clean and sustainable energy. In recent years, researchers have been exploring various materials for efficient HER activity. In this study, we report the synthesis of two different materials, namely MOS$_2$ and MoS$_2$-rGO, through a hydrothermal technique. X-ray diffraction (XRD), Fourier-transform infrar… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: This research is an excerpt from the report of TARP (Technical Answers for Real-World Problems)

  22. ViTaL: An Advanced Framework for Automated Plant Disease Identification in Leaf Images Using Vision Transformers and Linear Projection For Feature Reduction

    Authors: Abhishek Sebastian, Annis Fathima A, Pragna R, Madhan Kumar S, Yaswanth Kannan G, Vinay Murali

    Abstract: Our paper introduces a robust framework for the automated identification of diseases in plant leaf images. The framework incorporates several key stages to enhance disease recognition accuracy. In the pre-processing phase, a thumbnail resizing technique is employed to resize images, minimizing the loss of critical image details while ensuring computational efficiency. Normalization procedures are… ▽ More

    Submitted 27 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted and scheduled for presentation at CML 2024, this work will be published as a book chapter in Lecture Notes in Networks and Systems

  23. arXiv:2402.16442  [pdf, other

    cs.LG cs.AI cs.CV cs.DC math.OC

    On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions

    Authors: Maximilian Böther, Abraham Sebastian, Pranjal Awasthi, Ana Klimovic, Srikumar Ramalingam

    Abstract: Modern datasets span billions of samples, making training on all available data infeasible. Selecting a high quality subset helps in reducing training costs and enhancing model quality. Submodularity, a discrete analogue of convexity, is commonly used for solving such subset selection problems. However, existing algorithms for optimizing submodular functions are sequential, and the prior distribut… ▽ More

    Submitted 3 April, 2025; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: accepted at MLSys 2025

  24. A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory Computing

    Authors: Elena Ferro, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Luca Benini, Irem Boybat, Abu Sebastian

    Abstract: Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with circuit mismatches and non-idealities associated with the memory devices. Efficient near-memory digital logic is critical to retain the high area/energy efficiency and low latency of AIMC. Existing syst… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ISCAS2024

  25. arXiv:2401.16876  [pdf, other

    cs.CV cs.LG

    Zero-shot Classification using Hyperdimensional Computing

    Authors: Samuele Ruffino, Geethan Karunaratne, Michael Hersche, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Classification based on Zero-shot Learning (ZSL) is the ability of a model to classify inputs into novel classes on which the model has not previously seen any training examples. Providing an auxiliary descriptor in the form of a set of attributes describing the new classes involved in the ZSL-based classification is one of the favored approaches to solving this challenging task. In this work, ins… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: This is the extended version of a paper accepted in the Design, Automation, and Test in Europe Conference (DATE), 2024

  26. arXiv:2401.16024  [pdf, other

    cs.LG cs.AI

    Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architectures

    Authors: Michael Hersche, Francesco di Stefano, Thomas Hofmann, Abu Sebastian, Abbas Rahimi

    Abstract: Abstract reasoning is a cornerstone of human intelligence, and replicating it with artificial intelligence (AI) presents an ongoing challenge. This study focuses on efficiently solving Raven's progressive matrices (RPM), a visual test for assessing abstract reasoning abilities, by using distributed computation and operators provided by vector-symbolic architectures (VSA). Instead of hard-coding th… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted in NeurIPS 2023 Workshop on MATH-AI

  27. Improving the Accuracy of Analog-Based In-Memory Computing Accelerators Post-Training

    Authors: Corey Lammie, Athanasios Vasilopoulos, Julian Büchel, Giacomo Camposampiero, Manuel Le Gallo, Malte Rasch, Abu Sebastian

    Abstract: Analog-Based In-Memory Computing (AIMC) inference accelerators can be used to efficiently execute Deep Neural Network (DNN) inference workloads. However, to mitigate accuracy losses, due to circuit and device non-idealities, Hardware-Aware (HWA) training methodologies must be employed. These typically require significant information about the underlying hardware. In this paper, we propose two Post… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

    Journal ref: 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

  28. LionHeart: A Layer-based Mapping Framework for Heterogeneous Systems with Analog In-Memory Computing Tiles

    Authors: Corey Lammie, Yuxuan Wang, Flavio Ponzina, Joshua Klein, Hadjer Benmeziane, Marina Zapater, Irem Boybat, Abu Sebastian, Giovanni Ansaloni, David Atienza

    Abstract: When arranged in a crossbar configuration, resistive memory devices can be used to execute Matrix-Vector Multiplications (MVMs), the most dominant operation of many Machine Learning (ML) algorithms, in constant time complexity. Nonetheless, when performing computations in the analog domain, novel challenges are introduced in terms of arithmetic precision and stochasticity, due to non-ideal circuit… ▽ More

    Submitted 24 March, 2025; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE Transactions on Emerging Topics in Computing

  29. arXiv:2312.05605  [pdf, other

    cs.LG cs.CV

    TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing

    Authors: Aleksandar Terzic, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: MEGA is a recent transformer-based architecture, which utilizes a linear recurrent operator whose parallel computation, based on the FFT, scales as $O(LlogL)$, with $L$ being the sequence length. We build upon their approach by replacing the linear recurrence with a special temporal convolutional network which permits larger receptive field size with shallower networks, and reduces the computation… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  30. arXiv:2312.02829  [pdf, other

    cs.LG cs.AI stat.ML

    MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition

    Authors: Nicolas Menet, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: With the advent of deep learning, progressively larger neural networks have been designed to solve complex tasks. We take advantage of these capacity-rich models to lower the cost of inference by exploiting computation in superposition. To reduce the computational burden per input, we propose Multiple-Input-Multiple-Output Neural Networks (MIMONets) capable of handling many inputs at once. MIMONet… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: accepted in NeurIPS 2023

  31. arXiv:2311.13800  [pdf

    cs.CR cs.AI cs.LG

    Enhancing Intrusion Detection In Internet Of Vehicles Through Federated Learning

    Authors: Abhishek Sebastian, Pragna R, Sudhakaran G, Renjith P N, Leela Karthikeyan H

    Abstract: Federated learning is a technique of decentralized machine learning. that allows multiple parties to collaborate and learn a shared model without sharing their raw data. Our paper proposes a federated learning framework for intrusion detection in Internet of Vehicles (IOVs) using the CIC-IDS 2017 dataset. The proposed framework employs SMOTE for handling class imbalance, outlier detection for iden… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  32. arXiv:2311.02087  [pdf

    cs.SD cs.AI cs.LG eess.AS eess.SP

    Design Of Rubble Analyzer Probe Using ML For Earthquake

    Authors: Abhishek Sebastian, R Pragna, K Vishal Vythianathan, Dasaraju Sohan Sai, U Shiva Sri Hari Al, R Anirudh, Apurv Choudhary

    Abstract: The earthquake rubble analyzer uses machine learning to detect human presence via ambient sounds, achieving 97.45% accuracy. It also provides real-time environmental data, aiding in assessing survival prospects for trapped individuals, crucial for post-earthquake rescue efforts

    Submitted 24 October, 2023; originally announced November 2023.

  33. arXiv:2310.15036  [pdf

    cs.CV cs.AI

    A Technique for Classifying Static Gestures Using UWB Radar

    Authors: Abhishek Sebastian, Pragna R

    Abstract: Our paper presents a robust framework for UWB-based static gesture recognition, leveraging proprietary UWB radar sensor technology. Extensive data collection efforts were undertaken to compile datasets containing five commonly used gestures. Our approach involves a comprehensive data pre-processing pipeline that encompasses outlier handling, aspect ratio-preserving resizing, and false-color image… ▽ More

    Submitted 11 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: This is not a technical research paper, but an excerpt of what was applied during a funded project for the promotion of Open Science

  34. Using the IBM Analog In-Memory Hardware Acceleration Kit for Neural Network Training and Inference

    Authors: Manuel Le Gallo, Corey Lammie, Julian Buechel, Fabio Carta, Omobayode Fagbohungbe, Charles Mackin, Hsinyu Tsai, Vijay Narayanan, Abu Sebastian, Kaoutar El Maghraoui, Malte J. Rasch

    Abstract: Analog In-Memory Computing (AIMC) is a promising approach to reduce the latency and energy consumption of Deep Neural Network (DNN) inference and training. However, the noisy and non-linear device characteristics, and the non-ideal peripheral circuitry in AIMC chips, require adapting DNNs to be deployed on such hardware to achieve equivalent accuracy to digital computing. In this tutorial, we prov… ▽ More

    Submitted 26 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Journal ref: APL Machine Learning (2023) 1 (4): 041102

  35. Gradient descent-based programming of analog in-memory computing cores

    Authors: Julian Büchel, Athanasios Vasilopoulos, Benedikt Kersting, Frederic Odermatt, Kevin Brew, Injo Ok, Sam Choi, Iqbal Saraf, Victor Chan, Timothy Philip, Nicole Saulnier, Vijay Narayanan, Manuel Le Gallo, Abu Sebastian

    Abstract: The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Journal ref: 2022 International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 2022, pp. 33.1.1-33.1.4

  36. arXiv:2305.10459  [pdf, other

    cs.AR cs.CV cs.LG

    AnalogNAS: A Neural Network Design Framework for Accurate Inference with Analog In-Memory Computing

    Authors: Hadjer Benmeziane, Corey Lammie, Irem Boybat, Malte Rasch, Manuel Le Gallo, Hsinyu Tsai, Ramachandran Muralidhar, Smail Niar, Ouarnoughi Hamza, Vijay Narayanan, Abu Sebastian, Kaoutar El Maghraoui

    Abstract: The advancement of Deep Learning (DL) is driven by efficient Deep Neural Network (DNN) design and new hardware accelerators. Current DNN design is primarily tailored for general-purpose use and deployment on commercially viable platforms. Inference at the edge requires low latency, compact and power-efficient models, and must be cost-effective. Digital processors based on typical von Neumann archi… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Edge

  37. arXiv:2303.13957  [pdf, other

    cs.CV cs.LG cs.NE

    Factorizers for Distributed Sparse Block Codes

    Authors: Michael Hersche, Aleksandar Terzic, Geethan Karunaratne, Jovin Langenegger, Angéline Pouget, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Distributed sparse block codes (SBCs) exhibit compact representations for encoding and manipulating symbolic data structures using fixed-width vectors. One major challenge however is to disentangle, or factorize, the distributed representation of data structures into their constituent elements without having to search through all possible combinations. This factorization becomes more challenging w… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted at Neurosymbolic Artificial Intelligence

  38. WHYPE: A Scale-Out Architecture with Wireless Over-the-Air Majority for Scalable In-memory Hyperdimensional Computing

    Authors: Robert Guirado, Abbas Rahimi, Geethan Karunaratne, Eduard Alarcón, Abu Sebastian, Sergi Abadal

    Abstract: Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using long random vectors known as hypervectors. Among different hardware platforms capable of executing HDC algorithms, in-memory computing (IMC) has shown promise as it is very efficient in performing matrix-vector multiplications, which are common in the HDC algebra. Although H… ▽ More

    Submitted 4 February, 2023; originally announced March 2023.

    Comments: Accepted at IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS). arXiv admin note: text overlap with arXiv:2205.10889

  39. arXiv:2302.08469  [pdf, ps, other

    cs.LG cs.ET

    Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

    Authors: Malte J. Rasch, Charles Mackin, Manuel Le Gallo, An Chen, Andrea Fasoli, Frederic Odermatt, Ning Li, S. R. Nandakumar, Pritish Narayanan, Hsinyu Tsai, Geoffrey W. Burr, Abu Sebastian, Vijay Narayanan

    Abstract: Analog in-memory computing (AIMC) -- a promising approach for energy-efficient acceleration of deep learning workloads -- computes matrix-vector multiplications (MVMs) but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable deep neural network (DNN) inference accuracy as compared to a conventional floating point (FP) impl… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 35 pages, 7 figures, 5 tables

  40. A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference

    Authors: Manuel Le Gallo, Riduan Khaddam-Aljameh, Milos Stanisavljevic, Athanasios Vasilopoulos, Benedikt Kersting, Martino Dazzi, Geethan Karunaratne, Matthias Braendli, Abhairaj Singh, Silvia M. Mueller, Julian Buechel, Xavier Timoneda, Vinay Joshi, Urs Egger, Angelo Garofalo, Anastasios Petropoulos, Theodore Antonakopoulos, Kevin Brew, Samuel Choi, Injo Ok, Timothy Philip, Victor Chan, Claire Silvestre, Ishtiaq Ahsan, Nicole Saulnier , et al. (4 additional authors not shown)

    Abstract: The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly withi… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Journal ref: Nature Electronics 6, 680-693 (2023)

  41. arXiv:2211.05052  [pdf, other

    cs.ET cs.CV cs.LG cs.NE

    In-memory factorization of holographic perceptual representations

    Authors: Jovin Langenegger, Geethan Karunaratne, Michael Hersche, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Disentanglement of constituent factors of a sensory signal is central to perception and cognition and hence is a critical task for future artificial intelligence systems. In this paper, we present a compute engine capable of efficiently factorizing holographic perceptual representations by exploiting the computation-in-superposition capability of brain-inspired hyperdimensional computing and the i… ▽ More

    Submitted 16 February, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 23 pages, 4 figures, 1 extended data figure, 3 supplementary notes, 2 supplementary figures and 3 supplementary tables

  42. arXiv:2209.10481  [pdf, other

    cs.ET cs.LG hep-ex

    Benchmarking energy consumption and latency for neuromorphic computing in condensed matter and particle physics

    Authors: Dominique J. Kösters, Bryan A. Kortman, Irem Boybat, Elena Ferro, Sagar Dolas, Roberto de Austri, Johan Kwisthout, Hans Hilgenkamp, Theo Rasing, Heike Riel, Abu Sebastian, Sascha Caron, Johan H. Mentink

    Abstract: The massive use of artificial neural networks (ANNs), increasingly popular in many areas of scientific computing, rapidly increases the energy consumption of modern high-performance computing systems. An appealing and possibly more sustainable alternative is provided by novel neuromorphic paradigms, which directly implement ANNs in hardware. However, little is known about the actual benefits of ru… ▽ More

    Submitted 21 February, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: 7 pages, 4 figures, submitted to and accepted by APL Machine Learning

    Journal ref: APL Machine Learning 1, 016101 (2023)

  43. arXiv:2207.06810  [pdf, other

    cs.LG

    In-memory Realization of In-situ Few-shot Continual Learning with a Dynamically Evolving Explicit Memory

    Authors: Geethan Karunaratne, Michael Hersche, Jovin Langenegger, Giovanni Cherubini, Manuel Le Gallo-Bourdeau, Urs Egger, Kevin Brew, Sam Choi, INJO OK, Mary Claire Silvestre, Ning Li, Nicole Saulnier, Victor Chan, Ishtiaq Ahsan, Vijay Narayanan, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Continually learning new classes from a few training examples without forgetting previous old classes demands a flexible architecture with an inevitably growing portion of storage, in which new examples and classes can be incrementally stored and efficiently retrieved. One viable architectural solution is to tightly couple a stationary deep neural network to a dynamically evolving explicit memory… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at the European Solid-state Devices and Circuits Conference (ESSDERC), September 2022

  44. arXiv:2205.10889  [pdf, other

    cs.AR

    Wireless On-Chip Communications for Scalable In-memory Hyperdimensional Computing

    Authors: Robert Guirado, Abbas Rahimi, Geethan Karunaratne, Eduard Alarcón, Abu Sebastian, Sergi Abadal

    Abstract: Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using very long random vectors (aka hypervectors). Among different hardware platforms capable of executing HDC algorithms, in-memory computing (IMC) systems have been recently proved to be one of the most energy-efficient options, due to hypervector manipulations in the memory its… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: This paper has been accepted at 2022 IEEE International Joint Conference on Neural Networks (IJCNN)

  45. ALPINE: Analog In-Memory Acceleration with Tight Processor Integration for Deep Learning

    Authors: Joshua Klein, Irem Boybat, Yasir Qureshi, Martino Dazzi, Alexandre Levisse, Giovanni Ansaloni, Marina Zapater, Abu Sebastian, David Atienza

    Abstract: Analog in-memory computing (AIMC) cores offers significant performance and energy benefits for neural network inference with respect to digital logic (e.g., CPUs). AIMCs accelerate matrix-vector multiplications, which dominate these applications' run-time. However, AIMC-centric platforms lack the flexibility of general-purpose systems, as they often have hard-coded data flows and can only support… ▽ More

    Submitted 13 December, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted by IEEE Transactions on Computers, December 2022

    ACM Class: C.4; I.6.0

  46. arXiv:2203.16588  [pdf, other

    cs.CV cs.LG

    Constrained Few-shot Class-incremental Learning

    Authors: Michael Hersche, Geethan Karunaratne, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Continually learning new classes from fresh data without forgetting previous knowledge of old classes is a very challenging research problem. Moreover, it is imperative that such learning must respect certain memory and computational constraints such as (i) training samples are limited to only a few per class, (ii) the computational cost of learning a novel class remains constant, and (iii) the me… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 camera-ready version

  47. Generalized Key-Value Memory to Flexibly Adjust Redundancy in Memory-Augmented Networks

    Authors: Denis Kleyko, Geethan Karunaratne, Jan M. Rabaey, Abu Sebastian, Abbas Rahimi

    Abstract: Memory-augmented neural networks enhance a neural network with an external key-value memory whose complexity is typically dominated by the number of support vectors in the key memory. We propose a generalized key-value memory that decouples its dimension from the number of support vectors by introducing a free parameter that can arbitrarily add or remove redundancy to the key memory representation… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 8 pages, 7 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2022

  48. arXiv:2203.04571  [pdf, other

    cs.LG cs.AI cs.CV

    A Neuro-vector-symbolic Architecture for Solving Raven's Progressive Matrices

    Authors: Michael Hersche, Mustafa Zeqiri, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Neither deep neural networks nor symbolic AI alone has approached the kind of intelligence expressed in humans. This is mainly because neural networks are not able to decompose joint representations to obtain distinct objects (the so-called binding problem), while symbolic AI suffers from exhaustive rule searches, among other problems. These two problems are still pronounced in neuro-symbolic AI w… ▽ More

    Submitted 3 March, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Updated version with additional NVSA end-to-end training, generalization experiments, and PGM experiments

  49. arXiv:2111.06503  [pdf, other

    cs.AR cs.ET cs.LG

    AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On Analog Compute-in-Memory Accelerator

    Authors: Chuteng Zhou, Fernando Garcia Redondo, Julian Büchel, Irem Boybat, Xavier Timoneda Comas, S. R. Nandakumar, Shidhartha Das, Abu Sebastian, Manuel Le Gallo, Paul N. Whatmough

    Abstract: Always-on TinyML perception tasks in IoT applications require very high energy efficiency. Analog compute-in-memory (CiM) using non-volatile memory (NVM) promises high efficiency and also provides self-contained on-chip model storage. However, analog CiM introduces new practical considerations, including conductance drift, read/write noise, fixed analog-to-digital (ADC) converter gain, etc. These… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  50. Energy Efficient In-memory Hyperdimensional Encoding for Spatio-temporal Signal Processing

    Authors: Geethan Karunaratne, Manuel Le Gallo, Michael Hersche, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: The emerging brain-inspired computing paradigm known as hyperdimensional computing (HDC) has been proven to provide a lightweight learning framework for various cognitive tasks compared to the widely used deep learning-based approaches. Spatio-temporal (ST) signal processing, which encompasses biosignals such as electromyography (EMG) and electroencephalography (EEG), is one family of applications… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 68, no. 5, pp. 1725-1729, May 2021