Skip to main content

Showing 1–6 of 6 results for author: Shivdikar, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15510  [pdf, other

    cs.AR cs.DC cs.LG cs.NE

    NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator

    Authors: Kaustubh Shivdikar, Nicolas Bohm Agostini, Malith Jayaweera, Gilbert Jonatan, Jose L. Abellan, Ajay Joshi, John Kim, David Kaeli

    Abstract: Graph Neural Networks (GNNs) are emerging as a formidable tool for processing non-euclidean data across various domains, ranging from social network analysis to bioinformatics. Despite their effectiveness, their adoption has not been pervasive because of scalability challenges associated with large-scale graph datasets, particularly when leveraging message passing. To tackle these challenges, we… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Visit https://neurachip.us for WebGUI based simulations

  2. arXiv:2312.10561  [pdf, other

    cs.AR cs.AI cs.DC cs.LG

    Enabling Accelerators for Graph Computing

    Authors: Kaustubh Shivdikar

    Abstract: The advent of Graph Neural Networks (GNNs) has revolutionized the field of machine learning, offering a novel paradigm for learning on graph-structured data. Unlike traditional neural networks, GNNs are capable of capturing complex relationships and dependencies inherent in graph data, making them particularly suited for a wide range of applications including social network analysis, molecular che… ▽ More

    Submitted 24 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: Northeastern University Doctoral Dissertation

  3. arXiv:2312.08656  [pdf, other

    cs.LG cs.AI cs.DC

    MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training

    Authors: Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Jiahui Zhao, Shaoyi Huang, Omer Khan, David Kaeli, Caiwen Ding

    Abstract: In the acceleration of deep neural network training, the GPU has become the mainstream platform. GPUs face substantial challenges on GNNs, such as workload imbalance and memory access irregularities, leading to underutilized hardware. Existing solutions such as PyG, DGL with cuSPARSE, and GNNAdvisor frameworks partially address these challenges but memory traffic is still significant. We argue t… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ASPLOS 2024 accepted publication

    ACM Class: I.2; C.5

  4. GME: GPU-based Microarchitectural Extensions to Accelerate Homomorphic Encryption

    Authors: Kaustubh Shivdikar, Yuhui Bao, Rashmi Agrawal, Michael Shen, Gilbert Jonatan, Evelio Mora, Alexander Ingare, Neal Livesay, José L. Abellán, John Kim, Ajay Joshi, David Kaeli

    Abstract: Fully Homomorphic Encryption (FHE) enables the processing of encrypted data without decrypting it. FHE has garnered significant attention over the past decade as it supports secure outsourcing of data processing to remote cloud services. Despite its promise of strong data privacy and security guarantees, FHE introduces a slowdown of up to five orders of magnitude as compared to the same computatio… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  5. arXiv:2209.01290  [pdf, other

    cs.CR cs.AR cs.DC cs.PF

    Accelerating Polynomial Multiplication for Homomorphic Encryption on GPUs

    Authors: Kaustubh Shivdikar, Gilbert Jonatan, Evelio Mora, Neal Livesay, Rashmi Agrawal, Ajay Joshi, Jose Abellan, John Kim, David Kaeli

    Abstract: Homomorphic Encryption (HE) enables users to securely outsource both the storage and computation of sensitive data to untrusted servers. Not only does HE offer an attractive solution for security in cloud systems, but lattice-based HE systems are also believed to be resistant to attacks by quantum computers. However, current HE implementations suffer from prohibitively high latency. For lattice-ba… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted, to be pusblished at SEED 2022 conference (IEEE International Symposium on Secure and Private Execution Environment Design)

  6. SMASH: Sparse Matrix Atomic Scratchpad Hashing

    Authors: Kaustubh Shivdikar

    Abstract: Sparse matrices, more specifically SpGEMM kernels, are commonly found in a wide range of applications, spanning graph-based path-finding to machine learning algorithms (e.g., neural networks). A particular challenge in implementing SpGEMM kernels has been the pressure placed on DRAM memory. One approach to tackle this problem is to use an inner product method for the SpGEMM kernel implementation.… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.