Skip to main content

Showing 1–4 of 4 results for author: Blanton, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.05180  [pdf, other

    cs.LG cs.AI

    BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks

    Authors: Wei Li, Yang Zou, Christopher Ellis, Ruben Purdy, Shawn Blanton, José M. F. Moura

    Abstract: While many EDA tasks already involve graph-based data, existing LLMs in EDA primarily either represent graphs as sequential text, or simply ignore graph-structured data that might be beneficial like dataflow graphs of RTL code. Recent studies have found that LLM performance suffers when graphs are represented as sequential text, and using additional graph information significantly boosts performan… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  2. arXiv:2412.19002  [pdf, other

    cs.AR cs.AI

    Tempus Core: Area-Power Efficient Temporal-Unary Convolution Core for Low-Precision Edge DLAs

    Authors: Prabhu Vellaisamy, Harideep Nair, Thomas Kang, Yichen Ni, Haoyang Fan, Bin Qi, Jeff Chen, Shawn Blanton, John Paul Shen

    Abstract: The increasing complexity of deep neural networks (DNNs) poses significant challenges for edge inference deployment due to resource and power constraints of edge devices. Recent works on unary-based matrix multiplication hardware aim to leverage data sparsity and low-precision values to enhance hardware efficiency. However, the adoption and integration of such unary hardware into commercial deep l… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: Accepted in DATE 2025

  3. tubGEMM: Energy-Efficient and Sparsity-Effective Temporal-Unary-Binary Based Matrix Multiply Unit

    Authors: Prabhu Vellaisamy, Harideep Nair, Joseph Finn, Manav Trivedi, Albert Chen, Anna Li, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen

    Abstract: General Matrix Multiplication (GEMM) is a ubiquitous compute kernel in deep learning (DL). To support energy-efficient edge-native processing, new GEMM hardware units have been proposed that operate on unary encoded bitstreams using much simpler hardware. Most unary approaches thus far focus on rate-based unary encoding of values and perform stochastic approximate computation. This work presents t… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Comments: Published in 2023 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

  4. Commercial Evaluation of Zero-Skipping MAC Design for Bit Sparsity Exploitation in DL Inference

    Authors: Harideep Nair, Prabhu Vellaisamy, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen

    Abstract: General Matrix Multiply (GEMM) units, consisting of multiply-accumulate (MAC) arrays, perform bulk of the computation in deep learning (DL). Recent work has proposed a novel MAC design, Bit-Pragmatic (PRA), capable of dynamically exploiting bit sparsity. This work presents OzMAC (Omit-zero-MAC), a modified re-implementation of PRA, but extends beyond earlier works by performing rigorous post-synth… ▽ More

    Submitted 2 January, 2025; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Pre-print version of the publication in VLSI-SoC 2024