Skip to main content

Showing 1–1 of 1 results for author: Tanikanti, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.00136  [pdf, other

    cs.LG

    LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

    Authors: Krishna Teja Chitty-Venkata, Siddhisanket Raskar, Bharat Kale, Farah Ferdaus, Aditya Tanikanti, Ken Raffenetti, Valerie Taylor, Murali Emani, Venkatram Vishwanath

    Abstract: Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges, requiring efficient hardware acceleration. Benchmarking the performance of LLMs across diverse hardware platforms is crucial to understanding their scalability a… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.