Skip to main content

Showing 1–2 of 2 results for author: Ramani, P

.
  1. arXiv:2408.15879  [pdf, other

    cs.AI cs.CL

    Persuasion Games using Large Language Models

    Authors: Ganesh Prasath Ramani, Shirish Karande, Santhosh V, Yash Bhatia

    Abstract: Large Language Models (LLMs) have emerged as formidable instruments capable of comprehending and producing human-like text. This paper explores the potential of LLMs, to shape user perspectives and subsequently influence their decisions on particular tasks. This capability finds applications in diverse domains such as Investment, Credit cards and Insurance, wherein they assist users in selecting a… ▽ More

    Submitted 1 September, 2024; v1 submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2407.08608  [pdf, other

    cs.LG cs.AI

    FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

    Authors: Jay Shah, Ganesh Bikshandi, Ying Zhang, Vijay Thakkar, Pradeep Ramani, Tri Dao

    Abstract: Attention, as a core layer of the ubiquitous Transformer architecture, is the bottleneck for large language models and long-context applications. FlashAttention elaborated an approach to speed up attention on GPUs through minimizing memory reads/writes. However, it has yet to take advantage of new capabilities present in recent hardware, with FlashAttention-2 achieving only 35% utilization on the… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.