Skip to main content

Showing 1–2 of 2 results for author: Ambilduke, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.10620  [pdf, other

    cs.CL

    From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM

    Authors: Kshitij Ambilduke, Ben Peters, Sonal Sannigrahi, Anil Keshwani, Tsz Kin Lam, Bruno Martins, Marcely Zanon Boito, André F. T. Martins

    Abstract: Large language models (LLMs) have shown remarkable performance and generalization capabilities across multiple languages and tasks, making them very attractive targets for multi-modality integration (e.g., images or speech). In this work, we extend an existing LLM to the speech modality via speech discretization and continued pre-training. In particular, we are interested in multilingual LLMs, suc… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  2. arXiv:2401.03314  [pdf, other

    cs.CL cs.AI cs.LG

    Enhancing Context Through Contrast

    Authors: Kshitij Ambilduke, Aneesh Shetye, Diksha Bagade, Rishika Bhagwatkar, Khurshed Fitter, Prasad Vagdargi, Shital Chiddarwar

    Abstract: Neural machine translation benefits from semantically rich representations. Considerable progress in learning such representations has been achieved by language modelling and mutual information maximization objectives using contrastive learning. The language-dependent nature of language modelling introduces a trade-off between the universality of the learned representations and the model's perform… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.