Skip to main content

Showing 1–6 of 6 results for author: Radhakrishnan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.21870  [pdf, other

    cs.CL cs.AI

    Evaluating the Retrieval Robustness of Large Language Models

    Authors: Shuyang Cao, Karthik Radhakrishnan, David Rosenberg, Steven Lu, Pengxiang Cheng, Lu Wang, Shiyue Zhang

    Abstract: Retrieval-augmented generation (RAG) generally enhances large language models' (LLMs) ability to solve knowledge-intensive tasks. But RAG may also lead to performance degradation due to imperfect retrieval and the model's limited ability to leverage retrieved content. In this work, we evaluate the robustness of LLMs in practical RAG setups (henceforth retrieval robustness). We focus on three resea… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 19 pages

  2. arXiv:2505.15070  [pdf, ps, other

    cs.IR cs.CL

    An Alternative to FLOPS Regularization to Effectively Productionize SPLADE-Doc

    Authors: Aldo Porco, Dhruv Mehra, Igor Malioutov, Karthik Radhakrishnan, Moniba Keymanesh, Daniel Preoţiuc-Pietro, Sean MacAvaney, Pengxiang Cheng

    Abstract: Learned Sparse Retrieval (LSR) models encode text as weighted term vectors, which need to be sparse to leverage inverted index structures during retrieval. SPLADE, the most popular LSR model, uses FLOPS regularization to encourage vector sparsity during training. However, FLOPS regularization does not ensure sparsity among terms - only within a given query or document. Terms with very high Documen… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted as a short paper at SIGIR 2025

  3. arXiv:2305.16252  [pdf, other

    cs.CL

    Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning

    Authors: Genta Indra Winata, Lingjue Xie, Karthik Radhakrishnan, Shijie Wu, Xisen Jin, Pengxiang Cheng, Mayank Kulkarni, Daniel Preotiuc-Pietro

    Abstract: Real-life multilingual systems should be able to efficiently incorporate new languages as data distributions fed to the system evolve and shift over time. To do this, systems need to handle the issue of catastrophic forgetting, where the model performance drops for languages or tasks seen further in its past. In this paper, we study catastrophic forgetting, as well as methods to minimize this, in… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Findings

  4. arXiv:2110.04419  [pdf, other

    cs.CL

    Detecting Community Sensitive Norm Violations in Online Conversations

    Authors: Chan Young Park, Julia Mendelsohn, Karthik Radhakrishnan, Kinjal Jain, Tushar Kanakagiri, David Jurgens, Yulia Tsvetkov

    Abstract: Online platforms and communities establish their own norms that govern what behavior is acceptable within the community. Substantial effort in NLP has focused on identifying unacceptable behaviors and, recently, on forecasting them before they occur. However, these efforts have largely focused on toxicity as the sole form of community norm violation. Such focus has overlooked the much larger set o… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: Findings of EMNLP 2021

  5. arXiv:2010.09927  [pdf, other

    cs.CL cs.AI cs.DB cs.IR

    ColloQL: Robust Cross-Domain Text-to-SQL Over Search Queries

    Authors: Karthik Radhakrishnan, Arvind Srikantan, Xi Victoria Lin

    Abstract: Translating natural language utterances to executable queries is a helpful technique in making the vast amount of data stored in relational databases accessible to a wider range of non-tech-savvy end users. Prior work in this area has largely focused on textual input that is linguistically correct and semantically unambiguous. However, real-world user queries are often succinct, colloquial, and no… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: IntEx-SemPar Workshop at EMNLP 2020, 12 pages, 3 figures

  6. arXiv:1412.6149  [pdf, other

    cs.NI cs.CV

    Design, Implementation and Simulation of a Cloud Computing System for Enhancing Real-time Video Services by using VANET and Onboard Navigation Systems

    Authors: Karim Hammoudi, Nabil Ajam, Mohamed Kasraoui, Fadi Dornaika, Karan Radhakrishnan, Karthik Bandi, Qing Cai, Sai Liu

    Abstract: In this paper, we propose a design for novel and experimental cloud computing systems. The proposed system aims at enhancing computational, communicational and annalistic capabilities of road navigation services by merging several independent technologies, namely vision-based embedded navigation systems, prominent Cloud Computing Systems (CCSs) and Vehicular Ad-hoc NETwork (VANET). This work prese… ▽ More

    Submitted 25 November, 2014; originally announced December 2014.

    Comments: paper accepted for publication in the proceedings of the "17ème Colloque Compression et Représentation des Signaux Audiovisuels" (CORESA), 5p., Reims, France, 2014. (preprint)