Skip to main content

Showing 1–7 of 7 results for author: Tamber, M S

.
  1. arXiv:2505.19274  [pdf, ps, other

    cs.IR

    Conventional Contrastive Learning Often Falls Short: Improving Dense Retrieval with Cross-Encoder Listwise Distillation and Synthetic Data

    Authors: Manveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Jimmy Lin

    Abstract: We investigate improving the retrieval effectiveness of embedding models through the lens of corpus-specific fine-tuning. Prior work has shown that fine-tuning with queries generated using a dataset's retrieval corpus can boost retrieval effectiveness for the dataset. However, we find that surprisingly, fine-tuning using the conventional InfoNCE contrastive loss often reduces effectiveness in stat… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: updated version of arxiv:2502.19712

  2. arXiv:2505.04847  [pdf, other

    cs.CL cs.AI

    Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards

    Authors: Manveer Singh Tamber, Forrest Sheng Bao, Chenyu Xu, Ge Luo, Suleman Kazi, Minseok Bae, Miaoran Li, Ofer Mendelevitch, Renyi Qu, Jimmy Lin

    Abstract: Hallucinations remain a persistent challenge for LLMs. RAG aims to reduce hallucinations by grounding responses in contexts. However, even when provided context, LLMs still frequently introduce unsupported information or contradictions. This paper presents our efforts to measure LLM hallucinations with a focus on summarization tasks, assessing how often various LLMs introduce hallucinations when s… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  3. arXiv:2502.19712  [pdf, other

    cs.IR

    Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data Augmentation

    Authors: Manveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Jimmy Lin

    Abstract: While the current state-of-the-art dense retrieval models exhibit strong out-of-domain generalization, they might fail to capture nuanced domain-specific knowledge. In principle, fine-tuning these models for specialized retrieval tasks should yield higher effectiveness than relying on a one-size-fits-all model, but in practice, results can disappoint. We show that standard fine-tuning methods usin… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  4. arXiv:2501.18536  [pdf, other

    cs.IR

    Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges

    Authors: Manveer Singh Tamber, Jimmy Lin

    Abstract: Consider a scenario in which a user searches for information, only to encounter texts flooded with misleading or non-relevant content. This scenario exemplifies a simple yet potent vulnerability in neural Information Retrieval (IR) pipelines: content injection attacks. We find that embedding models for retrieval, rerankers, and large language model (LLM) relevance judges are vulnerable to these at… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  5. arXiv:2410.13210  [pdf, other

    cs.CL cs.AI

    FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMs

    Authors: Forrest Sheng Bao, Miaoran Li, Renyi Qu, Ge Luo, Erana Wan, Yujia Tang, Weisi Fan, Manveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Mike Qi, Ruixuan Tu, Chenyu Xu, Matthew Gonzales, Ofer Mendelevitch, Amin Ahmad

    Abstract: Summarization is one of the most common tasks performed by large language models (LLMs), especially in applications like Retrieval-Augmented Generation (RAG). However, existing evaluations of hallucinations in LLM-generated summaries, and evaluations of hallucination detection models both suffer from a lack of diversity and recency in the LLM and LLM families considered. This paper introduces Fait… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  6. arXiv:2406.09355  [pdf, other

    cs.IR

    Can't Hide Behind the API: Stealing Black-Box Commercial Embedding Models

    Authors: Manveer Singh Tamber, Jasper Xian, Jimmy Lin

    Abstract: Embedding models that generate dense vector representations of text are widely used and hold significant commercial value. Companies such as OpenAI and Cohere offer proprietary embedding models via paid APIs, but despite being "hidden" behind APIs, these models are not protected from theft. We present, to our knowledge, the first effort to "steal" these models for retrieval by training thief model… ▽ More

    Submitted 3 May, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: NAACL Findings 2025

  7. arXiv:2312.16098  [pdf, other

    cs.IR

    Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder-Decoder Models

    Authors: Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin

    Abstract: Recent work in zero-shot listwise reranking using LLMs has achieved state-of-the-art results. However, these methods are not without drawbacks. The proposed methods rely on large LLMs with billions of parameters and limited context sizes. This paper introduces LiT5-Distill and LiT5-Score, two methods for efficient zero-shot listwise reranking, leveraging T5 sequence-to-sequence encoder-decoder mod… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.