Skip to main content

Showing 1–20 of 20 results for author: Sarwar, S M

.
  1. arXiv:2505.20737  [pdf, ps, other

    cs.AI

    RRO: LLM Agent Optimization Through Rising Reward Trajectories

    Authors: Zilong Wang, Jingfeng Yang, Sreyashi Nag, Samarth Varshney, Xianfeng Tang, Haoming Jiang, Jingbo Shang, Sheikh Muhammad Sarwar

    Abstract: Large language models (LLMs) have exhibited extraordinary performance in a variety of tasks while it remains challenging for them to solve complex multi-step tasks as agents. In practice, agents sensitive to the outcome of certain key steps which makes them likely to fail the task because of a subtle mistake in the planning trajectory. Recent approaches resort to calibrating the reasoning process… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: preprint

  2. arXiv:2505.15196  [pdf, ps, other

    cs.CL

    EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association

    Authors: Weiqi Wang, Limeng Cui, Xin Liu, Sreyashi Nag, Wenju Xu, Chen Luo, Sheikh Muhammad Sarwar, Yang Li, Hansu Gu, Hui Liu, Changlong Yu, Jiaxin Bai, Yifan Gao, Haiyang Zhang, Qi He, Shuiwang Ji, Yangqiu Song

    Abstract: Goal-oriented script planning, or the ability to devise coherent sequences of actions toward specific goals, is commonly employed by humans to plan for typical activities. In e-commerce, customers increasingly seek LLM-based assistants to generate scripts and recommend products at each step, thereby facilitating convenient and efficient shopping experiences. However, this capability remains undere… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: ACL2025

  3. arXiv:2503.05786  [pdf, other

    cs.CL cs.HC cs.LG

    FedMentalCare: Towards Privacy-Preserving Fine-Tuned LLMs to Analyze Mental Health Status Using Federated Learning Framework

    Authors: S M Sarwar

    Abstract: With the increasing prevalence of mental health conditions worldwide, AI-powered chatbots and conversational agents have emerged as accessible tools to support mental health. However, deploying Large Language Models (LLMs) in mental healthcare applications raises significant privacy concerns, especially regarding regulations like HIPAA and GDPR. In this work, we propose FedMentalCare, a privacy-pr… ▽ More

    Submitted 13 March, 2025; v1 submitted 27 February, 2025; originally announced March 2025.

    Comments: 9 pages, 3 figures, 2 tables and 2 algorithms

    ACM Class: I.2.6; I.5.1; J.3; C.2.4; D.4.6

  4. arXiv:2502.18536  [pdf, other

    cs.CV cs.CL cs.IR cs.LG

    FilterRAG: Zero-Shot Informed Retrieval-Augmented Generation to Mitigate Hallucinations in VQA

    Authors: S M Sarwar

    Abstract: Visual Question Answering requires models to generate accurate answers by integrating visual and textual understanding. However, VQA models still struggle with hallucinations, producing convincing but incorrect answers, particularly in knowledge-driven and Out-of-Distribution scenarios. We introduce FilterRAG, a retrieval-augmented framework that combines BLIP-VQA with Retrieval-Augmented Generati… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: 12 pages, 6 figures and 2 tables

  5. arXiv:2407.18553  [pdf, other

    cs.IR

    REAPER: Reasoning based Retrieval Planning for Complex RAG Systems

    Authors: Ashutosh Joshi, Sheikh Muhammad Sarwar, Samarth Varshney, Sreyashi Nag, Shrivats Agrawal, Juhi Naik

    Abstract: Complex dialog systems often use retrieved evidence to facilitate factual responses. Such RAG (Retrieval Augmented Generation) systems retrieve from massive heterogeneous data stores that are usually architected as multiple indexes or APIs instead of a single monolithic source. For a given query, relevant evidence needs to be retrieved from one or a small subset of possible retrieval sources. Comp… ▽ More

    Submitted 30 July, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

  6. arXiv:2403.19836  [pdf, other

    cs.CL

    Target Span Detection for Implicit Harmful Content

    Authors: Nazanin Jafari, James Allan, Sheikh Muhammad Sarwar

    Abstract: Identifying the targets of hate speech is a crucial step in grasping the nature of such speech and, ultimately, in improving the detection of offensive posts on online forums. Much harmful content on online platforms uses implicit language especially when targeting vulnerable and protected groups such as using stereotypical characteristics instead of explicit target names, making it harder to dete… ▽ More

    Submitted 27 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  7. arXiv:2311.09134  [pdf, other

    cs.IR

    Scalable and Effective Generative Information Retrieval

    Authors: Hansi Zeng, Chen Luo, Bowen Jin, Sheikh Muhammad Sarwar, Tianxin Wei, Hamed Zamani

    Abstract: Recent research has shown that transformer networks can be used as differentiable search indexes by representing each document as a sequences of document ID tokens. These generative retrieval models cast the retrieval problem to a document ID generation problem for each given query. Despite their elegant design, existing generative retrieval models only perform well on artificially-constructed and… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  8. arXiv:2112.07061  [pdf, other

    cs.CR

    Differentially Private Data Publication with Multi-level Data Utility

    Authors: Honglu Jiang, S M Sarwar, Haotian Yu, Sheikh Ariful Islam

    Abstract: Conventional private data publication mechanisms aim to retain as much data utility as possible while ensuring sufficient privacy protection on sensitive data. Such data publication schemes implicitly assume that all data analysts and users have the same data access privilege levels. However, it is not applicable for the scenario that data users often have different levels of access to the same da… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  9. arXiv:2112.01704  [pdf, other

    cs.CR

    Differential Privacy in Privacy-Preserving Big Data and Learning: Challenge and Opportunity

    Authors: Honglu Jiang, Yifeng Gao, S M Sarwar, Luis GarzaPerez, Mahmudul Robin

    Abstract: Differential privacy (DP) has become the de facto standard of privacy preservation due to its strong protection and sound mathematical foundation, which is widely adopted in different applications such as big data analysis, graph data process, machine learning, deep learning, and federated learning. Although DP has become an active and influential area, it is not the best remedy for all privacy pr… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 12 pages

  10. arXiv:2109.04726  [pdf, other

    cs.CL cs.IR

    AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction

    Authors: Dong-Ho Lee, Ravi Kiran Selvam, Sheikh Muhammad Sarwar, Bill Yuchen Lin, Fred Morstatter, Jay Pujara, Elizabeth Boschee, James Allan, Xiang Ren

    Abstract: Deep neural models for named entity recognition (NER) have shown impressive results in overcoming label scarcity and generalizing to unseen entities by leveraging distant supervision and auxiliary information such as explanations. However, the costs of acquiring such additional information are generally prohibitive. In this paper, we present a novel two-stage framework (AutoTriggER) to improve NER… ▽ More

    Submitted 18 May, 2023; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: 15 pages, 13 figures, EACL 2023

  11. Mixed Attention Transformer for Leveraging Word-Level Knowledge to Neural Cross-Lingual Information Retrieval

    Authors: Zhiqi Huang, Hamed Bonab, Sheikh Muhammad Sarwar, Razieh Rahimi, James Allan

    Abstract: Pretrained contextualized representations offer great success for many downstream tasks, including document ranking. The multilingual versions of such pretrained representations provide a possibility of jointly learning many languages with the same model. Although it is expected to gain big with such joint training, in the case of cross lingual information retrieval (CLIR), the models under a mult… ▽ More

    Submitted 14 September, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

  12. arXiv:2107.12866  [pdf, ps, other

    cs.CL

    Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach

    Authors: Sheikh Muhammad Sarwar, Vanessa Murdock

    Abstract: Online harassment in the form of hate speech has been on the rise in recent years. Addressing the issue requires a combination of content moderation by people, aided by automatic detection methods. As content moderation is itself harmful to the people doing it, we desire to reduce the burden by improving the automatic detection of hate speech. Hate speech presents a challenge as it is directed at… ▽ More

    Submitted 30 July, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

  13. arXiv:2105.12936  [pdf, other

    cs.CL

    Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence

    Authors: Andrew Halterman, Katherine A. Keith, Sheikh Muhammad Sarwar, Brendan O'Connor

    Abstract: Automated event extraction in social science applications often requires corpus-level evaluations: for example, aggregating text predictions across metadata and unbiased estimates of recall. We combine corpus-level evaluation requirements with a real-world, social science setting and introduce the IndiaPoliceEvents corpus--all 21,391 sentences from 1,257 English-language Times of India articles ab… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: To appear in Findings of ACL 2021

    Journal ref: Findings of ACL 2021

  14. arXiv:2103.17055  [pdf, other

    cs.CL stat.ML

    A Neighbourhood Framework for Resource-Lean Content Flagging

    Authors: Sheikh Muhammad Sarwar, Dimitrina Zlatkova, Momchil Hardalov, Yoan Dinkov, Isabelle Augenstein, Preslav Nakov

    Abstract: We propose a novel framework for cross-lingual content flagging with limited target-language data, which significantly outperforms prior work in terms of predictive performance. The framework is based on a nearest-neighbour architecture. It is a modern instantiation of the vanilla k-nearest neighbour model, as we use Transformer representations in all its components. Our framework can adapt to new… ▽ More

    Submitted 27 January, 2022; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted to appear in Transactions of the Association for Computational Linguistics (TACL) -- this is a pre-MIT Press publication version

  15. arXiv:2103.00153  [pdf, other

    cs.CL cs.SI

    Detecting Harmful Content On Online Platforms: What Platforms Need Vs. Where Research Efforts Go

    Authors: Arnav Arora, Preslav Nakov, Momchil Hardalov, Sheikh Muhammad Sarwar, Vibha Nayak, Yoan Dinkov, Dimitrina Zlatkova, Kyle Dent, Ameya Bhatawdekar, Guillaume Bouchard, Isabelle Augenstein

    Abstract: The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more… ▽ More

    Submitted 6 June, 2023; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: The paper has been accepted for publication to ACM Computing Surveys (CSUR)

  16. arXiv:1907.01457  [pdf, ps, other

    cs.IR

    Semantic Driven Fielded Entity Retrieval

    Authors: Shahrzad Naseri, Sheikh Muhammad Sarwar, James Allan

    Abstract: A common approach for knowledge-base entity search is to consider an entity as a document with multiple fields. Models that focus on matching query terms in different fields are popular choices for searching such entity representations. An instance of such a model is FSDM (Fielded Sequential Dependence Model). We propose to integrate field-level semantic features into FSDM. We use FSDM to retrieve… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

  17. arXiv:1906.06849  [pdf, other

    cs.IR cs.CL

    A Multi-Task Architecture on Relevance-based Neural Query Translation

    Authors: Sheikh Muhammad Sarwar, Hamed Bonab, James Allan

    Abstract: We describe a multi-task learning approach to train a Neural Machine Translation (NMT) model with a Relevance-based Auxiliary Task (RAT) for search query translation. The translation process for Cross-lingual Information Retrieval (CLIR) task is usually treated as a black box and it is performed as an independent step. However, an NMT model trained on sentence-level parallel data is not aware of t… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: Accepted for publication at ACL 2019

  18. arXiv:1806.04411  [pdf, other

    cs.IR

    Named Entity Recognition with Extremely Limited Data

    Authors: John Foley, Sheikh Muhammad Sarwar, James Allan

    Abstract: Traditional information retrieval treats named entity recognition as a pre-indexing corpus annotation task, allowing entity tags to be indexed and used during search. Named entity taggers themselves are typically trained on thousands or tens of thousands of examples labeled by humans. However, there is a long tail of named entities classes, and for these cases, labeled data may be impossible to… ▽ More

    Submitted 13 June, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: 6 pages

    ACM Class: H.3.2; H.3.3

  19. Term Relevance Feedback for Contextual Named Entity Retrieval

    Authors: Sheikh Muhammad Sarwar, John Foley, James Allan

    Abstract: We address the role of a user in Contextual Named Entity Retrieval (CNER), showing (1) that user identification of important context-bearing terms is superior to automated approaches, and (2) that further gains are possible if the user indicates the relative importance of those terms. CNER is similar in spirit to List Question answering and Entity disambiguation. However, the main focus of CNER is… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

  20. arXiv:1508.03856  [pdf, ps, other

    cs.IR cs.LG

    Two-stage Cascaded Classifier for Purchase Prediction

    Authors: Sheikh Muhammad Sarwar, Mahamudul Hasan, Dmitry I. Ignatov

    Abstract: In this paper we describe our machine learning solution for the RecSys Challenge, 2015. We have proposed a time efficient two-stage cascaded classifier for the prediction of buy sessions and purchased items within such sessions. Based on the model, several interesting features found, and formation of our own test bed, we have achieved a reasonable score. Usage of Random Forests helps us to cope wi… ▽ More

    Submitted 16 August, 2015; originally announced August 2015.