Skip to main content

Showing 1–24 of 24 results for author: Salemi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10844  [pdf, ps, other

    cs.CL cs.IR

    CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training

    Authors: Alireza Salemi, Mukta Maddipatla, Hamed Zamani

    Abstract: This paper presents mRAG, a multi-agent retrieval-augmented generation (RAG) framework composed of specialized agents for subtasks such as planning, searching, reasoning, and coordination. Our system uses a self-training paradigm with reward-guided trajectory sampling to optimize inter-agent collaboration and enhance response generation. Evaluated on DataMorgana-derived datasets during the SIGIR 2… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2506.00137  [pdf, ps, other

    cs.CL cs.IR cs.LG

    LaMP-QA: A Benchmark for Personalized Long-form Question Answering

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: Personalization is essential for question answering systems that are user-centric. Despite its importance, personalization in answer generation has been relatively underexplored. This is mainly due to lack of resources for training and evaluating personalized question answering systems. We address this gap by introducing LaMP-QA -- a benchmark designed for evaluating personalized long-form answer… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  3. arXiv:2504.07794  [pdf, other

    cs.CL cs.IR

    Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation

    Authors: Alireza Salemi, Chris Samarinas, Hamed Zamani

    Abstract: This paper studies the limitations of (retrieval-augmented) large language models (LLMs) in generating diverse and comprehensive responses, and introduces the Plan-and-Refine (P&R) framework based on a two phase system design. In the global exploration phase, P&R generates a diverse set of plans for the given input, where each plan consists of a list of diverse query aspects with corresponding add… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  4. arXiv:2503.02614  [pdf, ps, other

    cs.IR

    Personalized Generation In Large Model Era: A Survey

    Authors: Yiyan Xu, Jinghao Zhang, Alireza Salemi, Xinting Hu, Wenjie Wang, Fuli Feng, Hamed Zamani, Xiangnan He, Tat-Seng Chua

    Abstract: In the era of large models, content generation is gradually shifting to Personalized Generation (PGen), tailoring content to individual preferences and needs. This paper presents the first comprehensive survey on PGen, investigating existing research in this rapidly growing field. We conceptualize PGen from a unified perspective, systematically formalizing its key components, core objectives, and… ▽ More

    Submitted 30 May, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: ACL 2025

  5. arXiv:2501.14956  [pdf, ps, other

    cs.CL cs.AI cs.IR

    ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation

    Authors: Alireza Salemi, Julian Killingback, Hamed Zamani

    Abstract: Evaluating personalized text generated by large language models (LLMs) is challenging, as only the LLM user, i.e., prompt author, can reliably assess the output, but re-engaging the same individuals across studies is infeasible. This paper addresses the challenge of evaluating personalized text generation by introducing ExPerT, an explainable reference-based evaluation framework. ExPerT leverages… ▽ More

    Submitted 30 May, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

  6. arXiv:2501.04167  [pdf, other

    cs.CL cs.AI cs.IR

    Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation

    Authors: Alireza Salemi, Cheng Li, Mingyang Zhang, Qiaozhu Mei, Weize Kong, Tao Chen, Zhuowan Li, Michael Bendersky, Hamed Zamani

    Abstract: Personalized text generation requires a unique ability of large language models (LLMs) to learn from context that they often do not encounter during their standard training. One way to encourage LLMs to better use personalized context for generating outputs that better align with the user's expectations is to instruct them to reason over the user's past preferences, background knowledge, or writin… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  7. arXiv:2501.03545  [pdf, ps, other

    cs.CL

    Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form Text Generation

    Authors: Chris Samarinas, Alexander Krubner, Alireza Salemi, Youngwoo Kim, Hamed Zamani

    Abstract: This paper presents ICAT, an evaluation framework for measuring coverage of diverse factual information in long-form text generation. ICAT breaks down a long output text into a list of atomic claims and not only verifies each claim through retrieval from a (reliable) knowledge source, but also computes the alignment between the atomic factual claims and various aspects expected to be presented in… ▽ More

    Submitted 30 May, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

  8. arXiv:2412.15246  [pdf, other

    cs.CL cs.AI cs.AR cs.DC cs.IR

    Accelerating Retrieval-Augmented Generation

    Authors: Derrick Quinn, Mohammad Nouri, Neel Patel, John Salihu, Alireza Salemi, Sukhan Lee, Hamed Zamani, Mohammad Alian

    Abstract: An evolving solution to address hallucination and enhance accuracy in large language models (LLMs) is Retrieval-Augmented Generation (RAG), which involves augmenting LLMs with information retrieved from an external knowledge source, such as the web. This paper profiles several RAG execution pipelines and demystifies the complex interplay between their retrieval and generation phases. We demonstrat… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

  9. arXiv:2410.09942  [pdf, ps, other

    cs.CL cs.IR

    Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents, each with a distinct task, backbone large language model (LLM), and RAG strategy. We introduce an iterative approach where the search engine generates retrieval results for the RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase.… ▽ More

    Submitted 25 June, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

  10. arXiv:2409.09510  [pdf, ps, other

    cs.CL

    Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: Despite its substantial impact on various search, recommendation, and question answering tasks, privacy-preserving methods for personalizing large language models (LLMs) have received relatively limited exploration. There is one primary approach in this area through retrieval-augmented generation (RAG), which generates personalized outputs by enriching the input prompt with information retrieved f… ▽ More

    Submitted 25 June, 2025; v1 submitted 14 September, 2024; originally announced September 2024.

  11. arXiv:2407.12982  [pdf, other

    cs.LG cs.CL cs.IR

    Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

    Authors: To Eun Kim, Alireza Salemi, Andrew Drozdov, Fernando Diaz, Hamed Zamani

    Abstract: In the field of language modeling, models augmented with retrieval components have emerged as a promising solution to address several challenges faced in the natural language processing (NLP) field, including knowledge grounding, interpretability, and scalability. Despite the primary focus on NLP, we posit that the paradigm of retrieval-enhancement can be extended to a broader spectrum of machine… ▽ More

    Submitted 18 October, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  12. arXiv:2407.11016  [pdf, other

    cs.CL cs.LG

    LongLaMP: A Benchmark for Personalized Long-form Text Generation

    Authors: Ishita Kumar, Snigdha Viswanathan, Sushrita Yerra, Alireza Salemi, Ryan A. Rossi, Franck Dernoncourt, Hanieh Deilamsalehy, Xiang Chen, Ruiyi Zhang, Shubham Agarwal, Nedim Lipka, Chien Van Nguyen, Thien Huu Nguyen, Hamed Zamani

    Abstract: Long-text generation is seemingly ubiquitous in real-world applications of large language models such as generating an email or writing a review. Despite the fundamental importance and prevalence of long-text generation in many practical applications, existing work on personalized generation has focused on the generation of very short text. To overcome these limitations, we study the problem of pe… ▽ More

    Submitted 14 October, 2024; v1 submitted 26 June, 2024; originally announced July 2024.

  13. arXiv:2405.00175  [pdf, other

    cs.CL cs.IR

    Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: This paper introduces uRAG--a framework with a unified retrieval engine that serves multiple downstream retrieval-augmented generation (RAG) systems. Each RAG system consumes the retrieval results for a unique purpose, such as open-domain question answering, fact verification, entity linking, and relation extraction. We introduce a generic training guideline that standardizes the communication bet… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  14. arXiv:2404.13781  [pdf, other

    cs.CL cs.IR

    Evaluating Retrieval Quality in Retrieval-Augmented Generation

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for retrieval models within these systems. Traditional end-to-end evaluation methods are computationally expensive. Furthermore, evaluation of the retrieval model's performance based on query-document relevance labels shows a small correlation with the RAG system's downstream performance. We propose a novel evaluatio… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  15. arXiv:2404.05970  [pdf, other

    cs.CL cs.IR

    Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation

    Authors: Alireza Salemi, Surya Kallumadi, Hamed Zamani

    Abstract: This paper studies retrieval-augmented approaches for personalizing large language models (LLMs), which potentially have a substantial impact on various applications and domains. We propose the first attempt to optimize the retrieval models that deliver a limited number of personal documents to large language models for the purpose of personalized generation. We develop two optimization algorithms… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  16. arXiv:2401.06466  [pdf, other

    cs.CL cs.AI

    PersianMind: A Cross-Lingual Persian-English Large Language Model

    Authors: Pedram Rostami, Ali Salemi, Mohammad Javad Dousti

    Abstract: Large language models demonstrate remarkable proficiency in various linguistic tasks and have extensive knowledge across various domains. Although they perform best in English, their ability in other languages is notable too. In contrast, open-source models, such as LLaMa, are primarily trained on English datasets, resulting in poor performance in non-English languages. In this paper, we introduce… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  17. arXiv:2306.16478  [pdf, other

    cs.IR cs.CL cs.CV

    Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering

    Authors: Alireza Salemi, Mahta Rafiee, Hamed Zamani

    Abstract: This paper studies a category of visual question answering tasks, in which accessing external knowledge is necessary for answering the questions. This category is called outside-knowledge visual question answering (OK-VQA). A major step in developing OK-VQA systems is to retrieve relevant documents for the given multi-modal query. Current state-of-the-art asymmetric dense retrieval model for this… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  18. arXiv:2304.13649  [pdf, other

    cs.CV cs.CL cs.IR

    A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering

    Authors: Alireza Salemi, Juan Altmayer Pizzorno, Hamed Zamani

    Abstract: Knowledge-Intensive Visual Question Answering (KI-VQA) refers to answering a question about an image whose answer does not lie in the image. This paper presents a new pipeline for KI-VQA tasks, consisting of a retriever and a reader. First, we introduce DEDR, a symmetric dual encoding dense retrieval framework in which documents and queries are encoded into a shared embedding space using uni-modal… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  19. arXiv:2304.11406  [pdf, other

    cs.CL

    LaMP: When Large Language Models Meet Personalization

    Authors: Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani

    Abstract: This paper highlights the importance of personalization in large language models and introduces the LaMP benchmark -- a novel benchmark for training and evaluating language models for producing personalized outputs. LaMP offers a comprehensive evaluation framework with diverse language tasks and multiple entries for each user profile. It consists of seven personalized tasks, spanning three text cl… ▽ More

    Submitted 4 June, 2024; v1 submitted 22 April, 2023; originally announced April 2023.

  20. arXiv:2304.01282  [pdf, other

    cs.CL

    PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation

    Authors: Alireza Salemi, Amirhossein Abaskohi, Sara Tavakoli, Yadollah Yaghoobzadeh, Azadeh Shakery

    Abstract: Multilingual pre-training significantly improves many multilingual NLP tasks, including machine translation. Most existing methods are based on some variants of masked language modeling and text-denoising objectives on monolingual data. Multilingual pre-training on monolingual data ignores the availability of parallel data in many language pairs. Also, some other works integrate the available huma… ▽ More

    Submitted 14 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 15 pages, 5 figures, 16 tables, 1 algorithm, LoResMT@EACL 2023

    Journal ref: https://aclanthology.org/2023.loresmt-1.3

  21. arXiv:2112.13430  [pdf, other

    cs.CR

    IoT Analytics and Blockchain

    Authors: Abbas Saleminezhadl, Manuel Remmele, Ravikumar Chaudhari, Rasha Kashef

    Abstract: The Internet of Things (IoT) is revolutionizing human life with the idea of interconnecting everyday used devices (Things) and making them smart. By establishing a communication network between devices, the IoT system aids in automating tasks and making them efficient and powerful. The sensors and the physical world, connected over a network, involve a massive amount of data. The data collection a… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

  22. arXiv:2109.04098  [pdf, other

    cs.CL

    ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

    Authors: Alireza Salemi, Emad Kebriaei, Ghazal Neisi Minaei, Azadeh Shakery

    Abstract: Abstractive text summarization is one of the areas influenced by the emergence of pre-trained language models. Current pre-training works in abstractive summarization give more points to the summaries with more words in common with the main text and pay less attention to the semantic similarity between generated sentences and the original document. We propose ARMAN, a Transformer-based encoder-dec… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  23. arXiv:2104.04770  [pdf, other

    cs.CL

    UTNLP at SemEval-2021 Task 5: A Comparative Analysis of Toxic Span Detection using Attention-based, Named Entity Recognition, and Ensemble Models

    Authors: Alireza Salemi, Nazanin Sabri, Emad Kebriaei, Behnam Bahrak, Azadeh Shakery

    Abstract: Detecting which parts of a sentence contribute to that sentence's toxicity -- rather than providing a sentence-level verdict of hatefulness -- would increase the interpretability of models and allow human moderators to better understand the outputs of the system. This paper presents our team's, UTNLP, methodology and results in the SemEval-2021 shared task 5 on toxic spans detection. We test multi… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  24. DISCERN: Diversity-based Selection of Centroids for k-Estimation and Rapid Non-stochastic Clustering

    Authors: Ali Hassani, Amir Iranmanesh, Mahdi Eftekhari, Abbas Salemi

    Abstract: One of the applications of center-based clustering algorithms such as K-Means is partitioning data points into K clusters. In some examples, the feature space relates to the underlying problem we are trying to solve, and sometimes we can obtain a suitable feature space. Nevertheless, while K-Means is one of the most efficient offline clustering algorithms, it is not equipped to estimate the number… ▽ More

    Submitted 22 September, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Int. J. Mach. Learn. & Cyber. (2020)