Skip to main content

Showing 1–18 of 18 results for author: Madhavan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.16507  [pdf, ps, other

    cs.LG

    Robust Reward Modeling via Causal Rubrics

    Authors: Pragya Srivastava, Harman Singh, Rahul Madhavan, Gandharv Patil, Sravanti Addepalli, Arun Suggala, Rengarajan Aravamudhan, Soumya Sharma, Anirban Laha, Aravindan Raghuveer, Karthikeyan Shanmugam, Doina Precup

    Abstract: Reward models (RMs) are fundamental to aligning Large Language Models (LLMs) via human feedback, yet they often suffer from reward hacking. They tend to latch on to superficial or spurious attributes, such as response length or formatting, mistaking these cues learned from correlations in training data for the true causal drivers of quality (e.g., factuality, relevance). This occurs because standa… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  2. arXiv:2504.09022  [pdf, other

    cs.MA

    Game-Theoretic Coordination For Time-Critical Missions of UAV Systems

    Authors: Mikayel Aramyan, Anna Manucharyan, Lusine Poghosyan, Rohith Madhavan, Tigran Bakaryan, Naira Hovakimyan

    Abstract: Cooperative missions involving Unmanned Aerial Vehicles (UAVs) in dynamic environments pose significant challenges in ensuring both coordination and agility. In this paper, we introduce a novel game-theoretic approach for time-critical missions, where each UAV optimizes a cost function that incorporates temporal and mission-specific constraints. The optimization is performed within a one-dimension… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  3. arXiv:2502.18293  [pdf, ps, other

    cs.LG cs.AI cs.CL

    AMPO: Active Multi-Preference Optimization for Self-play Preference Selection

    Authors: Taneesh Gupta, Rahul Madhavan, Xuchao Zhang, Chetan Bansal, Saravan Rajmohan

    Abstract: Multi-preference optimization enriches language-model alignment beyond pairwise preferences by contrasting entire sets of helpful and undesired responses, thereby enabling richer training signals for large language models. During self-play alignment, these models often produce numerous candidate answers per query, rendering it computationally infeasible to include all responses in the training obj… ▽ More

    Submitted 8 June, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

    Comments: Accepted at ICML 2025

  4. arXiv:2412.16378  [pdf, other

    cs.LG cs.AI cs.CL

    REFA: Reference Free Alignment for multi-preference optimization

    Authors: Taneesh Gupta, Rahul Madhavan, Xuchao Zhang, Chetan Bansal, Saravan Rajmohan

    Abstract: We introduce $\textbf{REFA}$, a family of reference-free alignment methods that optimize over multiple user preferences while enforcing fine-grained length control. Our approach integrates deviation-based weighting to emphasize high-quality responses, length normalization to prevent trivial short-response solutions, and an EOS-probability regularizer to mitigate dataset-induced brevity biases. The… ▽ More

    Submitted 24 February, 2025; v1 submitted 20 December, 2024; originally announced December 2024.

  5. arXiv:2412.04628  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Multi-Preference Optimization: Generalizing DPO via Set-Level Contrasts

    Authors: Taneesh Gupta, Rahul Madhavan, Xuchao Zhang, Nagarajan Natarajan, Chetan Bansal, Saravan Rajmohan

    Abstract: Direct Preference Optimization (DPO) has become a popular approach for aligning language models using pairwise preferences. However, in practical post-training pipelines, on-policy generation typically yields multiple candidate responses per prompt, which are scored by a reward model to guide learning. In this setting, we propose $\textbf{Multi-Preference Optimization (MPO)}$, a generalization of… ▽ More

    Submitted 19 June, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  6. arXiv:2412.02626  [pdf, other

    cs.CL cs.AI

    Time-Reversal Provides Unsupervised Feedback to LLMs

    Authors: Yerram Varun, Rahul Madhavan, Sravanti Addepalli, Arun Suggala, Karthikeyan Shanmugam, Prateek Jain

    Abstract: Large Language Models (LLMs) are typically trained to predict in the forward direction of time. However, recent works have shown that prompting these models to look back and critique their own generations can produce useful feedback. Motivated by this, we explore the question of whether LLMs can be empowered to think (predict and score) backwards to provide unsupervised feedback that complements f… ▽ More

    Submitted 2 February, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: Accepted as a spotlight in NeurIPS 2024

    Journal ref: The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024

  7. arXiv:2410.21545  [pdf, other

    cs.CL

    CARMO: Dynamic Criteria Generation for Context-Aware Reward Modelling

    Authors: Taneesh Gupta, Shivam Shandilya, Xuchao Zhang, Rahul Madhavan, Supriyo Ghosh, Chetan Bansal, Huaxiu Yao, Saravan Rajmohan

    Abstract: Reward modeling in large language models is susceptible to reward hacking, causing models to latch onto superficial features such as the tendency to generate lists or unnecessarily long responses. In reinforcement learning from human feedback (RLHF) and more generally during post-training flawed reward signals often lead to outputs that optimize for these spurious correlates instead of genuine qua… ▽ More

    Submitted 17 February, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  8. arXiv:2405.18626  [pdf, other

    cs.LG cs.AI

    Causal Contextual Bandits with Adaptive Context

    Authors: Rahul Madhavan, Aurghya Maiti, Gaurav Sinha, Siddharth Barman

    Abstract: We study a variant of causal contextual bandits where the context is chosen based on an initial intervention chosen by the learner. At the beginning of each round, the learner selects an initial action, depending on which a stochastic context is revealed by the environment. Following this, the learner then selects a final action and receives a reward. Given $T$ rounds of interactions with the envi… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Reinforcement Learning Conference (RLC) 2024, 10 pages (31 pages including appendix), 8 plots. arXiv admin note: text overlap with arXiv:2111.00886

  9. arXiv:2401.15229  [pdf, other

    cs.CY

    Evolving AI Risk Management: A Maturity Model based on the NIST AI Risk Management Framework

    Authors: Ravit Dotan, Borhane Blili-Hamelin, Ravi Madhavan, Jeanna Matthews, Joshua Scarpino

    Abstract: Researchers, government bodies, and organizations have been repeatedly calling for a shift in the responsible AI community from general principles to tangible and operationalizable practices in mitigating the potential sociotechnical harms of AI. Frameworks like the NIST AI RMF embody an emerging consensus on recommended practices in operationalizing sociotechnical harm mitigation. However, privat… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  10. arXiv:2311.11229  [pdf, other

    cs.CL

    Causal ATE Mitigates Unintended Bias in Controlled Text Generation

    Authors: Rahul Madhavan, Kahini Wadhawan

    Abstract: We study attribute control in language models through the method of Causal Average Treatment Effect (Causal ATE). Existing methods for the attribute control task in Language Models (LMs) check for the co-occurrence of words in a sentence with the attribute of interest, and control for them. However, spurious correlation of the words with the attribute in the training dataset, can cause models to h… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures

  11. CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation

    Authors: Rahul Madhavan, Rishabh Garg, Kahini Wadhawan, Sameep Mehta

    Abstract: We propose a method to control the attributes of Language Models (LMs) for the text generation task using Causal Average Treatment Effect (ATE) scores and counterfactual augmentation. We explore this method, in the context of LM detoxification, and propose the Causally Fair Language (CFL) architecture for detoxifying pre-trained LMs in a plug-and-play manner. Our architecture is based on a Structu… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 19 pages, 10 figures. Findings of ACL 2023

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023

  12. arXiv:2305.04638  [pdf, other

    cs.LG cs.AI

    Learning Good Interventions in Causal Graphs via Covering

    Authors: Ayush Sawarni, Rahul Madhavan, Gaurav Sinha, Siddharth Barman

    Abstract: We study the causal bandit problem that entails identifying a near-optimal intervention from a specified set $A$ of (possibly non-atomic) interventions over a given causal graph. Here, an optimal intervention in ${A}$ is one that maximizes the expected value for a designated reward variable in the graph, and we use the standard notion of simple regret to quantify near optimality. Considering Berno… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 26 pages

  13. arXiv:2203.03541  [pdf, other

    cs.CL cs.AI

    Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods

    Authors: Mohit Wadhwa, Mohan Bhambhani, Ashvini Jindal, Uma Sawant, Ramanujam Madhavan

    Abstract: Counterfactual fairness methods address the question: How would the prediction change if the sensitive identity attributes referenced in the text instance were different? These methods are entirely based on generating counterfactuals for the given training and test set instances. Counterfactual instances are commonly prepared by replacing sensitive identity terms, i.e., the identity terms present… ▽ More

    Submitted 4 February, 2022; originally announced March 2022.

  14. arXiv:2111.00886  [pdf, other

    cs.LG cs.AI

    Intervention Efficient Algorithm for Two-Stage Causal MDPs

    Authors: Rahul Madhavan, Aurghya Maiti, Gaurav Sinha, Siddharth Barman

    Abstract: We study Markov Decision Processes (MDP) wherein states correspond to causal graphs that stochastically generate rewards. In this setup, the learner's goal is to identify atomic interventions that lead to high rewards by intervening on variables at each state. Generalizing the recent causal-bandit framework, the current work develops (simple) regret minimization guarantees for two-stage causal MDP… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 29 pages

  15. arXiv:2106.13849  [pdf, other

    cs.CV eess.IV

    A CNN Segmentation-Based Approach to Object Detection and Tracking in Ultrasound Scans with Application to the Vagus Nerve Detection

    Authors: Abdullah F. Al-Battal, Yan Gong, Lu Xu, Timothy Morton, Chen Du, Yifeng Bu 1, Imanuel R Lerman, Radhika Madhavan, Truong Q. Nguyen

    Abstract: Ultrasound scanning is essential in several medical diagnostic and therapeutic applications. It is used to visualize and analyze anatomical features and structures that influence treatment plans. However, it is both labor intensive, and its effectiveness is operator dependent. Real-time accurate and robust automatic detection and tracking of anatomical structures while scanning would significantly… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 7 pages , 4 figures, submitted to the IEEE EMBC 2021 conference

  16. arXiv:2104.07361  [pdf, other

    cs.LG

    Scale Invariant Monte Carlo under Linear Function Approximation with Curvature based step-size

    Authors: Rahul Madhavan, Hemanta Makwana

    Abstract: We study the feature-scaled version of the Monte Carlo algorithm with linear function approximation. This algorithm converges to a scale-invariant solution, which is not unduly affected by states having feature vectors with large norms. The usual versions of the MCMC algorithm, obtained by minimizing the least-squares criterion, do not produce solutions that give equal importance to all states irr… ▽ More

    Submitted 29 May, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: 42 pages, 9 figures (9 pages main body with 5 figures)

  17. arXiv:2010.10737  [pdf, other

    cs.LG stat.ML

    Directed Graph Representation through Vector Cross Product

    Authors: Ramanujam Madhavan, Mohit Wadhwa

    Abstract: Graph embedding methods embed the nodes in a graph in low dimensional vector space while preserving graph topology to carry out the downstream tasks such as link prediction, node recommendation and clustering. These tasks depend on a similarity measure such as cosine similarity and Euclidean distance between a pair of embeddings that are symmetric in nature and hence do not hold good for directed… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  18. arXiv:2002.12143  [pdf, other

    cs.LG stat.ML

    Fairness-Aware Learning with Prejudice Free Representations

    Authors: Ramanujam Madhavan, Mohit Wadhwa

    Abstract: Machine learning models are extensively being used to make decisions that have a significant impact on human life. These models are trained over historical data that may contain information about sensitive attributes such as race, sex, religion, etc. The presence of such sensitive attributes can impact certain population subgroups unfairly. It is straightforward to remove sensitive features from t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.