Skip to main content

Showing 1–11 of 11 results for author: Honavar, V G

.
  1. arXiv:2503.05079  [pdf, other

    cs.LG

    On a Connection Between Imitation Learning and RLHF

    Authors: Teng Xiao, Yige Yuan, Mingxiao Li, Zhengyu Chen, Vasant G Honavar

    Abstract: This work studies the alignment of large language models with preference data from an imitation learning perspective. We establish a close theoretical connection between reinforcement learning from human feedback RLHF and imitation learning (IL), revealing that RLHF implicitly performs imitation learning on the preference data distribution. Building on this connection, we propose DIL, a principled… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: ICLR 2025

  2. arXiv:2502.00883  [pdf, other

    cs.LG cs.CL

    SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

    Authors: Teng Xiao, Yige Yuan, Zhengyu Chen, Mingxiao Li, Shangsong Liang, Zhaochun Ren, Vasant G Honavar

    Abstract: Existing preference optimization objectives for language model alignment require additional hyperparameters that must be extensively tuned to achieve optimal performance, increasing both the complexity and time required for fine-tuning large language models. In this paper, we propose a simple yet effective hyperparameter-free preference optimization algorithm for alignment. We observe that promisi… ▽ More

    Submitted 20 February, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: ICLR 2025

  3. arXiv:2412.14516  [pdf, other

    cs.LG cs.CL

    Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment

    Authors: Teng Xiao, Yige Yuan, Huaisheng Zhu, Mingxiao Li, Vasant G Honavar

    Abstract: We study the problem of aligning large language models (LLMs) with human preference data. Contrastive preference optimization has shown promising results in aligning LLMs with available preference data by optimizing the implicit reward associated with the policy. However, the contrastive objective focuses mainly on the relative values of implicit rewards associated with two responses while ignorin… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted by NeurIPS 2024 Main

  4. arXiv:2411.10821  [pdf, other

    cs.LG q-bio.BM

    GeomCLIP: Contrastive Geometry-Text Pre-training for Molecules

    Authors: Teng Xiao, Chao Cui, Huaisheng Zhu, Vasant G. Honavar

    Abstract: Pretraining molecular representations is crucial for drug and material discovery. Recent methods focus on learning representations from geometric structures, effectively capturing 3D position information. Yet, they overlook the rich information in biomedical texts, which detail molecules' properties and substructures. With this in mind, we set up a data collection effort for 200K pairs of ground-s… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: BIBM 2024

  5. arXiv:2410.10093  [pdf, other

    cs.CL cs.LG

    How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective

    Authors: Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar

    Abstract: This paper introduces a novel generalized self-imitation learning ($\textbf{GSIL}$) framework, which effectively and efficiently aligns large language models with offline demonstration data. We develop $\textbf{GSIL}$ by deriving a surrogate objective of imitation learning with density ratio estimates, facilitating the use of self-generated data and optimizing the imitation learning objective with… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024 Main

  6. arXiv:2403.08167  [pdf, other

    cs.LG cs.CL q-bio.QM

    MolBind: Multimodal Alignment of Language, Molecules, and Proteins

    Authors: Teng Xiao, Chao Cui, Huaisheng Zhu, Vasant G. Honavar

    Abstract: Recent advancements in biology and chemistry have leveraged multi-modal learning, integrating molecules and their natural language descriptions to enhance drug discovery. However, current pre-training frameworks are limited to two modalities, and designing a unified network to process different modalities (e.g., natural language, 2D molecular graphs, 3D molecular conformations, and 3D proteins) re… ▽ More

    Submitted 2 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Report number: 2403.08167

  7. arXiv:2403.07179  [pdf, other

    cs.LG cs.CL q-bio.BM

    3M-Diffusion: Latent Multi-Modal Diffusion for Language-Guided Molecular Structure Generation

    Authors: Huaisheng Zhu, Teng Xiao, Vasant G Honavar

    Abstract: Generating molecular structures with desired properties is a critical task with broad applications in drug discovery and materials design. We propose 3M-Diffusion, a novel multi-modal molecular graph generation method, to generate diverse, ideally novel molecular structures with desired properties. 3M-Diffusion encodes molecular graphs into a graph latent space which it then aligns with the text s… ▽ More

    Submitted 2 October, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  8. arXiv:2401.05667  [pdf, other

    cs.LG cs.AI

    EsaCL: Efficient Continual Learning of Sparse Models

    Authors: Weijieying Ren, Vasant G Honavar

    Abstract: A key challenge in the continual learning setting is to efficiently learn a sequence of tasks without forgetting how to perform previously learned tasks. Many existing approaches to this problem work by either retraining the model on previous tasks or by expanding the model to accommodate new tasks. However, these approaches typically suffer from increased storage and computational requirements, a… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: SDM 2024 : SIAM International Conference on Data Mining

  9. arXiv:2010.01101  [pdf

    stat.AP cs.CY cs.LG q-bio.PE

    Commuting Network Spillovers and COVID-19 Deaths Across US Counties

    Authors: Christopher Seto, Aria Khademi, Corina Graif, Vasant G. Honavar

    Abstract: This study explored how population mobility flows form commuting networks across US counties and influence the spread of COVID-19. We utilized 3-level mixed effects negative binomial regression models to estimate the impact of network COVID-19 exposure on county confirmed cases and deaths over time. We also conducted weighting-based analyses to estimate the causal effect of network exposure. Resul… ▽ More

    Submitted 10 February, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted for Presentation at The Population Association of America 2021

  10. arXiv:1707.00599  [pdf

    cs.CY

    Advanced Cyberinfrastructure for Science, Engineering, and Public Policy

    Authors: Vasant G. Honavar, Katherine Yelick, Klara Nahrstedt, Holly Rushmeier, Jennifer Rexford, Mark D. Hill, Elizabeth Bradley, Elizabeth Mynatt

    Abstract: Progress in many domains increasingly benefits from our ability to view the systems through a computational lens, i.e., using computational abstractions of the domains; and our ability to acquire, share, integrate, and analyze disparate types of data. These advances would not be possible without the advanced data and computational cyberinfrastructure and tools for data capture, integration, analys… ▽ More

    Submitted 30 June, 2017; originally announced July 2017.

    Comments: A Computing Community Consortium (CCC) white paper, 9 pages. arXiv admin note: text overlap with arXiv:1604.02006

  11. arXiv:1604.02006  [pdf

    cs.CY cs.AI cs.DC cs.HC

    Accelerating Science: A Computing Research Agenda

    Authors: Vasant G. Honavar, Mark D. Hill, Katherine Yelick

    Abstract: The emergence of "big data" offers unprecedented opportunities for not only accelerating scientific advances but also enabling new modes of discovery. Scientific progress in many disciplines is increasingly enabled by our ability to examine natural phenomena through the computational lens, i.e., using algorithmic or information processing abstractions of the underlying processes; and our ability t… ▽ More

    Submitted 6 April, 2016; originally announced April 2016.

    Comments: Computing Community Consortium (CCC) white paper, 17 pages