Skip to main content

Showing 1–8 of 8 results for author: Balazadeh, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07918  [pdf, ps, other

    cs.LG stat.ML

    CausalPFN: Amortized Causal Effect Estimation via In-Context Learning

    Authors: Vahid Balazadeh, Hamidreza Kamkari, Valentin Thomas, Benson Li, Junwei Ma, Jesse C. Cresswell, Rahul G. Krishnan

    Abstract: Causal effect estimation from observational data is fundamental across various applications. However, selecting an appropriate estimator from dozens of specialized methods demands substantial manual effort and domain expertise. We present CausalPFN, a single transformer that amortizes this workflow: trained once on a large library of simulated data-generating processes that satisfy ignorability, i… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2505.00467  [pdf, ps, other

    cs.CL cs.AI

    Red Teaming Large Language Models for Healthcare

    Authors: Vahid Balazadeh, Michael Cooper, David Pellow, Atousa Assadi, Jennifer Bell, Mark Coastworth, Kaivalya Deshpande, Jim Fackler, Gabriel Funingana, Spencer Gable-Cook, Anirudh Gangadhar, Abhishek Jaiswal, Sumanth Kaja, Christopher Khoury, Amrit Krishnan, Randy Lin, Kaden McKeen, Sara Naimimohasses, Khashayar Namdar, Aviraj Newatia, Allan Pang, Anshul Pattoo, Sameer Peesapati, Diana Prepelita, Bogdana Rakova , et al. (10 additional authors not shown)

    Abstract: We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large lang… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  3. arXiv:2412.08619  [pdf, other

    cs.CV cs.AI

    Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Models

    Authors: Vahid Balazadeh, Mohammadmehdi Ataei, Hyunmin Cheong, Amir Hosein Khasahmadi, Rahul G. Krishnan

    Abstract: Physical reasoning, which involves interpreting object behaviors within dynamic environments, remains a significant challenge for Vision-Language Models (VLMs). The limitations in physical reasoning arise from an inability to translate learned knowledge into predictions about physical behavior. We perform a careful study to show how continual fine-tuning can mitigate this issue. However, fine-tuni… ▽ More

    Submitted 10 March, 2025; v1 submitted 11 December, 2024; originally announced December 2024.

  4. arXiv:2410.14001  [pdf, other

    cs.LG cs.CL

    Personalized Adaptation via In-Context Preference Learning

    Authors: Allison Lau, Younwoo Choi, Vahid Balazadeh, Keertana Chidambaram, Vasilis Syrgkanis, Rahul G. Krishnan

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is widely used to align Language Models (LMs) with human preferences. However, existing approaches often neglect individual user preferences, leading to suboptimal personalization. We present the Preference Pretrained Transformer (PPT), a novel approach for adaptive personalization using online user feedback. PPT leverages the in-context learning c… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  5. arXiv:2404.07266  [pdf, ps, other

    cs.LG

    Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity

    Authors: Vahid Balazadeh, Keertana Chidambaram, Viet Nguyen, Rahul G. Krishnan, Vasilis Syrgkanis

    Abstract: We study the problem of online sequential decision-making given auxiliary demonstrations from experts who made their decisions based on unobserved contextual information. These demonstrations can be viewed as solving related but slightly different problems than what the learner faces. This setting arises in many application domains, such as self-driving cars, healthcare, and finance, where expert… ▽ More

    Submitted 15 June, 2025; v1 submitted 10 April, 2024; originally announced April 2024.

  6. arXiv:2308.07480  [pdf, other

    cs.LG stat.ME

    Order-based Structure Learning with Normalizing Flows

    Authors: Hamidreza Kamkari, Vahid Balazadeh, Vahid Zehtab, Rahul G. Krishnan

    Abstract: Estimating the causal structure of observational data is a challenging combinatorial search problem that scales super-exponentially with graph size. Existing methods use continuous relaxations to make this problem computationally tractable but often restrict the data-generating process to additive noise models (ANMs) through explicit or implicit assumptions. We present Order-based Structure Learni… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

  7. arXiv:2210.08139  [pdf, other

    cs.LG

    Partial Identification of Treatment Effects with Implicit Generative Models

    Authors: Vahid Balazadeh, Vasilis Syrgkanis, Rahul G. Krishnan

    Abstract: We consider the problem of partial identification, the estimation of bounds on the treatment effects from observational data. Although studied using discrete treatment variables or in specific causal graphs (e.g., instrumental variables), partial identification has been recently explored using tools from deep generative modeling. We propose a new method for partial identification of average treatm… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  8. arXiv:2002.04258  [pdf, other

    cs.LG cs.CY cs.HC eess.SY stat.ML

    Learning to Switch Among Agents in a Team via 2-Layer Markov Decision Processes

    Authors: Vahid Balazadeh, Abir De, Adish Singla, Manuel Gomez-Rodriguez

    Abstract: Reinforcement learning agents have been mostly developed and evaluated under the assumption that they will operate in a fully autonomous manner -- they will take all actions. In this work, our goal is to develop algorithms that, by learning to switch control between agents, allow existing reinforcement learning agents to operate under different automation levels. To this end, we first formally def… ▽ More

    Submitted 30 June, 2023; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: Published in Transactions on Machine Learning Research