Skip to main content

Showing 1–9 of 9 results for author: Vo, T V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05445  [pdf, ps, other

    cs.LG cs.AI

    Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic

    Authors: Thanh Vinh Vo, Young Lee, Haozhe Ma, Chien Lu, Tze-Yun Leong

    Abstract: Hidden confounders that influence both states and actions can bias policy learning in reinforcement learning (RL), leading to suboptimal or non-generalizable behavior. Most RL algorithms ignore this issue, learning policies from observational trajectories based solely on statistical associations rather than causal effects. We propose DoSAC (Do-Calculus Soft Actor-Critic with Backdoor Adjustment),… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Preprint

  2. arXiv:2408.10858  [pdf, other

    cs.LG cs.AI

    Centralized Reward Agent for Knowledge Sharing and Transfer in Multi-Task Reinforcement Learning

    Authors: Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong

    Abstract: Reward shaping is effective in addressing the sparse-reward challenge in reinforcement learning by providing immediate feedback through auxiliary informative rewards. Based on the reward shaping strategy, we propose a novel multi-task reinforcement learning framework that integrates a centralized reward agent (CRA) and multiple distributed policy agents. The CRA functions as a knowledge pool, whic… ▽ More

    Submitted 17 May, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

  3. arXiv:2408.03029  [pdf, other

    cs.LG cs.AI

    Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

    Authors: Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong

    Abstract: Reward shaping is a technique in reinforcement learning that addresses the sparse-reward problem by providing more frequent and informative rewards. We introduce a self-adaptive and highly efficient reward shaping mechanism that incorporates success rates derived from historical experiences as shaped rewards. The success rates are sampled from Beta distributions, which dynamically evolve from unce… ▽ More

    Submitted 28 February, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

  4. arXiv:2407.14811  [pdf, other

    cs.CV cs.AI

    Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

    Authors: Di Fu, Thanh Vinh Vo, Haozhe Ma, Tze-Yun Leong

    Abstract: Action recognition technology plays a vital role in enhancing security through surveillance systems, enabling better patient monitoring in healthcare, providing in-depth performance analysis in sports, and facilitating seamless human-AI collaboration in domains such as manufacturing and assistive technologies. The dynamic nature of data in these areas underscores the need for models that can conti… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  5. arXiv:2308.13047  [pdf, other

    cs.LG cs.AI stat.ME

    Federated Causal Inference from Observational Data

    Authors: Thanh Vinh Vo, Young lee, Tze-Yun Leong

    Abstract: Decentralized data sources are prevalent in real-world applications, posing a formidable challenge for causal inference. These sources cannot be consolidated into a single entity owing to privacy constraints. The presence of dissimilar data distributions and missing values within them can potentially introduce bias to the causal estimands. In this article, we propose a framework to estimate causal… ▽ More

    Submitted 30 May, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Preprint. arXiv admin note: substantial text overlap with arXiv:2301.00346

  6. arXiv:2301.00346  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    An Adaptive Kernel Approach to Federated Learning of Heterogeneous Causal Effects

    Authors: Thanh Vinh Vo, Arnab Bhattacharyya, Young Lee, Tze-Yun Leong

    Abstract: We propose a new causal inference framework to learn causal effects from multiple, decentralized data sources in a federated setting. We introduce an adaptive transfer algorithm that learns the similarities among the data sources by utilizing Random Fourier Features to disentangle the loss function into multiple components, each of which is associated with a data source. The data sources may have… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: NeurIPS 2022

  7. arXiv:2106.00456  [pdf, other

    stat.ME cs.AI cs.CR cs.LG

    Federated Estimation of Causal Effects from Observational Data

    Authors: Thanh Vinh Vo, Trong Nghia Hoang, Young Lee, Tze-Yun Leong

    Abstract: Many modern applications collect data that comes in federated spirit, with data kept locally and undisclosed. Till date, most insight into the causal inference requires data to be stored in a central repository. We present a novel framework for causal inference with federated data sources. We assess and integrate local causal effects from different private data sources without centralizing them. T… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: Preprint

  8. arXiv:2105.14877  [pdf, other

    cs.LG cs.AI stat.ME

    Adaptive Multi-Source Causal Inference

    Authors: Thanh Vinh Vo, Pengfei Wei, Trong Nghia Hoang, Tze-Yun Leong

    Abstract: Data scarcity is a tremendous challenge in causal effect estimation. In this paper, we propose to exploit additional data sources to facilitate estimating causal effects in the target population. Specifically, we leverage additional source datasets which share similar causal mechanisms with the target observations to help infer causal effects of the target population. We propose three levels of kn… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: Preprint

  9. arXiv:2004.11497  [pdf, other

    stat.ML cs.LG

    Causal Modeling with Stochastic Confounders

    Authors: Thanh Vinh Vo, Pengfei Wei, Wicher Bergsma, Tze-Yun Leong

    Abstract: This work extends causal inference with stochastic confounders. We propose a new approach to variational estimation for causal inference based on a representer theorem with a random input space. We estimate causal effects involving latent confounders that may be interdependent and time-varying from sequential, repeated measurements in an observational study. Our approach extends current work that… ▽ More

    Submitted 25 January, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: AISTATS 2021