Skip to main content

Showing 1–4 of 4 results for author: Ho, D W C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.06847  [pdf, other

    cs.AI cs.LG cs.RO

    A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering

    Authors: Qihan Qi, Xinsong Yang, Gang Xia, Daniel W. C. Ho, Pengyang Tang

    Abstract: This paper proposes a safety modulator actor-critic (SMAC) method to address safety constraint and overestimation mitigation in model-free safe reinforcement learning (RL). A safety modulator is developed to satisfy safety constraints by modulating actions, allowing the policy to ignore safety constraint and focus on maximizing reward. Additionally, a distributional critic with a theoretical updat… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  2. arXiv:2104.10637  [pdf, ps, other

    cs.LG math.FA stat.ML

    Robust Kernel-based Distribution Regression

    Authors: Zhan Yu, Daniel W. C. Ho, Ding-Xuan Zhou

    Abstract: Regularization schemes for regression have been widely studied in learning theory and inverse problems. In this paper, we study distribution regression (DR) which involves two stages of sampling, and aims at regressing from probability measures to real-valued responses over a reproducing kernel Hilbert space (RKHS). Recently, theoretical analysis on DR has been carried out via kernel ridge regress… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: 29 pages

  3. arXiv:2006.09017  [pdf, ps, other

    cs.LG math.ST stat.ML

    Estimates on Learning Rates for Multi-Penalty Distribution Regression

    Authors: Zhan Yu, Daniel W. C. Ho

    Abstract: This paper is concerned with functional learning by utilizing two-stage sampled distribution regression. We study a multi-penalty regularization algorithm for distribution regression under the framework of learning theory. The algorithm aims at regressing to real valued outputs from probability measures. The theoretical analysis on distribution regression is far from maturity and quite challenging… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 June, 2020; originally announced June 2020.

  4. arXiv:1004.3617  [pdf, ps, other

    cs.NI math.PR

    Consensus over a Random Network Generated by i.i.d. Stochastic Matrices

    Authors: Qingshuo Song, Guanrong Chen, Daniel W. C. Ho

    Abstract: Our goal is to find a necessary and sufficient condition on the consensus over a random network, generated by i.i.d. stochastic matrices. We show that the consensus problem in three different convergence modes (almost surely, in probability, and in L1) are equivalent, thus have the same necessary and sufficient condition. We obtain the necessary and sufficient condition through the stability in a… ▽ More

    Submitted 21 April, 2010; originally announced April 2010.