Skip to main content

Showing 1–12 of 12 results for author: Yao, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.04684  [pdf, other

    stat.ML cs.LG math.NA

    Propagating Model Uncertainty through Filtering-based Probabilistic Numerical ODE Solvers

    Authors: Dingling Yao, Filip Tronarp, Nathanael Bosch

    Abstract: Filtering-based probabilistic numerical solvers for ordinary differential equations (ODEs), also known as ODE filters, have been established as efficient methods for quantifying numerical uncertainty in the solution of ODEs. In practical applications, however, the underlying dynamical system often contains uncertain parameters, requiring the propagation of this model uncertainty to the ODE solutio… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  2. Combining Incomplete Observational and Randomized Data for Heterogeneous Treatment Effects

    Authors: Dong Yao, Caizhi Tang, Qing Cui, Longfei Li

    Abstract: Data from observational studies (OSs) is widely available and readily obtainable yet frequently contains confounding biases. On the other hand, data derived from randomized controlled trials (RCTs) helps to reduce these biases; however, it is expensive to gather, resulting in a tiny size of randomized data. For this reason, effectively fusing observational data and randomized data to better estima… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 10 pages, 4 figures, Accepted By CIKM2024

  3. arXiv:2409.02772  [pdf, other

    cs.LG stat.ML

    Unifying Causal Representation Learning with the Invariance Principle

    Authors: Dingling Yao, Dario Rancati, Riccardo Cadei, Marco Fumero, Francesco Locatello

    Abstract: Causal representation learning (CRL) aims at recovering latent causal variables from high-dimensional observations to solve causal downstream tasks, such as predicting the effect of new interventions or more robust classification. A plethora of methods have been developed, each tackling carefully crafted problem settings that lead to different types of identifiability. These different settings are… ▽ More

    Submitted 5 March, 2025; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: ICLR2025 Camera ready

  4. arXiv:2405.14953  [pdf, other

    cs.LG cs.AI stat.ML

    MallowsPO: Fine-Tune Your LLM with Preference Dispersions

    Authors: Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang

    Abstract: Direct Preference Optimization (DPO) has recently emerged as a popular approach to improve reinforcement learning with human feedback (RLHF), leading to better techniques to fine-tune large language models (LLM). A weakness of DPO, however, lies in its lack of capability to characterize the diversity of human preferences. Inspired by Mallows' theory of preference ranking, we develop in this paper… ▽ More

    Submitted 17 April, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.13888  [pdf, other

    cs.LG stat.ML

    Marrying Causal Representation Learning with Dynamical Systems for Science

    Authors: Dingling Yao, Caroline Muller, Francesco Locatello

    Abstract: Causal representation learning promises to extend causal models to hidden causal variables from raw entangled measurements. However, most progress has focused on proving identifiability results in different settings, and we are not aware of any successful real-world application. At the same time, the field of dynamical systems benefited from deep learning and scaled to countless applications but d… ▽ More

    Submitted 3 February, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024 Camera Ready

  6. arXiv:2403.08335  [pdf, other

    cs.LG cs.AI stat.ML

    A Sparsity Principle for Partially Observable Causal Representation Learning

    Authors: Danru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius von Kügelgen, Francesco Locatello, Sara Magliacane

    Abstract: Causal representation learning aims at identifying high-level causal variables from perceptual data. Most methods assume that all latent causal variables are captured in the high-dimensional observations. We instead consider a partially observed setting, in which each measurement only provides information about a subset of the underlying causal state. Prior work has studied this setting with multi… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 45 pages, 32 figures, 16 tables

  7. arXiv:2303.10112  [pdf, other

    cs.LG stat.ME

    Causal Discovery from Temporal Data: An Overview and New Perspectives

    Authors: Chang Gong, Di Yao, Chuzhe Zhang, Wenbin Li, Jingping Bi

    Abstract: Temporal data, representing chronological observations of complex systems, has always been a typical data structure that can be widely generated by many domains, such as industry, medicine and finance. Analyzing this type of data is extremely valuable for various applications. Thus, different temporal data analysis tasks, eg, classification, clustering and prediction, have been proposed in the pas… ▽ More

    Submitted 3 August, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 54 pages, 7 figures

  8. arXiv:2108.00819  [pdf, other

    cs.LG cs.AI stat.ML

    Active Learning in Gaussian Process State Space Model

    Authors: Hon Sum Alec Yu, Dingling Yao, Christoph Zimmer, Marc Toussaint, Duy Nguyen-Tuong

    Abstract: We investigate active learning in Gaussian Process state-space models (GPSSM). Our problem is to actively steer the system through latent states by determining its inputs such that the underlying dynamics can be optimally learned by a GPSSM. In order that the most informative inputs are selected, we employ mutual information as our active learning criterion. In particular, we present two approache… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

    Comments: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2021

  9. arXiv:2106.15327  [pdf, other

    cs.CV stat.ME

    Patch-Based Image Restoration using Expectation Propagation

    Authors: Dan Yao, Stephen McLaughlin, Yoann Altmann

    Abstract: This paper presents a new Expectation Propagation (EP) framework for image restoration using patch-based prior distributions. While Monte Carlo techniques are classically used to sample from intractable posterior distributions, they can suffer from scalability issues in high-dimensional inference problems such as image restoration. To address this issue, EP is used here to approximate the posterio… ▽ More

    Submitted 10 November, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 27 pages

  10. arXiv:2012.07915  [pdf, other

    cs.DC stat.AP

    Prediction of High-Performance Computing Input/Output Variability and Its Application to Optimization for System Configurations

    Authors: Li Xu, Thomas Lux, Tyler Chang, Bo Li, Yili Hong, Layne Watson, Ali Butt, Danfeng Yao, Kirk Cameron

    Abstract: Performance variability is an important measure for a reliable high performance computing (HPC) system. Performance variability is affected by complicated interactions between numerous factors, such as CPU frequency, the number of input/output (IO) threads, and the IO scheduler. In this paper, we focus on HPC IO variability. The prediction of HPC variability is a challenging problem in the enginee… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: 29 pages, 8 figures

    Journal ref: Quality Engineering, 2021

  11. arXiv:1808.08068  [pdf, ps, other

    cs.LG stat.ML

    Self-Paced Multi-Task Clustering

    Authors: Yazhou Ren, Xiaofan Que, Dezhong Yao, Zenglin Xu

    Abstract: Multi-task clustering (MTC) has attracted a lot of research attentions in machine learning due to its ability in utilizing the relationship among different tasks. Despite the success of traditional MTC models, they are either easy to stuck into local optima, or sensitive to outliers and noisy data. To alleviate these problems, we propose a novel self-paced multi-task clustering (SPMTC) paradigm. I… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

  12. arXiv:1704.03106  [pdf, other

    stat.ME

    3D mean Projective Shape Difference for Face Differentiation from Multiple Digital Camera Images

    Authors: K. D. Yao, V. Patrangenaru, D. Lester

    Abstract: We give a nonparametric methodology for hypothesis testing for equality of extrinsic mean objects on a manifold embedded in a numerical spaces. The results obtained in the general setting are detailed further in the case of 3D projective shapes represented in a space of symmetric matrices via the quadratic Veronese-Whitney (VW) embedding. Large sample and nonparametric bootstrap confidence regions… ▽ More

    Submitted 27 April, 2017; v1 submitted 10 April, 2017; originally announced April 2017.