Skip to main content

Showing 1–29 of 29 results for author: Luo, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.13667  [pdf, ps, other

    cs.LG cs.AI stat.ME

    Exploring Multi-Modal Data with Tool-Augmented LLM Agents for Precise Causal Discovery

    Authors: ChengAo Shen, Zhengzhang Chen, Dongsheng Luo, Dongkuan Xu, Haifeng Chen, Jingchao Ni

    Abstract: Causal discovery is an imperative foundation for decision-making across domains, such as smart health, AI for drug discovery and AIOps. Traditional statistical causal discovery methods, while well-established, predominantly rely on observational data and often overlook the semantic cues inherent in cause-and-effect relationships. The advent of Large Language Models (LLMs) has ushered in an afforda… ▽ More

    Submitted 31 May, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

  2. arXiv:2408.06615  [pdf, other

    math.NA stat.CO

    Gaussian mixture Taylor approximations of risk measures constrained by PDEs with Gaussian random field inputs

    Authors: Dingcheng Luo, Joshua Chen, Peng Chen, Omar Ghattas

    Abstract: This work considers the computation of risk measures for quantities of interest governed by PDEs with Gaussian random field parameters using Taylor approximations. While efficient, Taylor approximations are local to the point of expansion, and hence may degrade in accuracy when the variances of the input parameters are large. To address this challenge, we approximate the underlying Gaussian measur… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 34 Pages, 13 Figures, 1 Table

    MSC Class: 65D32 (Primary) 35R60; 41A30; 65C20; 68U05 (Secondary)

  3. arXiv:2305.07089  [pdf, other

    stat.ML cs.LG stat.ME

    Hierarchically Coherent Multivariate Mixture Networks

    Authors: Kin G. Olivares, David Luo, Cristian Challu, Stefania La Vattiata, Max Mergenthaler, Artur Dubrawski

    Abstract: Large collections of time series data are often organized into hierarchies with different levels of aggregation; examples include product and geographical groupings. Probabilistic coherent forecasting is tasked to produce forecasts consistent across levels of aggregation. In this study, we propose to augment neural forecasting architectures with a coherent multivariate mixture output. We optimize… ▽ More

    Submitted 16 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  4. arXiv:2304.03928  [pdf

    cs.LG stat.AP

    Interpretable machine learning-accelerated seed treatment by nanomaterials for environmental stress alleviation

    Authors: Hengjie Yu, Dan Luo, Sam F. Y. Li, Maozhen Qu, Da Liu, Yingchao He, Fang Cheng

    Abstract: Crops are constantly challenged by different environmental conditions. Seed treatment by nanomaterials is a cost-effective and environmentally-friendly solution for environmental stress mitigation in crop plants. Here, 56 seed nanopriming treatments are used to alleviate environmental stresses in maize. Seven selected nanopriming treatments significantly increase the stress resistance index (SRI)… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: 30 pages, 6 figures

  5. arXiv:2302.11837  [pdf, other

    stat.ME

    Bounding the FDP in competition-based control of the FDR

    Authors: Arya Ebadi, Dong Luo, Jack Freestone, William Stafford Noble, Uri Keich

    Abstract: Competition-based approach to controlling the false discovery rate (FDR) recently rose to prominence when, generalizing it to sequential hypothesis testing, Barber and Candès used it as part of their knockoff-filter. Control of the FDR implies that the, arguably more important, false discovery proportion is only controlled in an average sense. We present TDC-SB and TDC-UB that provide upper predic… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: The original version of this paper appeared as arxiv:2011.11939v1. That version was split into two: one branch continuing as v2 & v3 of that original submission, and the other branch is now added here as a new submission

  6. arXiv:2207.03517  [pdf, ps, other

    stat.ML cs.AI cs.LG

    HierarchicalForecast: A Reference Framework for Hierarchical Forecasting in Python

    Authors: Kin G. Olivares, Azul Garza, David Luo, Cristian Challú, Max Mergenthaler, Souhaib Ben Taieb, Shanika L. Wickramasuriya, Artur Dubrawski

    Abstract: Large collections of time series data are commonly organized into structures with different levels of aggregation; examples include product and geographical groupings. It is often important to ensure that the forecasts are coherent so that the predicted values at disaggregate levels add up to the aggregate forecast. The growing interest of the Machine Learning community in hierarchical forecasting… ▽ More

    Submitted 10 October, 2024; v1 submitted 7 July, 2022; originally announced July 2022.

  7. arXiv:2201.12594  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Imitation Learning from Corrupted Demonstrations

    Authors: Liu Liu, Ziyang Tang, Lanqing Li, Dijun Luo

    Abstract: We consider offline Imitation Learning from corrupted demonstrations where a constant fraction of data can be noise or even arbitrary outliers. Classical approaches such as Behavior Cloning assumes that demonstrations are collected by an presumably optimal expert, hence may fail drastically when learning from corrupted demonstrations. We propose a novel robust algorithm by minimizing a Median-of-M… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  8. arXiv:2105.14244  [pdf, other

    cs.LG cs.SI stat.ML

    Learning Graphon Autoencoders for Generative Graph Modeling

    Authors: Hongteng Xu, Peilin Zhao, Junzhou Huang, Dixin Luo

    Abstract: Graphon is a nonparametric model that generates graphs with arbitrary sizes and can be induced from graphs easily. Based on this model, we propose a novel algorithmic framework called \textit{graphon autoencoder} to build an interpretable and scalable graph generative model. This framework treats observed graphs as induced graphons in functional space and derives their latent representations by an… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  9. arXiv:2102.02741  [pdf, other

    cs.LG stat.ML

    Hawkes Processes on Graphons

    Authors: Hongteng Xu, Dixin Luo, Hongyuan Zha

    Abstract: We propose a novel framework for modeling multiple multivariate point processes, each with heterogeneous event types that share an underlying space and obey the same generative mechanism. Focusing on Hawkes processes and their variants that are associated with Granger causality graphs, our model leverages an uncountable event type space and samples the graphs with different sizes from a nonparamet… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

  10. arXiv:2012.05644  [pdf, other

    cs.LG cs.SI stat.ML

    Learning Graphons via Structured Gromov-Wasserstein Barycenters

    Authors: Hongteng Xu, Dixin Luo, Lawrence Carin, Hongyuan Zha

    Abstract: We propose a novel and principled method to learn a nonparametric graph model called graphon, which is defined in an infinite-dimensional space and represents arbitrary-size graphs. Based on the weak regularity lemma from the theory of graphons, we leverage a step function to approximate a graphon. We show that the cut distance of graphons can be relaxed to the Gromov-Wasserstein distance of their… ▽ More

    Submitted 17 December, 2020; v1 submitted 10 December, 2020; originally announced December 2020.

    Journal ref: AAAI 2021

  11. arXiv:2011.11939  [pdf, other

    stat.ME stat.AP

    Competition-based control of the false discovery proportion

    Authors: Dong Luo, Arya Ebadi, Yilun He, Kristen Emery, William Stafford Noble, Uri Keich

    Abstract: Recently, Barber and Candès laid the theoretical foundation for a general framework for false discovery rate (FDR) control based on the notion of "knockoffs." A closely related FDR control methodology has long been employed in the analysis of mass spectrometry data, referred to there as "target-decoy competition" (TDC). However, any approach that aims to control the FDR, which is defined as the ex… ▽ More

    Submitted 14 March, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: This revision focuses only on FDP-SD described in the original submission. A later submission will further develop the procedures for simultaneous bounds on the FDP

  12. arXiv:2010.05749  [pdf, ps, other

    stat.ME

    Detecting the skewness of data from the five-number summary and its application in meta-analysis

    Authors: Jiandong Shi, Dehui Luo, Xiang Wan, Yue Liu, Jiming Liu, Zhaoxiang Bian, Tiejun Tong

    Abstract: For clinical studies with continuous outcomes, when the data are potentially skewed, researchers may choose to report the whole or part of the five-number summary (the sample median, the first and third quartiles, and the minimum and maximum values) rather than the sample mean and standard deviation. In the recent literature, it is often suggested to transform the five-number summary back to the s… ▽ More

    Submitted 5 May, 2023; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: 40 pages, 10 figures, 7 tables

  13. arXiv:2010.01112  [pdf, other

    cs.LG stat.ML

    FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization

    Authors: Lanqing Li, Rui Yang, Dijun Luo

    Abstract: We study the offline meta-reinforcement learning (OMRL) problem, a paradigm which enables reinforcement learning (RL) algorithms to quickly adapt to unseen tasks without any interactions with the environments, making RL truly practical in many real-world applications. This problem is still not fully understood, for which two major challenges need to be addressed. First, offline RL usually suffers… ▽ More

    Submitted 6 May, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 23 pages, 11 figures

  14. arXiv:2008.00747  [pdf, ps, other

    econ.EM stat.ME

    Testing error distribution by kernelized Stein discrepancy in multivariate time series models

    Authors: Donghang Luo, Ke Zhu, Huan Gong, Dong Li

    Abstract: Knowing the error distribution is important in many multivariate time series applications. To alleviate the risk of error distribution mis-specification, testing methodologies are needed to detect whether the chosen error distribution is correct. However, the majority of the existing tests only deal with the multivariate normal distribution for some special multivariate time series models, and the… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  15. arXiv:2006.03160  [pdf, other

    cs.LG stat.ML

    Hierarchical Optimal Transport for Robust Multi-View Learning

    Authors: Dixin Luo, Hongteng Xu, Lawrence Carin

    Abstract: Traditional multi-view learning methods often rely on two assumptions: ($i$) the samples in different views are well-aligned, and ($ii$) their representations in latent space obey the same distribution. Unfortunately, these two assumptions may be questionable in practice, which limits the application of multi-view learning. In this work, we propose a hierarchical optimal transport (HOT) method to… ▽ More

    Submitted 8 June, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

  16. arXiv:2003.02130  [pdf, ps, other

    stat.ME

    Optimally estimating the sample standard deviation from the five-number summary

    Authors: Jiandong Shi, Dehui Luo, Hong Weng, Xian-Tao Zeng, Lu Lin, Haitao Chu, Tiejun Tong

    Abstract: When reporting the results of clinical studies, some researchers may choose the five-number summary (including the sample median, the first and third quartiles, and the minimum and maximum values) rather than the sample mean and standard deviation, particularly for skewed data. For these studies, when included in a meta-analysis, it is often desired to convert the five-number summary back to the s… ▽ More

    Submitted 17 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 30 pages and 4 figures and 2 tables. arXiv admin note: substantial text overlap with arXiv:1801.01267

    Journal ref: Research Synthesis Methods, 2020

  17. arXiv:2002.02913  [pdf, other

    cs.LG stat.ML

    Learning Autoencoders with Relational Regularization

    Authors: Hongteng Xu, Dixin Luo, Ricardo Henao, Svati Shah, Lawrence Carin

    Abstract: A new algorithmic framework is proposed for learning autoencoders of data distributions. We minimize the discrepancy between the model and target distributions, with a \emph{relational regularization} on the learnable latent prior. This regularization penalizes the fused Gromov-Wasserstein (FGW) distance between the latent prior and its corresponding posterior, allowing one to flexibly learn a str… ▽ More

    Submitted 25 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Journal ref: International conference on machine learning 2020

  18. arXiv:1910.02096  [pdf, other

    cs.LG stat.ML

    Fused Gromov-Wasserstein Alignment for Hawkes Processes

    Authors: Dixin Luo, Hongteng Xu, Lawrence Carin

    Abstract: We propose a novel fused Gromov-Wasserstein alignment method to jointly learn the Hawkes processes in different event spaces, and align their event types. Given two Hawkes processes, we use fused Gromov-Wasserstein discrepancy to measure their dissimilarity, which considers both the Wasserstein discrepancy based on their base intensities and the Gromov-Wasserstein discrepancy based on their infect… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: The workshop on learning with temporal point processes in NeurIPS 2019 (WTPP19)

  19. arXiv:1906.08397  [pdf, other

    stat.ML cs.LG

    Adversarial Self-Paced Learning for Mixture Models of Hawkes Processes

    Authors: Dixin Luo, Hongteng Xu, Lawrence Carin

    Abstract: We propose a novel adversarial learning strategy for mixture models of Hawkes processes, leveraging data augmentation techniques of Hawkes process in the framework of self-paced learning. Instead of learning a mixture model directly from a set of event sequences drawn from different Hawkes processes, the proposed method learns the target model iteratively, which generates "easy" sequences and uses… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  20. arXiv:1906.05492  [pdf, other

    stat.AP cs.LG

    Interpretable ICD Code Embeddings with Self- and Mutual-Attention Mechanisms

    Authors: Dixin Luo, Hongteng Xu, Lawrence Carin

    Abstract: We propose a novel and interpretable embedding method to represent the international statistical classification codes of diseases and related health problems (i.e., ICD codes). This method considers a self-attention mechanism within the disease domain and a mutual-attention mechanism jointly between diseases and procedures. This framework captures the clinical relationships between the disease cod… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  21. arXiv:1905.07645  [pdf, other

    cs.LG cs.SI stat.ML

    Scalable Gromov-Wasserstein Learning for Graph Partitioning and Matching

    Authors: Hongteng Xu, Dixin Luo, Lawrence Carin

    Abstract: We propose a scalable Gromov-Wasserstein learning (S-GWL) method and establish a novel and theoretically-supported paradigm for large-scale graph analysis. The proposed method is based on the fact that Gromov-Wasserstein discrepancy is a pseudometric on graphs. Given two graphs, the optimal transport associated with their Gromov-Wasserstein discrepancy provides the correspondence between their nod… ▽ More

    Submitted 9 October, 2019; v1 submitted 18 May, 2019; originally announced May 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  22. arXiv:1903.07821  [pdf

    cs.LG stat.ML

    POP-CNN: Predicting Odor's Pleasantness with Convolutional Neural Network

    Authors: Danli Wu, Yu Cheng, Dehan Luo, Kin-Yeung Wong, Kevin Hung, Zhijing Yang

    Abstract: Predicting odor's pleasantness simplifies the evaluation of odors and has the potential to be applied in perfumes and environmental monitoring industry. Classical algorithms for predicting odor's pleasantness generally use a manual feature extractor and an independent classifier. Manual designing a good feature extractor depend on expert knowledge and experience is the key to the accuracy of the a… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

  23. arXiv:1901.06003  [pdf, other

    cs.LG cs.SI stat.ML

    Gromov-Wasserstein Learning for Graph Matching and Node Embedding

    Authors: Hongteng Xu, Dixin Luo, Hongyuan Zha, Lawrence Carin

    Abstract: A novel Gromov-Wasserstein learning framework is proposed to jointly match (align) graphs and learn embedding vectors for the associated graph nodes. Using Gromov-Wasserstein discrepancy, we measure the dissimilarity between two graphs and find their correspondence, according to the learned optimal transport. The node embeddings associated with the two graphs are learned under the guidance of the… ▽ More

    Submitted 7 May, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    Journal ref: Thirty-sixth International Conference on Machine Learning 2019

  24. arXiv:1801.09456  [pdf, ps, other

    stat.ME

    Testing normality using the summary statistics with application to meta-analysis

    Authors: Dehui Luo, Xiang Wan, Jiming Liu, Tiejun Tong

    Abstract: As the most important tool to provide high-level evidence-based medicine, researchers can statistically summarize and combine data from multiple studies by conducting meta-analysis. In meta-analysis, mean differences are frequently used effect size measurements to deal with continuous data, such as the Cohen's d statistic and Hedges' g statistic values. To calculate the mean difference based effec… ▽ More

    Submitted 29 January, 2018; originally announced January 2018.

    Comments: 48 pages, 11 figures and 10 tables

  25. arXiv:1801.01267  [pdf, ps, other

    stat.ME

    How to estimate the sample mean and standard deviation from the five number summary?

    Authors: Jiandong Shi, Dehui Luo, Hong Weng, Xian-Tao Zeng, Lu Lin, Tiejun Tong

    Abstract: In some clinical studies, researchers may report the five number summary (including the sample median, the first and third quartiles, and the minimum and maximum values) rather than the sample mean and standard deviation. To conduct meta-analysis for pooling studies, one needs to first estimate the sample mean and standard deviation from the five number summary. A number of studies have been propo… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Comments: 18 pages, 5 figures, 2 tables

  26. arXiv:1710.05115  [pdf, other

    stat.ML

    Benefits from Superposed Hawkes Processes

    Authors: Hongteng Xu, Dixin Luo, Xu Chen, Lawrence Carin

    Abstract: The superposition of temporal point processes has been studied for many years, although the usefulness of such models for practical applications has not be fully developed. We investigate superposed Hawkes process as an important class of such models, with properties studied in the framework of least squares estimation. The superposition of Hawkes processes is demonstrated to be beneficial for tig… ▽ More

    Submitted 14 February, 2018; v1 submitted 13 October, 2017; originally announced October 2017.

    Journal ref: AISTATS 2018

  27. arXiv:1702.07013  [pdf, other

    cs.LG math.PR stat.ML

    Learning Hawkes Processes from Short Doubly-Censored Event Sequences

    Authors: Hongteng Xu, Dixin Luo, Hongyuan Zha

    Abstract: Many real-world applications require robust algorithms to learn point processes based on a type of incomplete data --- the so-called short doubly-censored (SDC) event sequences. We study this critical problem of quantitative asynchronous event sequence analysis under the framework of Hawkes processes by leveraging the idea of data synthesis. Given SDC event sequences observed in a variety of time… ▽ More

    Submitted 7 June, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

  28. arXiv:1505.05687  [pdf

    stat.ME

    Optimally estimating the sample mean from the sample size, median, mid-range and/or mid-quartile range

    Authors: Dehui Luo, Xiang Wan, Jiming Liu, Tiejun Tong

    Abstract: The era of big data is coming, and evidence-based medicine is attracting increasing attention to improve decision making in medical practice via integrating evidence from well designed and conducted clinical research. Meta-analysis is a statistical technique widely used in evidence-based medicine for analytically combining the findings from independent clinical trials to provide an overall estimat… ▽ More

    Submitted 4 October, 2016; v1 submitted 21 May, 2015; originally announced May 2015.

    Comments: 21 pages, 7 figures and 7 tables

  29. arXiv:1308.1624  [pdf

    stat.AP

    Poisson-type Multivariate Transfer Function Model Reveals Short-term Effects of Ambient Air Pollutants on Hospital Emergency room Visits for Cerebro-cardiovascular Diseases

    Authors: Menghui Li, Dasheng Luo, Chenghua Cao, Xiaochuan Pan, Qixin Wang

    Abstract: Laboratory experiments have shown that cardiovascular diseases are positively correlated to the concentration of ambient air pollutants, such as SO2, NO2, PM10, etc. It has also been repeatedly reported in many countries that increased concentration of ambient air pollutants leads to rise in hospital emergency room visitss for these diseases. These studies mainly adopt either regression analysis o… ▽ More

    Submitted 7 August, 2013; originally announced August 2013.