Skip to main content

Showing 1–50 of 62 results for author: Hué, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.07308  [pdf, ps, other

    cs.LG stat.ML

    PASS: Private Attributes Protection with Stochastic Data Substitution

    Authors: Yizhuo Chen, Chun-Fu, Chen, Hsiang Hsu, Shaohan Hu, Tarek Abdelzaher

    Abstract: The growing Machine Learning (ML) services require extensive collections of user data, which may inadvertently include people's private information irrelevant to the services. Various studies have been proposed to protect private attributes by removing them from the data while maintaining the utilities of the data for downstream tasks. Nevertheless, as we theoretically and empirically show in the… ▽ More

    Submitted 9 July, 2025; v1 submitted 8 June, 2025; originally announced June 2025.

  2. arXiv:2504.18377  [pdf, other

    stat.CO

    Statistical Disaggregation -- a Monte Carlo Approach for Imputation under Constraints

    Authors: Shenggang Hu, Hongsheng Dai, Fanlin Meng, Louis Aslett, Murray Pollock, Gareth O. Roberts

    Abstract: Equality-constrained models naturally arise in problems in which measurements are taken at different levels of resolution. The challenge in this setting is that the models usually induce a joint distribution which is intractable. Resorting to instead sampling from the joint distribution by means of a Monte Carlo approach is also challenging. For example, a naive rejection sampling does not work wh… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: 42 pages, 11 figures, to be published in Scandinavian Journal of Statistics

    MSC Class: 62-08 (Primary) 62D10 (Secondary)

  3. arXiv:2503.12811  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

    Authors: Kairong Luo, Haodong Wen, Shengding Hu, Zhenbo Sun, Zhiyuan Liu, Maosong Sun, Kaifeng Lyu, Wenguang Chen

    Abstract: Training large models is both resource-intensive and time-consuming, making it crucial to understand the quantitative relationship between model performance and hyperparameters. In this paper, we present an empirical law that describes how the pretraining loss of large language models evolves under different learning rate schedules, such as constant, cosine, and step decay schedules. Our proposed… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  4. arXiv:2412.15496  [pdf, other

    cs.LG stat.ML

    Graph Attention is Not Always Beneficial: A Theoretical Analysis of Graph Attention Mechanisms via Contextual Stochastic Block Models

    Authors: Zhongtian Ma, Qiaosheng Zhang, Bocheng Zhou, Yexin Zhang, Shuyue Hu, Zhen Wang

    Abstract: Despite the growing popularity of graph attention mechanisms, their theoretical understanding remains limited. This paper aims to explore the conditions under which these mechanisms are effective in node classification tasks through the lens of Contextual Stochastic Block Models (CSBMs). Our theoretical analysis reveals that incorporating graph attention mechanisms is \emph{not universally benefic… ▽ More

    Submitted 13 May, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: Accepted by ICML 2025

  5. arXiv:2411.14656  [pdf, other

    eess.SP cs.ET stat.AP

    mmWave Radar for Sit-to-Stand Analysis: A Comparative Study with Wearables and Kinect

    Authors: Shuting Hu, Peggy Ackun, Xiang Zhang, Siyang Cao, Jennifer Barton, Melvin G. Hector, Mindy J. Fain, Nima Toosizadeh

    Abstract: This study explores a novel approach for analyzing Sit-to-Stand (STS) movements using millimeter-wave (mmWave) radar technology. The goal is to develop a non-contact sensing, privacy-preserving, and all-day operational method for healthcare applications, including fall risk assessment. We used a 60GHz mmWave radar system to collect radar point cloud data, capturing STS motions from 45 participants… ▽ More

    Submitted 28 November, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

  6. arXiv:2409.10538  [pdf, other

    stat.ML cs.LG

    Fairness in Survival Analysis with Distributionally Robust Optimization

    Authors: Shu Hu, George H. Chen

    Abstract: We propose a general approach for encouraging fairness in survival analysis models based on minimizing a worst-case error across all subpopulations that occur with at least a user-specified probability. This approach can be used to convert many existing survival analysis models into ones that simultaneously encourage fairness, without requiring the user to specify which attributes or features to t… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

    Comments: Accepted at the Journal of Machine Learning Research; this paper is a journal paper extension of our earlier Machine Learning for Health 2022 paper (arXiv:2211.10508)

  7. arXiv:2405.03734  [pdf, other

    cs.HC cs.AI stat.AP

    FOKE: A Personalized and Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, and Prompt Engineering

    Authors: Silan Hu, Xiaoning Wang

    Abstract: Integrating large language models (LLMs) and knowledge graphs (KGs) holds great promise for revolutionizing intelligent education, but challenges remain in achieving personalization, interactivity, and explainability. We propose FOKE, a Forest Of Knowledge and Education framework that synergizes foundation models, knowledge graphs, and prompt engineering to address these challenges. FOKE introduce… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  8. arXiv:2404.19292  [pdf, other

    cs.IT cs.LG cs.MA stat.ML

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

    Authors: Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

    Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS). These algorithms draw inspiration from foundational concepts in information theory, and are proven to be sample efficient in MARL settings such as two-player zero-sum Markov games (MGs) and multi-player general-sum MGs. For episodic t… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  9. arXiv:2404.09729  [pdf

    eess.SP cs.IT cs.LG stat.ME

    Amplitude-Phase Fusion for Enhanced Electrocardiogram Morphological Analysis

    Authors: Shuaicong Hu, Yanan Wang, Jian Liu, Jingyu Lin, Shengmei Qin, Zhenning Nie, Zhifeng Yao, Wenjie Cai, Cuiwei Yang

    Abstract: Considering the variability of amplitude and phase patterns in electrocardiogram (ECG) signals due to cardiac activity and individual differences, existing entropy-based studies have not fully utilized these two patterns and lack integration. To address this gap, this paper proposes a novel fusion entropy metric, morphological ECG entropy (MEE) for the first time, specifically designed for ECG mor… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages, 12 figures

    ACM Class: I.5.2

  10. arXiv:2404.03701  [pdf, other

    cs.LG stat.ML

    Predictive Analytics of Varieties of Potatoes

    Authors: Fabiana Ferracina, Bala Krishnamoorthy, Mahantesh Halappanavar, Shengwei Hu, Vidyasagar Sathuvalli

    Abstract: We explore the application of machine learning algorithms specifically to enhance the selection process of Russet potato clones in breeding trials by predicting their suitability for advancement. This study addresses the challenge of efficiently identifying high-yield, disease-resistant, and climate-resilient potato varieties that meet processing industry standards. Leveraging manually collected d… ▽ More

    Submitted 7 November, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Minor revision; to appear in Crop Sciences

  11. arXiv:2403.05425  [pdf, ps, other

    stat.ML stat.ME

    An Adaptive Dimension Reduction Estimation Method for High-dimensional Bayesian Optimization

    Authors: Shouri Hu, Jiawei Li, Zhibo Cai

    Abstract: Bayesian optimization (BO) has shown impressive results in a variety of applications within low-to-moderate dimensional Euclidean spaces. However, extending BO to high-dimensional settings remains a significant challenge. We address this challenge by proposing a two-step optimization framework. Initially, we identify the effective dimension reduction (EDR) subspace for the objective function using… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: First draft

  12. arXiv:2402.00728  [pdf, other

    cs.LG stat.ML

    Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation

    Authors: Hsiang Hsu, Guihong Li, Shaohan Hu, Chun-Fu, Chen

    Abstract: Predictive multiplicity refers to the phenomenon in which classification tasks may admit multiple competing models that achieve almost-equally-optimal performance, yet generate conflicting outputs for individual samples. This presents significant concerns, as it can potentially result in systemic exclusion, inexplicable discrimination, and unfairness in practical applications. Measuring and mitiga… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  13. arXiv:2309.13557  [pdf, other

    stat.CO math.NA

    Bayesian Parameter Inference for Partially Observed Diffusions using Multilevel Stochastic Runge-Kutta Methods

    Authors: Pierre Del Moral, Shulan Hu, Ajay Jasra, Hamza Ruzayqat, Xinyu Wang

    Abstract: We consider the problem of Bayesian estimation of static parameters associated to a partially and discretely observed diffusion process. We assume that the exact transition dynamics of the diffusion process are unavailable, even up-to an unbiased estimator and that one must time-discretize the diffusion process. In such scenarios it has been shown how one can introduce the multilevel Monte Carlo m… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  14. arXiv:2309.05145  [pdf, other

    cs.LG cs.AI stat.ML

    Outlier Robust Adversarial Training

    Authors: Shu Hu, Zhenhuan Yang, Xin Wang, Yiming Ying, Siwei Lyu

    Abstract: Supervised learning models are challenged by the intrinsic complexities of training data such as outliers and minority subpopulations and intentional attacks at inference time with adversarial samples. While traditional robust learning methods and the recent adversarial training approaches are designed to handle each of the two challenges, to date, no work has been done to develop models that are… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted by The 15th Asian Conference on Machine Learning (ACML 2023)

  15. arXiv:2308.04158  [pdf, other

    stat.ME

    A Dual Cox Model Theory And Its Applications In Oncology

    Authors: Powei Chen, Siying Hu, Haojin Zhou

    Abstract: Given the prominence of targeted therapy and immunotherapy in cancer treatment, it becomes imperative to consider heterogeneity in patients' responses to treatments, which contributes greatly to the widely used proportional hazard assumption invalidated as in several clinical trials. To address the challenge, we develop a Dual Cox model theory including a Dual Cox model and a fitting algorithm.… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  16. arXiv:2211.10508  [pdf, other

    stat.ML cs.LG

    Distributionally Robust Survival Analysis: A Novel Fairness Loss Without Demographics

    Authors: Shu Hu, George H. Chen

    Abstract: We propose a general approach for training survival analysis models that minimizes a worst-case error across all subpopulations that are large enough (occurring with at least a user-specified minimum probability). This approach uses a training loss function that does not know any demographic information to treat as sensitive. Despite this, we demonstrate that our proposed approach often scores bet… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Machine Learning for Health (ML4H 2022)

  17. arXiv:2209.00383  [pdf, other

    cs.CV stat.ML

    TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Yuan Yuan, Yuming Du, Maomao Li, Shell Xu Hu, James L Crowley, Dominique Vaufreydaz

    Abstract: In this paper, we describe a graph-based algorithm that uses the features obtained by a self-supervised transformer to detect and segment salient objects in images and videos. With this approach, the image patches that compose an image or video are organised into a fully connected graph, where the edge between each pair of patches is labeled with a similarity score between patches using features l… ▽ More

    Submitted 5 December, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.11539

  18. arXiv:2208.02627  [pdf, ps, other

    stat.ME math.ST

    Modelling multivariate extreme value distributions via Markov trees

    Authors: Shuang Hu, Zuoxiang Peng, Johan Segers

    Abstract: Multivariate extreme value distributions are a common choice for modelling multivariate extremes. In high dimensions, however, the construction of flexible and parsimonious models is challenging. We propose to combine bivariate max-stable distributions into a Markov random field with respect to a tree. Although in general not max-stable itself, this Markov tree is attracted by a multivariate max-s… ▽ More

    Submitted 24 December, 2024; v1 submitted 29 July, 2022; originally announced August 2022.

    Comments: 49 pages, 6 figures, 3 tables

    MSC Class: 62G32; 62H22

    Journal ref: Scandinavian Journal of Statistics (2024), volume 51, pages 760-800

  19. arXiv:2207.07624  [pdf, other

    cs.LG stat.ML

    Feed-Forward Latent Domain Adaptation

    Authors: Ondrej Bohdal, Da Li, Shell Xu Hu, Timothy Hospedales

    Abstract: We study a new highly-practical problem setting that enables resource-constrained edge devices to adapt a pre-trained model to their local data distributions. Recognizing that device's data are likely to come from multiple latent domains that include a mixture of unlabelled domain-relevant and domain-irrelevant examples, we focus on the comparatively under-studied problem of latent domain adaptati… ▽ More

    Submitted 31 January, 2024; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted at WACV 2024. Project page: https://ondrejbohdal.github.io/cxda

  20. arXiv:2206.13140  [pdf, other

    cs.LG stat.ML

    Compressing Features for Learning with Noisy Labels

    Authors: Yingyi Chen, Shell Xu Hu, Xi Shen, Chunrong Ai, Johan A. K. Suykens

    Abstract: Supervised learning can be viewed as distilling relevant information from input data into feature representations. This process becomes difficult when supervision is noisy as the distilled information might not be relevant. In fact, recent research shows that networks can easily overfit all labels including those that are corrupted, and hence can hardly generalize to clean datasets. In this paper,… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted to TNNLS 2022. Project page: https://yingyichen-cyy.github.io/CompressFeatNoisyLabels/

  21. arXiv:2206.08531  [pdf, ps, other

    stat.ML cs.LG

    Reframed GES with a Neural Conditional Dependence Measure

    Authors: Xinwei Shen, Shengyu Zhu, Jiji Zhang, Shoubo Hu, Zhitang Chen

    Abstract: In a nonparametric setting, the causal structure is often identifiable only up to Markov equivalence, and for the purpose of causal inference, it is useful to learn a graphical representation of the Markov equivalence class (MEC). In this paper, we revisit the Greedy Equivalence Search (GES) algorithm, which is widely cited as a score-based algorithm for learning the MEC of the underlying causal s… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted to UAI 2022

  22. arXiv:2206.07902  [pdf, other

    cs.LG cs.CR stat.ML

    On Privacy and Personalization in Cross-Silo Federated Learning

    Authors: Ziyu Liu, Shengyuan Hu, Zhiwei Steven Wu, Virginia Smith

    Abstract: While the application of differential privacy (DP) has been well-studied in cross-device federated learning (FL), there is a lack of work considering DP and its implications for cross-silo FL, a setting characterized by a limited number of clients each containing many data subjects. In cross-silo FL, usual notions of client-level DP are less suitable as real-world privacy regulations typically con… ▽ More

    Submitted 17 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022, 37 pages

  23. arXiv:2203.11691  [pdf, other

    stat.ML cs.LG econ.EM

    GAM(L)A: An econometric model for interpretable Machine Learning

    Authors: Emmanuel Flachaire, Gilles Hacheme, Sullivan Hué, Sébastien Laurent

    Abstract: Despite their high predictive performance, random forest and gradient boosting are often considered as black boxes or uninterpretable models which has raised concerns from practitioners and regulators. As an alternative, we propose in this paper to use partial linear models that are inherently interpretable. Specifically, this article introduces GAM-lasso (GAMLA) and GAM-autometrics (GAMA), denote… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 47 pages, 12 tables and 7 figures

  24. arXiv:2202.11539  [pdf, other

    cs.CV stat.ML

    Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Shell Hu, Yuan Yuan, James Crowley, Dominique Vaufreydaz

    Abstract: Transformers trained with self-supervised learning using self-distillation loss (DINO) have been shown to produce attention maps that highlight salient foreground objects. In this paper, we demonstrate a graph-based approach that uses the self-supervised transformer features to discover an object from an image. Visual tokens are viewed as nodes in a weighted graph with edges representing a connect… ▽ More

    Submitted 24 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Journal ref: CVPR 2022 - Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States

  25. arXiv:2106.03300  [pdf, other

    cs.LG stat.ML

    Sum of Ranked Range Loss for Supervised Learning

    Authors: Shu Hu, Yiming Ying, Xin Wang, Siwei Lyu

    Abstract: In forming learning objectives, one oftentimes needs to aggregate a set of individual values to a single output. Such cases occur in the aggregate loss, which combines individual losses of a learning model over each training sample, and in the individual loss for multi-label learning, which combines prediction scores over all class labels. In this work, we introduce the sum of ranked range (SoRR)… ▽ More

    Submitted 3 April, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted by Journal of Machine Learning Research (JMLR). arXiv admin note: text overlap with arXiv:2010.01741

  26. arXiv:2106.00925  [pdf, other

    cs.LG stat.ML

    Contrastive ACE: Domain Generalization Through Alignment of Causal Mechanisms

    Authors: Yunqi Wang, Furui Liu, Zhitang Chen, Qing Lian, Shoubo Hu, Jianye Hao, Yik-Chung Wu

    Abstract: Domain generalization aims to learn knowledge invariant across different distributions while semantically meaningful for downstream tasks from multiple source domains, to improve the model's generalization ability on unseen target domains. The fundamental objective is to understand the underlying "invariance" behind these observational distributions and such invariance has been shown to have a clo… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  27. arXiv:2103.10912  [pdf, other

    stat.AP

    Copula Averaging for Tail Dependence in Insurance Claims Data

    Authors: Sen Hu, Adrian O'Hagan

    Abstract: Analysing dependent risks is an important task for insurance companies. A dependency is reflected in the fact that information about one random variable provides information about the likely distribution of values of another random variable. Insurance companies in particular must investigate such dependencies between different lines of business and the effects that an extreme loss event, such as a… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  28. arXiv:2012.04221  [pdf, other

    cs.LG stat.ML

    Ditto: Fair and Robust Federated Learning Through Personalization

    Authors: Tian Li, Shengyuan Hu, Ahmad Beirami, Virginia Smith

    Abstract: Fairness and robustness are two important concerns for federated learning systems. In this work, we identify that robustness to data and model poisoning attacks and fairness, measured as the uniformity of performance across devices, are competing constraints in statistically heterogeneous networks. To address these constraints, we propose employing a simple, general framework for personalized fede… ▽ More

    Submitted 15 June, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted by ICML 2021

  29. arXiv:2010.01741  [pdf, other

    cs.LG stat.ML

    Learning by Minimizing the Sum of Ranked Range

    Authors: Shu Hu, Yiming Ying, Xin Wang, Siwei Lyu

    Abstract: In forming learning objectives, one oftentimes needs to aggregate a set of individual values to a single output. Such cases occur in the aggregate loss, which combines individual losses of a learning model over each training sample, and in the individual loss for multi-label learning, which combines prediction scores over all class labels. In this work, we introduce the sum of ranked range (SoRR)… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: Accepted by Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020)

  30. arXiv:2009.04197   

    cs.LG cs.MA stat.ML

    QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning

    Authors: Jian Hu, Seth Austin Harding, Haibin Wu, Siyue Hu, Shih-wei Liao

    Abstract: In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the setting of Centralized Training with Decentralized Execution (CTDE), agents observe and interact with their environment locally and independently. With local observation and random sampling, the randomness in rewards and observations leads to randomness in long-term returns. Existing methods such as Value Decomposition Network… ▽ More

    Submitted 23 February, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: There are some experimental errors and experimental unfairness in this paper that will seriously affect the later studies

  31. arXiv:2009.01272  [pdf, other

    cs.LG stat.ML

    Understanding the wiring evolution in differentiable neural architecture search

    Authors: Sirui Xie, Shoukang Hu, Xinjiang Wang, Chunxiao Liu, Jianping Shi, Xunying Liu, Dahua Lin

    Abstract: Controversy exists on whether differentiable neural architecture search methods discover wiring topology effectively. To understand how wiring topology evolves, we study the underlying mechanism of several existing differentiable NAS frameworks. Our investigation is motivated by three observed searching patterns of differentiable NAS: 1) they search by growing instead of pruning; 2) wider networks… ▽ More

    Submitted 25 February, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

    Comments: AISTATS 2021

  32. arXiv:2006.13681  [pdf, other

    cs.CV cs.LG stat.ML

    Multi-view Drone-based Geo-localization via Style and Spatial Alignment

    Authors: Siyi Hu, Xiaojun Chang

    Abstract: In this paper, we focus on the task of multi-view multi-source geo-localization, which serves as an important auxiliary method of GPS positioning by matching drone-view image and satellite-view image with pre-annotated GPS tag. To solve this problem, most existing methods adopt metric loss with an weighted classification block to force the generation of common feature space shared by different vie… ▽ More

    Submitted 8 July, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 9 pages 9 figures. arXiv admin note: text overlap with arXiv:2002.12186 by other authors

    ACM Class: I.4.7; I.2.10

  33. arXiv:2006.13463  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Policy Network for Transferable Active Learning on Graphs

    Authors: Shengding Hu, Zheng Xiong, Meng Qu, Xingdi Yuan, Marc-Alexandre Côté, Zhiyuan Liu, Jian Tang

    Abstract: Graph neural networks (GNNs) have been attracting increasing popularity due to their simplicity and effectiveness in a variety of fields. However, a large number of labeled data is generally required to train these networks, which could be very expensive to obtain in some domains. In this paper, we study active learning for GNNs, i.e., how to efficiently label the nodes on a graph to reduce the an… ▽ More

    Submitted 23 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    ACM Class: I.2

  34. arXiv:2006.07856  [pdf, other

    cs.LG stat.ML

    The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems

    Authors: Sixu Hu, Yuan Li, Xu Liu, Qinbin Li, Zhaomin Wu, Bingsheng He

    Abstract: This paper presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available data sets as different data si… ▽ More

    Submitted 2 March, 2022; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: ACM Transactions on Intelligent Systems and Technology, Vol. 13, No. 4, Article 63

  35. arXiv:2006.04877  [pdf, other

    stat.ME cs.LG stat.CO

    A Causal Direction Test for Heterogeneous Populations

    Authors: Vahid Partovi Nia, Xinlin Li, Masoud Asgharian, Shoubo Hu, Zhitang Chen, Yanhui Geng

    Abstract: A probabilistic expert system emulates the decision-making ability of a human expert through a directional graphical model. The first step in building such systems is to understand data generation mechanism. To this end, one may try to decompose a multivariate distribution into product of several conditionals, and evolving a blackbox machine learning predictive models towards transparent cause-and… ▽ More

    Submitted 27 September, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    MSC Class: 62D20; 62H30

  36. arXiv:2005.00667  [pdf

    stat.AP

    Data-Driven Modeling Reveals the Impact of Stay-at-Home Orders on Human Mobility during the COVID-19 Pandemic in the U.S

    Authors: Chenfeng Xiong, Songhua Hu, Mofeng Yang, Hannah N Younes, Weiyu Luo, Sepehr Ghader, Lei Zhang

    Abstract: One approach to delay the spread of the novel coronavirus (COVID-19) is to reduce human travel by imposing travel restriction policies. It is yet unclear how effective those policies are on suppressing the mobility trend due to the lack of ground truth and large-scale dataset describing human mobility during the pandemic. This study uses real-world location-based service data collected from anonym… ▽ More

    Submitted 4 May, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

  37. arXiv:2004.12696  [pdf, other

    cs.LG stat.ML

    Empirical Bayes Transductive Meta-Learning with Synthetic Gradients

    Authors: Shell Xu Hu, Pablo G. Moreno, Yang Xiao, Xi Shen, Guillaume Obozinski, Neil D. Lawrence, Andreas Damianou

    Abstract: We propose a meta-learning approach that learns from multiple tasks in a transductive setting, by leveraging the unlabeled query set in addition to the support set to generate a more powerful model for each task. To develop our framework, we revisit the empirical Bayes formulation for multi-task learning. The evidence lower bound of the marginal log-likelihood of empirical Bayes decomposes as a su… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ICLR 2020

  38. arXiv:2002.09128  [pdf, other

    cs.LG stat.ML

    DSNAS: Direct Neural Architecture Search without Parameter Retraining

    Authors: Shoukang Hu, Sirui Xie, Hehui Zheng, Chunxiao Liu, Jianping Shi, Xunying Liu, Dahua Lin

    Abstract: If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this de… ▽ More

    Submitted 31 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: To appear in CVPR 2020

  39. arXiv:2002.05582  [pdf, other

    cs.LG stat.ML

    Learning to Predict Error for MRI Reconstruction

    Authors: Shi Hu, Nicola Pezzotti, Max Welling

    Abstract: In healthcare applications, predictive uncertainty has been used to assess predictive accuracy. In this paper, we demonstrate that predictive uncertainty estimated by the current methods does not highly correlate with prediction error by decomposing the latter into random and systematic errors, and showing that the former is equivalent to the variance of the random error. In addition, we observe t… ▽ More

    Submitted 6 July, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: Accepted to MICCAI 2021

  40. arXiv:1910.07629  [pdf, other

    cs.LG cs.CR stat.ML

    A New Defense Against Adversarial Images: Turning a Weakness into a Strength

    Authors: Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger

    Abstract: Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images. While many techniques for detecting these attacks have been proposed, they are easily bypassed when the adversary has full knowledge of the detection mechanism and adapts the attack strategy accordingly. In this… ▽ More

    Submitted 3 December, 2019; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019, 14 pages

  41. arXiv:1907.11216  [pdf, other

    stat.ML cs.LG

    Domain Generalization via Multidomain Discriminant Analysis

    Authors: Shoubo Hu, Kun Zhang, Zhitang Chen, Laiwan Chan

    Abstract: Domain generalization (DG) aims to incorporate knowledge from multiple source domains into a single model that could generalize well on unseen target domains. This problem is ubiquitous in practice since the distributions of the target data may rarely be identical to those of the source data. In this paper, we propose Multidomain Discriminant Analysis (MDA) to address DG of classification tasks in… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: UAI 2019

  42. arXiv:1907.09693  [pdf, other

    cs.LG cs.CR cs.DB stat.ML

    A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

    Authors: Qinbin Li, Zeyi Wen, Zhaomin Wu, Sixu Hu, Naibo Wang, Yuan Li, Xu Liu, Bingsheng He

    Abstract: Federated learning has been a hot research topic in enabling the collaborative training of machine learning models among different organizations under the privacy restrictions. As researchers try to support more machine learning models with different privacy-preserving approaches, there is a requirement in developing systems and infrastructures to ease the development of various federated learning… ▽ More

    Submitted 4 December, 2021; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: Accepted to IEEE Transactions on Knowledge and Data Engineering (TKDE)

  43. arXiv:1907.01949  [pdf, other

    cs.LG cs.CV stat.ML

    Supervised Uncertainty Quantification for Segmentation with Multiple Annotations

    Authors: Shi Hu, Daniel Worrall, Stefan Knegt, Bas Veeling, Henkjan Huisman, Max Welling

    Abstract: The accurate estimation of predictive uncertainty carries importance in medical scenarios such as lung node segmentation. Unfortunately, most existing works on predictive uncertainty do not return calibrated uncertainty estimates, which could be used in practice. In this work we exploit multi-grader annotation variability as a source of 'groundtruth' aleatoric uncertainty, which can be treated as… ▽ More

    Submitted 27 May, 2022; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: MICCAI 2019. Fixed a few typos

  44. Topological Techniques in Model Selection

    Authors: Shaoxiong Hu, Hugo Maruri-Aguliar, Zixiang Ma

    Abstract: The LASSO is an attractive regularisation method for linear regression that combines variable selection with an efficient computation procedure. This paper is concerned with enhancing the performance of LASSO for square-free hierarchical polynomial models when combining validation error with a measure of model complexity. The measure of the complexity is the sum of Betti numbers of the model which… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Journal ref: Alg. Stat. 13 (2022) 41-56

  45. arXiv:1904.04699  [pdf, other

    stat.AP

    Bivariate Gamma Mixture of Experts Models for Joint Insurance Claims Modeling

    Authors: Sen Hu, T Brendan Murphy, Adrian O'Hagan

    Abstract: In general insurance, risks from different categories are often modeled independently and their sum is regarded as the total risk the insurer takes on in exchange for a premium. The dependence from multiple risks is generally neglected even when correlation could exist, for example a single car accident may result in claims from multiple risk categories. It is desirable to take the covariance of d… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  46. STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks

    Authors: Shuochao Yao, Ailing Piao, Wenjun Jiang, Yiran Zhao, Huajie Shao, Shengzhong Liu, Dongxin Liu, Jinyang Li, Tianshi Wang, Shaohan Hu, Lu Su, Jiawei Han, Tarek Abdelzaher

    Abstract: Recent advances in deep learning motivate the use of deep neural networks in Internet-of-Things (IoT) applications. These networks are modelled after signal processing in the human brain, thereby leading to significant advantages at perceptual tasks such as vision and speech recognition. IoT applications, however, often measure physical phenomena, where the underlying physics (such as inertia, wir… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

  47. arXiv:1812.11027  [pdf, other

    cs.LG stat.ML

    Exploring Weight Symmetry in Deep Neural Networks

    Authors: Xu Shell Hu, Sergey Zagoruyko, Nikos Komodakis

    Abstract: We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the symmetry constraints, one would expect a dramatic drop in accuracy. Surprisingly, we show that this is not the case, and, depending on network size, symmetry can ha… ▽ More

    Submitted 10 January, 2019; v1 submitted 28 December, 2018; originally announced December 2018.

  48. arXiv:1812.08434  [pdf

    cs.LG cs.AI stat.ML

    Graph Neural Networks: A Review of Methods and Applications

    Authors: Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong Sun

    Abstract: Lots of learning tasks require dealing with graph data which contains rich relation information among elements. Modeling physics systems, learning molecular fingerprints, predicting protein interface, and classifying diseases demand a model to learn from graph inputs. In other domains such as learning from non-structural data like texts and images, reasoning on extracted structures (like the depen… ▽ More

    Submitted 6 October, 2021; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: Published at AI Open 2021

  49. arXiv:1809.08568  [pdf, other

    stat.ML cs.AI cs.LG

    Causal Inference and Mechanism Clustering of A Mixture of Additive Noise Models

    Authors: Shoubo Hu, Zhitang Chen, Vahid Partovi Nia, Laiwan Chan, Yanhui Geng

    Abstract: The inference of the causal relationship between a pair of observed variables is a fundamental problem in science, and most existing approaches are based on one single causal model. In practice, however, observations are often collected from multiple sources with heterogeneous causal models due to certain uncontrollable factors, which renders causal analysis results obtained by a single model skep… ▽ More

    Submitted 11 November, 2018; v1 submitted 23 September, 2018; originally announced September 2018.

    Comments: Published at NIPS 2018

  50. A Kernel Embedding-based Approach for Nonstationary Causal Model Inference

    Authors: Shoubo Hu, Zhitang Chen, Laiwan Chan

    Abstract: Although nonstationary data are more common in the real world, most existing causal discovery methods do not take nonstationarity into consideration. In this letter, we propose a kernel embedding-based approach, ENCI, for nonstationary causal model inference where data are collected from multiple domains with varying distributions. In ENCI, we transform the complicated relation of a cause-effect p… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

    Comments: Published at Neural Computation

    Journal ref: Neural computation, 30(5), 1394-1425, 2018