Skip to main content

Showing 1–30 of 30 results for author: Hsu, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.07308  [pdf, ps, other

    cs.LG stat.ML

    PASS: Private Attributes Protection with Stochastic Data Substitution

    Authors: Yizhuo Chen, Chun-Fu, Chen, Hsiang Hsu, Shaohan Hu, Tarek Abdelzaher

    Abstract: The growing Machine Learning (ML) services require extensive collections of user data, which may inadvertently include people's private information irrelevant to the services. Various studies have been proposed to protect private attributes by removing them from the data while maintaining the utilities of the data for downstream tasks. Nevertheless, as we theoretically and empirically show in the… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  2. arXiv:2404.10728  [pdf, other

    cs.LG stat.ML

    Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

    Authors: Hao-Lun Hsu, Weixin Wang, Miroslav Pajic, Pan Xu

    Abstract: We present the first study on provably efficient randomized exploration in cooperative multi-agent reinforcement learning (MARL). We propose a unified algorithm framework for randomized exploration in parallel Markov Decision Processes (MDPs), and two Thompson Sampling (TS)-type algorithms, CoopTS-PHE and CoopTS-LMC, incorporating the perturbed-history exploration (PHE) strategy and the Langevin M… ▽ More

    Submitted 3 March, 2025; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 66 pages, 14 figures, 6 table. Hao-Lun Hsu and Weixin Wang contributed equally to this work. Published in Proc. of the 38th Conference on Advances in Neural Information Processing Systems (NeurIPS 2024)

  3. arXiv:2402.00728  [pdf, other

    cs.LG stat.ML

    Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation

    Authors: Hsiang Hsu, Guihong Li, Shaohan Hu, Chun-Fu, Chen

    Abstract: Predictive multiplicity refers to the phenomenon in which classification tasks may admit multiple competing models that achieve almost-equally-optimal performance, yet generate conflicting outputs for individual samples. This presents significant concerns, as it can potentially result in systemic exclusion, inexplicable discrimination, and unfairness in practical applications. Measuring and mitiga… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  4. arXiv:2312.15549  [pdf, other

    cs.LG cs.MA math.ST stat.ML

    Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

    Authors: Tianyuan Jin, Hao-Lun Hsu, William Chang, Pan Xu

    Abstract: We study the multi-agent multi-armed bandit (MAMAB) problem, where $m$ agents are factored into $ρ$ overlapping groups. Each group represents a hyperedge, forming a hypergraph over the agents. At each round of interaction, the learner pulls a joint arm (composed of individual arms for each agent) and receives a reward according to the hypergraph structure. Specifically, we assume there is a local… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 22 pages, 7 figures, 2 tables. To appear in the proceedings of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'2024)

  5. arXiv:2302.14517  [pdf, other

    cs.LG cs.CR cs.CY stat.ML

    Arbitrary Decisions are a Hidden Cost of Differentially Private Training

    Authors: Bogdan Kulynych, Hsiang Hsu, Carmela Troncoso, Flavio P. Calmon

    Abstract: Mechanisms used in privacy-preserving machine learning often aim to guarantee differential privacy (DP) during model training. Practical DP-ensuring training methods use randomization when fitting model parameters to privacy-sensitive data (e.g., adding Gaussian noise to clipped gradients). We demonstrate that such randomization incurs predictive multiplicity: for a given input example, the output… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: To appear in ACM FAccT 2023

  6. arXiv:2210.15575  [pdf, other

    cs.LG cs.AI stat.ML

    A Graph Is More Than Its Nodes: Towards Structured Uncertainty-Aware Learning on Graphs

    Authors: Hans Hao-Hsun Hsu, Yuesong Shen, Daniel Cremers

    Abstract: Current graph neural networks (GNNs) that tackle node classification on graphs tend to only focus on nodewise scores and are solely evaluated by nodewise metrics. This limits uncertainty estimation on graphs since nodewise marginals do not fully characterize the joint distribution given the graph structure. In this work, we propose novel edgewise metrics, namely the edgewise expected calibration e… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Presented at NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)

  7. arXiv:2206.01295  [pdf, other

    cs.LG cs.IT stat.ML

    Rashomon Capacity: A Metric for Predictive Multiplicity in Classification

    Authors: Hsiang Hsu, Flavio du Pin Calmon

    Abstract: Predictive multiplicity occurs when classification models with statistically indistinguishable performances assign conflicting predictions to individual samples. When used for decision-making in applications of consequence (e.g., lending, education, criminal justice), models developed without regard for predictive multiplicity may result in unjustified and arbitrary decisions for specific individu… ▽ More

    Submitted 19 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version (34 pages, 23 figures, 2 tables)

  8. arXiv:2202.03881  [pdf, other

    cs.LG stat.ML

    Robust Hybrid Learning With Expert Augmentation

    Authors: Antoine Wehenkel, Jens Behrmann, Hsiang Hsu, Guillermo Sapiro, Gilles Louppe, Jörn-Henrik Jacobsen

    Abstract: Hybrid modelling reduces the misspecification of expert models by combining them with machine learning (ML) components learned from data. Similarly to many ML algorithms, hybrid model performance guarantees are limited to the training distribution. Leveraging the insight that the expert model is usually valid even outside the training domain, we overcome this limitation by introducing a hybrid dat… ▽ More

    Submitted 11 April, 2023; v1 submitted 8 February, 2022; originally announced February 2022.

    Journal ref: Transaction on Machine Learning Research, 2023

  9. arXiv:2201.08302  [pdf, other

    stat.CO stat.AP

    The R Package HCV for Hierarchical Clustering from Vertex-links

    Authors: ShengLi Tzeng, Hao-Yun Hsu

    Abstract: The HCV package implements the hierarchical clustering for spatial data. It requires clustering results not only homogeneous in non-geographical features among samples but also geographically close to each other within a cluster. We modified typically used hierarchical agglomerative clustering algorithms to introduce the spatial homogeneity, by considering geographical locations as vertices and co… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 12 pages, 7 figures

  10. arXiv:2006.15229  [pdf, other

    cs.LG stat.ML

    CheXpert++: Approximating the CheXpert labeler for Speed,Differentiability, and Probabilistic Output

    Authors: Matthew B. A. McDermott, Tzu Ming Harry Hsu, Wei-Hung Weng, Marzyeh Ghassemi, Peter Szolovits

    Abstract: It is often infeasible or impossible to obtain ground truth labels for medical data. To circumvent this, one may build rule-based or other expert-knowledge driven labelers to ingest data and yield silver labels absent any ground-truth training data. One popular such labeler is CheXpert, a labeler that produces diagnostic labels for chest X-ray radiology reports. CheXpert is very useful, but is rel… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: To appear at MLHC 2020

  11. arXiv:2006.07326  [pdf, other

    cs.LG cs.CV stat.ML

    CPR: Classifier-Projection Regularization for Continual Learning

    Authors: Sungmin Cha, Hsiang Hsu, Taebaek Hwang, Flavio P. Calmon, Taesup Moon

    Abstract: We propose a general, yet simple patch that can be applied to existing regularization-based continual learning methods called classifier-projection regularization (CPR). Inspired by both recent results on neural networks with wide local minima and information theory, CPR adds an additional regularization term that maximizes the entropy of a classifier's output probability. We demonstrate that this… ▽ More

    Submitted 19 April, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: ICLR 2021 camera ready version

  12. arXiv:2005.09218  [pdf, other

    cs.LG stat.ML

    Large Margin Mechanism and Pseudo Query Set on Cross-Domain Few-Shot Learning

    Authors: Jia-Fong Yeh, Hsin-Ying Lee, Bing-Chen Tsai, Yi-Rong Chen, Ping-Chia Huang, Winston H. Hsu

    Abstract: In recent years, few-shot learning problems have received a lot of attention. While methods in most previous works were trained and tested on datasets in one single domain, cross-domain few-shot learning is a brand-new branch of few-shot learning problems, where models handle datasets in different domains between training and testing phases. In this paper, to solve the problem that the model is pr… ▽ More

    Submitted 6 February, 2024; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: Full version of the CDFSL competition report (in CVPRW'20), archived

  13. arXiv:2005.02123  [pdf, other

    cs.CV cs.LG stat.ML

    Expanding Sparse Guidance for Stereo Matching

    Authors: Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu

    Abstract: The performance of image based stereo estimation suffers from lighting variations, repetitive patterns and homogeneous appearance. Moreover, to achieve good performance, stereo supervision requires sufficient densely-labeled data, which are hard to obtain. In this work, we leverage small amount of data with very sparse but accurate disparity cues from LiDAR to bridge the gap. We propose a novel sp… ▽ More

    Submitted 24 April, 2020; originally announced May 2020.

  14. arXiv:2003.08082  [pdf, other

    cs.LG cs.CV stat.ML

    Federated Visual Classification with Real-World Data Distribution

    Authors: Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown

    Abstract: Federated Learning enables visual models to be trained on-device, bringing advantages for user privacy (data need never leave the device), but challenges in terms of data diversity and quality. Whilst typical models in the datacenter are trained using data that are independent and identically distributed (IID), data at source are typically far from IID. Furthermore, differing quantities of data ar… ▽ More

    Submitted 17 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

  15. arXiv:2002.04788  [pdf, other

    cs.LG cs.CY cs.IT stat.ML

    To Split or Not to Split: The Impact of Disparate Treatment in Classification

    Authors: Hao Wang, Hsiang Hsu, Mario Diaz, Flavio P. Calmon

    Abstract: Disparate treatment occurs when a machine learning model yields different decisions for individuals based on a sensitive attribute (e.g., age, sex). In domains where prediction accuracy is paramount, it could potentially be acceptable to fit a model which exhibits disparate treatment. To evaluate the effect of disparate treatment, we compare the performance of split classifiers (i.e., classifiers… ▽ More

    Submitted 13 April, 2022; v1 submitted 11 February, 2020; originally announced February 2020.

  16. arXiv:1910.08109  [pdf, other

    cs.IT cs.LG stat.ML

    Obfuscation via Information Density Estimation

    Authors: Hsiang Hsu, Shahab Asoodeh, Flavio du Pin Calmon

    Abstract: Identifying features that leak information about sensitive attributes is a key challenge in the design of information obfuscation mechanisms. In this paper, we propose a framework to identify information-leaking features via information density estimation. Here, features whose information densities exceed a pre-defined threshold are deemed information-leaking features. Once these features are iden… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: 24 pages, 3 figures

  17. arXiv:1909.06335  [pdf, other

    cs.LG cs.CV stat.ML

    Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification

    Authors: Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown

    Abstract: Federated Learning enables visual models to be trained in a privacy-preserving way using real-world data from mobile devices. Given their distributed nature, the statistics of the data across these devices is likely to differ significantly. In this work, we look at the effect such non-identical data distributions has on visual classification via Federated Learning. We propose a way to synthesize d… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  18. arXiv:1908.08990  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

    Authors: Sebastian Agethen, Winston H. Hsu

    Abstract: Action recognition greatly benefits motion understanding in video analysis. Recurrent networks such as long short-term memory (LSTM) networks are a popular choice for motion-aware sequence learning tasks. Recently, a convolutional extension of LSTM was proposed, in which input-to-hidden and hidden-to-hidden transitions are modeled through convolution with a single kernel. This implies an unavoidab… ▽ More

    Submitted 30 July, 2019; originally announced August 2019.

  19. arXiv:1907.07768  [pdf, other

    cs.IR cs.CR cs.LG cs.SI stat.ML

    A Novel Approach for Detection and Ranking of Trendy and Emerging Cyber Threat Events in Twitter Streams

    Authors: Avishek Bose, Vahid Behzadan, Carlos Aguirre, William H. Hsu

    Abstract: We present a new machine learning and text information extraction approach to detection of cyber threat events in Twitter that are novel (previously non-extant) and developing (marked by significance with respect to similarity with a previously detected event). While some existing approaches to event detection measure novelty and trendiness, typically as independent criteria and occasionally as a… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: 9 pages, 3 figures, and 5 tables

  20. arXiv:1907.02907  [pdf, other

    stat.ML cs.LG

    Hybridized Threshold Clustering for Massive Data

    Authors: Jianmei Luo, ChandraVyas Annakula, Aruna Sai Kannamareddy, Jasjeet S. Sekhon, William Henry Hsu, Michael Higgins

    Abstract: As the size $n$ of datasets become massive, many commonly-used clustering algorithms (for example, $k$-means or hierarchical agglomerative clustering (HAC) require prohibitive computational cost and memory. In this paper, we propose a solution to these clustering problems by extending threshold clustering (TC) to problems of instance selection. TC is a recently developed clustering algorithm desig… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

  21. arXiv:1904.12417  [pdf, other

    stat.AP stat.ME

    Kernel Machine and Distributed Lag Models for Assessing Windows of Susceptibility to Environmental Mixtures in Children's Health Studies

    Authors: Ander Wilson, Hsiao-Hsien Leon Hsu, Yueh-Hsiu Mathilda Chiu, Robert O. Wright, Rosalind J. Wright, Brent A. Coull

    Abstract: Exposures to environmental chemicals during gestation can alter health status later in life. Most studies of maternal exposure to chemicals during pregnancy have focused on a single chemical exposure observed at high temporal resolution. Recent research has turned to focus on exposure to mixtures of multiple chemicals, generally observed at a single time point. We consider statistical methods for… ▽ More

    Submitted 21 September, 2021; v1 submitted 28 April, 2019; originally announced April 2019.

    Journal ref: Ann. Appl. Stat. 16(2): 1090-1110 (June 2022)

  22. arXiv:1902.07828  [pdf, other

    stat.ML cs.IT cs.LG

    Correspondence Analysis Using Neural Networks

    Authors: Hsiang Hsu, Salman Salamatian, Flavio P. Calmon

    Abstract: Correspondence analysis (CA) is a multivariate statistical tool used to visualize and interpret data dependencies. CA has found applications in fields ranging from epidemiology to social sciences. However, current methods used to perform CA do not scale to large, high-dimensional datasets. By re-interpreting the objective in CA using an information-theoretic tool called the principal inertia compo… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: Accepted to AISTATS 2019. Overlaps with arXiv:1806.08449

  23. arXiv:1812.01105  [pdf, other

    cs.CY cs.LG stat.ML

    Correspondence Analysis of Government Expenditure Patterns

    Authors: Hsiang Hsu, Flavio P. Calmon, José Cândido Silveira Santos Filho, Andre P. Calmon, Salman Salamatian

    Abstract: We analyze expenditure patterns of discretionary funds by Brazilian congress members. This analysis is based on a large dataset containing over $7$ million expenses made publicly available by the Brazilian government. This dataset has, up to now, remained widely untouched by machine learning methods. Our main contributions are two-fold: (i) we provide a novel dataset benchmark for machine learning… ▽ More

    Submitted 29 November, 2018; originally announced December 2018.

    Comments: Presented at NIPS 2018 Workshop on Machine Learning for the Developing World

  24. arXiv:1806.08449  [pdf, other

    cs.LG cs.IT stat.ML

    Generalizing Correspondence Analysis for Applications in Machine Learning

    Authors: Hsiang Hsu, Salman Salamatian, Flavio P. Calmon

    Abstract: Correspondence analysis (CA) is a multivariate statistical tool used to visualize and interpret data dependencies by finding maximally correlated embeddings of pairs of random variables. CA has found applications in fields ranging from epidemiology to social sciences; however, current methods do not scale to large, high-dimensional datasets. In this paper, we provide a novel interpretation of CA i… ▽ More

    Submitted 27 June, 2020; v1 submitted 21 June, 2018; originally announced June 2018.

    Comments: 30 pages, 7 figures, 6 tables. arXiv admin note: text overlap with arXiv:1902.07828

  25. arXiv:1802.00243  [pdf, other

    stat.ML

    Greedy Active Learning Algorithm for Logistic Regression Models

    Authors: Hsiang-Ling Hsu, Yuan-Chin Ivan Chang, Ray-Bing Chen

    Abstract: We study a logistic model-based active learning procedure for binary classification problems, in which we adopt a batch subject selection strategy with a modified sequential experimental design method. Moreover, accompanying the proposed subject selection scheme, we simultaneously conduct a greedy variable selection procedure such that we can update the classification model with all labeled traini… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

  26. arXiv:1710.05751  [pdf, other

    stat.ML

    Time Series Prediction : Predicting Stock Price

    Authors: Aaron Elliot, Cheng Hua Hsu

    Abstract: Time series forecasting is widely used in a multitude of domains. In this paper, we present four models to predict the stock price using the SPX index as input time series data. The martingale and ordinary linear models require the strongest assumption in stationarity which we use as baseline models. The generalized linear model requires lesser assumptions but is unable to outperform the martingal… ▽ More

    Submitted 19 October, 2017; v1 submitted 16 October, 2017; originally announced October 2017.

    Comments: Under advisement of Dr. Sang Kim, for his class CS542. Additional author unnamed

    MSC Class: 62-07

  27. arXiv:1709.06489  [pdf, ps, other

    q-bio.GN cs.LG q-bio.QM stat.ML

    Accurate Genomic Prediction Of Human Height

    Authors: Louis Lello, Steven G. Avery, Laurent Tellier, Ana Vazquez, Gustavo de los Campos, Stephen D. H. Hsu

    Abstract: We construct genomic predictors for heritable and extremely complex human quantitative traits (height, heel bone density, and educational attainment) using modern methods in high dimensional statistics (i.e., machine learning). Replication tests show that these predictors capture, respectively, $\sim$40, 20, and 9 percent of total variance for the three traits. For example, predicted heights corre… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Comments: 17 pages, 10 figures

  28. Bayesian Distributed Lag Interaction Models to Identify Perinatal Windows of Vulnerability in Children's Health

    Authors: Ander Wilson, Yueh-Hsiu Mathilda Chiu, Hsiao-Hsien Leon Hsu, Robert O. Wright, Rosalind J. Wright, Brent A. Coull

    Abstract: Epidemiological research supports an association between maternal exposure to air pollution during pregnancy and adverse children's health outcomes. Advances in exposure assessment and statistics allow for estimation of both critical windows of vulnerability and exposure effect heterogeneity. Simultaneous estimation of windows of vulnerability and effect heterogeneity can be accomplished by fittin… ▽ More

    Submitted 17 December, 2016; originally announced December 2016.

    Journal ref: Biostatistics 2007

  29. arXiv:1408.6583  [pdf, other

    q-bio.GN stat.AP

    Determination of Nonlinear Genetic Architecture using Compressed Sensing

    Authors: Chiu Man Ho, Stephen D. H. Hsu

    Abstract: We introduce a statistical method that can reconstruct nonlinear genetic models (i.e., including epistasis, or gene-gene interactions) from phenotype-genotype (GWAS) data. The computational and data resource requirements are similar to those necessary for reconstruction of linear genetic models (or identification of gene-trait associations), assuming a condition of generalized sparsity, which limi… ▽ More

    Submitted 19 July, 2015; v1 submitted 27 August, 2014; originally announced August 2014.

    Comments: 20 pages, 8 figures. arXiv admin note: text overlap with arXiv:1408.3421

    Journal ref: GigaScience 4: 44 (2015)

  30. arXiv:1310.2264  [pdf, other

    q-bio.GN stat.AP

    Application of compressed sensing to genome wide association studies and genomic selection

    Authors: Shashaank Vattikuti, James J. Lee, Christopher C. Chang, Stephen D. H. Hsu, Carson C. Chow

    Abstract: We show that the signal-processing paradigm known as compressed sensing (CS) is applicable to genome-wide association studies (GWAS) and genomic selection (GS). The aim of GWAS is to isolate trait-associated loci, whereas GS attempts to predict the phenotypic values of new individuals on the basis of training data. CS addresses a problem common to both endeavors, namely that the number of genotype… ▽ More

    Submitted 11 May, 2014; v1 submitted 8 October, 2013; originally announced October 2013.

    Comments: 30 pages, 11 figures. Version to appear in journal GigaScience