Skip to main content

Showing 1–50 of 138 results for author: Huang, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.19142  [pdf, ps, other

    cs.SI stat.ML

    Inferring Diffusion Structures of Heterogeneous Network Cascade

    Authors: Yubai Yuan, Siyu Huang, Abdul Basit Adeel

    Abstract: Network cascade refers to diffusion processes in which outcome changes within part of an interconnected population trigger a sequence of changes across the entire network. These cascades are governed by underlying diffusion networks, which are often latent. Inferring such networks is critical for understanding cascade pathways, uncovering Granger causality of interaction mechanisms among individua… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  2. arXiv:2506.05590  [pdf, ps, other

    stat.ML cs.LG

    Nonlinear Causal Discovery through a Sequential Edge Orientation Approach

    Authors: Stella Huang, Qing Zhou

    Abstract: Recent advances have established the identifiability of a directed acyclic graph (DAG) under additive noise models (ANMs), spurring the development of various causal discovery methods. However, most existing methods make restrictive model assumptions, rely heavily on general independence tests, or require substantial computational time. To address these limitations, we propose a sequential procedu… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 42 Pages, 13 figures, 3 tables

  3. arXiv:2505.19923  [pdf, ps, other

    cs.LG stat.ML

    Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL

    Authors: Qin-Wen Luo, Ming-Kun Xie, Ye-Wen Wang, Sheng-Jun Huang

    Abstract: Offline reinforcement learning (RL) aims to learn an effective policy from a static dataset. To alleviate extrapolation errors, existing studies often uniformly regularize the value function or policy updates across all states. However, due to substantial variations in data quality, the fixed regularization strength often leads to a dilemma: Weak regularization strength fails to address extrapolat… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  4. arXiv:2505.19024  [pdf, ps, other

    cs.LG stat.ML

    Learn Beneficial Noise as Graph Augmentation

    Authors: Siqi Huang, Yanchen Xu, Hongyuan Zhang, Xuelong Li

    Abstract: Although graph contrastive learning (GCL) has been widely investigated, it is still a challenge to generate effective and stable graph augmentations. Existing methods often apply heuristic augmentation like random edge dropping, which may disrupt important graph structures and result in unstable GCL performance. In this paper, we propose Positive-incentive Noise driven Graph Data Augmentation (PiN… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  5. arXiv:2505.04884  [pdf, ps, other

    stat.ME math.ST

    Model Selection for Unit-root Time Series with Many Predictors

    Authors: Shuo-Chieh Huang, Ching-Kang Ing, Ruey S. Tsay

    Abstract: This paper studies model selection for general unit-root time series, including the case with many exogenous predictors. We propose FHTD, a new model selection algorithm that leverages forward stepwise regression (FSR), a high-dimensional information criterion (HDIC), a backward elimination method based on HDIC, and a data-driven thresholding (DDT) approach. Under some mild assumptions that allow… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  6. arXiv:2502.18253  [pdf, other

    econ.GN stat.AP

    Enhancing External Validity of Experiments with Ongoing Sampling

    Authors: Chen Wang, Shichao Han, Shan Huang

    Abstract: Participants in online experiments often enroll over time, which can compromise sample representativeness due to temporal shifts in covariates. This issue is particularly critical in A/B tests, online controlled experiments extensively used to evaluate product updates, since these tests are cost-sensitive and typically short in duration. We propose a novel framework that dynamically assesses sampl… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  7. arXiv:2502.14047  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Towards a Learning Theory of Representation Alignment

    Authors: Francesco Insulla, Shuo Huang, Lorenzo Rosasco

    Abstract: It has recently been argued that AI models' representations are becoming aligned as their scale and performance increase. Empirical analyses have been designed to support this idea and conjecture the possible alignment of different representations toward a shared statistical model of reality. In this paper, we propose a learning-theoretic perspective to representation alignment. First, we review a… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  8. arXiv:2501.02454  [pdf, other

    stat.ME stat.AP

    Finite-Sample Valid Randomization Tests for Monotone Spillover Effects

    Authors: Shunzhuang Huang, Xinran Li, Panos Toulis

    Abstract: Randomization tests have gained popularity for causal inference under network interference because they are finite-sample valid with minimal assumptions. However, existing procedures are limited as they primarily focus on the existence of spillovers through sharp null hypotheses on potential outcomes. In this paper, we expand the scope of randomization procedures in network settings by developing… ▽ More

    Submitted 27 February, 2025; v1 submitted 5 January, 2025; originally announced January 2025.

  9. arXiv:2412.09265  [pdf, other

    cs.RO cs.LG stat.ML

    Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation

    Authors: Bofang Jia, Pengxiang Ding, Can Cui, Mingyang Sun, Pengfang Qian, Siteng Huang, Zhaoxin Fan, Donglin Wang

    Abstract: Visual-motor policy learning has advanced with architectures like diffusion-based policies, known for modeling complex robotic trajectories. However, their prolonged inference times hinder high-frequency control tasks requiring real-time feedback. While consistency distillation (CD) accelerates inference, it introduces errors that compromise action quality. To address these limitations, we propose… ▽ More

    Submitted 19 December, 2024; v1 submitted 12 December, 2024; originally announced December 2024.

  10. arXiv:2411.16192  [pdf, other

    stat.ME

    Modeling large dimensional matrix time series with partially known and latent factors

    Authors: Yongchang Hui, Yuteng Zhang, Siting Huang

    Abstract: This article considers to model large-dimensional matrix time series by introducing a regression term to the matrix factor model. This is an extension of classic matrix factor model to incorporate the information of known factors or useful covariates. We establish the convergence rates of coefficient matrix, loading matrices and the signal part. The theoretical results coincide with the rates in W… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 20 pages, 4 figures

  11. arXiv:2411.02811  [pdf, other

    stat.ME

    Temporal Wasserstein Imputation: Versatile Missing Data Imputation for Time Series

    Authors: Shuo-Chieh Huang, Tengyuan Liang, Ruey S. Tsay

    Abstract: Missing data can significantly hamper standard time series analysis, yet in practice they are frequently encountered. In this paper, we introduce temporal Wasserstein imputation, a novel method for imputing missing data in time series. Unlike existing techniques, our approach is fully nonparametric, circumventing the need for model specification prior to imputation, making it suitable for potentia… ▽ More

    Submitted 27 February, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

  12. arXiv:2411.01773  [pdf, other

    stat.AP

    Detection of LUAD-Associated Genes Using Wasserstein Distance in Multi-Omics Feature Selection

    Authors: Shaofei Zhao, Siming Huang, Kexuan Li, Weiyu Zhou, Lingli Yang, Shige Wang

    Abstract: Lung adenocarcinoma (LUAD) is characterized by substantial genetic heterogeneity, posing challenges in identifying reliable biomarkers for improved diagnosis and treatment. Tumor Mutational Burden (TMB) has traditionally been regarded as a predictive biomarker, given its association with immune response and treatment efficacy. In this study, we treated TMB as a response variable to identify genes… ▽ More

    Submitted 6 November, 2024; v1 submitted 3 November, 2024; originally announced November 2024.

  13. arXiv:2410.12300  [pdf, other

    stat.ME stat.AP

    Sparse Causal Effect Estimation using Two-Sample Summary Statistics in the Presence of Unmeasured Confounding

    Authors: Shimeng Huang, Niklas Pfister, Jack Bowden

    Abstract: Observational genome-wide association studies are now widely used for causal inference in genetic epidemiology. To maintain privacy, such data is often only publicly available as summary statistics, and often studies for the endogenous covariates and the outcome are available separately. This has necessitated methods tailored to two-sample summary statistics. Current state-of-the-art methods modif… ▽ More

    Submitted 23 November, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

  14. arXiv:2410.01047  [pdf, ps, other

    cs.LG math.FA stat.ML

    Spherical Analysis of Learning Nonlinear Functionals

    Authors: Zhenyu Yang, Shuo Huang, Han Feng, Ding-Xuan Zhou

    Abstract: In recent years, there has been growing interest in the field of functional neural networks. They have been proposed and studied with the aim of approximating continuous functionals defined on sets of functions on Euclidean domains. In this paper, we consider functionals defined on sets of functions on spheres. The approximation ability of deep ReLU neural networks is investigated by novel spheric… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  15. arXiv:2410.00397  [pdf, ps, other

    stat.ML cs.LG

    A Generalized Mean Approach for Distributed-PCA

    Authors: Zhi-Yu Jou, Su-Yun Huang, Hung Hung, Shinto Eguchi

    Abstract: Principal component analysis (PCA) is a widely used technique for dimension reduction. As datasets continue to grow in size, distributed-PCA (DPCA) has become an active research area. A key challenge in DPCA lies in efficiently aggregating results across multiple machines or computing nodes due to computational overhead. Fan et al. (2019) introduced a pioneering DPCA method to estimate the leading… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 17 pages, 1 table, 1 figure

  16. arXiv:2409.01369  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Imitating Language via Scalable Inverse Reinforcement Learning

    Authors: Markus Wulfmeier, Michael Bloesch, Nino Vieillard, Arun Ahuja, Jorg Bornschein, Sandy Huang, Artem Sokolov, Matt Barnes, Guillaume Desjardins, Alex Bewley, Sarah Maria Elisabeth Bechtle, Jost Tobias Springenberg, Nikola Momchev, Olivier Bachem, Matthieu Geist, Martin Riedmiller

    Abstract: The majority of language model training builds on imitation learning. It covers pretraining, supervised fine-tuning, and affects the starting conditions for reinforcement learning from human feedback (RLHF). The simplicity and scalability of maximum likelihood estimation (MLE) for next token prediction led to its role as predominant paradigm. However, the broader field of imitation learning can mo… ▽ More

    Submitted 9 December, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: Published at NeurIPS 2024

  17. arXiv:2407.03172  [pdf, other

    cs.CV cs.AI stat.AP

    IMC 2024 Methods & Solutions Review

    Authors: Shyam Gupta, Dhanisha Sharma, Songling Huang

    Abstract: For the past three years, Kaggle has been hosting the Image Matching Challenge, which focuses on solving a 3D image reconstruction problem using a collection of 2D images. Each year, this competition fosters the development of innovative and effective methodologies by its participants. In this paper, we introduce an advanced ensemble technique that we developed, achieving a score of 0.153449 on th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 Pages, 9 figures

  18. arXiv:2406.09625  [pdf, other

    stat.ME

    Time Series Forecasting with Many Predictors

    Authors: Shuo-Chieh Huang, Ruey S. Tsay

    Abstract: We propose a novel approach for time series forecasting with many predictors, referred to as the GO-sdPCA, in this paper. The approach employs a variable selection method known as the group orthogonal greedy algorithm and the high-dimensional Akaike information criterion to mitigate the impact of irrelevant predictors. Moreover, a novel technique, called peeling, is used to boost the variable sele… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  19. arXiv:2406.02362  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    Temporal Graph Rewiring with Expander Graphs

    Authors: Katarina Petrović, Shenyang Huang, Farimah Poursafaei, Petar Veličković

    Abstract: Evolving relations in real-world networks are often modelled by temporal graphs. Temporal Graph Neural Networks (TGNNs) emerged to model evolutionary behaviour of such graphs by leveraging the message passing primitive at the core of Graph Neural Networks (GNNs). It is well-known that GNNs are vulnerable to several issues directly related to the input graph topology, such as under-reaching and ove… ▽ More

    Submitted 22 October, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 2 figures

  20. arXiv:2405.14104  [pdf, ps, other

    econ.EM stat.ME

    On the Identifying Power of Monotonicity for Average Treatment Effects

    Authors: Yuehao Bai, Shunzhuang Huang, Sarah Moon, Azeem M. Shaikh, Edward J. Vytlacil

    Abstract: In the context of a binary outcome, treatment, and instrument, Balke and Pearl (1993, 1997) establish that the monotonicity condition of Imbens and Angrist (1994) has no identifying power beyond instrument exogeneity for average potential outcomes and average treatment effects in the sense that adding it to instrument exogeneity does not decrease the identified sets for those parameters whenever t… ▽ More

    Submitted 27 June, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

  21. arXiv:2403.12677  [pdf, other

    stat.ME

    Causal Change Point Detection and Localization

    Authors: Shimeng Huang, Jonas Peters, Niklas Pfister

    Abstract: Detecting and localizing change points in sequential data is of interest in many areas of application. Various notions of change points have been proposed, such as changes in mean, variance, or the linear regression coefficient. In this work, we consider settings in which a response variable $Y$ and a set of covariates $X=(X^1,\ldots,X^{d+1})$ are observed over time and aim to find changes in the… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  22. arXiv:2402.09436  [pdf, other

    math.PR stat.AP

    On the expected number of facets for the convex hull of samples

    Authors: Feng Zhao, Xinyi Tong, Shao-Lun Huang

    Abstract: This paper studies the convex hull of $d$-dimensional samples i.i.d. generated from spherically symmetric distributions. Specifically, we derive a complete integration formula for the expected facet number of the convex hull. This formula is with respect to the CDF of the radial distribution. As the number of samples approaches infinity, the integration formula enables us to obtain the asymptotic… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

  23. arXiv:2401.04933  [pdf, other

    cs.LG stat.ML

    Rethinking Test-time Likelihood: The Likelihood Path Principle and Its Application to OOD Detection

    Authors: Sicong Huang, Jiawei He, Kry Yik Chau Lui

    Abstract: While likelihood is attractive in theory, its estimates by deep generative models (DGMs) are often broken in practice, and perform poorly for out of distribution (OOD) Detection. Various recent works started to consider alternative scores and achieved better performances. However, such recipes do not come with provable guarantees, nor is it clear that their choices extract sufficient information.… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  24. arXiv:2311.06517  [pdf, other

    cs.AI cs.DB cs.LG stat.AP

    BClean: A Bayesian Data Cleaning System

    Authors: Jianbin Qin, Sifan Huang, Yaoshu Wang, Jing Zhu, Yifan Zhang, Yukai Miao, Rui Mao, Makoto Onizuka, Chuan Xiao

    Abstract: There is a considerable body of work on data cleaning which employs various principles to rectify erroneous data and transform a dirty dataset into a cleaner one. One of prevalent approaches is probabilistic methods, including Bayesian methods. However, existing probabilistic methods often assume a simplistic distribution (e.g., Gaussian distribution), which is frequently underfitted in practice,… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: Our source code is available at https://github.com/yyssl88/BClean

  25. arXiv:2308.08152  [pdf, other

    econ.EM stat.ME

    Estimating Effects of Long-Term Treatments

    Authors: Shan Huang, Chen Wang, Yuan Yuan, Jinglong Zhao, Brocco, Zhang

    Abstract: Estimating the effects of long-term treatments through A/B testing is challenging. Treatments, such as updates to product functionalities, user interface designs, and recommendation algorithms, are intended to persist within the system for a long duration of time after their initial launches. However, due to the constraints of conducting long-term experiments, practitioners often rely on short-ter… ▽ More

    Submitted 6 December, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

  26. arXiv:2307.03410  [pdf, other

    stat.ML cs.DC cs.LG

    Scalable High-Dimensional Multivariate Linear Regression for Feature-Distributed Data

    Authors: Shuo-Chieh Huang, Ruey S. Tsay

    Abstract: Feature-distributed data, referred to data partitioned by features and stored across multiple computing nodes, are increasingly common in applications with a large number of features. This paper proposes a two-stage relaxed greedy algorithm (TSRGA) for applying multivariate linear regression to such data. The main advantage of TSRGA is that its communication complexity does not depend on the featu… ▽ More

    Submitted 10 March, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  27. arXiv:2305.19640  [pdf, other

    stat.ML cs.LG

    Fine-grained analysis of non-parametric estimation for pairwise learning

    Authors: Junyu Zhou, Shuo Huang, Han Feng, Puyu Wang, Ding-Xuan Zhou

    Abstract: In this paper, we are concerned with the generalization performance of non-parametric estimation for pairwise learning. Most of the existing work requires the hypothesis space to be convex or a VC-class, and the loss to be convex. However, these restrictive assumptions limit the applicability of the results in studying many popular methods, especially kernel methods and neural networks. We signifi… ▽ More

    Submitted 21 June, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 30 pages, 1 figure

  28. arXiv:2303.06992  [pdf, other

    cs.LG stat.ML

    Improving Mutual Information Estimation with Annealed and Energy-Based Bounds

    Authors: Rob Brekelmans, Sicong Huang, Marzyeh Ghassemi, Greg Ver Steeg, Roger Grosse, Alireza Makhzani

    Abstract: Mutual information (MI) is a fundamental quantity in information theory and machine learning. However, direct estimation of MI is intractable, even if the true joint probability density for the variables of interest is known, as it involves estimating a potentially high-dimensional log partition function. In this work, we present a unifying view of existing MI bounds from the perspective of import… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: A shorter version appeared in the International Conference on Learning Representations (ICLR) 2022

    Journal ref: ICLR 2022 https://openreview.net/forum?id=T0B9AoM_bFg

  29. arXiv:2302.11124  [pdf, ps, other

    stat.ME

    On the efficiency-loss free ordering-robustness of product-PCA

    Authors: Hung Hung, Su-Yun Huang

    Abstract: This article studies the robustness of the eigenvalue ordering, an important issue when estimating the leading eigen-subspace by principal component analysis (PCA). In Yata and Aoshima (2010), cross-data-matrix PCA (CDM-PCA) was proposed and shown to have smaller bias than PCA in estimating eigenvalues. While CDM-PCA has the potential to achieve better estimation of the leading eigen-subspace than… ▽ More

    Submitted 20 March, 2025; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 3 figures

  30. arXiv:2302.03519  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Parametric Approximations of Neural Network Function Space Distance

    Authors: Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse

    Abstract: It is often useful to compactly summarize important properties of model parameters and training data so that they can be used later without storing and/or iterating over the entire dataset. As a specific case, we consider estimating the Function Space Distance (FSD) over a training set, i.e. the average discrepancy between the outputs of two neural networks. We propose a Linearized Activation Func… ▽ More

    Submitted 28 May, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 18 pages, 5 figures, ICML 2023

  31. Modeling MRSA decolonization: Interactions between body sites and the impact of site-specific clearance

    Authors: Onur Poyraz, Mohamad R. A. Sater, Loren G. Miller, James A. Mckinnell, Susan S. Huang, Yonatan H. Grad, Pekka Marttinen

    Abstract: MRSA colonization is a critical public health concern. Decolonization protocols have been designed for the clearance of MRSA. Successful decolonization protocols reduce disease incidence; however, multiple protocols exist, comprising diverse therapies targeting multiple body sites, and the optimal protocol is unclear. Here, we formulate a machine learning model using data from a randomized control… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 12 pages

    Journal ref: Journal of the Royal Society Interface 19 (2022) 191, 20210916

  32. arXiv:2211.01249  [pdf, other

    stat.AP physics.soc-ph

    How Democracies Polarize: A Multilevel Perspective

    Authors: Sihao Huang, Alexander F. Siegenfeld, Andrew Gelman

    Abstract: Democracies employ elections at various scales to select officials at the corresponding levels of administration. The geographical distribution of political opinion, the policy issues delegated to each level, and the multilevel interactions between elections can all greatly impact the makeup of these representative bodies. This perspective is not new: the adoption of federal systems has been motiv… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 20 pages, 6 figures

  33. arXiv:2207.02985  [pdf, other

    math.OC cs.CV eess.IV q-bio.BM stat.AP

    Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

    Authors: Shuai Huang, Mona Zehni, Ivan Dokmanić, Zhizhen Zhao

    Abstract: Unknown-view tomography (UVT) reconstructs a 3D density map from its 2D projections at unknown, random orientations. A line of work starting with Kam (1980) employs the method of moments (MoM) with rotation-invariant Fourier features to solve UVT in the frequency domain, assuming that the orientations are uniformly distributed. This line of work includes the recent orthogonal matrix retrieval (OMR… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Keywords: unknown view tomography, single-particle cryo-electron microscopy, method of moments, autocorrelation, spherical harmonics

    MSC Class: 92C55; 68U10; 33C55; 78M05

  34. arXiv:2206.08353  [pdf, other

    cs.LG stat.ML

    Towards Understanding How Machines Can Learn Causal Overhypotheses

    Authors: Eliza Kosoy, David M. Chan, Adrian Liu, Jasmine Collins, Bryanna Kaufmann, Sandy Han Huang, Jessica B. Hamrick, John Canny, Nan Rosemary Ke, Alison Gopnik

    Abstract: Recent work in machine learning and cognitive science has suggested that understanding causal information is essential to the development of intelligence. The extensive literature in cognitive science using the ``blicket detector'' environment shows that children are adept at many kinds of causal inference and learning. We propose to adapt that environment for machine learning agents. One of the k… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  35. Robust self-tuning semiparametric PCA for contaminated elliptical distribution

    Authors: Hung Hung, Su-Yun Huang, Shinto Eguchi

    Abstract: Principal component analysis (PCA) is one of the most popular dimension reduction methods. The usual PCA is known to be sensitive to the presence of outliers, and thus many robust PCA methods have been developed. Among them, the Tyler's M-estimator is shown to be the most robust scatter estimator under the elliptical distribution. However, when the underlying distribution is contaminated and devia… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  36. arXiv:2205.07271  [pdf, other

    stat.ML cs.LG stat.AP

    Supervised Learning and Model Analysis with Compositional Data

    Authors: Shimeng Huang, Elisabeth Ailer, Niki Kilbertus, Niklas Pfister

    Abstract: The compositionality and sparsity of high-throughput sequencing data poses a challenge for regression and classification. However, in microbiome research in particular, conditional modeling is an essential tool to investigate relationships between phenotypes and the microbiome. Existing techniques are often inadequate: they either rely on extensions of the linear log-contrast model (which adjusts… ▽ More

    Submitted 11 November, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

  37. arXiv:2204.00130  [pdf, other

    cs.LG stat.ML

    VFDS: Variational Foresight Dynamic Selection in Bayesian Neural Networks for Efficient Human Activity Recognition

    Authors: Randy Ardywibowo, Shahin Boluki, Zhangyang Wang, Bobak Mortazavi, Shuai Huang, Xiaoning Qian

    Abstract: In many machine learning tasks, input features with varying degrees of predictive capability are acquired at varying costs. In order to optimize the performance-cost trade-off, one would select features to observe a priori. However, given the changing context with previous observations, the subset of predictive features to select may change dynamically. Therefore, we face the challenging new probl… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  38. arXiv:2112.15327  [pdf, other

    cs.IT cs.LG eess.SP math.ST stat.ML

    Sufficient-Statistic Memory AMP

    Authors: Lei Liu, Shunqi Huang, YuZhi Yang, Zhaoyang Zhang, Brian M. Kurkoski

    Abstract: Approximate message passing (AMP) type algorithms have been widely used in the signal reconstruction of certain large random linear systems. A key feature of the AMP-type algorithms is that their dynamics can be correctly described by state evolution. While state evolution is a useful analytic tool, its convergence is not guaranteed. To solve the convergence problem of the state evolution of AMP-t… ▽ More

    Submitted 29 June, 2023; v1 submitted 31 December, 2021; originally announced December 2021.

    Comments: Double-column, 21 pages, submitted to IEEE Transactions on Information Theory

  39. arXiv:2112.02048  [pdf, other

    physics.ins-det cs.AR cs.LG hep-ex stat.ML

    Graph Neural Networks for Charged Particle Tracking on FPGAs

    Authors: Abdelrahman Elabd, Vesal Razavimaleki, Shi-Yu Huang, Javier Duarte, Markus Atkinson, Gage DeZoort, Peter Elmer, Scott Hauck, Jin-Xuan Hu, Shih-Chieh Hsu, Bo-Cheng Lai, Mark Neubauer, Isobel Ojalvo, Savannah Thais, Matthew Trahms

    Abstract: The determination of charged particle trajectories in collisions at the CERN Large Hadron Collider (LHC) is an important but challenging problem, especially in the high interaction density conditions expected during the future high-luminosity phase of the LHC (HL-LHC). Graph neural networks (GNNs) are a type of geometric deep learning algorithm that has successfully been applied to this task by em… ▽ More

    Submitted 23 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 28 pages, 17 figures, 1 table, published version

    Journal ref: Front. Big Data 5 (2022) 828666

  40. arXiv:2110.14446  [pdf, other

    cs.LG cs.SI stat.ML

    Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

    Authors: Derek Lim, Felix Hohne, Xiuyu Li, Sijia Linda Huang, Vaishnavi Gupta, Omkar Bhalerao, Ser-Nam Lim

    Abstract: Many widely used datasets for graph machine learning tasks have generally been homophilous, where nodes with similar labels connect to each other. Recently, new Graph Neural Networks (GNNs) have been developed that move beyond the homophily regime; however, their evaluation has often been conducted on small graphs with limited application domains. We collect and introduce diverse non-homophilous d… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

  41. arXiv:2109.06388  [pdf, other

    cs.IT stat.ML

    On Distributed Learning with Constant Communication Bits

    Authors: Xiangxiang Xu, Shao-Lun Huang

    Abstract: In this paper, we study a distributed learning problem constrained by constant communication bits. Specifically, we consider the distributed hypothesis testing (DHT) problem where two distributed nodes are constrained to transmit a constant number of bits to a central decoder. In such cases, we show that in order to achieve the optimal error exponents, it suffices to consider the empirical distrib… ▽ More

    Submitted 22 January, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Submitted to JSAIT

  42. arXiv:2106.14384  [pdf

    stat.AP cs.LG stat.ML

    Towards Model-informed Precision Dosing with Expert-in-the-loop Machine Learning

    Authors: Yihuang Kang, Yi-Wen Chiu, Ming-Yen Lin, Fang-yi Su, Sheng-Tai Huang

    Abstract: Machine Learning (ML) and its applications have been transforming our lives but it is also creating issues related to the development of fair, accountable, transparent, and ethical Artificial Intelligence. As the ML models are not fully comprehensible yet, it is obvious that we still need humans to be part of algorithmic decision-making processes. In this paper, we consider a ML framework that may… ▽ More

    Submitted 28 June, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

  43. arXiv:2105.14052  [pdf, other

    cs.LG stat.CO

    Targeted Deep Learning: Framework, Methods, and Applications

    Authors: Shih-Ting Huang, Johannes Lederer

    Abstract: Deep learning systems are typically designed to perform for a wide range of test inputs. For example, deep learning systems in autonomous cars are supposed to deal with traffic situations for which they were not specifically trained. In general, the ability to cope with a broad spectrum of unseen test inputs is called generalization. Generalization is definitely important in applications where the… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  44. arXiv:2105.14035  [pdf, other

    stat.ML cs.LG stat.CO

    DeepMoM: Robust Deep Learning With Median-of-Means

    Authors: Shih-Ting Huang, Johannes Lederer

    Abstract: Data used in deep learning is notoriously problematic. For example, data are usually combined from diverse sources, rarely cleaned and vetted thoroughly, and sometimes corrupted on purpose. Intentional corruption that targets the weak spots of algorithms has been studied extensively under the label of "adversarial attacks." In contrast, the arguably much more common case of corruption that reflect… ▽ More

    Submitted 8 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  45. arXiv:2105.07338  [pdf, ps, other

    cs.LG stat.ML

    CCMN: A General Framework for Learning with Class-Conditional Multi-Label Noise

    Authors: Ming-Kun Xie, Sheng-Jun Huang

    Abstract: Class-conditional noise commonly exists in machine learning tasks, where the class label is corrupted with a probability depending on its ground-truth. Many research efforts have been made to improve the model robustness against the class-conditional noise. However, they typically focus on the single label case by assuming that only one label is corrupted. In real applications, an instance is usua… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: 18 pages

  46. arXiv:2012.04646  [pdf, other

    stat.ML cs.LG cs.SI math.ST stat.CO

    Spectral clustering via adaptive layer aggregation for multi-layer networks

    Authors: Sihan Huang, Haolei Weng, Yang Feng

    Abstract: One of the fundamental problems in network analysis is detecting community structure in multi-layer networks, of which each layer represents one type of edge information among the nodes. We propose integrative spectral clustering approaches based on effective convex layer aggregations. Our aggregation methods are strongly motivated by a delicate asymptotic analysis of the spectral embedding of wei… ▽ More

    Submitted 6 October, 2022; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: 74 pages

  47. arXiv:2012.04533  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an stat.ML

    Beyond 4D Tracking: Using Cluster Shapes for Track Seeding

    Authors: Patrick J. Fox, Shangqing Huang, Joshua Isaacson, Xiangyang Ju, Benjamin Nachman

    Abstract: Tracking is one of the most time consuming aspects of event reconstruction at the Large Hadron Collider (LHC) and its high-luminosity upgrade (HL-LHC). Innovative detector technologies extend tracking to four-dimensions by including timing in the pattern recognition and parameter estimation. However, present and future hardware already have additional information that is largely unused by existing… ▽ More

    Submitted 10 November, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 19 pages, 14 figures; v2: journal version

    Report number: FERMILAB-PUB-20-650-T

    Journal ref: JINST 16 (2021) P05001

  48. arXiv:2010.13269  [pdf, other

    cs.LG stat.ML

    Revisiting convolutional neural network on graphs with polynomial approximations of Laplace-Beltrami spectral filtering

    Authors: Shih-Gu Huang, Moo K. Chung, Anqi Qiu, Alzheimer's Disease Neuroimaging Initiative

    Abstract: This paper revisits spectral graph convolutional neural networks (graph-CNNs) given in Defferrard (2016) and develops the Laplace-Beltrami CNN (LB-CNN) by replacing the graph Laplacian with the LB operator. We then define spectral filters via the LB operator on a graph. We explore the feasibility of Chebyshev, Laguerre, and Hermite polynomials to approximate LB-based spectral filters and define an… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

  49. arXiv:2010.03956  [pdf, other

    cs.LG stat.ML

    Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games

    Authors: Shengyi Huang, Santiago Ontañón

    Abstract: Training agents using Reinforcement Learning in games with sparse rewards is a challenging problem, since large amounts of exploration are required to retrieve even the first reward. To tackle this problem, a common approach is to use reward shaping to help exploration. However, an important drawback of reward shaping is that agents sometimes learn to optimize the shaped reward instead of the true… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: Preprint

  50. arXiv:2010.03710  [pdf

    stat.ML cs.IR cs.LG

    Topic Diffusion Discovery Based on Deep Non-negative Autoencoder

    Authors: Sheng-Tai Huang, Yihuang Kang, Shao-Min Hung, Bowen Kuo, I-Ling Cheng

    Abstract: Researchers have been overwhelmed by the explosion of research articles published by various research communities. Many research scholarly websites, search engines, and digital libraries have been created to help researchers identify potential research topics and keep up with recent progress on research of interests. However, it is still difficult for researchers to keep track of the research topi… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.