Skip to main content

Showing 1–28 of 28 results for author: Zhu, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.07642  [pdf, other

    stat.CO

    fastrerandomize: Fast Rerandomization Using Accelerated Computing

    Authors: Rebecca Goldstein, Connor T. Jerzak, Aniket Kamat, Fucheng Warren Zhu

    Abstract: We introduce fastrerandomize, an R package that implements novel algorithmic approaches to rerandomization in experimental design. Rerandomization improves precision by discarding treatment assignments until covariate balance meets predefined thresholds, but existing implementations often struggle with computational demands in large-scale settings. fastrerandomize addresses these limitations throu… ▽ More

    Submitted 14 April, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: 38 pages, 10 figures

    MSC Class: 62K10; 65C60 ACM Class: G.3; G.4

  2. arXiv:2411.02134  [pdf, other

    stat.ML cs.LG

    Optimizing Multi-Scale Representations to Detect Effect Heterogeneity Using Earth Observation and Computer Vision: Applications to Two Anti-Poverty RCTs

    Authors: Fucheng Warren Zhu, Connor T. Jerzak, Adel Daoud

    Abstract: Earth Observation (EO) data are increasingly used in policy analysis by enabling granular estimation of conditional average treatment effects (CATE). However, a challenge in EO-based causal inference is determining the scale of the input satellite imagery -- balancing the trade-off between capturing fine-grained individual heterogeneity in smaller images and broader contextual information in large… ▽ More

    Submitted 15 March, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: To appear in: Conference on Causal Learning and Reasoning, 2025

    ACM Class: I.4.7; I.4.9

  3. arXiv:2403.00224  [pdf, other

    stat.ME

    Tobit models for count time series

    Authors: Christian H. Weiß, Fukang Zhu

    Abstract: Several models for count time series have been developed during the last decades, often inspired by traditional autoregressive moving average (ARMA) models for real-valued time series, including integer-valued ARMA (INARMA) and integer-valued generalized autoregressive conditional heteroscedasticity (INGARCH) models. Both INARMA and INGARCH models exhibit an ARMA-like autocorrelation function (ACF… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  4. arXiv:2402.15772  [pdf, other

    stat.ME

    Mean-preserving rounding integer-valued ARMA models

    Authors: Christian H. Weiß, Fukang Zhu

    Abstract: In the past four decades, research on count time series has made significant progress, but research on $\mathbb{Z}$-valued time series is relatively rare. Existing $\mathbb{Z}$-valued models are mainly of autoregressive structure, where the use of the rounding operator is very natural. Because of the discontinuity of the rounding operator, the formulation of the corresponding model identifiability… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  5. arXiv:2402.11425  [pdf, ps, other

    stat.ME cs.LG math.OC math.PR

    Online Resource Allocation with Average Budget Constraints

    Authors: Ruicheng Ao, Hongyu Chen, David Simchi-Levi, Feng Zhu

    Abstract: We consider the problem of online resource allocation with average budget constraints. At each time point the decision maker makes an irrevocable decision of whether to accept or reject a request before the next request arrives with the goal to maximize the cumulative rewards. In contrast to existing literature requiring the total resource consumption is below a certain level, we require the avera… ▽ More

    Submitted 25 September, 2025; v1 submitted 17 February, 2024; originally announced February 2024.

  6. arXiv:2304.04341  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk

    Authors: David Simchi-Levi, Zeyu Zheng, Feng Zhu

    Abstract: We study the trade-off between expectation and tail risk for regret distribution in the stochastic multi-armed bandit problem. We fully characterize the interplay among three desired properties for policy design: worst-case optimality, instance-dependent consistency, and light-tailed risk. We show how the order of expected regret exactly affects the decaying rate of the regret tail probability for… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.02969

  7. arXiv:2301.06658  [pdf, other

    econ.EM stat.ME

    Statistical inference for the logarithmic spatial heteroskedasticity model with exogenous variables

    Authors: Bing Su, Fukang Zhu, Ke Zhu

    Abstract: The spatial dependence in mean has been well studied by plenty of models in a large strand of literature, however, the investigation of spatial dependence in variance is lagging significantly behind. The existing models for the spatial dependence in variance are scarce, with neither probabilistic structure nor statistical inference procedure being explored. To circumvent this deficiency, this pape… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  8. Conditional-mean Multiplicative Operator Models for Count Time Series

    Authors: Christian H. Weiß, Fukang Zhu

    Abstract: Multiplicative error models (MEMs) are commonly used for real-valued time series, but they cannot be applied to discrete-valued count time series as the involved multiplication would not preserve the integer nature of the data. Thus, the concept of a multiplicative operator for counts is proposed (as well as several specific instances thereof), which are then used to develop a kind of MEMs for cou… ▽ More

    Submitted 27 November, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 45 pages

    Journal ref: Computational Statistics & Data Analysis, 2024, 191, 107885

  9. arXiv:2206.02969  [pdf, other

    stat.ML cs.LG math.ST

    A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits

    Authors: David Simchi-Levi, Zeyu Zheng, Feng Zhu

    Abstract: We study the stochastic multi-armed bandit problem and design new policies that enjoy both worst-case optimality for expected regret and light-tailed risk for regret distribution. Specifically, our policy design (i) enjoys the worst-case optimality for the expected regret at order $O(\sqrt{KT\ln T})$ and (ii) has the worst-case tail probability of incurring a regret larger than any $x>0$ being upp… ▽ More

    Submitted 22 July, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Preliminary version appeared in NeurIPS 2022

  10. arXiv:2109.11929  [pdf, other

    stat.ML cs.AI cs.LG

    Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

    Authors: Adi Lin, Jie Lu, Junyu Xuan, Fujin Zhu, Guangquan Zhang

    Abstract: Causal effect estimation for dynamic treatment regimes (DTRs) contributes to sequential decision making. However, censoring and time-dependent confounding under DTRs are challenging as the amount of observational data declines over time due to a reducing sample size but the feature dimension increases over time. Long-term follow-up compounds these challenges. Another challenge is the highly comple… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  11. arXiv:2106.14813  [pdf, other

    stat.ML cs.DM cs.LG math.OC

    Offline Planning and Online Learning under Recovering Rewards

    Authors: David Simchi-Levi, Zeyu Zheng, Feng Zhu

    Abstract: Motivated by emerging applications such as live-streaming e-commerce, promotions and recommendations, we introduce and solve a general class of non-stationary multi-armed bandit problems that have the following two features: (i) the decision maker can pull and collect rewards from up to $K\,(\ge 1)$ out of $N$ different arms in each time period; (ii) the expected reward of an arm immediately drops… ▽ More

    Submitted 21 December, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: v1 accepted by ICML 2021

  12. arXiv:2009.13333  [pdf, other

    cs.LG cs.CV stat.ML

    Group Whitening: Balancing Learning Efficiency and Representational Capacity

    Authors: Lei Huang, Yi Zhou, Li Liu, Fan Zhu, Ling Shao

    Abstract: Batch normalization (BN) is an important technique commonly incorporated into deep learning models to perform standardization within mini-batches. The merits of BN in improving a model's learning efficiency can be further amplified by applying whitening, while its drawbacks in estimating population statistics for inference can be avoided through group normalization (GN). This paper proposes group… ▽ More

    Submitted 6 April, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: V4: camera version of CVPR 2021. Code available at: https://github.com/huangleiBuaa/GroupWhitening

  13. arXiv:2009.12836  [pdf, other

    cs.LG cs.CV stat.ML

    Normalization Techniques in Training DNNs: Methodology, Analysis and Application

    Authors: Lei Huang, Jie Qin, Yi Zhou, Fan Zhu, Li Liu, Ling Shao

    Abstract: Normalization techniques are essential for accelerating the training and improving the generalization of deep neural networks (DNNs), and have successfully been used in various applications. This paper reviews and comments on the past, present and future of normalization methods in the context of DNN training. We provide a unified picture of the main motivation behind different approaches from the… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: 20 pages

  14. Understanding the effect of hyperparameter optimization on machine learning models for structure design problems

    Authors: Xianping Du, Hongyi Xu, Feng Zhu

    Abstract: To relieve the computational cost of design evaluations using expensive finite element simulations, surrogate models have been widely applied in computer-aided engineering design. Machine learning algorithms (MLAs) have been implemented as surrogate models due to their capability of learning the complex interrelations between the design variables and the response from big datasets. Typically, an M… ▽ More

    Submitted 15 March, 2021; v1 submitted 4 July, 2020; originally announced July 2020.

    Comments: 43 pages, 15 figures,8 tables, Accepted by the Computer-aided design

    Journal ref: Computer-Aided Design (2021): 103013

  15. arXiv:2006.05554  [pdf, other

    cs.LG stat.ML

    Causal Discovery from Incomplete Data using An Encoder and Reinforcement Learning

    Authors: Xiaoshui Huang, Fujin Zhu, Lois Holloway, Ali Haidar

    Abstract: Discovering causal structure among a set of variables is a fundamental problem in many domains. However, state-of-the-art methods seldom consider the possibility that the observational data has missing values (incomplete data), which is ubiquitous in many real-world situations. The missing value will significantly impair the performance and even make the causal discovery algorithms fail. In this p… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  16. arXiv:2006.00978  [pdf, ps, other

    cs.LG stat.ML

    On the Number of Linear Regions of Convolutional Neural Networks

    Authors: H. Xiong, L. Huang, M. Yu, L. Liu, F. Zhu, L. Shao

    Abstract: One fundamental problem in deep learning is understanding the outstanding performance of deep Neural Networks (NNs) in practice. One explanation for the superiority of NNs is that they can realize a large class of complicated functions, i.e., they have powerful expressivity. The expressivity of a ReLU NN can be quantified by the maximal number of linear regions it can separate its input space into… ▽ More

    Submitted 27 June, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: International Conference on Machine Learning (ICML) 2020

  17. arXiv:2004.09161  [pdf, ps, other

    econ.EM stat.ME

    Multi-frequency-band tests for white noise under heteroskedasticity

    Authors: Mengya Liu, Fukan Zhu, Ke Zhu

    Abstract: This paper proposes a new family of multi-frequency-band (MFB) tests for the white noise hypothesis by using the maximum overlap discrete wavelet packet transform (MODWPT). The MODWPT allows the variance of a process to be decomposed into the variance of its components on different equal-length frequency sub-bands, and the MFB tests then measure the distance between the MODWPT-based variance ratio… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  18. arXiv:1906.09205  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction

    Authors: Fengda Zhu, Xiaojun Chang, Runhao Zeng, Mingkui Tan

    Abstract: Deep reinforcement learning has made significant progress in the field of continuous control, such as physical control and autonomous driving. However, it is challenging for a reinforcement model to learn a policy for each task sequentially due to catastrophic forgetting. Specifically, the model would forget knowledge it learned in the past when trained on a new task. We consider this challenge fr… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  19. arXiv:1903.06258  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Hyperspectral Image Classification with Deep Metric Learning and Conditional Random Field

    Authors: Yi Liang, Xin Zhao, Alan J. X. Guo, Fei Zhu

    Abstract: To improve the classification performance in the context of hyperspectral image processing, many works have been developed based on two common strategies, namely the spatial-spectral information integration and the utilization of neural networks. However, both strategies typically require more training data than the classical algorithms, aggregating the shortage of labeled samples. In this letter,… ▽ More

    Submitted 15 July, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

  20. arXiv:1801.03226  [pdf, other

    cs.LG stat.ML

    Adaptive Graph Convolutional Neural Networks

    Authors: Ruoyu Li, Sheng Wang, Feiyun Zhu, Junzhou Huang

    Abstract: Graph Convolutional Neural Networks (Graph CNNs) are generalizations of classical CNNs to handle graph data such as molecular data, point could and social networks. Current filters in graph CNNs are built for fixed and shared graph structure. However, for most real data, the graph structures varies in both size and connectivity. The paper proposes a generalized and flexible graph CNN taking data o… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 8 pages

  21. arXiv:1708.05446  [pdf, other

    cs.LG stat.ML

    Robust Contextual Bandit via the Capped-$\ell_{2}$ norm

    Authors: Feiyun Zhu, Xinliang Zhu, Sheng Wang, Jiawen Yao, Junzhou Huang

    Abstract: This paper considers the actor-critic contextual bandit for the mobile health (mHealth) intervention. The state-of-the-art decision-making methods in mHealth generally assume that the noise in the dynamic system follows the Gaussian distribution. Those methods use the least-square-based algorithm to estimate the expected reward, which is prone to the existence of outliers. To deal with the issue o… ▽ More

    Submitted 17 August, 2017; originally announced August 2017.

  22. arXiv:1602.01729  [pdf, ps, other

    stat.ML cs.CV cs.NE

    Correntropy Maximization via ADMM - Application to Robust Hyperspectral Unmixing

    Authors: Fei Zhu, Abderrahim Halimi, Paul Honeine, Badong Chen, Nanning Zheng

    Abstract: In hyperspectral images, some spectral bands suffer from low signal-to-noise ratio due to noisy acquisition and atmospheric effects, thus requiring robust techniques for the unmixing problem. This paper presents a robust supervised spectral unmixing approach for hyperspectral images. The robustness is achieved by writing the unmixing problem as the maximization of the correntropy criterion subject… ▽ More

    Submitted 4 February, 2016; originally announced February 2016.

    Comments: 23 pages

  23. arXiv:1601.03124  [pdf, other

    cs.LG stat.ML

    Online Prediction of Dyadic Data with Heterogeneous Matrix Factorization

    Authors: Guangyong Chen, Fengyuan Zhu, Pheng Ann Heng

    Abstract: Dyadic Data Prediction (DDP) is an important problem in many research areas. This paper develops a novel fully Bayesian nonparametric framework which integrates two popular and complementary approaches, discrete mixed membership modeling and continuous latent factor modeling into a unified Heterogeneous Matrix Factorization~(HeMF) model, which can predict the unobserved dyadics accurately. The HeM… ▽ More

    Submitted 12 January, 2016; originally announced January 2016.

    Comments: 26 pages, 10 figures

  24. arXiv:1601.03117  [pdf, other

    cs.CV stat.ML

    Blind Image Denoising via Dependent Dirichlet Process Tree

    Authors: Fengyuan Zhu, Guangyong Chen, Jianye Hao, Pheng-Ann Heng

    Abstract: Most existing image denoising approaches assumed the noise to be homogeneous white Gaussian distributed with known intensity. However, in real noisy images, the noise models are usually unknown beforehand and can be much more complex. This paper addresses this problem and proposes a novel blind image denoising algorithm to recover the clean image from noisy one with the unknown noise model. To mod… ▽ More

    Submitted 12 January, 2016; originally announced January 2016.

    Comments: 25 pages, 11 figures

  25. arXiv:1501.05684  [pdf, ps, other

    stat.ML cs.CV cs.LG math.OC

    Bi-Objective Nonnegative Matrix Factorization: Linear Versus Kernel-Based Models

    Authors: Paul Honeine, Fei Zhu

    Abstract: Nonnegative matrix factorization (NMF) is a powerful class of feature extraction techniques that has been successfully applied in many fields, namely in signal and image processing. Current NMF techniques have been limited to a single-objective problem in either its linear or nonlinear kernel-based formulation. In this paper, we propose to revisit the NMF as a multi-objective problem, in particula… ▽ More

    Submitted 22 January, 2015; originally announced January 2015.

  26. arXiv:1409.3660  [pdf, other

    cs.LG cs.CV stat.ML

    10,000+ Times Accelerated Robust Subset Selection (ARSS)

    Authors: Feiyun Zhu, Bin Fan, Xinliang Zhu, Ying Wang, Shiming Xiang, Chunhong Pan

    Abstract: Subset selection from massive data with noised information is increasingly popular for various applications. This problem is still highly challenging as current methods are generally slow in speed and sensitive to outliers. To address the above two issues, we propose an accelerated robust subset selection (ARSS) method. Specifically in the subset selection area, this is the first attempt to employ… ▽ More

    Submitted 17 November, 2014; v1 submitted 12 September, 2014; originally announced September 2014.

  27. arXiv:1407.4420  [pdf, ps, other

    cs.CV cs.IT cs.LG cs.NE stat.ML

    Kernel Nonnegative Matrix Factorization Without the Curse of the Pre-image - Application to Unmixing Hyperspectral Images

    Authors: Fei Zhu, Paul Honeine, Maya Kallas

    Abstract: The nonnegative matrix factorization (NMF) is widely used in signal and image processing, including bio-informatics, blind source separation and hyperspectral image analysis in remote sensing. A great challenge arises when dealing with a nonlinear formulation of the NMF. Within the framework of kernel machines, the models suggested in the literature do not allow the representation of the factoriza… ▽ More

    Submitted 27 March, 2016; v1 submitted 16 July, 2014; originally announced July 2014.

    Comments: 13 pages, 12 figures

  28. arXiv:1404.7642  [pdf, ps, other

    stat.AP q-fin.ST

    Predictive regressions for macroeconomic data

    Authors: Fukang Zhu, Zongwu Cai, Liang Peng

    Abstract: Researchers have constantly asked whether stock returns can be predicted by some macroeconomic data. However, it is known that macroeconomic data may exhibit nonstationarity and/or heavy tails, which complicates existing testing procedures for predictability. In this paper we propose novel empirical likelihood methods based on some weighted score equations to test whether the monthly CRSP value-we… ▽ More

    Submitted 30 April, 2014; originally announced April 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS708 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS708

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 1, 577-594