Skip to main content

Showing 1–50 of 52 results for author: Xie, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.19923  [pdf, ps, other

    cs.LG stat.ML

    Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL

    Authors: Qin-Wen Luo, Ming-Kun Xie, Ye-Wen Wang, Sheng-Jun Huang

    Abstract: Offline reinforcement learning (RL) aims to learn an effective policy from a static dataset. To alleviate extrapolation errors, existing studies often uniformly regularize the value function or policy updates across all states. However, due to substantial variations in data quality, the fixed regularization strength often leads to a dilemma: Weak regularization strength fails to address extrapolat… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  2. arXiv:2404.09353  [pdf, other

    stat.ME stat.AP stat.ML

    A Unified Combination Framework for Dependent Tests with Applications to Microbiome Association Studies

    Authors: Xiufan Yu, Linjun Zhang, Arun Srinivasan, Min-ge Xie, Lingzhou Xue

    Abstract: We introduce a novel meta-analysis framework to combine dependent tests under a general setting, and utilize it to synthesize various microbiome association tests that are calculated from the same dataset. Our development builds upon the classical meta-analysis methods of aggregating $p$-values and also a more recent general method of combining confidence distributions, but makes generalizations t… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  3. arXiv:2403.09984  [pdf, ps, other

    stat.ME

    Repro Samples Method for High-dimensional Logistic Model

    Authors: Xiaotian Hou, Linjun Zhang, Peng Wang, Min-ge Xie

    Abstract: This paper presents a novel method to make statistical inferences for both the model support and regression coefficients in a high-dimensional logistic regression model. Our method is based on the repro samples framework, in which we conduct statistical inference by generating artificial samples mimicking the actual data-generating process. The proposed method has two major advantages. Firstly, fo… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  4. arXiv:2402.15004  [pdf, other

    stat.ME math.ST

    Repro Samples Method for a Performance Guaranteed Inference in General and Irregular Inference Problems

    Authors: Minge Xie, Peng Wang

    Abstract: Rapid advancements in data science require us to have fundamentally new frameworks to tackle prevalent but highly non-trivial "irregular" inference problems, to which the large sample central limit theorem does not apply. Typical examples are those involving discrete or non-numerical parameters and those involving non-numerical data, etc. In this article, we present an innovative, wide-reaching, a… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  5. arXiv:2310.13178  [pdf, other

    stat.ME

    Exact Inference for Common Odds Ratio in Meta-Analysis with Zero-Total-Event Studies

    Authors: Xiaolin Chen, Jerry Q Cheng, Lu Tian, Minge Xie

    Abstract: Stemming from the high profile publication of Nissen and Wolski (2007) and subsequent discussions with divergent views on how to handle observed zero-total-event studies, defined to be studies which observe zero events in both treatment and control arms, the research topic concerning the common odds ratio model with zero-total-event studies remains to be an unresolved problem in meta-analysis. In… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  6. arXiv:2301.12674  [pdf, other

    stat.AP stat.ME

    A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros

    Authors: Zhengyang Zhou, Dateng Li, David Huh, Minge Xie, Eun-Young Mun

    Abstract: Background: Outcome measures that are count variables with excessive zeros are common in health behaviors research. There is a lack of empirical data about the relative performance of prevailing statistical models when outcomes are zero-inflated, particularly compared with recently developed approaches. Methods: The current simulation study examined five commonly used analytical approaches for c… ▽ More

    Submitted 15 August, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  7. arXiv:2301.00477  [pdf, other

    math.OC stat.ML

    A Sequential Quadratic Programming Method with High Probability Complexity Bounds for Nonlinear Equality Constrained Stochastic Optimization

    Authors: Albert S. Berahas, Miaolan Xie, Baoyu Zhou

    Abstract: A step-search sequential quadratic programming method is proposed for solving nonlinear equality constrained stochastic optimization problems. It is assumed that constraint function values and derivatives are available, but only stochastic approximations of the objective function and its associated derivatives can be computed via inexact probabilistic zeroth- and first-order oracles. Under reasona… ▽ More

    Submitted 5 October, 2024; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 29 pages, 2 figures

  8. arXiv:2209.09299  [pdf, other

    stat.ME math.ST stat.CO stat.OT

    Finite- and Large- Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples

    Authors: Peng Wang, Min-Ge Xie, Linjun Zhang

    Abstract: In this paper, we present a new and effective simulation-based approach to conduct both finite- and large-sample inference for high-dimensional linear regression models. This approach is developed under the so-called repro samples framework, in which we conduct statistical inference by creating and studying the behavior of artificial samples that are obtained by mimicking the sampling mechanism of… ▽ More

    Submitted 9 December, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

  9. arXiv:2208.02521  [pdf

    stat.ME

    A class of Šidák-type tests based on maximal precedence and exceedance statistic

    Authors: Niladri Chakrabortya, Di Cui, Min Xie

    Abstract: A class of nonparametric two-sample tests has been proposed in this article. As a generalization of the original Šidáks' test, the proposed test statistic is developed as the sum of the maximal precedence and maximal exceedance statistics. Unlike the Šidák-type precedence-exceedance test and the maximal precedence test, the proposed test is suitable for a two-sided alternative while being free fro… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  10. arXiv:2207.07988  [pdf, ps, other

    math.ST stat.ME

    Inference of high quantiles of a heavy-tailed distribution from block data

    Authors: Yongcheng Qi, Mengzi Xie, Jingping Yang

    Abstract: In this paper we consider the estimation problem for high quantiles of a heavy-tailed distribution from block data when only a few largest values are observed within blocks. We propose estimators for high quantiles and prove that these estimators are asymptotically normal. Furthermore, we employ empirical likelihood method and adjusted empirical likelihood method to constructing the confidence int… ▽ More

    Submitted 24 June, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: 28 pages

  11. arXiv:2207.03935  [pdf, other

    stat.ML cs.LG

    ControlBurn: Nonlinear Feature Selection with Sparse Tree Ensembles

    Authors: Brian Liu, Miaolan Xie, Haoyue Yang, Madeleine Udell

    Abstract: ControlBurn is a Python package to construct feature-sparse tree ensembles that support nonlinear feature selection and interpretable machine learning. The algorithms in this package first build large tree ensembles that prioritize basis functions with few features and then select a feature-sparse subset of these basis functions using a weighted lasso optimization criterion. The package includes v… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 22 pages

  12. arXiv:2206.06421  [pdf, other

    stat.ME math.ST

    Repro Samples Method for Finite- and Large-Sample Inferences

    Authors: Min-ge Xie, Peng Wang

    Abstract: This article presents a novel, general, and effective simulation-inspired approach, called {\it repro samples method}, to conduct statistical inference. The approach studies the performance of artificial samples, referred to as {\it repro samples}, obtained by mimicking the true observed sample to achieve uncertainty quantification and construct confidence sets for parameters of interest with guar… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    MSC Class: 62A99; 62F99; 62G99

  13. arXiv:2206.01707  [pdf, other

    stat.ME math.ST

    Approximate confidence distribution computing

    Authors: Suzanne Thornton, Wentao Li, Minge Xie

    Abstract: Approximate confidence distribution computing (ACDC) offers a new take on the rapidly developing field of likelihood-free inference from within a frequentist framework. The appeal of this computational method for statistical inference hinges upon the concept of a confidence distribution, a special type of estimator which is defined with respect to the repeated sampling principle. An ACDC method pr… ▽ More

    Submitted 12 October, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Supplementary material available upon request

  14. arXiv:2204.12724  [pdf, other

    stat.ME

    Semiparametric transformation Model with measurement error in Covariates: An Instrumental variable approach

    Authors: Sudheesh K. K., Deemat C. Mathew, Litty Mathew, Min Xie

    Abstract: Linear transformation model provides a general framework for analyzing censored survival data with covariates. The proportional hazards and proportional odds models are special cases of the linear transformation model. In biomedical studies, covariates with measurement error may occur in survival data. In this work, we propose a method to obtain estimators of the regression coefficients in the lin… ▽ More

    Submitted 9 May, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: We proposed an instrumental variable approach to analyze the linear transformation model when covariates are measured with error

  15. arXiv:2203.16330  [pdf

    q-bio.PE stat.AP

    Cointegration of SARS-CoV-2 Transmission with Weather Conditions and Mobility during the First Year of the COVID-19 Pandemic in the United States

    Authors: Hong Qin, Syed Tareq, William Torres, Megan Doman, Cleo Falvey, Jamaree Moore, Meng Hsiu Tsai, Yingfeng Wang, Azad Hossain, Mengjun Xie, Li Yang

    Abstract: Correlation between weather and the transmission of SARS-CoV-2 may suggest its seasonality. Cointegration analysis can avoid spurious correlation among time series data. We examined the cointegration of virus transmission with daily temperature, dewpoint, and confounding factors of mobility measurements during the first year of the pandemic in the United States. We examined the cointegration of th… ▽ More

    Submitted 30 March, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 6 pages, 3 figures

  16. arXiv:2112.05090  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Extending the WILDS Benchmark for Unsupervised Adaptation

    Authors: Shiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang

    Abstract: Machine learning systems deployed in the wild are often trained on a source distribution but deployed on a different target distribution. Unlabeled data can be a powerful point of leverage for mitigating these distribution shifts, as it is frequently much more available than labeled data and can often be obtained from distributions beyond the source distribution as well. However, existing distribu… ▽ More

    Submitted 23 April, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  17. arXiv:2111.13302  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech stat.ML

    Equivalence between algorithmic instability and transition to replica symmetry breaking in perceptron learning systems

    Authors: Yang Zhao, Junbin Qiu, Mingshan Xie, Haiping Huang

    Abstract: Binary perceptron is a fundamental model of supervised learning for the non-convex optimization, which is a root of the popular deep learning. Binary perceptron is able to achieve a classification of random high-dimensional data by computing the marginal probabilities of binary synapses. The relationship between the algorithmic instability and the equilibrium analysis of the model remains elusive.… ▽ More

    Submitted 7 March, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: 24 pages, 2 figures, revision to journal

    Journal ref: Phys. Rev. Research 4, 023023 (2022)

  18. arXiv:2109.01898  [pdf, other

    stat.ME

    Confidence Distribution and Distribution Estimation for Modern Statistical Inference

    Authors: Yifan Cui, Min-ge Xie

    Abstract: This paper introduces to readers the new concept and methodology of confidence distribution and the modern-day distributional inference in statistics. This discussion should be of interest to people who would like to go into the depth of the statistical inference methodology and to utilize distribution estimators in practice. We also include in the discussion the topic of generalized fiducial infe… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: To appear as a chapter in Springer Handbook of Engineering Statistics, 2nd ed

  19. ControlBurn: Feature Selection by Sparse Forests

    Authors: Brian Liu, Miaolan Xie, Madeleine Udell

    Abstract: Tree ensembles distribute feature importance evenly amongst groups of correlated features. The average feature ranking of the correlated group is suppressed, which reduces interpretability and complicates feature selection. In this paper we present ControlBurn, a feature selection algorithm that uses a weighted LASSO-based feature selection method to prune unnecessary features from tree ensembles,… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 15 pages

  20. arXiv:2106.09226  [pdf, other

    cs.LG stat.ML

    Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning

    Authors: Colin Wei, Sang Michael Xie, Tengyu Ma

    Abstract: Pretrained language models have achieved state-of-the-art performance when adapted to a downstream NLP task. However, theoretical analysis of these models is scarce and challenging since the pretraining and downstream tasks can be very different. We propose an analysis framework that links the pretraining and downstream tasks with an underlying latent variable generative model of text -- the downs… ▽ More

    Submitted 20 April, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

  21. arXiv:2105.07338  [pdf, ps, other

    cs.LG stat.ML

    CCMN: A General Framework for Learning with Class-Conditional Multi-Label Noise

    Authors: Ming-Kun Xie, Sheng-Jun Huang

    Abstract: Class-conditional noise commonly exists in machine learning tasks, where the class label is corrupted with a probability depending on its ground-truth. Many research efforts have been made to improve the model robustness against the class-conditional noise. However, they typically focus on the single label case by assuming that only one label is corrupted. In real applications, an instance is usua… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: 18 pages

  22. arXiv:2102.10771  [pdf, ps, other

    stat.ML cs.LG

    Divide-and-conquer methods for big data analysis

    Authors: Xueying Chen, Jerry Q. Cheng, Min-ge Xie

    Abstract: In the context of big data analysis, the divide-and-conquer methodology refers to a multiple-step process: first splitting a data set into several smaller ones; then analyzing each set separately; finally combining results from each analysis together. This approach is effective in handling large data sets that are unsuitable to be analyzed entirely by a single computer due to limits either from me… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

  23. arXiv:2012.04550  [pdf, other

    cs.LG stat.ML

    In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness

    Authors: Sang Michael Xie, Ananya Kumar, Robbie Jones, Fereshte Khani, Tengyu Ma, Percy Liang

    Abstract: Consider a prediction setting with few in-distribution labeled examples and many unlabeled examples both in- and out-of-distribution (OOD). The goal is to learn a model which performs well both in-distribution and OOD. In these settings, auxiliary information is often cheaply available for every input. How should we best leverage this auxiliary information for the prediction task? Empirically acro… ▽ More

    Submitted 7 April, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: ICLR 2021

  24. arXiv:2012.04464  [pdf, other

    stat.ME math.ST

    Bridging Bayesian, frequentist and fiducial (BFF) inferences using confidence distribution

    Authors: Suzanne Thornton, Minge Xie

    Abstract: Bayesian, frequentist and fiducial (BFF) inferences are much more congruous than they have been perceived historically in the scientific community (cf., Reid and Cox 2015; Kass 2011; Efron 1998). Most practitioners are probably more familiar with the two dominant statistical inferential paradigms, Bayesian inference and frequentist inference. The third, lesser known fiducial inference paradigm was… ▽ More

    Submitted 15 June, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 30 pages, 5 figures, Handbook on Bayesian Fiducial and Frequentist (BFF) Inferences

    MSC Class: 62-00 (Primary) 62A01 (Secondary)

  25. arXiv:2011.07047  [pdf, other

    stat.ME

    Nonparametric fusion learning: synthesize inferences from diverse sources using depth confidence distribution

    Authors: Dungang Liu, Regina Y. Liu, Minge Xie

    Abstract: Fusion learning refers to synthesizing inferences from multiple sources or studies to provide more effective inference and prediction than from any individual source or study alone. Most existing methods for synthesizing inferences rely on parametric model assumptions, such as normality, which often do not hold in practice. In this paper, we propose a general nonparametric fusion learning framewor… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 47 pages, 10 figures

  26. arXiv:2009.10265  [pdf, other

    stat.AP stat.ME

    A Bias Correction Method in Meta-analysis of Randomized Clinical Trials with no Adjustments for Zero-inflated Outcomes

    Authors: Zhengyang Zhou, Minge Xie, David Huh, Eun-Young Mun

    Abstract: Many clinical endpoint measures, such as the number of standard drinks consumed per week or the number of days that patients stayed in the hospital, are count data with excessive zeros. However, the zero-inflated nature of such outcomes is sometimes ignored in analyses of clinical trials. This leads to biased estimates of study-level intervention effect and, consequently, a biased estimate of the… ▽ More

    Submitted 25 June, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

  27. arXiv:2008.09148  [pdf, other

    cs.LG stat.ML

    Towards adversarial robustness with 01 loss neural networks

    Authors: Yunzhe Xue, Meiyan Xie, Usman Roshan

    Abstract: Motivated by the general robustness properties of the 01 loss we propose a single hidden layer 01 loss neural network trained with stochastic coordinate descent as a defense against adversarial attacks in machine learning. One measure of a model's robustness is the minimum distortion required to make the input adversarial. This can be approximated with the Boundary Attack (Brendel et. al. 2018) an… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:2006.07800

  28. arXiv:2006.16312  [pdf, other

    cs.LG cs.DS cs.IR eess.SY stat.ML

    Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

    Authors: Xiaotian Hao, Zhaoqing Peng, Yi Ma, Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, Han Li, Jian Xu, Kun Gai

    Abstract: In E-commerce, advertising is essential for merchants to reach their target users. The typical objective is to maximize the advertiser's cumulative revenue over a period of time under a budget constraint. In real applications, an advertisement (ad) usually needs to be exposed to the same user multiple times until the user finally contributes revenue (e.g., places an order). However, existing adver… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: accepted by ICML 2020

  29. arXiv:2006.16205  [pdf, other

    cs.LG stat.ML

    Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

    Authors: Sang Michael Xie, Tengyu Ma, Percy Liang

    Abstract: We focus on prediction problems with structured outputs that are subject to output validity constraints, e.g. pseudocode-to-code translation where the code must compile. While labeled input-output pairs are expensive to obtain, "unlabeled" outputs, i.e. outputs without corresponding inputs, are freely available (e.g. code on GitHub) and provide information about output validity. We can capture the… ▽ More

    Submitted 24 October, 2023; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: ICML 2021 Long talk

  30. arXiv:2006.07800  [pdf, other

    cs.LG stat.ML

    On the transferability of adversarial examples between convex and 01 loss models

    Authors: Yunzhe Xue, Meiyan Xie, Usman Roshan

    Abstract: The 01 loss gives different and more accurate boundaries than convex loss models in the presence of outliers. Could the difference of boundaries translate to adversarial examples that are non-transferable between 01 loss and convex models? We explore this empirically in this paper by studying transferability of adversarial examples between linear 01 loss and convex (hinge) loss models, and between… ▽ More

    Submitted 29 July, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

  31. arXiv:2005.01026  [pdf, other

    cs.LG cs.DC stat.ML

    Multi-Center Federated Learning: Clients Clustering for Better Personalization

    Authors: Guodong Long, Ming Xie, Tao Shen, Tianyi Zhou, Xianzhi Wang, Jing Jiang, Chengqi Zhang

    Abstract: Federated learning has received great attention for its capability to train a large-scale model in a decentralized manner without needing to access user data directly. It helps protect the users' private data from centralized collecting. Unlike distributed machine learning, federated learning aims to tackle non-IID data from heterogeneous sources in various real-world applications, such as those o… ▽ More

    Submitted 5 February, 2023; v1 submitted 3 May, 2020; originally announced May 2020.

    Comments: This paper has two duplicated versions: 2005.01026 and 2108.08647. The first one 2005.01026 is the right one, and the second one 2108.08647 should be deleted because it always causes misoperating

    Journal ref: World Wide Web,26,(2003),481-500

  32. arXiv:2004.08472  [pdf, other

    stat.ME math.ST

    Leveraging the Fisher randomization test using confidence distributions: inference, combination and fusion learning

    Authors: Xiaokang Luo, Tirthankar Dasgupta, Minge Xie, Regina Liu

    Abstract: The flexibility and wide applicability of the Fisher randomization test (FRT) makes it an attractive tool for assessment of causal effects of interventions from modern-day randomized experiments that are increasing in size and complexity. This paper provides a theoretical inferential framework for FRT by establishing its connection with confidence distributions Such a connection leads to developme… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  33. arXiv:2002.10716  [pdf, other

    cs.LG stat.ML

    Understanding and Mitigating the Tradeoff Between Robustness and Accuracy

    Authors: Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, Percy Liang

    Abstract: Adversarial training augments the training set with perturbations to improve the robust error (over worst-case perturbations), but it often leads to an increase in the standard error (on unperturbed test inputs). Previous explanations for this tradeoff rely on the assumption that no predictor in the hypothesis class has low standard and robust error. In this work, we precisely characterize the eff… ▽ More

    Submitted 6 July, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Appearing at International Conference on Machine Learning (ICML) 2020

  34. arXiv:2002.03444  [pdf, other

    cs.LG stat.ML

    Robust binary classification with the 01 loss

    Authors: Yunzhe Xue, Meiyan Xie, Usman Roshan

    Abstract: The 01 loss is robust to outliers and tolerant to noisy data compared to convex loss functions. We conjecture that the 01 loss may also be more robust to adversarial attacks. To study this empirically we have developed a stochastic coordinate descent algorithm for a linear 01 loss classifier and a single hidden layer 01 loss neural network. Due to the absence of the gradient we iteratively update… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

  35. arXiv:2001.11945  [pdf, other

    stat.ME

    p-Value as the Strength of Evidence Measured by Confidence Distribution

    Authors: Sifan Liu, Regina Liu, Min-ge Xie

    Abstract: The notion of p-value is a fundamental concept in statistical inference and has been widely used for reporting outcomes of hypothesis tests. However, p-value is often misinterpreted, misused or miscommunicated in practice. Part of the issue is that existing definitions of p-value are often derived from constructions under specific settings, and a general definition that directly reflects the evide… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Comments: 30 pages, 8 figures

    MSC Class: 62F03 (Primary) 62H15 (Secondary)

  36. arXiv:2001.09595  [pdf, other

    cs.LG cs.IR stat.ML

    Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning

    Authors: Xi Liu, Li Li, Ping-Chun Hsieh, Muhe Xie, Yong Ge, Rui Chen

    Abstract: With the explosive growth of online products and content, recommendation techniques have been considered as an effective tool to overcome information overload, improve user experience, and boost business revenue. In recent years, we have observed a new desideratum of considering long-term rewards of multiple related recommendation tasks simultaneously. The consideration of long-term rewards is str… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  37. arXiv:2001.08336  [pdf, other

    math.ST stat.ME

    Geometric Conditions for the Discrepant Posterior Phenomenon and Connections to Simpson's Paradox

    Authors: Yang Chen, Ruobin Gong, Min-ge Xie

    Abstract: The discrepant posterior phenomenon (DPP) is a counter-intuitive phenomenon that can frequently occur in a Bayesian analysis of multivariate parameters. It refers to the phenomenon that a parameter estimate based on a posterior is more extreme than both of those inferred based on either the prior or the likelihood alone. Inferential claims that exhibit DPP defy the common intuition that the poster… ▽ More

    Submitted 12 January, 2022; v1 submitted 22 January, 2020; originally announced January 2020.

  38. arXiv:1906.06032  [pdf, other

    cs.LG stat.ML

    Adversarial Training Can Hurt Generalization

    Authors: Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John C. Duchi, Percy Liang

    Abstract: While adversarial training can improve robust accuracy (against an adversary), it sometimes hurts standard accuracy (when there is no adversary). Previous work has studied this tradeoff between standard and robust accuracy, but only in the setting where no predictor performs well on both objectives in the infinite data limit. In this paper, we show that even when the optimal predictor with infinit… ▽ More

    Submitted 26 August, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  39. arXiv:1906.05533  [pdf, other

    stat.ME

    Individualized Group Learning

    Authors: Chencheng Cai, Rong Chen, Min-ge Xie

    Abstract: Many massive data are assembled through collections of information of a large number of individuals in a population. The analysis of such data, especially in the aspect of individualized inferences and solutions, has the potential to create significant value for practical applications. Traditionally, inference for an individual in the data set is either solely relying on the information of the ind… ▽ More

    Submitted 16 September, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

  40. arXiv:1901.10517  [pdf, other

    cs.LG stat.ML

    Reparameterizable Subset Sampling via Continuous Relaxations

    Authors: Sang Michael Xie, Stefano Ermon

    Abstract: Many machine learning tasks require sampling a subset of items from a collection based on a parameterized distribution. The Gumbel-softmax trick can be used to sample a single item, and allows for low-variance reparameterized gradients with respect to the parameters of the underlying distribution. However, stochastic optimization involving subset sampling is typically not reparameterizable. To ove… ▽ More

    Submitted 26 February, 2021; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: IJCAI 2019

  41. arXiv:1901.06247  [pdf, other

    cs.LG stat.ML

    Micro- and Macro-Level Churn Analysis of Large-Scale Mobile Games

    Authors: Xi Liu, Muhe Xie, Xidao Wen, Rui Chen, Yong Ge, Nick Duffield, Na Wang

    Abstract: As mobile devices become more and more popular, mobile gaming has emerged as a promising market with billion-dollar revenues. A variety of mobile game platforms and services have been developed around the world. A critical challenge for these platforms and services is to understand the churn behavior in mobile games, which usually involves churn at micro level (between an app and a specific user)… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1808.06573

  42. arXiv:1811.05932  [pdf, ps, other

    cs.LG cs.SI stat.ML

    Streaming Network Embedding through Local Actions

    Authors: Xi Liu, Ping-Chun Hsieh, Nick Duffield, Rui Chen, Muhe Xie, Xidao Wen

    Abstract: Recently, considerable research attention has been paid to network embedding, a popular approach to construct feature vectors of vertices. Due to the curse of dimensionality and sparsity in graphical datasets, this approach has become indispensable for machine learning tasks over large networks. The majority of existing literature has considered this technique under the assumption that the network… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  43. arXiv:1808.06573  [pdf, other

    cs.LG stat.ML

    A Semi-Supervised and Inductive Embedding Model for Churn Prediction of Large-Scale Mobile Games

    Authors: Xi Liu, Muhe Xie, Xidao Wen, Rui Chen, Yong Ge, Nick Duffield, Na Wang

    Abstract: Mobile gaming has emerged as a promising market with billion-dollar revenues. A variety of mobile game platforms and services have been developed around the world. One critical challenge for these platforms and services is to understand user churn behavior in mobile games. Accurate churn prediction will benefit many stakeholders such as game developers, advertisers, and platform operators. In this… ▽ More

    Submitted 10 October, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: to appear in ICDM 2018

  44. arXiv:1805.10407  [pdf, other

    cs.LG cs.AI stat.ML

    Semi-supervised Deep Kernel Learning: Regression with Unlabeled Data by Minimizing Predictive Variance

    Authors: Neal Jean, Sang Michael Xie, Stefano Ermon

    Abstract: Large amounts of labeled data are typically required to train deep learning models. For many real-world problems, however, acquiring additional data can be expensive or even impossible. We present semi-supervised deep kernel learning (SSDKL), a semi-supervised regression model based on minimizing predictive variance in the posterior regularization framework. SSDKL combines the hierarchical represe… ▽ More

    Submitted 4 March, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: In Proceedings of Neural Information Processing Systems (NeurIPS) 2018

  45. arXiv:1805.09757  [pdf, other

    cs.LG stat.ML

    Geographical Hidden Markov Tree for Flood Extent Mapping (With Proof Appendix)

    Authors: Miao Xie, Zhe Jiang, Arpan Man Sainju

    Abstract: Flood extent mapping plays a crucial role in disaster management and national water forecasting. Unfortunately, traditional classification methods are often hampered by the existence of noise, obstacles and heterogeneity in spectral features as well as implicit anisotropic spatial dependency across class labels. In this paper, we propose geographical hidden Markov tree, a probabilistic graphical m… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

  46. arXiv:1802.05380  [pdf, other

    cs.LG stat.ML

    Active Feature Acquisition with Supervised Matrix Completion

    Authors: Sheng-Jun Huang, Miao Xu, Ming-Kun Xie, Masashi Sugiyama, Gang Niu, Songcan Chen

    Abstract: Feature missing is a serious problem in many applications, which may lead to low quality of training data and further significantly degrade the learning performance. While feature acquisition usually involves special devices or complex process, it is expensive to acquire all feature values for the whole dataset. On the other hand, features may be correlated with each other, and some values may be… ▽ More

    Submitted 4 June, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: 9 pages, 8 figures

  47. arXiv:1802.03511  [pdf, other

    stat.ME

    A General Framework For Frequentist Model Averaging

    Authors: Priyam Mitra, Heng Lian, Ritwik Mitra, Hua Liang, Min-ge Xie

    Abstract: Model selection strategies have been routinely employed to determine a model for data analysis in statistics, and further study and inference then often proceed as though the selected model were the true model that were known a priori. This practice does not account for the uncertainty introduced by the selection process and the fact that the selected model can possibly be a wrong one. Model avera… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

  48. arXiv:1705.10347  [pdf, ps, other

    stat.CO math.ST

    An effective likelihood-free approximate computing method with statistical inferential guarantees

    Authors: Suzanne Thornton, Wentao Li, Min-ge Xie

    Abstract: Approximate Bayesian computing is a powerful likelihood-free method that has grown increasingly popular since early applications in population genetics. However, complications arise in the theoretical justification for Bayesian inference conducted from this method with a non-sufficient summary statistic. In this paper, we seek to re-frame approximate Bayesian computing within a frequentist context… ▽ More

    Submitted 30 November, 2018; v1 submitted 29 May, 2017; originally announced May 2017.

  49. Dynamic dependence networks: Financial time series forecasting and portfolio decisions (with discussion)

    Authors: Zoey Yi Zhao, Meng Xie, Mike West

    Abstract: We discuss Bayesian forecasting of increasingly high-dimensional time series, a key area of application of stochastic dynamic models in the financial industry and allied areas of business. Novel state-space models characterizing sparse patterns of dependence among multiple time series extend existing multivariate volatility models to enable scaling to higher numbers of individual time series. The… ▽ More

    Submitted 27 June, 2016; originally announced June 2016.

    Comments: 31 pages, 9 figures, 3 tables

    MSC Class: 62M10; 62F15; 62P20

    Journal ref: Applied Stochastic Models in Business and Industry, 32, 311-339, 2016

  50. arXiv:1505.05184  [pdf

    math.OC stat.AP

    Multi-Objective Optimization of a Port-of-Entry Inspection Policy

    Authors: Christina M. Young, Mingyu Li, Yada Zhu, Minge Xie, Elsayed A. Elsayed, Tsvetan Asamov

    Abstract: At the port-of-entry containers are inspected through a specific sequence of sensor stations to detect the presence of nuclear materials, biological and chemical agents, and other illegal cargo. The inspection policy, which includes the sequence in which sensors are applied and the threshold levels used at the inspection stations, affects the probability of misclassifying a container as well as th… ▽ More

    Submitted 19 May, 2015; originally announced May 2015.