Skip to main content

Showing 1–50 of 61 results for author: Yuan, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2508.20803  [pdf, ps, other

    stat.CO stat.ME

    Optional subsampling for generalized estimating equations in growing-dimensional longitudinal Data

    Authors: Chunjing Li, Jiahui Zhang, Xiaohui Yuan

    Abstract: As a powerful tool for longitudinal data analysis, the generalized estimating equations have been widely studied in the academic community. However, in large-scale settings, this approach faces pronounced computational and storage challenges. In this paper, we propose an optimal Poisson subsampling algorithm for generalized estimating equations in large-scale longitudinal data with diverging covar… ▽ More

    Submitted 28 August, 2025; originally announced August 2025.

    Comments: 34 pages, 5 figures

  2. arXiv:2508.13359  [pdf

    stat.AP cs.CE math.PR

    Unified Modelling of Infrastructure Asset Performance Deterioration -- a bounded gamma process approach

    Authors: Wang Chen, Arnold X. -X. Yuan

    Abstract: Infrastructure asset management systems require a flexible deterioration model that can handle various degradation patterns in a unified way. Owing to its appealing monotonic sample paths, independent increments and mathematical tractability, gamma process has been widely employed as an infrastructure performance deterioration model. This model was recently enhanced by introducing an upper bound t… ▽ More

    Submitted 18 August, 2025; originally announced August 2025.

  3. arXiv:2506.13955  [pdf, ps, other

    stat.ML cs.CR cs.LG stat.AP

    Bridging Unsupervised and Semi-Supervised Anomaly Detection: A Theoretically-Grounded and Practical Framework with Synthetic Anomalies

    Authors: Matthew Lau, Tian-Yi Zhou, Xiangchi Yuan, Jizhou Chen, Wenke Lee, Xiaoming Huo

    Abstract: Anomaly detection (AD) is a critical task across domains such as cybersecurity and healthcare. In the unsupervised setting, an effective and theoretically-grounded principle is to train classifiers to distinguish normal data from (synthetic) anomalies. We extend this principle to semi-supervised AD, where training data also include a limited labeled subset of anomalies possibly present in test tim… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2501.12314  [pdf, other

    stat.ML cs.LG

    Uncertainty Quantification With Noise Injection in Neural Networks: A Bayesian Perspective

    Authors: Xueqiong Yuan, Jipeng Li, Ercan Engin Kuruoglu

    Abstract: Model uncertainty quantification involves measuring and evaluating the uncertainty linked to a model's predictions, helping assess their reliability and confidence. Noise injection is a technique used to enhance the robustness of neural networks by introducing randomness. In this paper, we establish a connection between noise injection and uncertainty quantification from a Bayesian standpoint. We… ▽ More

    Submitted 23 April, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

  5. arXiv:2406.19691  [pdf, other

    stat.ME

    Optimal subsampling for functional composite quantile regression in massive data

    Authors: Jingxiang Pan, Xiaohui Yuan, Xiaohui Yuan

    Abstract: As computer resources become increasingly limited, traditional statistical methods face challenges in analyzing massive data, especially in functional data analysis. To address this issue, subsampling offers a viable solution by significantly reducing computational requirements. This paper introduces a subsampling technique for composite quantile regression, designed for efficient application with… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  6. arXiv:2406.17567  [pdf, other

    stat.ME stat.OT

    Transfer Learning for High Dimensional Robust Regression

    Authors: Xiaohui Yuan, Shujie Ren

    Abstract: Transfer learning has become an essential technique for utilizing information from source datasets to improve the performance of the target task. However, in the context of high-dimensional data, heterogeneity arises due to heteroscedastic variance or inhomogeneous covariate effects. To solve this problem, this paper proposes a robust transfer learning based on the Huber regression, specifically d… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. arXiv:2406.04690  [pdf, other

    cs.LG stat.ML

    Higher-order Structure Based Anomaly Detection on Attributed Networks

    Authors: Xu Yuan, Na Zhou, Shuo Yu, Huafei Huang, Zhikui Chen, Feng Xia

    Abstract: Anomaly detection (such as telecom fraud detection and medical image detection) has attracted the increasing attention of people. The complex interaction between multiple entities widely exists in the network, which can reflect specific human behavior patterns. Such patterns can be modeled by higher-order network structures, thus benefiting anomaly detection on attributed networks. However, due to… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  8. arXiv:2405.00917  [pdf, other

    stat.ME

    Semiparametric mean and variance joint models with clipped-Laplace link functions for bounded integer-valued time series

    Authors: Tianqing Liu, Xiaohui Yuan

    Abstract: We present a novel approach for modeling bounded count time series data, by deriving accurate upper and lower bounds for the variance of a bounded count random variable while maintaining a fixed mean. Leveraging these bounds, we propose semiparametric mean and variance joint (MVJ) models utilizing a clipped-Laplace link function. These models offer a flexible and feasible structure for both mean a… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2404.18421

  9. arXiv:2404.18421  [pdf, other

    stat.ME math.ST

    Semiparametric mean and variance joint models with Laplace link functions for count time series

    Authors: Tianqing Liu, Xiaohui Yuan

    Abstract: Count time series data are frequently analyzed by modeling their conditional means and the conditional variance is often considered to be a deterministic function of the corresponding conditional mean and is not typically modeled independently. We propose a semiparametric mean and variance joint model, called random rounded count-valued generalized autoregressive conditional heteroskedastic (RRC-G… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  10. arXiv:2401.05394  [pdf, other

    eess.SP cs.LG math.OC stat.ML

    Iterative Regularization with k-support Norm: An Important Complement to Sparse Recovery

    Authors: William de Vazelhes, Bhaskar Mukhoty, Xiao-Tong Yuan, Bin Gu

    Abstract: Sparse recovery is ubiquitous in machine learning and signal processing. Due to the NP-hard nature of sparse recovery, existing methods are known to suffer either from restrictive (or even unknown) applicability conditions, or high computational cost. Recently, iterative regularization methods have emerged as a promising fast approach because they can achieve sparse recovery in one pass through ea… ▽ More

    Submitted 19 March, 2024; v1 submitted 19 December, 2023; originally announced January 2024.

    Comments: Accepted at AAAI 2024. Code at https://github.com/wdevazelhes/IRKSN_AAAI2024

  11. arXiv:2301.03125  [pdf, ps, other

    stat.ML cs.LG math.OC

    Sharper Analysis for Minibatch Stochastic Proximal Point Methods: Stability, Smoothness, and Deviation

    Authors: Xiao-Tong Yuan, Ping Li

    Abstract: The stochastic proximal point (SPP) methods have gained recent attention for stochastic optimization, with strong convergence guarantees and superior robustness to the classic stochastic gradient descent (SGD) methods showcased at little to no cost of computational overhead added. In this article, we study a minibatch variant of SPP, namely M-SPP, for solving convex composite risk minimization pro… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  12. arXiv:2301.02448  [pdf, other

    stat.CO

    Optimal subsampling algorithm for composite quantile regression with distributed data

    Authors: Xiaohui Yuan, Shiting Zhou, Yue Wang

    Abstract: For massive data stored at multiple machines, we propose a distributed subsampling procedure for the composite quantile regression. By establishing the consistency and asymptotic normality of the composite quantile regression estimator from a general subsampling algorithm, we derive the optimal subsampling probabilities and the optimal allocation sizes under the L-optimality criteria. A two-step a… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 29 pages, 8 figures, 7 tables

  13. arXiv:2206.05187  [pdf, other

    stat.ML cs.LG

    On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond

    Authors: Xiao-Tong Yuan, Ping Li

    Abstract: The FedProx algorithm is a simple yet powerful distributed proximal point optimization method widely used for federated learning (FL) over heterogeneous data. Despite its popularity and remarkable success witnessed in practice, the theoretical understanding of FedProx is largely underinvestigated: the appealing convergence behavior of FedProx is so far characterized under certain non-standard and… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  14. arXiv:2206.03834  [pdf, ps, other

    stat.ML cs.LG

    Boosting the Confidence of Generalization for $L_2$-Stable Randomized Learning Algorithms

    Authors: Xiao-Tong Yuan, Ping Li

    Abstract: Exponential generalization bounds with near-tight rates have recently been established for uniformly stable learning algorithms. The notion of uniform stability, however, is stringent in the sense that it is invariant to the data-generating distribution. Under the weaker and distribution dependent notions of stability such as hypothesis stability and $L_2$-stability, the literature suggests that o… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  15. arXiv:2203.09413  [pdf, ps, other

    stat.ML cs.LG eess.SP

    Stability and Risk Bounds of Iterative Hard Thresholding

    Authors: Xiao-Tong Yuan, Ping Li

    Abstract: In this paper, we analyze the generalization performance of the Iterative Hard Thresholding (IHT) algorithm widely used for sparse recovery problems. The parameter estimation and sparsity recovery consistency of IHT has long been known in compressed sensing. From the perspective of statistical learning, another fundamental question is how well the IHT estimation would predict on unseen data. This… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  16. arXiv:2202.03335  [pdf, other

    cs.CR cs.LG stat.ML

    Membership Inference Attacks and Defenses in Neural Network Pruning

    Authors: Xiaoyong Yuan, Lan Zhang

    Abstract: Neural network pruning has been an essential technique to reduce the computation and memory requirements for using deep neural networks for resource-constrained devices. Most existing research focuses primarily on balancing the sparsity and accuracy of a pruned neural network by strategically removing insignificant parameters and retraining the pruned model. Such efforts on reusing training sample… ▽ More

    Submitted 3 August, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: This paper has been accepted to USENIX Security Symposium 2022. This is an extended version with more experimental results

  17. arXiv:2109.03775  [pdf, other

    cs.LG stat.ML

    FedZKT: Zero-Shot Knowledge Transfer towards Resource-Constrained Federated Learning with Heterogeneous On-Device Models

    Authors: Lan Zhang, Dapeng Wu, Xiaoyong Yuan

    Abstract: Federated learning enables multiple distributed devices to collaboratively learn a shared prediction model without centralizing their on-device data. Most of the current algorithms require comparable individual efforts for local training with the same structure and size of on-device models, which, however, impedes participation from resource-constrained devices. Given the widespread yet heterogene… ▽ More

    Submitted 5 April, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: This paper has been accepted to ICDCS 2022

  18. arXiv:2106.10056  [pdf

    cs.LG stat.ML

    A Vertical Federated Learning Framework for Horizontally Partitioned Labels

    Authors: Wensheng Xia, Ying Li, Lan Zhang, Zhonghai Wu, Xiaoyong Yuan

    Abstract: Vertical federated learning is a collaborative machine learning framework to train deep leaning models on vertically partitioned data with privacy-preservation. It attracts much attention both from academia and industry. Unfortunately, applying most existing vertical federated learning methods in real-world applications still faces two daunting challenges. First, most existing vertical federated l… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 10 pages, 6 figures

  19. arXiv:2102.10749  [pdf, other

    cs.IT cs.LG cs.NI eess.SP stat.ML

    CSIT-Free Model Aggregation for Federated Edge Learning via Reconfigurable Intelligent Surface

    Authors: Hang Liu, Xiaojun Yuan, Ying-Jun Angela Zhang

    Abstract: We study over-the-air model aggregation in federated edge learning (FEEL) systems, where channel state information at the transmitters (CSIT) is assumed to be unavailable. We leverage the reconfigurable intelligent surface (RIS) technology to align the cascaded channel coefficients for CSIT-free model aggregation. To this end, we jointly optimize the RIS and the receiver by minimizing the aggregat… ▽ More

    Submitted 25 July, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: This work has been submitted to the IEEE for possible publication

  20. arXiv:2012.09265  [pdf, other

    quant-ph cs.LG stat.ML

    Variational Quantum Algorithms

    Authors: M. Cerezo, Andrew Arrasmith, Ryan Babbush, Simon C. Benjamin, Suguru Endo, Keisuke Fujii, Jarrod R. McClean, Kosuke Mitarai, Xiao Yuan, Lukasz Cincio, Patrick J. Coles

    Abstract: Applications such as simulating complicated quantum systems or solving large-scale linear algebra problems are very challenging for classical computers due to the extremely high computational cost. Quantum computers promise a solution, although fault-tolerant quantum computers will likely not be available in the near future. Current quantum devices have serious constraints, including limited numbe… ▽ More

    Submitted 4 October, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Review Article. 33 pages, 7 figures. Updated to published version

    Report number: LA-UR-20-30142

    Journal ref: Nature Reviews Physics 3, 625-644 (2021)

  21. arXiv:2009.09835  [pdf, other

    cs.LG math.NA math.OC stat.ML

    Hybrid Stochastic-Deterministic Minibatch Proximal Gradient: Less-Than-Single-Pass Optimization with Nearly Optimal Generalization

    Authors: Pan Zhou, Xiaotong Yuan

    Abstract: Stochastic variance-reduced gradient (SVRG) algorithms have been shown to work favorably in solving large-scale learning problems. Despite the remarkable success, the stochastic gradient complexity of SVRG-type algorithms usually scales linearly with data size and thus could still be expensive for huge data. To address this deficiency, we propose a hybrid stochastic-deterministic minibatch proxima… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  22. arXiv:2008.02320  [pdf, other

    eess.IV cs.LG stat.ML

    Machine learning for faster and smarter fluorescence lifetime imaging microscopy

    Authors: Varun Mannam, Yide Zhang, Xiaotong Yuan, Cara Ravasio, Scott S. Howard

    Abstract: Fluorescence lifetime imaging microscopy (FLIM) is a powerful technique in biomedical research that uses the fluorophore decay rate to provide additional contrast in fluorescence microscopy. However, at present, the calculation, analysis, and interpretation of FLIM is a complex, slow, and computationally expensive process. Machine learning (ML) techniques are well suited to extract and interpret m… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Report number: 042005

  23. arXiv:2007.15353  [pdf, other

    cs.LG stat.ML

    Growing Efficient Deep Networks by Structured Continuous Sparsification

    Authors: Xin Yuan, Pedro Savarese, Michael Maire

    Abstract: We develop an approach to growing deep network architectures over the course of training, driven by a principled combination of accuracy and sparsity objectives. Unlike existing pruning or architecture search techniques that operate on full-sized models or supernet architectures, our method can start from a small, simple seed architecture and dynamically grow and prune both layers and filters. By… ▽ More

    Submitted 5 June, 2023; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Published as a conference paper at ICLR 2021

  24. arXiv:2007.03219  [pdf, other

    cs.LG stat.ML

    Meta-Learning with Network Pruning

    Authors: Hongduan Tian, Bo Liu, Xiao-Tong Yuan, Qingshan Liu

    Abstract: Meta-learning is a powerful paradigm for few-shot learning. Although with remarkable success witnessed in many applications, the existing optimization based meta-learning models with over-parameterized neural networks have been evidenced to ovetfit on training tasks. To remedy this deficiency, we propose a network pruning based meta-learning approach for overfitting reduction via explicitly contro… ▽ More

    Submitted 22 July, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

  25. arXiv:2006.15419  [pdf, ps, other

    stat.ML cs.LG cs.MS

    The flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R

    Authors: Xingguo Li, Tuo Zhao, Xiaoming Yuan, Han Liu

    Abstract: This paper describes an R package named flare, which implements a family of new high dimensional regression methods (LAD Lasso, SQRT Lasso, $\ell_q$ Lasso, and Dantzig selector) and their extensions to sparse precision matrix estimation (TIGER and CLIME). These methods exploit different nonsmooth loss functions to gain modeling flexibility, estimation robustness, and tuning insensitiveness. The de… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Journal ref: Journal of Machine Learning Research 16 (2015) 553-557

  26. arXiv:2006.13463  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Policy Network for Transferable Active Learning on Graphs

    Authors: Shengding Hu, Zheng Xiong, Meng Qu, Xingdi Yuan, Marc-Alexandre Côté, Zhiyuan Liu, Jian Tang

    Abstract: Graph neural networks (GNNs) have been attracting increasing popularity due to their simplicity and effectiveness in a variety of fields. However, a large number of labeled data is generally required to train these networks, which could be very expensive to obtain in some domains. In this paper, we study active learning for GNNs, i.e., how to efficiently label the nodes on a graph to reduce the an… ▽ More

    Submitted 23 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    ACM Class: I.2

  27. arXiv:2006.04045  [pdf, other

    cs.LG cs.CV math.DS math.OC stat.ML

    A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

    Authors: Risheng Liu, Pan Mu, Xiaoming Yuan, Shangzhi Zeng, Jin Zhang

    Abstract: In recent years, a variety of gradient-based first-order methods have been developed to solve bi-level optimization problems for learning applications. However, theoretical guarantees of these existing approaches heavily rely on the simplification that for each fixed upper-level variable, the lower-level solution must be a singleton (a.k.a., Lower-Level Singleton, LLS). In this work, we first desi… ▽ More

    Submitted 2 July, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Accepted at ICML 2020

  28. arXiv:2004.08861  [pdf, other

    cs.LG cs.NE stat.ML

    Role-Wise Data Augmentation for Knowledge Distillation

    Authors: Jie Fu, Xue Geng, Zhijian Duan, Bohan Zhuang, Xingdi Yuan, Adam Trischler, Jie Lin, Chris Pal, Hao Dong

    Abstract: Knowledge Distillation (KD) is a common method for transferring the ``knowledge'' learned by one machine learning model (the \textit{teacher}) into another model (the \textit{student}), where typically, the teacher has a greater capacity (e.g., more parameters or higher bit-widths). To our knowledge, existing methods overlook the fact that although the student absorbs extra knowledge from the teac… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  29. arXiv:2004.02278  [pdf

    q-bio.PE physics.soc-ph stat.CO

    The Framework for the Prediction of the Critical Turning Period for Outbreak of COVID-19 Spread in China based on the iSEIR Model

    Authors: George Xianzhi Yuan, Lan Di, Yudi Gu, Guoqi Qian, Xiaosong Qian

    Abstract: The goal of this study is to establish a general framework for predicting the so-called critical Turning Period in an infectious disease epidemic such as the COVID-19 outbreak in China early this year. This framework enabled a timely prediction of the turning period when applied to Wuhan COVID-19 epidemic and informed the relevant authority for taking appropriate and timely actions to control the… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: 24 paages, 9 figures, 10 tables

    MSC Class: 34F05; 35r30; 62j02; 62F10; 65N21; 92D30; 92B05; 92d25; 93c30

    Journal ref: Journal of Systems Science and Information, Vol.10, No.4: 309 - 337, (2022)

  30. arXiv:1908.10449  [pdf, other

    cs.CL cs.LG stat.ML

    Interactive Machine Comprehension with Information Seeking Agents

    Authors: Xingdi Yuan, Jie Fu, Marc-Alexandre Cote, Yi Tay, Christopher Pal, Adam Trischler

    Abstract: Existing machine reading comprehension (MRC) models do not scale effectively to real-world applications like web-level information retrieval and question answering (QA). We argue that this stems from the nature of MRC datasets: most of these are static environments wherein the supporting documents and all necessary information are fully observed. In this paper, we propose a simple method that refr… ▽ More

    Submitted 16 April, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: ACL2020

  31. arXiv:1908.02246  [pdf, ps, other

    stat.ML cs.LG stat.CO

    On Convergence of Distributed Approximate Newton Methods: Globalization, Sharper Bounds and Beyond

    Authors: Xiao-Tong Yuan, Ping Li

    Abstract: The DANE algorithm is an approximate Newton method popularly used for communication-efficient distributed machine learning. Reasons for the interest in DANE include scalability and versatility. Convergence of DANE, however, can be tricky; its appealing convergence rate is only rigorous for quadratic objective, and for more general convex functions the known results are no stronger than those of th… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  32. arXiv:1904.11316  [pdf, ps, other

    cs.LG math.ST stat.ML

    Stability and Optimization Error of Stochastic Gradient Descent for Pairwise Learning

    Authors: Wei Shen, Zhenhuan Yang, Yiming Ying, Xiaoming Yuan

    Abstract: In this paper we study the stability and its trade-off with optimization error for stochastic gradient descent (SGD) algorithms in the pairwise learning setting. Pairwise learning refers to a learning task which involves a loss function depending on pairs of instances among which notable examples are bipartite ranking, metric learning, area under ROC (AUC) maximization and minimum error entropy (M… ▽ More

    Submitted 26 April, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: 35 pages

  33. arXiv:1904.05100  [pdf, other

    cs.LG stat.ML

    Knowledge Squeezed Adversarial Network Compression

    Authors: Shu Changyong, Li Peng, Xie Yuan, Qu Yanyun, Dai Longquan, Ma Lizhuang

    Abstract: Deep network compression has been achieved notable progress via knowledge distillation, where a teacher-student learning manner is adopted by using predetermined loss. Recently, more focuses have been transferred to employ the adversarial training to minimize the discrepancy between distributions of output from two networks. However, they always emphasize on result-oriented learning while neglecti… ▽ More

    Submitted 25 April, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

  34. arXiv:1812.03271  [pdf, other

    cs.LG stat.ML

    Generalized Batch Normalization: Towards Accelerating Deep Neural Networks

    Authors: Xiaoyong Yuan, Zheng Feng, Matthew Norton, Xiaolin Li

    Abstract: Utilizing recently introduced concepts from statistics and quantitative risk management, we present a general variant of Batch Normalization (BN) that offers accelerated convergence of Neural Network training compared to conventional BN. In general, we show that mean and standard deviation are not always the most appropriate choice for the centering and scaling procedure within the BN transformati… ▽ More

    Submitted 8 December, 2018; originally announced December 2018.

    Comments: accepted at AAAI-19

  35. arXiv:1812.00855  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Solving Text-based Games by Producing Adaptive Action Spaces

    Authors: Ruo Yu Tao, Marc-Alexandre Côté, Xingdi Yuan, Layla El Asri

    Abstract: To solve a text-based game, an agent needs to formulate valid text commands for a given context and find the ones that lead to success. Recent attempts at solving text-based games with deep reinforcement learning have focused on the latter, i.e., learning to act optimally when valid actions are known in advance. In this work, we propose to tackle the first task and train a model that generates the… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  36. arXiv:1806.11532  [pdf, other

    cs.LG cs.CL stat.ML

    TextWorld: A Learning Environment for Text-based Games

    Authors: Marc-Alexandre Côté, Ákos Kádár, Xingdi Yuan, Ben Kybartas, Tavian Barnes, Emery Fine, James Moore, Ruo Yu Tao, Matthew Hausknecht, Layla El Asri, Mahmoud Adada, Wendy Tay, Adam Trischler

    Abstract: We introduce TextWorld, a sandbox learning environment for the training and evaluation of RL agents on text-based games. TextWorld is a Python library that handles interactive play-through of text games, as well as backend functions like state tracking and reward assignment. It comes with a curated list of games whose features and challenges we have analyzed. More significantly, it enables users t… ▽ More

    Submitted 8 November, 2019; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: Presented at the Computer Games Workshop at IJCAI 2018, Stockholm

  37. arXiv:1803.06795  [pdf, other

    cs.CV stat.ML

    Nonlocal Low-Rank Tensor Factor Analysis for Image Restoration

    Authors: Xinyuan Zhang, Xin Yuan, Lawrence Carin

    Abstract: Low-rank signal modeling has been widely leveraged to capture non-local correlation in image processing applications. We propose a new method that employs low-rank tensor factor analysis for tensors generated by grouped image patches. The low-rank tensors are fed into the alternative direction multiplier method (ADMM) to further improve image reconstruction. The motivating application is compressi… ▽ More

    Submitted 18 March, 2018; originally announced March 2018.

  38. arXiv:1712.09926  [pdf, other

    cs.LG cs.NE stat.ML

    Rapid Adaptation with Conditionally Shifted Neurons

    Authors: Tsendsuren Munkhdalai, Xingdi Yuan, Soroush Mehri, Adam Trischler

    Abstract: We describe a mechanism by which artificial neural networks can learn rapid adaptation - the ability to adapt on the fly, with little data, to new tasks - that we call conditionally shifted neurons. We apply this mechanism in the framework of metalearning, where the aim is to replicate some of the flexibility of human learning in machines. Conditionally shifted neurons modify their activation valu… ▽ More

    Submitted 3 July, 2018; v1 submitted 28 December, 2017; originally announced December 2017.

    Comments: ICML 2018; Added: additional ablation and speed comparison with MetaNet

  39. arXiv:1712.07107  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarial Examples: Attacks and Defenses for Deep Learning

    Authors: Xiaoyong Yuan, Pan He, Qile Zhu, Xiaolin Li

    Abstract: With rapid progress and significant successes in a wide spectrum of applications, deep learning is being applied in many safety-critical environments. However, deep neural networks have been recently found vulnerable to well-designed input samples, called adversarial examples. Adversarial examples are imperceptible to human but can easily fool deep neural networks in the testing/deploying stage. T… ▽ More

    Submitted 6 July, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

    Comments: Github: https://github.com/chbrian/awesome-adversarial-examples-dl

  40. arXiv:1712.01145  [pdf, other

    cs.CR cs.LG stat.ML

    Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection

    Authors: Ruimin Sun, Xiaoyong Yuan, Pan He, Qile Zhu, Aokun Chen, Andre Gregio, Daniela Oliveira, Xiaolin Li

    Abstract: Existing malware detectors on safety-critical devices have difficulties in runtime detection due to the performance overhead. In this paper, we introduce PROPEDEUTICA, a framework for efficient and effective real-time malware detection, leveraging the best of conventional machine learning (ML) and deep learning (DL) techniques. In PROPEDEUTICA, all software start execution are considered as benign… ▽ More

    Submitted 17 October, 2021; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: 12 pages, 4 figures. This paper has been accepted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  41. arXiv:1703.01866  [pdf, other

    stat.ME

    Weighted empirical likelihood for quantile regression with nonignorable missing covariates

    Authors: Xiaohui Yuan, Xiaogang Dong

    Abstract: In this paper, we propose an empirical likelihood-based weighted estimator of regression parameter in quantile regression model with nonignorable missing covariates. The proposed estimator is computationally simple and achieves semiparametric efficiency if the probability of missingness on the fully observed variables is correctly specified. The efficiency gain of the proposed estimator over the c… ▽ More

    Submitted 7 October, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: 20 pages,2 figures

  42. arXiv:1703.00119  [pdf, other

    cs.LG stat.ML

    Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization

    Authors: Bo Liu, Xiao-Tong Yuan, Lezi Wang, Qingshan Liu, Dimitris N. Metaxas

    Abstract: Iterative Hard Thresholding (IHT) is a class of projected gradient descent methods for optimizing sparsity-constrained minimization models, with the best known efficiency and scalability in practice. As far as we know, the existing IHT-style methods are designed for sparse minimization in primal form. It remains open to explore duality theory and algorithms in such a non-convex and NP-hard problem… ▽ More

    Submitted 20 June, 2017; v1 submitted 28 February, 2017; originally announced March 2017.

  43. arXiv:1701.03006  [pdf, other

    stat.ML cs.LG

    Compressive Sensing via Convolutional Factor Analysis

    Authors: Xin Yuan, Yunchen Pu, Lawrence Carin

    Abstract: We solve the compressive sensing problem via convolutional factor analysis, where the convolutional dictionaries are learned {\em in situ} from the compressed measurements. An alternating direction method of multipliers (ADMM) paradigm for compressive sensing inversion based on convolutional factor analysis is developed. The proposed algorithm provides reconstructed images as well as features, whi… ▽ More

    Submitted 11 January, 2017; originally announced January 2017.

    Comments: 17 pages, 6 figures

  44. arXiv:1612.00922  [pdf, ps, other

    stat.ME

    An efficient and doubly robust empirical likelihood approach for estimating equations with missing data

    Authors: Tianqing Liu, Xiaohui Yuan, Zhaohai Li, Aiyi Liu

    Abstract: This paper considers an empirical likelihood inference for parameters defined by general estimating equations, when data are missing at random. The efficiency of existing estimators depends critically on correctly specifying the conditional expectation of the estimating function given the observed components of the random observations. When the conditional expectation is not correctly specified, t… ▽ More

    Submitted 2 December, 2016; originally announced December 2016.

    Comments: 31 pages,0 figures,7 tables

  45. arXiv:1609.08976  [pdf, other

    stat.ML cs.LG

    Variational Autoencoder for Deep Learning of Images, Labels and Captions

    Authors: Yunchen Pu, Zhe Gan, Ricardo Henao, Xin Yuan, Chunyuan Li, Andrew Stevens, Lawrence Carin

    Abstract: A novel variational autoencoder is developed to model images, as well as associated labels or captions. The Deep Generative Deconvolutional Network (DGDN) is used as a decoder of the latent image features, and a deep Convolutional Neural Network (CNN) is used as an image encoder; the CNN is used to approximate a distribution for the latent DGDN features/code. The latent code is also linked to gene… ▽ More

    Submitted 28 September, 2016; originally announced September 2016.

    Comments: NIPS 2016 (To appear)

  46. arXiv:1512.07344  [pdf, other

    cs.CV cs.LG stat.ML

    A Deep Generative Deconvolutional Image Model

    Authors: Yunchen Pu, Xin Yuan, Andrew Stevens, Chunyuan Li, Lawrence Carin

    Abstract: A deep generative model is developed for representation and analysis of images, based on a hierarchical convolutional dictionary-learning framework. Stochastic {\em unpooling} is employed to link consecutive layers in the model, yielding top-down image generation. A Bayesian support vector machine is linked to the top-layer features, yielding max-margin discrimination. Deep deconvolutional inferen… ▽ More

    Submitted 22 December, 2015; originally announced December 2015.

    Comments: 10 pages, 7 figures. Appearing in Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS) 2016, Cadiz, Spain. JMLR: W&CP volume 41

  47. arXiv:1509.06253  [pdf, other

    cs.IT stat.AP

    Convergence of the Generalized Alternating Projection Algorithm for Compressive Sensing

    Authors: Xin Yuan, Hong Jiang, Paul Wilford

    Abstract: The convergence of the generalized alternating projection (GAP) algorithm is studied in this paper to solve the compressive sensing problem $\yv = \Amat \xv + \epsilonv$. By assuming that $\Amat\Amat\ts$ is invertible, we prove that GAP converges linearly within a certain range of step-size when the sensing matrix $\Amat$ satisfies restricted isometry property (RIP) condition of $δ_{2K}$, where… ▽ More

    Submitted 4 September, 2015; originally announced September 2015.

    Comments: 12 pages, 11 figures

  48. arXiv:1508.06901  [pdf, other

    stat.ML cs.LG stat.AP

    Compressive Sensing via Low-Rank Gaussian Mixture Models

    Authors: Xin Yuan, Hong Jiang, Gang Huang, Paul A. Wilford

    Abstract: We develop a new compressive sensing (CS) inversion algorithm by utilizing the Gaussian mixture model (GMM). While the compressive sensing is performed globally on the entire image as implemented in our lensless camera, a low-rank GMM is imposed on the local image patches. This low-rank GMM is derived via eigenvalue thresholding of the GMM trained on the projection of the measurement data, thus le… ▽ More

    Submitted 27 August, 2015; originally announced August 2015.

    Comments: 12 pages, 8 figures

  49. arXiv:1508.03498  [pdf, other

    cs.CV stat.AP stat.ME

    Lensless Compressive Imaging

    Authors: Xin Yuan, Hong Jiang, Gang Huang, Paul Wilford

    Abstract: We develop a lensless compressive imaging architecture, which consists of an aperture assembly and a single sensor, without using any lens. An anytime algorithm is proposed to reconstruct images from the compressive measurements; the algorithm produces a sequence of solutions that monotonically converge to the true signal (thus, anytime). The algorithm is developed based on the sparsity of local o… ▽ More

    Submitted 14 August, 2015; originally announced August 2015.

    Comments: 37 pages, 10 figures. Submitted to SIAM Journal on Imaging Science

  50. arXiv:1504.07468  [pdf, other

    stat.ML

    Non-Gaussian Discriminative Factor Models via the Max-Margin Rank-Likelihood

    Authors: Xin Yuan, Ricardo Henao, Ephraim L. Tsalik, Raymond J. Langley, Lawrence Carin

    Abstract: We consider the problem of discriminative factor analysis for data that are in general non-Gaussian. A Bayesian model based on the ranks of the data is proposed. We first introduce a new {\em max-margin} version of the rank-likelihood. A discriminative factor model is then developed, integrating the max-margin rank-likelihood and (linear) Bayesian support vector machines, which are also built on t… ▽ More

    Submitted 19 May, 2015; v1 submitted 28 April, 2015; originally announced April 2015.

    Comments: 14 pages, 7 figures, ICML 2015