Skip to main content

Showing 1–50 of 56 results for author: Zheng, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.18994  [pdf

    stat.ME stat.ML

    Causal Decomposition Analysis with Synergistic Interventions: A Triply-Robust Machine Learning Approach to Addressing Multiple Dimensions of Social Disparities

    Authors: Soojin Park, Su Yeon Kim, Xinyao Zheng, Chioun Lee

    Abstract: Educational disparities are rooted in and perpetuate social inequalities across multiple dimensions such as race, socioeconomic status, and geography. To reduce disparities, most intervention strategies focus on a single domain and frequently evaluate their effectiveness by using causal decomposition analysis. However, a growing body of research suggests that single-domain interventions may be ins… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 41 pages

  2. arXiv:2505.14725  [pdf, ps, other

    q-bio.GN cs.LG stat.AP

    HR-VILAGE-3K3M: A Human Respiratory Viral Immunization Longitudinal Gene Expression Dataset for Systems Immunity

    Authors: Xuejun Sun, Yiran Song, Xiaochen Zhou, Ruilie Cai, Yu Zhang, Xinyi Li, Rui Peng, Jialiu Xie, Yuanyuan Yan, Muyao Tang, Prem Lakshmanane, Baiming Zou, James S. Hagood, Raymond J. Pickles, Didong Li, Fei Zou, Xiaojing Zheng

    Abstract: Respiratory viral infections pose a global health burden, yet the cellular immune responses driving protection or pathology remain unclear. Natural infection cohorts often lack pre-exposure baseline data and structured temporal sampling. In contrast, inoculation and vaccination trials generate insightful longitudinal transcriptomic data. However, the scattering of these datasets across platforms,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  3. arXiv:2504.09783  [pdf, other

    stat.ME stat.CO

    BLAST: Bayesian online change-point detection with structured image data

    Authors: Xiaojun Zheng, Simon Mak

    Abstract: The prompt online detection of abrupt changes in image data is essential for timely decision-making in broad applications, from video surveillance to manufacturing quality control. Existing methods, however, face three key challenges. First, the high-dimensional nature of image data introduces computational bottlenecks for efficient real-time monitoring. Second, changes often involve structural im… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  4. arXiv:2504.05250  [pdf, ps, other

    cs.LG stat.ML

    PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel Similarity

    Authors: Mustafa Burak Gurbuz, Xingyu Zheng, Constantine Dovrolis

    Abstract: As deep learning continues to be driven by ever-larger datasets, understanding which examples are most important for generalization has become a critical question. While progress in data selection continues, emerging applications require studying this problem in dynamic contexts. To bridge this gap, we pose the Incremental Data Selection (IDS) problem, where examples arrive as a continuous stream,… ▽ More

    Submitted 30 June, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  5. arXiv:2411.18008  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Causal and Local Correlations Based Network for Multivariate Time Series Classification

    Authors: Mingsen Du, Yanxuan Wei, Xiangwei Zheng, Cun Ji

    Abstract: Recently, time series classification has attracted the attention of a large number of researchers, and hundreds of methods have been proposed. However, these methods often ignore the spatial correlations among dimensions and the local correlations among features. To address this issue, the causal and local correlations based network (CaLoNet) is proposed in this study for multivariate time series… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: Submitted on April 03, 2023; major revisions on March 25, 2024; minor revisions on July 9, 2024

  6. arXiv:2411.06881  [pdf, other

    cs.LG stat.ML

    WassFFed: Wasserstein Fair Federated Learning

    Authors: Zhongxuan Han, Li Zhang, Chaochao Chen, Xiaolin Zheng, Fei Zheng, Yuyuan Li, Jianwei Yin

    Abstract: Federated Learning (FL) employs a training approach to address scenarios where users' data cannot be shared across clients. Achieving fairness in FL is imperative since training data in FL is inherently geographically distributed among diverse user groups. Existing research on fairness predominantly assumes access to the entire training data, making direct transfer to FL challenging. However, the… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: Submitted to TKDE

  7. arXiv:2410.00068  [pdf

    eess.IV cs.LG stat.AP

    Denoising VAE as an Explainable Feature Reduction and Diagnostic Pipeline for Autism Based on Resting state fMRI

    Authors: Xinyuan Zheng, Orren Ravid, Robert A. J. Barry, Yoojean Kim, Qian Wang, Young-geun Kim, Xi Zhu, Xiaofu He

    Abstract: Autism spectrum disorders (ASDs) are developmental conditions characterized by restricted interests and difficulties in communication. The complexity of ASD has resulted in a deficiency of objective diagnostic biomarkers. Deep learning methods have gained recognition for addressing these challenges in neuroimaging analysis, but finding and interpreting such diagnostic biomarkers are still challeng… ▽ More

    Submitted 27 March, 2025; v1 submitted 30 September, 2024; originally announced October 2024.

    ACM Class: J.3; I.4.9; I.4.10

  8. arXiv:2408.09941  [pdf, ps, other

    stat.ML math.PR math.ST

    Predicting path-dependent processes by deep learning

    Authors: Xudong Zheng, Yuecai Han

    Abstract: In this paper, we investigate a deep learning method for predicting path-dependent processes based on discretely observed historical information. This method is implemented by considering the prediction as a nonparametric regression and obtaining the regression function through simulated samples and deep neural networks. When applying this method to fractional Brownian motion and the solutions of… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  9. arXiv:2407.03774  [pdf, other

    stat.ME

    Mixture Modeling for Temporal Point Processes with Memory

    Authors: Xiaotian Zheng, Athanasios Kottas, Bruno Sansó

    Abstract: We propose a constructive approach to building temporal point processes that incorporate dependence on their history. The dependence is modeled through the conditional density of the duration, i.e., the interval between successive event times, using a mixture of first-order conditional densities for each one of a specific number of lagged durations. Such a formulation for the conditional duration… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  10. arXiv:2404.19242  [pdf, other

    cs.CV eess.IV stat.ME

    A Minimal Set of Parameters Based Depth-Dependent Distortion Model and Its Calibration Method for Stereo Vision Systems

    Authors: Xin Ma, Puchen Zhu, Xiao Li, Xiaoyin Zheng, Jianshu Zhou, Xuchen Wang, Kwok Wai Samuel Au

    Abstract: Depth position highly affects lens distortion, especially in close-range photography, which limits the measurement accuracy of existing stereo vision systems. Moreover, traditional depth-dependent distortion models and their calibration methods have remained complicated. In this work, we propose a minimal set of parameters based depth-dependent distortion model (MDM), which considers the radial an… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for publication in IEEE Transactions on Instrumentation and Measurement

  11. arXiv:2404.01466  [pdf, other

    cs.LG stat.ME

    TS-CausalNN: Learning Temporal Causal Relations from Non-linear Non-stationary Time Series Data

    Authors: Omar Faruque, Sahara Ali, Xue Zheng, Jianwu Wang

    Abstract: The growing availability and importance of time series data across various domains, including environmental science, epidemiology, and economics, has led to an increasing need for time-series causal discovery methods that can identify the intricate relationships in the non-stationary, non-linear, and often noisy real world data. However, the majority of current time series causal discovery methods… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  12. arXiv:2310.19787  [pdf

    stat.ME stat.AP stat.ML

    $e^{\text{RPCA}}$: Robust Principal Component Analysis for Exponential Family Distributions

    Authors: Xiaojun Zheng, Simon Mak, Liyan Xie, Yao Xie

    Abstract: Robust Principal Component Analysis (RPCA) is a widely used method for recovering low-rank structure from data matrices corrupted by significant and sparse outliers. These corruptions may arise from occlusions, malicious tampering, or other causes for anomalies, and the joint identification of such corruptions with low-rank background is critical for process monitoring and diagnosis. However, exis… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  13. arXiv:2310.07187  [pdf, other

    stat.ML cs.LG

    Kernel Cox partially linear regression: building predictive models for cancer patients' survival

    Authors: Yaohua Rong, Sihai Dave Zhao, Xia Zheng, Yi Li

    Abstract: Wide heterogeneity exists in cancer patients' survival, ranging from a few months to several decades. To accurately predict clinical outcomes, it is vital to build an accurate predictive model that relates patients' molecular profiles with patients' survival. With complex relationships between survival and high-dimensional molecular predictors, it is challenging to conduct non-parametric modeling… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  14. arXiv:2207.12867  [pdf, other

    stat.ME stat.AP

    A New Causal Decomposition Paradigm towards Health Equity

    Authors: Xinwei Sun, Xiangyu Zheng, Jim Weinstein

    Abstract: Causal decomposition has provided a powerful tool to analyze health disparity problems, by assessing the proportion of disparity caused by each mediator. However, most of these methods lack \emph{policy implications}, as they fail to account for all sources of disparities caused by the mediator. Besides, their estimations \emph{pre-specified} some covariates set (\emph{a.k.a}, admissible set) for… ▽ More

    Submitted 20 February, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

  15. arXiv:2203.04246  [pdf, other

    stat.ME math.AT stat.AP stat.ML

    PERCEPT: a new online change-point detection method using topological data analysis

    Authors: Xiaojun Zheng, Simon Mak, Liyan Xie, Yao Xie

    Abstract: Topological data analysis (TDA) provides a set of data analysis tools for extracting embedded topological structures from complex high-dimensional datasets. In recent years, TDA has been a rapidly growing field which has found success in a wide range of applications, including signal processing, neuroscience and network analysis. In these applications, the online detection of changes is of crucial… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  16. arXiv:2111.01840  [pdf, ps, other

    stat.ME

    Bayesian Geostatistical Modeling for Discrete-Valued Processes

    Authors: Xiaotian Zheng, Athanasios Kottas, Bruno Sansó

    Abstract: We introduce a flexible and scalable class of Bayesian geostatistical models for discrete data, based on the class of nearest neighbor mixture transition distribution processes (NNMP), referred to as discrete NNMP. The proposed class characterizes spatial variability by a weighted combination of first-order conditional probability mass functions (pmfs) for each one of a given number of neighbors.… ▽ More

    Submitted 2 March, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

  17. arXiv:2107.07736  [pdf, other

    stat.ME

    Nearest-Neighbor Mixture Models for Non-Gaussian Spatial Processes

    Authors: Xiaotian Zheng, Athanasios Kottas, Bruno Sansó

    Abstract: We develop a class of nearest-neighbor mixture models that provide direct, computationally efficient, probabilistic modeling for non-Gaussian geospatial data. The class is defined over a directed acyclic graph, which implies conditional independence in representing a multivariate distribution through factorization into a product of univariate conditionals, and is extended to a full spatial process… ▽ More

    Submitted 27 June, 2022; v1 submitted 16 July, 2021; originally announced July 2021.

  18. arXiv:2107.01876  [pdf, other

    stat.ML cs.LG

    Which Invariance Should We Transfer? A Causal Minimax Learning Approach

    Authors: Mingzhou Liu, Xiangyu Zheng, Xinwei Sun, Fang Fang, Yizhou Wang

    Abstract: A major barrier to deploying current machine learning models lies in their non-reliability to dataset shifts. To resolve this problem, most existing studies attempted to transfer stable information to unseen environments. Particularly, independent causal mechanisms-based methods proposed to remove mutable causal mechanisms via the do-operator. Compared to previous methods, the obtained stable pred… ▽ More

    Submitted 30 May, 2023; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: Accepted version of ICML-23

  19. arXiv:2103.00117  [pdf, other

    stat.ME math.AT stat.OT

    Online High-Dimensional Change-Point Detection using Topological Data Analysis

    Authors: Xiaojun Zheng, Simon Mak, Yao Xie

    Abstract: Topological Data Analysis (TDA) is a rapidly growing field, which studies methods for learning underlying topological structures present in complex data representations. TDA methods have found recent success in extracting useful geometric structures for a wide range of applications, including protein classification, neuroscience, and time-series analysis. However, in many such applications, one is… ▽ More

    Submitted 7 March, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

  20. arXiv:2011.02203  [pdf, other

    cs.LG stat.ML

    Latent Causal Invariant Model

    Authors: Xinwei Sun, Botong Wu, Xiangyu Zheng, Chang Liu, Wei Chen, Tao Qin, Tie-yan Liu

    Abstract: Current supervised learning can learn spurious correlation during the data-fitting process, imposing issues regarding interpretability, out-of-distribution (OOD) generalization, and robustness. To avoid spurious correlation, we propose a Latent Causal Invariance Model (LaCIM) which pursues causal prediction. Specifically, we introduce latent variables that are separated into (a) output-causative f… ▽ More

    Submitted 27 April, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

  21. On Construction and Estimation of Stationary Mixture Transition Distribution Models

    Authors: Xiaotian Zheng, Athanasios Kottas, Bruno Sansó

    Abstract: Mixture transition distribution time series models build high-order dependence through a weighted combination of first-order transition densities for each one of a specified number of lags. We present a framework to construct stationary transition mixture distribution models that extend beyond linear, Gaussian dynamics. We study conditions for first-order strict stationarity which allow for differ… ▽ More

    Submitted 16 June, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Journal ref: Journal of Computational and Graphical Statistics, 31, 283-293 (2022)

  22. arXiv:2009.13266  [pdf, other

    cs.LG cs.NE stat.ML

    Disentangled Neural Architecture Search

    Authors: Xinyue Zheng, Peng Wang, Qigang Wang, Zhongchao Shi

    Abstract: Neural architecture search has shown its great potential in various areas recently. However, existing methods rely heavily on a black-box controller to search architectures, which suffers from the serious problem of lacking interpretability. In this paper, we propose disentangled neural architecture search (DNAS) which disentangles the hidden representation of the controller into semantically mean… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  23. arXiv:2008.01830  [pdf

    stat.AP

    Tree Inference: Response Time in a Binary Multinomial Processing Tree, Representation and Uniqueness of Parameters

    Authors: Richard Schweickert, Xiaofang Zheng

    Abstract: A Multinomial Processing Tree (MPT) is a directed tree with a probability associated with each arc. Here we consider an additional parameter associated with each arc, a measure such as the time required to select the arc. MPTs are often used as models of tasks. Each vertex represents a process and an arc descending from a vertex represents selection of an outcome of the process. A source vertex re… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: 28 pages, 3 figures

  24. arXiv:2007.11202  [pdf, other

    cs.LG stat.ML

    MathNet: Haar-Like Wavelet Multiresolution-Analysis for Graph Representation and Learning

    Authors: Xuebin Zheng, Bingxin Zhou, Ming Li, Yu Guang Wang, Junbin Gao

    Abstract: Graph Neural Networks (GNNs) have recently caught great attention and achieved significant progress in graph-level applications. In this paper, we propose a framework for graph neural networks with multiresolution Haar-like wavelets, or MathNet, with interrelated convolution and pooling strategies. The underlying method takes graphs in different structures as input and assembles consistent graph r… ▽ More

    Submitted 24 January, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: 32 pages, 6 figures, 6 tables

    MSC Class: 68T07; 05C85; 42C40 ACM Class: I.2.4; I.2.6

  25. arXiv:2005.11903  [pdf, other

    cs.LG cs.CR stat.ML

    Vertically Federated Graph Neural Network for Privacy-Preserving Node Classification

    Authors: Chaochao Chen, Jun Zhou, Longfei Zheng, Huiwen Wu, Lingjuan Lyu, Jia Wu, Bingzhe Wu, Ziqi Liu, Li Wang, Xiaolin Zheng

    Abstract: Recently, Graph Neural Network (GNN) has achieved remarkable progresses in various real-world tasks on graph data, consisting of node features and the adjacent information between different nodes. High-performance GNN models always depend on both rich features and complete edge information in graph. However, such information could possibly be isolated by different data holders in practice, which i… ▽ More

    Submitted 24 April, 2022; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: Accepted by IJCAI'22

  26. arXiv:2005.03825  [pdf, other

    eess.IV cs.LG stat.ML

    Learned Multi-layer Residual Sparsifying Transform Model for Low-dose CT Reconstruction

    Authors: Xikai Yang, Xuehang Zheng, Yong Long, Saiprasad Ravishankar

    Abstract: Signal models based on sparse representation have received considerable attention in recent years. Compared to synthesis dictionary learning, sparsifying transform learning involves highly efficient sparse coding and operator update steps. In this work, we propose a Multi-layer Residual Sparsifying Transform (MRST) learning model wherein the transform domain residuals are jointly sparsified over l… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  27. arXiv:2003.02834  [pdf, other

    cs.CR cs.LG stat.ML

    Practical Privacy Preserving POI Recommendation

    Authors: Chaochao Chen, Jun Zhou, Bingzhe Wu, Wenjin Fang, Li Wang, Yuan Qi, Xiaolin Zheng

    Abstract: Point-of-Interest (POI) recommendation has been extensively studied and successfully applied in industry recently. However, most existing approaches build centralized models on the basis of collecting users' data. Both private data and models are held by the recommender, which causes serious privacy concerns. In this paper, we propose a novel Privacy preserving POI Recommendation (PriRec) framewor… ▽ More

    Submitted 27 April, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted by ACM TIST

  28. arXiv:2003.02452  [pdf, other

    cs.LG stat.ML

    Semi-supervised Learning Meets Factorization: Learning to Recommend with Chain Graph Model

    Authors: Chaochao Chen, Kevin C. Chang, Qibing Li, Xiaolin Zheng

    Abstract: Recently latent factor model (LFM) has been drawing much attention in recommender systems due to its good performance and scalability. However, existing LFMs predict missing values in a user-item rating matrix only based on the known ones, and thus the sparsity of the rating matrix always limits their performance. Meanwhile, semi-supervised learning (SSL) provides an effective way to alleviate the… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted by TKDD

  29. On the Trend-corrected Variant of Adaptive Stochastic Optimization Methods

    Authors: Bingxin Zhou, Xuebin Zheng, Junbin Gao

    Abstract: Adam-type optimizers, as a class of adaptive moment estimation methods with the exponential moving average scheme, have been successfully used in many applications of deep learning. Such methods are appealing due to the capability on large-scale sparse datasets with high computational efficiency. In this paper, we present a new framework for Adam-type methods with the trend information when updati… ▽ More

    Submitted 15 December, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: 8 pages, 4 figures, 2 tables, IJCNN2020

    MSC Class: 68T07 ACM Class: I.2.0; I.2.6

  30. arXiv:1911.05309  [pdf, other

    cs.LG q-fin.PM stat.ML

    Adaptive Portfolio by Solving Multi-armed Bandit via Thompson Sampling

    Authors: Mengying Zhu, Xiaolin Zheng, Yan Wang, Yuyuan Li, Qianqiao Liang

    Abstract: As the cornerstone of modern portfolio theory, Markowitz's mean-variance optimization is considered a major model adopted in portfolio management. However, due to the difficulty of estimating its parameters, it cannot be applied to all periods. In some cases, naive strategies such as Equally-weighted and Value-weighted portfolios can even get better performance. Under these circumstances, we can u… ▽ More

    Submitted 14 November, 2019; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: conference

  31. arXiv:1909.13189  [pdf, other

    stat.ML cs.LG stat.ME

    Learning Sparse Nonparametric DAGs

    Authors: Xun Zheng, Chen Dan, Bryon Aragam, Pradeep Ravikumar, Eric P. Xing

    Abstract: We develop a framework for learning sparse nonparametric directed acyclic graphs (DAGs) from data. Our approach is based on a recent algebraic characterization of DAGs that led to a fully continuous program for score-based learning of DAG models parametrized by a linear structural equation model (SEM). We extend this algebraic characterization to nonparametric SEM by leveraging nonparametric spars… ▽ More

    Submitted 23 March, 2020; v1 submitted 28 September, 2019; originally announced September 2019.

    Comments: To appear in AISTATS 2020

  32. arXiv:1908.01287  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    BCD-Net for Low-dose CT Reconstruction: Acceleration, Convergence, and Generalization

    Authors: Il Yong Chun, Xuehang Zheng, Yong Long, Jeffrey A. Fessler

    Abstract: Obtaining accurate and reliable images from low-dose computed tomography (CT) is challenging. Regression convolutional neural network (CNN) models that are learned from training data are increasingly gaining attention in low-dose CT reconstruction. This paper modifies the architecture of an iterative regression CNN, BCD-Net, for fast, stable, and accurate low-dose CT reconstruction, and presents t… ▽ More

    Submitted 4 August, 2019; originally announced August 2019.

    Comments: Accepted to MICCAI 2019, and the authors indicated by asterisks (*) equally contributed to this work

  33. arXiv:1906.00165  [pdf, other

    eess.IV cs.LG stat.ML

    Two-layer Residual Sparsifying Transform Learning for Image Reconstruction

    Authors: Xuehang Zheng, Saiprasad Ravishankar, Yong Long, Marc Louis Klasky, Brendt Wohlberg

    Abstract: Signal models based on sparsity, low-rank and other properties have been exploited for image reconstruction from limited and corrupted data in medical imaging and other computational imaging applications. In particular, sparsifying transform models have shown promise in various applications, and offer numerous advantages such as efficiencies in sparse coding and learning. This work investigates pr… ▽ More

    Submitted 7 January, 2020; v1 submitted 1 June, 2019; originally announced June 2019.

    Comments: Accepted to IEEE ISBI 2020

  34. arXiv:1904.07404  [pdf, other

    cs.LG cs.PL stat.ML

    swTVM: Towards Optimized Tensor Code Generation for Deep Learning on Sunway Many-Core Processor

    Authors: Mingzhen Li, Changxi Liu, Jianjin Liao, Xuegui Zheng, Hailong Yang, Rujun Sun, Jun Xu, Lin Gan, Guangwen Yang, Zhongzhi Luan, Depei Qian

    Abstract: The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core p… ▽ More

    Submitted 11 July, 2022; v1 submitted 15 April, 2019; originally announced April 2019.

  35. arXiv:1901.04065  [pdf, other

    cs.LG stat.ML

    Gradient Regularized Budgeted Boosting

    Authors: Zhixiang Eddie Xu, Matt J. Kusner, Kilian Q. Weinberger, Alice X. Zheng

    Abstract: As machine learning transitions increasingly towards real world applications controlling the test-time cost of algorithms becomes more and more crucial. Recent work, such as the Greedy Miser and Speedboost, incorporate test-time budget constraints into the training procedure and learn classifiers that provably stay within budget (in expectation). However, so far, these algorithms are limited to th… ▽ More

    Submitted 26 January, 2019; v1 submitted 13 January, 2019; originally announced January 2019.

  36. arXiv:1901.04055  [pdf, other

    cs.LG stat.ML

    Gradient Boosted Feature Selection

    Authors: Zhixiang Eddie Xu, Gao Huang, Kilian Q. Weinberger, Alice X. Zheng

    Abstract: A feature selection algorithm should ideally satisfy four conditions: reliably extract relevant features; be able to identify non-linear feature interactions; scale linearly with the number of features and dimensions; allow the incorporation of known sparsity structure. In this work we propose a novel feature selection algorithm, Gradient Boosted Feature Selection (GBFS), which satisfies all four… ▽ More

    Submitted 13 January, 2019; originally announced January 2019.

  37. arXiv:1812.10644  [pdf, other

    stat.ME

    Quantile Treatment Effects and Bootstrap Inference under Covariate-Adaptive Randomization

    Authors: Yichong Zhang, Xin Zheng

    Abstract: In this paper, we study the estimation and inference of the quantile treatment effect under covariate-adaptive randomization. We propose two estimation methods: (1) the simple quantile regression and (2) the inverse propensity score weighted quantile regression. For the two estimators, we derive their asymptotic distributions uniformly over a compact set of quantile indexes, and show that, when th… ▽ More

    Submitted 24 February, 2020; v1 submitted 27 December, 2018; originally announced December 2018.

    Comments: 121 pages

  38. arXiv:1810.09078  [pdf

    cs.SD cs.LG eess.AS stat.ML

    Our Practice Of Using Machine Learning To Recognize Species By Voice

    Authors: Siddhardha Balemarthy, Atul Sajjanhar, James Xi Zheng

    Abstract: As the technology is advancing, audio recognition in machine learning is improved as well. Research in audio recognition has traditionally focused on speech. Living creatures (especially the small ones) are part of the whole ecosystem, monitoring as well as maintaining them are important tasks. Species such as animals and birds are tending to change their activities as well as their habitats due t… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: 16 pages

  39. arXiv:1810.04754  [pdf, other

    cs.LG stat.ML

    Efficient Tensor Decomposition with Boolean Factors

    Authors: Sung-En Chang, Xun Zheng, Ian E. H. Yen, Pradeep Ravikumar, Rose Yu

    Abstract: Tensor decomposition has been extensively used as a tool for exploratory analysis. Motivated by neuroscience applications, we study tensor decomposition with Boolean factors. The resulting optimization problem is challenging due to the non-convex objective and the combinatorial constraints. We propose Binary Matching Pursuit (BMP), a novel generalization of the matching pursuit strategy to decompo… ▽ More

    Submitted 11 November, 2020; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: 14 pages, 3 figures

  40. arXiv:1807.00366  [pdf, other

    cs.LG cs.AI stat.ML

    Beyond Winning and Losing: Modeling Human Motivations and Behaviors Using Inverse Reinforcement Learning

    Authors: Baoxiang Wang, Tongfang Sun, Xianjun Sam Zheng

    Abstract: In recent years, reinforcement learning (RL) methods have been applied to model gameplay with great success, achieving super-human performance in various environments, such as Atari, Go, and Poker. However, those studies mostly focus on winning the game and have largely ignored the rich and complex human motivations, which are essential for understanding different players' diverse behaviors. In th… ▽ More

    Submitted 5 July, 2018; v1 submitted 1 July, 2018; originally announced July 2018.

  41. arXiv:1803.01422  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    DAGs with NO TEARS: Continuous Optimization for Structure Learning

    Authors: Xun Zheng, Bryon Aragam, Pradeep Ravikumar, Eric P. Xing

    Abstract: Estimating the structure of directed acyclic graphs (DAGs, also known as Bayesian networks) is a challenging problem since the search space of DAGs is combinatorial and scales superexponentially with the number of nodes. Existing approaches rely on various local heuristics for enforcing the acyclicity constraint. In this paper, we introduce a fundamentally different strategy: We formulate the stru… ▽ More

    Submitted 2 November, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

    Comments: 22 pages, 8 figures, accepted to NIPS 2018

  42. Beyond Keywords and Relevance: A Personalized Ad Retrieval Framework in E-Commerce Sponsored Search

    Authors: Su Yan, Wei Lin, Tianshu Wu, Daorui Xiao, Xu Zheng, Bo Wu, Kaipeng Liu

    Abstract: On most sponsored search platforms, advertisers bid on some keywords for their advertisements (ads). Given a search request, ad retrieval module rewrites the query into bidding keywords, and uses these keywords as keys to select Top N ads through inverted indexes. In this way, an ad will not be retrieved even if queries are related when the advertiser does not bid on corresponding keywords. Moreov… ▽ More

    Submitted 23 April, 2018; v1 submitted 28 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 2018 World Wide Web Conference Pages 1919-1928

  43. arXiv:1712.09043  [pdf, other

    cs.LG cs.IR stat.ML

    Neural Collaborative Autoencoder

    Authors: Qibing Li, Xiaolin Zheng, Xinyue Wu

    Abstract: In recent years, deep neural networks have yielded state-of-the-art performance on several tasks. Although some recent works have focused on combining deep learning with recommendation, we highlight three issues of existing models. First, these models cannot work on both explicit and implicit feedback, since the network structures are specially designed for one particular case. Second, due to the… ▽ More

    Submitted 19 December, 2018; v1 submitted 25 December, 2017; originally announced December 2017.

  44. arXiv:1711.11179  [pdf, other

    cs.LG stat.ML

    State Space LSTM Models with Particle MCMC Inference

    Authors: Xun Zheng, Manzil Zaheer, Amr Ahmed, Yuan Wang, Eric P Xing, Alexander J Smola

    Abstract: Long Short-Term Memory (LSTM) is one of the most powerful sequence models. Despite the strong performance, however, it lacks the nice interpretability as in state space models. In this paper, we present a way to combine the best of both worlds by introducing State Space LSTM (SSL) models that generalizes the earlier work \cite{zaheer2017latent} of combining topic models with LSTM. However, unlike… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  45. arXiv:1711.00905  [pdf, other

    stat.ML cs.LG physics.med-ph

    Sparse-View X-Ray CT Reconstruction Using $\ell_1$ Prior with Learned Transform

    Authors: Xuehang Zheng, Il Yong Chun, Zhipeng Li, Yong Long, Jeffrey A. Fessler

    Abstract: A major challenge in X-ray computed tomography (CT) is reducing radiation dose while maintaining high quality of reconstructed images. To reduce the radiation dose, one can reduce the number of projection views (sparse-view CT); however, it becomes difficult to achieve high-quality image reconstruction as the number of projection views decreases. Researchers have applied the concept of learning sp… ▽ More

    Submitted 15 September, 2019; v1 submitted 2 November, 2017; originally announced November 2017.

    Comments: The first two authors contributed equally to this work

  46. Low Dose CT Image Reconstruction With Learned Sparsifying Transform

    Authors: Xuehang Zheng, Zening Lu, Saiprasad Ravishankar, Yong Long, Jeffrey A. Fessler

    Abstract: A major challenge in computed tomography (CT) is to reduce X-ray dose to a low or even ultra-low level while maintaining the high quality of reconstructed images. We propose a new method for CT reconstruction that combines penalized weighted-least squares reconstruction (PWLS) with regularization based on a sparsifying transform (PWLS-ST) learned from a dataset of numerous CT images. We adopt an a… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: This is a revised and corrected version of the IEEE IVMSP Workshop paper DOI: 10.1109/IVMSPW.2016.7528219

  47. PWLS-ULTRA: An Efficient Clustering and Learning-Based Approach for Low-Dose 3D CT Image Reconstruction

    Authors: Xuehang Zheng, Saiprasad Ravishankar, Yong Long, Jeffrey A. Fessler

    Abstract: The development of computed tomography (CT) image reconstruction methods that significantly reduce patient radiation exposure while maintaining high image quality is an important area of research in low-dose CT (LDCT) imaging. We propose a new penalized weighted least squares (PWLS) reconstruction method that exploits regularization based on an efficient Union of Learned TRAnsforms (PWLS-ULTRA). T… ▽ More

    Submitted 1 June, 2018; v1 submitted 27 March, 2017; originally announced March 2017.

    Comments: Accepted to IEEE Transaction on Medical Imaging

    Journal ref: IEEE Transaction on Medical Imaging 37(6):1498-510 Jun 2018

  48. arXiv:1412.1576  [pdf, other

    stat.ML cs.DC cs.IR cs.LG

    LightLDA: Big Topic Models on Modest Compute Clusters

    Authors: Jinhui Yuan, Fei Gao, Qirong Ho, Wei Dai, Jinliang Wei, Xun Zheng, Eric P. Xing, Tie-Yan Liu, Wei-Ying Ma

    Abstract: When building large-scale machine learning (ML) programs, such as big topic models or deep neural nets, one usually assumes such tasks can only be attempted with industrial-sized clusters with thousands of nodes, which are out of reach for most practitioners or academic researchers. We consider this challenge in the context of topic modeling on web-scale corpora, and show that with a modest cluste… ▽ More

    Submitted 4 December, 2014; originally announced December 2014.

  49. arXiv:1411.2305  [pdf, other

    cs.DC cs.LG stat.ML

    Model-Parallel Inference for Big Topic Models

    Authors: Xun Zheng, Jin Kyu Kim, Qirong Ho, Eric P. Xing

    Abstract: In real world industrial applications of topic modeling, the ability to capture gigantic conceptual space by learning an ultra-high dimensional topical representation, i.e., the so-called "big model", is becoming the next desideratum after enthusiasms on "big data", especially for fine-grained downstream tasks such as online advertising, where good performances are usually achieved by regression-b… ▽ More

    Submitted 9 November, 2014; originally announced November 2014.

  50. arXiv:1406.4580  [pdf, other

    stat.ML cs.DC cs.LG

    Primitives for Dynamic Big Model Parallelism

    Authors: Seunghak Lee, Jin Kyu Kim, Xun Zheng, Qirong Ho, Garth A. Gibson, Eric P. Xing

    Abstract: When training large machine learning models with many variables or parameters, a single machine is often inadequate since the model may be too large to fit in memory, while training can take a long time even with stochastic updates. A natural recourse is to turn to distributed cluster computing, in order to harness additional memory and processors. However, naive, unstructured parallelization of M… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.