Skip to main content

Showing 1–50 of 74 results for author: Shen, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.02071  [pdf, other

    stat.ME

    Integrating Misclassified EHR Outcomes with Validated Outcomes from a Non-probability Sample

    Authors: Jenny Shen, Dane Isenberg, Kristin A. Linn, Rebecca A. Hubbard

    Abstract: Although increasingly used for research, electronic health records (EHR) often lack gold-standard assessment of key data elements. Linking EHRs to other data sources with higher-quality measurements can improve statistical inference, but such analyses must account for selection bias if the linked data source arises from a non-probability sample. We propose a set of novel estimators targeting the a… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  2. arXiv:2503.00299  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Hidden Convexity of Fair PCA and Fast Solver via Eigenvalue Optimization

    Authors: Junhui Shen, Aaron J. Davis, Ding Lu, Zhaojun Bai

    Abstract: Principal Component Analysis (PCA) is a foundational technique in machine learning for dimensionality reduction of high-dimensional datasets. However, PCA could lead to biased outcomes that disadvantage certain subgroups of the underlying datasets. To address the bias issue, a Fair PCA (FPCA) model was introduced by Samadi et al. (2018) for equalizing the reconstruction loss between subgroups. The… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

  3. arXiv:2502.20285  [pdf, other

    cs.LG stat.ML

    Conformal Tail Risk Control for Large Language Model Alignment

    Authors: Catherine Yu-Chi Chen, Jingyan Shen, Zhun Deng, Lihua Lei

    Abstract: Recent developments in large language models (LLMs) have led to their widespread usage for various tasks. The prevalence of LLMs in society implores the assurance on the reliability of their performance. In particular, risk-sensitive applications demand meticulous attention to unexpectedly poor outcomes, i.e., tail events, for instance, toxic answers, humiliating language, and offensive outputs. D… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  4. arXiv:2502.15962  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Towards Efficient Contrastive PAC Learning

    Authors: Jie Shen

    Abstract: We study contrastive learning under the PAC learning framework. While a series of recent works have shown statistical results for learning under contrastive loss, based either on the VC-dimension or Rademacher complexity, their algorithms are inherently inefficient or not implying PAC guarantees. In this paper, we consider contrastive learning of the fundamental concept of linear representations.… ▽ More

    Submitted 4 July, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: accepted to Transactions on Machine Learning Research

  5. arXiv:2502.02861  [pdf, ps, other

    stat.ML cs.DS cs.LG

    Algorithms with Calibrated Machine Learning Predictions

    Authors: Judy Hanwen Shen, Ellen Vitercik, Anders Wikum

    Abstract: The field of algorithms with predictions incorporates machine learning advice in the design of online algorithms to improve real-world performance. A central consideration is the extent to which predictions can be trusted -- while existing approaches often require users to specify an aggregate trust level, modern machine learning models can provide estimates of prediction-level uncertainty. In thi… ▽ More

    Submitted 14 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: v2 matches the camera-ready version accepted at ICML 2025

  6. arXiv:2412.04663  [pdf, other

    stat.ME stat.AP

    Learning Fair Decisions with Factor Models: Applications to Annuity Pricing

    Authors: Fei Huang, Junhao Shen, Yanrong Yang, Ran Zhao

    Abstract: Fairness-aware statistical learning is essential for mitigating discrimination against protected attributes such as gender, race, and ethnicity in data-driven decision-making. This is particularly critical in high-stakes applications like insurance underwriting and annuity pricing, where biased business decisions can have significant financial and social consequences. Factor models are commonly us… ▽ More

    Submitted 13 April, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  7. arXiv:2411.01382  [pdf, other

    stat.ME math.ST

    On MCMC mixing under unidentified nonparametric models with an application to survival predictions under transformation models

    Authors: Chong Zhong, Jin Yang, Junshan Shen, Catherine C. Liu, Zhaohai Li

    Abstract: The multi-modal posterior under unidentified nonparametric models yields poor mixing of Markov Chain Monte Carlo (MCMC), which is a stumbling block to Bayesian predictions. In this article, we conceptualize a prior informativeness threshold that is essentially the variance of posterior modes and expressed by the uncertainty hyperparameters of nonparametric priors. The threshold plays the role of a… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

  8. arXiv:2410.01186  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient PAC Learning of Halfspaces with Constant Malicious Noise Rate

    Authors: Jie Shen

    Abstract: Understanding noise tolerance of machine learning algorithms is a central quest in learning theory. In this work, we study the problem of computationally efficient PAC learning of halfspaces in the presence of malicious noise, where an adversary can corrupt both instances and labels of training samples. The best-known noise tolerance either depends on a target error rate under distributional assum… ▽ More

    Submitted 14 February, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: ALT 2025 (V4 fixed some typos in V3 and a missing factor 'd' in Prop 14)

  9. arXiv:2409.02392  [pdf, other

    cs.LG stat.ML

    Building Math Agents with Multi-Turn Iterative Preference Learning

    Authors: Wei Xiong, Chengshuai Shi, Jiaming Shen, Aviv Rosenberg, Zhen Qin, Daniele Calandriello, Misha Khalman, Rishabh Joshi, Bilal Piot, Mohammad Saleh, Chi Jin, Tong Zhang, Tianqi Liu

    Abstract: Recent studies have shown that large language models' (LLMs) mathematical problem-solving capabilities can be enhanced by integrating external tools, such as code interpreters, and employing multi-turn Chain-of-Thought (CoT) reasoning. While current methods focus on synthetic data generation and Supervised Fine-Tuning (SFT), this paper studies the complementary direct preference learning approach… ▽ More

    Submitted 27 February, 2025; v1 submitted 3 September, 2024; originally announced September 2024.

    Comments: A multi-turn direct preference learning framework for tool-integrated reasoning tasks

  10. arXiv:2408.04154  [pdf, other

    cs.LG cs.AI stat.ML

    The Data Addition Dilemma

    Authors: Judy Hanwen Shen, Inioluwa Deborah Raji, Irene Y. Chen

    Abstract: In many machine learning for healthcare tasks, standard datasets are constructed by amassing data across many, often fundamentally dissimilar, sources. But when does adding more data help, and when does it hinder progress on desired model outcomes in real-world settings? We identify this situation as the \textit{Data Addition Dilemma}, demonstrating that adding training data in this multi-source s… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Machine Learning For Health Care 2024 (MLHC)

  11. arXiv:2407.15247  [pdf, ps, other

    cs.LG stat.ML

    TimeInf: Time Series Data Contribution via Influence Functions

    Authors: Yizi Zhang, Jingyan Shen, Xiaoxue Xiong, Yongchan Kwon

    Abstract: Evaluating the contribution of individual data points to a model's prediction is critical for interpreting model predictions and improving model performance. Existing data contribution methods have been applied to various data types, including tabular data, images, and text; however, their primary focus has been on i.i.d. settings. Despite the pressing need for principled approaches tailored to ti… ▽ More

    Submitted 14 June, 2025; v1 submitted 21 July, 2024; originally announced July 2024.

  12. arXiv:2406.03056  [pdf, other

    stat.ME stat.AP

    Sparse two-stage Bayesian meta-analysis for individualized treatments

    Authors: Junwei Shen, Erica E. M. Moodie, Shirin Golchi

    Abstract: Individualized treatment rules tailor treatments to patients based on clinical, demographic, and other characteristics. Estimation of individualized treatment rules requires the identification of individuals who benefit most from the particular treatments and thus the detection of variability in treatment effects. To develop an effective individualized treatment rule, data from multisite studies m… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  13. arXiv:2404.17554  [pdf

    cs.HC eess.SP eess.SY stat.AP

    A Novel Context driven Critical Integrative Levels (CIL) Approach: Advancing Human-Centric and Integrative Lighting Asset Management in Public Libraries with Practical Thresholds

    Authors: Jing Lin, Nina Mylly, Per Olof Hedekvist, Jingchun Shen

    Abstract: This paper proposes the context driven Critical Integrative Levels (CIL), a novel approach to lighting asset management in public libraries that aligns with the transformative vision of human-centric and integrative lighting. This approach encompasses not only the visual aspects of lighting performance but also prioritizes the physiological and psychological well-being of library users. Incorporat… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  14. arXiv:2404.03764  [pdf, other

    cs.LG stat.ME stat.ML

    Covariate-Elaborated Robust Partial Information Transfer with Conditional Spike-and-Slab Prior

    Authors: Ruqian Zhang, Yijiao Zhang, Annie Qu, Zhongyi Zhu, Juan Shen

    Abstract: The popularity of transfer learning stems from the fact that it can borrow information from useful auxiliary datasets. Existing statistical transfer learning methods usually adopt a global similarity measure between the source data and the target data, which may lead to inefficiency when only partial information is shared. In this paper, we propose a novel Bayesian transfer learning method named `… ▽ More

    Submitted 21 August, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: 35 pages, 4 figures

  15. Human-Centric and Integrative Lighting Asset Management in Public Libraries: Qualitative Insights and Challenges from a Swedish Field Study

    Authors: Jing Lin, Per Olof Hedekvist, Nina Mylly, Math Bollen, Jingchun Shen, Jiawei Xiong, Christofer Silfvenius

    Abstract: Traditional lighting source reliability evaluations, often covering just half of a lamp's volume, can misrepresent real-world performance. To overcome these limitations,adopting advanced asset management strategies for a more holistic evaluation is crucial. This paper investigates human-centric and integrative lighting asset management in Swedish public libraries. Through field observations, inter… ▽ More

    Submitted 5 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 22 pages

    Journal ref: 2024;published

  16. arXiv:2309.04013  [pdf, other

    math.OC cs.LG stat.ML

    An Element-wise RSAV Algorithm for Unconstrained Optimization Problems

    Authors: Shiheng Zhang, Jiahao Zhang, Jie Shen, Guang Lin

    Abstract: We present a novel optimization algorithm, element-wise relaxed scalar auxiliary variable (E-RSAV), that satisfies an unconditional energy dissipation law and exhibits improved alignment between the modified and the original energy. Our algorithm features rigorous proofs of linear convergence in the convex setting. Furthermore, we present a simple accelerated algorithm that improves the linear con… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 25 pages, 7 figures

    MSC Class: 90C26; 68T99; 68W40

  17. arXiv:2306.06281  [pdf, other

    stat.ML cs.LG

    Energy-Dissipative Evolutionary Deep Operator Neural Networks

    Authors: Jiahao Zhang, Shiheng Zhang, Jie Shen, Guang Lin

    Abstract: Energy-Dissipative Evolutionary Deep Operator Neural Network is an operator learning neural network. It is designed to seed numerical solutions for a class of partial differential equations instead of a single partial differential equation, such as partial differential equations with different parameters or different initial conditions. The network consists of two sub-networks, the Branch net and… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 18 pages

  18. arXiv:2306.00673  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise

    Authors: Shiwei Zeng, Jie Shen

    Abstract: The concept class of low-degree polynomial threshold functions (PTFs) plays a fundamental role in machine learning. In this paper, we study PAC learning of $K$-sparse degree-$d$ PTFs on $\mathbb{R}^n$, where any such concept depends only on $K$ out of $n$ attributes of the input. Our main contribution is a new algorithm that runs in time $({nd}/ε)^{O(d)}$ and under the Gaussian marginal distributi… ▽ More

    Submitted 19 March, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023. V2 fixed typos

  19. arXiv:2305.01726  [pdf, ps, other

    stat.ML cs.LG stat.CO stat.ME

    Slow Kill for Big Data Learning

    Authors: Yiyuan She, Jianhui Shen, Adrian Barbu

    Abstract: Big-data applications often involve a vast number of observations and features, creating new challenges for variable selection and parameter estimation. This paper presents a novel technique called ``slow kill,'' which utilizes nonconvex constrained optimization, adaptive $\ell_2$-shrinkage, and increasing learning rates. The fact that the problem size can decrease during the slow kill iterations… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  20. arXiv:2301.10958  [pdf, ps, other

    stat.ML cs.CV cs.LG

    Learning Large Scale Sparse Models

    Authors: Atul Dhingra, Jie Shen, Nicholas Kleene

    Abstract: In this work, we consider learning sparse models in large scale settings, where the number of samples and the feature dimension can grow as large as millions or billions. Two immediate issues occur under such challenging scenario: (i) computational cost; (ii) memory overhead. In particular, the memory issue precludes a large volume of prior algorithms that are based on batch optimization technique… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  21. arXiv:2301.02243  [pdf, other

    cs.LG eess.SP stat.AP

    Machine Fault Classification using Hamiltonian Neural Networks

    Authors: Jeremy Shen, Jawad Chowdhury, Sourav Banerjee, Gabriel Terejanu

    Abstract: A new approach is introduced to classify faults in rotating machinery based on the total energy signature estimated from sensor measurements. The overall goal is to go beyond using black-box models and incorporate additional physical constraints that govern the behavior of mechanical systems. Observational data is used to train Hamiltonian neural networks that describe the conserved energy of the… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: ICPRAM 2023

  22. arXiv:2205.14504  [pdf, other

    stat.ME

    Bayesian prediction via nonparametric transformation models

    Authors: Chong Zhong, Jin Yang, Junshan Shen, Catherine Liu, Zhaohai Li

    Abstract: This article tackles the old problem of prediction via a nonparametric transformation model (NTM) in a new Bayesian way. Estimation of NTMs is known challenging due to model unidentifiability though appealing because of its robust prediction capability in survival analysis. Inspired by the uniqueness of the posterior predictive distribution, we achieve efficient prediction via the NTM aforemention… ▽ More

    Submitted 7 February, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: The corresponding R package BuLTM is available on GitHub https://github.com/LazyLaker

  23. arXiv:2204.10971  [pdf, other

    stat.ME econ.GN stat.ML

    An Efficient Approach for Optimizing the Cost-effective Individualized Treatment Rule Using Conditional Random Forest

    Authors: Yizhe Xu, Tom H. Greene, Adam P. Bress, Brandon K. Bellows, Yue Zhang, Zugui Zhang, Paul Kolm, William S. Weintraub, Andrew S. Moran, Jincheng Shen

    Abstract: Evidence from observational studies has become increasingly important for supporting healthcare policy making via cost-effectiveness (CE) analyses. Similar as in comparative effectiveness studies, health economic evaluations that consider subject-level heterogeneity produce individualized treatment rules (ITRs) that are often more cost-effective than one-size-fits-all treatment. Thus, it is of gre… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Submitted to Statistical Methods in Medical Research

  24. arXiv:2201.02301  [pdf, other

    stat.ME

    New designs for Bayesian adaptive cluster-randomized trials

    Authors: Junwei Shen, Shirin Golchi, Erica E. M. Moodie, David Benrimoh

    Abstract: Adaptive approaches, allowing for more flexible trial design, have been proposed for individually randomized trials to save time or reduce sample size. However, adaptive designs for cluster-randomized trials in which groups of participants rather than individuals are randomized to treatment arms are less common. Motivated by a cluster-randomized trial designed to assess the effectiveness of a mach… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  25. arXiv:2112.09746  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Supervised Multivariate Learning with Simultaneous Feature Auto-grouping and Dimension Reduction

    Authors: Yiyuan She, Jiahui Shen, Chao Zhang

    Abstract: Modern high-dimensional methods often adopt the "bet on sparsity" principle, while in supervised multivariate learning statisticians may face "dense" problems with a large number of nonzero coefficients. This paper proposes a novel clustered reduced-rank learning (CRL) framework that imposes two joint matrix regularizations to automatically group the features in constructing predictive factors. CR… ▽ More

    Submitted 9 February, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  26. arXiv:2112.08471  [pdf, other

    stat.ME math.ST stat.ML

    Gaining Outlier Resistance with Progressive Quantiles: Fast Algorithms and Theoretical Studies

    Authors: Yiyuan She, Zhifeng Wang, Jiahui Shen

    Abstract: Outliers widely occur in big-data applications and may severely affect statistical estimation and inference. In this paper, a framework of outlier-resistant estimation is introduced to robustify an arbitrarily given loss function. It has a close connection to the method of trimming and includes explicit outlyingness parameters for all samples, which in turn facilitates computation, theory, and par… ▽ More

    Submitted 18 April, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

  27. arXiv:2111.10476  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Towards Return Parity in Markov Decision Processes

    Authors: Jianfeng Chi, Jian Shen, Xinyi Dai, Weinan Zhang, Yuan Tian, Han Zhao

    Abstract: Algorithmic decisions made by machine learning models in high-stakes domains may have lasting impacts over time. However, naive applications of standard fairness criterion in static settings over temporal domains may lead to delayed and adverse effects. To understand the dynamics of performance disparity, we study a fairness problem in Markov decision processes (MDPs). Specifically, we propose ret… ▽ More

    Submitted 25 February, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: AISTATS 2022. Code is released at https://github.com/JFChi/Return-Parity-MDP

  28. arXiv:2111.08550  [pdf, other

    cs.LG cs.AI stat.ML

    On Effective Scheduling of Model-based Reinforcement Learning

    Authors: Hang Lai, Jian Shen, Weinan Zhang, Yimin Huang, Xing Zhang, Ruiming Tang, Yong Yu, Zhenguo Li

    Abstract: Model-based reinforcement learning has attracted wide attention due to its superior sample efficiency. Despite its impressive success so far, it is still unclear how to appropriately schedule the important hyperparameters to achieve adequate performance, such as the real data ratio for policy optimization in Dyna-style model-based algorithms. In this paper, we first theoretically analyze the role… ▽ More

    Submitted 5 July, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted at NeurIPS2021

  29. arXiv:2110.02440  [pdf, ps, other

    stat.AP stat.ME

    Inverse Probability Weighting-based Mediation Analysis for Microbiome Data

    Authors: Yuexia Zhang, Jian Wang, Jiayi Shen, Jessica Galloway-Pena, Samuel Shelburne, Linbo Wang, Jianhua Hu

    Abstract: Mediation analysis is an important tool for studying causal associations in biomedical and other scientific areas and has recently gained attention in microbiome studies. Using a microbiome study of acute myeloid leukemia (AML) patients, we investigate whether the effect of induction chemotherapy intensity levels on infection status is mediated by microbial taxa abundance. The unique characteristi… ▽ More

    Submitted 18 May, 2025; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: 39 pages, 2 figures

  30. arXiv:2109.03713  [pdf, other

    stat.ME stat.CO

    Dependent Dirichlet Processes for Analysis of a Generalized Shared Frailty Model

    Authors: Chong Zhong, Zhihua Ma, Junshan Shen, Catherine Liu

    Abstract: Bayesian paradigm takes advantage of well fitting complicated survival models and feasible computing in survival analysis owing to the superiority in tackling the complex censoring scheme, compared with the frequentist paradigm. In this chapter, we aim to display the latest tendency in Bayesian computing, in the sense of automating the posterior sampling, through Bayesian analysis of survival mode… ▽ More

    Submitted 9 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

  31. arXiv:2108.00605  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bucketed PCA Neural Networks with Neurons Mirroring Signals

    Authors: Jackie Shen

    Abstract: The bucketed PCA neural network (PCA-NN) with transforms is developed here in an effort to benchmark deep neural networks (DNN's), for problems on supervised classification. Most classical PCA models apply PCA to the entire training data set to establish a reductive representation and then employ non-network tools such as high-order polynomial classifiers. In contrast, the bucketed PCA-NN applies… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    ACM Class: I.2.10; I.2.6

  32. arXiv:2108.00473  [pdf, other

    math.OC cs.LG stat.ML

    Derivative-free Alternating Projection Algorithms for General Nonconvex-Concave Minimax Problems

    Authors: Zi Xu, Ziqi Wang, Jingjing Shen, Yuhong Dai

    Abstract: In this paper, we study zeroth-order algorithms for nonconvex-concave minimax problems, which have attracted widely attention in machine learning, signal processing and many other fields in recent years. We propose a zeroth-order alternating randomized gradient projection (ZO-AGP) algorithm for smooth nonconvex-concave minimax problems, and its iteration complexity to obtain an $\varepsilon$-stati… ▽ More

    Submitted 25 January, 2024; v1 submitted 1 August, 2021; originally announced August 2021.

  33. arXiv:2102.06247  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Sample-Optimal PAC Learning of Halfspaces with Malicious Noise

    Authors: Jie Shen

    Abstract: We study efficient PAC learning of homogeneous halfspaces in $\mathbb{R}^d$ in the presence of malicious noise of Valiant (1985). This is a challenging noise model and only until recently has near-optimal noise tolerance bound been established under the mild condition that the unlabeled data distribution is isotropic log-concave. However, it remains unsettled how to obtain the optimal sample compl… ▽ More

    Submitted 4 October, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted to ICML 2021. V2 and V3 polished writing

  34. arXiv:2101.00059  [pdf, other

    stat.ME q-bio.GN stat.AP

    CauchyCP: a powerful test under non-proportional hazards using Cauchy combination of change-point Cox regressions

    Authors: Hong Zhang, Qing Li, Devan V. Mehrotra, Judong Shen

    Abstract: Non-proportional hazards data are routinely encountered in randomized clinical trials. In such cases, classic Cox proportional hazards model can suffer from severe power loss, with difficulty in interpretation of the estimated hazard ratio since the treatment effect varies over time. We propose CauchyCP, an omnibus test of change-point Cox regression models, to overcome both challenges while detec… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

    Journal ref: Statistical Methods in Medical Research. 2021;30(11):2447-2458

  35. arXiv:2012.14878  [pdf, other

    cs.LG stat.ML

    Growing Deep Forests Efficiently with Soft Routing and Learned Connectivity

    Authors: Jianghao Shen, Sicheng Wang, Zhangyang Wang

    Abstract: Despite the latest prevailing success of deep neural networks (DNNs), several concerns have been raised against their usage, including the lack of intepretability the gap between DNNs and other well-established machine learning models, and the growingly expensive computational costs. A number of recent works [1], [2], [3] explored the alternative to sequentially stacking decision tree/random fores… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Comments: ICDM workshop 2018

    Journal ref: ICDM Workshops 2018: 399-402

  36. arXiv:2012.10793  [pdf, ps, other

    cs.LG cs.DS stat.ML

    On the Power of Localized Perceptron for Label-Optimal Learning of Halfspaces with Adversarial Noise

    Authors: Jie Shen

    Abstract: We study {\em online} active learning of homogeneous halfspaces in $\mathbb{R}^d$ with adversarial noise where the overall probability of a noisy label is constrained to be at most $ν$. Our main contribution is a Perceptron-like online active learning algorithm that runs in polynomial time, and under the conditions that the marginal distribution is isotropic log-concave and $ν= Ω(ε)$, where… ▽ More

    Submitted 22 June, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

    Comments: V2 and V3 polished writing; accepted to ICML 2021

  37. arXiv:2010.09546  [pdf, other

    cs.LG cs.AI stat.ML

    Model-based Policy Optimization with Unsupervised Model Adaptation

    Authors: Jian Shen, Han Zhao, Weinan Zhang, Yong Yu

    Abstract: Model-based reinforcement learning methods learn a dynamics model with real data sampled from the environment and leverage it to generate simulated data to derive an agent. However, due to the potential distribution mismatch between simulated data and real data, this could lead to degraded performance. Despite much effort being devoted to reducing this distribution mismatch, existing methods fail… ▽ More

    Submitted 28 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020)

  38. arXiv:2007.05700  [pdf, other

    cs.LG cs.SI stat.ML

    M-Evolve: Structural-Mapping-Based Data Augmentation for Graph Classification

    Authors: Jiajun Zhou, Jie Shen, Shanqing Yu, Guanrong Chen, Qi Xuan

    Abstract: Graph classification, which aims to identify the category labels of graphs, plays a significant role in drug classification, toxicity detection, protein analysis etc. However, the limitation of scale in the benchmark datasets makes it easy for graph classification models to fall into over-fitting and undergeneralization. To improve this, we introduce data augmentation on graphs (i.e. graph augment… ▽ More

    Submitted 3 April, 2021; v1 submitted 11 July, 2020; originally announced July 2020.

    Comments: 11 pages, 9 figures. arXiv admin note: text overlap with arXiv:2009.09863

  39. arXiv:2007.03641  [pdf, ps, other

    cs.LG math.NA stat.ML

    One-Bit Compressed Sensing via One-Shot Hard Thresholding

    Authors: Jie Shen

    Abstract: This paper concerns the problem of 1-bit compressed sensing, where the goal is to estimate a sparse signal from a few of its binary measurements. We study a non-convex sparsity-constrained program and present a novel and concise analysis that moves away from the widely used notion of Gaussian width. We show that with high probability a simple algorithm is guaranteed to produce an accurate approxim… ▽ More

    Submitted 9 July, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted to The Conference on Uncertainty in Artificial Intelligence (UAI) 2020

  40. arXiv:2007.01995  [pdf, other

    cs.LG cs.AI stat.ML

    Bidirectional Model-based Policy Optimization

    Authors: Hang Lai, Jian Shen, Weinan Zhang, Yong Yu

    Abstract: Model-based reinforcement learning approaches leverage a forward dynamics model to support planning and decision making, which, however, may fail catastrophically if the model is inaccurate. Although there are several existing methods dedicated to combating the model error, the potential of the single forward model is still limited. In this paper, we propose to additionally construct a backward dy… ▽ More

    Submitted 29 September, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML2020

  41. arXiv:2006.03781  [pdf, ps, other

    stat.ML cs.DS cs.LG

    Attribute-Efficient Learning of Halfspaces with Malicious Noise: Near-Optimal Label Complexity and Noise Tolerance

    Authors: Jie Shen, Chicheng Zhang

    Abstract: This paper is concerned with computationally efficient learning of homogeneous sparse halfspaces in $\mathbb{R}^d$ under noise. Though recent works have established attribute-efficient learning algorithms under various types of label noise (e.g. bounded noise), it remains an open question when and how $s$-sparse halfspaces can be efficiently learned under the challenging malicious noise model, whe… ▽ More

    Submitted 2 March, 2021; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: V1/V2 had a problematic argument on polynomial-time solvability of a form of sparse principal component analysis. V3 fixed it by using a new approach based on semidefinite programming. V4/V5 polishes the writing and is accepted to ALT 2021

  42. arXiv:2005.00905  [pdf, other

    stat.ME q-bio.QM stat.CO

    An efficient and accurate approximation to the distribution of quadratic forms of Gaussian variables

    Authors: Hong Zhang, Judong Shen, Zheyang Wu

    Abstract: In computational and applied statistics, it is of great interest to get fast and accurate calculation for the distributions of the quadratic forms of Gaussian random variables. This paper presents a novel approximation strategy that contains two developments. First, we propose a faster numerical procedure in computing the moments of the quadratic forms. Second, we establish a general moment-matchi… ▽ More

    Submitted 23 September, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

    Journal ref: Journal of Computational and Graphical Statistics. 2022, 31:1, 304-311

  43. arXiv:2004.13797  [pdf, ps, other

    q-fin.TR math.OC q-fin.CP q-fin.MF stat.AP

    A Stochastic LQR Model for Child Order Placement in Algorithmic Trading

    Authors: Jackie Jianhong Shen

    Abstract: Modern Algorithmic Trading ("Algo") allows institutional investors and traders to liquidate or establish big security positions in a fully automated or low-touch manner. Most existing academic or industrial Algos focus on how to "slice" a big parent order into smaller child orders over a given time horizon. Few models rigorously tackle the actual placement of these child orders. Instead, placement… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    MSC Class: 91G80

  44. arXiv:2004.05318  [pdf, other

    cs.LG stat.ML

    Multi-task Learning via Adaptation to Similar Tasks for Mortality Prediction of Diverse Rare Diseases

    Authors: Luchen Liu, Zequn Liu, Haoxian Wu, Zichang Wang, Jianhao Shen, Yiping Song, Ming Zhang

    Abstract: Mortality prediction of diverse rare diseases using electronic health record (EHR) data is a crucial task for intelligent healthcare. However, data insufficiency and the clinical diversity of rare diseases make it hard for directly training deep learning models on individual disease data or all the data from different diseases. Mortality prediction for these patients with different diseases can be… ▽ More

    Submitted 11 May, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

    Comments: 10 pages, 3 Figures, submitted to AMIA Annual Symposium

  45. arXiv:2003.00874  [pdf, other

    cs.CV cs.LG stat.ML

    Weakly-supervised Object Localization for Few-shot Learning and Fine-grained Few-shot Learning

    Authors: Xiaojian He, Jinfu Lin, Junming Shen

    Abstract: Few-shot learning (FSL) aims to learn novel visual categories from very few samples, which is a challenging problem in real-world applications. Many methods of few-shot classification work well on general images to learn global representation. However, they can not deal with fine-grained categories well at the same time due to a lack of subtle and local information. We argue that localization is a… ▽ More

    Submitted 11 December, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 8 pages, 6 figures, 6 tables

    MSC Class: 68T30; 68T10 (Primary) ACM Class: I.2.4; I.5.3

  46. arXiv:2002.12168  [pdf, other

    cs.LG cs.CV stat.ML

    Infinitely Wide Graph Convolutional Networks: Semi-supervised Learning via Gaussian Processes

    Authors: Jilin Hu, Jianbing Shen, Bin Yang, Ling Shao

    Abstract: Graph convolutional neural networks~(GCNs) have recently demonstrated promising results on graph-based semi-supervised classification, but little work has been done to explore their theoretical properties. Recently, several deep neural networks, e.g., fully connected and convolutional neural networks, with infinite hidden units have been proved to be equivalent to Gaussian processes~(GPs). To expl… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  47. arXiv:2002.09745  [pdf, other

    cs.CR cs.DS cs.LG stat.ML

    Differentially Private Set Union

    Authors: Sivakanth Gopi, Pankaj Gulhane, Janardhan Kulkarni, Judy Hanwen Shen, Milad Shokouhi, Sergey Yekhanin

    Abstract: We study the basic operation of set union in the global model of differential privacy. In this problem, we are given a universe $U$ of items, possibly of infinite size, and a database $D$ of users. Each user $i$ contributes a subset $W_i \subseteq U$ of items. We want an ($ε$,$δ$)-differentially private algorithm which outputs a subset $S \subset \cup_i W_i$ such that the size of $S$ is as large a… ▽ More

    Submitted 6 April, 2022; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: 23 pages, 7 figures

  48. arXiv:2002.04840  [pdf, other

    cs.LG stat.ML

    Efficient active learning of sparse halfspaces with arbitrary bounded noise

    Authors: Chicheng Zhang, Jie Shen, Pranjal Awasthi

    Abstract: We study active learning of homogeneous $s$-sparse halfspaces in $\mathbb{R}^d$ under the setting where the unlabeled data distribution is isotropic log-concave and each label is flipped with probability at most $η$ for a parameter $η\in \big[0, \frac12\big)$, known as the bounded noise. Even in the presence of mild label noise, i.e. $η$ is a small constant, this is a challenging problem and only… ▽ More

    Submitted 13 August, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 33 pages, 2 figures; NeurIPS 2020

  49. arXiv:2001.00705  [pdf, other

    cs.LG stat.ML

    Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference

    Authors: Jianghao Shen, Yonggan Fu, Yue Wang, Pengfei Xu, Zhangyang Wang, Yingyan Lin

    Abstract: While increasingly deep networks are still in general desired for achieving state-of-the-art performance, for many specific inputs a simpler network might already suffice. Existing works exploited this observation by learning to skip convolutional layers in an input-dependent manner. However, we argue their binary decision scheme, i.e., either fully executing or completely bypassing one layer for… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

  50. arXiv:1911.09310  [pdf, other

    cs.LG stat.ML

    Improving Unsupervised Domain Adaptation with Variational Information Bottleneck

    Authors: Yuxuan Song, Lantao Yu, Zhangjie Cao, Zhiming Zhou, Jian Shen, Shuo Shao, Weinan Zhang, Yong Yu

    Abstract: Domain adaptation aims to leverage the supervision signal of source domain to obtain an accurate model for target domain, where the labels are not available. To leverage and adapt the label information from source domain, most existing methods employ a feature extracting function and match the marginal distributions of source and target domains in a shared feature space. In this paper, from the pe… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.