Skip to main content

Showing 1–19 of 19 results for author: Qian, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.04237  [pdf, ps, other

    stat.ME stat.ML

    Structural Classification of Locally Stationary Time Series Based on Second-order Characteristics

    Authors: Chen Qian, Xiucai Ding, Lexin Li

    Abstract: Time series classification is crucial for numerous scientific and engineering applications. In this article, we present a numerically efficient, practically competitive, and theoretically rigorous classification method for distinguishing between two classes of locally stationary time series based on their time-domain, second-order characteristics. Our approach builds on the autoregressive approxim… ▽ More

    Submitted 9 July, 2025; v1 submitted 6 July, 2025; originally announced July 2025.

    Comments: 41 Pages, 4 Figures

  2. arXiv:2505.05269  [pdf, other

    stat.ML cs.LG

    A Two-Sample Test of Text Generation Similarity

    Authors: Jingbin Xu, Chen Qian, Meimei Liu, Feng Guo

    Abstract: The surge in digitized text data requires reliable inferential methods on observed textual patterns. This article proposes a novel two-sample text test for comparing similarity between two groups of documents. The hypothesis is whether the probabilistic mapping generating the textual data is identical across two groups of documents. The proposed test aims to assess text similarity by comparing the… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  3. arXiv:2504.13018  [pdf, ps, other

    stat.ME

    High Dimensional Sparse Canonical Correlation Analysis for Elliptical Symmetric Distributions

    Authors: Chengde Qian, Yanhong Liu, Long Feng

    Abstract: This paper proposes a robust high-dimensional sparse canonical correlation analysis (CCA) method for investigating linear relationships between two high-dimensional random vectors, focusing on elliptical symmetric distributions. Traditional CCA methods, based on sample covariance matrices, struggle in high-dimensional settings, particularly when data exhibit heavy-tailed distributions. To address… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  4. arXiv:2503.02506  [pdf, other

    stat.ME stat.ML

    Robust Multi-Source Domain Adaptation under Label Shift

    Authors: Congbin Xu, Chengde Qian, Zhaojun Wang, Changliang Zou

    Abstract: As the volume of data continues to expand, it becomes increasingly common for data to be aggregated from multiple sources. Leveraging multiple sources for model training typically achieves better predictive performance on test datasets. Unsupervised multi-source domain adaptation aims to predict labels of unlabeled samples in the target domain by using labeled samples from source domains. This wor… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 53 pages, 3 figures

  5. arXiv:2411.07874  [pdf, other

    stat.ME math.ST

    Changepoint Detection in Complex Models: Cross-Fitting Is Needed

    Authors: Chengde Qian, Guanghui Wang, Zhaojun Wang, Changliang Zou

    Abstract: Changepoint detection is commonly approached by minimizing the sum of in-sample losses to quantify the model's overall fit across distinct data segments. However, we observe that flexible modeling techniques, particularly those involving hyperparameter tuning or model selection, often lead to inaccurate changepoint estimation due to biases that distort the target of in-sample loss minimization. To… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  6. arXiv:2310.10603  [pdf, other

    cs.LG cs.AI cs.NE math.OC stat.ML

    Exploring the Power of Graph Neural Networks in Solving Linear Optimization Problems

    Authors: Chendi Qian, Didier Chételat, Christopher Morris

    Abstract: Recently, machine learning, particularly message-passing graph neural networks (MPNNs), has gained traction in enhancing exact optimization algorithms. For example, MPNNs speed up solving mixed-integer optimization problems by imitating computational intensive heuristics like strong branching, which entails solving multiple linear optimization problems (LPs). Despite the empirical success, the rea… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  7. arXiv:2309.08570  [pdf

    stat.ML cs.LG physics.optics

    Neural Network Driven, Interactive Design for Nonlinear Optical Molecules Based on Group Contribution Method

    Authors: Jinming Fan, Chao Qian, Shaodong Zhou

    Abstract: A Lewis-mode group contribution method (LGC) -- multi-stage Bayesian neural network (msBNN) -- evolutionary algorithm (EA) framework is reported for rational design of D-Pi-A type organic small-molecule nonlinear optical materials is presented. Upon combination of msBNN and corrected Lewis-mode group contribution method (cLGC), different optical properties of molecules are afforded accurately and… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  8. arXiv:2307.01150  [pdf, ps, other

    stat.ME math.ST

    Reliever: Relieving the Burden of Costly Model Fits for Changepoint Detection

    Authors: Chengde Qian, Guanghui Wang, Changliang Zou

    Abstract: We propose a general methodology Reliever for fast and reliable changepoint detection when the model fitting is costly. Instead of fitting a sequence of models for each potential search interval, Reliever employs a substantially reduced number of proxy/relief models that are trained on a predetermined set of intervals. This approach can be seamlessly integrated with state-of-the-art changepoint se… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  9. arXiv:2305.01770  [pdf, other

    cs.LG q-bio.QM stat.ML

    DeCom: Deep Coupled-Factorization Machine for Post COVID-19 Respiratory Syncytial Virus Prediction with Nonpharmaceutical Interventions Awareness

    Authors: Xinyan Li, Cheng Qian, Lucas Glass

    Abstract: Respiratory syncytial virus (RSV) is one of the most dangerous respiratory diseases for infants and young children. Due to the nonpharmaceutical intervention (NPI) imposed in the COVID-19 outbreak, the seasonal transmission pattern of RSV has been discontinued in 2020 and then shifted months ahead in 2021 in the northern hemisphere. It is critical to understand how COVID-19 impacts RSV and build p… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  10. arXiv:2212.09083  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    Influence-Based Mini-Batching for Graph Neural Networks

    Authors: Johannes Gasteiger, Chendi Qian, Stephan Günnemann

    Abstract: Using graph neural networks for large graphs is challenging since there is no clear way of constructing mini-batches. To solve this, previous methods have relied on sampling or graph clustering. While these approaches often lead to good training convergence, they introduce significant overhead due to expensive random data accesses and perform poorly during inference. In this work we instead focus… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: Published as a proceedings paper at LoG 2022

  11. arXiv:2206.11168  [pdf, other

    cs.LG cs.AI cs.DS cs.NE stat.ML

    Ordered Subgraph Aggregation Networks

    Authors: Chendi Qian, Gaurav Rattan, Floris Geerts, Christopher Morris, Mathias Niepert

    Abstract: Numerous subgraph-enhanced graph neural networks (GNNs) have emerged recently, provably boosting the expressive power of standard (message-passing) GNNs. However, there is a limited understanding of how these approaches relate to each other and to the Weisfeiler-Leman hierarchy. Moreover, current approaches either use all subgraphs of a given size, sample them uniformly at random, or use hand-craf… ▽ More

    Submitted 15 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted at NeurIPS 2022. Fixed link to code repository

  12. arXiv:2202.00071  [pdf, other

    cs.LG cs.IR stat.ML

    JULIA: Joint Multi-linear and Nonlinear Identification for Tensor Completion

    Authors: Cheng Qian, Kejun Huang, Lucas Glass, Rakshith S. Srinivasa, Jimeng Sun

    Abstract: Tensor completion aims at imputing missing entries from a partially observed tensor. Existing tensor completion methods often assume either multi-linear or nonlinear relationships between latent components. However, real-world tensors have much more complex patterns where both multi-linear and nonlinear relationships may coexist. In such cases, the existing methods are insufficient to describe t… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  13. arXiv:1910.05484  [pdf, other

    stat.ML cs.LG

    Bayesian Optimization using Pseudo-Points

    Authors: Chao Qian, Hang Xiong, Ke Xue

    Abstract: Bayesian optimization (BO) is a popular approach for expensive black-box optimization, with applications including parameter tuning, experimental design, robotics. BO usually models the objective function by a Gaussian process (GP), and iteratively samples the next data point by maximizing an acquisition function. In this paper, we propose a new general framework for BO by generating pseudo-points… ▽ More

    Submitted 21 January, 2020; v1 submitted 12 October, 2019; originally announced October 2019.

  14. arXiv:1907.13100  [pdf, other

    cs.NE cs.CC stat.ML

    On the Robustness of Median Sampling in Noisy Evolutionary Optimization

    Authors: Chao Bian, Chao Qian, Yang Yu, Ke Tang

    Abstract: Evolutionary algorithms (EAs) are a sort of nature-inspired metaheuristics, which have wide applications in various practical optimization problems. In these problems, objective evaluations are usually inaccurate, because noise is almost inevitable in real world, and it is a crucial issue to weaken the negative effect caused by noise. Sampling is a popular strategy, which evaluates the objective a… ▽ More

    Submitted 28 November, 2022; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: 19 pages. arXiv admin note: text overlap with arXiv:1810.05045, arXiv:1711.00956

  15. arXiv:1907.11911  [pdf, other

    cs.LG stat.ML

    REP: Predicting the Time-Course of Drug Sensitivity

    Authors: Cheng Qian, Amin Emad, Nicholas D. Sidiropoulos

    Abstract: The biological processes involved in a drug's mechanisms of action are oftentimes dynamic, complex and difficult to discern. Time-course gene expression data is a rich source of information that can be used to unravel these complex processes, identify biomarkers of drug sensitivity and predict the response to a drug. However, the majority of previous work has not fully utilized this temporal dimen… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

  16. arXiv:1907.02443  [pdf, other

    stat.ML cs.LG stat.ME

    High-dimensional Gaussian graphical model for network-linked data

    Authors: Tianxi Li, Cheng Qian, Elizaveta Levina, Ji Zhu

    Abstract: Graphical models are commonly used to represent conditional dependence relationships between variables. There are multiple methods available for exploring them from high-dimensional data, but almost all of them rely on the assumption that the observations are independent and identically distributed. At the same time, observations connected by a network are becoming increasingly common, and tend to… ▽ More

    Submitted 21 April, 2020; v1 submitted 4 July, 2019; originally announced July 2019.

  17. arXiv:1810.12758  [pdf, ps, other

    q-bio.QM cs.LG stat.ML

    From Gene Expression to Drug Response: A Collaborative Filtering Approach

    Authors: Cheng Qian, Nicholas D. Sidiropoulos, Magda Amiridi, Amin Emad

    Abstract: Predicting the response of cancer cells to drugs is an important problem in pharmacogenomics. Recent efforts in generation of large scale datasets profiling gene expression and drug sensitivity in cell lines have provided a unique opportunity to study this problem. However, one major challenge is the small number of samples (cell lines) compared to the number of features (genes) even in these larg… ▽ More

    Submitted 30 October, 2018; v1 submitted 29 October, 2018; originally announced October 2018.

  18. arXiv:1810.06833  [pdf, other

    cs.LG stat.ML

    Maximizing Monotone DR-submodular Continuous Functions by Derivative-free Optimization

    Authors: Yibo Zhang, Chao Qian, Ke Tang

    Abstract: In this paper, we study the problem of monotone (weakly) DR-submodular continuous maximization. While previous methods require the gradient information of the objective function, we propose a derivative-free algorithm LDGM for the first time. We define $β$ and $α$ to characterize how close a function is to continuous DR-submodulr and submodular, respectively. Under a convex polytope constraint, we… ▽ More

    Submitted 23 February, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

  19. ZOOpt: Toolbox for Derivative-Free Optimization

    Authors: Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Chao Qian, Yang Yu

    Abstract: Recent advances in derivative-free optimization allow efficient approximation of the global-optimal solutions of sophisticated functions, such as functions with many local optima, non-differentiable and non-continuous functions. This article describes the ZOOpt (Zeroth Order Optimization) toolbox that provides efficient derivative-free solvers and is designed easy to use. ZOOpt provides single-mac… ▽ More

    Submitted 1 June, 2022; v1 submitted 31 December, 2017; originally announced January 2018.

    Comments: SCIENCE CHINA Information Sciences, 2022. Codes: https://github.com/polixir/ZOOpt

    Journal ref: SCIENCE CHINA Information Sciences, 65: 207101, 2022