Skip to main content

Showing 1–20 of 20 results for author: Chu, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.07946  [pdf, ps, other

    stat.ME

    Graph-theoretic Inference for Random Effects in High-dimensional Studies

    Authors: Lynna Chu, Yichuan Bai

    Abstract: We study the problem of testing for the presence of random effects in mixed models with high-dimensional fixed effects. To this end, we propose a rank-based graph-theoretic approach to test whether a collection of random effects is zero. Our approach is non-parametric and model-free in the sense that we not require correct specification of the mixed model nor estimation of unknown parameters. Inst… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2505.21814  [pdf, ps, other

    stat.ME stat.AP

    Adaptive Block-Based Change-Point Detection for Sparse Spatially Clustered Data with Applications in Remote Sensing Imaging

    Authors: Alan Moore, Lynna Chu, Zhengyuan Zhu

    Abstract: We present a non-parametric change-point detection approach to detect potentially sparse changes in a time series of high-dimensional observations or non-Euclidean data objects. We target a change in distribution that occurs in a small, unknown subset of dimensions, where these dimensions may be correlated. Our work is motivated by a remote sensing application, where changes occur in small, spatia… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2402.15600  [pdf, ps, other

    stat.ME

    A Graph-based Approach to Estimating the Number of Clusters in High-dimensional Settings

    Authors: Yichuan Bai, Lynna Chu

    Abstract: We consider the problem of estimating the number of clusters (k) in a dataset. We propose a non-parametric approach to the problem that utilizes similarity graphs to construct a robust statistic that effectively captures similarity information among observations. This graph-based statistic is applicable to datasets of any dimension, is computationally efficient to obtain, and can be paired with an… ▽ More

    Submitted 11 June, 2025; v1 submitted 23 February, 2024; originally announced February 2024.

  4. arXiv:2307.12325  [pdf, ps, other

    stat.ME

    A Robust Framework for Graph-based Two-Sample Tests Using Weights

    Authors: Yichuan Bai, Lynna Chu

    Abstract: Graph-based tests are a class of non-parametric two-sample tests useful for analyzing high-dimensional data. The test statistics are constructed from similarity graphs (such as K-minimum spanning tree), and consequently, their performance is sensitive to the structure of the graph. When the graph has problematic structures (for example, hubs), as is common for high-dimensional data, this can resul… ▽ More

    Submitted 19 June, 2025; v1 submitted 23 July, 2023; originally announced July 2023.

  5. arXiv:2301.04856  [pdf, other

    cs.CL cs.LG stat.ML

    Multimodal Deep Learning

    Authors: Cem Akkus, Luyang Chu, Vladana Djakovic, Steffen Jauch-Walser, Philipp Koch, Giacomo Loss, Christopher Marquardt, Marco Moldovan, Nadja Sauter, Maximilian Schneider, Rickmer Schulte, Karol Urbanczyk, Jann Goschenhofer, Christian Heumann, Rasmus Hvingelby, Daniel Schalk, Matthias Aßenmacher

    Abstract: This book is the result of a seminar in which we reviewed multimodal approaches and attempted to create a solid overview of the field, starting with the current state-of-the-art approaches in the two subfields of Deep Learning individually. Further, modeling frameworks are discussed where one modality is transformed into the other, as well as models in which one modality is utilized to enhance rep… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  6. arXiv:2008.03655  [pdf, other

    quant-ph cs.LG stat.ML

    Global Optimum Search in Quantum Deep Learning

    Authors: Lanston Hau Man Chu, Tejas Bhojraj, Rui Huang

    Abstract: This paper aims to solve machine learning optimization problem by using quantum circuit. Two approaches, namely the average approach and the Partial Swap Test Cut-off method (PSTC) was proposed to search for the global minimum/maximum of two different objective functions. The current cost is $O(\sqrt{|Θ|} N)$, but there is potential to improve PSTC further to $O(\sqrt{|Θ|} \cdot sublinear \ N)$ by… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: 17 pages

  7. arXiv:2007.03797  [pdf, other

    cs.LG cs.DC stat.ML

    Personalized Cross-Silo Federated Learning on Non-IID Data

    Authors: Yutao Huang, Lingyang Chu, Zirui Zhou, Lanjun Wang, Jiangchuan Liu, Jian Pei, Yong Zhang

    Abstract: Non-IID data present a tough challenge for federated learning. In this paper, we explore a novel idea of facilitating pairwise collaborations between clients with similar data. We propose FedAMP, a new method employing federated attentive message passing to facilitate similar clients to collaborate more. We establish the convergence of FedAMP for both convex and non-convex models, and propose a he… ▽ More

    Submitted 13 December, 2021; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted by AAAI 2021. The API of this work is available at Huawei Cloud (https://developer.huaweicloud.com/develop/aigallery/notebook/detail?id=6d4a9521-6a4d-4b6d-b84d-943d7c7b1cbd), free registration is required before use

  8. arXiv:1906.06857  [pdf, other

    cs.LG stat.ML

    Exact and Consistent Interpretation of Piecewise Linear Models Hidden behind APIs: A Closed Form Solution

    Authors: Zicun Cong, Lingyang Chu, Lanjun Wang, Xia Hu, Jian Pei

    Abstract: More and more AI services are provided through APIs on cloud where predictive models are hidden behind APIs. To build trust with users and reduce potential application risk, it is important to interpret how such predictive models hidden behind APIs make their decisions. The biggest challenge of interpreting such predictions is that no access to model parameters or training data is available. Exist… ▽ More

    Submitted 19 April, 2020; v1 submitted 17 June, 2019; originally announced June 2019.

  9. arXiv:1905.06329  [pdf, ps, other

    eess.SP cs.LG stat.ML

    LEMO: Learn to Equalize for MIMO-OFDM Systems with Low-Resolution ADCs

    Authors: Lei Chu, Ling Pei, Husheng Li, Robert Caiming Qiu

    Abstract: This paper develops a new deep neural network optimized equalization framework for massive multiple input multiple output orthogonal frequency division multiplexing (MIMOOFDM) systems that employ low-resolution analog-to-digital converters (ADCs) at the base station (BS). The use of lowresolution ADCs could largely reduce hardware complexity and circuit power consumption, however, it makes the cha… ▽ More

    Submitted 25 May, 2020; v1 submitted 14 May, 2019; originally announced May 2019.

  10. arXiv:1903.00711  [pdf, other

    cs.LG stat.ML

    neuralRank: Searching and ranking ANN-based model repositories

    Authors: Nirmit Desai, Linsong Chu, Raghu K. Ganti, Sebastian Stein, Mudhakar Srivatsa

    Abstract: Widespread applications of deep learning have led to a plethora of pre-trained neural network models for common tasks. Such models are often adapted from other models via transfer learning. The models may have varying training sets, training algorithms, network architectures, and hyper-parameters. For a given application, what isthe most suitable model in a model repository? This is a critical que… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  11. Sequential Change-point Detection for High-dimensional and non-Euclidean Data

    Authors: Lynna Chu, Hao Chen

    Abstract: In many applications, it is often of practical and scientific interest to detect anomaly events in a streaming sequence of high-dimensional or non-Euclidean observations. We study a non-parametric framework that utilizes nearest neighbor information among the observations to detect changes in an online setting. It can be applied to data in arbitrary dimension and non-Euclidean data as long as a si… ▽ More

    Submitted 21 October, 2022; v1 submitted 14 October, 2018; originally announced October 2018.

    Journal ref: in IEEE Transactions on Signal Processing, vol. 70, pp. 4498-4511, 2022

  12. arXiv:1808.05403  [pdf, other

    cs.IT cs.LG eess.SP stat.ML

    A Survey on Nonconvex Regularization Based Sparse and Low-Rank Recovery in Signal Processing, Statistics, and Machine Learning

    Authors: Fei Wen, Lei Chu, Peilin Liu, Robert C. Qiu

    Abstract: In the past decade, sparse and low-rank recovery have drawn much attention in many areas such as signal/image processing, statistics, bioinformatics and machine learning. To achieve sparsity and/or low-rankness inducing, the $\ell_1$ norm and nuclear norm are of the most popular regularization penalties due to their convexity. While the $\ell_1$ and nuclear norm are convenient as the related conve… ▽ More

    Submitted 6 June, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

    Comments: 22 pages

    Journal ref: Published in IEEE Access 2018: https://ieeexplore.ieee.org/abstract/document/8531588

  13. arXiv:1802.03503  [pdf, other

    stat.AP eess.SP

    A New Approach of Exploiting Self-Adjoint Matrix Polynomials of Large Random Matrices for Anomaly Detection and Fault Location

    Authors: Zenan Ling, Robert C. Qiu, Xing He, Lei Chu

    Abstract: Synchronized measurements of a large power grid enable an unprecedented opportunity to study the spatialtemporal correlations. Statistical analytics for those massive datasets start with high-dimensional data matrices. Uncertainty is ubiquitous in a future's power grid. These data matrices are recognized as random matrices. This new point of view is fundamental in our theoretical analysis since tr… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

    Comments: 12 pages, 13 figures, submitted to IEEE Trans on Big Data

  14. arXiv:1801.01669  [pdf, other

    stat.AP stat.ME

    Early Anomaly Detection and Location in Distribution Network: A Data-Driven Approach

    Authors: Xin Shi, Robert Qiu, Xing He, Zenan Ling, Haosen Yang, Lei Chu

    Abstract: The measurement data collected from the supervisory control and data acquisition (SCADA) system installed in distribution network can reflect the operational state of the network effectively. In this paper, a random matrix theory (RMT) based approach is developed for early anomaly detection and localization by using the data. For every feeder in the distribution network, a corresponding data matri… ▽ More

    Submitted 11 March, 2020; v1 submitted 5 January, 2018; originally announced January 2018.

    Comments: 10 pages, submitted to IET Generation, Transmission and Distribution

  15. Invisible Units Detection and Estimation Based on Random Matrix Theory

    Authors: Xing He, Lei Chu, Robert C. Qiu, Qian Ai, Zenan Ling, Jian Zhang

    Abstract: Invisible units mainly refer to small-scale units that are not monitored by, and thus are not visible to utilities. Integration of these invisible units into power systems does significantly affect the way in which a distribution grid is planned and operated. This paper, based on random matrix theory (RMT), proposes a statistical, data-driven framework to handle the massive grid data, in contrast… ▽ More

    Submitted 9 December, 2023; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: 10 pages

    Journal ref: IEEE Transactions on Power Systems, 2019, 35(3): 1846-1855

  16. arXiv:1708.04935  [pdf, other

    stat.AP

    Spatio-Temporal Big Data Analysis for Smart Grids Based on Random Matrix Theory: A Comprehensive Study

    Authors: Robert Qiu, Lei Chu, Xing He, Zenan Ling, Haichun Liu

    Abstract: A cornerstone of the smart grid is the advanced monitorability on its assets and operations. Increasingly pervasive installation of the phasor measurement units (PMUs) allows the so-called synchrophasor measurements to be taken roughly 100 times faster than the legacy supervisory control and data acquisition (SCADA) measurements, time-stamped using the global positioning system (GPS) signals to ca… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: Book chapter#23 for the book "Transportation and Power Grid in Smart Cities: Communication Networks and Services". arXiv admin note: text overlap with arXiv:1302.0885 by other authors

  17. arXiv:1707.00167  [pdf, other

    stat.ME

    Asymptotic Distribution-Free Change-Point Detection for Multivariate and non-Euclidean Data

    Authors: Lynna Chu, Hao Chen

    Abstract: We consider the testing and estimation of change-points, locations where the distribution abruptly changes, in a sequence of multivariate or non-Euclidean observations. We study a nonparametric framework that utilizes similarity information among observations, which can be applied to various data types as long as an informative similarity measure on the sample space can be defined. The existing ap… ▽ More

    Submitted 22 February, 2018; v1 submitted 1 July, 2017; originally announced July 2017.

  18. arXiv:1610.05076  [pdf, other

    stat.ME

    A Novel Data-Driven Situation Awareness Approach for Future Grids--Using Large Random Matrices for Big Data Modeling

    Authors: Xing He, Lei Chu, Robert C. Qiu, Qian Ai, Zenan Ling

    Abstract: Data-driven approaches, when tasked with situation awareness, are suitable for complex grids with massive datasets. It is a challenge, however, to efficiently turn these massive datasets into useful big data analytics. To address such a challenge, this paper, based on random matrix theory (RMT), proposes a datadriven approach. The approach models massive datasets as large random matrices; it is mo… ▽ More

    Submitted 16 January, 2018; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: 10 pages, 14 figures, 2 tables, Submit to IEEE Access. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

  19. arXiv:1609.03301  [pdf, other

    stat.AP

    Massive Streaming PMU Data Modeling and Analytics in Smart Grid State Evaluation Based on Multiple High-Dimensional Covariance Tests

    Authors: Lei Chu, Robert Qiu, Xing He, Zenan Ling, Yadong Liu

    Abstract: The analogous deployment of phase measurement units (PMUs), the increase of data quantum and the deregulation of energy market, all call for the robust state evaluation in large scale power systems. Implementing model based estimators is impractical because of the complexity scale of solving the high dimension power flow equations. In this paper, we first represent massive streaming PMU data as bi… ▽ More

    Submitted 22 June, 2017; v1 submitted 12 September, 2016; originally announced September 2016.

    Comments: IEEE, transations on Big Data, 2017

  20. Designing for Situation Awareness of Future Power Grids: An Indicator System Based on Linear Eigenvalue Statistics of Large Random Matrices

    Authors: Xing He, Robert C. Qiu, Qian Ai, Lei Chu, Xinyi Xu, Zenan Ling

    Abstract: Future power grids are fundamentally different from current ones, both in size and in complexity; this trend imposes challenges for situation awareness (SA) based on classical indicators, which are usually model-based and deterministic. As an alternative, this paper proposes a statistical indicator system based on linear eigenvalue statistics (LESs) of large random matrices: 1) from a data modelin… ▽ More

    Submitted 6 July, 2016; v1 submitted 22 December, 2015; originally announced December 2015.

    Comments: 8 pages, 8 figures, 3 tables

    Journal ref: IEEE Access , vol.4, pp.3557-3568, 2016