Skip to main content

Showing 1–26 of 26 results for author: Ling, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.18656  [pdf, ps, other

    stat.ML cs.LG math.ST

    A Random Matrix Analysis of In-context Memorization for Nonlinear Attention

    Authors: Zhenyu Liao, Jiaqing Liu, TianQi Hou, Difan Zou, Zenan Ling

    Abstract: Attention mechanisms have revolutionized machine learning (ML) by enabling efficient modeling of global dependencies across inputs. Their inherently parallelizable structures allow for efficient scaling with the exponentially increasing size of both pretrained data and model parameters. Yet, despite their central role as the computational backbone of modern large language models (LLMs), the theore… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 40 pages, 7 pages

  2. arXiv:2505.05364  [pdf

    stat.AP

    Machine learning bridging battery field data and laboratory data

    Authors: Yanbin Zhao, Hao Liu, Zhihua Deng, Tong Li, Haoyi Jiang, Zhenfei Ling, Xingkai Wang, Lei Zhang, Xiaoping Ouyang

    Abstract: Aiming at the dilemma that most laboratory data-driven diagnostic and prognostic methods cannot be applied to field batteries in passenger cars and energy storage systems, this paper proposes a method to bridge field data and laboratory data using machine learning. Only two field real impedances corresponding to a medium frequency and a high frequency are needed to predict laboratory real impedanc… ▽ More

    Submitted 13 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: 73 pages, 21 figures

  3. arXiv:2504.18835  [pdf

    stat.AP

    Machine learning accelerates fuel cell life testing

    Authors: Yanbin Zhao, Hao Liu, Zhihua Deng, Haoyi Jiang, Zhenfei Ling, Zhiyang Liu, Xingkai Wang, Tong Li, Xiaoping Ouyang

    Abstract: Accelerated life testing (ALT) can significantly reduce the economic, time, and labor costs of life testing in the process of equipment, device, and material research and development (R&D), and improve R&D efficiency. This paper proposes a performance characterization data prediction (PCDP) method and a life prediction-driven ALT (LP-ALT) method to accelerate the life test of polymer electrolyte m… ▽ More

    Submitted 7 May, 2025; v1 submitted 26 April, 2025; originally announced April 2025.

    Comments: 39 pages, 25 figures

  4. arXiv:2504.09052  [pdf, other

    stat.ME math.ST

    Bayesian shrinkage priors subject to linear constraints

    Authors: Zhi Ling, Shozen Dan

    Abstract: In Bayesian regression models with categorical predictors, constraints are needed to ensure identifiability when using all $K$ levels of a factor. The sum-to-zero constraint is particularly useful as it allows coefficients to represent deviations from the population average. However, implementing such constraints in Bayesian settings is challenging, especially when assigning appropriate priors tha… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  5. arXiv:2502.13583  [pdf, ps, other

    math.NA math.OC stat.ML

    Fundamental Bias in Inverting Random Sampling Matrices with Application to Sub-sampled Newton

    Authors: Chengmei Niu, Zhenyu Liao, Zenan Ling, Michael W. Mahoney

    Abstract: A substantial body of work in machine learning (ML) and randomized numerical linear algebra (RandNLA) has exploited various sorts of random sketching methodologies, including random sampling and random projection, with much of the analysis using Johnson--Lindenstrauss and subspace embedding techniques. Recent studies have identified the issue of inversion bias -- the phenomenon that inverses of ra… ▽ More

    Submitted 29 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 52 pages, 4 figures. This version has been accepted at ICML 2025 and includes minor revisions for the camera-ready submission

  6. arXiv:2411.03774  [pdf, other

    stat.AP

    Towards pandemic preparedness: ability to estimate high-resolution social contact patterns from longitudinal surveys

    Authors: Shozen Dan, Joshua Tegegne, Yu Chen, Zhi Ling, Veronika K. Jaeger, André Karch, Swapnil Mishra, Oliver Ratmann

    Abstract: Social contact surveys are an important tool to assess infection risks within populations, and the effect of non-pharmaceutical interventions on social behaviour during disease outbreaks, epidemics, and pandemics. Numerous longitudinal social contact surveys were conducted during the COVID-19 era, however data analysis is plagued by reporting fatigue, a phenomenon whereby the average number of soc… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  7. arXiv:2410.03581  [pdf, other

    stat.ML cs.LG

    Nonstationary Sparse Spectral Permanental Process

    Authors: Zicheng Sun, Yixuan Zhang, Zenan Ling, Xuhui Fan, Feng Zhou

    Abstract: Existing permanental processes often impose constraints on kernel types or stationarity, limiting the model's expressiveness. To overcome these limitations, we propose a novel approach utilizing the sparse spectral representation of nonstationary kernels. This technique relaxes the constraints on kernel types and stationarity, allowing for more flexible modeling while reducing computational comple… ▽ More

    Submitted 18 December, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

  8. arXiv:2402.02697  [pdf, ps, other

    cs.LG stat.ML

    Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian Mixtures

    Authors: Zenan Ling, Longbo Li, Zhanbo Feng, Yixuan Zhang, Feng Zhou, Robert C. Qiu, Zhenyu Liao

    Abstract: Deep equilibrium models (DEQs), as a typical implicit neural network, have demonstrated remarkable success on various tasks. There is, however, a lack of theoretical understanding of the connections and differences between implicit DEQs and explicit neural network models. In this paper, leveraging recent advances in random matrix theory (RMT), we perform an in-depth analysis on the eigenspectra of… ▽ More

    Submitted 19 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  9. arXiv:2310.10379  [pdf, other

    cs.LG stat.ML

    Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification

    Authors: Tianjun Ke, Haoqun Cao, Zenan Ling, Feng Zhou

    Abstract: Meta-learning has demonstrated promising results in few-shot classification (FSC) by learning to solve new problems using prior knowledge. Bayesian methods are effective at characterizing uncertainty in FSC, which is crucial in high-risk fields. In this context, the logistic-softmax likelihood is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classificat… ▽ More

    Submitted 10 October, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  10. arXiv:2308.16425  [pdf, other

    cs.LG stat.ML

    On the Equivalence between Implicit and Explicit Neural Networks: A High-dimensional Viewpoint

    Authors: Zenan Ling, Zhenyu Liao, Robert C. Qiu

    Abstract: Implicit neural networks have demonstrated remarkable success in various tasks. However, there is a lack of theoretical analysis of the connections and differences between implicit and explicit networks. In this paper, we study high-dimensional implicit neural networks and provide the high dimensional equivalents for the corresponding conjugate kernels and neural tangent kernels. Built upon this,… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted by Workshop on High-dimensional Learning Dynamics, ICML 2023, Honolulu, Hawaii

  11. arXiv:2205.13814  [pdf, ps, other

    cs.LG stat.ML

    Global Convergence of Over-parameterized Deep Equilibrium Models

    Authors: Zenan Ling, Xingyu Xie, Qiuhao Wang, Zongpeng Zhang, Zhouchen Lin

    Abstract: A deep equilibrium model (DEQ) is implicitly defined through an equilibrium point of an infinite-depth weight-tied model with an input-injection. Instead of infinite computations, it solves an equilibrium point directly with root-finding and computes gradients with implicit differentiation. The training dynamics of over-parameterized DEQs are investigated in this study. By supposing a condition on… ▽ More

    Submitted 28 March, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted by AISTATS 2023

  12. arXiv:2103.13810  [pdf, other

    cs.LG cs.AI stat.ML

    Any Part of Bayesian Network Structure Learning

    Authors: Zhaolong Ling, Kui Yu, Hao Wang, Lin Liu, Jiuyong Li

    Abstract: We study an interesting and challenging problem, learning any part of a Bayesian network (BN) structure. In this challenge, it will be computationally inefficient using existing global BN structure learning algorithms to find an entire BN structure to achieve the part of a BN structure in which we are interested. And local BN structure learning algorithms encounter the false edge orientation probl… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  13. arXiv:1911.10947  [pdf, other

    cs.LG stat.ML

    State Alignment-based Imitation Learning

    Authors: Fangchen Liu, Zhan Ling, Tongzhou Mu, Hao Su

    Abstract: Consider an imitation learning problem that the imitator and the expert have different dynamics models. Most of the current imitation learning methods fail because they focus on imitating actions. We propose a novel state alignment-based imitation learning method to train the imitator to follow the state sequences in expert demonstrations as much as possible. The state alignment comes from both lo… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  14. arXiv:1911.07147  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Causality-based Feature Selection: Methods and Evaluations

    Authors: Kui Yu, Xianjie Guo, Lin Liu, Jiuyong Li, Hao Wang, Zhaolong Ling, Xindong Wu

    Abstract: Feature selection is a crucial preprocessing step in data analytics and machine learning. Classical feature selection algorithms select features based on the correlations between predictive features and the class variable and do not attempt to capture causal relationships between them. It has been shown that the knowledge about the causal relationships between features and the class variable has p… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

  15. arXiv:1807.11694  [pdf, other

    cs.LG stat.ML

    Spectrum concentration in deep residual learning: a free probability approach

    Authors: Zenan Ling, Xing He, Robert C. Qiu

    Abstract: We revisit the initialization of deep residual networks (ResNets) by introducing a novel analytical tool in free probability to the community of deep learning. This tool deals with non-Hermitian random matrices, rather than their conventional Hermitian counterparts in the literature. As a consequence, this new tool enables us to evaluate the singular value spectrum of the input-output Jacobian of… ▽ More

    Submitted 24 February, 2019; v1 submitted 31 July, 2018; originally announced July 2018.

  16. arXiv:1804.08438  [pdf, other

    eess.AS cs.CL cs.SD stat.ML

    A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment

    Authors: Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhenhua Ling

    Abstract: Voice conversion (VC) aims at conversion of speaker characteristic without altering content. Due to training data limitations and modeling imperfections, it is difficult to achieve believable speaker mimicry without introducing processing artifacts; performance assessment of VC, therefore, usually involves both speaker similarity and quality evaluation by a human panel. As a time-consuming, expens… ▽ More

    Submitted 4 September, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: Correction (bug fix) of a published ODYSSEY 2018 publication with the same title and author list; more details in footnote in page 1

  17. arXiv:1804.04262  [pdf, other

    eess.AS cs.CL cs.SD stat.ML

    The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods

    Authors: Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhenhua Ling

    Abstract: We present the Voice Conversion Challenge 2018, designed as a follow up to the 2016 edition with the aim of providing a common framework for evaluating and comparing different state-of-the-art voice conversion (VC) systems. The objective of the challenge was to perform speaker conversion (i.e. transform the vocal identity) of a source speaker to a target speaker while maintaining linguistic inform… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Accepted for Speaker Odyssey 2018

  18. arXiv:1802.03503  [pdf, other

    stat.AP eess.SP

    A New Approach of Exploiting Self-Adjoint Matrix Polynomials of Large Random Matrices for Anomaly Detection and Fault Location

    Authors: Zenan Ling, Robert C. Qiu, Xing He, Lei Chu

    Abstract: Synchronized measurements of a large power grid enable an unprecedented opportunity to study the spatialtemporal correlations. Statistical analytics for those massive datasets start with high-dimensional data matrices. Uncertainty is ubiquitous in a future's power grid. These data matrices are recognized as random matrices. This new point of view is fundamental in our theoretical analysis since tr… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

    Comments: 12 pages, 13 figures, submitted to IEEE Trans on Big Data

  19. arXiv:1801.01669  [pdf, other

    stat.AP stat.ME

    Early Anomaly Detection and Location in Distribution Network: A Data-Driven Approach

    Authors: Xin Shi, Robert Qiu, Xing He, Zenan Ling, Haosen Yang, Lei Chu

    Abstract: The measurement data collected from the supervisory control and data acquisition (SCADA) system installed in distribution network can reflect the operational state of the network effectively. In this paper, a random matrix theory (RMT) based approach is developed for early anomaly detection and localization by using the data. For every feeder in the distribution network, a corresponding data matri… ▽ More

    Submitted 11 March, 2020; v1 submitted 5 January, 2018; originally announced January 2018.

    Comments: 10 pages, submitted to IET Generation, Transmission and Distribution

  20. arXiv:1712.08871  [pdf, other

    stat.AP

    A Data-driven Approach to Multi-event Analytics in Large-scale Power Systems Using Factor Model

    Authors: Fan Yang, Xing He, Robert Caiming Qiu, Zenan Ling

    Abstract: Multi-event detection and recognition in real time is of challenge for a modern grid as its feature is usually non-identifiable. Based on factor model, this paper porposes a data-driven method as an alternative solution under the framework of random matrix theory. This method maps the raw data into a high-dimensional space with two parts: 1) the principal components (factors, mapping event signals… ▽ More

    Submitted 23 December, 2017; originally announced December 2017.

    Comments: 7 pages, 2 figures

  21. Invisible Units Detection and Estimation Based on Random Matrix Theory

    Authors: Xing He, Lei Chu, Robert C. Qiu, Qian Ai, Zenan Ling, Jian Zhang

    Abstract: Invisible units mainly refer to small-scale units that are not monitored by, and thus are not visible to utilities. Integration of these invisible units into power systems does significantly affect the way in which a distribution grid is planned and operated. This paper, based on random matrix theory (RMT), proposes a statistical, data-driven framework to handle the massive grid data, in contrast… ▽ More

    Submitted 9 December, 2023; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: 10 pages

    Journal ref: IEEE Transactions on Power Systems, 2019, 35(3): 1846-1855

  22. arXiv:1708.04935  [pdf, other

    stat.AP

    Spatio-Temporal Big Data Analysis for Smart Grids Based on Random Matrix Theory: A Comprehensive Study

    Authors: Robert Qiu, Lei Chu, Xing He, Zenan Ling, Haichun Liu

    Abstract: A cornerstone of the smart grid is the advanced monitorability on its assets and operations. Increasingly pervasive installation of the phasor measurement units (PMUs) allows the so-called synchrophasor measurements to be taken roughly 100 times faster than the legacy supervisory control and data acquisition (SCADA) measurements, time-stamped using the global positioning system (GPS) signals to ca… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: Book chapter#23 for the book "Transportation and Power Grid in Smart Cities: Communication Networks and Services". arXiv admin note: text overlap with arXiv:1302.0885 by other authors

  23. arXiv:1612.01089  [pdf, other

    stat.AP

    A Novel Approach for Big Data Analytics in Future Grids Based on Free Probability

    Authors: Zenan Ling, Robert C. Qiu, Xing He, Chu Lei

    Abstract: Based on the random matrix model, we can build statistical models using massive datasets across the power grid, and employ hypothesis testing for anomaly detection. First, the aim of this paper is to make the first attempt to apply the recent free probability result in extracting big data analytics, in particular data fusion. The nature of this work is basic in that new algorithms and analytics to… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

    Comments: 8 pages, 5 figures

  24. arXiv:1610.05076  [pdf, other

    stat.ME

    A Novel Data-Driven Situation Awareness Approach for Future Grids--Using Large Random Matrices for Big Data Modeling

    Authors: Xing He, Lei Chu, Robert C. Qiu, Qian Ai, Zenan Ling

    Abstract: Data-driven approaches, when tasked with situation awareness, are suitable for complex grids with massive datasets. It is a challenge, however, to efficiently turn these massive datasets into useful big data analytics. To address such a challenge, this paper, based on random matrix theory (RMT), proposes a datadriven approach. The approach models massive datasets as large random matrices; it is mo… ▽ More

    Submitted 16 January, 2018; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: 10 pages, 14 figures, 2 tables, Submit to IEEE Access. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

  25. arXiv:1609.03301  [pdf, other

    stat.AP

    Massive Streaming PMU Data Modeling and Analytics in Smart Grid State Evaluation Based on Multiple High-Dimensional Covariance Tests

    Authors: Lei Chu, Robert Qiu, Xing He, Zenan Ling, Yadong Liu

    Abstract: The analogous deployment of phase measurement units (PMUs), the increase of data quantum and the deregulation of energy market, all call for the robust state evaluation in large scale power systems. Implementing model based estimators is impractical because of the complexity scale of solving the high dimension power flow equations. In this paper, we first represent massive streaming PMU data as bi… ▽ More

    Submitted 22 June, 2017; v1 submitted 12 September, 2016; originally announced September 2016.

    Comments: IEEE, transations on Big Data, 2017

  26. Designing for Situation Awareness of Future Power Grids: An Indicator System Based on Linear Eigenvalue Statistics of Large Random Matrices

    Authors: Xing He, Robert C. Qiu, Qian Ai, Lei Chu, Xinyi Xu, Zenan Ling

    Abstract: Future power grids are fundamentally different from current ones, both in size and in complexity; this trend imposes challenges for situation awareness (SA) based on classical indicators, which are usually model-based and deterministic. As an alternative, this paper proposes a statistical indicator system based on linear eigenvalue statistics (LESs) of large random matrices: 1) from a data modelin… ▽ More

    Submitted 6 July, 2016; v1 submitted 22 December, 2015; originally announced December 2015.

    Comments: 8 pages, 8 figures, 3 tables

    Journal ref: IEEE Access , vol.4, pp.3557-3568, 2016