Skip to main content

Showing 1–14 of 14 results for author: Xiao, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.16535  [pdf, other

    stat.CO

    Decentralized Quantile Regression for Feature-Distributed Massive Datasets with Privacy Guarantees

    Authors: Peiwen Xiao, Xiaohui Liu, Guangming Pan, Wei Long

    Abstract: In this paper, we introduce a novel decentralized surrogate gradient-based algorithm for quantile regression in a feature-distributed setting, where global features are dispersed across multiple machines within a decentralized network. The proposed algorithm, \texttt{DSG-cqr}, utilizes a convolution-type smoothing approach to address the non-smooth nature of the quantile loss function. \texttt{DSG… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  2. arXiv:2501.05564  [pdf, other

    cs.LG cs.AR stat.ML

    Analog Bayesian neural networks are insensitive to the shape of the weight distribution

    Authors: Ravi G. Patel, T. Patrick Xiao, Sapan Agarwal, Christopher Bennett

    Abstract: Recent work has demonstrated that Bayesian neural networks (BNN's) trained with mean field variational inference (MFVI) can be implemented in analog hardware, promising orders of magnitude energy savings compared to the standard digital implementations. However, while Gaussians are typically used as the variational distribution in MFVI, it is difficult to precisely control the shape of the noise d… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: Presented at the NeurIPS 2024 Workshop on Machine Learning with New Compute Paradigms, https://openreview.net/forum?id=soS5qgU7Yb

  3. arXiv:2411.19305  [pdf, other

    stat.ML cs.LG math.DS

    LD-EnSF: Synergizing Latent Dynamics with Ensemble Score Filters for Fast Data Assimilation with Sparse Observations

    Authors: Pengpeng Xiao, Phillip Si, Peng Chen

    Abstract: Data assimilation techniques are crucial for correcting the trajectory when modeling complex physical systems. A recently developed data assimilation method, Latent Ensemble Score Filter (Latent-EnSF), has shown great promise in addressing the key limitation of EnSF for highly sparse observations in high-dimensional and nonlinear data assimilation problems. It performs data assimilation in a laten… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  4. arXiv:2411.05877  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

    Authors: Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Benjamin Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng

    Abstract: Large language models (LMs) are typically adapted to improve performance on new contexts (\eg text prompts that define new tasks or domains) through fine-tuning or prompting. However, there is an accuracy compute tradeoff -- fine-tuning incurs significant training cost and prompting increases inference overhead. We introduce $GenerativeAdapter$, an effective and efficient adaptation method that di… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  5. arXiv:2405.19440  [pdf, other

    cs.LG math.OC stat.ML

    MGDA Converges under Generalized Smoothness, Provably

    Authors: Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi Ji

    Abstract: Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which typically do not hold for neural networks, such as Long short-term memory (LSTM) models and Transformers. In this paper, we study… ▽ More

    Submitted 8 March, 2025; v1 submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2312.03807  [pdf, other

    math.OC cs.LG stat.ML

    Achieving ${O}(ε^{-1.5})$ Complexity in Hessian/Jacobian-free Stochastic Bilevel Optimization

    Authors: Yifan Yang, Peiyao Xiao, Kaiyi Ji

    Abstract: In this paper, we revisit the bilevel optimization problem, in which the upper-level objective function is generally nonconvex and the lower-level objective function is strongly convex. Although this type of problem has been studied extensively, it still remains an open question how to achieve an ${O}(ε^{-1.5})$ sample complexity in Hessian/Jacobian-free stochastic bilevel optimization without any… ▽ More

    Submitted 6 April, 2025; v1 submitted 6 December, 2023; originally announced December 2023.

  7. arXiv:2305.19442  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    SimFBO: Towards Simple, Flexible and Communication-efficient Federated Bilevel Learning

    Authors: Yifan Yang, Peiyao Xiao, Kaiyi Ji

    Abstract: Federated bilevel optimization (FBO) has shown great potential recently in machine learning and edge computing due to the emerging nested optimization structure in meta-learning, fine-tuning, hyperparameter tuning, etc. However, existing FBO algorithms often involve complicated computations and require multiple sub-loops per iteration, each of which contains a number of communication rounds. In th… ▽ More

    Submitted 27 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  8. arXiv:2305.18409  [pdf, other

    cs.LG math.OC stat.ML

    Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms

    Authors: Peiyao Xiao, Hao Ban, Kaiyi Ji

    Abstract: Multi-objective optimization (MOO) has become an influential framework in many machine learning problems with multiple objectives such as learning with multiple criteria and multi-task learning (MTL). In this paper, we propose a new direction-oriented multi-objective problem by regularizing the common descent direction within a neighborhood of a direction that optimizes a linear combination of obj… ▽ More

    Submitted 28 November, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  9. arXiv:2302.04969  [pdf, other

    cs.LG math.OC stat.ML

    Communication-Efficient Federated Hypergradient Computation via Aggregated Iterative Differentiation

    Authors: Peiyao Xiao, Kaiyi Ji

    Abstract: Federated bilevel optimization has attracted increasing attention due to emerging machine learning and communication applications. The biggest challenge lies in computing the gradient of the upper-level objective function (i.e., hypergradient) in the federated setting due to the nonlinear and distributed construction of a series of global Hessian matrices. In this paper, we propose a novel communi… ▽ More

    Submitted 15 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: 38 pages, 15 figures

  10. arXiv:2008.12997  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Resistance to Adversarial Deformations by Regularizing Gradients

    Authors: Pengfei Xia, Bin Li

    Abstract: Improving the resistance of deep neural networks against adversarial attacks is important for deploying models to realistic applications. However, most defense methods are designed to defend against intensity perturbations and ignore location perturbations, which should be equally important for deep model security. In this paper, we focus on adversarial deformations, a typical class of location pe… ▽ More

    Submitted 6 October, 2020; v1 submitted 29 August, 2020; originally announced August 2020.

  11. arXiv:2003.10396  [pdf, other

    cs.NE cs.LG stat.ML

    Evaluating complexity and resilience trade-offs in emerging memory inference machines

    Authors: Christopher H. Bennett, Ryan Dellana, T. Patrick Xiao, Ben Feinberg, Sapan Agarwal, Suma Cardwell, Matthew J. Marinella, William Severa, Brad Aimone

    Abstract: Neuromorphic-style inference only works well if limited hardware resources are maximized properly, e.g. accuracy continues to scale with parameters and complexity in the face of potential disturbance. In this work, we use realistic crossbar simulations to highlight that compact implementations of deep neural networks are unexpectedly susceptible to collapse from multiple system disturbances. Our w… ▽ More

    Submitted 25 February, 2020; originally announced March 2020.

  12. arXiv:1912.00838  [pdf, other

    cs.LG eess.SP stat.ML

    DeepFPC: Deep Unfolding of a Fixed-Point Continuation Algorithm for Sparse Signal Recovery from Quantized Measurements

    Authors: Peng Xiao, Bin Liao, Nikos Deligiannis

    Abstract: We present DeepFPC, a novel deep neural network designed by unfolding the iterations of the fixed-point continuation algorithm with one-sided l1-norm (FPC-l1), which has been proposed for solving the 1-bit compressed sensing problem. The network architecture resembles that of deep residual learning and incorporates prior knowledge about the signal structure (i.e., sparsity), thereby offering inter… ▽ More

    Submitted 4 December, 2019; v1 submitted 2 December, 2019; originally announced December 2019.

  13. arXiv:1902.02675  [pdf, other

    cs.LG stat.ML

    Neural Network for NILM Based on Operational State Change Classification

    Authors: Peng Xiao, Samuel Cheng

    Abstract: Energy disaggregation in a non-intrusive way estimates appliance level electricity consumption from a single meter that measures the whole house electricity demand. Recently, with the ongoing increment of energy data, there are many data-driven deep learning architectures being applied to solve the non-intrusive energy disaggregation problem. However, most proposed methods try to estimate the on-o… ▽ More

    Submitted 19 March, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: 5 pages, 4 figures

  14. arXiv:1808.06206  [pdf, other

    cs.LG cs.AI stat.ML

    TLR: Transfer Latent Representation for Unsupervised Domain Adaptation

    Authors: Pan Xiao, Bo Du, Jia Wu, Lefei Zhang, Ruimin Hu, Xuelong Li

    Abstract: Domain adaptation refers to the process of learning prediction models in a target domain by making use of data from a source domain. Many classic methods solve the domain adaptation problem by establishing a common latent space, which may cause the loss of many important properties across both domains. In this manuscript, we develop a novel method, transfer latent representation (TLR), to learn a… ▽ More

    Submitted 19 August, 2018; originally announced August 2018.