Skip to main content

Showing 1–2 of 2 results for author: Shou, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.20221  [pdf, ps, other

    cs.LG stat.ML

    Gradient Flow Matching for Learning Update Dynamics in Neural Network Training

    Authors: Xiao Shou, Yanna Ding, Jianxi Gao

    Abstract: Training deep neural networks remains computationally intensive due to the itera2 tive nature of gradient-based optimization. We propose Gradient Flow Matching (GFM), a continuous-time modeling framework that treats neural network training as a dynamical system governed by learned optimizer-aware vector fields. By leveraging conditional flow matching, GFM captures the underlying update rules of op… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2412.15554  [pdf, other

    cs.LG cs.AI stat.ML

    Architecture-Aware Learning Curve Extrapolation via Graph Ordinary Differential Equation

    Authors: Yanna Ding, Zijie Huang, Xiao Shou, Yihang Guo, Yizhou Sun, Jianxi Gao

    Abstract: Learning curve extrapolation predicts neural network performance from early training epochs and has been applied to accelerate AutoML, facilitating hyperparameter tuning and neural architecture search. However, existing methods typically model the evolution of learning curves in isolation, neglecting the impact of neural network (NN) architectures, which influence the loss landscape and learning t… ▽ More

    Submitted 18 January, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: Accepted to AAAI'25