Skip to main content

Showing 1–17 of 17 results for author: Ding, T

Searching in archive math. Search in all archives.
.
  1. arXiv:2503.19859  [pdf, other

    cs.LG eess.SP math.OC stat.CO stat.ML

    An Overview of Low-Rank Structures in the Training and Adaptation of Large Models

    Authors: Laura Balzano, Tianjiao Ding, Benjamin D. Haeffele, Soo Min Kwon, Qing Qu, Peng Wang, Zhangyang Wang, Can Yaras

    Abstract: The rise of deep learning has revolutionized data processing and prediction in signal processing and machine learning, yet the substantial computational demands of training and deploying modern large-scale deep models present significant challenges, including high computational costs and energy consumption. Recent research has uncovered a widespread phenomenon in deep networks: the emergence of lo… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: Authors are listed alphabetically; 27 pages, 10 figures

  2. arXiv:2412.01051  [pdf, other

    math.OC cs.LG

    An Efficient Unsupervised Framework for Convex Quadratic Programs via Deep Unrolling

    Authors: Linxin Yang, Bingheng Li, Tian Ding, Jianghua Wu, Akang Wang, Yuyi Wang, Jiliang Tang, Ruoyu Sun, Xiaodong Luo

    Abstract: Quadratic programs (QPs) arise in various domains such as machine learning, finance, and control. Recently, learning-enhanced primal-dual hybrid gradient (PDHG) methods have shown great potential in addressing large-scale linear programs; however, this approach has not been extended to QPs. In this work, we focus on unrolling "PDQP", a PDHG algorithm specialized for convex QPs. Specifically, we pr… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  3. arXiv:2411.13805  [pdf, other

    math.OC

    On Representing Convex Quadratically Constrained Quadratic Programs via Graph Neural Networks

    Authors: Chenyang Wu, Qian Chen, Akang Wang, Tian Ding, Ruoyu Sun, Wenguo Yang, Qingjiang Shi

    Abstract: Convex quadratically constrained quadratic programs (QCQPs) involve finding a solution within a convex feasible region defined by quadratic constraints while minimizing a convex quadratic objective function. These problems arise in various industrial applications, including power systems and signal processing. Traditional methods for solving convex QCQPs primarily rely on matrix factorization, whi… ▽ More

    Submitted 6 January, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

  4. arXiv:2406.01908  [pdf, other

    cs.LG math.OC

    PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

    Authors: Bingheng Li, Linxin Yang, Yupeng Chen, Senmiao Wang, Qian Chen, Haitao Mao, Yao Ma, Akang Wang, Tian Ding, Jiliang Tang, Ruoyu Sun

    Abstract: Solving large-scale linear programming (LP) problems is an important task in various areas such as communication networks, power systems, finance and logistics. Recently, two distinct approaches have emerged to expedite LP solving: (i) First-order methods (FOMs); (ii) Learning to optimize (L2O). In this work, we propose an FOM-unrolled neural network (NN) called PDHG-Net, and propose a two-stage L… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  5. arXiv:2404.05064  [pdf, other

    cs.LG math.NA

    A Structure-Guided Gauss-Newton Method for Shallow ReLU Neural Network

    Authors: Zhiqiang Cai, Tong Ding, Min Liu, Xinyu Liu, Jianlin Xia

    Abstract: In this paper, we propose a structure-guided Gauss-Newton (SgGN) method for solving least squares problems using a shallow ReLU neural network. The method effectively takes advantage of both the least squares structure and the neural network structure of the objective function. By categorizing the weights and biases of the hidden and output layers of the network as nonlinear and linear parameters,… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    MSC Class: 65D15; 65K10

  6. arXiv:2203.01238  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features

    Authors: Jinxin Zhou, Xiao Li, Tianyu Ding, Chong You, Qing Qu, Zhihui Zhu

    Abstract: When training deep neural networks for classification tasks, an intriguing empirical phenomenon has been widely observed in the last-layer classifiers and features, where (i) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equiangular Tight Frame (ETF) up to scaling, and (ii) cross-example within-class variability of last-layer activations collapses to zero… ▽ More

    Submitted 12 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  7. arXiv:2105.02375  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    A Geometric Analysis of Neural Collapse with Unconstrained Features

    Authors: Zhihui Zhu, Tianyu Ding, Jinxin Zhou, Xiao Li, Chong You, Jeremias Sulam, Qing Qu

    Abstract: We provide the first global optimization landscape analysis of $Neural\;Collapse$ -- an intriguing empirical phenomenon that arises in the last-layer classifiers and features of neural networks during the terminal phase of training. As recently reported by Papyan et al., this phenomenon implies that ($i$) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 42 pages, 8 figures, 1 table; the first two authors contributed to this work equally

  8. arXiv:2104.10396  [pdf, other

    math.OC

    Disturbance-resilient Distributed Resource Allocation over Stochastic Networks using Uncoordinated Stepsizes

    Authors: Tie Ding, Shanying Zhu, Cailian Chen, Xinping Guan

    Abstract: This paper studies distributed resource allocation problem in multi-agent systems, where all the agents cooperatively minimize the sum of their cost functions with global resource constraints over stochastic communication networks. This problem arises from many practical domains such as economic dispatch in smart grid, task assignment, and power allocation in robotic control. Most of existing work… ▽ More

    Submitted 24 April, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  9. arXiv:2011.04868  [pdf, other

    cs.LG math.OC stat.ML

    Neural Network Compression Via Sparse Optimization

    Authors: Tianyi Chen, Bo Ji, Yixin Shi, Tianyu Ding, Biyi Fang, Sheng Yi, Xiao Tu

    Abstract: The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network compression, while most of them are heuristic rule-based or typically not friendly to be incorporated into varying scenarios. On the other hand, sparse optimization yi… ▽ More

    Submitted 11 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

  10. arXiv:2009.12078  [pdf, other

    math.OC

    Half-Space Proximal Stochastic Gradient Method for Group-Sparsity Regularized Problem

    Authors: Tianyi Chen, Guanyi Wang, Tianyu Ding, Bo Ji, Sheng Yi, Zhihui Zhu

    Abstract: Optimizing with group sparsity is significant in enhancing model interpretability in machining learning applications, e.g., feature selection, compressed sensing and model compression. However, for large-scale stochastic training problems, effective group sparsity exploration are typically hard to achieve. Particularly, the state-of-the-art stochastic optimization algorithms usually generate merel… ▽ More

    Submitted 12 February, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: In the future, we might split the work into two separate papers for theoretical analysis and numerical applications on deep learning respectively

  11. arXiv:2007.01429  [pdf, other

    cs.LG math.OC stat.ML

    The Global Landscape of Neural Networks: An Overview

    Authors: Ruoyu Sun, Dawei Li, Shiyu Liang, Tian Ding, R Srikant

    Abstract: One of the major concerns for neural network training is that the non-convexity of the associated loss functions may cause bad landscape. The recent success of neural networks suggests that their loss landscape is not too bad, but what specific results do we know about the landscape? In this article, we review recent findings and results on the global landscape of neural networks. First, we point… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: 16 pages. 8 figures

  12. arXiv:2004.03639  [pdf, other

    math.OC cs.LG stat.ML

    Orthant Based Proximal Stochastic Gradient Method for $\ell_1$-Regularized Optimization

    Authors: Tianyi Chen, Tianyu Ding, Bo Ji, Guanyi Wang, Jing Tian, Yixin Shi, Sheng Yi, Xiao Tu, Zhihui Zhu

    Abstract: Sparsity-inducing regularization problems are ubiquitous in machine learning applications, ranging from feature selection to model compression. In this paper, we present a novel stochastic method -- Orthant Based Proximal Stochastic Gradient Method (OBProx-SG) -- to solve perhaps the most popular instance, i.e., the l1-regularized problem. The OBProx-SG method contains two steps: (i) a proximal st… ▽ More

    Submitted 23 July, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted by ECML 2020

  13. arXiv:1911.01413  [pdf, ps, other

    cs.LG math.OC stat.ML

    Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations

    Authors: Tian Ding, Dawei Li, Ruoyu Sun

    Abstract: Does over-parameterization eliminate sub-optimal local minima for neural networks? An affirmative answer was given by a classical result in [59] for 1-hidden-layer wide neural networks. A few recent works have extended the setting to multi-layer neural networks, but none of them has proved every local minimum is global. Why is this result never extended to deep networks? In this paper, we show t… ▽ More

    Submitted 14 November, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: 58 pages. The main theorem is strengthened. An early version was submitted to Optimization Online on October 4, 2019

  14. arXiv:1812.11039  [pdf, other

    cs.LG math.OC stat.ML

    On the Benefit of Width for Neural Networks: Disappearance of Bad Basins

    Authors: Dawei Li, Tian Ding, Ruoyu Sun

    Abstract: Wide networks are often believed to have a nice optimization landscape, but what rigorous results can we prove? To understand the benefit of width, it is important to identify the difference between wide and narrow networks. In this work, we prove that from narrow to wide networks, there is a phase transition from having sub-optimal basins to no sub-optimal basins. Specifically, we prove two resul… ▽ More

    Submitted 2 September, 2021; v1 submitted 28 December, 2018; originally announced December 2018.

    Comments: ver1: Nov 22, 2018; ver2: Jan 20, 2020; ver3: July 26, 2020; ver4: Jan 19, 2021; ver5: Sept 2, 2021

  15. arXiv:1507.02776  [pdf, other

    math.AP math.NA math.OC

    A one-step reconstruction algorithm for quantitative photoacoustic imaging

    Authors: Tian Ding, Kui Ren, Sarah Vallelian

    Abstract: Quantitative photoacoustic tomography (QPAT) is a recent hybrid imaging modality that couples optical tomography with ultrasound imaging to achieve high resolution imaging of optical properties of scattering media. Image reconstruction in QPAT is usually a two-step process. In the first step, the initial pressure field inside the medium, generated by the photoacoustic effect, is reconstructed usin… ▽ More

    Submitted 6 August, 2015; v1 submitted 10 July, 2015; originally announced July 2015.

    Journal ref: Inverse Problems, 31, 2015

  16. arXiv:1505.03247  [pdf

    math.OC

    A Modification of Sufficient Conditions to Ensure the Exact Conic Relaxation

    Authors: Tao Ding, Bo Zeng, Rui Bo

    Abstract: To solve the AC optimal power flow problem, it is proposed in [1,2] that a convex conic approximation to branch flow model (BFM) can be obtained if we first eliminate phase angles of voltages and currents and then relax a set of equality constraints to second order conic ones. In particular, provided a set of sufficient conditions are satisfied, the conic relaxation is exact. We note, however, tha… ▽ More

    Submitted 13 May, 2015; originally announced May 2015.

    Comments: Letters

  17. Inverse transport calculations in optical imaging with subspace optimization algorithms

    Authors: Tian Ding, Kui Ren

    Abstract: Inverse boundary value problems for the radiative transport equation play important roles in optics-based medical imaging techniques such as diffuse optical tomography (DOT) and fluorescence optical tomography (FOT). Despite the rapid progress in the mathematical theory and numerical computation of these inverse problems in recent years, developing robust and efficient reconstruction algorithms re… ▽ More

    Submitted 1 May, 2014; v1 submitted 19 April, 2014; originally announced April 2014.