Skip to main content

Showing 1–31 of 31 results for author: Fattahi, S

.
  1. arXiv:2505.17304  [pdf, ps, other

    cs.LG math.OC

    Implicit Regularization of Infinitesimally-perturbed Gradient Descent Toward Low-dimensional Solutions

    Authors: Jianhao Ma, Geyu Liang, Salar Fattahi

    Abstract: Implicit regularization refers to the phenomenon where local search algorithms converge to low-dimensional solutions, even when such structures are neither explicitly specified nor encoded in the optimization problem. While widely observed, this phenomenon remains theoretically underexplored, particularly in modern over-parameterized problems. In this paper, we study the conditions that enable imp… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  2. arXiv:2504.09708  [pdf, ps, other

    math.OC cs.LG stat.ML

    Preconditioned Gradient Descent for Over-Parameterized Nonconvex Matrix Factorization

    Authors: Gavin Zhang, Salar Fattahi, Richard Y. Zhang

    Abstract: In practical instances of nonconvex matrix factorization, the rank of the true solution $r^{\star}$ is often unknown, so the rank $r$ of the model can be overspecified as $r>r^{\star}$. This over-parameterized regime of matrix factorization significantly slows down the convergence of local search algorithms, from a linear rate with $r=r^{\star}$ to a sublinear rate when $r>r^{\star}$. We propose a… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: NeurIPS 2021. See also https://proceedings.neurips.cc/paper/2021/hash/2f2cd5c753d3cee48e47dbb5bbaed331-Abstract.html

  3. arXiv:2504.09648  [pdf, other

    cs.LG cs.CV stat.CO

    RANSAC Revisited: An Improved Algorithm for Robust Subspace Recovery under Adversarial and Noisy Corruptions

    Authors: Guixian Chen, Jianhao Ma, Salar Fattahi

    Abstract: In this paper, we study the problem of robust subspace recovery (RSR) in the presence of both strong adversarial corruptions and Gaussian noise. Specifically, given a limited number of noisy samples -- some of which are tampered by an adaptive and strong adversary -- we aim to recover a low-dimensional subspace that approximately contains a significant fraction of the uncorrupted samples, up to an… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  4. arXiv:2502.06775  [pdf, other

    cs.LG

    Enhancing Performance of Explainable AI Models with Constrained Concept Refinement

    Authors: Geyu Liang, Senne Michielssen, Salar Fattahi

    Abstract: The trade-off between accuracy and interpretability has long been a challenge in machine learning (ML). This tension is particularly significant for emerging interpretable-by-design methods, which aim to redesign ML algorithms for trustworthy interpretability but often sacrifice accuracy in the process. In this paper, we address this gap by investigating the impact of deviations in concept represe… ▽ More

    Submitted 27 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  5. arXiv:2404.08178  [pdf, other

    math.OC

    A Parametric Approach for Solving Convex Quadratic Optimization with Indicators Over Trees

    Authors: Aaresh Bhathena, Salar Fattahi, Andrés Gómez, Simge Küçükyavuz

    Abstract: This paper investigates convex quadratic optimization problems involving $n$ indicator variables, each associated with a continuous variable, particularly focusing on scenarios where the matrix $Q$ defining the quadratic term is positive definite and its sparsity pattern corresponds to the adjacency matrix of a tree graph. We introduce a graph-based dynamic programming algorithm that solves this p… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  6. arXiv:2404.07955  [pdf, other

    cs.LG math.ST

    Triple Component Matrix Factorization: Untangling Global, Local, and Noisy Components

    Authors: Naichen Shi, Salar Fattahi, Raed Al Kontar

    Abstract: In this work, we study the problem of common and unique feature extraction from noisy data. When we have N observation matrices from N different and associated sources corrupted by sparse and potentially gross noise, can we recover the common and unique components from these noisy observations? This is a challenging task as the number of parameters to estimate is approximately thrice the number of… ▽ More

    Submitted 8 November, 2024; v1 submitted 21 March, 2024; originally announced April 2024.

    Journal ref: Journal of Machine Learning Research, 2024

  7. arXiv:2402.06756  [pdf, other

    cs.LG math.OC stat.ML

    Convergence of Gradient Descent with Small Initialization for Unregularized Matrix Completion

    Authors: Jianhao Ma, Salar Fattahi

    Abstract: We study the problem of symmetric matrix completion, where the goal is to reconstruct a positive semidefinite matrix $\rm{X}^\star \in \mathbb{R}^{d\times d}$ of rank-$r$, parameterized by $\rm{U}\rm{U}^{\top}$, from only a subset of its observed entries. We show that the vanilla gradient descent (GD) with small initialization provably converges to the ground truth $\rm{X}^\star$ without requiring… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  8. arXiv:2307.13750  [pdf, other

    math.OC cs.LG

    Solution Path of Time-varying Markov Random Fields with Discrete Regularization

    Authors: Salar Fattahi, Andres Gomez

    Abstract: We study the problem of inferring sparse time-varying Markov random fields (MRFs) with different discrete and temporal regularizations on the parameters. Due to the intractability of discrete regularization, most approaches for solving this problem rely on the so-called maximum-likelihood estimation (MLE) with relaxed regularization, which neither results in ideal statistical properties nor scale… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  9. arXiv:2305.17744  [pdf, other

    stat.ME

    Heterogeneous Matrix Factorization: When Features Differ by Datasets

    Authors: Naichen Shi, Raed Al Kontar, Salar Fattahi

    Abstract: In myriad statistical applications, data are collected from related but heterogeneous sources. These sources share some commonalities while containing idiosyncratic characteristics. One of the most fundamental challenges in such scenarios is to recover the shared and source-specific factors. Despite the existence of a few heuristic approaches, a generic algorithm with theoretical guarantees has ye… ▽ More

    Submitted 27 March, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

  10. arXiv:2305.15311  [pdf, other

    cs.LG cs.CV

    Personalized Dictionary Learning for Heterogeneous Datasets

    Authors: Geyu Liang, Naichen Shi, Raed Al Kontar, Salar Fattahi

    Abstract: We introduce a relevant yet challenging problem named Personalized Dictionary Learning (PerDL), where the goal is to learn sparse linear representations from heterogeneous datasets that share some commonality. In PerDL, we model each dataset's shared and unique features as global and local dictionaries. Challenges for PerDL not only are inherited from classical dictionary learning (DL), but also a… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  11. arXiv:2305.15276  [pdf, other

    cs.LG stat.ML

    Robust Sparse Mean Estimation via Incremental Learning

    Authors: Jianhao Ma, Rui Ray Chen, Yinghui He, Salar Fattahi, Wei Hu

    Abstract: In this paper, we study the problem of robust sparse mean estimation, where the goal is to estimate a $k$-sparse mean from a collection of partially corrupted samples drawn from a heavy-tailed distribution. Existing estimators face two critical challenges in this setting. First, they are limited by a conjectured computational-statistical tradeoff, implying that any computationally efficient algori… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  12. arXiv:2302.10963  [pdf, other

    cs.LG math.OC

    Can Learning Be Explained By Local Optimality In Robust Low-rank Matrix Recovery?

    Authors: Jianhao Ma, Salar Fattahi

    Abstract: We explore the local landscape of low-rank matrix recovery, focusing on reconstructing a $d_1\times d_2$ matrix $X^\star$ with rank $r$ from $m$ linear measurements, some potentially noisy. When the noise is distributed according to an outlier model, minimizing a nonsmooth $\ell_1$-loss with a simple sub-gradient method can often perfectly recover the ground truth matrix $X^\star$. Given this, a n… ▽ More

    Submitted 4 April, 2025; v1 submitted 21 February, 2023; originally announced February 2023.

  13. arXiv:2210.12816  [pdf, other

    cs.LG eess.SP math.OC

    Simple Alternating Minimization Provably Solves Complete Dictionary Learning

    Authors: Geyu Liang, Gavin Zhang, Salar Fattahi, Richard Y. Zhang

    Abstract: This paper focuses on the noiseless complete dictionary learning problem, where the goal is to represent a set of given signals as linear combinations of a small number of atoms from a learned dictionary. There are two main challenges faced by theoretical and practical studies of dictionary learning: the lack of theoretical guarantees for practically-used heuristic algorithms and their poor scalab… ▽ More

    Submitted 4 March, 2025; v1 submitted 23 October, 2022; originally announced October 2022.

  14. arXiv:2210.00346  [pdf, other

    cs.LG

    Behind the Scenes of Gradient Descent: A Trajectory Analysis via Basis Function Decomposition

    Authors: Jianhao Ma, Lingjun Guo, Salar Fattahi

    Abstract: This work analyzes the solution trajectory of gradient-based algorithms via a novel basis function decomposition. We show that, although solution trajectories of gradient-based algorithms may vary depending on the learning task, they behave almost monotonically when projected onto an appropriate orthonormal function basis. Such projection gives rise to a basis function decomposition of the solutio… ▽ More

    Submitted 3 October, 2022; v1 submitted 1 October, 2022; originally announced October 2022.

  15. arXiv:2207.07612  [pdf, other

    cs.LG stat.ML

    Blessing of Nonconvexity in Deep Linear Models: Depth Flattens the Optimization Landscape Around the True Solution

    Authors: Jianhao Ma, Salar Fattahi

    Abstract: This work characterizes the effect of depth on the optimization landscape of linear regression, showing that, despite their nonconvexity, deeper models have more desirable optimization landscape. We consider a robust and over-parameterized setting, where a subset of measurements are grossly corrupted with noise and the true linear model is captured via an $N$-layer linear neural network. On the ne… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  16. arXiv:2206.10174  [pdf, other

    stat.AP stat.ME stat.ML

    Efficient Inference of Spatially-varying Gaussian Markov Random Fields with Applications in Gene Regulatory Networks

    Authors: Visweswaran Ravikumar, Tong Xu, Wajd N. Al-Holou, Salar Fattahi, Arvind Rao

    Abstract: In this paper, we study the problem of inferring spatially-varying Gaussian Markov random fields (SV-GMRF) where the goal is to learn a network of sparse, context-specific GMRFs representing network relationships between genes. An important application of SV-GMRFs is in inference of gene regulatory networks from spatially-resolved transcriptomics datasets. The current work on inference of SV-GMRFs… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  17. arXiv:2206.03345  [pdf, other

    math.OC cs.LG stat.ML

    Preconditioned Gradient Descent for Overparameterized Nonconvex Burer--Monteiro Factorization with Global Optimality Certification

    Authors: Gavin Zhang, Salar Fattahi, Richard Y. Zhang

    Abstract: We consider using gradient descent to minimize the nonconvex function $f(X)=φ(XX^{T})$ over an $n\times r$ factor matrix $X$, in which $φ$ is an underlying smooth convex cost function defined over $n\times n$ matrices. While only a second-order stationary point $X$ can be provably found in reasonable time, if $X$ is additionally rank deficient, then its rank deficiency certifies it as being global… ▽ More

    Submitted 21 April, 2025; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: v2: accepted at JMLR. v3: minor correction in proof of Lemma 27

  18. arXiv:2202.08788  [pdf, other

    cs.LG math.OC stat.ML

    Global Convergence of Sub-gradient Method for Robust Matrix Recovery: Small Initialization, Noisy Measurements, and Over-parameterization

    Authors: Jianhao Ma, Salar Fattahi

    Abstract: In this work, we study the performance of sub-gradient method (SubGM) on a natural nonconvex and nonsmooth formulation of low-rank matrix recovery with $\ell_1$-loss, where the goal is to recover a low-rank matrix from a limited number of measurements, a subset of which may be grossly corrupted with noise. We study a scenario where the rank of the true solution is unknown and over-estimated instea… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  19. arXiv:2110.12547  [pdf, other

    math.OC

    A Graph-based Decomposition Method for Convex Quadratic Optimization with Indicators

    Authors: Peijing Liu, Salar Fattahi, Andrés Gómez, Simge Küçükyavuz

    Abstract: In this paper, we consider convex quadratic optimization problems with indicator variables when the matrix $Q$ defining the quadratic term in the objective is sparse. We use a graphical representation of the support of $Q$, and show that if this graph is a path, then we can solve the associated problem in polynomial time. This enables us to construct a compact extended formulation for the closure… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

  20. arXiv:2102.03585  [pdf, other

    cs.LG stat.CO stat.ML

    Scalable Inference of Sparsely-changing Markov Random Fields with Strong Statistical Guarantees

    Authors: Salar Fattahi, Andres Gomez

    Abstract: In this paper, we study the problem of inferring time-varying Markov random fields (MRF), where the underlying graphical model is both sparse and changes sparsely over time. Most of the existing methods for the inference of time-varying MRFs rely on the regularized maximum likelihood estimation (MLE), that typically suffer from weak statistical guarantees and high computational time. Instead, we i… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  21. arXiv:2102.02969  [pdf, other

    cs.LG stat.ML

    Sign-RIP: A Robust Restricted Isometry Property for Low-rank Matrix Recovery

    Authors: Jianhao Ma, Salar Fattahi

    Abstract: Restricted isometry property (RIP), essentially stating that the linear measurements are approximately norm-preserving, plays a crucial role in studying low-rank matrix recovery problem. However, RIP fails in the robust setting, when a subset of the measurements are grossly corrupted with noise. In this work, we propose a robust restricted isometry property, called Sign-RIP, and show its broad app… ▽ More

    Submitted 28 September, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

  22. arXiv:2010.04015  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Learning Partially Observed Linear Dynamical Systems from Logarithmic Number of Samples

    Authors: Salar Fattahi

    Abstract: In this work, we study the problem of learning partially observed linear dynamical systems from a single sample trajectory. A major practical challenge in the existing system identification methods is the undesirable dependency of their required sample size on the system dimension: roughly speaking, they presume and rely on sample sizes that scale linearly with respect to the system dimension. Evi… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

  23. arXiv:1909.09895  [pdf, other

    math.OC cs.LG stat.ML

    Efficient Learning of Distributed Linear-Quadratic Controllers

    Authors: Salar Fattahi, Nikolai Matni, Somayeh Sojoudi

    Abstract: In this work, we propose a robust approach to design distributed controllers for unknown-but-sparse linear and time-invariant systems. By leveraging modern techniques in distributed controller synthesis and structured linear inverse problems as applied to system identification, we show that near-optimal distributed controllers can be learned with sub-linear sample complexity and computed with near… ▽ More

    Submitted 10 October, 2019; v1 submitted 21 September, 2019; originally announced September 2019.

  24. arXiv:1905.09937  [pdf, other

    math.OC

    On the Absence of Spurious Local Trajectories in Time-varying Nonconvex Optimization

    Authors: S. Fattahi, C. Josz, Y. Ding, R. Mohammadi, J. Lavaei, S. Sojoudi

    Abstract: In this paper, we study the landscape of an online nonconvex optimization problem, for which the input data vary over time and the solution is a trajectory rather than a single point. To understand the complexity of finding a global solution of this problem, we introduce the notion of \textit{spurious (i.e., non-global) local trajectory} as a generalization to the notion of spurious local solution… ▽ More

    Submitted 30 October, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

  25. arXiv:1904.09396  [pdf, ps, other

    eess.SY math.OC stat.ML

    Learning Sparse Dynamical Systems from a Single Sample Trajectory

    Authors: Salar Fattahi, Nikolai Matni, Somayeh Sojoudi

    Abstract: This paper addresses the problem of identifying sparse linear time-invariant (LTI) systems from a single sample trajectory generated by the system dynamics. We introduce a Lasso-like estimator for the parameters of the system, taking into account their sparse nature. Assuming that the system is stable, or that it is equipped with an initial stabilizing controller, we provide sharp finite-time guar… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

  26. arXiv:1812.11466  [pdf, other

    cs.LG stat.ML

    Exact Guarantees on the Absence of Spurious Local Minima for Non-negative Rank-1 Robust Principal Component Analysis

    Authors: Salar Fattahi, Somayeh Sojoudi

    Abstract: This work is concerned with the non-negative rank-1 robust principal component analysis (RPCA), where the goal is to recover the dominant non-negative principal components of a data matrix precisely, where a number of measurements could be grossly corrupted with sparse and arbitrary large noise. Most of the known techniques for solving the RPCA rely on convex relaxation methods by lifting the prob… ▽ More

    Submitted 3 September, 2019; v1 submitted 29 December, 2018; originally announced December 2018.

  27. arXiv:1803.07753  [pdf, ps, other

    eess.SY stat.ML

    Sample Complexity of Sparse System Identification Problem

    Authors: Salar Fattahi, Somayeh Sojoudi

    Abstract: In this paper, we study the system identification problem for sparse linear time-invariant systems. We propose a sparsity promoting block-regularized estimator to identify the dynamics of the system with only a limited number of input-state data samples. We characterize the properties of this estimator under high-dimensional scaling, where the growth rate of the system dimension is comparable to o… ▽ More

    Submitted 26 August, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

  28. arXiv:1802.04911  [pdf, ps, other

    stat.ML cs.LG math.OC stat.CO

    Large-Scale Sparse Inverse Covariance Estimation via Thresholding and Max-Det Matrix Completion

    Authors: Richard Y. Zhang, Salar Fattahi, Somayeh Sojoudi

    Abstract: The sparse inverse covariance estimation problem is commonly solved using an $\ell_{1}$-regularized Gaussian maximum likelihood estimator known as "graphical lasso", but its computational cost becomes prohibitive for large data sets. A recent line of results showed--under mild assumptions--that the graphical lasso estimator can be retrieved by soft-thresholding the sample covariance matrix and sol… ▽ More

    Submitted 6 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: 35-th International Conference on Machine Learning (ICML 2018)

  29. arXiv:1711.10428  [pdf, other

    math.OC

    A Bound Strengthening Method for Optimal Transmission Switching in Power Systems

    Authors: Salar Fattahi, Javad Lavaei, Alper Atamturk

    Abstract: This paper studies the optimal transmission switching (OTS) problem for power systems, where certain lines are fixed (uncontrollable) and the remaining ones are controllable via on/off switches. The goal is to identify a topology of the power grid that minimizes the cost of the system operation while satisfying the physical and operational constraints. Most of the existing methods for the problem… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Report number: BCOL Research Report 17.06, IEOR, University of California-Berkeley

  30. arXiv:1711.09131  [pdf, ps, other

    stat.ML stat.CO

    Sparse Inverse Covariance Estimation for Chordal Structures

    Authors: Salar Fattahi, Richard Y. Zhang, Somayeh Sojoudi

    Abstract: In this paper, we consider the Graphical Lasso (GL), a popular optimization problem for learning the sparse representations of high-dimensional datasets, which is well-known to be computationally expensive for large-scale problems. Recently, we have shown that the sparsity pattern of the optimal solution of GL is equivalent to the one obtained from simply thresholding the sample covariance matrix,… ▽ More

    Submitted 24 November, 2017; originally announced November 2017.

  31. arXiv:1708.09479  [pdf, ps, other

    stat.ML

    Graphical Lasso and Thresholding: Equivalence and Closed-form Solutions

    Authors: Salar Fattahi, Somayeh Sojoudi

    Abstract: Graphical Lasso (GL) is a popular method for learning the structure of an undirected graphical model, which is based on an $l_1$ regularization technique. The objective of this paper is to compare the computationally-heavy GL technique with a numerically-cheap heuristic method that is based on simply thresholding the sample covariance matrix. To this end, two notions of sign-consistent and inverse… ▽ More

    Submitted 28 June, 2019; v1 submitted 30 August, 2017; originally announced August 2017.