Skip to main content

Showing 1–30 of 30 results for author: Lerman, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.03587  [pdf, other

    cs.LG stat.ML

    Stein Discrepancy for Unsupervised Domain Adaptation

    Authors: Anneke von Seeger, Dongmian Zou, Gilad Lerman

    Abstract: Unsupervised domain adaptation (UDA) leverages information from a labeled source dataset to improve accuracy on a related but unlabeled target dataset. A common approach to UDA is aligning representations from the source and target domains by minimizing the distance between their data distributions. Previous methods have employed distances such as Wasserstein distance and maximum mean discrepancy.… ▽ More

    Submitted 21 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 24 pages, 9 figures

  2. arXiv:2403.18658  [pdf, ps, other

    math.ST stat.ML

    Theoretical Guarantees for the Subspace-Constrained Tyler's Estimator

    Authors: Gilad Lerman, Feng Yu, Teng Zhang

    Abstract: This work analyzes the subspace-constrained Tyler's estimator (STE) designed for recovering a low-dimensional subspace within a dataset that may be highly corrupted with outliers. It assumes a weak inlier-outlier model and allows the fraction of inliers to be smaller than a fraction that leads to computational hardness of the robust subspace recovery problem. It shows that in this setting, if the… ▽ More

    Submitted 12 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  3. arXiv:2311.02490  [pdf, other

    math.NA math.OC stat.ML

    Improved Convergence Rates of Windowed Anderson Acceleration for Symmetric Fixed-Point Iterations

    Authors: Casey Garner, Gilad Lerman, Teng Zhang

    Abstract: This paper studies the commonly utilized windowed Anderson acceleration (AA) algorithm for fixed-point methods, $x^{(k+1)}=q(x^{(k)})$. It provides the first proof that when the operator $q$ is linear and symmetric the windowed AA, which uses a sliding window of prior iterates, improves the root-linear convergence factor over the fixed-point iterations. When $q$ is nonlinear, yet has a symmetric J… ▽ More

    Submitted 8 March, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 32 pages, 14 figures

    MSC Class: 65F10; 65H10; 68W40

  4. arXiv:2206.08994  [pdf, other

    stat.ML cs.CV cs.LG math.NA

    Robust Group Synchronization via Quadratic Programming

    Authors: Yunpeng Shi, Cole Wyeth, Gilad Lerman

    Abstract: We propose a novel quadratic programming formulation for estimating the corruption levels in group synchronization, and use these estimates to solve this problem. Our objective function exploits the cycle consistency of the group and we thus refer to our method as detection and estimation of structural consistency (DESC). This general framework can be extended to other algebraic and geometric stru… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022

    MSC Class: 90C26; 90C17; 68Q87; 65C20; 90-08; 60-08 ACM Class: G.1.6; I.4.0

  5. arXiv:2206.01874  [pdf, other

    cs.LG stat.ML

    An Unpooling Layer for Graph Generation

    Authors: Yinglong Guo, Dongmian Zou, Gilad Lerman

    Abstract: We propose a novel and trainable graph unpooling layer for effective graph generation. Given a graph with features, the unpooling layer enlarges this graph and learns its desired new structure and features. Since this unpooling layer is trainable, it can be applied to graph generation either in the decoder of a variational autoencoder or in the generator of a generative adversarial network (GAN).… ▽ More

    Submitted 5 March, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

  6. arXiv:2203.16505  [pdf, other

    cs.CV math.NA stat.ML

    Fast, Accurate and Memory-Efficient Partial Permutation Synchronization

    Authors: Shaohan Li, Yunpeng Shi, Gilad Lerman

    Abstract: Previous partial permutation synchronization (PPS) algorithms, which are commonly used for multi-object matching, often involve computation-intensive and memory-demanding matrix operations. These operations become intractable for large scale structure-from-motion datasets. For pure permutation synchronization, the recent Cycle-Edge Message Passing (CEMP) framework suggests a memory-efficient and f… ▽ More

    Submitted 31 March, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

    MSC Class: 90C26; 90C10; 90C17; 68Q87; 65C20

  7. Ensemble Riemannian Data Assimilation over the Wasserstein Space

    Authors: Sagar K. Tamang, Ardeshir Ebtehaj, Peter J. Van Leeuwen, Dongmian Zou, Gilad Lerman

    Abstract: In this paper, we present an ensemble data assimilation paradigm over a Riemannian manifold equipped with the Wasserstein metric. Unlike the Eulerian penalization of error in the Euclidean space, the Wasserstein metric can capture translation and difference between the shapes of square-integrable probability distributions of the background state and observations -- enabling to formally penalize ge… ▽ More

    Submitted 24 March, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Journal ref: Nonlinear Processes in Geophysics, 28, 295-309 (2021)

  8. arXiv:2007.13638  [pdf, other

    cs.CV cs.IT stat.ML

    Message Passing Least Squares Framework and its Application to Rotation Synchronization

    Authors: Yunpeng Shi, Gilad Lerman

    Abstract: We propose an efficient algorithm for solving group synchronization under high levels of corruption and noise, while we focus on rotation synchronization. We first describe our recent theoretically guaranteed message passing algorithm that estimates the corruption levels of the measured group ratios. We then propose a novel reweighted least squares method to estimate the group elements, where the… ▽ More

    Submitted 14 August, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: To Appear in ICML 2020 Proceedings

    MSC Class: 90C26; 90C17; 68Q87; 65C20; 90-08; 60-08 ACM Class: G.1.6; I.4.0

    Journal ref: International Conference on Machine Learning, 8796-8806 (2020)

  9. arXiv:2006.06658  [pdf, other

    cs.CV math.NA math.PR stat.ML

    Robust Multi-object Matching via Iterative Reweighting of the Graph Connection Laplacian

    Authors: Yunpeng Shi, Shaohan Li, Gilad Lerman

    Abstract: We propose an efficient and robust iterative solution to the multi-object matching problem. We first clarify serious limitations of current methods as well as the inappropriateness of the standard iteratively reweighted least squares procedure. In view of these limitations, we suggest a novel and more reliable iterative reweighting strategy that incorporates information from higher-order neighborh… ▽ More

    Submitted 24 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    MSC Class: 90C26; 90C10; 90C17; 68Q87; 65C20 ACM Class: G.1.6; I.4.0

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 33, 15243--15253 (2020)

  10. arXiv:2006.05534  [pdf, other

    cs.LG stat.ML

    Novelty Detection via Robust Variational Autoencoding

    Authors: Chieh-Hsin Lai, Dongmian Zou, Gilad Lerman

    Abstract: We propose a new method for novelty detection that can tolerate high corruption of the training points, whereas previous works assumed either no or very low corruption. Our method trains a robust variational autoencoder (VAE), which aims to generate a model for the uncorrupted training points. To gain robustness to high corruption, we incorporate the following four changes to the common VAE: 1. Ex… ▽ More

    Submitted 1 March, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

  11. arXiv:2003.02421  [pdf, other

    stat.ME math.DS

    Regularized Variational Data Assimilation for Bias Treatment using the Wasserstein Metric

    Authors: Sagar K. Tamang, Ardeshir Ebtehaj, Dongmian Zou, Gilad Lerman

    Abstract: This paper presents a new variational data assimilation (VDA) approach for the formal treatment of bias in both model outputs and observations. This approach relies on the Wasserstein metric stemming from the theory of optimal mass transport to penalize the distance between the probability histograms of the analysis state and an a priori reference dataset, which is likely to be more uncertain but… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: 7 figures

    Journal ref: Quarterly Journal of the Royal Meteorological Society, Volume 146, Issue 730, pages 2332-2346, July 2020

  12. arXiv:1912.11347  [pdf, other

    stat.ML cs.IT math.OC math.PR

    Robust Group Synchronization via Cycle-Edge Message Passing

    Authors: Gilad Lerman, Yunpeng Shi

    Abstract: We propose a general framework for solving the group synchronization problem, where we focus on the setting of adversarial or uniform corruption and sufficiently small noise. Specifically, we apply a novel message passing procedure that uses cycle consistency information in order to estimate the corruption levels of group ratios and consequently solve the synchronization problem in our setting. We… ▽ More

    Submitted 27 July, 2021; v1 submitted 24 December, 2019; originally announced December 2019.

    MSC Class: 90-08; 62G35; 68Q25; 68W40; 68Q87; 93E10

  13. arXiv:1904.03275  [pdf, ps, other

    cs.LG math.OC stat.ML

    Robust Subspace Recovery with Adversarial Outliers

    Authors: Tyler Maunu, Gilad Lerman

    Abstract: We study the problem of robust subspace recovery (RSR) in the presence of adversarial outliers. That is, we seek a subspace that contains a large portion of a dataset when some fraction of the data points are arbitrarily corrupted. We first examine a theoretical estimator that is intractable to calculate and use it to derive information-theoretic bounds of exact recovery. We then propose two tract… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 21 pages, 1 table

  14. arXiv:1904.00152  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Subspace Recovery Layer for Unsupervised Anomaly Detection

    Authors: Chieh-Hsin Lai, Dongmian Zou, Gilad Lerman

    Abstract: We propose a neural network for unsupervised anomaly detection with a novel robust subspace recovery layer (RSR layer). This layer seeks to extract the underlying subspace from a latent representation of the given data and removes outliers that lie away from this subspace. It is used within an autoencoder. The encoder maps the data into a latent space, from which the RSR layer extracts the subspac… ▽ More

    Submitted 24 December, 2019; v1 submitted 30 March, 2019; originally announced April 2019.

    Comments: This work is on the ICLR 2020 conference

    Journal ref: Eighth International Conference on Learning Representations (ICLR), 2020, https://openreview.net/pdf?id=rylb3eBtwr

  15. Encoding Robust Representation for Graph Generation

    Authors: Dongmian Zou, Gilad Lerman

    Abstract: Generative networks have made it possible to generate meaningful signals such as images and texts from simple noise. Recently, generative methods based on GAN and VAE were developed for graphs and graph signals. However, the mathematical properties of these methods are unclear, and training good generative models is difficult. This work proposes a graph generation model that uses a recent adaptati… ▽ More

    Submitted 15 January, 2019; v1 submitted 28 September, 2018; originally announced September 2018.

    Comments: 9 pages, 7 figures, 6 tables

    Journal ref: 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019, pp. 1-9

  16. An Overview of Robust Subspace Recovery

    Authors: Gilad Lerman, Tyler Maunu

    Abstract: This paper will serve as an introduction to the body of work on robust subspace recovery. Robust subspace recovery involves finding an underlying low-dimensional subspace in a dataset that is possibly corrupted with outliers. While this problem is easy to state, it has been difficult to develop optimal algorithms due to its underlying nonconvexity. This work emphasizes advantages and disadvantages… ▽ More

    Submitted 5 July, 2018; v1 submitted 2 March, 2018; originally announced March 2018.

    Comments: 31 pages, 5 figures, 3 tables

    Journal ref: Proceedings of the IEEE 106 (2018) 1380-1410

  17. arXiv:1706.03896  [pdf, other

    cs.LG math.OC stat.ML

    A Well-Tempered Landscape for Non-convex Robust Subspace Recovery

    Authors: Tyler Maunu, Teng Zhang, Gilad Lerman

    Abstract: We present a mathematical analysis of a non-convex energy landscape for robust subspace recovery. We prove that an underlying subspace is the only stationary point and local minimizer in a specified neighborhood under a deterministic condition on a dataset. If the deterministic condition is satisfied, we further show that a geodesic gradient descent method over the Grassmannian manifold can exactl… ▽ More

    Submitted 28 February, 2019; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: 58 pages, 6 figures, 1 table

    Journal ref: Journal of Machine Learning Research, 20(37):1-59, 2019

  18. arXiv:1510.08406  [pdf, ps, other

    stat.ML

    Fast Landmark Subspace Clustering

    Authors: Xu Wang, Gilad Lerman

    Abstract: Kernel methods obtain superb performance in terms of accuracy for various machine learning tasks since they can effectively extract nonlinear relations. However, their time complexity can be rather large especially for clustering tasks. In this paper we define a general class of kernels that can be easily approximated by randomization. These kernels appear in various applications, in particular, t… ▽ More

    Submitted 28 October, 2015; originally announced October 2015.

  19. arXiv:1507.06710  [pdf, other

    math.ST stat.AP

    Nonparametric Bayesian Regression on Manifolds via Brownian Motion

    Authors: Xu Wang, Gilad Lerman

    Abstract: This paper proposes a novel framework for manifold-valued regression and establishes its consistency as well as its contraction rate. It assumes a predictor with values in the interval $[0,1]$ and response with values in a compact Riemannian manifold $M$. This setting is useful for applications such as modeling dynamic scenes or shape deformations, where the visual scene or the deformed objects ca… ▽ More

    Submitted 23 July, 2015; originally announced July 2015.

  20. arXiv:1410.0095  [pdf, ps, other

    stat.ML cs.CV cs.LG

    Riemannian Multi-Manifold Modeling

    Authors: Xu Wang, Konstantinos Slavakis, Gilad Lerman

    Abstract: This paper advocates a novel framework for segmenting a dataset in a Riemannian manifold $M$ into clusters lying around low-dimensional submanifolds of $M$. Important examples of $M$, for which the proposed clustering algorithm is computationally efficient, are the sphere, the set of positive definite matrices, and the Grassmannian. The clustering problem with these examples of $M$ is already usef… ▽ More

    Submitted 30 September, 2014; originally announced October 2014.

  21. arXiv:1406.6145  [pdf, other

    cs.LG cs.CV stat.AP stat.ML

    Fast, Robust and Non-convex Subspace Recovery

    Authors: Gilad Lerman, Tyler Maunu

    Abstract: This work presents a fast and non-convex algorithm for robust subspace recovery. The data sets considered include inliers drawn around a low-dimensional subspace of a higher dimensional ambient space, and a possibly large portion of outliers that do not lie nearby this subspace. The proposed algorithm, which we refer to as Fast Median Subspace (FMS), is designed to robustly determine the underlyin… ▽ More

    Submitted 9 June, 2016; v1 submitted 24 June, 2014; originally announced June 2014.

    Journal ref: Information and Inference: A Journal of the IMA 7 (2018) 277-336

  22. arXiv:1301.2007  [pdf, other

    stat.ML

    Spectral Clustering Based on Local PCA

    Authors: Ery Arias-Castro, Gilad Lerman, Teng Zhang

    Abstract: We propose a spectral clustering method based on local principal components analysis (PCA). After performing local PCA in selected neighborhoods, the algorithm builds a nearest neighbor graph weighted according to a discrepancy between the principal subspaces in the neighborhoods, and then applies spectral clustering. As opposed to standard spectral methods based solely on pairwise distances betwe… ▽ More

    Submitted 9 January, 2013; originally announced January 2013.

    Journal ref: Journal of Machine Learning Research, 18(9):1-57, 2017

  23. arXiv:1202.4044  [pdf, other

    cs.IT stat.CO stat.ML

    Robust computation of linear models by convex relaxation

    Authors: Gilad Lerman, Michael McCoy, Joel A. Tropp, Teng Zhang

    Abstract: Consider a dataset of vector-valued observations that consists of noisy inliers, which are explained well by a low-dimensional subspace, along with some number of outliers. This work describes a convex optimization problem, called REAPER, that can reliably fit a low-dimensional model to this type of data. This approach parameterizes linear subspaces using orthogonal projectors, and it uses a relax… ▽ More

    Submitted 11 August, 2014; v1 submitted 17 February, 2012; originally announced February 2012.

    Comments: Formerly titled "Robust computation of linear models, or How to find a needle in a haystack"

    MSC Class: 62H25; 65K05; 90C22

    Journal ref: Foundations of Computational Mathematics, April 2015, Volume 15, Issue 2, pp 363-410

  24. arXiv:1112.4863  [pdf, ps, other

    stat.ML math.OC

    A Novel M-Estimator for Robust PCA

    Authors: Teng Zhang, Gilad Lerman

    Abstract: We study the basic problem of robust subspace recovery. That is, we assume a data set that some of its points are sampled around a fixed subspace and the rest of them are spread in the whole ambient space, and we aim to recover the fixed underlying subspace. We first estimate "robust inverse sample covariance" by solving a convex minimization procedure; we then recover the subspace by the bottom e… ▽ More

    Submitted 23 June, 2014; v1 submitted 20 December, 2011; originally announced December 2011.

    Journal ref: Journal of Machine Learning Research 15 (2014) 749-808

  25. arXiv:1104.3770  [pdf, ps, other

    stat.ML math.ST

    Robust recovery of multiple subspaces by geometric l_p minimization

    Authors: Gilad Lerman, Teng Zhang

    Abstract: We assume i.i.d. data sampled from a mixture distribution with K components along fixed d-dimensional linear subspaces and an additional outlier component. For p>0, we study the simultaneous recovery of the K fixed subspaces by minimizing the l_p-averaged distances of the sampled data points from any K subspaces. Under some conditions, we show that if $0<p\leq1$, then all underlying subspaces can… ▽ More

    Submitted 1 February, 2012; v1 submitted 19 April, 2011; originally announced April 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS914 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS914

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2686-2715

  26. arXiv:1012.4116  [pdf, ps, other

    stat.ML cs.CV math.FA

    lp-Recovery of the Most Significant Subspace among Multiple Subspaces with Outliers

    Authors: Gilad Lerman, Teng Zhang

    Abstract: We assume data sampled from a mixture of d-dimensional linear subspaces with spherically symmetric distributions within each subspace and an additional outlier component with spherically symmetric distribution within the ambient space (for simplicity we may assume that all distributions are uniform on their corresponding unit spheres). We also assume mixture weights for the different components. W… ▽ More

    Submitted 13 January, 2014; v1 submitted 18 December, 2010; originally announced December 2010.

    Comments: This is a revised version of the part of 1002.1994 that deals with single subspace recovery. V3: Improved estimates (in particular for Lemma 3.1 and for estimates relying on it), asymptotic dependence of probabilities and constants on D and d and further clarifications; for simplicity it assumes uniform distributions on spheres. V4: minor revision for the published version

    Journal ref: Constructive Approximation, December 2014, Volume 40, Issue 3, pp 329-385

  27. Hybrid Linear Modeling via Local Best-fit Flats

    Authors: Teng Zhang, Arthur Szlam, Yi Wang, Gilad Lerman

    Abstract: We present a simple and fast geometric method for modeling data by a union of affine subspaces. The method begins by forming a collection of local best-fit affine subspaces, i.e., subspaces approximating the data in local neighborhoods. The correct sizes of the local neighborhoods are determined automatically by the Jones' $β_2$ numbers (we prove under certain geometric conditions that our method… ▽ More

    Submitted 1 May, 2012; v1 submitted 17 October, 2010; originally announced October 2010.

    Comments: This version adds some clarifications and numerical experiments as well as strengthens the previous theorem. For face experiments, we use here the Extended Yale Face Database B (cropped faces unlike previous version). This database points to a failure mode of our algorithms, but we suggest and successfully test a workaround

    Journal ref: International Journal of Computer Vision Volume 100, Issue 3 (2012), Page 217-240

  28. arXiv:1002.1994   

    stat.ML

    Probabilistic Recovery of Multiple Subspaces in Point Clouds by Geometric lp Minimization

    Authors: Gilad Lerman, Teng Zhang

    Abstract: We assume data independently sampled from a mixture distribution on the unit ball of the D-dimensional Euclidean space with K+1 components: the first component is a uniform distribution on that ball representing outliers and the other K components are uniform distributions along K d-dimensional linear subspaces restricted to that ball. We study both the simultaneous recovery of all K underlying su… ▽ More

    Submitted 19 April, 2012; v1 submitted 9 February, 2010; originally announced February 2010.

    Comments: This paper was split into two different papers: 1. https://arxiv.boxedpaper.com/abs/1012.4116 2. https://arxiv.boxedpaper.com/abs/1104.3770

  29. arXiv:1001.1323  [pdf, other

    stat.ML math.ST

    Spectral clustering based on local linear approximations

    Authors: Ery Arias-Castro, Guangliang Chen, Gilad Lerman

    Abstract: In the context of clustering, we assume a generative model where each cluster is the result of sampling points in the neighborhood of an embedded smooth surface; the sample may be contaminated with outliers, which are modeled as points sampled in space away from the clusters. We consider a prototype for a higher-order spectral clustering method based on the residual from a local linear approximati… ▽ More

    Submitted 28 November, 2011; v1 submitted 8 January, 2010; originally announced January 2010.

    MSC Class: 62H30; 62G20; 68T10

    Journal ref: Electronic Journal of Statistics, Vol. 5 (2011), pages 1537-1587

  30. Foundations of a Multi-way Spectral Clustering Framework for Hybrid Linear Modeling

    Authors: Guangliang Chen, Gilad Lerman

    Abstract: The problem of Hybrid Linear Modeling (HLM) is to model and segment data using a mixture of affine subspaces. Different strategies have been proposed to solve this problem, however, rigorous analysis justifying their performance is missing. This paper suggests the Theoretical Spectral Curvature Clustering (TSCC) algorithm for solving the HLM problem, and provides careful analysis to justify it.… ▽ More

    Submitted 14 January, 2009; v1 submitted 20 October, 2008; originally announced October 2008.

    Comments: 40 pages. Minor changes to the previous version (mainly revised Sections 2.2 & 2.3, and added references). Accepted to the Journal of Foundations of Computational Mathematics

    Report number: arXiv:0810.3724v2

    Journal ref: Found Comput Math (2009) 9(5): 517-558