Skip to main content

Showing 1–15 of 15 results for author: Hashemi, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.08917  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation

    Authors: Guojun Liang, Najmeh Abiri, Atiye Sadat Hashemi, Jens Lundström, Stefan Byttner, Prayag Tiwari

    Abstract: Accurate imputation is essential for the reliability and success of downstream tasks. Recently, diffusion models have attracted great attention in this field. However, these models neglect the latent distribution in a lower-dimensional space derived from the observed data, which limits the generative capacity of the diffusion model. Additionally, dealing with the original missing data without labe… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 5 pages, conference

  2. arXiv:2405.18237  [pdf, other

    cs.LG math.ST stat.ML

    Unveiling the Cycloid Trajectory of EM Iterations in Mixed Linear Regression

    Authors: Zhankun Luo, Abolfazl Hashemi

    Abstract: We study the trajectory of iterations and the convergence rates of the Expectation-Maximization (EM) algorithm for two-component Mixed Linear Regression (2MLR). The fundamental goal of MLR is to learn the regression models from unlabeled observations. The EM algorithm finds extensive applications in solving the mixture of linear regressions. Recent results have established the super-linear converg… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: This paper was accepted by the 41st International Conference on Machine Learning (ICML 2024). The code for numerical experiments is available at https://github.com/dassein/cycloid_em_mlr

  3. arXiv:2405.02188  [pdf, other

    stat.ML cs.AI cs.LG

    Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes

    Authors: Sang Bin Moon, Abolfazl Hashemi

    Abstract: The Adversarial Markov Decision Process (AMDP) is a learning framework that deals with unknown and varying tasks in decision-making applications like robotics and recommendation systems. A major limitation of the AMDP formalism, however, is pessimistic regret analysis results in the sense that although the cost function can change from one episode to the next, the evolution in many settings is not… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2111.01692  [pdf, other

    stat.ML cs.AI cs.LG eess.SP stat.AP

    Efficient Hierarchical Bayesian Inference for Spatio-temporal Regression Models in Neuroimaging

    Authors: Ali Hashemi, Yijing Gao, Chang Cai, Sanjay Ghosh, Klaus-Robert Müller, Srikantan S. Nagarajan, Stefan Haufe

    Abstract: Several problems in neuroimaging and beyond require inference on the parameters of multi-task sparse hierarchical regression models. Examples include M/EEG inverse problems, neural encoding models for task-based fMRI analyses, and climate science. In these domains, both the model parameters to be inferred and the measurement noise may exhibit a complex spatio-temporal structure. Existing work eith… ▽ More

    Submitted 23 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  5. arXiv:2106.08882  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

    Authors: Anish Acharya, Abolfazl Hashemi, Prateek Jain, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu

    Abstract: Geometric median (\textsc{Gm}) is a classical method in statistics for achieving a robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 0.5. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) for high-dimensional optimization problems. In this paper, we show that by applying \textsc{G… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  6. arXiv:2106.07094  [pdf, other

    cs.LG cs.DC eess.SP math.OC stat.ML

    On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates

    Authors: Rudrajit Das, Abolfazl Hashemi, Sujay Sanghavi, Inderjit S. Dhillon

    Abstract: There is a dearth of convergence results for differentially private federated learning (FL) with non-Lipschitz objective functions (i.e., when gradient norms are not bounded). The primary reason for this is that the clipping operation (i.e., projection onto an $\ell_2$ ball of a fixed radius called the clipping threshold) for bounding the sensitivity of the average update to each client's update i… ▽ More

    Submitted 15 April, 2022; v1 submitted 13 June, 2021; originally announced June 2021.

  7. arXiv:2103.03191  [pdf, other

    stat.ML cs.LG math.NA math.OC math.PR

    Generalization Bounds for Sparse Random Feature Expansions

    Authors: Abolfazl Hashemi, Hayden Schaeffer, Robert Shi, Ufuk Topcu, Giang Tran, Rachel Ward

    Abstract: Random feature methods have been successful in various machine learning tasks, are easy to compute, and come with theoretical accuracy bounds. They serve as an alternative approach to standard neural networks since they can represent similar function spaces without a costly training phase. However, for accuracy, random feature methods require more measurements than trainable parameters, limiting t… ▽ More

    Submitted 20 August, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

  8. arXiv:2012.04061  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    Faster Non-Convex Federated Learning via Global and Local Momentum

    Authors: Rudrajit Das, Anish Acharya, Abolfazl Hashemi, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu

    Abstract: We propose \texttt{FedGLOMO}, a novel federated learning (FL) algorithm with an iteration complexity of $\mathcal{O}(ε^{-1.5})$ to converge to an $ε$-stationary point (i.e., $\mathbb{E}[\|\nabla f(\bm{x})\|^2] \leq ε$) for smooth non-convex functions -- under arbitrary client heterogeneity and compressed communication -- compared to the $\mathcal{O}(ε^{-2})$ complexity of most prior works. Our key… ▽ More

    Submitted 24 October, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

  9. arXiv:2011.10643  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Optimization

    Authors: Abolfazl Hashemi, Anish Acharya, Rudrajit Das, Haris Vikalo, Sujay Sanghavi, Inderjit Dhillon

    Abstract: In decentralized optimization, it is common algorithmic practice to have nodes interleave (local) gradient descent iterations with gossip (i.e. averaging over the network) steps. Motivated by the training of large-scale machine learning models, it is also increasingly common to require that messages be {\em lossy compressed} versions of the local parameters. In this paper, we show that, in such co… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  10. arXiv:1909.12898  [pdf, other

    cs.LG eess.SY stat.ML

    Identifying Sparse Low-Dimensional Structures in Markov Chains: A Nonnegative Matrix Factorization Approach

    Authors: Mahsa Ghasemi, Abolfazl Hashemi, Haris Vikalo, Ufuk Topcu

    Abstract: We consider the problem of learning low-dimensional representations for large-scale Markov chains. We formulate the task of representation learning as that of mapping the state space of the model to a low-dimensional state space, called the kernel space. The kernel space contains a set of meta states which are desired to be representative of only a small subset of original states. To promote this… ▽ More

    Submitted 7 April, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in American Control Conference (ACC) Proceedings, 2020

  11. arXiv:1711.00142  [pdf, ps, other

    stat.ML cs.LG

    Sampling and Reconstruction of Graph Signals via Weak Submodularity and Semidefinite Relaxation

    Authors: Abolfazl Hashemi, Rasoul Shafipour, Haris Vikalo, Gonzalo Mateos

    Abstract: We study the problem of sampling a bandlimited graph signal in the presence of noise, where the objective is to select a node subset of prescribed cardinality that minimizes the signal reconstruction mean squared error (MSE). To that end, we formulate the task at hand as the minimization of MSE subject to binary constraints, and approximate the resulting NP-hard problem via semidefinite programmin… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.

  12. arXiv:1711.00126  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Accelerated Sparse Subspace Clustering

    Authors: Abolfazl Hashemi, Haris Vikalo

    Abstract: State-of-the-art algorithms for sparse subspace clustering perform spectral clustering on a similarity matrix typically obtained by representing each data point as a sparse combination of other points using either basis pursuit (BP) or orthogonal matching pursuit (OMP). BP-based methods are often prohibitive in practice while the performance of OMP-based schemes are unsatisfactory, especially in s… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.

  13. arXiv:1608.02554  [pdf, ps, other

    stat.ML cs.IT

    Sparse recovery via Orthogonal Least-Squares under presence of Noise

    Authors: Abolfazl Hashemi, Haris Vikalo

    Abstract: We consider the Orthogonal Least-Squares (OLS) algorithm for the recovery of a $m$-dimensional $k$-sparse signal from a low number of noisy linear measurements. The Exact Recovery Condition (ERC) in bounded noisy scenario is established for OLS under certain condition on nonzero elements of the signal. The new result also improves the existing guarantees for Orthogonal Matching Pursuit (OMP) algor… ▽ More

    Submitted 8 August, 2016; originally announced August 2016.

  14. arXiv:1608.02549  [pdf, ps, other

    stat.ML cs.IT

    Sampling Requirements and Accelerated Schemes for Sparse Linear Regression with Orthogonal Least-Squares

    Authors: Abolfazl Hashemi, Haris Vikalo

    Abstract: We study the problem of inferring a sparse vector from random linear combinations of its components. We propose the Accelerated Orthogonal Least-Squares (AOLS) algorithm that improves performance of the well-known Orthogonal Least-Squares (OLS) algorithm while requiring significantly lower computational costs. While OLS greedily selects columns of the coefficient matrix that correspond to non-zero… ▽ More

    Submitted 13 April, 2018; v1 submitted 8 August, 2016; originally announced August 2016.

  15. arXiv:1602.06916  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Sparse Linear Regression via Generalized Orthogonal Least-Squares

    Authors: Abolfazl Hashemi, Haris Vikalo

    Abstract: Sparse linear regression, which entails finding a sparse solution to an underdetermined system of linear equations, can formally be expressed as an $l_0$-constrained least-squares problem. The Orthogonal Least-Squares (OLS) algorithm sequentially selects the features (i.e., columns of the coefficient matrix) to greedily find an approximate sparse solution. In this paper, a generalization of Orthog… ▽ More

    Submitted 28 July, 2016; v1 submitted 22 February, 2016; originally announced February 2016.