Skip to main content

Showing 1–12 of 12 results for author: Motahari, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.10101  [pdf, ps, other

    stat.ML cs.LG

    Fundamental Limits of Learning High-dimensional Simplices in Noisy Regimes

    Authors: Seyed Amir Hossein Saberi, Amir Najafi, Abolfazl Motahari, Babak H. khalaj

    Abstract: In this paper, we establish sample complexity bounds for learning high-dimensional simplices in $\mathbb{R}^K$ from noisy data. Specifically, we consider $n$ i.i.d. samples uniformly drawn from an unknown simplex in $\mathbb{R}^K$, each corrupted by additive Gaussian noise of unknown variance. We prove an algorithm exists that, with high probability, outputs a simplex within $\ell_2$ or total vari… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: Extension of our ICML 2023 paper, 44 pages

  2. arXiv:2410.14061  [pdf, other

    stat.ML cs.LG

    Gradual Domain Adaptation via Manifold-Constrained Distributionally Robust Optimization

    Authors: Amir Hossein Saberi, Amir Najafi, Ala Emrani, Amin Behjati, Yasaman Zolfimoselo, Mahdi Shadrooy, Abolfazl Motahari, Babak H. Khalaj

    Abstract: The aim of this paper is to address the challenge of gradual domain adaptation within a class of manifold-constrained data distributions. In particular, we consider a sequence of $T\ge2$ data distributions $P_1,\ldots,P_T$ undergoing a gradual shift, where each pair of consecutive measures $P_i,P_{i+1}$ are close to each other in Wasserstein distance. We have a supervised dataset of size $n$ sampl… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: Published at Proceedings of Neural Information Processing Systems (NeurIPS) 2024

  3. arXiv:2310.00027  [pdf, ps, other

    stat.ML cs.LG

    Out-Of-Domain Unlabeled Data Improves Generalization

    Authors: Amir Hossein Saberi, Amir Najafi, Alireza Heidari, Mohammad Hosein Movasaghinia, Abolfazl Motahari, Babak H. Khalaj

    Abstract: We propose a novel framework for incorporating unlabeled data into semi-supervised classification problems, where scenarios involving the minimization of either i) adversarially robust or ii) non-robust loss functions have been considered. Notably, we allow the unlabeled samples to deviate slightly (in total variation sense) from the in-domain distribution. The core idea behind our framework is to… ▽ More

    Submitted 15 February, 2024; v1 submitted 28 September, 2023; originally announced October 2023.

    Comments: Published at ICLR 2024 (Spotlight), 29 pages, no figures

  4. arXiv:2209.05953  [pdf, ps, other

    stat.ML cs.LG

    Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes

    Authors: Amir Hossein Saberi, Amir Najafi, Seyed Abolfazl Motahari, Babak H. Khalaj

    Abstract: In this paper, we find a sample complexity bound for learning a simplex from noisy samples. Assume a dataset of size $n$ is given which includes i.i.d. samples drawn from a uniform distribution over an unknown simplex in $\mathbb{R}^K$, where samples are assumed to be corrupted by a multi-variate additive Gaussian noise of an arbitrary magnitude. We prove the existence of an algorithm that with hi… ▽ More

    Submitted 28 April, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted for ICML 2023; 27 pages

  5. arXiv:2111.02802  [pdf, other

    stat.ML cs.LG

    Distributed Sparse Feature Selection in Communication-Restricted Networks

    Authors: Hanie Barghi, Amir Najafi, Seyed Abolfazl Motahari

    Abstract: This paper aims to propose and theoretically analyze a new distributed scheme for sparse linear regression and feature selection. The primary goal is to learn the few causal features of a high-dimensional dataset based on noisy observations from an unknown sparse linear model. However, the presumed training set which includes $n$ data samples in $\mathbb{R}^p$ is already distributed over a large n… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Submitted to IEEE Transactions on Signal Processing, 14 pages

  6. arXiv:2012.07527  [pdf, other

    cs.CL cs.LG stat.ML

    Regularizing Recurrent Neural Networks via Sequence Mixup

    Authors: Armin Karamzade, Amir Najafi, Seyed Abolfazl Motahari

    Abstract: In this paper, we extend a class of celebrated regularization techniques originally proposed for feed-forward neural networks, namely Input Mixup (Zhang et al., 2017) and Manifold Mixup (Verma et al., 2018), to the realm of Recurrent Neural Networks (RNN). Our proposed methods are easy to implement and have a low computational complexity, while leverage the performance of simple neural architectur… ▽ More

    Submitted 27 November, 2020; originally announced December 2020.

    Comments: 17 pages

  7. arXiv:1812.10437  [pdf, other

    cs.LG cs.DC cs.IT stat.ML

    Structure Learning of Sparse GGMs over Multiple Access Networks

    Authors: Mostafa Tavassolipour, Armin Karamzade, Reza Mirzaeifard, Seyed Abolfazl Motahari, Mohammad-Taghi Manzuri Shalmani

    Abstract: A central machine is interested in estimating the underlying structure of a sparse Gaussian Graphical Model (GGM) from datasets distributed across multiple local machines. The local machines can communicate with the central machine through a wireless multiple access channel. In this paper, we are interested in designing effective strategies where reliable learning is feasible under power and bandw… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

  8. arXiv:1810.07845  [pdf, other

    cs.LG stat.ML

    On Statistical Learning of Simplices: Unmixing Problem Revisited

    Authors: Amir Najafi, Saeed Ilchi, Amir H. Saberi, Seyed Abolfazl Motahari, Babak H. Khalaj, Hamid R. Rabiee

    Abstract: We study the sample complexity of learning a high-dimensional simplex from a set of points uniformly sampled from its interior. Learning of simplices is a long studied problem in computer science and has applications in computational biology and remote sensing, mostly under the name of `spectral unmixing'. We theoretically show that a sufficient sample complexity for reliable learning of a $K$-dim… ▽ More

    Submitted 12 August, 2020; v1 submitted 17 October, 2018; originally announced October 2018.

    Comments: 32 pages

  9. Learning of Tree-Structured Gaussian Graphical Models on Distributed Data under Communication Constraints

    Authors: Mostafa Tavassolipour, Seyed Abolfazl Motahari, Mohammad-Taghi Manzuri Shalmani

    Abstract: In this paper, learning of tree-structured Gaussian graphical models from distributed data is addressed. In our model, samples are stored in a set of distributed machines where each machine has access to only a subset of features. A central machine is then responsible for learning the structure based on received messages from the other nodes. We present a set of communication efficient strategies,… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

  10. arXiv:1806.04863  [pdf, other

    q-bio.GN cs.LG stat.ML

    Cell Identity Codes: Understanding Cell Identity from Gene Expression Profiles using Deep Neural Networks

    Authors: Farzad Abdolhosseini, Behrooz Azarkhalili, Abbas Maazallahi, Aryan Kamal, Seyed Abolfazl Motahari, Ali Sharifi-Zarchi, Hamidreza Chitsaz

    Abstract: Understanding cell identity is an important task in many biomedical areas. Expression patterns of specific marker genes have been used to characterize some limited cell types, but exclusive markers are not available for many cell types. A second approach is to use machine learning to discriminate cell types based on the whole gene expression profiles (GEPs). The accuracies of simple classification… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

  11. arXiv:1710.02101  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Reliable Clustering of Bernoulli Mixture Models

    Authors: Amir Najafi, Abolfazl Motahari, Hamid R. Rabiee

    Abstract: A Bernoulli Mixture Model (BMM) is a finite mixture of random binary vectors with independent dimensions. The problem of clustering BMM data arises in a variety of real-world applications, ranging from population genetics to activity analysis in social networks. In this paper, we analyze the clusterability of BMMs from a theoretical perspective, when the number of clusters is unknown. In particula… ▽ More

    Submitted 16 June, 2019; v1 submitted 5 October, 2017; originally announced October 2017.

    Comments: 22 pages

  12. arXiv:1705.02627  [pdf, other

    stat.ML cs.IT cs.LG

    Learning of Gaussian Processes in Distributed and Communication Limited Systems

    Authors: Mostafa Tavassolipour, Seyed Abolfazl Motahari, Mohammad-Taghi Manzuri Shalmani

    Abstract: It is of fundamental importance to find algorithms obtaining optimal performance for learning of statistical models in distributed and communication limited systems. Aiming at characterizing the optimal strategies, we consider learning of Gaussian Processes (GPs) in distributed systems as a pivotal example. We first address a very basic problem: how many bits are required to estimate the inner-pro… ▽ More

    Submitted 7 May, 2017; originally announced May 2017.