Skip to main content

Showing 1–9 of 9 results for author: Mazumdar, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2502.18393  [pdf, other

    math.ST cs.DS cs.IT cs.LG stat.ML

    Learning sparse generalized linear models with binary outcomes via iterative hard thresholding

    Authors: Namiko Matsumoto, Arya Mazumdar

    Abstract: In statistics, generalized linear models (GLMs) are widely used for modeling data and can expressively capture potential nonlinear dependence of the model's outcomes on its covariates. Within the broad family of GLMs, those with binary outcomes, which include logistic and probit regressions, are motivated by common tasks such as binary classification with (possibly) non-separable data. In addition… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  2. arXiv:2412.20301  [pdf, other

    math.NA cs.DC cs.IT eess.SP

    Distributed Hybrid Sketching for $\ell_2$-Embeddings

    Authors: Neophytos Charalambides, Arya Mazumdar

    Abstract: Linear algebraic operations are ubiquitous in engineering applications, and arise often in a variety of fields including statistical signal processing and machine learning. With contemporary large datasets, to perform linear algebraic methods and regression tasks, it is necessary to resort to both distributed computations as well as data compression. In this paper, we study \textit{distributed}… ▽ More

    Submitted 28 December, 2024; originally announced December 2024.

    Comments: 21 pages, 10 figures, 1 table

    MSC Class: 65F10; 65F20; 65F55; 65B99; 65Z05; 68P20; 68P27; 68P30; 68U01; 68W10; 68W15; 68W20; 68W25; 94D99 ACM Class: G.1.2; G.1.3; E.4

  3. arXiv:2403.15928  [pdf, other

    cs.LG math.OC

    Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time

    Authors: Abhijit Mazumdar, Rafal Wisniewski, Manuela L. Bujorianu

    Abstract: In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochastic stopping time, the problem of learning optimal policy without violating safety constraints during the learning phase is yet to be addressed. To this end, we propose an algorithm b… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  4. arXiv:2307.04191  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    On the sample complexity of parameter estimation in logistic regression with normal design

    Authors: Daniel Hsu, Arya Mazumdar

    Abstract: The logistic regression model is one of the most popular data generation model in noisy binary classification problems. In this work, we study the sample complexity of estimating the parameters of the logistic regression model up to a given $\ell_2$ error, in terms of the dimension and the inverse temperature, with standard normal covariates. The inverse temperature controls the signal-to-noise ra… ▽ More

    Submitted 23 May, 2024; v1 submitted 9 July, 2023; originally announced July 2023.

  5. arXiv:2110.00744  [pdf, ps, other

    cs.DS cs.IT cs.LG math.ST

    Random Subgraph Detection Using Queries

    Authors: Wasim Huleihel, Arya Mazumdar, Soumyabrata Pal

    Abstract: The planted densest subgraph detection problem refers to the task of testing whether in a given (random) graph there is a subgraph that is unusually dense. Specifically, we observe an undirected and unweighted graph on $n$ vertices. Under the null hypothesis, the graph is a realization of an Erdős-Rényi graph with edge probability (or, density) $q$. Under the alternative, there is a subgraph on… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 October, 2021; originally announced October 2021.

    Comments: 27 pages

  6. arXiv:2109.01064  [pdf, other

    math.PR cs.IT cs.LG math.ST

    Lower Bounds on the Total Variation Distance Between Mixtures of Two Gaussians

    Authors: Sami Davies, Arya Mazumdar, Soumyabrata Pal, Cyrus Rashtchian

    Abstract: Mixtures of high dimensional Gaussian distributions have been studied extensively in statistics and learning theory. While the total variation distance appears naturally in the sample complexity of distribution learning, it is analytically difficult to obtain tight lower bounds for mixtures. Exploiting a connection between total variation distance and the characteristic function of the mixture, we… ▽ More

    Submitted 9 March, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: 22 pages, 1 figure; Accepted to ALT 2022

  7. arXiv:2103.09424  [pdf, other

    cs.DC cs.LG math.OC stat.ML

    Escaping Saddle Points in Distributed Newton's Method with Communication Efficiency and Byzantine Resilience

    Authors: Avishek Ghosh, Raj Kumar Maity, Arya Mazumdar, Kannan Ramchandran

    Abstract: The problem of saddle-point avoidance for non-convex optimization is quite challenging in large scale distributed learning frameworks, such as Federated Learning, especially in the presence of Byzantine workers. The celebrated cubic-regularized Newton method of \cite{nest} is one of the most elegant ways to avoid saddle-points in the standard centralized (non-distributed) setup. In this paper, we… ▽ More

    Submitted 25 December, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  8. arXiv:2006.08737  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Distributed Newton Can Communicate Less and Resist Byzantine Workers

    Authors: Avishek Ghosh, Raj Kumar Maity, Arya Mazumdar

    Abstract: We develop a distributed second order optimization algorithm that is communication-efficient as well as robust against Byzantine failures of the worker machines. We propose COMRADE (COMunication-efficient and Robust Approximate Distributed nEwton), an iterative second order algorithm, where the worker machines communicate only once per iteration with the center machine. This is in sharp contrast w… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  9. arXiv:1512.09156  [pdf, other

    cs.IT cs.LG math.NA

    Low rank approximation and decomposition of large matrices using error correcting codes

    Authors: Shashanka Ubaru, Arya Mazumdar, Yousef Saad

    Abstract: Low rank approximation is an important tool used in many applications of signal processing and machine learning. Recently, randomized sketching algorithms were proposed to effectively construct low rank approximations and obtain approximate singular value decompositions of large matrices. Similar ideas were used to solve least squares regression problems. In this paper, we show how matrices from e… ▽ More

    Submitted 15 June, 2017; v1 submitted 30 December, 2015; originally announced December 2015.

    Journal ref: IEEE Transactions on Information Theory ( Volume: 63, Issue: 9, Sept. 2017 ) Page(s): 5544 - 5558