Skip to main content

Showing 1–10 of 10 results for author: Khodadadian, S

.
  1. arXiv:2505.21796  [pdf, ps, other

    stat.ML cs.LG math.PR

    A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging

    Authors: Sajad Khodadadian, Martin Zubeldia

    Abstract: Polyak-Ruppert averaging is a widely used technique to achieve the optimal asymptotic variance of stochastic approximation (SA) algorithms, yet its high-probability performance guarantees remain underexplored in general settings. In this paper, we present a general framework for establishing non-asymptotic concentration bounds for the error of averaged SA iterates. Our approach assumes access to i… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 37 pages

  2. arXiv:2401.00364  [pdf, other

    cs.LG eess.SY math.OC

    Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise

    Authors: Shaan Ul Haque, Sajad Khodadadian, Siva Theja Maguluri

    Abstract: Stochastic approximation (SA) is an iterative algorithm for finding the fixed point of an operator using noisy samples and widely used in optimization and Reinforcement Learning (RL). The noise in RL exhibits a Markovian structure, and in some cases, such as gradient temporal difference (GTD) methods, SA is employed in a two-time-scale framework. This combination introduces significant theoretical… ▽ More

    Submitted 11 May, 2025; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 83 pages, 6 figures

  3. arXiv:2206.10185  [pdf, ps, other

    cs.LG

    Federated Stochastic Approximation under Markov Noise and Heterogeneity: Applications in Reinforcement Learning

    Authors: Sajad Khodadadian, Pranay Sharma, Gauri Joshi, Siva Theja Maguluri

    Abstract: Since reinforcement learning algorithms are notoriously data-intensive, the task of sampling observations from the environment is usually split across multiple agents. However, transferring these observations from the agents to a central location can be prohibitively expensive in terms of communication cost, and it can also compromise the privacy of each agent's local behavior policy. Federated re… ▽ More

    Submitted 21 October, 2024; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: 80 pages, 0 figure, accepted to ICML 2022 for long presentation

  4. arXiv:2106.00772  [pdf, other

    cs.LG cs.CY cs.IT

    Information Theoretic Measures for Fairness-aware Feature Selection

    Authors: Sajad Khodadadian, Mohamed Nafea, AmirEmad Ghassami, Negar Kiyavash

    Abstract: Machine learning algorithms are increasingly used for consequential decision making regarding individuals based on their relevant features. Features that are relevant for accurate decisions may however lead to either explicit or implicit forms of discrimination against unprivileged groups, such as those of certain race or gender. This happens due to existing biases in the training data, which are… ▽ More

    Submitted 8 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures

  5. arXiv:2105.12540  [pdf, ps, other

    cs.LG math.OC

    Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation

    Authors: Zaiwei Chen, Sajad Khodadadian, Siva Theja Maguluri

    Abstract: In this paper, we develop a novel variant of off-policy natural actor-critic algorithm with linear function approximation and we establish a sample complexity of $\mathcal{O}(ε^{-3})$, outperforming all the previously known convergence bounds of such algorithms. In order to overcome the divergence due to deadly triad in off-policy policy evaluation under function approximation, we develop a critic… ▽ More

    Submitted 12 April, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

  6. arXiv:2105.01424  [pdf, other

    cs.LG

    On the Linear convergence of Natural Policy Gradient Algorithm

    Authors: Sajad Khodadadian, Prakirt Raj Jhunjhunwala, Sushil Mahavir Varma, Siva Theja Maguluri

    Abstract: Markov Decision Processes are classically solved using Value Iteration and Policy Iteration algorithms. Recent interest in Reinforcement Learning has motivated the study of methods inspired by optimization, such as gradient ascent. Among these, a popular algorithm is the Natural Policy Gradient, which is a mirror descent variant for MDPs. This algorithm forms the basis of several popular Reinforce… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: 19 pages, 1 figure, A version of this paper was first submitted to a conference in Mar 2021

  7. arXiv:2102.09318  [pdf, other

    cs.LG math.OC stat.ML

    Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm

    Authors: Sajad Khodadadian, Zaiwei Chen, Siva Theja Maguluri

    Abstract: In this paper, we provide finite-sample convergence guarantees for an off-policy variant of the natural actor-critic (NAC) algorithm based on Importance Sampling. In particular, we show that the algorithm converges to a global optimal policy with a sample complexity of $\mathcal{O}(ε^{-3}\log^2(1/ε))$ under an appropriate choice of stepsizes. In order to overcome the issue of large variance due to… ▽ More

    Submitted 10 June, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

  8. arXiv:2102.01867  [pdf, other

    cs.LG cs.AI cs.IT

    Impact of Data Processing on Fairness in Supervised Learning

    Authors: Sajad Khodadadian, AmirEmad Ghassami, Negar Kiyavash

    Abstract: We study the impact of pre and post processing for reducing discrimination in data-driven decision makers. We first analyze the fundamental trade-off between fairness and accuracy in a pre-processing approach, and propose a design for a pre-processing module based on a convex optimization program, which can be added before the original classifier. This leads to a fundamental lower bound on attaina… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 18 pages, 4 figures

  9. arXiv:2101.10506  [pdf, other

    cs.LG

    Finite Sample Analysis of Two-Time-Scale Natural Actor-Critic Algorithm

    Authors: Sajad Khodadadian, Thinh T. Doan, Justin Romberg, Siva Theja Maguluri

    Abstract: Actor-critic style two-time-scale algorithms are one of the most popular methods in reinforcement learning, and have seen great empirical success. However, their performance is not completely understood theoretically. In this paper, we characterize the \emph{global} convergence of an online natural actor-critic algorithm in the tabular setting using a single trajectory of samples. Our analysis app… ▽ More

    Submitted 20 February, 2022; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: 28 pages, 2 figures

  10. arXiv:1801.04378  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Fairness in Supervised Learning: An Information Theoretic Approach

    Authors: AmirEmad Ghassami, Sajad Khodadadian, Negar Kiyavash

    Abstract: Automated decision making systems are increasingly being used in real-world applications. In these systems for the most part, the decision rules are derived by minimizing the training error on the available historical data. Therefore, if there is a bias related to a sensitive attribute such as gender, race, religion, etc. in the data, say, due to cultural/historical discriminatory practices agains… ▽ More

    Submitted 29 July, 2018; v1 submitted 12 January, 2018; originally announced January 2018.