Skip to main content

Showing 1–4 of 4 results for author: Haque, S u

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.18391  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise

    Authors: Siddharth Chandak, Shaan Ul Haque, Nicholas Bambos

    Abstract: Two-time-scale Stochastic Approximation (SA) is an iterative algorithm with applications in reinforcement learning and optimization. Prior finite time analysis of such algorithms has focused on fixed point iterations with mappings contractive under Euclidean norm. Motivated by applications in reinforcement learning, we give the first mean square bound on non linear two-time-scale SA where the iter… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: Submitted to IEEE Conference on Decision and Control (CDC) 2025

  2. arXiv:2502.14208  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms

    Authors: Zaiwei Chen, Sheng Zhang, Zhe Zhang, Shaan Ul Haque, Siva Theja Maguluri

    Abstract: We study the problem of solving fixed-point equations for seminorm-contractive operators and establish foundational results on the non-asymptotic behavior of iterative algorithms in both deterministic and stochastic settings. Specifically, in the deterministic setting, we prove a fixed-point theorem for seminorm-contractive operators, showing that iterates converge geometrically to the kernel of t… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  3. arXiv:2206.03328  [pdf, other

    cs.LG math.OC stat.ML

    Concentration bounds for SSP Q-learning for average cost MDPs

    Authors: Shaan Ul Haque, Vivek Borkar

    Abstract: We derive a concentration bound for a Q-learning algorithm for average cost Markov decision processes based on an equivalent shortest path problem, and compare it numerically with the alternative scheme based on relative value iteration.

    Submitted 12 June, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 6 pages, 2 figures

  4. arXiv:2203.01667  [pdf, other

    cs.LG stat.ML

    Joint Probability Estimation Using Tensor Decomposition and Dictionaries

    Authors: Shaan ul Haque, Ajit Rajwade, Karthik S. Gurumoorthy

    Abstract: In this work, we study non-parametric estimation of joint probabilities of a given set of discrete and continuous random variables from their (empirically estimated) 2D marginals, under the assumption that the joint probability could be decomposed and approximated by a mixture of product densities/mass functions. The problem of estimating the joint probability density function (PDF) using semi-par… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.