Skip to main content

Showing 1–5 of 5 results for author: Reddy, K S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.07199  [pdf, ps, other

    cs.LG cs.IT math.ST stat.ML

    Fixed-Confidence Best Arm Identification with Decreasing Variance

    Authors: Tamojeet Roychowdhury, Kota Srinivas Reddy, Krishna P Jagannathan, Sharayu Moharir

    Abstract: We focus on the problem of best-arm identification in a stochastic multi-arm bandit with temporally decreasing variances for the arms' rewards. We model arm rewards as Gaussian random variables with fixed means and variances that decrease with time. The cost incurred by the learner is modeled as a weighted sum of the time needed by the learner to identify the best arm, and the number of samples of… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: 6 pages, 2 figures, accepted in the National Conference on Communications 2025

  2. arXiv:2305.06082  [pdf, ps, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Best Arm Identification in Bandits with Limited Precision Sampling

    Authors: Kota Srinivas Reddy, P. N. Karthik, Nikhil Karamchandani, Jayakrishnan Nair

    Abstract: We study best arm identification in a variant of the multi-armed bandit problem where the learner has limited precision in arm selection. The learner can only sample arms via certain exploration bundles, which we refer to as boxes. In particular, at each sampling epoch, the learner selects a box, which in turn causes an arm to get pulled as per a box-specific probability distribution. The pulled a… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: ISIT 2023

  3. arXiv:2208.09215  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Almost Cost-Free Communication in Federated Best Arm Identification

    Authors: Kota Srinivas Reddy, P. N. Karthik, Vincent Y. F. Tan

    Abstract: We study the problem of best arm identification in a federated learning multi-armed bandit setup with a central server and multiple clients. Each client is associated with a multi-armed bandit in which each arm yields {\em i.i.d.}\ rewards following a Gaussian distribution with an unknown mean and known variance. The set of arms is assumed to be the same at all the clients. We define two notions o… ▽ More

    Submitted 19 December, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Accepted to AAAI 2023

  4. arXiv:2203.15236  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Best Arm Identification in Restless Markov Multi-Armed Bandits

    Authors: P. N. Karthik, Kota Srinivas Reddy, Vincent Y. F. Tan

    Abstract: We study the problem of identifying the best arm in a multi-armed bandit environment when each arm is a time-homogeneous and ergodic discrete-time Markov process on a common, finite state space. The state evolution on each arm is governed by the arm's transition probability matrix (TPM). A decision entity that knows the set of arm TPMs but not the exact mapping of the TPMs to the arms, wishes to f… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 41 pages

  5. arXiv:2005.14425  [pdf, other

    cs.IT cs.DS cs.LG stat.ML

    Query complexity of heavy hitter estimation

    Authors: Sahasrajit Sarmasarkar, Kota Srinivas Reddy, Nikhil Karamchandani

    Abstract: We consider the problem of identifying the subset $\mathcal{S}^γ_{\mathcal{P}}$ of elements in the support of an underlying distribution $\mathcal{P}$ whose probability value is larger than a given threshold $γ$, by actively querying an oracle to gain information about a sequence $X_1, X_2, \ldots$ of $i.i.d.$ samples drawn from $\mathcal{P}$. We consider two query models: $(a)$ each query is an i… ▽ More

    Submitted 10 February, 2021; v1 submitted 29 May, 2020; originally announced May 2020.