Skip to main content

Showing 1–2 of 2 results for author: Pati, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.06349  [pdf, other

    stat.ML cs.LG eess.SY math.OC math.ST

    Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors

    Authors: Prateek Jaiswal, Debdeep Pati, Anirban Bhattacharya, Bani K. Mallick

    Abstract: Thompson sampling (TS) is one of the most popular and earliest algorithms to solve stochastic multi-armed bandit problems. We consider a variant of TS, named $α$-TS, where we use a fractional or $α$-posterior ($α\in(0,1)$) instead of the standard posterior distribution. To compute an $α$-posterior, the likelihood in the definition of the standard posterior is tempered with a factor $α$. For $α$-TS… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  2. arXiv:1910.02052  [pdf, other

    eess.SP cs.AI cs.LG

    AI Assisted Annotator using Reinforcement Learning

    Authors: V. Ratna Saripalli, Gopal Avinash, Dibyajyoti Pati, Michael Potter, Charles W. Anderson

    Abstract: Healthcare data suffers from both noise and lack of ground truth. The cost of data increases as it is cleaned and annotated in healthcare. Unlike other data sets, medical data annotation, which is critical to accurate ground truth, requires medical domain expertise for a better patient outcome. In this work, we report on the use of reinforcement learning to mimic the decision making process of ann… ▽ More

    Submitted 11 June, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: 10 pages