Showing 1–1 of 1 results for author: Kondamudi, S S

Search v0.5.6 released 2020-02-24

arXiv:1906.11245 [pdf, other]

cs.LG cs.AI

doi 10.1109/ICoIAS.2019.00018

A Tractable Algorithm For Finite-Horizon Continuous Reinforcement Learning

Authors: Phanideep Gampa, Sairam Satwik Kondamudi, Lakshmanan Kailasam

Abstract: We consider the finite horizon continuous reinforcement learning problem. Our contribution is three-fold. First,we give a tractable algorithm based on optimistic value iteration for the problem. Next,we give a lower bound on regret of order $Ω(T^{2/3})$ for any algorithm discretizes the state space, improving the previous regret bound of $Ω(T^{1/2})$ of Ortner and Ryabko \cite{contrl} for the same… ▽ More We consider the finite horizon continuous reinforcement learning problem. Our contribution is three-fold. First,we give a tractable algorithm based on optimistic value iteration for the problem. Next,we give a lower bound on regret of order $Ω(T^{2/3})$ for any algorithm discretizes the state space, improving the previous regret bound of $Ω(T^{1/2})$ of Ortner and Ryabko \cite{contrl} for the same problem. Next,under the assumption that the rewards and transitions are Hölder Continuous we show that the upper bound on the discretization error is $const.Ln^{-α}T$. Finally,we give some simple experiments to validate our propositions. △ Less

Submitted 26 June, 2019; originally announced June 2019.

Comments: InProceedings of International Conference on Intelligent Autonomous System, ICOIAS 2019

Search v0.5.6 released 2020-02-24