Skip to main content

Showing 1–6 of 6 results for author: Grover, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.10831  [pdf, other

    cs.LG stat.ML

    Minimax-Bayes Reinforcement Learning

    Authors: Thomas Kleine Buening, Christos Dimitrakakis, Hannes Eriksson, Divya Grover, Emilio Jorge

    Abstract: While the Bayesian decision-theoretic framework offers an elegant solution to the problem of decision making under uncertainty, one question is how to appropriately select the prior distribution. One idea is to employ a worst-case prior. However, this is not as easy to specify in sequential decision making as in simple statistical estimation problems. This paper studies (sometimes approximate) min… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  2. arXiv:2104.07276  [pdf, ps, other

    cs.AI

    Adaptive Belief Discretization for POMDP Planning

    Authors: Divya Grover, Christos Dimitrakakis

    Abstract: Partially Observable Markov Decision Processes (POMDP) is a widely used model to represent the interaction of an environment and an agent, under state uncertainty. Since the agent does not observe the environment state, its uncertainty is typically represented through a probabilistic belief. While the set of possible beliefs is infinite, making exact planning intractable, the belief space's comple… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  3. arXiv:2005.13781  [pdf, other

    eess.SP cs.RO

    A Maneuver-based Urban Driving Dataset and Model for Cooperative Vehicle Applications

    Authors: Behrad Toghi, Divas Grover, Mahdi Razzaghpour, Rajat Jain, Rodolfo Valiente, Mahdi Zaman, Ghayoor Shah, Yaser P. Fallah

    Abstract: Short-term future of automated driving can be imagined as a hybrid scenario in which both automated and human-driven vehicles co-exist in the same environment. In order to address the needs of such road configuration, many technology solutions such as vehicular communication and predictive control for automated vehicles have been introduced in the literature. Both aforementioned solutions rely on… ▽ More

    Submitted 21 August, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: Accepted to IEEE Connected and Automated Vehicle Symposium (IEEE CAVS 2020)

  4. arXiv:2002.03098  [pdf, other

    cs.LG stat.ML

    Inferential Induction: A Novel Framework for Bayesian Reinforcement Learning

    Authors: Hannes Eriksson, Emilio Jorge, Christos Dimitrakakis, Debabrota Basu, Divya Grover

    Abstract: Bayesian reinforcement learning (BRL) offers a decision-theoretic solution for reinforcement learning. While "model-based" BRL algorithms have focused either on maintaining a posterior distribution on models or value functions and combining this with approximate dynamic programming or tree search, previous Bayesian "model-free" value function distribution approaches implicitly make strong assumpti… ▽ More

    Submitted 1 July, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: 28 pages, 12 figures

  5. arXiv:1902.02661  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Reinforcement Learning via Deep, Sparse Sampling

    Authors: Divya Grover, Debabrota Basu, Christos Dimitrakakis

    Abstract: We address the problem of Bayesian reinforcement learning using efficient model-based online planning. We propose an optimism-free Bayes-adaptive algorithm to induce deeper and sparser exploration with a theoretical bound on its performance relative to the Bayes optimal policy, with a lower computational complexity. The main novelty is the use of a candidate policy generator, to generate long-term… ▽ More

    Submitted 27 June, 2020; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: Published in AISTATS 2020

  6. arXiv:1809.06846  [pdf, other

    cs.CV

    MNIST Dataset Classification Utilizing k-NN Classifier with Modified Sliding-window Metric

    Authors: Divas Grover, Behrad Toghi

    Abstract: The MNIST dataset of the handwritten digits is known as one of the commonly used datasets for machine learning and computer vision research. We aim to study a widely applicable classification problem and apply a simple yet efficient K-nearest neighbor classifier with an enhanced heuristic. We evaluate the performance of the K-nearest neighbor classification algorithm on the MNIST dataset where the… ▽ More

    Submitted 12 March, 2019; v1 submitted 18 September, 2018; originally announced September 2018.

    Comments: Accepted to the Computer Vision Conference (CVC2019), Las Vegas, NV