Skip to main content

Showing 1–7 of 7 results for author: Di Castro, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2303.15827  [pdf, other

    cs.LG math.NA stat.ML

    CONFIDE: Contextual Finite Differences Modelling of PDEs

    Authors: Ori Linial, Orly Avner, Dotan Di Castro

    Abstract: We introduce a method for inferring an explicit PDE from a data sample generated by previously unseen dynamics, based on a learned context. The training phase integrates knowledge of the form of the equation with a differential scheme, while the inference phase yields a PDE that fits the data sample and enables both signal prediction and data explanation. We include results of extensive experiment… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

  2. arXiv:2110.00445  [pdf, ps, other

    stat.ML cs.LG

    Sim and Real: Better Together

    Authors: Shirli Di Castro Shashua, Dotan Di Castro, Shie Mannor

    Abstract: Simulation is used extensively in autonomous systems, particularly in robotic manipulation. By far, the most common approach is to train a controller in simulation, and then use it as an initial starting point for the real system. We demonstrate how to learn simultaneously from both simulation and interaction with the real environment. We propose an algorithm for balancing the large number of samp… ▽ More

    Submitted 5 October, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

  3. arXiv:1908.08379  [pdf, other

    cs.LG stat.ML

    Practical Risk Measures in Reinforcement Learning

    Authors: Dotan Di Castro, Joel Oren, Shie Mannor

    Abstract: Practical application of Reinforcement Learning (RL) often involves risk considerations. We study a generalized approximation scheme for risk measures, based on Monte-Carlo simulations, where the risk measures need not necessarily be \emph{coherent}. We demonstrate that, even in simple problems, measures such as the variance of the reward-to-go do not capture the risk in a satisfactory manner. In… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  4. arXiv:1607.01381  [pdf, other

    stat.ML cs.AI cs.IR

    One-Shot Session Recommendation Systems with Combinatorial Items

    Authors: Yahel David, Dotan Di Castro, Zohar Karnin

    Abstract: In recent years, content recommendation systems in large websites (or \emph{content providers}) capture an increased focus. While the type of content varies, e.g.\ movies, articles, music, advertisements, etc., the high level problem remains the same. Based on knowledge obtained so far on the user, recommend the most desired content. In this paper we present a method to handle the well known user-… ▽ More

    Submitted 5 July, 2016; originally announced July 2016.

  5. arXiv:1502.02259  [pdf, other

    stat.ML cs.LG

    Contextual Markov Decision Processes

    Authors: Assaf Hallak, Dotan Di Castro, Shie Mannor

    Abstract: We consider a planning problem where the dynamics and rewards of the environment depend on a hidden static parameter referred to as the context. The objective is to learn a strategy that maximizes the accumulated reward across all contexts. The new model, called Contextual Markov Decision Process (CMDP), can model a customer's behavior when interacting with a website (the learner). The customer's… ▽ More

    Submitted 8 February, 2015; originally announced February 2015.

  6. arXiv:1301.0104  [pdf, other

    cs.LG stat.ML

    Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes

    Authors: Aviv Tamar, Dotan Di Castro, Shie Mannor

    Abstract: In this paper we extend temporal difference policy evaluation algorithms to performance criteria that include the variance of the cumulative reward. Such criteria are useful for risk management, and are important in domains such as finance and process control. We propose both TD(0) and LSTD(lambda) variants with linear function approximation, prove their convergence, and demonstrate their utility… ▽ More

    Submitted 1 January, 2013; originally announced January 2013.

    Journal ref: JMLR Workshop and Conference Proceedings 28 (3): 495-503, 2013

  7. arXiv:1206.6404  [pdf

    cs.LG cs.CY math.OC stat.ML

    Policy Gradients with Variance Related Risk Criteria

    Authors: Dotan Di Castro, Aviv Tamar, Shie Mannor

    Abstract: Managing risk in dynamic decision problems is of cardinal importance in many fields such as finance and process control. The most common approach to defining risk is through various variance related criteria such as the Sharpe Ratio or the standard deviation adjusted reward. It is known that optimizing many of the variance related risk criteria is NP-hard. In this paper we devise a framework for l… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)