Skip to main content

Showing 1–1 of 1 results for author: Osborne, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:1905.02219  [pdf, other

    cs.LG stat.ML

    Lessons from Contextual Bandit Learning in a Customer Support Bot

    Authors: Nikos Karampatziakis, Sebastian Kochman, Jade Huang, Paul Mineiro, Kathy Osborne, Weizhu Chen

    Abstract: In this work, we describe practical lessons we have learned from successfully using contextual bandits (CBs) to improve key business metrics of the Microsoft Virtual Agent for customer support. While our current use cases focus on single step einforcement learning (RL) and mostly in the domain of natural language processing and information retrieval we believe many of our findings are generally ap… ▽ More

    Submitted 18 June, 2019; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: Reinforcement Learning for Real Life Workshop