Skip to main content

Showing 1–2 of 2 results for author: Tatlı, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.20877  [pdf, other

    stat.ML cs.LG

    Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms

    Authors: Meltem Tatlı, Arpan Mukherjee, Prashanth L. A., Karthikeyan Shanmugam, Ali Tajer

    Abstract: The objective of canonical multi-armed bandits is to identify and repeatedly select an arm with the largest reward, often in the form of the expected value of the arm's probability distribution. Such a utilitarian perspective and focus on the probability models' first moments, however, is agnostic to the distributions' tail behavior and their implications for variability and risks in decision-maki… ▽ More

    Submitted 30 April, 2025; v1 submitted 29 April, 2025; originally announced April 2025.

    Comments: An earlier version of this manuscript, which focused on risk-sensitive bandits, has appeared in the Proceedings of the 2025 International Conference on Artificial Intelligence and Statistics (AISTATS)

  2. arXiv:2503.08896  [pdf, other

    stat.ML cs.LG

    Risk-sensitive Bandits: Arm Mixture Optimality and Regret-efficient Algorithms

    Authors: Meltem Tatlı, Arpan Mukherjee, Prashanth L. A., Karthikeyan Shanmugam, Ali Tajer

    Abstract: This paper introduces a general framework for risk-sensitive bandits that integrates the notions of risk-sensitive objectives by adopting a rich class of distortion riskmetrics. The introduced framework subsumes the various existing risk-sensitive models. An important and hitherto unknown observation is that for a wide range of riskmetrics, the optimal bandit policy involves selecting a mixture of… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: AISTATS 2025