Skip to main content

Showing 1–2 of 2 results for author: Shoji, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.00446  [pdf, ps, other

    stat.ML cs.LG

    Off-Policy Evaluation of Ranking Policies via Embedding-Space User Behavior Modeling

    Authors: Tatsuki Takahashi, Chihiro Maru, Hiroko Shoji

    Abstract: Off-policy evaluation (OPE) in ranking settings with large ranking action spaces, which stems from an increase in both the number of unique actions and length of the ranking, is essential for assessing new recommender policies using only logged bandit data from previous versions. To address the high variance issues associated with existing estimators, we introduce two new assumptions: no direct ef… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  2. arXiv:2502.08993  [pdf, other

    stat.ML cs.LG

    Off-Policy Evaluation for Recommendations with Missing-Not-At-Random Rewards

    Authors: Tatsuki Takahashi, Chihiro Maru, Hiroko Shoji

    Abstract: Unbiased recommender learning (URL) and off-policy evaluation/learning (OPE/L) techniques are effective in addressing the data bias caused by display position and logging policies, thereby consistently improving the performance of recommendations. However, when both bias exits in the logged data, these estimators may suffer from significant bias. In this study, we first analyze the position bias o… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 4pages