Skip to main content

Showing 1–5 of 5 results for author: Lichtenberg, J M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09168  [pdf, other

    cs.IR cs.AI cs.LG

    Ranking Across Different Content Types: The Robust Beauty of Multinomial Blending

    Authors: Jan Malte Lichtenberg, Giuseppe Di Benedetto, Matteo Ruffini

    Abstract: An increasing number of media streaming services have expanded their offerings to include entities of multiple content types. For instance, audio streaming services that started by offering music only, now also offer podcasts, merchandise items, and videos. Ranking items across different content types into a single slate poses a significant challenge for traditional learning-to-rank (LTR) algorith… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: To appear in 18th ACM Conference on Recommender Systems (RecSys24), Bari, Italy. ACM, New York, NY, USA, 3 pages

  2. arXiv:2406.01285  [pdf, other

    cs.IR cs.AI cs.LG

    Large Language Models as Recommender Systems: A Study of Popularity Bias

    Authors: Jan Malte Lichtenberg, Alexander Buchholz, Pola Schwöbel

    Abstract: The issue of popularity bias -- where popular items are disproportionately recommended, overshadowing less popular but potentially relevant items -- remains a significant challenge in recommender systems. Recent advancements have seen the integration of general-purpose Large Language Models (LLMs) into the architecture of such systems. This integration raises concerns that it might exacerbate popu… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted at Gen-IR@SIGIR24 workshop

  3. arXiv:2309.01120  [pdf, other

    cs.LG

    Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation

    Authors: Jan Malte Lichtenberg, Alexander Buchholz, Giuseppe Di Benedetto, Matteo Ruffini, Ben London

    Abstract: "Clipping" (a.k.a. importance weight truncation) is a widely used variance-reduction technique for counterfactual off-policy estimators. Like other variance-reduction techniques, clipping reduces variance at the cost of increased bias. However, unlike other techniques, the bias introduced by clipping is always a downward bias (assuming non-negative rewards), yielding a lower bound on the true expe… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: Presented at CONSEQUENCES '23 workshop at RecSys 2023 conference in Singapore

  4. arXiv:2205.06024  [pdf, other

    stat.ML cs.IR cs.LG stat.CO

    Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

    Authors: Alexander Buchholz, Jan Malte Lichtenberg, Giuseppe Di Benedetto, Yannik Stein, Vito Bellini, Matteo Ruffini

    Abstract: The Plackett-Luce (PL) model is ubiquitous in learning-to-rank (LTR) because it provides a useful and intuitive probabilistic model for sampling ranked lists. Counterfactual offline evaluation and optimization of ranking metrics are pivotal for using LTR methods in production. When adopting the PL model as a ranking policy, both tasks require the computation of expectations with respect to the mod… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  5. arXiv:1912.02532  [pdf, other

    cs.LG cs.AI stat.ML

    Iterative Policy-Space Expansion in Reinforcement Learning

    Authors: Jan Malte Lichtenberg, Özgür Şimşek

    Abstract: Humans and animals solve a difficult problem much more easily when they are presented with a sequence of problems that starts simple and slowly increases in difficulty. We explore this idea in the context of reinforcement learning. Rather than providing the agent with an externally provided curriculum of progressively more difficult tasks, the agent solves a single task utilizing a decreasingly co… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: Workshop on Biological and Artificial Reinforcement Learning at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada