Skip to main content

Showing 1–1 of 1 results for author: Borec, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16345  [pdf, other

    cs.CL

    The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization

    Authors: Luka Borec, Philipp Sadler, David Schlangen

    Abstract: This work analyses the text memorization behavior of large language models (LLMs) when subjected to nucleus sampling. Stochastic decoding methods like nucleus sampling are typically applied to overcome issues such as monotonous and repetitive text generation, which are often observed with maximization-based decoding techniques. We hypothesize that nucleus sampling might also reduce the occurrence… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 9 pages, Accepted at INLG 2024 (International Natural Language Generation Conference)