Skip to main content

Showing 1–1 of 1 results for author: Jurkiewicz, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.03046  [pdf, other

    cs.CL

    Oddballness: universal anomaly detection with language models

    Authors: Filip GraliƄski, Ryszard Staruch, Krzysztof Jurkiewicz

    Abstract: We present a new method to detect anomalies in texts (in general: in sequences of any data), using language models, in a totally unsupervised manner. The method considers probabilities (likelihoods) generated by a language model, but instead of focusing on low-likelihood tokens, it considers a new metric introduced in this paper: oddballness. Oddballness measures how ``strange'' a given token is a… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.