Skip to main content

Showing 1–3 of 3 results for author: Szyndler, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13496  [pdf, other

    cs.AI cs.CL cs.LG

    ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model

    Authors: Przemek Pospieszny, Wojciech Mormul, Karolina Szyndler, Sanjeev Kumar

    Abstract: Modern software systems generate extensive heterogeneous log data with dynamic formats, fragmented event sequences, and varying temporal patterns, making anomaly detection both crucial and challenging. To address these complexities, we propose ADALog, an adaptive, unsupervised anomaly detection framework designed for practical applicability across diverse real-world environments. Unlike traditiona… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: Conference paper accepted at ICMLT 2025; to appear in the IEEE Conference Proceedings

    ACM Class: I.2.6; I.2.7; I.5.1; C.2.4

  2. arXiv:2408.04632  [pdf, other

    cs.CL cs.CV

    Arctic-TILT. Business Document Understanding at Sub-Billion Scale

    Authors: Łukasz Borchmann, Michał Pietruszka, Wojciech Jaśkowski, Dawid Jurkiewicz, Piotr Halama, Paweł Józiak, Łukasz Garncarek, Paweł Liskowski, Karolina Szyndler, Andrzej Gretkowski, Julita Ołtusek, Gabriela Nowakowska, Artur Zawłocki, Łukasz Duhr, Paweł Dyda, Michał Turski

    Abstract: The vast portion of workloads employing LLMs involves answering questions grounded on PDF or scan content. We introduce the Arctic-TILT achieving accuracy on par with models 1000$\times$ its size on these use cases. It can be fine-tuned and deployed on a single 24GB GPU, lowering operational costs while processing Visually Rich Documents with up to 400k tokens. The model establishes state-of-the-a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  3. arXiv:2206.04045  [pdf, other

    cs.CL cs.LG

    STable: Table Generation Framework for Encoder-Decoder Models

    Authors: Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Pałka, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek

    Abstract: The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutatio… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.