Skip to main content

Showing 1–22 of 22 results for author: Stefanik, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.15903  [pdf, other

    cs.LG

    VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics

    Authors: Josef Kuchař, Marek Kadlčík, Michal Spiegel, Michal Štefánik

    Abstract: We introduce a large-scale dataset for instruction-guided vector image editing, consisting of over 270,000 pairs of SVG images paired with natural language edit instructions. Our dataset enables training and evaluation of models that modify vector graphics based on textual commands. We describe the data collection process, including image pairing via CLIP similarity and instruction generation with… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  2. arXiv:2506.08966  [pdf, other

    cs.CL cs.LG cs.NE

    Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers

    Authors: Marek Kadlčík, Michal Štefánik, Timothee Mickus, Michal Spiegel, Josef Kuchař

    Abstract: Pretrained language models (LMs) are prone to arithmetic errors. Existing work showed limited success in probing numeric values from models' representations, indicating that these errors can be attributed to the inherent unreliability of distributionally learned embeddings in representing exact quantities. However, we observe that previous probing methods are inadequate for the emergent structure… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2503.22395  [pdf, ps, other

    cs.CL

    Negation: A Pink Elephant in the Large Language Models' Room?

    Authors: Tereza Vrabcová, Marek Kadlčík, Petr Sojka, Michal Štefánik, Michal Spiegel

    Abstract: Negations are key to determining sentence meaning, making them essential for logical reasoning. Despite their importance, negations pose a substantial challenge for large language models (LLMs) and remain underexplored. We constructed and published two new textual entailment datasets NoFEVER-ML and NoSNLI-ML in four languages (English, Czech, German, and Ukrainian) with examples differing in n… ▽ More

    Submitted 3 June, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

  4. arXiv:2503.01909  [pdf, other

    cs.LG cs.AI

    Attend or Perish: Benchmarking Attention in Algorithmic Reasoning

    Authors: Michal Spiegel, Michal Štefánik, Marek Kadlčík, Josef Kuchař

    Abstract: Can transformers learn to perform algorithmic tasks reliably across previously unseen input/output domains? While pre-trained language models show solid accuracy on benchmarks incorporating algorithmic reasoning, assessing the reliability of these results necessitates an ability to cleanse models' functional capabilities from memorization. In this paper, we propose an algorithmic benchmark compris… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

  5. arXiv:2412.17933  [pdf, other

    cs.CL cs.AI

    BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism

    Authors: Martin Fajcik, Martin Docekal, Jan Dolezal, Karel Ondrej, Karel Beneš, Jan Kapsa, Pavel Smrz, Alexander Polok, Michal Hradis, Zuzana Neverilova, Ales Horak, Radoslav Sabol, Michal Stefanik, Adam Jirkovsky, David Adamczyk, Petr Hyner, Jan Hula, Hynek Kydlicek

    Abstract: We present BenCzechMark (BCM), the first comprehensive Czech language benchmark designed for large language models, offering diverse tasks, multiple task formats, and multiple evaluation metrics. Its duel scoring system is grounded in statistical significance theory and uses aggregation across tasks inspired by social preference theory. Our benchmark encompasses 50 challenging tasks, with correspo… ▽ More

    Submitted 22 May, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: Accepted to TACL

  6. arXiv:2411.12921  [pdf, other

    cs.IR cs.AI

    A Comparative Study of Text Retrieval Models on DaReCzech

    Authors: Jakub Stetina, Martin Fajcik, Michal Stefanik, Michal Hradis

    Abstract: This article presents a comprehensive evaluation of 7 off-the-shelf document retrieval models: Splade, Plaid, Plaid-X, SimCSE, Contriever, OpenAI ADA and Gemma2 chosen to determine their performance on the Czech retrieval dataset DaReCzech. The primary objective of our experiments is to estimate the quality of modern retrieval approaches in the Czech language. Our analyses include retrieval qualit… ▽ More

    Submitted 20 December, 2024; v1 submitted 19 November, 2024; originally announced November 2024.

  7. arXiv:2407.08400  [pdf, other

    cs.CL cs.AI

    Self-training Language Models for Arithmetic Reasoning

    Authors: Marek Kadlčík, Michal Štefánik

    Abstract: Recent language models achieve impressive results in tasks involving complex multistep reasoning, but scaling these capabilities further traditionally requires expensive collection of more annotated data. In this work, we explore the potential of improving models' reasoning capabilities without new data, merely using automated feedback to the validity of their predictions in arithmetic reasoning (… ▽ More

    Submitted 23 October, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: To appear in Findings of EMNLP 2024. Reproducible implementations and references to resulting models can be found on https://github.com/prompteus/calc-x

  8. arXiv:2403.09703  [pdf, other

    cs.CL cs.AI

    Concept-aware Data Construction Improves In-context Learning of Language Models

    Authors: Michal Štefánik, Marek Kadlčík, Petr Sojka

    Abstract: Many recent language models (LMs) are capable of in-context learning (ICL), manifested in the LMs' ability to perform a new task solely from natural-language instruction. Previous work curating in-context learners assumes that ICL emerges from a vast over-parametrization or the scale of multi-task training. However, recent theoretical work attributes the ICL ability to concept-dependent training d… ▽ More

    Submitted 28 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Long paper to appear in Findings of ACL 2024

  9. arXiv:2305.16718  [pdf, other

    cs.CL

    People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts

    Authors: Vít Novotný, Kristýna Luger, Michal Štefánik, Tereza Vrabcová, Aleš Horák

    Abstract: Although pre-trained named entity recognition (NER) models are highly accurate on modern corpora, they underperform on historical texts due to differences in language OCR errors. In this work, we develop a new NER corpus of 3.6M sentences from late medieval charters written mainly in Czech, Latin, and German. We show that we can start with a list of known historical figures and locations and an… ▽ More

    Submitted 6 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: To appear in the Findings of the Association for Computational Linguistics: ACL 2023

    MSC Class: 68T50 ACM Class: I.2.7; I.7.0

  10. arXiv:2305.15017  [pdf, other

    cs.LG cs.AI cs.CL

    Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems

    Authors: Marek Kadlčík, Michal Štefánik, Ondřej Sotolář, Vlastimil Martinek

    Abstract: Despite outstanding performance in many tasks, language models are notoriously inclined to make factual errors in tasks requiring arithmetic computation. We address this deficiency by creating Calc-X, a collection of datasets that demonstrates the appropriate use of a calculator in reasoning chains. Calc-X is suitable for teaching language models to offload computations to a symbolic system. We su… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Published in EMNLP 2023: Main track

  11. arXiv:2305.13775  [pdf, other

    cs.CL cs.AI

    Concept-aware Training Improves In-context Learning Ability of Language Models

    Authors: Michal Štefánik, Marek Kadlčík

    Abstract: Many recent language models (LMs) of Transformers family exhibit so-called in-context learning (ICL) ability, manifested in the LMs' ability to modulate their function by a task described in a natural language input. Previous work curating these models assumes that ICL emerges from vast over-parametrization or the scale of multi-task training. However, a complementary branch of recent theoretical… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Work in progress

  12. arXiv:2305.06841  [pdf, other

    cs.CL cs.AI

    Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models

    Authors: Lukáš Mikula, Michal Štefánik, Marek Petrovič, Petr Sojka

    Abstract: While the Large Language Models (LLMs) dominate a majority of language understanding tasks, previous work shows that some of these results are supported by modelling spurious correlations of training datasets. Authors commonly assess model robustness by evaluating their models on out-of-distribution (OOD) datasets of the same task, but these datasets might share the bias of the training dataset.… ▽ More

    Submitted 6 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Long paper in Proceedings of EACL 2024: Main track

  13. arXiv:2304.01922  [pdf, other

    cs.CL

    Resources and Few-shot Learners for In-context Learning in Slavic Languages

    Authors: Michal Štefánik, Marek Kadlčík, Piotr Gramacki, Petr Sojka

    Abstract: Despite the rapid recent progress in creating accurate and compact in-context learners, most recent work focuses on in-context learning (ICL) for tasks in English. However, the ability to interact with users of languages outside English presents a great potential for broadening the applicability of language technologies to non-English speakers. In this work, we collect the infrastructure necessa… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: EACL 2023 SlavicNLP Long Paper. New instructional templates and models are available on https://github.com/fewshot-goes-multilingual/slavic-incontext-learning

  14. arXiv:2212.01692  [pdf, other

    cs.CL cs.AI cs.LG

    Can In-context Learners Learn a Reasoning Concept from Demonstrations?

    Authors: Michal Štefánik, Marek Kadlčík

    Abstract: Language models exhibit an emergent ability to learn a new task from a small number of input-output demonstrations. However, recent work shows that in-context learners largely rely on their pre-trained knowledge, such as the sentiment of the labels, instead of learning new associations from the input. We argue that the commonly-used few-shot evaluation using a random selection of in-context demons… ▽ More

    Submitted 19 July, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

    Comments: Awarded Best Paper at ACL 2023 Natural Language Reasoning and Structured Explanations (NLRSE) workshop

  15. arXiv:2211.16550  [pdf, other

    cs.CL cs.AI cs.NE

    Soft Alignment Objectives for Robust Adaptation of Language Generation

    Authors: Michal Štefánik, Marek Kadlčík, Petr Sojka

    Abstract: Domain adaptation allows generative language models to address specific flaws caused by the domain shift of their application. However, the traditional adaptation by further training on in-domain data rapidly weakens the model's ability to generalize to other domains, making the open-ended deployments of the adapted models prone to errors. This work introduces novel training objectives built upon… ▽ More

    Submitted 26 May, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Annual Meeting of The ACL 2023: Main conference long paper

  16. arXiv:2206.08446  [pdf, other

    cs.CL cs.AI

    Methods for Estimating and Improving Robustness of Language Models

    Authors: Michal Štefánik

    Abstract: Despite their outstanding performance, large language models (LLMs) suffer notorious flaws related to their preference for simple, surface-level textual relations over full semantic complexity of the problem. This proposal investigates a common denominator of this problem in their weak ability to generalise outside of the training domain. We survey diverse research directions providing estimations… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Thesis proposal, accepted & to appear in NAACL SRW 2022

  17. arXiv:2203.03989  [pdf, other

    cs.CL cs.AI cs.LG

    Adaptor: Objective-Centric Adaptation Framework for Language Models

    Authors: Michal Štefánik, Vít Novotný, Nikola Groverová, Petr Sojka

    Abstract: Progress in natural language processing research is catalyzed by the possibilities given by the widespread software frameworks. This paper introduces Adaptor library that transposes the traditional model-centric approach composed of pre-training + fine-tuning steps to objective-centric approach, composing the training process by applications of selected objectives. We survey research directions th… ▽ More

    Submitted 20 May, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: 60th Annual Meeting of the ACL (ACL 2022): System Demonstrations paper

  18. arXiv:2109.07242  [pdf, other

    cs.CL

    Regressive Ensemble for Machine Translation Quality Evaluation

    Authors: Michal Štefánik, Vít Novotný, Petr Sojka

    Abstract: This work introduces a simple regressive ensemble for evaluating machine translation quality based on a set of novel and established metrics. We evaluate the ensemble using a correlation to expert-based MQM scores of the WMT 2021 Metrics workshop. In both monolingual and zero-shot cross-lingual settings, we show a significant performance improvement over single metrics. In the cross-lingual settin… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 8 pages incl. references, Proceedings of EMNLP 2021 Sixth Conference on Machine Translation (WMT 21)

  19. arXiv:2106.00411  [pdf, other

    cs.DL cs.IR

    WebMIaS on Docker: Deploying Math-Aware Search in a Single Line of Code

    Authors: Dávid Lupták, Vít Novotný, Michal Štefánik, Petr Sojka

    Abstract: Math informational retrieval (MIR) search engines are absent in the wide-spread production use, even though documents in the STEM fields contain many mathematical formulae, which are sometimes more important than text for understanding. We have developed and open-sourced the WebMIaS MIR search engine that has been successfully deployed in the European Digital Mathematics Library (EuDML). However,… ▽ More

    Submitted 14 July, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted to be published in: Intelligent Computer Mathematics 14th International Conference, CICM 2021, Timisoara, Romania, July 26--31, 2021, Proceedings, Fairouz Kamareddine and Claudio Sacerdotti-Coen (eds.), Lecture Notes in Artificial Intelligence, Springer, Cham, 2021

    MSC Class: 68V35 (Primary); 68V30 (Secondary) ACM Class: H.3.3; H.3.4; H.3.5; H.3.6; H.3.7

  20. When FastText Pays Attention: Efficient Estimation of Word Representations using Constrained Positional Weighting

    Authors: Vít Novotný, Michal Štefánik, Eniafe Festus Ayetiran, Petr Sojka, Radim Řehůřek

    Abstract: In 2018, Mikolov et al. introduced the positional language model, which has characteristics of attention-based neural machine translation models and which achieved state-of-the-art performance on the intrinsic word analogy task. However, the positional model is not practically fast and it has never been evaluated on qualitative criteria or extrinsic tasks. We propose a constrained positional model… ▽ More

    Submitted 28 February, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: J. Univers. Comput. Sci. 28:2 (2022) 181-201

  21. One Size Does Not Fit All: Finding the Optimal Subword Sizes for FastText Models across Languages

    Authors: Vít Novotný, Eniafe Festus Ayetiran, Dalibor Bačovský, Dávid Lupták, Michal Štefánik, Petr Sojka

    Abstract: Unsupervised representation learning of words from large multilingual corpora is useful for downstream tasks such as word sense disambiguation, semantic text similarity, and information retrieval. The representation precision of log-bilinear fastText models is mostly due to their use of subword information. In previous work, the optimization of fastText's subword sizes has not been fully explored,… ▽ More

    Submitted 20 September, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: RANLP (2021) 1072-1078

  22. arXiv:2003.05019  [pdf, other

    cs.IR cs.CL cs.LG

    Text classification with word embedding regularization and soft similarity measure

    Authors: Vít Novotný, Eniafe Festus Ayetiran, Michal Štefánik, Petr Sojka

    Abstract: Since the seminal work of Mikolov et al., word embeddings have become the preferred word representations for many natural language processing tasks. Document similarity measures extracted from word embeddings, such as the soft cosine measure (SCM) and the Word Mover's Distance (WMD), were reported to achieve state-of-the-art performance on semantic text similarity and text classification. Despit… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

    MSC Class: 68P20 ACM Class: F.2.1; G.1.3; H.3.3; I.2.7