Skip to main content

Showing 1–6 of 6 results for author: Ondrej, K

.
  1. arXiv:2412.17933  [pdf, other

    cs.CL cs.AI

    BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism

    Authors: Martin Fajcik, Martin Docekal, Jan Dolezal, Karel Ondrej, Karel Beneš, Jan Kapsa, Pavel Smrz, Alexander Polok, Michal Hradis, Zuzana Neverilova, Ales Horak, Radoslav Sabol, Michal Stefanik, Adam Jirkovsky, David Adamczyk, Petr Hyner, Jan Hula, Hynek Kydlicek

    Abstract: We present BenCzechMark (BCM), the first comprehensive Czech language benchmark designed for large language models, offering diverse tasks, multiple task formats, and multiple evaluation metrics. Its duel scoring system is grounded in statistical significance theory and uses aggregation across tasks inspired by social preference theory. Our benchmark encompasses 50 challenging tasks, with correspo… ▽ More

    Submitted 22 May, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: Accepted to TACL

  2. arXiv:2110.05781  [pdf, other

    eess.AS cs.CL cs.LG

    BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications

    Authors: Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek, Karel Ondrej, Oliver Ohneiser, Hartmut Helmke

    Abstract: Automatic speech recognition (ASR) allows transcribing the communications between air traffic controllers (ATCOs) and aircraft pilots. The transcriptions are used later to extract ATC named entities, e.g., aircraft callsigns. One common challenge is speech activity detection (SAD) and speaker diarization (SD). In the failure condition, two or more segments remain in the same recording, jeopardizin… ▽ More

    Submitted 14 October, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: To be published in the 2022 IEEE Spoken Language Technology Workshop (SLT) (SLT 2022)

  3. arXiv:2109.03502  [pdf, other

    cs.CL cs.IR cs.LG

    R2-D2: A Modular Baseline for Open-Domain Question Answering

    Authors: Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz

    Abstract: This work presents a novel four-stage open-domain QA pipeline R2-D2 (Rank twice, reaD twice). The pipeline is composed of a retriever, passage reranker, extractive reader, generative reader and a mechanism that aggregates the final prediction from all system's components. We demonstrate its strength across three open-domain QA datasets: NaturalQuestions, TriviaQA and EfficientQA, surpassing state-… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to Findings of EMNLP'21. arXiv admin note: substantial text overlap with arXiv:2102.10697

  4. arXiv:2102.10697  [pdf, other

    cs.CL cs.AI cs.LG

    Pruning the Index Contents for Memory Efficient Open-Domain QA

    Authors: Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz

    Abstract: This work presents a novel pipeline that demonstrates what is achievable with a combined effort of state-of-the-art approaches. Specifically, it proposes the novel R2-D2 (Rank twice, reaD twice) pipeline composed of retriever, passage reranker, extractive reader, generative reader and a simple way to combine them. Furthermore, previous work often comes with a massive index of external documents th… ▽ More

    Submitted 9 April, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: v2 - added connection between pruner and DPR, results on TriviaQA, new reranker, results with HN-DPR checkpoint and additional analyses

  5. arXiv:2101.00133  [pdf, other

    cs.CL cs.AI

    NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

    Authors: Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini , et al. (28 additional authors not shown)

    Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage conte… ▽ More

    Submitted 19 September, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track

  6. arXiv:2001.08603  [pdf, other

    cs.AI cs.LG cs.LO

    Learning Distributional Programs for Relational Autocompletion

    Authors: Kumar Nitesh, Kuzelka Ondrej, De Raedt Luc

    Abstract: Relational autocompletion is the problem of automatically filling out some missing values in multi-relational data. We tackle this problem within the probabilistic logic programming framework of Distributional Clauses (DC), which supports both discrete and continuous probability distributions. Within this framework, we introduce DiceML { an approach to learn both the structure and the parameters o… ▽ More

    Submitted 5 July, 2021; v1 submitted 23 January, 2020; originally announced January 2020.