Skip to main content

Showing 1–18 of 18 results for author: Borchmann, Ł

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20315  [pdf, ps, other

    cs.CL cs.AI

    Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

    Authors: Zhewei Yao, Guoheng Sun, Lukasz Borchmann, Zheyu Shen, Minghang Deng, Bohan Zhai, Hao Zhang, Ang Li, Yuxiong He

    Abstract: Translating natural language into SQL (Test2SQL) is a longstanding challenge at the intersection of natural language understanding and structured data access. While large language models (LLMs) have significantly improved fluency in SQL generation, producing correct and executable SQL--particularly for complex queries--remains a bottleneck. We present Arctic-Text2SQL-R1, a reinforcement learning (… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 22 pages, 2 figures

  2. arXiv:2504.10419  [pdf, other

    cs.CL

    Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

    Authors: Michał Turski, Mateusz Chiliński, Łukasz Borchmann

    Abstract: Checkboxes are critical in real-world document processing where the presence or absence of ticks directly informs data extraction and decision-making processes. Yet, despite the strong performance of Large Vision and Language Models across a wide range of tasks, they struggle with interpreting checkable content. This challenge becomes particularly pressing in industries where a single overlooked c… ▽ More

    Submitted 15 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

  3. arXiv:2503.24364  [pdf, other

    cs.CL

    Query and Conquer: Execution-Guided SQL Generation

    Authors: Łukasz Borchmann, Marek Wydmuch

    Abstract: We propose a novel approach for generating complex outputs that significantly improves accuracy in text-to-SQL tasks. Our method leverages execution results to select the most semantically consistent query from multiple candidates, enabling smaller, cost-effective models to surpass computationally intensive reasoning methods such as o1, o3-mini, and DeepSeek R1 while reducing inference cost by as… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

  4. arXiv:2412.17758  [pdf, other

    cs.CL cs.AI

    In Case You Missed It: ARC 'Challenge' Is Not That Challenging

    Authors: Łukasz Borchmann

    Abstract: ARC Challenge appears more difficult than ARC Easy for modern LLMs primarily due to an evaluation setup that prevents direct comparison of answer choices rather than inherent complexity. Although some researchers have quietly shifted to a more appropriate scheme over the last year, the implications of this change have yet to be widely acknowledged. We highlight this overlooked shift, show how simi… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

  5. arXiv:2411.11829  [pdf, other

    cs.LG cs.CL cs.DB

    Tackling prediction tasks in relational databases with LLMs

    Authors: Marek Wydmuch, Łukasz Borchmann, Filip Graliński

    Abstract: Though large language models (LLMs) have demonstrated exceptional performance across numerous problems, their application to predictive tasks in relational databases remains largely unexplored. In this work, we address the notion that LLMs cannot yield satisfactory results on relational databases due to their interconnected tables, complex relationships, and heterogeneous data types. Using the rec… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  6. arXiv:2410.23331  [pdf, other

    cs.CL

    Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists

    Authors: Michał Pietruszka, Łukasz Borchmann, Aleksander Jędrosz, Paweł Morawiecki

    Abstract: We present a benchmark for large language models designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, which requires domain knowledge in addition to a deep understanding of the underlying problem and data structure. The model is provided with a dataset description in a prompt and asked to generate code transforming it. The evaluation score… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  7. arXiv:2408.04632  [pdf, other

    cs.CL cs.CV

    Arctic-TILT. Business Document Understanding at Sub-Billion Scale

    Authors: Łukasz Borchmann, Michał Pietruszka, Wojciech Jaśkowski, Dawid Jurkiewicz, Piotr Halama, Paweł Józiak, Łukasz Garncarek, Paweł Liskowski, Karolina Szyndler, Andrzej Gretkowski, Julita Ołtusek, Gabriela Nowakowska, Artur Zawłocki, Łukasz Duhr, Paweł Dyda, Michał Turski

    Abstract: The vast portion of workloads employing LLMs involves answering questions grounded on PDF or scan content. We introduce the Arctic-TILT achieving accuracy on par with models 1000$\times$ its size on these use cases. It can be fine-tuned and deployed on a single 24GB GPU, lowering operational costs while processing Visually Rich Documents with up to 400k tokens. The model establishes state-of-the-a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  8. arXiv:2405.18433  [pdf, other

    cs.CL

    Notes on Applicability of GPT-4 to Document Understanding

    Authors: Łukasz Borchmann

    Abstract: We perform a missing, reproducible evaluation of all publicly available GPT-4 family models concerning the Document Understanding field, where it is frequently required to comprehend text spacial arrangement and visual clues in addition to textual semantics. Benchmark results indicate that though it is hard to achieve satisfactory results with text-only models, GPT-4 Vision Turbo performs well whe… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  9. arXiv:2305.08455  [pdf, other

    cs.CV cs.CL cs.LG

    Document Understanding Dataset and Evaluation (DUDE)

    Authors: Jordy Van Landeghem, Rubén Tito, Łukasz Borchmann, Michał Pietruszka, Paweł Józiak, Rafał Powalski, Dawid Jurkiewicz, Mickaël Coustaty, Bertrand Ackaert, Ernest Valveny, Matthew Blaschko, Sien Moens, Tomasz Stanisławek

    Abstract: We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted at ICCV 2023

  10. arXiv:2206.04045  [pdf, other

    cs.CL cs.LG

    STable: Table Generation Framework for Encoder-Decoder Models

    Authors: Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Pałka, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek

    Abstract: The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutatio… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  11. arXiv:2102.09550  [pdf, other

    cs.CL cs.LG

    Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

    Authors: Rafał Powalski, Łukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michał Pietruszka, Gabriela Pałka

    Abstract: We address the challenging problem of Natural Language Comprehension beyond plain-text documents by introducing the TILT neural network architecture which simultaneously learns layout information, visual features, and textual semantics. Contrary to previous approaches, we rely on a decoder capable of unifying a variety of problems involving natural language. The layout is represented as an attenti… ▽ More

    Submitted 12 July, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at ICDAR 2021

  12. arXiv:2011.03228  [pdf, other

    cs.CL cs.IR

    From Dataset Recycling to Multi-Property Extraction and Beyond

    Authors: Tomasz Dwojak, Michał Pietruszka, Łukasz Borchmann, Jakub Chłędowski, Filip Graliński

    Abstract: This paper investigates various Transformer architectures on the WikiReading Information Extraction and Machine Reading Comprehension dataset. The proposed dual-source model outperforms the current state-of-the-art by a large margin. Next, we introduce WikiReading Recycled-a newly developed public dataset and the task of multiple property extraction. It uses the same data as WikiReading but does n… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted at CoNLL 2020; this article supersedes arXiv: 2006.08281

  13. arXiv:2010.15552  [pdf, other

    cs.LG

    Successive Halving Top-k Operator

    Authors: Michał Pietruszka, Łukasz Borchmann, Filip Graliński

    Abstract: We propose a differentiable successive halving method of relaxing the top-k operator, rendering gradient-based optimization possible. The need to perform softmax iteratively on the entire vector of scores is avoided by using a tournament-style selection. As a result, a much better approximation of top-k with lower computational cost is achieved compared to the previous approach.

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Work in progress

  14. arXiv:2010.14464  [pdf, other

    cs.DS cs.CL cs.IR

    Dynamic Boundary Time Warping for Sub-sequence Matching with Few Examples

    Authors: Łukasz Borchmann, Dawid Jurkiewicz, Filip Graliński, Tomasz Górecki

    Abstract: The paper presents a novel method of finding a fragment in a long temporal sequence similar to the set of shorter sequences. We are the first to propose an algorithm for such a search that does not rely on computing the average sequence from query examples. Instead, we use query examples as is, utilizing all of them simultaneously. The introduced method based on the Dynamic Time Warping (DTW) tech… ▽ More

    Submitted 1 September, 2024; v1 submitted 27 October, 2020; originally announced October 2020.

  15. arXiv:2009.05169  [pdf, other

    cs.CL cs.LG

    Sparsifying Transformer Models with Trainable Representation Pooling

    Authors: Michał Pietruszka, Łukasz Borchmann, Łukasz Garncarek

    Abstract: We propose a novel method to sparsify attention in the Transformer model by learning to select the most-informative token representations during the training process, thus focusing on the task-specific parts of an input. A reduction of quadratic time and memory complexity to sublinear was achieved due to a robust trainable top-$k$ operator. Our experiments on a challenging long document summarizat… ▽ More

    Submitted 7 March, 2022; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: Accepted at ACL 2022

  16. arXiv:2006.08281  [pdf, other

    cs.CL cs.IR

    On the Multi-Property Extraction and Beyond

    Authors: Tomasz Dwojak, Michał Pietruszka, Łukasz Borchmann, Filip Graliński, Jakub Chłędowski

    Abstract: In this paper, we investigate the Dual-source Transformer architecture on the WikiReading information extraction and machine reading comprehension dataset. The proposed model outperforms the current state-of-the-art by a large margin. Next, we introduce WikiReading Recycled - a newly developed public dataset, supporting the task of multiple property extraction. It keeps the spirit of the original… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 5 pages

  17. arXiv:2005.07934  [pdf, other

    cs.CL

    ApplicaAI at SemEval-2020 Task 11: On RoBERTa-CRF, Span CLS and Whether Self-Training Helps Them

    Authors: Dawid Jurkiewicz, Łukasz Borchmann, Izabela Kosmala, Filip Graliński

    Abstract: This paper presents the winning system for the propaganda Technique Classification (TC) task and the second-placed system for the propaganda Span Identification (SI) task. The purpose of TC task was to identify an applied propaganda technique given propaganda text fragment. The goal of SI task was to find specific text fragments which contain at least one propaganda technique. Both of the develope… ▽ More

    Submitted 5 September, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

  18. arXiv:1911.03911  [pdf, other

    cs.CL

    Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

    Authors: Łukasz Borchmann, Dawid Wiśniewski, Andrzej Gretkowski, Izabela Kosmala, Dawid Jurkiewicz, Łukasz Szałkiewicz, Gabriela Pałka, Karol Kaczmarek, Agnieszka Kaliska, Filip Graliński

    Abstract: We propose a new shared task of semantic retrieval from legal texts, in which a so-called contract discovery is to be performed, where legal clauses are extracted from documents, given a few examples of similar clauses from other legal acts. The task differs substantially from conventional NLI and shared tasks on legal information extraction (e.g., one has to identify text span instead of a single… ▽ More

    Submitted 8 October, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Submitted to Findings of EMNLP