Skip to main content

Showing 1–8 of 8 results for author: Saparina, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.21028  [pdf, ps, other

    cs.AI

    Who Gets Cited Most? Benchmarking Long-Context Language Models on Scientific Articles

    Authors: Miao Li, Alexander Gurung, Irina Saparina, Mirella Lapata

    Abstract: This paper introduces SciTrek, a novel question-answering benchmark designed to evaluate the long-context reasoning capabilities of large language models (LLMs) using scientific articles. Current long-context benchmarks often rely on non-scientific texts, focus on simple information retrieval tasks, or employ artificial contexts. SciTrek addresses these limitations by proposing complex questions t… ▽ More

    Submitted 25 September, 2025; originally announced September 2025.

    Comments: 31 pages

  2. arXiv:2509.17570  [pdf, ps, other

    cs.CL

    Asking a Language Model for Diverse Responses

    Authors: Sergey Troshin, Irina Saparina, Antske Fokkens, Vlad Niculae

    Abstract: Large language models increasingly rely on explicit reasoning chains and can produce multiple plausible responses for a given context. We study the candidate sampler that produces the set of plausible responses contrasting the ancestral (parallel) sampling against two alternatives: enumeration, which asks the model to produce $n$ candidates in one pass, and iterative sampling, which proposes candi… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: UncertaiNLP workshop, 2025

  3. arXiv:2502.18448  [pdf, ps, other

    cs.CL cs.AI

    Disambiguate First, Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing

    Authors: Irina Saparina, Mirella Lapata

    Abstract: Handling ambiguity and underspecification is an important challenge in natural language interfaces, particularly for tasks like text-to-SQL semantic parsing. We propose a modular approach that resolves ambiguity using natural language interpretations before mapping these to logical forms (e.g., SQL queries). Although LLMs excel at parsing unambiguous utterances, they show strong biases for ambiguo… ▽ More

    Submitted 12 July, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

    Comments: Findings of ACL 2025

  4. arXiv:2406.19073  [pdf, other

    cs.CL

    AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database Queries

    Authors: Irina Saparina, Mirella Lapata

    Abstract: Practical semantic parsers are expected to understand user utterances and map them to executable programs, even when these are ambiguous. We introduce a new benchmark, AMBROSIA, which we hope will inform and inspire the development of text-to-SQL parsers capable of recognizing and interpreting ambiguous requests. Our dataset contains questions showcasing three different types of ambiguity (scope a… ▽ More

    Submitted 31 October, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2024 D&B Track Spotlight

  5. arXiv:2402.08666  [pdf, other

    cs.CL

    Improving Generalization in Semantic Parsing by Increasing Natural Language Variation

    Authors: Irina Saparina, Mirella Lapata

    Abstract: Text-to-SQL semantic parsing has made significant progress in recent years, with various models demonstrating impressive performance on the challenging Spider benchmark. However, it has also been shown that these models often struggle to generalize even when faced with small perturbations of previously (accurately) parsed expressions. This is mainly due to the linguistic form of questions in Spide… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  6. arXiv:2210.07201  [pdf, other

    cs.CL

    Searching for Better Database Queries in the Outputs of Semantic Parsers

    Authors: Anton Osokin, Irina Saparina, Ramil Yarullin

    Abstract: The task of generating a database query from a question in natural language suffers from ambiguity and insufficiently precise description of the goal. The problem is amplified when the system needs to generalize to databases unseen at training. In this paper, we consider the case when, at the test time, the system has access to an external criterion that evaluates the generated queries. The criter… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  7. arXiv:2109.06162  [pdf, other

    cs.CL

    SPARQLing Database Queries from Intermediate Question Decompositions

    Authors: Irina Saparina, Anton Osokin

    Abstract: To translate natural language questions into executable database queries, most approaches rely on a fully annotated training set. Annotating a large dataset with queries is difficult as it requires query-language expertise. We reduce this burden using grounded in databases intermediate question representations. These representations are simpler to collect and were originally crowdsourced within th… ▽ More

    Submitted 31 May, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  8. arXiv:1912.03771  [pdf, other

    cs.LG stat.ML

    Cost-Sensitive Training for Autoregressive Models

    Authors: Irina Saparina, Anton Osokin

    Abstract: Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daumé III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.