Skip to main content

Showing 1–14 of 14 results for author: Margatina, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01859  [pdf, other

    cs.CL

    CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions

    Authors: Tamer Alkhouli, Katerina Margatina, James Gung, Raphael Shu, Claudia Zaghi, Monica Sunkara, Yi Zhang

    Abstract: We introduce Conversational Function-Calling Evaluation Through Turn-Level Interactions (CONFETTI), a conversational benchmark1 designed to evaluate the function-calling capabilities and response quality of large language models (LLMs). Current benchmarks lack comprehensive assessment of LLMs in complex conversational scenarios. CONFETTI addresses this gap through 109 human-simulated conversations… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: ACL 2025 (main conference)

  2. arXiv:2502.12094  [pdf, other

    cs.AI cs.CL

    A Study on Leveraging Search and Self-Feedback for Agent Reasoning

    Authors: Karthikeyan K, Michelle Yuan, Elman Mansimov, Katerina Margatina, Anurag Pratik, Daniele Bonadiman, Monica Sunkara, Yi Zhang, Yassine Benajiba

    Abstract: Recent works have demonstrated that incorporating search during inference can significantly improve reasoning capabilities of language agents. Some approaches may make use of the ground truth or rely on model's own generated feedback. The search algorithm uses this feedback to then produce values that will update its criterion for exploring and exploiting various reasoning paths. In this study, we… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: Under review

  3. arXiv:2404.16019  [pdf, other

    cs.CL

    The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models

    Authors: Hannah Rose Kirk, Alexander Whitefield, Paul Röttger, Andrew Bean, Katerina Margatina, Juan Ciro, Rafael Mosquera, Max Bartolo, Adina Williams, He He, Bertie Vidgen, Scott A. Hale

    Abstract: Human feedback is central to the alignment of Large Language Models (LLMs). However, open questions remain about methods (how), domains (where), people (who) and objectives (to what end) of feedback processes. To navigate these questions, we introduce PRISM, a dataset that maps the sociodemographics and stated preferences of 1,500 diverse participants from 75 countries, to their contextual prefere… ▽ More

    Submitted 3 December, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Journal ref: The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track (2024)

  4. arXiv:2310.17271  [pdf, other

    cs.CL

    Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?

    Authors: Ahmed Alajrami, Katerina Margatina, Nikolaos Aletras

    Abstract: Understanding how and what pre-trained language models (PLMs) learn about language is an open challenge in natural language processing. Previous work has focused on identifying whether they capture semantic and syntactic information, and how the data or the pre-training objective affects their performance. However, to the best of our knowledge, no previous work has specifically examined how inform… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: To appear at EMNLP 2023

  5. arXiv:2305.14264  [pdf, other

    cs.CL cs.AI

    Active Learning Principles for In-Context Learning with Large Language Models

    Authors: Katerina Margatina, Timo Schick, Nikolaos Aletras, Jane Dwivedi-Yu

    Abstract: The remarkable advancements in large language models (LLMs) have significantly enhanced the performance in few-shot learning settings. By using only a small number of labeled examples, referred to as demonstrations, LLMs can effectively grasp the task at hand through in-context learning. However, the process of selecting appropriate demonstrations has received limited attention in prior work. This… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: To appear at Findings of EMNLP (Camera Ready version)

  6. arXiv:2305.13342  [pdf, other

    cs.LG cs.CL

    On the Limitations of Simulating Active Learning

    Authors: Katerina Margatina, Nikolaos Aletras

    Abstract: Active learning (AL) is a human-and-model-in-the-loop paradigm that iteratively selects informative unlabeled data for human annotation, aiming to improve over random sampling. However, performing AL experiments with human annotations on-the-fly is a laborious and expensive process, thus unrealistic for academic research. An easy fix to this impediment is to simulate AL, by treating an already lab… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: To appear at Findings of ACL 2023

  7. arXiv:2302.12297  [pdf, other

    cs.CL

    Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

    Authors: Katerina Margatina, Shuai Wang, Yogarshi Vyas, Neha Anna John, Yassine Benajiba, Miguel Ballesteros

    Abstract: Temporal concept drift refers to the problem of data changing over time. In NLP, that would entail that language (e.g. new expressions, meaning shifts) and factual knowledge (e.g. new concepts, updated facts) evolve over time. Focusing on the latter, we benchmark $11$ pretrained masked language models (MLMs) on a series of tests designed to evaluate the effect of temporal concept drift, as it is c… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: To appear at EACL 2023. Our code will be available at https://github.com/amazon-science/temporal-robustness

  8. arXiv:2302.06976  [pdf, other

    cs.CL

    Investigating Multi-source Active Learning for Natural Language Inference

    Authors: Ard Snijders, Douwe Kiela, Katerina Margatina

    Abstract: In recent years, active learning has been successfully applied to an array of NLP tasks. However, prior work often assumes that training and test data are drawn from the same distribution. This is problematic, as in real-life settings data may stem from several sources of varying relevance and quality. We show that four popular active learning schemes fail to outperform random selection when appli… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 23 pages. Accepted for publication at the European Chapter of the Association of Computational Linguistics (EACL) 2023

  9. arXiv:2203.10020  [pdf, other

    cs.CL

    Challenges and Strategies in Cross-Cultural NLP

    Authors: Daniel Hershcovich, Stella Frank, Heather Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello, Laura Cabello Piqueras, Ilias Chalkidis, Ruixiang Cui, Constanza Fierro, Katerina Margatina, Phillip Rust, Anders Søgaard

    Abstract: Various efforts in the Natural Language Processing (NLP) community have been made to accommodate linguistic diversity and serve speakers of many different languages. However, it is important to acknowledge that speakers and the content they produce and require, vary not just by language, but also by culture. Although language and culture are tightly linked, there are important differences. Analogo… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: ACL 2022 - Theme track

  10. arXiv:2109.03764  [pdf, other

    cs.CL cs.AI cs.LG

    Active Learning by Acquiring Contrastive Examples

    Authors: Katerina Margatina, Giorgos Vernikos, Loïc Barrault, Nikolaos Aletras

    Abstract: Common acquisition functions for active learning use either uncertainty or diversity sampling, aiming to select difficult and diverse data points from the pool of unlabeled data, respectively. In this work, leveraging the best of both worlds, we propose an acquisition function that opts for selecting \textit{contrastive examples}, i.e. data points that are similar in the model feature space and ye… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021

  11. arXiv:2109.01819  [pdf, other

    cs.CL cs.AI cs.LG

    Frustratingly Simple Pretraining Alternatives to Masked Language Modeling

    Authors: Atsuki Yamaguchi, George Chrysostomou, Katerina Margatina, Nikolaos Aletras

    Abstract: Masked language modeling (MLM), a self-supervised pretraining objective, is widely used in natural language processing for learning text representations. MLM trains a model to predict a random sample of input tokens that have been replaced by a [MASK] placeholder in a multi-class setting over the entire vocabulary. When pretraining, it is common to use alongside MLM other auxiliary objectives on t… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021

  12. arXiv:2104.08320  [pdf, other

    cs.CL

    On the Importance of Effectively Adapting Pretrained Language Models for Active Learning

    Authors: Katerina Margatina, Loïc Barrault, Nikolaos Aletras

    Abstract: Recent Active Learning (AL) approaches in Natural Language Processing (NLP) proposed using off-the-shelf pretrained language models (LMs). In this paper, we argue that these LMs are not adapted effectively to the downstream task during AL and we explore ways to address this issue. We suggest to first adapt the pretrained LM to the target task by continuing training with all the available unlabeled… ▽ More

    Submitted 2 March, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: To appear at ACL 2022

  13. arXiv:2009.13366  [pdf, other

    cs.LG stat.ML

    Domain Adversarial Fine-Tuning as an Effective Regularizer

    Authors: Giorgos Vernikos, Katerina Margatina, Alexandra Chronopoulou, Ion Androutsopoulos

    Abstract: In Natural Language Processing (NLP), pretrained language models (LMs) that are transferred to downstream tasks have been recently shown to achieve state-of-the-art results. However, standard fine-tuning can degrade the general-domain representations captured during pretraining. To address this issue, we introduce a new regularization technique, AFTER; domain Adversarial Fine-Tuning as an Effectiv… ▽ More

    Submitted 5 October, 2020; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020, Findings of EMNLP

  14. arXiv:1906.03674  [pdf, other

    cs.LG cs.CL stat.ML

    Attention-based Conditioning Methods for External Knowledge Integration

    Authors: Katerina Margatina, Christos Baziotis, Alexandros Potamianos

    Abstract: In this paper, we present a novel approach for incorporating external knowledge in Recurrent Neural Networks (RNNs). We propose the integration of lexicon features into the self-attention mechanism of RNN-based architectures. This form of conditioning on the attention distribution, enforces the contribution of the most salient words for the task at hand. We introduce three methods, namely attentio… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: ACL 2019