Skip to main content

Showing 1–11 of 11 results for author: Chiruzzo, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11243  [pdf, ps, other

    cs.CL cs.AI

    RETUYT-INCO at BEA 2025 Shared Task: How Far Can Lightweight Models Go in AI-powered Tutor Evaluation?

    Authors: Santiago Góngora, Ignacio Sastre, Santiago Robaina, Ignacio Remersaro, Luis Chiruzzo, Aiala Rosá

    Abstract: In this paper, we present the RETUYT-INCO participation at the BEA 2025 shared task. Our participation was characterized by the decision of using relatively small models, with fewer than 1B parameters. This self-imposed restriction tries to represent the conditions in which many research labs or institutions are in the Global South, where computational power is not easily accessible due to its pro… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: This paper will be presented at the 20th BEA Workshop (Innovative Use of NLP for Building Educational Applications) at ACL 2025

  2. arXiv:2504.20251  [pdf, other

    cs.CL cs.AI cs.CY

    A Platform for Generating Educational Activities to Teach English as a Second Language

    Authors: Aiala Rosá, Santiago Góngora, Juan Pablo Filevich, Ignacio Sastre, Laura Musto, Brian Carpenter, Luis Chiruzzo

    Abstract: We present a platform for the generation of educational activities oriented to teaching English as a foreign language. The different activities -- games and language practice exercises -- are strongly based on Natural Language Processing techniques. The platform offers the possibility of playing out-of-the-box games, generated from resources created semi-automatically and then manually curated. It… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: Unpublished report written in 2023

  3. arXiv:2504.07304  [pdf, other

    cs.CL cs.AI

    PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing Games

    Authors: Santiago Góngora, Luis Chiruzzo, Gonzalo Méndez, Pablo Gervás

    Abstract: Every time an Interactive Storytelling (IS) system gets a player input, it is facing the world-update problem. Classical approaches to this problem consist in mapping that input to known preprogrammed actions, what can severely constrain the free will of the player. When the expected experience has a strong focus on improvisation, like in Role-playing Games (RPGs), this problem is critical. In thi… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: Presented at the 15th International Conference on Computational Creativity (ICCC'24)

    Journal ref: Proceedings of the Fifteenth International Conference on Computational Creativity (2024) 101-106

  4. arXiv:2410.16315  [pdf, other

    cs.CY

    Why AI Is WEIRD and Should Not Be This Way: Towards AI For Everyone, With Everyone, By Everyone

    Authors: Rada Mihalcea, Oana Ignat, Longju Bai, Angana Borah, Luis Chiruzzo, Zhijing Jin, Claude Kwizera, Joan Nwatu, Soujanya Poria, Thamar Solorio

    Abstract: This paper presents a vision for creating AI systems that are inclusive at every stage of development, from data collection to model design and evaluation. We address key limitations in the current AI pipeline and its WEIRD representation, such as lack of data diversity, biases in model performance, and narrow evaluation metrics. We also focus on the need for diverse representation among the devel… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  5. arXiv:2309.13702  [pdf, ps, other

    cs.CL cs.AI

    Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-playing Games

    Authors: Santiago Góngora, Luis Chiruzzo, Gonzalo Méndez, Pablo Gervás

    Abstract: In role-playing games a Game Master (GM) is the player in charge of the game, who must design the challenges the players face and narrate the outcomes of their actions. In this work we discuss some challenges to model GMs from an Interactive Storytelling and Natural Language Processing perspective. Following those challenges we propose three test categories to evaluate such dialogue systems, and w… ▽ More

    Submitted 30 September, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 11 pages. Accepted at GALA 2023 (Games and Learning Alliance 12th International Conference)

  6. arXiv:2309.06163  [pdf, ps, other

    cs.CL

    Overview of GUA-SPA at IberLEF 2023: Guarani-Spanish Code Switching Analysis

    Authors: Luis Chiruzzo, Marvin Agüero-Torales, Gustavo Giménez-Lugo, Aldo Alvarez, Yliana Rodríguez, Santiago Góngora, Thamar Solorio

    Abstract: We present the first shared task for detecting and analyzing code-switching in Guarani and Spanish, GUA-SPA at IberLEF 2023. The challenge consisted of three tasks: identifying the language of a token, NER, and a novel task of classifying the way a Spanish span is used in the code-switched context. We annotated a corpus of 1500 texts extracted from news articles and tweets, around 25 thousand toke… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Journal ref: Procesamiento del Lenguaje Natural, Revista no. 71, septiembre de 2023, pp. 321-328

  7. arXiv:2302.07912  [pdf, other

    cs.CL

    Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

    Authors: Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

    Abstract: Large multilingual models have inspired a new class of word alignment methods, which work well for the model's pretraining languages. However, the languages most in need of automatic alignment are low-resource and, thus, not typically included in the pretraining data. In this work, we ask: How do modern aligners perform on unseen languages, and are they better than traditional methods? We contribu… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: EACL 2023

  8. arXiv:2208.10898  [pdf, other

    cs.CL cs.SI

    Don't Take it Personally: Analyzing Gender and Age Differences in Ratings of Online Humor

    Authors: J. A. Meaney, Steven R. Wilson, Luis Chiruzzo, Walid Magdy

    Abstract: Computational humor detection systems rarely model the subjectivity of humor responses, or consider alternative reactions to humor - namely offense. We analyzed a large dataset of humor and offense ratings by male and female annotators of different age groups. We find that women link these two concepts more strongly than men, and they tend to give lower humor ratings and higher offense scores. We… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  9. arXiv:2104.08726  [pdf, other

    cs.CL

    AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages

    Authors: Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Meza-Ruiz, Gustavo A. Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu, Katharina Kann

    Abstract: Pretrained multilingual models are able to perform cross-lingual transfer in a zero-shot setting, even for languages unseen during pretraining. However, prior work evaluating performance on unseen languages has largely been limited to low-level, syntactic tasks, and it remains unclear if zero-shot learning of high-level, semantic tasks is possible for unseen languages. To explore this question, we… ▽ More

    Submitted 16 March, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted to ACL 2022

  10. arXiv:1710.06393  [pdf, other

    cs.CL

    RETUYT in TASS 2017: Sentiment Analysis for Spanish Tweets using SVM and CNN

    Authors: Aiala Rosá, Luis Chiruzzo, Mathias Etcheverry, Santiago Castro

    Abstract: This article presents classifiers based on SVM and Convolutional Neural Networks (CNN) for the TASS 2017 challenge on tweets sentiment analysis. The classifier with the best performance in general uses a combination of SVM and CNN. The use of word embeddings was particularly useful for improving the classifiers performance.

    Submitted 17 October, 2017; originally announced October 2017.

    Comments: in Spanish. Published in http://ceur-ws.org/Vol-1896/p9_retuyt_tass2017.pdf

    Journal ref: ISSN 1613-0073, TASS 2017: Workshop on Semantic Analysis at SEPLN, Sep 2017, pages 77-83

  11. arXiv:1710.00477  [pdf, other

    cs.CL

    A Crowd-Annotated Spanish Corpus for Humor Analysis

    Authors: Santiago Castro, Luis Chiruzzo, Aiala Rosá, Diego Garat, Guillermo Moncecchi

    Abstract: Computational Humor involves several tasks, such as humor recognition, humor generation, and humor scoring, for which it is useful to have human-curated data. In this work we present a corpus of 27,000 tweets written in Spanish and crowd-annotated by their humor value and funniness score, with about four annotations per tweet, tagged by 1,300 people over the Internet. It is equally divided between… ▽ More

    Submitted 19 July, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

    Comments: Camera-ready version of the paper submitted to SocialNLP 2018, with a fixed typo