Skip to main content

Showing 1–9 of 9 results for author: Sorodoc, I

.
  1. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  2. arXiv:2506.07671  [pdf, ps, other

    cs.CL cs.AI

    GaRAGe: A Benchmark with Grounding Annotations for RAG Evaluation

    Authors: Ionut-Teodor Sorodoc, Leonardo F. R. Ribeiro, Rexhina Blloshmi, Christopher Davis, Adrià de Gispert

    Abstract: We present GaRAGe, a large RAG benchmark with human-curated long-form answers and annotations of each grounding passage, allowing a fine-grained evaluation of whether LLMs can identify relevant grounding when generating RAG answers. Our benchmark contains 2366 questions of diverse complexity, dynamism, and topics, and includes over 35K annotated passages retrieved from both private document sets a… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: ACL 2025 (Findings)

  3. arXiv:2410.08623  [pdf, other

    cs.CL cs.IR

    Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision

    Authors: Philipp Christmann, Svitlana Vakulenko, Ionut Teodor Sorodoc, Bill Byrne, Adrià de Gispert

    Abstract: Long-form question answering (LFQA) aims at generating in-depth answers to end-user questions, providing relevant information beyond the direct answer. However, existing retrievers are typically optimized towards information that directly targets the question, missing out on such contextual information. Furthermore, there is a lack of training data for relevant context. To this end, we propose and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted at EMNLP 2024 (Findings)

  4. arXiv:2004.03340  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Online Continual Learning with CALM

    Authors: Germán Kruszewski, Ionut-Teodor Sorodoc, Tomas Mikolov

    Abstract: Online Continual Learning (OCL) studies learning over a continuous data stream without observing any single example more than once, a setting that is closer to the experience of humans and systems that must learn "on-the-wild". Yet, commonly available benchmarks are far from these real-world conditions, because they explicitly signal different tasks, lack latent similarity structure or assume temp… ▽ More

    Submitted 1 February, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

  5. arXiv:1911.02103  [pdf, other

    cs.CV cs.CL cs.MM

    Recurrent Instance Segmentation using Sequences of Referring Expressions

    Authors: Alba Herrera-Palacio, Carles Ventura, Carina Silberer, Ionut-Teodor Sorodoc, Gemma Boleda, Xavier Giro-i-Nieto

    Abstract: The goal of this work is to segment the objects in an image that are referred to by a sequence of linguistic descriptions (referring expressions). We propose a deep neural network with recurrent layers that output a sequence of binary masks, one for each referring expression provided by the user. The recurrent layers in the architecture allow the model to condition each predicted mask on the previ… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 3rd NeurIPS Workshop on Visually Grounded Interaction and Language (ViGIL, 2019)

  6. arXiv:1905.06649  [pdf, other

    cs.CL

    What do Entity-Centric Models Learn? Insights from Entity Linking in Multi-Party Dialogue

    Authors: Laura Aina, Carina Silberer, Matthijs Westera, Ionut-Teodor Sorodoc, Gemma Boleda

    Abstract: Humans use language to refer to entities in the external world. Motivated by this, in recent years several models that incorporate a bias towards learning entity representations have been proposed. Such entity-centric models have shown empirical success, but we still know little about why. In this paper we analyze the behavior of two recently proposed entity-centric models in a referential task, E… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: To appear in Proceedings of NAACL 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics

  7. arXiv:1805.05370  [pdf, other

    cs.CL

    AMORE-UPF at SemEval-2018 Task 4: BiLSTM with Entity Library

    Authors: Laura Aina, Carina Silberer, Ionut-Teodor Sorodoc, Matthijs Westera, Gemma Boleda

    Abstract: This paper describes our winning contribution to SemEval 2018 Task 4: Character Identification on Multiparty Dialogues. It is a simple, standard model with one key innovation, an entity library. Our results show that this innovation greatly facilitates the identification of infrequent characters. Because of the generic nature of our model, this finding is potentially relevant to any task that requ… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.

  8. arXiv:1804.05018  [pdf, other

    cs.CV cs.LG stat.ML

    Comparatives, Quantifiers, Proportions: A Multi-Task Model for the Learning of Quantities from Vision

    Authors: Sandro Pezzelle, Ionut-Teodor Sorodoc, Raffaella Bernardi

    Abstract: The present work investigates whether different quantification mechanisms (set comparison, vague quantification, and proportional estimation) can be jointly learned from visual scenes by a multi-task computational model. The motivation is that, in humans, these processes underlie the same cognitive, non-symbolic ability, which allows an automatic estimation and comparison of set magnitudes. We sho… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

    Comments: 12 pages (references included). To appear in the Proceedings of NAACL-HLT 2018

    MSC Class: 68T45

    Journal ref: Proceedings of NAACL-HLT 2018

  9. arXiv:1704.02923  [pdf, other

    cs.CL cs.AI cs.CV

    Pay Attention to Those Sets! Learning Quantification from Images

    Authors: Ionut Sorodoc, Sandro Pezzelle, Aurélie Herbelot, Mariella Dimiccoli, Raffaella Bernardi

    Abstract: Major advances have recently been made in merging language and vision representations. But most tasks considered so far have confined themselves to the processing of objects and lexicalised relations amongst objects (content words). We know, however, that humans (even pre-school children) can abstract over raw data to perform certain types of higher-level reasoning, expressed in natural language b… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: Submitted to Journal Paper, 28 pages, 12 figures, 5 tables