Skip to main content

Showing 1–4 of 4 results for author: Ramos, M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06275  [pdf, ps, other

    cs.CV cs.CL cs.LG

    Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

    Authors: Emmanouil Zaranis, António Farinhas, Saul Santos, Beatriz Canaverde, Miguel Moura Ramos, Aditya K Surikuchi, André Viveiros, Baohao Liao, Elena Bueno-Benito, Nithin Sivakumaran, Pavlo Vasylenko, Shoubin Yu, Sonal Sannigrahi, Wafaa Mohammed, Ben Peters, Danae Sánchez Villegas, Elias Stengel-Eskin, Giuseppe Attanasio, Jaehong Yoon, Stella Frank, Alessandro Suglia, Chrysoula Zerva, Desmond Elliott, Mariella Dimiccoli, Mohit Bansal , et al. (6 additional authors not shown)

    Abstract: Despite recent progress in vision-language models (VLMs), holistic understanding of long-form video content remains a significant challenge, partly due to limitations in current benchmarks. Many focus on peripheral, ``needle-in-a-haystack'' details, encouraging context-insensitive retrieval over deep comprehension. Others rely on large-scale, semi-automatically generated questions (often produced… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: Under Review

  2. arXiv:2504.12140  [pdf, other

    cs.CL

    Multilingual Contextualization of Large Language Models for Document-Level Machine Translation

    Authors: Miguel Moura Ramos, Patrick Fernandes, Sweta Agrawal, André F. T. Martins

    Abstract: Large language models (LLMs) have demonstrated strong performance in sentence-level machine translation, but scaling to document-level translation remains challenging, particularly in modeling long-range dependencies and discourse phenomena across sentences and paragraphs. In this work, we propose a method to improve LLM-based long-document translation through targeted fine-tuning on high-quality… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 9 pages, work-in-progress

  3. arXiv:2411.05986  [pdf, other

    cs.CL

    Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings

    Authors: Miguel Moura Ramos, Tomás Almeida, Daniel Vareta, Filipe Azevedo, Sweta Agrawal, Patrick Fernandes, André F. T. Martins

    Abstract: Reinforcement learning (RL) has been proven to be an effective and robust method for training neural machine translation systems, especially when paired with powerful reward models that accurately assess translation quality. However, most research has focused on RL methods that use sentence-level feedback, leading to inefficient learning signals due to the reward sparsity problem -- the model rece… ▽ More

    Submitted 16 April, 2025; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: 12 pages, work-in-progress

  4. arXiv:2311.09132  [pdf, other

    cs.CL

    Aligning Neural Machine Translation Models: Human Feedback in Training and Inference

    Authors: Miguel Moura Ramos, Patrick Fernandes, António Farinhas, André F. T. Martins

    Abstract: Reinforcement learning from human feedback (RLHF) is a recent technique to improve the quality of the text generated by a language model, making it closer to what humans would generate. A core ingredient in RLHF's success in aligning and improving large language models (LLMs) is its reward model, trained using human feedback on model outputs. In machine translation (MT), where metrics trained from… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: EAMT 2024