Skip to main content

Showing 1–9 of 9 results for author: Rakotonirina, N C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.13791  [pdf, ps, other

    cs.CL

    From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

    Authors: Nathanaël Carraz Rakotonirina, Mohammed Hamdy, Jon Ander Campos, Lucas Weber, Alberto Testoni, Marzieh Fadaee, Sandro Pezzelle, Marco Del Tredici

    Abstract: Large Language Models (LLMs) are increasingly used in working environments for a wide range of tasks, excelling at solving individual problems in isolation. However, are they also able to effectively collaborate over long-term interactions? To investigate this, we introduce MemoryCode, a synthetic multi-session dataset designed to test LLMs' ability to track and execute simple coding instructions… ▽ More

    Submitted 6 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Published as conference paper at ACL 2025

  2. arXiv:2412.08127  [pdf, other

    cs.CL cs.AI cs.LG

    Evil twins are not that evil: Qualitative insights into machine-generated prompts

    Authors: Nathanaël Carraz Rakotonirina, Corentin Kervadec, Francesca Franzon, Marco Baroni

    Abstract: It has been widely observed that language models (LMs) respond in predictable ways to algorithmically generated prompts that are seemingly unintelligible. This is both a sign that we lack a full understanding of how LMs work, and a practical challenge, because opaqueness can be exploited for harmful uses of LMs, such as jailbreaking. We present the first thorough analysis of opaque machine-generat… ▽ More

    Submitted 31 March, 2025; v1 submitted 11 December, 2024; originally announced December 2024.

  3. arXiv:2402.15268  [pdf, other

    cs.CL cs.AI cs.LG

    MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models

    Authors: Nathanaël Carraz Rakotonirina, Marco Baroni

    Abstract: Transformer-based language models (LMs) track contextual information through large, hard-coded input windows. We introduce MemoryPrompt, a leaner approach in which the LM is complemented by a small auxiliary recurrent network that passes information to the LM by prefixing its regular input with a sequence of vectors, akin to soft prompts, without requiring LM finetuning. Tested on a task designed… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Published as conference paper at LREC-COLING 2024

  4. arXiv:2306.00410  [pdf, other

    cs.CL cs.SD eess.AS

    Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili

    Authors: Christiaan Jacobs, Nathanaël Carraz Rakotonirina, Everlyn Asiko Chimoto, Bruce A. Bassett, Herman Kamper

    Abstract: We consider hate speech detection through keyword spotting on radio broadcasts. One approach is to build an automatic speech recognition (ASR) system for the target low-resource language. We compare this to using acoustic word embedding (AWE) models that map speech segments to a space where matching words have similar vectors. We specifically use a multilingual AWE model trained on labelled data f… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to Interspeech 2023

  5. arXiv:2304.01662  [pdf, other

    cs.CV cs.AI cs.CL

    Cross-Domain Image Captioning with Discriminative Finetuning

    Authors: Roberto Dessì, Michele Bevilacqua, Eleonora Gualdoni, Nathanael Carraz Rakotonirina, Francesca Franzon, Marco Baroni

    Abstract: Neural captioners are typically trained to mimic human-generated references without optimizing for any specific communication goal, leading to problems such as the generation of vague captions. In this paper, we show that fine-tuning an out-of-the-box neural captioner with a self-supervised discriminative communication objective helps to recover a plain, visually descriptive language that is more… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  6. arXiv:2302.09865  [pdf, other

    cs.CL cs.AI cs.LG

    Can discrete information extraction prompts generalize across language models?

    Authors: Nathanaël Carraz Rakotonirina, Roberto Dessì, Fabio Petroni, Sebastian Riedel, Marco Baroni

    Abstract: We study whether automatically-induced prompts that effectively extract information from a language model can also be used, out-of-the-box, to probe other language models for the same information. After confirming that discrete prompts induced with the AutoPrompt algorithm outperform manual and semi-manual prompts on the slot-filling task, we demonstrate a drop in performance for AutoPrompt prompt… ▽ More

    Submitted 7 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Published as conference paper at ICLR 2023

  7. arXiv:2108.11637  [pdf, other

    cs.SD cs.LG eess.AS

    Self-Attention for Audio Super-Resolution

    Authors: Nathanaël Carraz Rakotonirina

    Abstract: Convolutions operate only locally, thus failing to model global interactions. Self-attention is, however, able to learn representations that capture long-range dependencies in sequences. We propose a network architecture for audio super-resolution that combines convolution and self-attention. Attention-based Feature-Wise Linear Modulation (AFiLM) uses self-attention mechanism instead of recurrent… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: MLSP 2021

  8. arXiv:2009.12177  [pdf, other

    cs.CV cs.LG eess.IV

    Tarsier: Evolving Noise Injection in Super-Resolution GANs

    Authors: Baptiste Roziere, Nathanal Carraz Rakotonirina, Vlad Hosu, Andry Rasoanaivo, Hanhe Lin, Camille Couprie, Olivier Teytaud

    Abstract: Super-resolution aims at increasing the resolution and level of detail within an image. The current state of the art in general single-image super-resolution is held by NESRGAN+, which injects a Gaussian noise after each residual layer at training time. In this paper, we harness evolutionary methods to improve NESRGAN+ by optimizing the noise injection at inference time. More precisely, we use Dia… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

  9. ESRGAN+ : Further Improving Enhanced Super-Resolution Generative Adversarial Network

    Authors: Nathanaël Carraz Rakotonirina, Andry Rasoanaivo

    Abstract: Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) is a perceptual-driven approach for single image super resolution that is able to produce photorealistic images. Despite the visual quality of these generated images, there is still room for improvement. In this fashion, the model is extended to further improve the perceptual quality of the images. We have designed a novel block to… ▽ More

    Submitted 15 July, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: ICASSP 2020