Skip to main content

Showing 1–4 of 4 results for author: Poppi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15323  [pdf, other

    cs.CL

    Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack

    Authors: Silvia Cappelletti, Tobia Poppi, Samuele Poppi, Zheng-Xin Yong, Diego Garcia-Olano, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Large Language Models (LLMs) are increasingly evaluated on multiple-choice question answering (MCQA) tasks using *first-token probability* (FTP), which selects the answer option whose initial token has the highest likelihood. While efficient, FTP can be fragile: models may assign high probability to unrelated tokens (*misalignment*) or use a valid token merely as part of a generic preamble rather… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 13 pages, 5 figures, 7 tables

  2. arXiv:2503.12127  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    Hyperbolic Safety-Aware Vision-Language Models

    Authors: Tobia Poppi, Tejaswi Kasarla, Pascal Mettes, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Addressing the retrieval of unsafe content from vision-language models such as CLIP is an important step towards real-world integration. Current efforts have relied on unlearning techniques that try to erase the model's knowledge of unsafe concepts. While effective in reducing unwanted outputs, unlearning limits the model's capacity to discern between safe and unsafe content. In this work, we intr… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: CVPR 2025

  3. arXiv:2311.16254  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

    Authors: Samuele Poppi, Tobia Poppi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Large-scale vision-and-language models, such as CLIP, are typically trained on web-scale data, which can introduce inappropriate content and lead to the development of unsafe and biased behavior. This, in turn, hampers their applicability in sensitive and trustworthy contexts and could raise significant concerns in their adoption. Our research introduces a novel approach to enhancing the safety of… ▽ More

    Submitted 23 July, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: ECCV 2024

  4. arXiv:2304.08230  [pdf, other

    cs.CV cs.AI

    Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose Estimation

    Authors: Elena Govi, Davide Sapienza, Carmelo Scribano, Tobia Poppi, Giorgia Franchini, Paola Ardòn, Micaela Verucchi, Marko Bertogna

    Abstract: In recent years, there has been a growing trend of using data-driven methods in industrial settings. These kinds of methods often process video images or parts, therefore the integrity of such images is crucial. Sometimes datasets, e.g. consisting of images, can be sophisticated for various reasons. It becomes critical to understand how the manipulation of video and images can impact the effective… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 17 pages, 10 figures, submitted to EURASIP Journal on Image and Video Processing

    ACM Class: I.2.10; I.4.0