Skip to main content

Showing 1–30 of 30 results for author: Volpi, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00333  [pdf, ps, other

    cs.CV

    Test-time Vocabulary Adaptation for Language-driven Object Detection

    Authors: Mingxuan Liu, Tyler L. Hayes, Massimiliano Mancini, Elisa Ricci, Riccardo Volpi, Gabriela Csurka

    Abstract: Open-vocabulary object detection models allow users to freely specify a class vocabulary in natural language at test time, guiding the detection of desired objects. However, vocabularies can be overly broad or even mis-specified, hampering the overall performance of the detector. In this work, we propose a plug-and-play Vocabulary Adapter (VocAda) to refine the user-defined vocabulary, automatical… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: Accepted as a conference paper at ICIP 2025

  2. arXiv:2408.04471  [pdf, other

    cs.CV

    What could go wrong? Discovering and describing failure modes in computer vision

    Authors: Gabriela Csurka, Tyler L. Hayes, Diane Larlus, Riccardo Volpi

    Abstract: Deep learning models are effective, yet brittle. Even carefully trained, their behavior tends to be hard to predict when confronted with out-of-distribution samples. In this work, our goal is to propose a simple yet effective solution to predict and describe via natural language potential failure modes of computer vision models. Given a pretrained model and a set of samples, our aim is to find sen… ▽ More

    Submitted 24 September, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

    Comments: Preprint of the eXCV Workshop paper (ECCV'24)

  3. arXiv:2405.10053  [pdf, other

    cs.CV

    SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

    Authors: Mingxuan Liu, Tyler L. Hayes, Elisa Ricci, Gabriela Csurka, Riccardo Volpi

    Abstract: Open-vocabulary object detection (OvOD) has transformed detection into a language-guided task, empowering users to freely define their class vocabularies of interest during inference. However, our initial investigation indicates that existing OvOD detectors exhibit significant variability when dealing with vocabularies across various semantic granularities, posing a concern for real-world deployme… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted as a conference paper (highlight) at CVPR 2024

  4. arXiv:2402.17420  [pdf, other

    cs.CV cs.AI

    PANDAS: Prototype-based Novel Class Discovery and Detection

    Authors: Tyler L. Hayes, César R. de Souza, Namil Kim, Jiwon Kim, Riccardo Volpi, Diane Larlus

    Abstract: Object detectors are typically trained once and for all on a fixed set of classes. However, this closed-world assumption is unrealistic in practice, as new classes will inevitably emerge after the detector is deployed in the wild. In this work, we look at ways to extend a detector trained for a set of base classes so it can i) spot the presence of novel classes, and ii) automatically enrich its re… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to the Conference on Lifelong Learning Agents (CoLLAs 2024)

  5. arXiv:2402.16392  [pdf, other

    cs.CV

    Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

    Authors: Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Gregory Rogez

    Abstract: When deploying a semantic segmentation model into the real world, it will inevitably encounter semantic classes that were not seen during training. To ensure a safe deployment of such systems, it is crucial to accurately evaluate and improve their anomaly segmentation capabilities. However, acquiring and labelling semantic segmentation data is expensive and unanticipated conditions are long-tail a… ▽ More

    Submitted 12 July, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to ECCV 2024

  6. arXiv:2305.19879  [pdf, other

    cs.CV

    RaSP: Relation-aware Semantic Prior for Weakly Supervised Incremental Segmentation

    Authors: Subhankar Roy, Riccardo Volpi, Gabriela Csurka, Diane Larlus

    Abstract: Class-incremental semantic image segmentation assumes multiple model updates, each enriching the model to segment new categories. This is typically carried out by providing expensive pixel-level annotations to the training algorithm for all new objects, limiting the adoption of such methods in practical applications. Approaches that solely require image-level labels offer an attractive alternative… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted to CoLLAs 2023

  7. arXiv:2303.11298  [pdf, other

    cs.CV

    Reliability in Semantic Segmentation: Are We on the Right Track?

    Authors: Pau de Jorge, Riccardo Volpi, Philip Torr, Gregory Rogez

    Abstract: Motivated by the increasing popularity of transformers in computer vision, in recent times there has been a rapid development of novel architectures. While in-domain performance follows a constant, upward trend, properties like robustness or uncertainty estimation are less explored -leaving doubts about advances in model reliability. Studies along these axes exist, but they are mainly limited to c… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  8. arXiv:2302.06378  [pdf, other

    cs.CV

    Semantic Image Segmentation: Two Decades of Research

    Authors: Gabriela Csurka, Riccardo Volpi, Boris Chidlovskii

    Abstract: Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer vision applications, providing key information for the global understanding of an image. This survey is an effort to summarize two decades of research in the field of SiS, where we propose a literature review of solutions starting from early historical methods followed by an overview of more recent deep learn… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: Pre-print of the book: G. Csurka, R. Volpi and B. Chidlovski: Semantic Image Segmentation: Two Decades of Research, FTCGV (14): No. 1-2, http://dx.doi.org/10.1561/0600000095. The authors retained the copyright and are allowed to post it on arXiv. Research only use, commercial use or systematic downloading (by robots or other automatic processes) is prohibited

  9. arXiv:2203.16195  [pdf, other

    cs.CV

    On the Road to Online Adaptation for Semantic Image Segmentation

    Authors: Riccardo Volpi, Pau de Jorge, Diane Larlus, Gabriela Csurka

    Abstract: We propose a new problem formulation and a corresponding evaluation framework to advance research on unsupervised domain adaptation for semantic image segmentation. The overall goal is fostering the development of adaptive learning systems that will continuously learn, without supervision, in ever-changing environments. Typical protocols that study adaptation algorithms for segmentation models are… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022 (camera ready)

  10. arXiv:2202.01181  [pdf, other

    cs.LG cs.CV

    Make Some Noise: Reliable and Efficient Single-Step Adversarial Training

    Authors: Pau de Jorge, Adel Bibi, Riccardo Volpi, Amartya Sanyal, Philip H. S. Torr, Grégory Rogez, Puneet K. Dokania

    Abstract: Recently, Wong et al. showed that adversarial training with single-step FGSM leads to a characteristic failure mode named Catastrophic Overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks. Experimentally they showed that simply adding a random perturbation prior to FGSM (RS-FGSM) could prevent CO. However, Andriushchenko and Flammarion observed that RS-FGSM still le… ▽ More

    Submitted 17 October, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Published in NeurIPS 2022

  11. arXiv:2112.03241  [pdf, other

    cs.CV cs.AI

    Unsupervised Domain Adaptation for Semantic Image Segmentation: a Comprehensive Survey

    Authors: Gabriela Csurka, Riccardo Volpi, Boris Chidlovskii

    Abstract: Semantic segmentation plays a fundamental role in a broad variety of computer vision applications, providing key information for the global understanding of an image. Yet, the state-of-the-art models rely on large amount of annotated samples, which are more expensive to obtain than in tasks such as image classification. Since unlabelled data is instead significantly cheaper to obtain, it is not su… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 33 pages

    ACM Class: I.4.6; I.2

  12. arXiv:2102.12289  [pdf, other

    cs.SD cs.LG eess.AS

    Automatic Feature Extraction for Heartbeat Anomaly Detection

    Authors: Robert-George Colt, Csongor-Huba Várady, Riccardo Volpi, Luigi Malagò

    Abstract: We focus on automatic feature extraction for raw audio heartbeat sounds, aimed at anomaly detection applications in healthcare. We learn features with the help of an autoencoder composed by a 1D non-causal convolutional encoder and a WaveNet decoder trained with a modified objective based on variational inference, employing the Maximum Mean Discrepancy (MMD). Moreover we model the latent distribut… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: 7 pages, 2 figures, Presented at PharML 2020 Workshop - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), see https://sites.google.com/view/pharml2020/accepted-submissions, source-code: https://github.com/rist-ro/argo

    MSC Class: 68T07

  13. arXiv:2012.04324  [pdf, other

    cs.CV cs.AI cs.LG

    Continual Adaptation of Visual Representations via Domain Randomization and Meta-learning

    Authors: Riccardo Volpi, Diane Larlus, Grégory Rogez

    Abstract: Most standard learning approaches lead to fragile models which are prone to drift when sequentially trained on samples of a different nature - the well-known "catastrophic forgetting" issue. In particular, when a model consecutively learns from different visual domains, it tends to forget the past domains in favor of the most recent ones. In this context, we show that one way to learn models that… ▽ More

    Submitted 8 April, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021

  14. arXiv:2011.14276  [pdf, other

    astro-ph.CO cs.AI cs.LG

    Accelerating MCMC algorithms through Bayesian Deep Networks

    Authors: Hector J. Hortua, Riccardo Volpi, Dimitri Marinelli, Luigi Malago

    Abstract: Markov Chain Monte Carlo (MCMC) algorithms are commonly used for their versatility in sampling from complicated probability distributions. However, as the dimension of the distribution gets larger, the computational costs for a satisfactory exploration of the sampling space become challenging. Adaptive MCMC methods employing a choice of proposal distribution can address this issue speeding up the… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

    Comments: Accepted in the Third Workshop on Machine Learning and the Physical Sciences, NeurIPS 2020, Vancouver, Canada. Text overlap with arXiv:1911.08508v3

  15. arXiv:2008.06729  [pdf, other

    cs.LG stat.ML

    Reliable Uncertainties for Bayesian Neural Networks using Alpha-divergences

    Authors: Hector J. Hortua, Luigi Malago, Riccardo Volpi

    Abstract: Bayesian Neural Networks (BNNs) often result uncalibrated after training, usually tending towards overconfidence. Devising effective calibration methods with low impact in terms of computational complexity is thus of central interest. In this paper we present calibration methods for BNNs based on the alpha divergences from Information Geometry. We compare the use of alpha divergence in training an… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: Accepted at the ICML 2020: Workshop on Uncertainty and Robustness in Deep Learning

  16. arXiv:2008.06687  [pdf, other

    cs.LG stat.ML

    Natural Reweighted Wake-Sleep

    Authors: Csongor Várady, Riccardo Volpi, Luigi Malagò, Nihat Ay

    Abstract: Helmholtz Machines (HMs) are a class of generative models composed of two Sigmoid Belief Networks (SBNs), acting respectively as an encoder and a decoder. These models are commonly trained using a two-step optimization algorithm called Wake-Sleep (WS) and more recently by improved versions, such as Reweighted Wake-Sleep (RWS) and Bidirectional Helmholtz Machines (BiHM). The locality of the connect… ▽ More

    Submitted 14 September, 2022; v1 submitted 15 August, 2020; originally announced August 2020.

    Comments: 41 pages, 18 figures, to be published in Neural Networks Journal

    MSC Class: 68T07

  17. arXiv:2005.07694  [pdf, other

    astro-ph.CO cs.LG

    Constraining the Reionization History using Bayesian Normalizing Flows

    Authors: Héctor J. Hortúa, Luigi Malago, Riccardo Volpi

    Abstract: The next generation 21 cm surveys open a new window onto the early stages of cosmic structure formation and provide new insights about the Epoch of Reionization (EoR). However, the non-Gaussian nature of the 21 cm signal along with the huge amount of data generated from these surveys will require more advanced techniques capable to efficiently extract the necessary information to constrain the Rei… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: 17 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2005.02299

    Journal ref: Mach. Learn.: Sci. Technol. 1 035014, 2020

  18. arXiv:2005.02299  [pdf, other

    astro-ph.CO cs.LG eess.SP

    Parameters Estimation from the 21 cm signal using Variational Inference

    Authors: Héctor J. Hortúa, Riccardo Volpi, Luigi Malagò

    Abstract: Upcoming experiments such as Hydrogen Epoch of Reionization Array (HERA) and Square Kilometre Array (SKA) are intended to measure the 21cm signal over a wide range of redshifts, representing an incredible opportunity in advancing our understanding about the nature of cosmic Reionization. At the same time these kind of experiments will present new challenges in processing the extensive amount of da… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: Presented at ICLR 2020 Workshop on Fundamental Science in the era of AI

  19. arXiv:2003.06498  [pdf, other

    cs.CV

    Explainable Deep Classification Models for Domain Generalization

    Authors: Andrea Zunino, Sarah Adel Bargal, Riccardo Volpi, Mehrnoosh Sameki, Jianming Zhang, Stan Sclaroff, Vittorio Murino, Kate Saenko

    Abstract: Conventionally, AI models are thought to trade off explainability for lower accuracy. We develop a training strategy that not only leads to a more explainable AI system for object classification, but as a consequence, suffers no perceptible accuracy degradation. Explanations are defined as regions of visual evidence upon which a deep classification network makes a decision. This is represented in… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

  20. arXiv:2003.06430  [pdf, other

    cs.CV

    Learning Unbiased Representations via Mutual Information Backpropagation

    Authors: Ruggero Ragonesi, Riccardo Volpi, Jacopo Cavazza, Vittorio Murino

    Abstract: We are interested in learning data-driven representations that can generalize well, even when trained on inherently biased data. In particular, we face the case where some attributes (bias) of the data, if learned by the model, can severely compromise its generalization properties. We tackle this problem through the lens of information theory, leveraging recent findings for a differentiable estima… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: Code publicly available at https://github.com/rugrag/learn-unbiased

  21. arXiv:2001.02950  [pdf, other

    cs.CV

    Generative Pseudo-label Refinement for Unsupervised Domain Adaptation

    Authors: Pietro Morerio, Riccardo Volpi, Ruggero Ragonesi, Vittorio Murino

    Abstract: We investigate and characterize the inherent resilience of conditional Generative Adversarial Networks (cGANs) against noise in their conditioning labels, and exploit this fact in the context of Unsupervised Domain Adaptation (UDA). In UDA, a classifier trained on the labelled source set can be used to infer pseudo-labels on the unlabelled target set. However, this will result in a significant amo… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

  22. arXiv:1912.02280  [pdf, other

    cs.LG stat.ML

    Natural Alpha Embeddings

    Authors: Riccardo Volpi, Luigi Malagò

    Abstract: Learning an embedding for a large collection of items is a popular approach to overcome the computational limitations associated to one-hot encodings. The aim of item embedding is to learn a low dimensional space for the representations, able to capture with its geometry relevant features or relationships for the data at hand. This can be achieved for example by exploiting adjacencies among items… ▽ More

    Submitted 24 December, 2019; v1 submitted 4 December, 2019; originally announced December 2019.

  23. Parameters Estimation for the Cosmic Microwave Background with Bayesian Neural Networks

    Authors: Hector J. Hortua, Riccardo Volpi, Dimitri Marinelli, Luigi Malagò

    Abstract: In this paper, we present the first study that compares different models of Bayesian Neural Networks (BNNs) to predict the posterior distribution of the cosmological parameters directly from the Cosmic Microwave Background temperature and polarization maps. We focus our analysis on four different methods to sample the weights of the network during training: Dropout, DropConnect, Reparameterization… ▽ More

    Submitted 30 October, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: 24 pages with 19 figures

    Journal ref: Phys. Rev. D 102, 103509 (2020)

  24. arXiv:1903.11900  [pdf, other

    cs.LG cs.CV stat.ML

    Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets

    Authors: Riccardo Volpi, Vittorio Murino

    Abstract: We are concerned with the vulnerability of computer vision models to distributional shifts. We formulate a combinatorial optimization problem that allows evaluating the regions in the image space where a given model is more vulnerable, in terms of image transformations applied to the input, and face it with standard search algorithms. We further embed this idea in a training procedure, where we de… ▽ More

    Submitted 20 August, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: ICCV 2019 (camera ready)

  25. arXiv:1807.01889  [pdf, other

    cs.LG stat.ML

    Learning in Variational Autoencoders with Kullback-Leibler and Renyi Integral Bounds

    Authors: Septimia Sârbu, Riccardo Volpi, Alexandra Peşte, Luigi Malagò

    Abstract: In this paper we propose two novel bounds for the log-likelihood based on Kullback-Leibler and the Rényi divergences, which can be used for variational inference and in particular for the training of Variational AutoEncoders. Our proposal is motivated by the difficulties encountered in training VAEs on continuous datasets with high contrast images, such as those with handwritten digits and charact… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: accepted at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models, Stockholm, Sweden, 2018

  26. arXiv:1805.12018  [pdf, other

    cs.CV

    Generalizing to Unseen Domains via Adversarial Data Augmentation

    Authors: Riccardo Volpi, Hongseok Namkoong, Ozan Sener, John Duchi, Vittorio Murino, Silvio Savarese

    Abstract: We are concerned with learning models that generalize well to different \emph{unseen} domains. We consider a worst-case formulation over data distributions that are near the source domain in the feature space. Only using training data from a single source distribution, we propose an iterative procedure that augments the dataset with examples from a fictitious target domain that is "hard" under the… ▽ More

    Submitted 6 November, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to NIPS 2018 (camera ready)

  27. arXiv:1711.08561  [pdf, other

    cs.CV

    Adversarial Feature Augmentation for Unsupervised Domain Adaptation

    Authors: Riccardo Volpi, Pietro Morerio, Silvio Savarese, Vittorio Murino

    Abstract: Recent works showed that Generative Adversarial Networks (GANs) can be successfully applied in unsupervised domain adaptation, where, given a labeled source dataset and an unlabeled target dataset, the goal is to train powerful classifiers for the target samples. In particular, it was shown that a GAN objective function can be used to learn target features indistinguishable from the source ones. I… ▽ More

    Submitted 4 May, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Accepted to CVPR 2018

  28. arXiv:1710.10571  [pdf, ps, other

    stat.ML cs.LG

    Certifying Some Distributional Robustness with Principled Adversarial Training

    Authors: Aman Sinha, Hongseok Namkoong, Riccardo Volpi, John Duchi

    Abstract: Neural networks are vulnerable to adversarial examples and researchers have proposed many heuristic attack and defense mechanisms. We address this problem through the principled lens of distributionally robust optimization, which guarantees performance under adversarial input perturbations. By considering a Lagrangian penalty formulation of perturbing the underlying data distribution in a Wasserst… ▽ More

    Submitted 1 May, 2020; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: ICLR 2018: https://openreview.net/forum?id=Hk6kPgZA-

  29. arXiv:1703.06229  [pdf, other

    cs.NE cs.LG stat.ML

    Curriculum Dropout

    Authors: Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, Vittorio Murino

    Abstract: Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. Besides, Dropout can be interpreted as an approximate model aggregation technique, where an exponential number of smaller networks are averaged in o… ▽ More

    Submitted 3 August, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: Accepted at ICCV (International Conference on Computer Vision) 2017

  30. arXiv:1701.02898  [pdf, other

    cs.CV q-bio.NC

    Modeling Retinal Ganglion Cell Population Activity with Restricted Boltzmann Machines

    Authors: Matteo Zanotto, Riccardo Volpi, Alessandro Maccione, Luca Berdondini, Diego Sona, Vittorio Murino

    Abstract: The retina is a complex nervous system which encodes visual stimuli before higher order processing occurs in the visual cortex. In this study we evaluated whether information about the stimuli received by the retina can be retrieved from the firing rate distribution of Retinal Ganglion Cells (RGCs), exploiting High-Density 64x64 MEA technology. To this end, we modeled the RGC population activity u… ▽ More

    Submitted 17 January, 2017; v1 submitted 11 January, 2017; originally announced January 2017.