Skip to main content

Showing 1–18 of 18 results for author: Zanella, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04005  [pdf, ps, other

    cs.CV

    Vocabulary-free few-shot learning for Vision-Language Models

    Authors: Maxime Zanella, Clément Fuchs, Ismail Ben Ayed, Christophe De Vleeschouwer

    Abstract: Recent advances in few-shot adaptation for Vision-Language Models (VLMs) have greatly expanded their ability to generalize across tasks using only a few labeled examples. However, existing approaches primarily build upon the strong zero-shot priors of these models by leveraging carefully designed, task-specific prompts. This dependence on predefined class names can restrict their applicability, es… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted at CVPR Workshops 2025

  2. arXiv:2501.04352  [pdf, other

    cs.CV

    Online Gaussian Test-Time Adaptation of Vision-Language Models

    Authors: Clément Fuchs, Maxime Zanella, Christophe De Vleeschouwer

    Abstract: Online test-time adaptation (OTTA) of vision-language models (VLMs) has recently garnered increased attention to take advantage of data observed along a stream to improve future predictions. Unfortunately, existing methods rely on dataset-specific hyperparameters, significantly limiting their adaptability to unseen tasks. In response, we propose Online Gaussian Adaptation (OGA), a novel method tha… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  3. arXiv:2501.03729  [pdf, other

    cs.CV

    Realistic Test-Time Adaptation of Vision-Language Models

    Authors: Maxime Zanella, Clément Fuchs, Christophe De Vleeschouwer, Ismail Ben Ayed

    Abstract: The zero-shot capabilities of Vision-Language Models (VLMs) have been widely leveraged to improve predictive performance. However, previous works on transductive or test-time adaptation (TTA) often make strong assumptions about the data distribution, such as the presence of all classes. Our work challenges these favorable deployment scenarios, and introduces a more realistic evaluation framework,… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  4. arXiv:2411.14975  [pdf, other

    eess.IV cs.AI cs.CV q-bio.QM

    Exploring Foundation Models Fine-Tuning for Cytology Classification

    Authors: Manon Dausort, Tiffanie Godelaine, Maxime Zanella, Karim El Khoury, Isabelle Salmon, Benoît Macq

    Abstract: Cytology slides are essential tools in diagnosing and staging cancer, but their analysis is time-consuming and costly. Foundation models have shown great potential to assist in these tasks. In this paper, we explore how existing foundation models can be applied to cytological classification. More particularly, we focus on low-rank adaptation, a parameter-efficient fine-tuning method suited to few-… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    Comments: 5 pages, 2 figures

  5. arXiv:2411.14827  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Physically Interpretable Probabilistic Domain Characterization

    Authors: Anaïs Halin, Sébastien Piérard, Renaud Vandeghen, Benoît Gérin, Maxime Zanella, Martin Colot, Jan Held, Anthony Cioppa, Emmanuel Jean, Gianluca Bontempi, Saïd Mahmoudi, Benoît Macq, Marc Van Droogenbroeck

    Abstract: Characterizing domains is essential for models analyzing dynamic environments, as it allows them to adapt to evolving conditions or to hand the task over to backup systems when facing conditions outside their operational domain. Existing solutions typically characterize a domain by solving a regression or classification problem, which limits their applicability as they only provide a limited summa… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

  6. arXiv:2409.01883  [pdf, other

    cs.CV

    Boosting Vision-Language Models for Histopathology Classification: Predict all at once

    Authors: Maxime Zanella, Fereshteh Shakeri, Yunshi Huang, Houda Bahig, Ismail Ben Ayed

    Abstract: The development of vision-language models (VLMs) for histo-pathology has shown promising new usages and zero-shot performances. However, current approaches, which decompose large slides into smaller patches, focus solely on inductive classification, i.e., prediction for each patch is made independently of the other patches in the target test data. We extend the capability of these large models by… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  7. arXiv:2409.00698  [pdf, other

    cs.CV

    Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification

    Authors: Karim El Khoury, Maxime Zanella, Benoît Gérin, Tiffanie Godelaine, Benoît Macq, Saïd Mahmoudi, Christophe De Vleeschouwer, Ismail Ben Ayed

    Abstract: Vision-Language Models for remote sensing have shown promising uses thanks to their extensive pretraining. However, their conventional usage in zero-shot scene classification methods still involves dividing large images into patches and making independent predictions, i.e., inductive inference, thereby limiting their effectiveness by ignoring valuable contextual information. Our approach tackles t… ▽ More

    Submitted 7 January, 2025; v1 submitted 1 September, 2024; originally announced September 2024.

    Comments: Accepted at ICASSP 2025

  8. arXiv:2406.01837  [pdf, other

    cs.CV

    Boosting Vision-Language Models with Transduction

    Authors: Maxime Zanella, Benoît Gérin, Ismail Ben Ayed

    Abstract: Transduction is a powerful paradigm that leverages the structure of unlabeled data to boost predictive accuracy. We present TransCLIP, a novel and computationally efficient transductive approach designed for Vision-Language Models (VLMs). TransCLIP is applicable as a plug-and-play module on top of popular inductive zero- and few-shot models, consistently improving their performances. Our new objec… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2405.18541  [pdf, other

    cs.CV

    Low-Rank Few-Shot Adaptation of Vision-Language Models

    Authors: Maxime Zanella, Ismail Ben Ayed

    Abstract: Recent progress in the few-shot adaptation of Vision-Language Models (VLMs) has further pushed their generalization capabilities, at the expense of just a few labeled samples within the target downstream task. However, this promising, already quite abundant few-shot literature has focused principally on prompt learning and, to a lesser extent, on adapters, overlooking the recent advances in Parame… ▽ More

    Submitted 1 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.02266  [pdf, other

    cs.CV

    On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?

    Authors: Maxime Zanella, Ismail Ben Ayed

    Abstract: The development of large vision-language models, notably CLIP, has catalyzed research into effective adaptation techniques, with a particular focus on soft prompt tuning. Conjointly, test-time augmentation, which utilizes multiple augmented views of a single image to enhance zero-shot generalization, is emerging as a significant area of interest. This has predominantly directed research efforts to… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  11. arXiv:2211.10119  [pdf, other

    cs.CV

    Mixture Domain Adaptation to Improve Semantic Segmentation in Real-World Surveillance

    Authors: Sébastien Piérard, Anthony Cioppa, Anaïs Halin, Renaud Vandeghen, Maxime Zanella, Benoît Macq, Saïd Mahmoudi, Marc Van Droogenbroeck

    Abstract: Various tasks encountered in real-world surveillance can be addressed by determining posteriors (e.g. by Bayesian inference or machine learning), based on which critical decisions must be taken. However, the surveillance domain (acquisition device, operating conditions, etc.) is often unknown, which prevents any possibility of scene-specific optimization. In this paper, we define a probabilistic f… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  12. arXiv:2211.05226  [pdf, other

    eess.IV cs.CV

    A kinetic approach to consensus-based segmentation of biomedical images

    Authors: Raffaella Fiamma Cabini, Anna Pichiecchio, Alessandro Lascialfari, Silvia Figini, Mattia Zanella

    Abstract: In this work, we apply a kinetic version of a bounded confidence consensus model to biomedical segmentation problems. In the presented approach, time-dependent information on the microscopic state of each particle/pixel includes its space position and a feature representing a static characteristic of the system, i.e. the gray level of each pixel. From the introduced microscopic model we derive a k… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: 29 pages, 13 figures

  13. arXiv:2210.12456  [pdf, other

    cs.LG

    Abstract Interpretation-Based Feature Importance for SVMs

    Authors: Abhinandan Pal, Francesco Ranzato, Caterina Urban, Marco Zanella

    Abstract: We propose a symbolic representation for support vector machines (SVMs) by means of abstract interpretation, a well-known and successful technique for designing and implementing static program analyses. We leverage this abstraction in two ways: (1) to enhance the interpretability of SVMs by deriving a novel feature importance measure, called abstract feature importance (AFI), that does not depend… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

  14. arXiv:2101.00909  [pdf, other

    cs.LG cs.CY cs.PL

    Fair Training of Decision Tree Classifiers

    Authors: Francesco Ranzato, Caterina Urban, Marco Zanella

    Abstract: We study the problem of formally verifying individual fairness of decision tree ensembles, as well as training tree models which maximize both accuracy and individual fairness. In our approach, fairness verification and fairness-aware training both rely on a notion of stability of a classification model, which is a variant of standard robustness under input perturbations used in adversarial machin… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

  15. arXiv:2012.11352  [pdf, other

    cs.LG

    Genetic Adversarial Training of Decision Trees

    Authors: Francesco Ranzato, Marco Zanella

    Abstract: We put forward a novel learning methodology for ensembles of decision trees based on a genetic algorithm which is able to train a decision tree for maximizing both its accuracy and its robustness to adversarial perturbations. This learning algorithm internally leverages a complete formal verification technique for robustness properties of decision trees based on abstract interpretation, a well kno… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  16. arXiv:1904.11803  [pdf, other

    cs.LG stat.ML

    Robustness Verification of Support Vector Machines

    Authors: Francesco Ranzato, Marco Zanella

    Abstract: We study the problem of formally verifying the robustness to adversarial examples of support vector machines (SVMs), a major machine learning model for classification and regression tasks. Following a recent stream of works on formal robustness verification of (deep) neural networks, our approach relies on a sound abstract version of a given SVM classifier to be used for checking its robustness. T… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

  17. arXiv:1805.01892  [pdf, other

    physics.soc-ph cs.SI nlin.AO

    Opinion modeling on social media and marketing aspects

    Authors: G. Toscani, A. Tosin, M. Zanella

    Abstract: We introduce and discuss kinetic models of opinion formation on social networks in which the distribution function depends on both the opinion and the connectivity of the agents. The opinion formation model is subsequently coupled with a kinetic model describing the spreading of popularity of a product on the web through a social network. Numerical experiments on the underlying kinetic models show… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    MSC Class: 35Q20; 35Q84; 35Q91; 82B21; 91D30

    Journal ref: Phys. Rev. E 98, 022315 (2018)

  18. arXiv:1604.00421  [pdf, other

    math.NA cs.SI nlin.AO

    Opinion dynamics over complex networks: kinetic modeling and numerical methods

    Authors: Giacomo Albi, Lorenzo Pareschi, Mattia Zanella

    Abstract: In this paper we consider the modeling of opinion dynamics over time dependent large scale networks. A kinetic description of the agents' distribution over the evolving network is considered which combines an opinion update based on binary interactions between agents with a dynamic creation and removal process of new connections. The number of connections of each agent influences the spreading of… ▽ More

    Submitted 1 April, 2016; originally announced April 2016.