Skip to main content

Showing 1–5 of 5 results for author: Rodolà, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.00081  [pdf, other

    cs.LG stat.ML

    Task Singular Vectors: Reducing Task Interference in Model Merging

    Authors: Antonio Andrea Gargiulo, Donato Crisostomi, Maria Sofia Bucarelli, Simone Scardapane, Fabrizio Silvestri, Emanuele Rodolà

    Abstract: Task Arithmetic has emerged as a simple yet effective method to merge models without additional training. However, by treating entire networks as flat parameter vectors, it overlooks key structural information and is susceptible to task interference. In this paper, we study task vectors at the layer level, focusing on task layer matrices and their singular value decomposition. In particular, we co… ▽ More

    Submitted 4 April, 2025; v1 submitted 26 November, 2024; originally announced December 2024.

    Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 (CVPR)

    ACM Class: I.5.1; I.4.2; I.2.10

  2. arXiv:2307.01037  [pdf, other

    stat.ME cs.LG

    Vector Quantile Regression on Manifolds

    Authors: Marco Pegoraro, Sanketh Vedula, Aviv A. Rosenberg, Irene Tallini, Emanuele Rodolà, Alex M. Bronstein

    Abstract: Quantile regression (QR) is a statistical tool for distribution-free estimation of conditional quantiles of a target variable given explanatory features. QR is limited by the assumption that the target distribution is univariate and defined on an Euclidean domain. Although the notion of quantiles was recently extended to multi-variate distributions, QR for multi-variate distributions on manifolds… ▽ More

    Submitted 7 February, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  3. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  4. arXiv:2005.14117  [pdf, other

    eess.IV cs.LG stat.ML

    Multimodal Feature Fusion and Knowledge-Driven Learning via Experts Consult for Thyroid Nodule Classification

    Authors: Danilo Avola, Luigi Cinque, Alessio Fagioli, Sebastiano Filetti, Giorgio Grani, Emanuele Rodolà

    Abstract: Computer-aided diagnosis (CAD) is becoming a prominent approach to assist clinicians spanning across multiple fields. These automated systems take advantage of various computer vision (CV) procedures, as well as artificial intelligence (AI) techniques, to formulate a diagnosis of a given image, e.g., computed tomography and ultrasound. Advances in both areas (CV and AI) are enabling ever increasin… ▽ More

    Submitted 25 October, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

  5. arXiv:2003.12283  [pdf, other

    cs.LG cs.CG cs.GR stat.ML

    LIMP: Learning Latent Shape Representations with Metric Preservation Priors

    Authors: Luca Cosmo, Antonio Norelli, Oshri Halimi, Ron Kimmel, Emanuele Rodolà

    Abstract: In this paper, we advocate the adoption of metric preservation as a powerful prior for learning latent representations of deformable 3D shapes. Key to our construction is the introduction of a geometric distortion criterion, defined directly on the decoded shapes, translating the preservation of the metric on the decoding to the formation of linear paths in the underlying latent space. Our rationa… ▽ More

    Submitted 2 September, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

    Comments: 24 pages (main article 14 + main bibliography 3 + supplementary 6 + supplementary bibliography 1)