Skip to main content

Showing 1–18 of 18 results for author: Moschella, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.22785  [pdf, ps, other

    cs.LG

    Navigating the Latent Space Dynamics of Neural Models

    Authors: Marco Fumero, Luca Moschella, Emanuele Rodolà, Francesco Locatello

    Abstract: Neural networks transform high-dimensional data into compact, structured representations, often modeled as elements of a lower dimensional latent space. In this paper, we present an alternative interpretation of neural models as dynamical systems acting on the latent manifold. Specifically, we show that autoencoder models implicitly define a latent vector field on the manifold, derived by iterativ… ▽ More

    Submitted 9 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

  2. arXiv:2503.05283  [pdf, ps, other

    cs.CV

    Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces

    Authors: Souhail Hadgi, Luca Moschella, Andrea Santilli, Diego Gomez, Qixing Huang, Emanuele Rodolà, Simone Melzi, Maks Ovsjanikov

    Abstract: Recent works have shown that, when trained at scale, uni-modal 2D vision and text encoders converge to learned features that share remarkable structural properties, despite arising from different representations. However, the role of 3D encoders with respect to other modalities remains unexplored. Furthermore, existing 3D foundation models that leverage large datasets are typically trained with ex… ▽ More

    Submitted 4 June, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

    Comments: CVPR 2025

  3. arXiv:2503.01881  [pdf, other

    cs.LG cs.AI

    Mapping representations in Reinforcement Learning via Semantic Alignment for Zero-Shot Stitching

    Authors: Antonio Pio Ricciardi, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Deep Reinforcement Learning (RL) models often fail to generalize when even small changes occur in the environment's observations or task requirements. Addressing these shifts typically requires costly retraining, limiting the reusability of learned policies. In this paper, we build on recent work in semantic alignment to propose a zero-shot method for mapping between latent spaces across different… ▽ More

    Submitted 26 February, 2025; originally announced March 2025.

    Comments: 11 pages, 3 figures, 2 tables

    MSC Class: 68T07 ACM Class: I.2.6

  4. arXiv:2406.15057  [pdf, other

    cs.LG

    Latent Space Translation via Inverse Relative Projection

    Authors: Valentino Maiorca, Luca Moschella, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: The emergence of similar representations between independently trained neural models has sparked significant interest in the representation learning community, leading to the development of various methods to obtain communication between latent spaces. "Latent space communication" can be achieved in two ways: i) by independently mapping the original spaces to a shared or relative one; ii) by direc… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.00664, arXiv:2406.11014

  5. arXiv:2406.11014  [pdf, other

    cs.LG cs.AI

    Latent Communication in Artificial Neural Networks

    Authors: Luca Moschella

    Abstract: As NNs permeate various scientific and industrial domains, understanding the universality and reusability of their representations becomes crucial. At their core, these networks create intermediate neural representations, indicated as latent spaces, of the input data and subsequently leverage them to perform specific downstream tasks. This dissertation focuses on the universality and reusability o… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Doctoral Thesis: https://iris.uniroma1.it/handle/11573/1711827

  6. arXiv:2404.12917  [pdf, other

    cs.LG cs.AI cs.CV

    R3L: Relative Representations for Reinforcement Learning

    Authors: Antonio Pio Ricciardi, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Visual Reinforcement Learning is a popular and powerful framework that takes full advantage of the Deep Learning breakthrough. It is known that variations in input domains (e.g., different panorama colors due to seasonal changes) or task domains (e.g., altering the target speed of a car) can disrupt agent performance, necessitating new training for each variation. Recent advancements in the field… ▽ More

    Submitted 18 February, 2025; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 12 pages, 5 figures, 7 tables

    MSC Class: 68T07 ACM Class: I.2.6

  7. arXiv:2311.06547  [pdf, other

    cs.LG

    From Charts to Atlas: Merging Latent Spaces into One

    Authors: Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Liò, Emanuele Rodolà

    Abstract: Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We investigate in this study the aggregation of such latent spaces to create a unified space encompassing the combined information. To this end, we introduce Relative Latent Space Aggregation, a two-step approach that first renders the spaces comparable using relative rep… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: To appear in the NeurReps workshop @ NeurIPS 2023

  8. arXiv:2311.00664  [pdf, other

    cs.LG

    Latent Space Translation via Semantic Alignment

    Authors: Valentino Maiorca, Luca Moschella, Antonio Norelli, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: While different neural models often exhibit latent spaces that are alike when exposed to semantically related data, this intrinsic similarity is not always immediately discernible. Towards a better understanding of this phenomenon, our work shows how representations learned from these neural modules can be translated between different pre-trained networks via simpler transformations than previousl… ▽ More

    Submitted 11 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023. 21 pages, 13 figures, 8 tables

  9. arXiv:2310.01211  [pdf, other

    cs.LG

    From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

    Authors: Irene Cannistraci, Luca Moschella, Marco Fumero, Valentino Maiorca, Emanuele Rodolà

    Abstract: It has been observed that representations learned by distinct neural networks conceal structural similarities when the models are trained under similar inductive biases. From a geometric perspective, identifying the classes of transformations and the related invariances that connect these representations is fundamental to unlocking applications, such as merging, stitching, and reusing different ne… ▽ More

    Submitted 20 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 41 pages, 14 figures and 31 tables

  10. arXiv:2303.00721  [pdf, other

    cs.LG cs.AI

    Bootstrapping Parallel Anchors for Relative Representations

    Authors: Irene Cannistraci, Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Emanuele Rodolà

    Abstract: The use of relative representations for latent embeddings has shown potential in enabling latent space communication and zero-shot model stitching across a wide range of applications. Nevertheless, relative representations rely on a certain amount of parallel anchors to be given as input, which can be impractical to obtain in certain scenarios. To overcome this limitation, we propose an optimizati… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 9 pages, 7 tables

    MSC Class: 68T07 ACM Class: I.2.6

  11. Latent Spectral Regularization for Continual Learning

    Authors: Emanuele Frascaroli, Riccardo Benaglia, Matteo Boschini, Luca Moschella, Cosimo Fiorini, Emanuele Rodolà, Simone Calderara

    Abstract: While biological intelligence grows organically as new knowledge is gathered throughout life, Artificial Neural Networks forget catastrophically whenever they face a changing training data distribution. Rehearsal-based Continual Learning (CL) approaches have been established as a versatile and reliable solution to overcome this limitation; however, sudden input disruptions and memory constraints a… ▽ More

    Submitted 16 July, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 14 pages, 4 figures, , to appear in Pattern Recognition Letters, Volume 184, August 2024, Pages 119-125

    Journal ref: Pattern Recognition Letters, Volume 184, August 2024, Pages 119-125, ISSN 0167-8655

  12. arXiv:2210.01738  [pdf, other

    cs.LG cs.AI cs.CV

    ASIF: Coupled Data Turns Unimodal Models to Multimodal Without Training

    Authors: Antonio Norelli, Marco Fumero, Valentino Maiorca, Luca Moschella, Emanuele Rodolà, Francesco Locatello

    Abstract: CLIP proved that aligning visual and language spaces is key to solving many vision tasks without explicit training, but required to train image and text encoders from scratch on a huge dataset. LiT improved this by only training the text encoder and using a pre-trained vision network. In this paper, we show that a common space can be created without any training at all, using single-domain encoder… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 17 pages

  13. arXiv:2209.15430  [pdf, other

    cs.LG cs.AI

    Relative representations enable zero-shot latent space communication

    Authors: Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Francesco Locatello, Emanuele Rodolà

    Abstract: Neural networks embed the geometric structure of a data manifold lying in a high-dimensional space into latent representations. Ideally, the distribution of the data points in the latent space should depend only on the task, the data, the loss, and other architecture-specific constraints. However, factors such as the random weights initialization, training hyperparameters, or other sources of rand… ▽ More

    Submitted 7 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 notable top 5%, 26 pages, 11 figures, 18 tables

    MSC Class: 68T07 ACM Class: I.2.6

  14. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  15. arXiv:2206.03695  [pdf, other

    cs.LG cs.AI

    Metric Based Few-Shot Graph Classification

    Authors: Donato Crisostomi, Simone Antonelli, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Many modern deep-learning techniques do not work without enormous datasets. At the same time, several fields demand methods working in scarcity of data. This problem is even more complex when the samples have varying structures, as in the case of graphs. Graph representation learning techniques have recently proven successful in a variety of domains. Nevertheless, the employed architectures perfor… ▽ More

    Submitted 30 October, 2024; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: In Proceedings of the First Learning on Graphs Conference (LoG 2022)

  16. arXiv:2201.10222  [pdf, other

    cs.LG cs.AI cs.CL physics.hist-ph

    Explanatory Learning: Beyond Empiricism in Neural Networks

    Authors: Antonio Norelli, Giorgio Mariani, Luca Moschella, Andrea Santilli, Giambattista Parascandolo, Simone Melzi, Emanuele Rodolà

    Abstract: We introduce Explanatory Learning (EL), a framework to let machines use existing knowledge buried in symbolic sequences -- e.g. explanations written in hieroglyphic -- by autonomously learning to interpret them. In EL, the burden of interpreting symbols is not left to humans or rigid human-coded compilers, as done in Program Synthesis. Rather, EL calls for a learned interpreter, built upon a limit… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 7 pages

  17. arXiv:2106.13679  [pdf, other

    cs.CV cs.GR cs.LG

    Shape registration in the time of transformers

    Authors: Giovanni Trappolini, Luca Cosmo, Luca Moschella, Riccardo Marin, Simone Melzi, Emanuele Rodolà

    Abstract: In this paper, we propose a transformer-based procedure for the efficient registration of non-rigid 3D point clouds. The proposed approach is data-driven and adopts for the first time the transformer architecture in the registration task. Our method is general and applies to different settings. Given a fixed template with some desired properties (e.g. skinning weights or other animation cues), we… ▽ More

    Submitted 28 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  18. arXiv:2104.00514  [pdf, other

    cs.GR cs.CG cs.LG

    Learning Spectral Unions of Partial Deformable 3D Shapes

    Authors: Luca Moschella, Simone Melzi, Luca Cosmo, Filippo Maggioli, Or Litany, Maks Ovsjanikov, Leonidas Guibas, Emanuele Rodolà

    Abstract: Spectral geometric methods have brought revolutionary changes to the field of geometry processing. Of particular interest is the study of the Laplacian spectrum as a compact, isometry and permutation-invariant representation of a shape. Some recent works show how the intrinsic geometry of a full shape can be recovered from its spectrum, but there are approaches that consider the more challenging p… ▽ More

    Submitted 21 December, 2022; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: 18 pages, 20 figures