Skip to main content

Showing 1–11 of 11 results for author: Marsocci, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.11172  [pdf, other

    cs.CV

    TerraMesh: A Planetary Mosaic of Multimodal Earth Observation Data

    Authors: Benedikt Blumenstiel, Paolo Fraccaro, Valerio Marsocci, Johannes Jakubik, Stefano Maurogiovanni, Mikolaj Czerkawski, Rocco Sedona, Gabriele Cavallaro, Thomas Brunschwiler, Juan Bernabe-Moreno, Nicolas Longépé

    Abstract: Large-scale foundation models in Earth Observation can learn versatile, label-efficient representations by leveraging massive amounts of unlabeled data. However, existing public datasets are often limited in scale, geographic coverage, or sensor variety. We introduce TerraMesh, a new globally diverse, multimodal dataset combining optical, synthetic aperture radar, elevation, and land-cover modalit… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  2. arXiv:2504.11171  [pdf, ps, other

    cs.CV cs.AI

    TerraMind: Large-Scale Generative Multimodality for Earth Observation

    Authors: Johannes Jakubik, Felix Yang, Benedikt Blumenstiel, Erik Scheurer, Rocco Sedona, Stefano Maurogiovanni, Jente Bosmans, Nikolaos Dionelis, Valerio Marsocci, Niklas Kopp, Rahul Ramachandran, Paolo Fraccaro, Thomas Brunschwiler, Gabriele Cavallaro, Juan Bernabe-Moreno, Nicolas Longépé

    Abstract: We present TerraMind, the first any-to-any generative, multimodal foundation model for Earth observation (EO). Unlike other multimodal models, TerraMind is pretrained on dual-scale representations combining both token-level and pixel-level data across modalities. On a token level, TerraMind encodes high-level contextual information to learn cross-modal relationships, while on a pixel level, TerraM… ▽ More

    Submitted 11 June, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

  3. arXiv:2504.08548  [pdf, other

    cs.GR cs.CV

    COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails

    Authors: Miguel Espinosa, Valerio Marsocci, Yuru Jia, Elliot J. Crowley, Mikolaj Czerkawski

    Abstract: In remote sensing, multi-modal data from various sensors capturing the same scene offers rich opportunities, but learning a unified representation across these modalities remains a significant challenge. Traditional methods have often been limited to single or dual-modality approaches. In this paper, we introduce COP-GEN-Beta, a generative diffusion model trained on optical, radar, and elevation d… ▽ More

    Submitted 14 April, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: Accepted at CVPR 2025 Workshop MORSE

  4. arXiv:2503.09493  [pdf, other

    cs.CV

    Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection

    Authors: Romain Thoreau, Valerio Marsocci, Dawa Derksen

    Abstract: As large-scale heterogeneous data sets become increasingly available, adapting foundation models at low cost has become a key issue. Seminal works in natural language processing, e.g. Low-Rank Adaptation (LoRA), leverage the low "intrinsic rank" of parameter updates during adaptation. In this paper, we argue that incorporating stronger inductive biases in both data and models can enhance the adapt… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  5. arXiv:2503.07890  [pdf, other

    cs.CV

    Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?

    Authors: Yuru Jia, Valerio Marsocci, Ziyang Gong, Xue Yang, Maarten Vergauwen, Andrea Nascetti

    Abstract: Self-supervised learning (SSL) has revolutionized representation learning in Remote Sensing (RS), advancing Geospatial Foundation Models (GFMs) to leverage vast unlabeled satellite imagery for diverse downstream tasks. Currently, GFMs primarily focus on discriminative objectives, such as contrastive learning or masked image modeling, owing to their proven success in learning transferable represent… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  6. arXiv:2412.04204  [pdf, other

    cs.CV

    PANGAEA: A Global and Inclusive Benchmark for Geospatial Foundation Models

    Authors: Valerio Marsocci, Yuru Jia, Georges Le Bellier, David Kerekes, Liang Zeng, Sebastian Hafner, Sebastian Gerard, Eric Brune, Ritu Yadav, Ali Shibli, Heng Fang, Yifang Ban, Maarten Vergauwen, Nicolas Audebert, Andrea Nascetti

    Abstract: Geospatial Foundation Models (GFMs) have emerged as powerful tools for extracting representations from Earth observation data, but their evaluation remains inconsistent and narrow. Existing works often evaluate on suboptimal downstream datasets and tasks, that are often too easy or too narrow, limiting the usefulness of the evaluations to assess the real-world applicability of GFMs. Additionally,… ▽ More

    Submitted 30 April, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  7. arXiv:2405.09922  [pdf, other

    cs.CV

    Cross-sensor self-supervised training and alignment for remote sensing

    Authors: Valerio Marsocci, Nicolas Audebert

    Abstract: Large-scale ''foundation models'' have gained traction as a way to leverage the vast amounts of unlabeled remote sensing data collected every day. However, due to the multiplicity of Earth Observation satellites, these models should learn ''sensor agnostic'' representations, that generalize across sensor characteristics with minimal fine-tuning. This is complicated by data availability, as low-res… ▽ More

    Submitted 24 June, 2025; v1 submitted 16 May, 2024; originally announced May 2024.

    Journal ref: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2025, 18, pp.12278-12289

  8. Conditional computation in neural networks: principles and research trends

    Authors: Simone Scardapane, Alessandro Baiocchi, Alessio Devoto, Valerio Marsocci, Pasquale Minervini, Jary Pomponi

    Abstract: This article summarizes principles and ideas from the emerging area of applying \textit{conditional computation} methods to the design of neural networks. In particular, we focus on neural networks that can dynamically activate or de-activate parts of their computational graph conditionally on their input. Examples include the dynamic selection of, e.g., input tokens, layers (or sets of layers), a… ▽ More

    Submitted 8 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Journal ref: Intelligenza Artificiale, vol. Pre-press, pp. 1-16, 2024

  9. arXiv:2304.07750  [pdf, other

    cs.CV

    GeoMultiTaskNet: remote sensing unsupervised domain adaptation using geographical coordinates

    Authors: Valerio Marsocci, Nicolas Gonthier, Anatol Garioud, Simone Scardapane, Clément Mallet

    Abstract: Land cover maps are a pivotal element in a wide range of Earth Observation (EO) applications. However, annotating large datasets to develop supervised systems for remote sensing (RS) semantic segmentation is costly and time-consuming. Unsupervised Domain Adaption (UDA) could tackle these issues by adapting a model trained on a source domain, where labels are available, to a target domain, without… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  10. arXiv:2205.15903  [pdf, other

    eess.IV cs.CV

    Inferring 3D change detection from bitemporal optical images

    Authors: Valerio Marsocci, Virginia Coletta, Roberta Ravanelli, Simone Scardapane, Mattia Crespi

    Abstract: Change detection is one of the most active research areas in Remote Sensing (RS). Most of the recently developed change detection methods are based on deep learning (DL) algorithms. This kind of algorithms is generally focused on generating two-dimensional (2D) change maps, thus only identifying planimetric changes in land use/land cover (LULC) and not considering nor returning any information on… ▽ More

    Submitted 16 January, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: https://doi.org/10.1016/j.isprsjprs.2022.12.009

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing 196 (2023) 325-339

  11. arXiv:2205.11319  [pdf, other

    cs.CV

    Continual Barlow Twins: continual self-supervised learning for remote sensing semantic segmentation

    Authors: Valerio Marsocci, Simone Scardapane

    Abstract: In the field of Earth Observation (EO), Continual Learning (CL) algorithms have been proposed to deal with large datasets by decomposing them into several subsets and processing them incrementally. The majority of these algorithms assume that data is (a) coming from a single source, and (b) fully labeled. Real-world EO datasets are instead characterized by a large heterogeneity (e.g., coming from… ▽ More

    Submitted 9 January, 2023; v1 submitted 23 May, 2022; originally announced May 2022.