Skip to main content

Showing 1–15 of 15 results for author: Dionelis, N

.
  1. arXiv:2504.11171  [pdf, other

    cs.CV cs.AI

    TerraMind: Large-Scale Generative Multimodality for Earth Observation

    Authors: Johannes Jakubik, Felix Yang, Benedikt Blumenstiel, Erik Scheurer, Rocco Sedona, Stefano Maurogiovanni, Jente Bosmans, Nikolaos Dionelis, Valerio Marsocci, Niklas Kopp, Rahul Ramachandran, Paolo Fraccaro, Thomas Brunschwiler, Gabriele Cavallaro, Juan Bernabe-Moreno, Nicolas Longépé

    Abstract: We present TerraMind, the first any-to-any generative, multimodal foundation model for Earth observation (EO). Unlike other multimodal models, TerraMind is pretrained on dual-scale representations combining both token-level and pixel-level data across modalities. On a token level, TerraMind encodes high-level contextual information to learn cross-modal relationships, while on a pixel level, TerraM… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  2. arXiv:2502.13818  [pdf, other

    cs.CV cs.LG

    Building Age Estimation: A New Multi-Modal Benchmark Dataset and Community Challenge

    Authors: Nikolaos Dionelis, Nicolas Longépé, Alessandra Feliciotti, Mattia Marconcini, Devis Peressutti, Nika Oman Kadunc, JaeWan Park, Hagai Raja Sinulingga, Steve Andreas Immanuel, Ba Tran, Caroline Arnold

    Abstract: Estimating the construction year of buildings is of great importance for sustainability. Sustainable buildings minimize energy consumption and are a key part of responsible and sustainable urban planning and development to effectively combat climate change. By using Artificial Intelligence (AI) and recently proposed powerful Transformer models, we are able to estimate the construction epoch of bui… ▽ More

    Submitted 12 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 13 pages, 22 figures, Submitted

  3. arXiv:2502.13734  [pdf, other

    cs.CV cs.LG

    CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models

    Authors: Nikolaos Dionelis, Jente Bosmans, Nicolas Longépé

    Abstract: Performing accurate confidence quantification and assessment in pixel-wise regression tasks, which are downstream applications of AI Foundation Models for Earth Observation (EO), is important for deep neural networks to predict their failures, improve their performance and enhance their capabilities in real-world applications, for their practical deployment. For pixel-wise regression tasks, specif… ▽ More

    Submitted 3 April, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 7 pages, 4 figures, Submitted

  4. arXiv:2406.18295  [pdf, other

    cs.CV cs.LG

    Evaluating and Benchmarking Foundation Models for Earth Observation and Geospatial AI

    Authors: Nikolaos Dionelis, Casper Fibaek, Luke Camilleri, Andreas Luyts, Jente Bosmans, Bertrand Le Saux

    Abstract: When we are primarily interested in solving several problems jointly with a given prescribed high performance accuracy for each target application, then Foundation Models should for most cases be used rather than problem-specific models. We focus on the specific Computer Vision application of Foundation Models for Earth Observation (EO) and geospatial AI. These models can solve important problems… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, Submitted

  5. arXiv:2406.18279  [pdf, other

    cs.CV cs.LG

    Improving EO Foundation Models with Confidence Assessment for enhanced Semantic segmentation

    Authors: Nikolaos Dionelis, Nicolas Longepe

    Abstract: Confidence assessments of semantic segmentation algorithms are important. Ideally, deep learning models should have the ability to predict in advance whether their output is likely to be incorrect. Assessing the confidence levels of model predictions in Earth Observation (EO) classification is essential, as it can enhance semantic segmentation performance and help prevent further exploitation of t… ▽ More

    Submitted 22 November, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 5 pages, 7 figures, 4 tables, Accepted

  6. arXiv:2404.11302  [pdf, other

    cs.CV cs.LG

    A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching

    Authors: Francesco Pro, Nikolaos Dionelis, Luca Maiano, Bertrand Le Saux, Irene Amerini

    Abstract: Nowadays the accurate geo-localization of ground-view images has an important role across domains as diverse as journalism, forensics analysis, transports, and Earth Observation. This work addresses the problem of matching a query ground-view image with the corresponding satellite image without GPS data. This is done by comparing the features from a ground-view image and a satellite one, innovativ… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 6 pages, 2 figures, 2 tables, Submitted to IGARSS 2024

  7. arXiv:2404.11299  [pdf, other

    cs.CV cs.LG

    Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images

    Authors: Nikolaos Dionelis, Francesco Pro, Luca Maiano, Irene Amerini, Bertrand Le Saux

    Abstract: Data from satellites or aerial vehicles are most of the times unlabelled. Annotating such data accurately is difficult, requires expertise, and is costly in terms of time. Even if Earth Observation (EO) data were correctly labelled, labels might change over time. Learning from unlabelled data within a semi-supervised learning framework for segmentation of aerial images is challenging. In this pape… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 6 pages, 7 figures, Submitted to IGARSS 2024

  8. arXiv:2401.04464  [pdf, other

    cs.CV cs.LG

    PhilEO Bench: Evaluating Geo-Spatial Foundation Models

    Authors: Casper Fibaek, Luke Camilleri, Andreas Luyts, Nikolaos Dionelis, Bertrand Le Saux

    Abstract: Massive amounts of unlabelled data are captured by Earth Observation (EO) satellites, with the Sentinel-2 constellation generating 1.6 TB of data daily. This makes Remote Sensing a data-rich domain well suited to Machine Learning (ML) solutions. However, a bottleneck in applying ML models to EO is the lack of annotated data as annotation is a labour-intensive and costly process. As a result, resea… ▽ More

    Submitted 15 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures, Submitted to IGARSS 2024

  9. arXiv:2111.15487  [pdf, other

    cs.LG stat.ML

    FROB: Few-shot ROBust Model for Classification and Out-of-Distribution Detection

    Authors: Nikolaos Dionelis, Mehrdad Yaghoobi, Sotirios A. Tsaftaris

    Abstract: Nowadays, classification and Out-of-Distribution (OoD) detection in the few-shot setting remain challenging aims due to rarity and the limited samples in the few-shot setting, and because of adversarial attacks. Accomplishing these aims is important for critical systems in safety, security, and defence. In parallel, OoD detection is challenging since deep neural network classifiers set high confid… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: Paper, 22 pages, Figures, Tables

  10. arXiv:2110.15273  [pdf, other

    cs.LG stat.ML

    OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Boundary

    Authors: Nikolaos Dionelis, Mehrdad Yaghoobi, Sotirios A. Tsaftaris

    Abstract: Generative models trained in an unsupervised manner may set high likelihood and low reconstruction loss to Out-of-Distribution (OoD) samples. This increases Type II errors and leads to missed anomalies, overall decreasing Anomaly Detection (AD) performance. In addition, AD models underperform due to the rarity of anomalies. To address these limitations, we propose the OoD Minimum Anomaly Score GAN… ▽ More

    Submitted 2 February, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Research work paper, 9 pages, 10 figures, Appendix

  11. Tail of Distribution GAN (TailGAN): Generative-Adversarial-Network-Based Boundary Formation

    Authors: Nikolaos Dionelis, Mehrdad Yaghoobi, Sotirios A. Tsaftaris

    Abstract: Generative Adversarial Networks (GAN) are a powerful methodology and can be used for unsupervised anomaly detection, where current techniques have limitations such as the accurate detection of anomalies near the tail of a distribution. GANs generally do not guarantee the existence of a probability density and are susceptible to mode collapse, while few GANs use likelihood to reduce mode collapse.… ▽ More

    Submitted 2 February, 2022; v1 submitted 24 July, 2021; originally announced July 2021.

    Comments: 5 pages, 2020 Sensor Signal Processing for Defence Conference (SSPD)

    Journal ref: 2020 Sensor Signal Processing for Defence Conference (SSPD)

  12. Boundary of Distribution Support Generator (BDSG): Sample Generation on the Boundary

    Authors: Nikolaos Dionelis, Mehrdad Yaghoobi, Sotirios A. Tsaftaris

    Abstract: Generative models, such as Generative Adversarial Networks (GANs), have been used for unsupervised anomaly detection. While performance keeps improving, several limitations exist particularly attributed to difficulties at capturing multimodal supports and to the ability to approximate the underlying distribution closer to the tails, i.e. the boundary of the distribution's support. This paper propo… ▽ More

    Submitted 2 February, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: 5 pages, 2020 IEEE International Conference on Image Processing (ICIP)

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP)

  13. arXiv:1811.00078  [pdf, other

    cs.SD eess.AS

    On Single-Channel Speech Enhancement and On Non-Linear Modulation-Domain Kalman Filtering

    Authors: Nikolaos Dionelis

    Abstract: This report focuses on algorithms that perform single-channel speech enhancement. The author of this report uses modulation-domain Kalman filtering algorithms for speech enhancement, i.e. noise suppression and dereverberation, in [1], [2], [3], [4] and [5]. Modulation-domain Kalman filtering can be applied for both noise and late reverberation suppression and in [2], [1], [3] and [4], various mode… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

    Comments: 13 pages

  14. arXiv:1807.10236  [pdf, other

    cs.SD eess.AS

    Modulation-Domain Kalman Filtering for Monaural Blind Speech Denoising and Dereverberation

    Authors: Nikolaos Dionelis, Mike Brookes

    Abstract: We describe a monaural speech enhancement algorithm based on modulation-domain Kalman filtering to blindly track the time-frequency log-magnitude spectra of speech and reverberation. We propose an adaptive algorithm that performs blind joint denoising and dereverberation, while accounting for the inter-frame speech dynamics, by estimating the posterior distribution of the speech log-magnitude spec… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 13 pages, 13 figures, Submitted to IEEE Transactions on Audio, Speech and Language Processing

  15. arXiv:1708.02171  [pdf, other

    cs.SD

    Phase-Aware Single-Channel Speech Enhancement with Modulation-Domain Kalman Filtering

    Authors: Nikolaos Dionelis, Mike Brookes

    Abstract: We present a single-channel phase-sensitive speech enhancement algorithm that is based on modulation-domain Kalman filtering and on tracking the speech phase using circular statistics. With Kalman filtering, using that speech and noise are additive in the complex STFT domain, the algorithm tracks the speech log-spectrum, the noise log-spectrum and the speech phase. Joint amplitude and phase estima… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

    Comments: 13 pages, 17 figures, Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing