Skip to main content

Showing 1–50 of 66 results for author: Borth, D

.
  1. arXiv:2504.18072  [pdf, other

    cs.LG

    A Model Zoo on Phase Transitions in Neural Networks

    Authors: Konstantin Schürholt, Léo Meynent, Yefan Zhou, Haiquan Lu, Yaoqing Yang, Damian Borth

    Abstract: Using the weights of trained Neural Network (NN) models as data modality has recently gained traction as a research field - dubbed Weight Space Learning (WSL). Multiple recent works propose WSL methods to analyze models, evaluate methods, or synthesize weights. Weight space learning methods require populations of trained models as datasets for development and evaluation. However, existing collecti… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  2. arXiv:2504.17039  [pdf, other

    cs.CV

    Dense Air Pollution Estimation from Sparse in-situ Measurements and Satellite Data

    Authors: Ruben Gonzalez Avilés, Linus Scheibenreif, Damian Borth

    Abstract: This paper addresses the critical environmental challenge of estimating ambient Nitrogen Dioxide (NO$_2$) concentrations, a key issue in public health and environmental policy. Existing methods for satellite-based air pollution estimation model the relationship between satellite and in-situ measurements at select point locations. While these approaches have advanced our ability to provide air qual… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  3. arXiv:2504.16851  [pdf, other

    cs.CV

    Hyperspectral Vision Transformers for Greenhouse Gas Estimations from Space

    Authors: Ruben Gonzalez Avilés, Linus Scheibenreif, Nassim Ait Ali Braham, Benedikt Blumenstiel, Thomas Brunschwiler, Ranjini Guruprasad, Damian Borth, Conrad Albrecht, Paolo Fraccaro, Devyani Lambhate, Johannes Jakubik

    Abstract: Hyperspectral imaging provides detailed spectral information and holds significant potential for monitoring of greenhouse gases (GHGs). However, its application is constrained by limited spatial coverage and infrequent revisit times. In contrast, multispectral imaging offers broader spatial and temporal coverage but often lacks the spectral detail that can enhance GHG detection. To address these c… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  4. arXiv:2504.11154  [pdf, other

    cs.CV eess.IV

    SAR-to-RGB Translation with Latent Diffusion for Earth Observation

    Authors: Kaan Aydin, Joelle Hanna, Damian Borth

    Abstract: Earth observation satellites like Sentinel-1 (S1) and Sentinel-2 (S2) provide complementary remote sensing (RS) data, but S2 images are often unavailable due to cloud cover or data gaps. To address this, we propose a diffusion model (DM)-based approach for SAR-to-RGB translation, generating synthetic optical images from SAR inputs. We explore three different setups: two using Standard Diffusion, w… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 10 pages, 3 figures

  5. arXiv:2504.10231  [pdf, other

    cs.LG

    A Model Zoo of Vision Transformers

    Authors: Damian Falk, Léo Meynent, Florence Pfammatter, Konstantin Schürholt, Damian Borth

    Abstract: The availability of large, structured populations of neural networks - called 'model zoos' - has led to the development of a multitude of downstream tasks ranging from model analysis, to representation learning on model weights or generative modeling of neural network parameters. However, existing model zoos are limited in size and architecture and neglect the transformer, which is among the curre… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted at the ICLR Workshop on Neural Network Weights as a New Data Modality 2025

  6. arXiv:2504.10141  [pdf, other

    cs.LG

    The Impact of Model Zoo Size and Composition on Weight Space Learning

    Authors: Damian Falk, Konstantin Schürholt, Damian Borth

    Abstract: Re-using trained neural network models is a common strategy to reduce training cost and transfer knowledge. Weight space learning - using the weights of trained models as data modality - is a promising new field to re-use populations of pre-trained models for future tasks. Approaches in this field have demonstrated high performance both on model analysis and weight generation tasks. However, until… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted at the ICLR Workshop on Neural Network Weights as a New Data Modality 2025

  7. arXiv:2503.17138  [pdf, other

    cs.LG

    Structure Is Not Enough: Leveraging Behavior for Neural Network Weight Reconstruction

    Authors: Léo Meynent, Ivan Melev, Konstantin Schürholt, Göran Kauermann, Damian Borth

    Abstract: The weights of neural networks (NNs) have recently gained prominence as a new data modality in machine learning, with applications ranging from accuracy and hyperparameter prediction to representation learning or weight generation. One approach to leverage NN weights involves training autoencoders (AEs), using contrastive and reconstruction losses. This allows such models to be applied to a wide v… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: Accepted at the ICLR Workshop on Neural Network Weights as a New Data Modality 2025

  8. arXiv:2412.16083  [pdf, other

    cs.LG q-fin.ST

    Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation

    Authors: Timur Sattarov, Marco Schreyer, Damian Borth

    Abstract: The increasing demand for privacy-preserving data analytics in finance necessitates solutions for synthetic data generation that rigorously uphold privacy standards. We introduce DP-Fed-FinDiff framework, a novel integration of Differential Privacy, Federated Learning and Denoising Diffusion Probabilistic Models designed to generate high-fidelity synthetic tabular data. This framework ensures comp… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: 9 pages, 9 figures, preprint version, currently under review

  9. arXiv:2407.12440  [pdf, other

    cs.LG

    GraphGuard: Contrastive Self-Supervised Learning for Credit-Card Fraud Detection in Multi-Relational Dynamic Graphs

    Authors: Kristófer Reynisson, Marco Schreyer, Damian Borth

    Abstract: Credit card fraud has significant implications at both an individual and societal level, making effective prevention essential. Current methods rely heavily on feature engineering and labeled information, both of which have significant limitations. In this work, we present GraphGuard, a novel contrastive self-supervised graph-based framework for detecting fraudulent credit card transactions. We co… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 8 pages, 1 figure, 2 tables, preprint version, presented at AAAI 2024 Workshop on AI in Finance for Social Impact

  10. arXiv:2406.09997  [pdf, other

    cs.LG

    Towards Scalable and Versatile Weight Space Learning

    Authors: Konstantin Schürholt, Michael W. Mahoney, Damian Borth

    Abstract: Learning representations of well-trained neural network models holds the promise to provide an understanding of the inner workings of those models. However, previous work has either faced limitations when processing larger networks or was task-specific to either discriminative or generative tasks. This paper introduces the SANE approach to weight-space learning. SANE overcomes previous limitations… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  11. arXiv:2403.15356  [pdf, other

    cs.CV

    Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation

    Authors: Zhitong Xiong, Yi Wang, Fahong Zhang, Adam J. Stewart, Joëlle Hanna, Damian Borth, Ioannis Papoutsis, Bertrand Le Saux, Gustau Camps-Valls, Xiao Xiang Zhu

    Abstract: The development of foundation models has revolutionized our ability to interpret the Earth's surface using satellite observational data. Traditional models have been siloed, tailored to specific sensors or data types like optical, radar, and hyperspectral, each with its own unique characteristics. This specialization hinders the potential for a holistic analysis that could benefit from the combine… ▽ More

    Submitted 7 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 36 pages, 7 figures

  12. arXiv:2401.15973  [pdf, other

    cs.LG

    Sample Weight Estimation Using Meta-Updates for Online Continual Learning

    Authors: Hamed Hemati, Damian Borth

    Abstract: The loss function plays an important role in optimizing the performance of a learning system. A crucial aspect of the loss function is the assignment of sample weights within a mini-batch during loss computation. In the context of continual learning (CL), most existing strategies uniformly treat samples when calculating the loss value, thereby assigning equal weights to each sample. While this app… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  13. arXiv:2401.06263  [pdf, other

    cs.LG

    FedTabDiff: Federated Learning of Diffusion Probabilistic Models for Synthetic Mixed-Type Tabular Data Generation

    Authors: Timur Sattarov, Marco Schreyer, Damian Borth

    Abstract: Realistic synthetic tabular data generation encounters significant challenges in preserving privacy, especially when dealing with sensitive information in domains like finance and healthcare. In this paper, we introduce \textit{Federated Tabular Diffusion} (FedTabDiff) for generating high-fidelity mixed-type tabular data without centralized access to the original tabular datasets. Leveraging the s… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures, 2 tables, preprint version, currently under review

  14. arXiv:2310.12766  [pdf, other

    cs.CL cs.LG

    Transformer-based Entity Legal Form Classification

    Authors: Alexander Arimond, Mauro Molteni, Dominik Jany, Zornitsa Manolova, Damian Borth, Andreas G. F. Hoepner

    Abstract: We propose the application of Transformer-based language models for classifying entity legal forms from raw legal entity names. Specifically, we employ various BERT variants and compare their performance against multiple traditional baselines. Our evaluation encompasses a substantial subset of freely available Legal Entity Identifier (LEI) data, comprising over 1.1 million legal entities from 30 d… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  15. arXiv:2309.01472  [pdf, other

    cs.LG q-fin.ST

    FinDiff: Diffusion Models for Financial Tabular Data Generation

    Authors: Timur Sattarov, Marco Schreyer, Damian Borth

    Abstract: The sharing of microdata, such as fund holdings and derivative instruments, by regulatory institutions presents a unique challenge due to strict data confidentiality and privacy regulations. These challenges often hinder the ability of both academics and practitioners to conduct collaborative research effectively. The emergence of generative models, particularly diffusion models, capable of synthe… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 9 pages, 5 figures, 3 tables, preprint version, currently under review

  16. arXiv:2307.01741  [pdf, other

    cs.CV

    Ben-ge: Extending BigEarthNet with Geographical and Environmental Data

    Authors: Michael Mommert, Nicolas Kesseli, Joëlle Hanna, Linus Scheibenreif, Damian Borth, Begüm Demir

    Abstract: Deep learning methods have proven to be a powerful tool in the analysis of large amounts of complex Earth observation data. However, while Earth observation data are multi-modal in most cases, only single or few modalities are typically considered. In this work, we present the ben-ge dataset, which supplements the BigEarthNet-MM dataset by compiling freely and globally available geographical and e… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted for presentation at the IEEE International Geoscience and Remote Sensing Symposium 2023

  17. arXiv:2306.10724  [pdf, other

    cs.LG

    Partial Hypernetworks for Continual Learning

    Authors: Hamed Hemati, Vincenzo Lomonaco, Davide Bacciu, Damian Borth

    Abstract: Hypernetworks mitigate forgetting in continual learning (CL) by generating task-dependent weights and penalizing weight changes at a meta-model level. Unfortunately, generating all weights is not only computationally expensive for larger architectures, but also, it is not well understood whether generating all model weights is necessary. Inspired by latent replay methods in CL, we propose partial… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  18. arXiv:2306.05709  [pdf, other

    eess.AS cs.CL cs.SD

    Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech

    Authors: Shijun Wang, Jón Guðnason, Damian Borth

    Abstract: Effective speech emotional representations play a key role in Speech Emotion Recognition (SER) and Emotional Text-To-Speech (TTS) tasks. However, emotional speech samples are more difficult and expensive to acquire compared with Neutral style speech, which causes one issue that most related works unfortunately neglect: imbalanced datasets. Models might overfit to the majority Neutral class and fai… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted by INTERSPEECH2023

  19. arXiv:2304.13718  [pdf, other

    cs.LG

    Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models

    Authors: Dominik Honegger, Konstantin Schürholt, Damian Borth

    Abstract: With growing size of Neural Networks (NNs), model sparsification to reduce the computational cost and memory demand for model inference has become of vital interest for both research and production. While many sparsification methods have been proposed and successfully applied on individual models, to the best of our knowledge their behavior and robustness has not yet been studied on large populati… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICLR 2023 Workshop on Sparsity in Neural Networks

  20. arXiv:2303.01508  [pdf, other

    cs.SD cs.AI eess.AS

    Fine-grained Emotional Control of Text-To-Speech: Learning To Rank Inter- And Intra-Class Emotion Intensities

    Authors: Shijun Wang, Jón Guðnason, Damian Borth

    Abstract: State-of-the-art Text-To-Speech (TTS) models are capable of producing high-quality speech. The generated speech, however, is usually neutral in emotional expression, whereas very often one would want fine-grained emotional control of words or phonemes. Although still challenging, the first TTS models have been recently proposed that are able to control voice by manually assigning emotion intensity… ▽ More

    Submitted 11 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP2023

  21. arXiv:2301.11396  [pdf, other

    cs.LG

    Class-Incremental Learning with Repetition

    Authors: Hamed Hemati, Andrea Cossu, Antonio Carta, Julio Hurtado, Lorenzo Pellegrini, Davide Bacciu, Vincenzo Lomonaco, Damian Borth

    Abstract: Real-world data streams naturally include the repetition of previous concepts. From a Continual Learning (CL) perspective, repetition is a property of the environment and, unlike replay, cannot be controlled by the agent. Nowadays, the Class-Incremental (CI) scenario represents the leading test-bed for assessing and comparing CL strategies. This scenario type is very easy to use, but it never allo… ▽ More

    Submitted 19 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Accepted to the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023 19 pages

  22. arXiv:2210.15051  [pdf, other

    cs.LG

    Federated Continual Learning to Detect Accounting Anomalies in Financial Auditing

    Authors: Marco Schreyer, Hamed Hemati, Damian Borth, Miklos A. Vasarhelyi

    Abstract: The International Standards on Auditing require auditors to collect reasonable assurance that financial statements are free of material misstatement. At the same time, a central objective of Continuous Assurance is the real-time assessment of digital accounting journal entries. Recently, driven by the advances in artificial intelligence, Deep Learning techniques have emerged in financial auditing… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 6 pages (excl. appendix), 5 figures, 1 table, preprint version, currently under review

  23. arXiv:2209.14764  [pdf, other

    cs.LG

    Model Zoos: A Dataset of Diverse Populations of Neural Network Models

    Authors: Konstantin Schürholt, Diyar Taskiran, Boris Knyazev, Xavier Giró-i-Nieto, Damian Borth

    Abstract: In the last years, neural networks (NN) have evolved from laboratory environments to the state-of-the-art for many real-world problems. It was shown that NN models (i.e., their weights and biases) evolve on unique trajectories in weight space during training. Following, a population of such neural network models (referred to as model zoo) would form structures in weight space. We think that the ge… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks

  24. arXiv:2209.14733  [pdf, other

    cs.LG cs.CV

    Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights

    Authors: Konstantin Schürholt, Boris Knyazev, Xavier Giró-i-Nieto, Damian Borth

    Abstract: Learning representations of neural network weights given a model zoo is an emerging and challenging area with many potential applications from model inspection, to neural architecture search or knowledge distillation. Recently, an autoencoder trained on a model zoo was able to learn a hyper-representation, which captures intrinsic and extrinsic properties of the models in the zoo. In this work, we… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022). arXiv admin note: text overlap with arXiv:2207.10951

  25. arXiv:2209.09157  [pdf, other

    cs.LG cs.CE q-fin.ST

    RESHAPE: Explaining Accounting Anomalies in Financial Statement Audits by enhancing SHapley Additive exPlanations

    Authors: Ricardo Müller, Marco Schreyer, Timur Sattarov, Damian Borth

    Abstract: Detecting accounting anomalies is a recurrent challenge in financial statement audits. Recently, novel methods derived from Deep-Learning (DL) have been proposed to audit the large volumes of a statement's underlying accounting records. However, due to their vast number of parameters, such models exhibit the drawback of being inherently opaque. At the same time, the concealing of a model's inner w… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 9 pages, 4 figures, 5 tables, preprint version, currently under review

  26. arXiv:2208.12708  [pdf, other

    cs.LG cs.CE cs.CR

    Federated and Privacy-Preserving Learning of Accounting Data in Financial Statement Audits

    Authors: Marco Schreyer, Timur Sattarov, Damian Borth

    Abstract: The ongoing 'digital transformation' fundamentally changes audit evidence's nature, recording, and volume. Nowadays, the International Standards on Auditing (ISA) requires auditors to examine vast volumes of a financial statement's underlying digital accounting records. As a result, audit firms also 'digitize' their analytical capabilities and invest in Deep Learning (DL), a successful sub-discipl… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 8 pages, 5 figures, 3 tables, preprint version, currently under review

  27. arXiv:2208.04994  [pdf, other

    cs.SD eess.AS

    Generative Data Augmentation Guided by Triplet Loss for Speech Emotion Recognition

    Authors: Shijun Wang, Hamed Hemati, Jón Guðnason, Damian Borth

    Abstract: Speech Emotion Recognition (SER) is crucial for human-computer interaction but still remains a challenging problem because of two major obstacles: data scarcity and imbalance. Many datasets for SER are substantially imbalanced, where data utterances of one class (most often Neutral) are much more frequent than those of other classes. Furthermore, only a few data resources are available for many ex… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Published in INTERSPEECH 2022

  28. arXiv:2207.10951  [pdf, other

    cs.LG

    Hyper-Representations for Pre-Training and Transfer Learning

    Authors: Konstantin Schürholt, Boris Knyazev, Xavier Giró-i-Nieto, Damian Borth

    Abstract: Learning representations of neural network weights given a model zoo is an emerging and challenging area with many potential applications from model inspection, to neural architecture search or knowledge distillation. Recently, an autoencoder trained on a model zoo was able to learn a hyper-representation, which captures intrinsic and extrinsic properties of the models in the zoo. In this work, we… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Journal ref: First Workshop of Pre-training: Perspectives, Pitfalls, and Paths Forward at ICML 2022, Baltimore, Maryland, USA, PMLR 162, 2022

  29. arXiv:2112.13215  [pdf, other

    cs.LG

    Continual Learning for Unsupervised Anomaly Detection in Continuous Auditing of Financial Accounting Data

    Authors: Hamed Hemati, Marco Schreyer, Damian Borth

    Abstract: International audit standards require the direct assessment of a financial statement's underlying accounting journal entries. Driven by advances in artificial intelligence, deep-learning inspired audit techniques emerged to examine vast quantities of journal entry data. However, in regular audits, most of the proposed methods are applied to learn from a comparably stationary journal entry populati… ▽ More

    Submitted 31 March, 2022; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: AAAI 2022 Workshop on AI in Financial Services: Adaptiveness, Resilience & Governance

  30. arXiv:2112.03615  [pdf, other

    cs.CV cs.AI

    Saliency Diversified Deep Ensemble for Robustness to Adversaries

    Authors: Alex Bogun, Dimche Kostadinov, Damian Borth

    Abstract: Deep learning models have shown incredible performance on numerous image recognition, classification, and reconstruction tasks. Although very appealing and valuable due to their predictive capabilities, one common threat remains challenging to resolve. A specifically trained attacker can introduce malicious input perturbations to fool the network, thus causing potentially harmful mispredictions. M… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI Workshop on Adversarial Machine Learning and Beyond 2022

    ACM Class: I.2.0

  31. arXiv:2110.15288  [pdf, other

    cs.LG

    Hyper-Representations: Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction

    Authors: Konstantin Schürholt, Dimche Kostadinov, Damian Borth

    Abstract: Self-Supervised Learning (SSL) has been shown to learn useful and information-preserving representations. Neural Networks (NNs) are widely applied, yet their weight space is still not fully understood. Therefore, we propose to use SSL to learn hyper-representations of the weights of populations of NNs. To that end, we introduce domain specific data augmentations and an adapted attention architectu… ▽ More

    Submitted 14 December, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Published at 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia. 31 Pages, 14 figures

  32. arXiv:2110.14422  [pdf

    cs.SD cs.AI eess.AS

    Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning

    Authors: Shijun Wang, Dimche Kostadinov, Damian Borth

    Abstract: Voice Conversion (VC) for unseen speakers, also known as zero-shot VC, is an attractive research topic as it enables a range of applications like voice customizing, animation production, and others. Recent work in this area made progress with disentanglement methods that separate utterance content and speaker characteristics from speech audio recordings. However, many of these methods are subject… ▽ More

    Submitted 31 May, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Published in: 2022 International Joint Conference on Neural Networks (IJCNN)

  33. arXiv:2109.11201  [pdf, other

    cs.LG cs.CE

    Multi-view Contrastive Self-Supervised Learning of Accounting Data Representations for Downstream Audit Tasks

    Authors: Marco Schreyer, Timur Sattarov, Damian Borth

    Abstract: International audit standards require the direct assessment of a financial statement's underlying accounting transactions, referred to as journal entries. Recently, driven by the advances in artificial intelligence, deep learning inspired audit techniques have emerged in the field of auditing vast quantities of journal entry data. Nowadays, the majority of such methods rely on a set of specialized… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 8 pages (excl. appendix), 4 Figures, 3 Tables

  34. arXiv:2109.10085  [pdf, other

    cs.AI

    Heterogeneous Ensemble for ESG Ratings Prediction

    Authors: Tim Krappel, Alex Bogun, Damian Borth

    Abstract: Over the past years, topics ranging from climate change to human rights have seen increasing importance for investment decisions. Hence, investors (asset managers and asset owners) who wanted to incorporate these issues started to assess companies based on how they handle such topics. For this assessment, investors rely on specialized rating agencies that issue ratings along the environmental, soc… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Accepted to KDD Workshop on Machine Learning in Finance 2021

    ACM Class: J.4

  35. Learning Interpretable Concept Groups in CNNs

    Authors: Saurabh Varshneya, Antoine Ledent, Robert A. Vandermeulen, Yunwen Lei, Matthias Enders, Damian Borth, Marius Kloft

    Abstract: We propose a novel training methodology -- Concept Group Learning (CGL) -- that encourages training of interpretable CNN filters by partitioning filters in each layer into concept groups, each of which is trained to learn a single visual concept. We achieve this through a novel regularization strategy that forces filters in the same group to be active in similar image regions for a given layer. We… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  36. arXiv:2108.13902  [pdf, other

    cs.LG cs.CV

    Estimation of Air Pollution with Remote Sensing Data: Revealing Greenhouse Gas Emissions from Space

    Authors: Linus Scheibenreif, Michael Mommert, Damian Borth

    Abstract: Air pollution is a major driver of climate change. Anthropogenic emissions from the burning of fossil fuels for transportation and power generation emit large amounts of problematic air pollutants, including Greenhouse Gases (GHGs). Despite the importance of limiting GHG emissions to mitigate climate change, detailed information about the spatial and temporal distribution of GHG and other air poll… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: for associated codebase, see https://www.github.com/HSG-AIML/RemoteSensingNO2Estimation

    ACM Class: I.4

  37. arXiv:2107.10894  [pdf, other

    cs.CV eess.IV

    Power Plant Classification from Remote Imaging with Deep Learning

    Authors: Michael Mommert, Linus Scheibenreif, Joëlle Hanna, Damian Borth

    Abstract: Satellite remote imaging enables the detailed study of land use patterns on a global scale. We investigate the possibility to improve the information content of traditional land use classification by identifying the nature of industrial sites from medium-resolution remote sensing images. In this work, we focus on classifying different types of power plants from Sentinel-2 imaging data. Using a Res… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Presented at the 2021 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

  38. arXiv:2104.06074  [pdf, other

    cs.SD cs.LG eess.AS

    NoiseVC: Towards High Quality Zero-Shot Voice Conversion

    Authors: Shijun Wang, Damian Borth

    Abstract: Voice conversion (VC) is a task that transforms voice from target audio to source without losing linguistic contents, it is challenging especially when source and target speakers are unseen during training (zero-shot VC). Previous approaches require a pre-trained model or linguistic data to do the zero-shot conversion. Meanwhile, VC models with Vector Quantization (VQ) or Instance Normalization (I… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  39. arXiv:2103.14512  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Continual Speaker Adaptation for Text-to-Speech Synthesis

    Authors: Hamed Hemati, Damian Borth

    Abstract: Training a multi-speaker Text-to-Speech (TTS) model from scratch is computationally expensive and adding new speakers to the dataset requires the model to be re-trained. The naive solution of sequential fine-tuning of a model for new speakers can lead to poor performance of older speakers. This phenomenon is known as catastrophic forgetting. In this paper, we look at TTS modeling from a continual… ▽ More

    Submitted 31 March, 2022; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Preprint

  40. arXiv:2012.07110  [pdf, other

    cs.LG cs.CR cs.CV

    Leaking Sensitive Financial Accounting Data in Plain Sight using Deep Autoencoder Neural Networks

    Authors: Marco Schreyer, Chistian Schulze, Damian Borth

    Abstract: Nowadays, organizations collect vast quantities of sensitive information in `Enterprise Resource Planning' (ERP) systems, such as accounting relevant transactions, customer master data, or strategic sales price information. The leakage of such information poses a severe threat for companies as the number of incidents and the reputational damage to those experiencing them continue to increase. At t… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

    Comments: 8 pages (excl. appendix), 4 Figures, 2 Tables, AAAI-21 Workshop on Knowledge Discovery from Unstructured Data in Financial Services, this paper is the initial accepted version

  41. arXiv:2011.11344  [pdf, other

    cs.CV cs.AI cs.LG

    Characterization of Industrial Smoke Plumes from Remote Sensing Data

    Authors: Michael Mommert, Mario Sigel, Marcel Neuhausler, Linus Scheibenreif, Damian Borth

    Abstract: The major driver of global warming has been identified as the anthropogenic release of greenhouse gas (GHG) emissions from industrial activities. The quantitative monitoring of these emissions is mandatory to fully understand their effect on the Earth's climate and to enforce emission regulations on a large scale. In this work, we investigate the possibility to detect and quantify industrial smoke… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: To be presented at the "Tackling Climate Change with Machine Learning" workshop at NeurIPS 2020

  42. arXiv:2011.06392  [pdf, other

    cs.SD cs.LG eess.AS

    Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement

    Authors: Hamed Hemati, Damian Borth

    Abstract: Recent neural Text-to-Speech (TTS) models have been shown to perform very well when enough data is available. However, fine-tuning them for new speakers or languages is not straightforward in a low-resource setup. In this paper, we show that by applying minor modifications to a Tacotron model, one can transfer an existing TTS model for new speakers from the same or a different language using only… ▽ More

    Submitted 31 March, 2022; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: Preprint

  43. arXiv:2008.10769  [pdf, other

    cs.LG stat.ML

    Variable selection for Gaussian process regression through a sparse projection

    Authors: Chiwoo Park, David J. Borth, Nicholas S. Wilson, Chad N. Hunter

    Abstract: This paper presents a new variable selection approach integrated with Gaussian process (GP) regression. We consider a sparse projection of input variables and a general stationary covariance model that depends on the Euclidean distance between the projected features. The sparse projection matrix is considered as an unknown parameter. We propose a forward stagewise approach with embedded gradient d… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  44. arXiv:2008.07275  [pdf, ps, other

    cs.CY cs.CV cs.LG stat.ML

    Facial Recognition: A cross-national Survey on Public Acceptance, Privacy, and Discrimination

    Authors: Léa Steinacker, Miriam Meckel, Genia Kostka, Damian Borth

    Abstract: With rapid advances in machine learning (ML), more of this technology is being deployed into the real world interacting with us and our environment. One of the most widely applied application of ML is facial recognition as it is running on millions of devices. While being useful for some people, others perceive it as a threat when used by public authorities. This discrepancy and the lack of policy… ▽ More

    Submitted 15 July, 2020; originally announced August 2020.

    Comments: ICML 2020 - Law and Machine Learning Workshop, Vienna, Austria

  45. arXiv:2008.02528  [pdf, other

    cs.LG stat.ML

    Learning Sampling in Financial Statement Audits using Vector Quantised Autoencoder Neural Networks

    Authors: Marco Schreyer, Timur Sattarov, Anita Gierbl, Bernd Reimer, Damian Borth

    Abstract: The audit of financial statements is designed to collect reasonable assurance that an issued statement is free from material misstatement 'true and fair presentation'. International audit standards require the assessment of a statements' underlying accounting relevant transactions referred to as 'journal entries' to detect potential misstatements. To efficiently audit the increasing quantities of… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 8 pages, 5 figures, 3 tables, to appear in Proceedings of the ACM's International Conference on AI in Finance (ICAIF'20), this paper is the initial accepted version

  46. arXiv:2006.10424  [pdf, other

    cs.LG cs.AI stat.ML

    An Investigation of the Weight Space to Monitor the Training Progress of Neural Networks

    Authors: Konstantin Schürholt, Damian Borth

    Abstract: Safe use of Deep Neural Networks (DNNs) requires careful testing. However, deployed models are often trained further to improve in performance. As rigorous testing and evaluation is expensive, triggers are in need to determine the degree of change of a model. In this paper we investigate the weight space of DNN models for structure that can be exploited to that end. Our results show that DNN model… ▽ More

    Submitted 17 March, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 8 pages, 9 figures

  47. arXiv:2005.01686  [pdf

    q-fin.RM cs.LG econ.EM

    Neural Networks and Value at Risk

    Authors: Alexander Arimond, Damian Borth, Andreas Hoepner, Michael Klawunn, Stefan Weisheit

    Abstract: Utilizing a generative regime switching framework, we perform Monte-Carlo simulations of asset returns for Value at Risk threshold estimation. Using equity markets and long term bonds as test assets in the global, US, Euro area and UK setting over an up to 1,250 weeks sample horizon ending in August 2018, we investigate neural networks along three design steps relating (i) to the initialization of… ▽ More

    Submitted 6 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 2019 Financial Data Science Association Paper, San Francisco

  48. arXiv:2001.04639  [pdf, other

    cs.LG stat.ME stat.ML

    Robust Gaussian Process Regression with a Bias Model

    Authors: Chiwoo Park, David J. Borth, Nicholas S. Wilson, Chad N. Hunter, Fritz J. Friedersdorf

    Abstract: This paper presents a new approach to a robust Gaussian process (GP) regression. Most existing approaches replace an outlier-prone Gaussian likelihood with a non-Gaussian likelihood induced from a heavy tail distribution, such as the Laplace distribution and Student-t distribution. However, the use of a non-Gaussian likelihood would incur the need for a computationally expensive Bayesian approxima… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    MSC Class: 62G08

  49. arXiv:1910.03810  [pdf, other

    cs.LG stat.ML

    Adversarial Learning of Deepfakes in Accounting

    Authors: Marco Schreyer, Timur Sattarov, Bernd Reimer, Damian Borth

    Abstract: Nowadays, organizations collect vast quantities of accounting relevant transactions, referred to as 'journal entries', in 'Enterprise Resource Planning' (ERP) systems. The aggregation of those entries ultimately defines an organization's financial statement. To detect potential misstatements and fraud, international audit standards demand auditors to directly assess journal entries using 'Computer… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 17 pages, 10 figures, and, 5 tables

  50. arXiv:1908.00734  [pdf, other

    cs.LG q-fin.ST stat.ML

    Detection of Accounting Anomalies in the Latent Space using Adversarial Autoencoder Neural Networks

    Authors: Marco Schreyer, Timur Sattarov, Christian Schulze, Bernd Reimer, Damian Borth

    Abstract: The detection of fraud in accounting data is a long-standing challenge in financial statement audits. Nowadays, the majority of applied techniques refer to handcrafted rules derived from known fraud scenarios. While fairly successful, these rules exhibit the drawback that they often fail to generalize beyond known fraud scenarios and fraudsters gradually find ways to circumvent them. In contrast,… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Comments: 11 pages, 9 figures, 2nd KDD Workshop on Anomaly Detection in Finance, August 05, 2019, Anchorage, Alaska