Skip to main content

Showing 1–50 of 58 results for author: Rottmann, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23751  [pdf, ps, other

    cs.CV

    Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes?

    Authors: Annika Mütze, Sadia Ilyas, Christian Dörpelkus, Matthias Rottmann

    Abstract: Open-vocabulary object detectors such as Grounding DINO are trained on vast and diverse data, achieving remarkable performance on challenging datasets. Due to that, it is unclear where to find their limitations, which is of major concern when using in safety-critical applications. Real-world data does not provide sufficient control, required for a rigorous evaluation of model generalization. In co… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  2. arXiv:2506.05617  [pdf, ps, other

    cs.LG cs.AI

    LFA applied to CNNs: Efficient Singular Value Decomposition of Convolutional Mappings by Local Fourier Analysis

    Authors: Antonia van Betteray, Matthias Rottmann, Karsten Kahl

    Abstract: The singular values of convolutional mappings encode interesting spectral properties, which can be used, e.g., to improve generalization and robustness of convolutional neural networks as well as to facilitate model compression. However, the computation of singular values is typically very resource-intensive. The naive approach involves unrolling the convolutional mapping along the input and chann… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  3. arXiv:2505.16594  [pdf, ps, other

    cs.CV cs.LG

    Temporal Object Captioning for Street Scene Videos from LiDAR Tracks

    Authors: Vignesh Gopinathan, Urs Zimmermann, Michael Arnold, Matthias Rottmann

    Abstract: Video captioning models have seen notable advancements in recent years, especially with regard to their ability to capture temporal information. While many research efforts have focused on architectural advancements, such as temporal attention mechanisms, there remains a notable gap in understanding how models capture and utilize temporal semantics for effective temporal feature extraction, especi… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  4. arXiv:2505.12753  [pdf, ps, other

    cs.CV

    LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking

    Authors: Martha Teiko Teye, Ori Maoz, Matthias Rottmann

    Abstract: Multi-object tracking from LiDAR point clouds presents unique challenges due to the sparse and irregular nature of the data, compounded by the need for temporal coherence across frames. Traditional tracking systems often rely on hand-crafted features and motion models, which can struggle to maintain consistent object identities in crowded or fast-moving scenes. We present a lidar-based two-staged… ▽ More

    Submitted 22 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: Template change

  5. arXiv:2504.08602  [pdf, other

    cs.CV cs.AI cs.LG

    On Background Bias of Post-Hoc Concept Embeddings in Computer Vision DNNs

    Authors: Gesina Schwalbe, Georgii Mikriukov, Edgar Heinert, Stavros Gerolymatos, Mert Keser, Alois Knoll, Matthias Rottmann, Annika Mütze

    Abstract: The thriving research field of concept-based explainable artificial intelligence (C-XAI) investigates how human-interpretable semantic concepts embed in the latent spaces of deep neural networks (DNNs). Post-hoc approaches therein use a set of examples to specify a concept, and determine its embeddings in DNN latent space using data driven techniques. This proved useful to uncover biases between d… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: camera-ready version for 3rd World Conference on eXplainable Artificial Intelligence; 5 figures, 6 tables; code available at: https://github.com/gesina/bg_randomized_loce

  6. arXiv:2503.12453  [pdf, other

    cs.CV

    Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation

    Authors: Edgar Heinert, Thomas Gottwald, Annika Mütze, Matthias Rottmann

    Abstract: Previous works studied how deep neural networks (DNNs) perceive image content in terms of their biases towards different image cues, such as texture and shape. Previous methods to measure shape and texture biases are typically style-transfer-based and limited to DNNs for image classification. In this work, we provide a new evaluation procedure consisting of 1) a cue-decomposition method that compr… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  7. Poly-MgNet: Polynomial Building Blocks in Multigrid-Inspired ResNets

    Authors: Antonia van Betteray, Matthias Rottmann, Karsten Kahl

    Abstract: The structural analogies of ResNets and Multigrid (MG) methods such as common building blocks like convolutions and poolings where already pointed out by He et al.\ in 2016. Multigrid methods are used in the context of scientific computing for solving large sparse linear systems arising from partial differential equations. MG methods particularly rely on two main concepts: smoothing and residual r… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Journal ref: Proceedings of ICPRAM 2025, ISBN: 978-989-758-730-6

  8. arXiv:2503.10210  [pdf, other

    cs.CV

    TARS: Traffic-Aware Radar Scene Flow Estimation

    Authors: Jialong Wu, Marco Braun, Dominic Spata, Matthias Rottmann

    Abstract: Scene flow provides crucial motion information for autonomous driving. Recent LiDAR scene flow models utilize the rigid-motion assumption at the instance level, assuming objects are rigid bodies. However, these instance-level methods are not suitable for sparse radar point clouds. In this work, we present a novel $\textbf{T}$raffic-$\textbf{A}$ware $\textbf{R}$adar $\textbf{S}$cene flow estimation… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  9. arXiv:2502.11864  [pdf, other

    cs.CV cs.RO

    Does Knowledge About Perceptual Uncertainty Help an Agent in Automated Driving?

    Authors: Natalie Grabowsky, Annika Mütze, Joshua Wendland, Nils Jansen, Matthias Rottmann

    Abstract: Agents in real-world scenarios like automated driving deal with uncertainty in their environment, in particular due to perceptual uncertainty. Although, reinforcement learning is dedicated to autonomous decision-making under uncertainty these algorithms are typically not informed about the uncertainty currently contained in their environment. On the other hand, uncertainty estimation for perceptio… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 8 pages, 9 figures

  10. arXiv:2501.08083  [pdf, other

    cs.CV

    Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving

    Authors: Mert Keser, Halil Ibrahim Orhan, Niki Amini-Naieni, Gesina Schwalbe, Alois Knoll, Matthias Rottmann

    Abstract: Deep neural networks (DNNs) remain challenged by distribution shifts in complex open-world domains like automated driving (AD): Robustness against yet unknown novel objects (semantic shift) or styles like lighting conditions (covariate shift) cannot be guaranteed. Hence, reliable operation-time monitors for identification of out-of-training-data-distribution (OOD) scenarios are imperative. Current… ▽ More

    Submitted 4 April, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  11. arXiv:2410.14878  [pdf, other

    cs.CV

    On the Influence of Shape, Texture and Color for Learning Semantic Segmentation

    Authors: Annika Mütze, Natalie Grabowsky, Edgar Heinert, Matthias Rottmann, Hanno Gottschalk

    Abstract: In recent years, a body of works has emerged, studying shape and texture biases of off-the-shelf pre-trained deep neural networks (DNN) for image classification. These works study how much a trained DNN relies on image cues, predominantly shape and texture. In this work, we switch the perspective, posing the following questions: What can a DNN learn from each of the image cues, i.e., shape, textur… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  12. arXiv:2409.17109  [pdf, other

    cs.CV cs.AI

    Unveiling Ontological Commitment in Multi-Modal Foundation Models

    Authors: Mert Keser, Gesina Schwalbe, Niki Amini-Naieni, Matthias Rottmann, Alois Knoll

    Abstract: Ontological commitment, i.e., used concepts, relations, and assumptions, are a corner stone of qualitative reasoning (QR) models. The state-of-the-art for processing raw inputs, though, are deep neural networks (DNNs), nowadays often based off from multimodal foundation models. These automatically learn rich representations of concepts and respective reasoning. Unfortunately, the learned qualitati… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: Qualitative Reasoning Workshop 2024 (QR2024) colocated with ECAI2024, camera-ready submission; first two authors contributed equally; 10 pages, 4 figures, 3 tables

  13. arXiv:2409.11373  [pdf, other

    cs.CV

    Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks

    Authors: Edgar Heinert, Stephan Tilgner, Timo Palm, Matthias Rottmann

    Abstract: When employing deep neural networks (DNNs) for semantic segmentation in safety-critical applications like automotive perception or medical imaging, it is important to estimate their performance at runtime, e.g. via uncertainty estimates or prediction quality estimates. Previous works mostly performed uncertainty estimation on pixel-level. In a line of research, a connected-component-wise (segment-… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 11 pages, 3 figures, submitted to BMVC "Workshop on Robust Recognition in the Open World" (https://rrow2024.github.io/call-for-papers)

    MSC Class: 68T07

  14. arXiv:2408.11221  [pdf, other

    cs.CV

    On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes

    Authors: Sadia Ilyas, Ido Freeman, Matthias Rottmann

    Abstract: Out-of-distribution (OOD) object detection is a critical task focused on detecting objects that originate from a data distribution different from that of the training data. In this study, we investigate to what extent state-of-the-art open-vocabulary object detectors can detect unusual objects in street scenes, which are considered as OOD or rare scenarios with respect to common street scene datas… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  15. arXiv:2406.11835  [pdf, other

    cs.CV

    OoDIS: Anomaly Instance Segmentation and Detection Benchmark

    Authors: Alexey Nekrasov, Rui Zhou, Miriam Ackermann, Alexander Hermans, Bastian Leibe, Matthias Rottmann

    Abstract: Safe navigation of self-driving cars and robots requires a precise understanding of their environment. Training data for perception systems cannot cover the wide variety of objects that may appear during deployment. Thus, reliable identification of unknown objects, such as wild animals and untypical obstacles, is critical due to their potential to cause serious accidents. Significant progress in s… ▽ More

    Submitted 10 April, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted for publication at ICRA 2025. Project page: https://vision.rwth-aachen.de/oodis

  16. arXiv:2406.10600  [pdf, other

    cs.CV

    SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data

    Authors: Jialong Wu, Mirko Meuter, Markus Schoeler, Matthias Rottmann

    Abstract: Radar-based perception has gained increasing attention in autonomous driving, yet the inherent sparsity of radars poses challenges. Radar raw data often contains excessive noise, whereas radar point clouds retain only limited information. In this work, we holistically treat the sparse nature of radar data by introducing an adaptive subsampling method together with a tailored network architecture t… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: 18 pages, 4 figures, 5 tables, with supplement

  17. arXiv:2402.09530  [pdf, other

    cs.CV

    Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion

    Authors: Edgar Heinert, Matthias Rottmann, Kira Maag, Karsten Kahl

    Abstract: Convolutional neural networks (CNNs) for image processing tend to focus on localized texture patterns, commonly referred to as texture bias. While most of the previous works in the literature focus on the task of image classification, we go beyond this and study the texture bias of CNNs in semantic segmentation. In this work, we propose to train CNNs on pre-processed images with less texture to re… ▽ More

    Submitted 22 July, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  18. arXiv:2310.03393  [pdf, ps, other

    math.NA cs.LG

    Uncertainty quantification for deep learning-based schemes for solving high-dimensional backward stochastic differential equations

    Authors: Lorenc Kapllani, Long Teng, Matthias Rottmann

    Abstract: Deep learning-based numerical schemes for solving high-dimensional backward stochastic differential equations (BSDEs) have recently raised plenty of scientific interest. While they enable numerical methods to approximate very high-dimensional BSDEs, their reliability has not been studied and is thus not understood. In this work, we study uncertainty quantification (UQ) for a class of deep learning… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 41 pages, 23 figures and 15 tables

    MSC Class: 68T37; 65C30; 60H35

  19. arXiv:2310.00372  [pdf, other

    cs.CV cs.LG

    Deep Active Learning with Noisy Oracle in Object Detection

    Authors: Marius Schubert, Tobias Riedlinger, Karsten Kahl, Matthias Rottmann

    Abstract: Obtaining annotations for complex computer vision tasks such as object detection is an expensive and time-intense endeavor involving a large number of human workers or expert opinions. Reducing the amount of annotations required while maintaining algorithm performance is, therefore, desirable for machine learning practitioners and has been successfully achieved by active learning algorithms. Howev… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  20. arXiv:2308.08504  [pdf, other

    cs.LG cs.CV

    ResBuilder: Automated Learning of Depth with Residual Structures

    Authors: Julian Burghoff, Matthias Rottmann, Jill von Conta, Sebastian Schoenen, Andreas Witte, Hanno Gottschalk

    Abstract: In this work, we develop a neural architecture search algorithm, termed Resbuilder, that develops ResNet architectures from scratch that achieve high accuracy at moderate computational cost. It can also be used to modify existing architectures and has the capability to remove and insert ResNet blocks, in this way searching for suitable architectures in the space of ResNet architectures. In our exp… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  21. arXiv:2306.07835  [pdf, other

    cs.CV

    LMD: Light-weight Prediction Quality Estimation for Object Detection in Lidar Point Clouds

    Authors: Tobias Riedlinger, Marius Schubert, Sarina Penquitt, Jan-Marcel Kezmann, Pascal Colling, Karsten Kahl, Lutz Roese-Koerner, Michael Arnold, Urs Zimmermann, Matthias Rottmann

    Abstract: Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics which has seen a significant rise in performance and accuracy during recent years. Particularly uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence. Previously proposed methods for quantifying… ▽ More

    Submitted 15 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: 19 pages, 11 figures, 11 tables

  22. arXiv:2303.06999  [pdf, other

    cs.CV cs.LG

    Identifying Label Errors in Object Detection Datasets by Loss Inspection

    Authors: Marius Schubert, Tobias Riedlinger, Karsten Kahl, Daniel Kröll, Sebastian Schoenen, Siniša Šegvić, Matthias Rottmann

    Abstract: Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as wel… ▽ More

    Submitted 19 December, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  23. arXiv:2212.14397  [pdf, other

    cs.CV

    AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains

    Authors: Krzysztof Lis, Matthias Rottmann, Annika Mütze, Sina Honari, Pascal Fua, Mathieu Salzmann

    Abstract: In addition to impressive performance, vision transformers have demonstrated remarkable abilities to encode information they were not trained to extract. For example, this information can be used to perform segmentation or single-view depth estimation even though the networks were only trained for image recognition. We show that a similar phenomenon occurs when explicitly training transformers for… ▽ More

    Submitted 29 December, 2024; v1 submitted 29 December, 2022; originally announced December 2022.

    ACM Class: I.4.6; I.4.8; I.5.4

    Journal ref: 35th British Machine Vision Conference 2024, BMVC 2024, Glasgow, UK, November 25-28, 2024

  24. arXiv:2212.10836  [pdf, other

    cs.CV cs.LG

    Towards Rapid Prototyping and Comparability in Active Learning for Deep Object Detection

    Authors: Tobias Riedlinger, Marius Schubert, Karsten Kahl, Hanno Gottschalk, Matthias Rottmann

    Abstract: Active learning as a paradigm in deep learning is especially important in applications involving intricate perception tasks such as object detection where labels are difficult and expensive to acquire. Development of active learning methods in such fields is highly computationally expensive and time consuming which obstructs the progression of research and leads to a lack of comparability between… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 17 pages, 12 figures, 9 tables

  25. arXiv:2211.05525  [pdf, other

    cs.CV cs.AI cs.LG

    MGiaD: Multigrid in all dimensions. Efficiency and robustness by coarsening in resolution and channel dimensions

    Authors: Antonia van Betteray, Matthias Rottmann, Karsten Kahl

    Abstract: Current state-of-the-art deep neural networks for image classification are made up of 10 - 100 million learnable weights and are therefore inherently prone to overfitting. The complexity of the weight count can be seen as a function of the number of channels, the spatial extent of the input and the number of layers of the network. Due to the use of convolutional layers the scaling of weight comple… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  26. arXiv:2208.08815  [pdf, other

    cs.CV

    Semi-supervised domain adaptation with CycleGAN guided by a downstream task loss

    Authors: Annika Mütze, Matthias Rottmann, Hanno Gottschalk

    Abstract: Domain adaptation is of huge interest as labeling is an expensive and error-prone task, especially when labels are needed on pixel-level like in semantic segmentation. Therefore, one would like to be able to train neural networks on synthetic domains, where data is abundant and labels are precise. However, these models often perform poorly on out-of-domain images. To mitigate the shift in the inpu… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: 11pages, 11figures

  27. arXiv:2207.06104  [pdf, other

    cs.CV cs.LG

    Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification

    Authors: Matthias Rottmann, Marco Reese

    Abstract: In this work, we for the first time present a method for detecting label errors in image datasets with semantic segmentation, i.e., pixel-wise class labels. Annotation acquisition for semantic segmentation datasets is time-consuming and requires plenty of human labor. In particular, review processes are time consuming and label errors can easily be overlooked by humans. The consequences are biased… ▽ More

    Submitted 23 August, 2024; v1 submitted 13 July, 2022; originally announced July 2022.

    MSC Class: 68T45; 62-07 ACM Class: I.2; I.4; I.5

  28. arXiv:2207.03513  [pdf, other

    cs.CV

    False Negative Reduction in Semantic Segmentation under Domain Shift using Depth Estimation

    Authors: Kira Maag, Matthias Rottmann

    Abstract: State-of-the-art deep neural networks demonstrate outstanding performance in semantic segmentation. However, their performance is tied to the domain represented by the training data. Open world scenarios cause inaccurate predictions which is hazardous in safety relevant applications like automated driving. In this work, we enhance semantic segmentation predictions using monocular depth estimation… ▽ More

    Submitted 7 December, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  29. arXiv:2206.04776  [pdf, other

    cs.LG cs.AI cs.CV cs.HC

    What should AI see? Using the Public's Opinion to Determine the Perception of an AI

    Authors: Robin Chan, Radin Dardashti, Meike Osinski, Matthias Rottmann, Dominik Brüggemann, Cilia Rücker, Peter Schlicht, Fabian Hüger, Nikol Rummel, Hanno Gottschalk

    Abstract: Deep neural networks (DNN) have made impressive progress in the interpretation of image data, so that it is conceivable and to some degree realistic to use them in safety critical applications like automated driving. From an ethical standpoint, the AI algorithm should take into account the vulnerability of objects or subjects on the street that ranges from "not at all", e.g. the road itself, to "h… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 26 pages, 12 figures

    Journal ref: AI and Ethics (2023)

  30. arXiv:2205.14917  [pdf, other

    cs.CV cs.LG

    Uncertainty Quantification and Resource-Demanding Computer Vision Applications of Deep Learning

    Authors: Julian Burghoff, Robin Chan, Hanno Gottschalk, Annika Muetze, Tobias Riedlinger, Matthias Rottmann, Marius Schubert

    Abstract: Bringing deep neural networks (DNNs) into safety critical applications such as automated driving, medical imaging and finance, requires a thorough treatment of the model's uncertainties. Training deep neural networks is already resource demanding and so is also their uncertainty quantification. In this overview article, we survey methods that we developed to teach DNNs to be uncertain when they en… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    MSC Class: 68T45; 62-07

  31. arXiv:2205.09104  [pdf, other

    math.NA cs.DC hep-lat

    Coarsest-level improvements in multigrid for lattice QCD on large-scale computers

    Authors: Jesus Espinoza-Valverde, Andreas Frommer, Gustavo Ramirez-Hidalgo, Matthias Rottmann

    Abstract: Numerical simulations of quantum chromodynamics (QCD) on a lattice require the frequent solution of linear systems of equations with large, sparse and typically ill-conditioned matrices. Algebraic multigrid methods are meanwhile the standard for these difficult solves. Although the linear systems at the coarsest level of the multigrid hierarchy are much smaller than the ones at the finest level, t… ▽ More

    Submitted 27 March, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

    MSC Class: 65F10 (Primary); 65Y05 (Secondary)

  32. arXiv:2202.08700  [pdf, other

    cs.CV cs.LG

    Detecting and Learning the Unknown in Semantic Segmentation

    Authors: Robin Chan, Svenja Uhlemeyer, Matthias Rottmann, Hanno Gottschalk

    Abstract: Semantic segmentation is a crucial component for perception in automated driving. Deep neural networks (DNNs) are commonly used for this task and they are usually trained on a closed set of object classes appearing in a closed operational domain. However, this is in contrast to the open world assumption in automated driving that DNNs are deployed to. Therefore, DNNs necessarily face data that they… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 37 pages, 7 figures, chapter in Deep Neural Networks and Data for Automated Driving

  33. arXiv:2201.13279  [pdf, other

    cs.CV

    UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs

    Authors: Philipp Oberdiek, Gernot A. Fink, Matthias Rottmann

    Abstract: We present an approach to quantifying both aleatoric and epistemic uncertainty for deep neural networks in image classification, based on generative adversarial networks (GANs). While most works in the literature that use GANs to generate out-of-distribution (OoD) examples only focus on the evaluation of OoD detection, we present a GAN based approach to learn a classifier that produces proper unce… ▽ More

    Submitted 9 January, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

  34. arXiv:2201.01073  [pdf, other

    cs.CV

    Towards Unsupervised Open World Semantic Segmentation

    Authors: Svenja Uhlemeyer, Matthias Rottmann, Hanno Gottschalk

    Abstract: For the semantic segmentation of images, state-of-the-art deep neural networks (DNNs) achieve high segmentation accuracy if that task is restricted to a closed set of classes. However, as of now DNNs have limited ability to operate in an open world, where they are tasked to identify pixels belonging to unknown objects and eventually to learn novel classes, incrementally. Humans have the capability… ▽ More

    Submitted 12 September, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: UAI 2022, published in PMLR, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence

  35. arXiv:2112.04758  [pdf, other

    cs.LG cs.CV stat.ML

    Does Redundancy in AI Perception Systems Help to Test for Super-Human Automated Driving Performance?

    Authors: Hanno Gottschalk, Matthias Rottmann, Maida Saltagic

    Abstract: While automated driving is often advertised with better-than-human driving performance, this work reviews that it is nearly impossible to provide direct statistical evidence on the system level that this is actually the case. The amount of labeled data needed would exceed dimensions of present day technical and economical capabilities. A commonly used strategy therefore is the use of redundancy al… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    MSC Class: 68T07; 68T45

  36. arXiv:2110.15681  [pdf, other

    cs.CV cs.AI

    False Positive Detection and Prediction Quality Estimation for LiDAR Point Cloud Segmentation

    Authors: Pascal Colling, Matthias Rottmann, Lutz Roese-Koerner, Hanno Gottschalk

    Abstract: We present a novel post-processing tool for semantic segmentation of LiDAR point cloud data, called LidarMetaSeg, which estimates the prediction quality segmentwise. For this purpose we compute dispersion measures based on network probability outputs as well as feature measures based on point cloud input features and aggregate them on segment level. These aggregated measures are used to train a me… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  37. arXiv:2109.09410  [pdf, other

    cs.CV cs.LG

    Background-Foreground Segmentation for Interior Sensing in Automotive Industry

    Authors: Claudia Drygala, Matthias Rottmann, Hanno Gottschalk, Klaus Friedrichs, Thomas Kurbiel

    Abstract: To ensure safety in automated driving, the correct perception of the situation inside the car is as important as its environment. Thus, seat occupancy detection and classification of detected instances play an important role in interior sensing. By the knowledge of the seat occupancy status, it is possible to, e.g., automate the airbag deployment control. Furthermore, the presence of a driver, whi… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  38. arXiv:2107.04517  [pdf, other

    cs.CV

    Gradient-Based Quantification of Epistemic Uncertainty for Deep Object Detectors

    Authors: Tobias Riedlinger, Matthias Rottmann, Marius Schubert, Hanno Gottschalk

    Abstract: The vast majority of uncertainty quantification methods for deep object detectors such as variational inference are based on the network output. Here, we study gradient-based epistemic uncertainty metrics for deep object detectors to obtain reliable confidence estimates. We show that they contain predictive information and that they capture information orthogonal to that of common, output-based un… ▽ More

    Submitted 17 March, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: 26 pages, 11 figures, 14 tables

  39. arXiv:2106.05549  [pdf, other

    cs.CV cs.LG

    Validation of Simulation-Based Testing: Bypassing Domain Shift with Label-to-Image Synthesis

    Authors: Julia Rosenzweig, Eduardo Brito, Hans-Ulrich Kobialka, Maram Akila, Nico M. Schmidt, Peter Schlicht, Jan David Schneider, Fabian Hüger, Matthias Rottmann, Sebastian Houben, Tim Wirtz

    Abstract: Many machine learning applications can benefit from simulated data for systematic validation - in particular if real-life data is difficult to obtain or annotate. However, since simulations are prone to domain shift w.r.t. real-life data, it is crucial to verify the transferability of the obtained results. We propose a novel framework consisting of a generative label-to-image synthesis model toget… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: The first two authors contributed equally. Accepted at the 4th Workshop on "Ensuring and Validating Safety for Automated Vehicles" (WS13), IV2021. Under IEEE Copyright

  40. arXiv:2104.14812  [pdf, other

    cs.CV

    SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation

    Authors: Robin Chan, Krzysztof Lis, Svenja Uhlemeyer, Hermann Blum, Sina Honari, Roland Siegwart, Pascal Fua, Mathieu Salzmann, Matthias Rottmann

    Abstract: State-of-the-art semantic or instance segmentation deep neural networks (DNNs) are usually trained on a closed set of semantic classes. As such, they are ill-equipped to handle previously-unseen objects. However, detecting and localizing such objects is crucial for safety-critical applications such as perception for automated driving, especially if they appear on the road ahead. While some methods… ▽ More

    Submitted 9 November, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: 35 pages, 18 figures, 16 tables, website https://segmentmeifyoucan.com/, NeurIPS 2021 Track on Datasets and Benchmarks

    MSC Class: 68T45; 62-07 ACM Class: I.4.6; I.4.9

  41. Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety

    Authors: Sebastian Houben, Stephanie Abrecht, Maram Akila, Andreas Bär, Felix Brockherde, Patrick Feifel, Tim Fingscheidt, Sujan Sai Gannamaneni, Seyed Eghbal Ghobadi, Ahmed Hammam, Anselm Haselhoff, Felix Hauser, Christian Heinzemann, Marco Hoffmann, Nikhil Kapoor, Falk Kappel, Marvin Klingner, Jan Kronenberger, Fabian Küppers, Jonas Löhdefink, Michael Mlynarski, Michael Mock, Firas Mualla, Svetlana Pavlitskaya, Maximilian Poretschkin , et al. (16 additional authors not shown)

    Abstract: The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety conce… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: 94 pages

    Journal ref: Fingscheidt, T., Gottschalk, H., Houben, S. (eds) Deep Neural Networks and Data for Automated Driving, Springer, Cham (2022)

  42. arXiv:2012.07504  [pdf, other

    cs.CV cs.LG

    Improving Video Instance Segmentation by Light-weight Temporal Uncertainty Estimates

    Authors: Kira Maag, Matthias Rottmann, Serin Varghese, Fabian Hueger, Peter Schlicht, Hanno Gottschalk

    Abstract: Instance segmentation with neural networks is an essential task in environment perception. In many works, it has been observed that neural networks can predict false positive instances with high confidence values and true positives with low ones. Thus, it is important to accurately model the uncertainties of neural networks in order to prevent safety issues and foster interpretability. In applicat… ▽ More

    Submitted 13 April, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

  43. arXiv:2012.06575  [pdf, other

    cs.CV cs.LG

    Entropy Maximization and Meta Classification for Out-Of-Distribution Detection in Semantic Segmentation

    Authors: Robin Chan, Matthias Rottmann, Hanno Gottschalk

    Abstract: Deep neural networks (DNNs) for the semantic segmentation of images are usually trained to operate on a predefined closed set of object classes. This is in contrast to the "open world" setting where DNNs are envisioned to be deployed to. From a functional safety point of view, the ability to detect so-called "out-of-distribution" (OoD) samples, i.e., objects outside of a DNN's semantic space, is c… ▽ More

    Submitted 27 July, 2021; v1 submitted 9 December, 2020; originally announced December 2020.

    Comments: 18 pages, 20 figures, ICCV 2021

    MSC Class: 68T45; 62-07

  44. arXiv:2011.12087  [pdf, ps, other

    cs.LG math.ST

    A Convenient Infinite Dimensional Framework for Generative Adversarial Learning

    Authors: Hayk Asatryan, Hanno Gottschalk, Marieke Lippert, Matthias Rottmann

    Abstract: In recent years, generative adversarial networks (GANs) have demonstrated impressive experimental results while there are only a few works that foster statistical learning theory for GANs. In this work, we propose an infinite dimensional theoretical framework for generative adversarial learning. We assume that the probability density functions of the underlying measure are uniformly bounded, $k$-t… ▽ More

    Submitted 19 January, 2023; v1 submitted 24 November, 2020; originally announced November 2020.

    MSC Class: 62G20; 68T05

  45. arXiv:2010.03320  [pdf, other

    cs.CV cs.LG

    YOdar: Uncertainty-based Sensor Fusion for Vehicle Detection with Camera and Radar Sensors

    Authors: Kamil Kowol, Matthias Rottmann, Stefan Bracke, Hanno Gottschalk

    Abstract: In this work, we present an uncertainty-based method for sensor fusion with camera and radar data. The outputs of two neural networks, one processing camera and the other one radar data, are combined in an uncertainty aware manner. To this end, we gather the outputs and corresponding meta information for both networks. For each predicted object, the gathered information is post-processed by a grad… ▽ More

    Submitted 23 November, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

  46. arXiv:2010.01884  [pdf, other

    cs.CV

    MetaBox+: A new Region Based Active Learning Method for Semantic Segmentation using Priority Maps

    Authors: Pascal Colling, Lutz Roese-Koerner, Hanno Gottschalk, Matthias Rottmann

    Abstract: We present a novel region based active learning method for semantic image segmentation, called MetaBox+. For acquisition, we train a meta regression model to estimate the segment-wise Intersection over Union (IoU) of each predicted segment of unlabeled images. This can be understood as an estimation of segment-wise prediction quality. Queried regions are supposed to minimize to competing targets,… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  47. arXiv:2010.01695  [pdf, other

    cs.CV cs.LG

    MetaDetect: Uncertainty Quantification and Prediction Quality Estimates for Object Detection

    Authors: Marius Schubert, Karsten Kahl, Matthias Rottmann

    Abstract: In object detection with deep neural networks, the box-wise objectness score tends to be overconfident, sometimes even indicating high confidence in presence of inaccurate predictions. Hence, the reliability of the prediction and therefore reliable uncertainties are of highest interest. In this work, we present a post processing method that for any given neural network provides predictive uncertai… ▽ More

    Submitted 6 October, 2020; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: 11 pages, 5 figures, 5 tables

  48. arXiv:2009.11397  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Detection of Iterative Adversarial Attacks via Counter Attack

    Authors: Matthias Rottmann, Kira Maag, Mathis Peyron, Natasa Krejic, Hanno Gottschalk

    Abstract: Deep neural networks (DNNs) have proven to be powerful tools for processing unstructured data. However for high-dimensional data, like images, they are inherently vulnerable to adversarial attacks. Small almost invisible perturbations added to the input can be used to fool DNNs. Various attacks, hardening methods and detection methods have been introduced in recent years. Notoriously, Carlini-Wagn… ▽ More

    Submitted 23 March, 2021; v1 submitted 23 September, 2020; originally announced September 2020.

    MSC Class: 68T45; 62-07

  49. arXiv:2005.06831  [pdf, other

    cs.CV

    Detection and Retrieval of Out-of-Distribution Objects in Semantic Segmentation

    Authors: Philipp Oberdiek, Matthias Rottmann, Gernot A. Fink

    Abstract: When deploying deep learning technology in self-driving cars, deep neural networks are constantly exposed to domain shifts. These include, e.g., changes in weather conditions, time of day, and long-term temporal shift. In this work we utilize a deep neural network trained on the Cityscapes dataset containing urban street scenes and infer images from a different dataset, the A2D2 dataset, containin… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

  50. arXiv:1912.07420  [pdf, other

    cs.CV

    MetaFusion: Controlled False-Negative Reduction of Minority Classes in Semantic Segmentation

    Authors: Robin Chan, Matthias Rottmann, Fabian Hüger, Peter Schlicht, Hanno Gottschalk

    Abstract: In semantic segmentation datasets, classes of high importance are oftentimes underrepresented, e.g., humans in street scenes. Neural networks are usually trained to reduce the overall number of errors, attaching identical loss to errors of all kinds. However, this is not necessarily aligned with human intuition. For instance, an overlooked pedestrian seems more severe than an incorrectly detected… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.