-
The Gaussian Discriminant Variational Autoencoder (GdVAE): A Self-Explainable Model with Counterfactual Explanations
Authors:
Anselm Haselhoff,
Kevin Trelenberg,
Fabian Küppers,
Jonas Schneider
Abstract:
Visual counterfactual explanation (CF) methods modify image concepts, e.g, shape, to change a prediction to a predefined outcome while closely resembling the original query image. Unlike self-explainable models (SEMs) and heatmap techniques, they grant users the ability to examine hypothetical "what-if" scenarios. Previous CF methods either entail post-hoc training, limiting the balance between tr…
▽ More
Visual counterfactual explanation (CF) methods modify image concepts, e.g, shape, to change a prediction to a predefined outcome while closely resembling the original query image. Unlike self-explainable models (SEMs) and heatmap techniques, they grant users the ability to examine hypothetical "what-if" scenarios. Previous CF methods either entail post-hoc training, limiting the balance between transparency and CF quality, or demand optimization during inference. To bridge the gap between transparent SEMs and CF methods, we introduce the GdVAE, a self-explainable model based on a conditional variational autoencoder (CVAE), featuring a Gaussian discriminant analysis (GDA) classifier and integrated CF explanations. Full transparency is achieved through a generative classifier that leverages class-specific prototypes for the downstream task and a closed-form solution for CFs in the latent space. The consistency of CFs is improved by regularizing the latent space with the explainer function. Extensive comparisons with existing approaches affirm the effectiveness of our method in producing high-quality CF explanations while preserving transparency. Code and models are public.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
Uncertainty Calibration and its Application to Object Detection
Authors:
Fabian Küppers
Abstract:
Image-based environment perception is an important component especially for driver assistance systems or autonomous driving. In this scope, modern neuronal networks are used to identify multiple objects as well as the according position and size information within a single frame. The performance of such an object detection model is important for the overall performance of the whole system. However…
▽ More
Image-based environment perception is an important component especially for driver assistance systems or autonomous driving. In this scope, modern neuronal networks are used to identify multiple objects as well as the according position and size information within a single frame. The performance of such an object detection model is important for the overall performance of the whole system. However, a detection model might also predict these objects under a certain degree of uncertainty. [...]
In this work, we examine the semantic uncertainty (which object type?) as well as the spatial uncertainty (where is the object and how large is it?). We evaluate if the predicted uncertainties of an object detection model match with the observed error that is achieved on real-world data. In the first part of this work, we introduce the definition for confidence calibration of the semantic uncertainty in the context of object detection, instance segmentation, and semantic segmentation. We integrate additional position information in our examinations to evaluate the effect of the object's position on the semantic calibration properties. Besides measuring calibration, it is also possible to perform a post-hoc recalibration of semantic uncertainty that might have turned out to be miscalibrated. [...]
The second part of this work deals with the spatial uncertainty obtained by a probabilistic detection model. [...] We review and extend common calibration methods so that it is possible to obtain parametric uncertainty distributions for the position information in a more flexible way.
In the last part, we demonstrate a possible use-case for our derived calibration methods in the context of object tracking. [...] We integrate our previously proposed calibration techniques and demonstrate the usefulness of semantic and spatial uncertainty calibration in a subsequent process. [...]
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection
Authors:
Fabian Küppers,
Jonas Schneider,
Anselm Haselhoff
Abstract:
Reliable spatial uncertainty evaluation of object detection models is of special interest and has been subject of recent work. In this work, we review the existing definitions for uncertainty calibration of probabilistic regression tasks. We inspect the calibration properties of common detection networks and extend state-of-the-art recalibration methods. Our methods use a Gaussian process (GP) rec…
▽ More
Reliable spatial uncertainty evaluation of object detection models is of special interest and has been subject of recent work. In this work, we review the existing definitions for uncertainty calibration of probabilistic regression tasks. We inspect the calibration properties of common detection networks and extend state-of-the-art recalibration methods. Our methods use a Gaussian process (GP) recalibration scheme that yields parametric distributions as output (e.g. Gaussian or Cauchy). The usage of GP recalibration allows for a local (conditional) uncertainty calibration by capturing dependencies between neighboring samples. The use of parametric distributions such as as Gaussian allows for a simplified adaption of calibration in subsequent processes, e.g., for Kalman filtering in the scope of object tracking.
In addition, we use the GP recalibration scheme to perform covariance estimation which allows for post-hoc introduction of local correlations between the output quantities, e.g., position, width, or height in object detection. To measure the joint calibration of multivariate and possibly correlated data, we introduce the quantile calibration error which is based on the Mahalanobis distance between the predicted distribution and the ground truth to determine whether the ground truth is within a predicted quantile.
Our experiments show that common detection models overestimate the spatial uncertainty in comparison to the observed error. We show that the simple Isotonic Regression recalibration method is sufficient to achieve a good uncertainty quantification in terms of calibrated quantiles. In contrast, if normal distributions are required for subsequent processes, our GP-Normal recalibration method yields the best results. Finally, we show that our covariance estimation method is able to achieve best calibration results for joint multivariate calibration.
△ Less
Submitted 19 August, 2022; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Confidence Calibration for Object Detection and Segmentation
Authors:
Fabian Küppers,
Anselm Haselhoff,
Jan Kronenberger,
Jonas Schneider
Abstract:
Calibrated confidence estimates obtained from neural networks are crucial, particularly for safety-critical applications such as autonomous driving or medical image diagnosis. However, although the task of confidence calibration has been investigated on classification problems, thorough investigations on object detection and segmentation problems are still missing. Therefore, we focus on the inves…
▽ More
Calibrated confidence estimates obtained from neural networks are crucial, particularly for safety-critical applications such as autonomous driving or medical image diagnosis. However, although the task of confidence calibration has been investigated on classification problems, thorough investigations on object detection and segmentation problems are still missing. Therefore, we focus on the investigation of confidence calibration for object detection and segmentation models in this chapter. We introduce the concept of multivariate confidence calibration that is an extension of well-known calibration methods to the task of object detection and segmentation. This allows for an extended confidence calibration that is also aware of additional features such as bounding box/pixel position, shape information, etc. Furthermore, we extend the expected calibration error (ECE) to measure miscalibration of object detection and segmentation models. We examine several network architectures on MS COCO as well as on Cityscapes and show that especially object detection as well as instance segmentation models are intrinsically miscalibrated given the introduced definition of calibration. Using our proposed calibration methods, we have been able to improve calibration so that it also has a positive impact on the quality of segmentation masks as well.
△ Less
Submitted 20 June, 2022; v1 submitted 25 February, 2022;
originally announced February 2022.
-
Bayesian Confidence Calibration for Epistemic Uncertainty Modelling
Authors:
Fabian Küppers,
Jan Kronenberger,
Jonas Schneider,
Anselm Haselhoff
Abstract:
Modern neural networks have found to be miscalibrated in terms of confidence calibration, i.e., their predicted confidence scores do not reflect the observed accuracy or precision. Recent work has introduced methods for post-hoc confidence calibration for classification as well as for object detection to address this issue. Especially in safety critical applications, it is crucial to obtain a reli…
▽ More
Modern neural networks have found to be miscalibrated in terms of confidence calibration, i.e., their predicted confidence scores do not reflect the observed accuracy or precision. Recent work has introduced methods for post-hoc confidence calibration for classification as well as for object detection to address this issue. Especially in safety critical applications, it is crucial to obtain a reliable self-assessment of a model. But what if the calibration method itself is uncertain, e.g., due to an insufficient knowledge base?
We introduce Bayesian confidence calibration - a framework to obtain calibrated confidence estimates in conjunction with an uncertainty of the calibration method. Commonly, Bayesian neural networks (BNN) are used to indicate a network's uncertainty about a certain prediction. BNNs are interpreted as neural networks that use distributions instead of weights for inference. We transfer this idea of using distributions to confidence calibration. For this purpose, we use stochastic variational inference to build a calibration mapping that outputs a probability distribution rather than a single calibrated estimate. Using this approach, we achieve state-of-the-art calibration performance for object detection calibration. Finally, we show that this additional type of uncertainty can be used as a sufficient criterion for covariate shift detection. All code is open source and available at https://github.com/EFS-OpenSource/calibration-framework.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.
-
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety
Authors:
Sebastian Houben,
Stephanie Abrecht,
Maram Akila,
Andreas Bär,
Felix Brockherde,
Patrick Feifel,
Tim Fingscheidt,
Sujan Sai Gannamaneni,
Seyed Eghbal Ghobadi,
Ahmed Hammam,
Anselm Haselhoff,
Felix Hauser,
Christian Heinzemann,
Marco Hoffmann,
Nikhil Kapoor,
Falk Kappel,
Marvin Klingner,
Jan Kronenberger,
Fabian Küppers,
Jonas Löhdefink,
Michael Mlynarski,
Michael Mock,
Firas Mualla,
Svetlana Pavlitskaya,
Maximilian Poretschkin
, et al. (16 additional authors not shown)
Abstract:
The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety conce…
▽ More
The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety concerns. In recent years, a zoo of state-of-the-art techniques aiming to address these safety concerns has emerged. This work provides a structured and broad overview of them. We first identify categories of insufficiencies to then describe research activities aiming at their detection, quantification, or mitigation. Our paper addresses both machine learning experts and safety engineers: The former ones might profit from the broad range of machine learning topics covered and discussions on limitations of recent methods. The latter ones might gain insights into the specifics of modern ML methods. We moreover hope that our contribution fuels discussions on desiderata for ML systems and strategies on how to propel existing approaches accordingly.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
Analysis of a dynamical system modeling lasers and applications for optical neural networks
Authors:
Lauri Ylinen,
Tuomo von Lerber,
Franko Küppers,
Matti Lassas
Abstract:
An analytical study of dynamical properties of a semiconductor laser with optical injection of arbitrary polarization is presented. It is shown that if the injected field is sufficiently weak, then the laser has nine equilibrium points, however, only one of them is stable. Even if the injected field is linearly polarized, six of the equilibrium points have a state of polarization that is elliptica…
▽ More
An analytical study of dynamical properties of a semiconductor laser with optical injection of arbitrary polarization is presented. It is shown that if the injected field is sufficiently weak, then the laser has nine equilibrium points, however, only one of them is stable. Even if the injected field is linearly polarized, six of the equilibrium points have a state of polarization that is elliptical. Dependence of the equilibrium points on the injected field is described, and it is shown that as the intensity of the injected field increases, the number of equilibrium points decreases, with only a single equilibrium point remaining for strong enough injected fields. As an application, a complex-valued optical neural network with working principle based on injection locking is proposed.
△ Less
Submitted 17 November, 2021; v1 submitted 3 March, 2021;
originally announced March 2021.
-
From Black-box to White-box: Examining Confidence Calibration under different Conditions
Authors:
Franziska Schwaiger,
Maximilian Henne,
Fabian Küppers,
Felippe Schmoeller Roza,
Karsten Roscher,
Anselm Haselhoff
Abstract:
Confidence calibration is a major concern when applying artificial neural networks in safety-critical applications. Since most research in this area has focused on classification in the past, confidence calibration in the scope of object detection has gained more attention only recently. Based on previous work, we study the miscalibration of object detection models with respect to image location a…
▽ More
Confidence calibration is a major concern when applying artificial neural networks in safety-critical applications. Since most research in this area has focused on classification in the past, confidence calibration in the scope of object detection has gained more attention only recently. Based on previous work, we study the miscalibration of object detection models with respect to image location and box scale. Our main contribution is to additionally consider the impact of box selection methods like non-maximum suppression to calibration. We investigate the default intrinsic calibration of object detection models and how it is affected by these post-processing techniques. For this purpose, we distinguish between black-box calibration with non-maximum suppression and white-box calibration with raw network outputs. Our experiments reveal that post-processing highly affects confidence calibration. We show that non-maximum suppression has the potential to degrade initially well-calibrated predictions, leading to overconfident and thus miscalibrated models.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Multivariate Confidence Calibration for Object Detection
Authors:
Fabian Küppers,
Jan Kronenberger,
Amirhossein Shantia,
Anselm Haselhoff
Abstract:
Unbiased confidence estimates of neural networks are crucial especially for safety-critical applications. Many methods have been developed to calibrate biased confidence estimates. Though there is a variety of methods for classification, the field of object detection has not been addressed yet. Therefore, we present a novel framework to measure and calibrate biased (or miscalibrated) confidence es…
▽ More
Unbiased confidence estimates of neural networks are crucial especially for safety-critical applications. Many methods have been developed to calibrate biased confidence estimates. Though there is a variety of methods for classification, the field of object detection has not been addressed yet. Therefore, we present a novel framework to measure and calibrate biased (or miscalibrated) confidence estimates of object detection methods. The main difference to related work in the field of classifier calibration is that we also use additional information of the regression output of an object detector for calibration. Our approach allows, for the first time, to obtain calibrated confidence estimates with respect to image location and box scale. In addition, we propose a new measure to evaluate miscalibration of object detectors. Finally, we show that our developed methods outperform state-of-the-art calibration models for the task of object detection and provides reliable confidence estimates across different locations and scales.
△ Less
Submitted 28 April, 2020;
originally announced April 2020.
-
Optical computing by injection-locked lasers
Authors:
Tuomo von Lerber,
Matti Lassas,
Quang Trung Le,
Vladimir Lyubopytov,
Arkadi Chipouline,
Klaus Hofmann,
Franko Kueppers
Abstract:
A programmable optical computer has remained an elusive concept. To construct a practical computing primitive equivalent to an electronic Boolean logic, one should find a nonlinear phenomenon that overcomes weaknesses present in many optical processing schemes. Ideally, the nonlinearity should provide a functionally complete set of logic operations, enable ultrafast all-optical programmability, an…
▽ More
A programmable optical computer has remained an elusive concept. To construct a practical computing primitive equivalent to an electronic Boolean logic, one should find a nonlinear phenomenon that overcomes weaknesses present in many optical processing schemes. Ideally, the nonlinearity should provide a functionally complete set of logic operations, enable ultrafast all-optical programmability, and allow cascaded operations without a change in the operating wavelength or in the signal encoding format. Here we demonstrate a programmable logic gate using an injection-locked Vertical-Cavity Surface-Emitting Laser (VCSEL). The gate program is switched between the AND and the OR operations at the rate of 1 GHz with Bit Error Ratio (BER) of 10e-6 without changes in the wavelength or in the signal encoding format. The scheme is based on nonlinearity of normalization operations, which can be used to construct any continuous complex function or operation, Boolean or otherwise.
△ Less
Submitted 22 September, 2017;
originally announced September 2017.
-
The quest for companions to post-common envelope binaries: I. Searching a sample of stars from the CSS and SDSS
Authors:
U. Backhaus,
S. Bauer,
K. Beuermann,
J. Diese,
S. Dreizler,
F. V. Hessman,
T. -O. Husser,
K. -H. Klapdohr,
J. Moellmanns,
R. Schuenecke,
J. Dette,
J. Dubbert,
T. Miosga,
A. L. Rochus Vogel,
S. Simons,
S. Biriuk,
M. Debrah,
M. Griemens,
A. Hahn,
T. Moeller,
M. Pawlowski,
M. Schweizer,
A. -L. Speck,
C. Zapros,
T. Bollmann
, et al. (17 additional authors not shown)
Abstract:
As part of an ongoing collaboration between student groups at high schools and professional astronomers, we have searched for the presence of circum-binary planets in a bona-fide unbiased sample of twelve post-common envelope binaries (PCEBs) from the Catalina Sky Survey (CSS) and the Sloan Digital Sky Survey (SDSS). Although the present ephemerides are significantly more accurate than previous on…
▽ More
As part of an ongoing collaboration between student groups at high schools and professional astronomers, we have searched for the presence of circum-binary planets in a bona-fide unbiased sample of twelve post-common envelope binaries (PCEBs) from the Catalina Sky Survey (CSS) and the Sloan Digital Sky Survey (SDSS). Although the present ephemerides are significantly more accurate than previous ones, we find no clear evidence for orbital period variations between 2005 and 2011 or during the 2011 observing season. The sparse long-term coverage still permits O-C variations with a period of years and an amplitude of tens of seconds, as found in other systems. Our observations provide the basis for future inferences about the frequency with which planet-sized or brown-dwarf companions have either formed in these evolved systems or survived the common envelope (CE) phase.
△ Less
Submitted 10 January, 2012;
originally announced January 2012.