-
Tailored Uncertainty Estimation for Deep Learning Systems
Authors:
Joachim Sicking,
Maram Akila,
Jan David Schneider,
Fabian Hüger,
Peter Schlicht,
Tim Wirtz,
Stefan Wrobel
Abstract:
Uncertainty estimation bears the potential to make deep learning (DL) systems more reliable. Standard techniques for uncertainty estimation, however, come along with specific combinations of strengths and weaknesses, e.g., with respect to estimation quality, generalization abilities and computational complexity. To actually harness the potential of uncertainty quantification, estimators are requir…
▽ More
Uncertainty estimation bears the potential to make deep learning (DL) systems more reliable. Standard techniques for uncertainty estimation, however, come along with specific combinations of strengths and weaknesses, e.g., with respect to estimation quality, generalization abilities and computational complexity. To actually harness the potential of uncertainty quantification, estimators are required whose properties closely match the requirements of a given use case. In this work, we propose a framework that, firstly, structures and shapes these requirements, secondly, guides the selection of a suitable uncertainty estimation method and, thirdly, provides strategies to validate this choice and to uncover structural weaknesses. By contributing tailored uncertainty estimation in this sense, our framework helps to foster trustworthy DL systems. Moreover, it anticipates prospective machine learning regulations that require, e.g., in the EU, evidences for the technical appropriateness of machine learning systems. Our framework provides such evidences for system components modeling uncertainty.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
Validation of Simulation-Based Testing: Bypassing Domain Shift with Label-to-Image Synthesis
Authors:
Julia Rosenzweig,
Eduardo Brito,
Hans-Ulrich Kobialka,
Maram Akila,
Nico M. Schmidt,
Peter Schlicht,
Jan David Schneider,
Fabian Hüger,
Matthias Rottmann,
Sebastian Houben,
Tim Wirtz
Abstract:
Many machine learning applications can benefit from simulated data for systematic validation - in particular if real-life data is difficult to obtain or annotate. However, since simulations are prone to domain shift w.r.t. real-life data, it is crucial to verify the transferability of the obtained results. We propose a novel framework consisting of a generative label-to-image synthesis model toget…
▽ More
Many machine learning applications can benefit from simulated data for systematic validation - in particular if real-life data is difficult to obtain or annotate. However, since simulations are prone to domain shift w.r.t. real-life data, it is crucial to verify the transferability of the obtained results. We propose a novel framework consisting of a generative label-to-image synthesis model together with different transferability measures to inspect to what extent we can transfer testing results of semantic segmentation models from synthetic data to equivalent real-life data. With slight modifications, our approach is extendable to, e.g., general multi-class classification tasks. Grounded on the transferability analysis, our approach additionally allows for extensive testing by incorporating controlled simulations. We validate our approach empirically on a semantic segmentation task on driving scenes. Transferability is tested using correlation analysis of IoU and a learned discriminator. Although the latter can distinguish between real-life and synthetic tests, in the former we observe surprisingly strong correlations of 0.7 for both cars and pedestrians.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety
Authors:
Sebastian Houben,
Stephanie Abrecht,
Maram Akila,
Andreas Bär,
Felix Brockherde,
Patrick Feifel,
Tim Fingscheidt,
Sujan Sai Gannamaneni,
Seyed Eghbal Ghobadi,
Ahmed Hammam,
Anselm Haselhoff,
Felix Hauser,
Christian Heinzemann,
Marco Hoffmann,
Nikhil Kapoor,
Falk Kappel,
Marvin Klingner,
Jan Kronenberger,
Fabian Küppers,
Jonas Löhdefink,
Michael Mlynarski,
Michael Mock,
Firas Mualla,
Svetlana Pavlitskaya,
Maximilian Poretschkin
, et al. (16 additional authors not shown)
Abstract:
The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety conce…
▽ More
The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety concerns. In recent years, a zoo of state-of-the-art techniques aiming to address these safety concerns has emerged. This work provides a structured and broad overview of them. We first identify categories of insufficiencies to then describe research activities aiming at their detection, quantification, or mitigation. Our paper addresses both machine learning experts and safety engineers: The former ones might profit from the broad range of machine learning topics covered and discussions on limitations of recent methods. The latter ones might gain insights into the specifics of modern ML methods. We moreover hope that our contribution fuels discussions on desiderata for ML systems and strategies on how to propel existing approaches accordingly.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable Semantic Segmentation
Authors:
Linara Adilova,
Elena Schulz,
Maram Akila,
Sebastian Houben,
Jan David Schneider,
Fabian Hueger,
Tim Wirtz
Abstract:
Data-driven sensor interpretation in autonomous driving can lead to highly implausible predictions as can most of the time be verified with common-sense knowledge. However, learning common knowledge only from data is hard and approaches for knowledge integration are an active research area. We propose to use a partly human-designed, partly learned set of rules to describe relations between objects…
▽ More
Data-driven sensor interpretation in autonomous driving can lead to highly implausible predictions as can most of the time be verified with common-sense knowledge. However, learning common knowledge only from data is hard and approaches for knowledge integration are an active research area. We propose to use a partly human-designed, partly learned set of rules to describe relations between objects of a traffic scene on a high level of abstraction. In doing so, we improve and robustify existing deep neural networks consuming low-level sensor information. We present an initial study adapting the well-established Probabilistic Soft Logic (PSL) framework to validate and improve on the problem of semantic segmentation. We describe in detail how we integrate common knowledge into the segmentation pipeline using PSL and verify our approach in a set of experiments demonstrating the increase in robustness against several severe image distortions applied to the A2D2 autonomous driving data set.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
Street-Map Based Validation of Semantic Segmentation in Autonomous Driving
Authors:
Laura von Rueden,
Tim Wirtz,
Fabian Hueger,
Jan David Schneider,
Nico Piatkowski,
Christian Bauckhage
Abstract:
Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness, which motivates the thorough validation of learned models. However, current validation approaches mostly require ground truth data and are thus both cost-intensive and limited in their applicability. We propose to overcome these limitations by a model agnostic validation using a-priori knowledge…
▽ More
Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness, which motivates the thorough validation of learned models. However, current validation approaches mostly require ground truth data and are thus both cost-intensive and limited in their applicability. We propose to overcome these limitations by a model agnostic validation using a-priori knowledge from street maps. In particular, we show how to validate semantic segmentation masks and demonstrate the potential of our approach using OpenStreetMap. We introduce validation metrics that indicate false positive or negative road segments. Besides the validation approach, we present a method to correct the vehicle's GPS position so that a more accurate localization can be used for the street-map based validation. Lastly, we present quantitative results on the Cityscapes dataset indicating that our validation approach can indeed uncover errors in semantic segmentation masks.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Effective strain manipulation of the antiferromagnetic state of polycrystalline NiO
Authors:
A. Barra,
A. Ross,
O. Gomonay,
L. Baldrati,
A. Chavez,
R. Lebrun,
J. D. Schneider,
P. Shirazi,
Q. Wang,
J. Sinova,
G. P. Carman,
M. Kläui
Abstract:
As a candidate material for applications such as magnetic memory, polycrystalline antiferromagnets offer the same robustness to external magnetic fields, THz spin dynamics, and lack of stray field as their single crystalline counterparts, but without the limitation of epitaxial growth and lattice matched substrates. Here, we first report the detection of the average Neel vector orientiation in pol…
▽ More
As a candidate material for applications such as magnetic memory, polycrystalline antiferromagnets offer the same robustness to external magnetic fields, THz spin dynamics, and lack of stray field as their single crystalline counterparts, but without the limitation of epitaxial growth and lattice matched substrates. Here, we first report the detection of the average Neel vector orientiation in polycrystalline NiO via spin Hall magnetoresistance (SMR). Secondly, by applying strain through a piezo-electric substrate, we reduce the critical magnetic field required to reach a saturation of the SMR signal, indicating a change of the anisotropy. Our results are consistent with polycrystalline NiO exhibiting a positive sign of the in-plane magnetostriction. This method of anisotropy-tuning offers an energy efficient, on-chip alternative to manipulate a polycrystalline antiferromagnets magnetic state.
△ Less
Submitted 24 March, 2021;
originally announced March 2021.
-
From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation
Authors:
Nikhil Kapoor,
Andreas Bär,
Serin Varghese,
Jan David Schneider,
Fabian Hüger,
Peter Schlicht,
Tim Fingscheidt
Abstract:
Despite recent advancements, deep neural networks are not robust against adversarial perturbations. Many of the proposed adversarial defense approaches use computationally expensive training mechanisms that do not scale to complex real-world tasks such as semantic segmentation, and offer only marginal improvements. In addition, fundamental questions on the nature of adversarial perturbations and t…
▽ More
Despite recent advancements, deep neural networks are not robust against adversarial perturbations. Many of the proposed adversarial defense approaches use computationally expensive training mechanisms that do not scale to complex real-world tasks such as semantic segmentation, and offer only marginal improvements. In addition, fundamental questions on the nature of adversarial perturbations and their relation to the network architecture are largely understudied. In this work, we study the adversarial problem from a frequency domain perspective. More specifically, we analyze discrete Fourier transform (DFT) spectra of several adversarial images and report two major findings: First, there exists a strong connection between a model architecture and the nature of adversarial perturbations that can be observed and addressed in the frequency domain. Second, the observed frequency patterns are largely image- and attack-type independent, which is important for the practical impact of any defense making use of such patterns. Motivated by these findings, we additionally propose an adversarial defense method based on the well-known Wiener filters that captures and suppresses adversarial frequencies in a data-driven manner. Our proposed method not only generalizes across unseen attacks but also beats five existing state-of-the-art methods across two models in a variety of attack settings.
△ Less
Submitted 21 April, 2021; v1 submitted 2 December, 2020;
originally announced December 2020.
-
Towards Map-Based Validation of Semantic Segmentation Masks
Authors:
Laura von Rueden,
Tim Wirtz,
Fabian Hueger,
Jan David Schneider,
Christian Bauckhage
Abstract:
Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness. We propose to validate machine learning models for self-driving vehicles not only with given ground truth labels, but also with additional a-priori knowledge. In particular, we suggest to validate the drivable area in semantic segmentation masks using given street map data. We present first resul…
▽ More
Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness. We propose to validate machine learning models for self-driving vehicles not only with given ground truth labels, but also with additional a-priori knowledge. In particular, we suggest to validate the drivable area in semantic segmentation masks using given street map data. We present first results, which indicate that prediction errors can be uncovered by map-based validation.
△ Less
Submitted 26 November, 2020; v1 submitted 3 November, 2020;
originally announced November 2020.
-
Determining Phase-Space Properties of the LEDA RFQ Output Beam
Authors:
W. P. Lysenko,
J. D. Gilpatrick,
L. J. Rybarcyk,
J. D. Schneider,
H. V. Smith Jr,
L. M. Young,
M. E. Schulze
Abstract:
Quadrupole scans were used to characterize the LEDA RFQ beam. Experimental data were fit to computer simulation models for the rms beam size. The codes were found to be inadequate in accurately reproducing details of the wire scanner data. When this discrepancy is resolved, we plan to fit using all the data in wire scanner profiles, not just the rms values, using a 3-D nonlinear code.
Quadrupole scans were used to characterize the LEDA RFQ beam. Experimental data were fit to computer simulation models for the rms beam size. The codes were found to be inadequate in accurately reproducing details of the wire scanner data. When this discrepancy is resolved, we plan to fit using all the data in wire scanner profiles, not just the rms values, using a 3-D nonlinear code.
△ Less
Submitted 19 August, 2000;
originally announced August 2000.
-
Status Report on the Low-Energy Demonstration Accelerator (LEDA)
Authors:
H. Vernon Smith, Jr.,
J. David Schneider
Abstract:
The 75-keV injector and 6.7-MeV RFQ that comprise the first portion of the cw, 100-mA proton linac for the accelerator production of tritium (APT) project have been built and operated. The LEDA RFQ has been extensively tested for pulsed and cw output-beam currents <=100 mA. Up to 2.2 MW of cw rf power from the 350-MHz rf system is coupled into the RFQ, including 670 kW for the cw proton beam. Th…
▽ More
The 75-keV injector and 6.7-MeV RFQ that comprise the first portion of the cw, 100-mA proton linac for the accelerator production of tritium (APT) project have been built and operated. The LEDA RFQ has been extensively tested for pulsed and cw output-beam currents <=100 mA. Up to 2.2 MW of cw rf power from the 350-MHz rf system is coupled into the RFQ, including 670 kW for the cw proton beam. The emittance for a 93-mA pulsed RFQ output beam, as determined from quadrupole-magnet-scan measurements, is ex x ey = 0.25 x 0.31 (pi mm mrad)2 [rms normalized]. A follow-on experiment, to intentionally introduce and measure beam halo on the RFQ output beam, is now being installed.
△ Less
Submitted 18 August, 2000;
originally announced August 2000.
-
High Power Operations of LEDA
Authors:
L. M. Young,
L. J. Rybarcyk,
J. D. Schneider,
M. E. Schulze,
H. V. Smith
Abstract:
The LEDA RFQ, a 350-MHz continuous wave (CW) radio-frequency quadrupole (RFQ), successfully accelerated a 100-mA CW proton beam from 75 keV to 6.7 MeV. We have accumulated 111 hr of beam on time with at least 90 mA of CW output beam current. The 8-m-long RFQ accelerates a dc, 75-keV, ~106-mA H+ beam from the LEDA injector with ~94% transmission. When operating the RFQ at the RF power level for w…
▽ More
The LEDA RFQ, a 350-MHz continuous wave (CW) radio-frequency quadrupole (RFQ), successfully accelerated a 100-mA CW proton beam from 75 keV to 6.7 MeV. We have accumulated 111 hr of beam on time with at least 90 mA of CW output beam current. The 8-m-long RFQ accelerates a dc, 75-keV, ~106-mA H+ beam from the LEDA injector with ~94% transmission. When operating the RFQ at the RF power level for which it was designed, the peak electrical field on the vane tips is 33 MV/m. However, to maintain the high transmission quoted above with the CW beam, it was necessary to operate the RFQ with field levels ~10% higher than design. The RFQ dissipates 1.5 MW of RF power when operating with this field. Three klystrons provide the 2.2 MW of RF power required by the RFQ to accelerate the 100-mA beam. The beam power is 670 kW. Some of the challenges that were met in accelerating a 100-mA CW proton beam to 6.7 MeV, will be discussed.
△ Less
Submitted 18 August, 2000;
originally announced August 2000.
-
LEDA Beam Operations Milestone and Observed Beam Transmission Characteristics
Authors:
L. J. Rybarcyk,
J. D. Schneider,
H. V. Smith,
L. M. Young,
M. E. Schulze
Abstract:
Recently, the Low-Energy Demonstration Accelerator (LEDA) portion of the Accelerator Production of Tritium (APT) project reached its 100-mA, 8-hr CW beam operation milestone. LEDA consists of a 75-keV proton injector, 6.7-MeV, 350-MHz CW radio-frequency quadrupole (RFQ) with associated high-power and low-level rf systems, a short high-energy beam transport (HEBT) and high-power (670-kW CW) beam…
▽ More
Recently, the Low-Energy Demonstration Accelerator (LEDA) portion of the Accelerator Production of Tritium (APT) project reached its 100-mA, 8-hr CW beam operation milestone. LEDA consists of a 75-keV proton injector, 6.7-MeV, 350-MHz CW radio-frequency quadrupole (RFQ) with associated high-power and low-level rf systems, a short high-energy beam transport (HEBT) and high-power (670-kW CW) beam dump. During the commissioning phase it was discovered that the RFQ field level must to be approximately 5-10% higher than design in order to accelerate the full 100-mA beam with low losses. Measurements of a low-duty-factor, 100-mA beam show the beam transmission is unexpectedly low for RFQ field levels between ~90 and 105% of design. This paper will describe some aspects of LEDA operations critical to achieving the above milestone. Measurement and simulation results for reduced RFQ beam transmission near design operating conditions are also presented.
△ Less
Submitted 16 August, 2000;
originally announced August 2000.
-
Beam Emittance Measurements for the Low-Energy Demonstration Accelerator Radio-Frequency Quadrupole
Authors:
M. E. Schulze,
J. D. Gilpatrick,
W. P. Lysenko,
L. J. Rybarcyk,
J. D. Schneider,
H. V. Smith, Jr.,
L. M. You
Abstract:
The Low-Energy Demonstration Accelerator (LEDA) radio-frequency quadrupole (RFQ) is a 100% duty factor (CW) linac that delivers >100 mA of H+ beam at 6.7 MeV. The 8-m-long, 350-MHz RFQ structure accelerates a dc, 75-keV, 110-mA H+ beam from the LEDA injector with >90% transmission. LEDA [1,2] consists of a 75-keV proton injector, 6.7-MeV, 350-MHz CW RFQ with associated high-power and low-level r…
▽ More
The Low-Energy Demonstration Accelerator (LEDA) radio-frequency quadrupole (RFQ) is a 100% duty factor (CW) linac that delivers >100 mA of H+ beam at 6.7 MeV. The 8-m-long, 350-MHz RFQ structure accelerates a dc, 75-keV, 110-mA H+ beam from the LEDA injector with >90% transmission. LEDA [1,2] consists of a 75-keV proton injector, 6.7-MeV, 350-MHz CW RFQ with associated high-power and low-level rf systems, a short high-energy beam transport (HEBT) and high-power (670-kW CW) beam stop. The beam emittance is inferred from wire scanner measurements of the beam profile at a single location in the HEBT. The beam profile is measured as a function of the magnetic field gradient in one of the HEBT quadrupoles. As the gradient is changed the spot size passes through a transverse waist. Measurements are presented for peak currents between 25 and 100 mA.
△ Less
Submitted 17 August, 2000; v1 submitted 15 August, 2000;
originally announced August 2000.