-
The Influence of Faulty Labels in Data Sets on Human Pose Estimation
Authors:
Arnold Schwarz,
Levente Hernadi,
Felix Bießmann,
Kristian Hildebrand
Abstract:
In this study we provide empirical evidence demonstrating that the quality of training data impacts model performance in Human Pose Estimation (HPE). Inaccurate labels in widely used data sets, ranging from minor errors to severe mislabeling, can negatively influence learning and distort performance metrics. We perform an in-depth analysis of popular HPE data sets to show the extent and nature of…
▽ More
In this study we provide empirical evidence demonstrating that the quality of training data impacts model performance in Human Pose Estimation (HPE). Inaccurate labels in widely used data sets, ranging from minor errors to severe mislabeling, can negatively influence learning and distort performance metrics. We perform an in-depth analysis of popular HPE data sets to show the extent and nature of label inaccuracies. Our findings suggest that accounting for the impact of faulty labels will facilitate the development of more robust and accurate HPE models for a variety of real-world applications. We show improved performance with cleansed data.
△ Less
Submitted 9 September, 2024; v1 submitted 5 September, 2024;
originally announced September 2024.
-
Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics
Authors:
Tuong Vy Nguyen,
Johannes Hoster,
Alexander Glaser,
Kristian Hildebrand,
Felix Biessmann
Abstract:
Generative deep learning architectures can produce realistic, high-resolution fake imagery -- with potentially drastic societal implications. A key question in this context is: How easy is it to generate realistic imagery, in particular for niche domains. The iterative process required to achieve specific image content is difficult to automate and control. Especially for rare classes, it remains d…
▽ More
Generative deep learning architectures can produce realistic, high-resolution fake imagery -- with potentially drastic societal implications. A key question in this context is: How easy is it to generate realistic imagery, in particular for niche domains. The iterative process required to achieve specific image content is difficult to automate and control. Especially for rare classes, it remains difficult to assess fidelity, meaning whether generative approaches produce realistic imagery and alignment, meaning how (well) the generation can be guided by human input. In this work, we present a large-scale empirical evaluation of generative architectures which we fine-tuned to generate synthetic satellite imagery. We focus on nuclear power plants as an example of a rare object category - as there are only around 400 facilities worldwide, this restriction is exemplary for many other scenarios in which training and test data is limited by the restricted number of occurrences of real-world examples. We generate synthetic imagery by conditioning on two kinds of modalities, textual input and image input obtained from a game engine that allows for detailed specification of the building layout. The generated images are assessed by commonly used metrics for automatic evaluation and then compared with human judgement from our conducted user studies to assess their trustworthiness. Our results demonstrate that even for rare objects, generation of authentic synthetic satellite imagery with textual or detailed building layouts is feasible. In line with previous work, we find that automated metrics are often not aligned with human perception -- in fact, we find strong negative correlations between commonly used image quality metrics and human ratings.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Using Game Engines and Machine Learning to Create Synthetic Satellite Imagery for a Tabletop Verification Exercise
Authors:
Johannes Hoster,
Sara Al-Sayed,
Felix Biessmann,
Alexander Glaser,
Kristian Hildebrand,
Igor Moric,
Tuong Vy Nguyen
Abstract:
Satellite imagery is regarded as a great opportunity for citizen-based monitoring of activities of interest. Relevant imagery may however not be available at sufficiently high resolution, quality, or cadence -- let alone be uniformly accessible to open-source analysts. This limits an assessment of the true long-term potential of citizen-based monitoring of nuclear activities using publicly availab…
▽ More
Satellite imagery is regarded as a great opportunity for citizen-based monitoring of activities of interest. Relevant imagery may however not be available at sufficiently high resolution, quality, or cadence -- let alone be uniformly accessible to open-source analysts. This limits an assessment of the true long-term potential of citizen-based monitoring of nuclear activities using publicly available satellite imagery. In this article, we demonstrate how modern game engines combined with advanced machine-learning techniques can be used to generate synthetic imagery of sites of interest with the ability to choose relevant parameters upon request; these include time of day, cloud cover, season, or level of activity onsite. At the same time, resolution and off-nadir angle can be adjusted to simulate different characteristics of the satellite. While there are several possible use-cases for synthetic imagery, here we focus on its usefulness to support tabletop exercises in which simple monitoring scenarios can be examined to better understand verification capabilities enabled by new satellite constellations and very short revisit times.
△ Less
Submitted 23 June, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
CAD Models to Real-World Images: A Practical Approach to Unsupervised Domain Adaptation in Industrial Object Classification
Authors:
Dennis Ritter,
Mike Hemberger,
Marc Hönig,
Volker Stopp,
Erik Rodner,
Kristian Hildebrand
Abstract:
In this paper, we systematically analyze unsupervised domain adaptation pipelines for object classification in a challenging industrial setting. In contrast to standard natural object benchmarks existing in the field, our results highlight the most important design choices when only category-labeled CAD models are available but classification needs to be done with real-world images. Our domain ada…
▽ More
In this paper, we systematically analyze unsupervised domain adaptation pipelines for object classification in a challenging industrial setting. In contrast to standard natural object benchmarks existing in the field, our results highlight the most important design choices when only category-labeled CAD models are available but classification needs to be done with real-world images. Our domain adaptation pipeline achieves SoTA performance on the VisDA benchmark, but more importantly, drastically improves recognition performance on our new open industrial dataset comprised of 102 mechanical parts. We conclude with a set of guidelines that are relevant for practitioners needing to apply state-of-the-art unsupervised domain adaptation in practice. Our code is available at https://github.com/dritter-bht/synthnet-transfer-learning.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies
Authors:
James Paul Mason,
Alexandra Werth,
Colin G. West,
Allison A. Youngblood,
Donald L. Woodraska,
Courtney Peck,
Kevin Lacjak,
Florian G. Frick,
Moutamen Gabir,
Reema A. Alsinan,
Thomas Jacobsen,
Mohammad Alrubaie,
Kayla M. Chizmar,
Benjamin P. Lau,
Lizbeth Montoya Dominguez,
David Price,
Dylan R. Butler,
Connor J. Biron,
Nikita Feoktistov,
Kai Dewey,
N. E. Loomis,
Michal Bodzianowski,
Connor Kuybus,
Henry Dietrick,
Aubrey M. Wolfe
, et al. (977 additional authors not shown)
Abstract:
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th…
▽ More
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Pose-Guided Sign Language Video GAN with Dynamic Lambda
Authors:
Christopher Kissel,
Christopher Kümmel,
Dennis Ritter,
Kristian Hildebrand
Abstract:
We propose a novel approach for the synthesis of sign language videos using GANs. We extend the previous work of Stoll et al. by using the human semantic parser of the Soft-Gated Warping-GAN from to produce photorealistic videos guided by region-level spatial layouts. Synthesizing target poses improves performance on independent and contrasting signers. Therefore, we have evaluated our system with…
▽ More
We propose a novel approach for the synthesis of sign language videos using GANs. We extend the previous work of Stoll et al. by using the human semantic parser of the Soft-Gated Warping-GAN from to produce photorealistic videos guided by region-level spatial layouts. Synthesizing target poses improves performance on independent and contrasting signers. Therefore, we have evaluated our system with the highly heterogeneous MS-ASL dataset with over 200 signers resulting in a SSIM of 0.893. Furthermore, we introduce a periodic weighting approach to the generator that reactivates the training and leads to quantitatively better results.
△ Less
Submitted 6 May, 2021;
originally announced May 2021.
-
Study of energy response and resolution of the ATLAS Tile Calorimeter to hadrons of energies from 16 to 30 GeV
Authors:
Jalal Abdallah,
Stylianos Angelidakis,
Giorgi Arabidze,
Nikolay Atanov,
Johannes Bernhard,
Romeo Bonnefoy,
Jonathan Bossio,
Ryan Bouabid,
Fernando Carrio,
Tomas Davidek,
Michal Dubovsky,
Luca Fiorini,
Francisco Brandan Garcia Aparisi,
Tancredi Carli,
Alexander Gerbershagen,
Hazal Goksu,
Haleh Hadavand,
Siarhei Harkusha,
Dingane Hlaluku,
Michael James Hibbard,
Kevin Hildebrand,
Juansher Jejelava,
Andrey Kamenshchikov,
Stergios Kazakos,
Tomas Kello
, et al. (46 additional authors not shown)
Abstract:
Three spare modules of the ATLAS Tile Calorimeter were exposed to test beams from the Super Proton Synchrotron accelerator at CERN in 2017. The measurements of the energy response and resolution of the detector to positive pions and kaons and protons with energy in the range 16 to 30 GeV are reported. The results have uncertainties of few percent. They were compared to the predictions of the Geant…
▽ More
Three spare modules of the ATLAS Tile Calorimeter were exposed to test beams from the Super Proton Synchrotron accelerator at CERN in 2017. The measurements of the energy response and resolution of the detector to positive pions and kaons and protons with energy in the range 16 to 30 GeV are reported. The results have uncertainties of few percent. They were compared to the predictions of the Geant4-based simulation program used in ATLAS to estimate the response of the detector to proton-proton events at Large Hadron Collider. The determinations obtained using experimental and simulated data agree within the uncertainties.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Anonymization of labeled TOF-MRA images for brain vessel segmentation using generative adversarial networks
Authors:
Tabea Kossen,
Pooja Subramaniam,
Vince I. Madai,
Anja Hennemuth,
Kristian Hildebrand,
Adam Hilbert,
Jan Sobesky,
Michelle Livne,
Ivana Galinovic,
Ahmed A. Khalil,
Jochen B. Fiebach,
Dietmar Frey
Abstract:
Anonymization and data sharing are crucial for privacy protection and acquisition of large datasets for medical image analysis. This is a big challenge, especially for neuroimaging. Here, the brain's unique structure allows for re-identification and thus requires non-conventional anonymization. Generative adversarial networks (GANs) have the potential to provide anonymous images while preserving p…
▽ More
Anonymization and data sharing are crucial for privacy protection and acquisition of large datasets for medical image analysis. This is a big challenge, especially for neuroimaging. Here, the brain's unique structure allows for re-identification and thus requires non-conventional anonymization. Generative adversarial networks (GANs) have the potential to provide anonymous images while preserving predictive properties. Analyzing brain vessel segmentation, we trained 3 GANs on time-of-flight (TOF) magnetic resonance angiography (MRA) patches for image-label generation: 1) Deep convolutional GAN, 2) Wasserstein-GAN with gradient penalty (WGAN-GP) and 3) WGAN-GP with spectral normalization (WGAN-GP-SN). The generated image-labels from each GAN were used to train a U-net for segmentation and tested on real data. Moreover, we applied our synthetic patches using transfer learning on a second dataset. For an increasing number of up to 15 patients we evaluated the model performance on real data with and without pre-training. The performance for all models was assessed by the Dice Similarity Coefficient (DSC) and the 95th percentile of the Hausdorff Distance (95HD). Comparing the 3 GANs, the U-net trained on synthetic data generated by the WGAN-GP-SN showed the highest performance to predict vessels (DSC/95HD 0.82/28.97) benchmarked by the U-net trained on real data (0.89/26.61). The transfer learning approach showed superior performance for the same GAN compared to no pre-training, especially for one patient only (0.91/25.68 vs. 0.85/27.36). In this work, synthetic image-label pairs retained generalizable information and showed good performance for vessel segmentation. Besides, we showed that synthetic patches can be used in a transfer learning approach with independent data. This paves the way to overcome the challenges of scarce data and anonymization in medical imaging.
△ Less
Submitted 16 November, 2020; v1 submitted 9 September, 2020;
originally announced September 2020.
-
Transverse confinement of ultrasound through the Anderson transition in 3D mesoglasses
Authors:
L. A. Cobus,
W. K. Hildebrand,
S. E. Skipetrov,
B. A. van Tiggelen,
J. H. Page
Abstract:
We report an in-depth investigation of the Anderson localization transition for classical waves in three dimensions (3D). Experimentally, we observe clear signatures of Anderson localization by measuring the transverse confinement of transmitted ultrasound through slab-shaped mesoglass samples. We compare our experimental data with predictions of the self-consistent theory of Anderson localization…
▽ More
We report an in-depth investigation of the Anderson localization transition for classical waves in three dimensions (3D). Experimentally, we observe clear signatures of Anderson localization by measuring the transverse confinement of transmitted ultrasound through slab-shaped mesoglass samples. We compare our experimental data with predictions of the self-consistent theory of Anderson localization for an open medium with the same geometry as our samples. This model describes the transverse confinement of classical waves as a function of the localization (correlation) length, $ξ$ ($ζ$), and is fitted to our experimental data to quantify the transverse spreading/confinement of ultrasound all of the way through the transition between diffusion and localization. Hence we are able to precisely identify the location of the mobility edges at which the Anderson transitions occur.
△ Less
Submitted 16 October, 2018;
originally announced October 2018.
-
Observation of infinite-range intensity correlations above, at and below the 3D Anderson localization transition
Authors:
W. K. Hildebrand,
A. Strybulevych,
S. E. Skipetrov,
B. A. van Tiggelen,
J. H. Page
Abstract:
We investigate long-range intensity correlations on both sides of the Anderson transition of classical waves in a three-dimensional (3D) disordered material. Our ultrasonic experiments are designed to unambiguously detect a recently predicted infinite-range C0 contribution, due to local density of states fluctuations near the source. We find that these C0 correlations, in addition to C2 and C3 con…
▽ More
We investigate long-range intensity correlations on both sides of the Anderson transition of classical waves in a three-dimensional (3D) disordered material. Our ultrasonic experiments are designed to unambiguously detect a recently predicted infinite-range C0 contribution, due to local density of states fluctuations near the source. We find that these C0 correlations, in addition to C2 and C3 contributions, are significantly enhanced near mobility edges. Separate measurements of the inverse participation ratio reveal a link between C0 and the anomalous dimension Δ_2, implying that C0 may also be used to explore the critical regime of the Anderson transition.
△ Less
Submitted 22 November, 2013; v1 submitted 28 March, 2013;
originally announced March 2013.
-
Mesoscopic phase statistics of diffuse ultrasound in dynamic matter
Authors:
M. L. Cowan,
D. Anache-Ménier,
W. K. Hildebrand,
J. H. Page,
B. A. van Tiggelen
Abstract:
Temporal fluctuations in the phase of waves transmitted through a dynamic, strongly scattering, mesoscopic sample are investigated using ultrasonic waves, and compared with theoretical predictions based on circular Gaussian statistics. The fundamental role of phase in Diffusing Acoustic Wave Spectroscopy is revealed, and phase statistics are also shown to provide a sensitive and accurate way to…
▽ More
Temporal fluctuations in the phase of waves transmitted through a dynamic, strongly scattering, mesoscopic sample are investigated using ultrasonic waves, and compared with theoretical predictions based on circular Gaussian statistics. The fundamental role of phase in Diffusing Acoustic Wave Spectroscopy is revealed, and phase statistics are also shown to provide a sensitive and accurate way to probe scatterer motions at both short and long time scales.
△ Less
Submitted 1 June, 2007; v1 submitted 25 January, 2007;
originally announced January 2007.