Skip to main content

Showing 1–50 of 81 results for author: Bauer, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.07004  [pdf, other

    cs.PL

    Task-Based Tensor Computations on Modern GPUs

    Authors: Rohan Yadav, Michael Garland, Alex Aiken, Michael Bauer

    Abstract: Domain-specific, fixed-function units are becoming increasingly common in modern processors. As the computational demands of applications evolve, the capabilities and programming interfaces of these fixed-function units continue to change. NVIDIA's Hopper GPU architecture contains multiple fixed-function units per compute unit, including an asynchronous data movement unit (TMA) and an asynchronous… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  2. arXiv:2503.15288  [pdf, other

    physics.space-ph cs.CV cs.LG

    Beacon2Science: Enhancing STEREO/HI beacon data1 with machine learning for efficient CME tracking

    Authors: Justin Le Louëdec, Maike Bauer, Tanja Amerstorfer, Jackie A. Davies

    Abstract: Observing and forecasting coronal mass ejections (CME) in real-time is crucial due to the strong geomagnetic storms they can generate that can have a potentially damaging effect, for example, on satellites and electrical devices. With its near-real-time availability, STEREO/HI beacon data is the perfect candidate for early forecasting of CMEs. However, previous work concluded that CME arrival pred… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 24 pages, 11 figures, 1 tables, submitted to AGU Space Weather on 14th Marc 2025

  3. arXiv:2502.03298  [pdf, other

    cs.CL cs.AI cs.LG

    MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters

    Authors: Amin Dada, Osman Alperen Koras, Marie Bauer, Amanda Butler, Kaleb E. Smith, Jens Kleesiek, Julian Friedrich

    Abstract: While increasing patients' access to medical documents improves medical care, this benefit is limited by varying health literacy levels and complex medical terminology. Large language models (LLMs) offer solutions by simplifying medical information. However, evaluating LLMs for safe and patient-friendly text generation is difficult due to the lack of standardized evaluation resources. To fill this… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  4. arXiv:2412.00505  [pdf, other

    cs.CV eess.IV

    Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion

    Authors: Jona Ballé, Luca Versari, Emilien Dupont, Hyunjik Kim, Matthias Bauer

    Abstract: Inspired by the success of generative image models, recent work on learned image compression increasingly focuses on better probabilistic models of the natural image distribution, leading to excellent image quality. This, however, comes at the expense of a computational complexity that is several orders of magnitude higher than today's commercial codecs, and thus prohibitive for most practical app… ▽ More

    Submitted 23 March, 2025; v1 submitted 30 November, 2024; originally announced December 2024.

    Comments: 16 pages, 12 figures. Accepted for presentation at CVPR 2025

  5. arXiv:2411.03475  [pdf, other

    cs.CV

    Self Supervised Networks for Learning Latent Space Representations of Human Body Scans and Motions

    Authors: Emmanuel Hartman, Nicolas Charon, Martin Bauer

    Abstract: This paper introduces self-supervised neural network models to tackle several fundamental problems in the field of 3D human body analysis and processing. First, we propose VariShaPE (Varifold Shape Parameter Estimator), a novel architecture for the retrieval of latent space representations of body shapes and poses. This network offers a fast and robust method to estimate the embedding of arbitrary… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 23 pages, 11 figures, 6 tables

  6. arXiv:2410.15012  [pdf

    eess.IV cs.AI cs.CV

    Pathologist-like explainable AI for interpretable Gleason grading in prostate cancer

    Authors: Gesa Mittmann, Sara Laiouar-Pedari, Hendrik A. Mehrtens, Sarah Haggenmüller, Tabea-Clara Bucher, Tirtha Chanda, Nadine T. Gaisa, Mathias Wagner, Gilbert Georg Klamminger, Tilman T. Rau, Christina Neppl, Eva Maria Compérat, Andreas Gocht, Monika Hämmerle, Niels J. Rupp, Jula Westhoff, Irene Krücken, Maximillian Seidl, Christian M. Schürch, Marcus Bauer, Wiebke Solass, Yu Chun Tam, Florian Weber, Rainer Grobholz, Jaroslaw Augustyniak , et al. (41 additional authors not shown)

    Abstract: The aggressiveness of prostate cancer, the most common cancer in men worldwide, is primarily assessed based on histopathological data using the Gleason scoring system. While artificial intelligence (AI) has shown promise in accurately predicting Gleason scores, these predictions often lack inherent explainability, potentially leading to distrust in human-machine interactions. To address this issue… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 58 pages, 15 figures (incl. supplementary)

  7. arXiv:2408.08233  [pdf, other

    math.MG cs.LG

    The Z-Gromov-Wasserstein Distance

    Authors: Martin Bauer, Facundo Mémoli, Tom Needham, Mao Nishino

    Abstract: The Gromov-Wasserstein (GW) distance is a powerful tool for comparing metric measure spaces which has found broad applications in data science and machine learning. Driven by the need to analyze datasets whose objects have increasingly complex structure (such as node and edge-attributed graphs), several variants of GW distance have been introduced in the recent literature. With a view toward estab… ▽ More

    Submitted 6 January, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: V3: Improved exposition. V2: Added a new result on contractibility and fixed small errors

  8. arXiv:2407.07726  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    PaliGemma: A versatile 3B VLM for transfer

    Authors: Lucas Beyer, Andreas Steiner, André Susano Pinto, Alexander Kolesnikov, Xiao Wang, Daniel Salz, Maxim Neumann, Ibrahim Alabdulmohsin, Michael Tschannen, Emanuele Bugliarello, Thomas Unterthiner, Daniel Keysers, Skanda Koppula, Fangyu Liu, Adam Grycner, Alexey Gritsenko, Neil Houlsby, Manoj Kumar, Keran Rong, Julian Eisenschlos, Rishabh Kabra, Matthias Bauer, Matko Bošnjak, Xi Chen, Matthias Minderer , et al. (10 additional authors not shown)

    Abstract: PaliGemma is an open Vision-Language Model (VLM) that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model. It is trained to be a versatile and broadly knowledgeable base model that is effective to transfer. It achieves strong performance on a wide variety of open-world tasks. We evaluate PaliGemma on almost 40 diverse tasks including standard VLM benchmarks, but also more… ▽ More

    Submitted 10 October, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: v2 adds Appendix H and I and a few citations

  9. arXiv:2406.18111  [pdf, other

    cs.DC

    Automatic Tracing in Task-Based Runtime Systems

    Authors: Rohan Yadav, Michael Bauer, David Broman, Michael Garland, Alex Aiken, Fredrik Kjolstad

    Abstract: Implicitly parallel task-based runtime systems often perform dynamic analysis to discover dependencies in and extract parallelism from sequential programs. Dependence analysis becomes expensive as task granularity drops below a threshold. Tracing techniques have been developed where programmers annotate repeated program fragments (traces) issued by the application, and the runtime system memoizes… ▽ More

    Submitted 16 December, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  10. arXiv:2406.18109  [pdf, other

    cs.DC

    Composing Distributed Computations Through Task and Kernel Fusion

    Authors: Rohan Yadav, Shiv Sundram, Wonchan Lee, Michael Garland, Michael Bauer, Alex Aiken, Fredrik Kjolstad

    Abstract: We introduce Diffuse, a system that dynamically performs task and kernel fusion in distributed, task-based runtime systems. The key component of Diffuse is an intermediate representation of distributed computation that enables the necessary analyses for the fusion of distributed tasks to be performed in a scalable manner. We pair task fusion with a JIT compiler to fuse together the kernels within… ▽ More

    Submitted 16 December, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  11. arXiv:2406.14774  [pdf, other

    cs.LG cs.CL cs.CV

    Evaluating Numerical Reasoning in Text-to-Image Models

    Authors: Ivana Kajić, Olivia Wiles, Isabela Albuquerque, Matthias Bauer, Su Wang, Jordi Pont-Tuset, Aida Nematzadeh

    Abstract: Text-to-image generative models are capable of producing high-quality images that often faithfully depict concepts described using natural language. In this work, we comprehensively evaluate a range of text-to-image models on numerical reasoning tasks of varying difficulty, and show that even the most advanced models have only rudimentary numerical skills. Specifically, their ability to correctly… ▽ More

    Submitted 6 February, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

  12. arXiv:2404.05694  [pdf, other

    cs.CL cs.AI cs.LG

    Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

    Authors: Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek, Christoph M. Friedrich

    Abstract: Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa. While these models demonstrate remarkable performance on general datasets, they can struggle in specialized domains such as medicine, where unique domain-specific terminologies, domain-specific abbreviations, and varying document structures are commo… ▽ More

    Submitted 8 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  13. arXiv:2404.04067  [pdf, other

    cs.CL cs.AI cs.LG

    Does Biomedical Training Lead to Better Medical Performance?

    Authors: Amin Dada, Marie Bauer, Amanda Butler Contreras, Osman Alperen Koraş, Constantin Marc Seibold, Kaleb E Smith, Jens Kleesiek

    Abstract: Large Language Models (LLMs) are expected to significantly contribute to patient care, diagnostics, and administrative processes. Emerging biomedical LLMs aim to address healthcare-specific challenges, including privacy demands and computational constraints. Assessing the models' suitability for this sensitive application area is of the utmost importance. However, biomedical training has not been… ▽ More

    Submitted 17 September, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  14. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  15. arXiv:2401.09865  [pdf, other

    cs.CV cs.AI cs.LG

    Improving fine-grained understanding in image-text pre-training

    Authors: Ioana Bica, Anastasija Ilić, Matthias Bauer, Goker Erdogan, Matko Bošnjak, Christos Kaplanis, Alexey A. Gritsenko, Matthias Minderer, Charles Blundell, Razvan Pascanu, Jovana Mitrović

    Abstract: We introduce SPARse Fine-grained Contrastive Alignment (SPARC), a simple method for pretraining more fine-grained multimodal representations from image-text pairs. Given that multiple image patches often correspond to single words, we propose to learn a grouping of image patches for every token in the caption. To achieve this, we use a sparse similarity metric between image patches and language to… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 26 pages

  16. arXiv:2401.06893  [pdf, other

    eess.IV cs.CV

    Local Gamma Augmentation for Ischemic Stroke Lesion Segmentation on MRI

    Authors: Jon Middleton, Marko Bauer, Kaining Sheng, Jacob Johansen, Mathias Perslev, Silvia Ingala, Mads Nielsen, Akshay Pai

    Abstract: The identification and localisation of pathological tissues in medical images continues to command much attention among deep learning practitioners. When trained on abundant datasets, deep neural networks can match or exceed human performance. However, the scarcity of annotated data complicates the training of these models. Data augmentation techniques can compensate for a lack of training samples… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Camera-ready version for Northern Lights Deep Learning Conference 2024, 7 pages, 2 figures

  17. Extension of the Dip-test Repertoire -- Efficient and Differentiable p-value Calculation for Clustering

    Authors: Lena G. M. Bauer, Collin Leiber, Christian Böhm, Claudia Plant

    Abstract: Over the last decade, the Dip-test of unimodality has gained increasing interest in the data mining community as it is a parameter-free statistical test that reliably rates the modality in one-dimensional samples. It returns a so called Dip-value and a corresponding probability for the sample's unimodality (Dip-p-value). These two values share a sigmoidal relationship. However, the specific transf… ▽ More

    Submitted 2 April, 2025; v1 submitted 19 December, 2023; originally announced December 2023.

    Journal ref: Proceedings of the 2023 SIAM International Conference on Data Mining (SDM) (pp. 109-117). Society for Industrial and Applied Mathematics

  18. arXiv:2312.02753  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    C3: High-performance and low-complexity neural compression from a single image or video

    Authors: Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz, Emilien Dupont

    Abstract: Most neural compression models are trained on large datasets of images or videos in order to generalize to unseen data. Such generalization typically requires large and expressive architectures with a high decoding complexity. Here we introduce C3, a neural compression method with strong rate-distortion (RD) performance that instead overfits a small model to each image or video separately. The res… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  19. arXiv:2311.04382  [pdf, other

    cs.CV math.DG

    Basis restricted elastic shape analysis on the space of unregistered surfaces

    Authors: Emmanuel Hartman, Emery Pierson, Martin Bauer, Mohamed Daoudi, Nicolas Charon

    Abstract: This paper introduces a new mathematical and numerical framework for surface analysis derived from the general setting of elastic Riemannian metrics on shape spaces. Traditionally, those metrics are defined over the infinite dimensional manifold of immersed surfaces and satisfy specific invariance properties enabling the comparison of surfaces modulo shape preserving transformations such as repara… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 18 pages, 10 figures, 8 tables

    MSC Class: I.4.0; I.5.1; I.4.9

  20. arXiv:2307.02988  [pdf, other

    cs.NI eess.SP

    UAV Swarms for Joint Data Ferrying and Dynamic Cell Coverage via Optimal Transport Descent and Quadratic Assignment

    Authors: Kai Cui, Lars Baumgärtner, Burak Yilmaz, Mengguang Li, Christian Fabian, Benjamin Becker, Lin Xiang, Maximilian Bauer, Heinz Koeppl

    Abstract: Both data ferrying with disruption-tolerant networking (DTN) and mobile cellular base stations constitute important techniques for UAV-aided communication in situations of crises where standard communication infrastructure is unavailable. For optimal use of a limited number of UAVs, we propose providing both DTN and a cellular base station on each UAV. Here, DTN is used for large amounts of low-pr… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted to IEEE LCN 2023 as full paper, pre-final version

  21. arXiv:2302.03130  [pdf, other

    cs.LG cs.CV

    Spatial Functa: Scaling Functa to ImageNet Classification and Generation

    Authors: Matthias Bauer, Emilien Dupont, Andy Brock, Dan Rosenbaum, Jonathan Richard Schwarz, Hyunjik Kim

    Abstract: Neural fields, also known as implicit neural representations, have emerged as a powerful means to represent complex signals of various modalities. Based on this Dupont et al. (2022) introduce a framework that views neural fields as data, termed *functa*, and proposes to do deep learning directly on this dataset of neural fields. In this work, we show that the proposed framework faces limitations w… ▽ More

    Submitted 9 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  22. arXiv:2211.13185  [pdf, other

    cs.CV

    BaRe-ESA: A Riemannian Framework for Unregistered Human Body Shapes

    Authors: Emmanuel Hartman, Emery Pierson, Martin Bauer, Nicolas Charon, Mohamed Daoudi

    Abstract: We present Basis Restricted Elastic Shape Analysis (BaRe-ESA), a novel Riemannian framework for human body scan representation, interpolation and extrapolation. BaRe-ESA operates directly on unregistered meshes, i.e., without the need to establish prior point to point correspondences or to assume a consistent mesh structure. Our method relies on a latent space representation, which is equipped wit… ▽ More

    Submitted 21 August, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 13 pages, 7 figures, 3 tables

    MSC Class: I.4.0; I.5.1; I.4.9

  23. arXiv:2208.02670  [pdf

    stat.ML cs.LG

    Development and Validation of ML-DQA -- a Machine Learning Data Quality Assurance Framework for Healthcare

    Authors: Mark Sendak, Gaurav Sirdeshmukh, Timothy Ochoa, Hayley Premo, Linda Tang, Kira Niederhoffer, Sarah Reed, Kaivalya Deshpande, Emily Sterrett, Melissa Bauer, Laurie Snyder, Afreen Shariff, David Whellan, Jeffrey Riggio, David Gaieski, Kristin Corey, Megan Richards, Michael Gao, Marshall Nichols, Bradley Heintze, William Knechtle, William Ratliff, Suresh Balu

    Abstract: The approaches by which the machine learning and clinical research communities utilize real world data (RWD), including data captured in the electronic health record (EHR), vary dramatically. While clinical researchers cautiously use RWD for clinical investigations, ML for healthcare teams consume public datasets with minimal scrutiny to develop new algorithms. This study bridges this gap by devel… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Presented at 2022 Machine Learning in Health Care Conference

  24. Elastic shape analysis of surfaces with second-order Sobolev metrics: a comprehensive numerical framework

    Authors: Emmanuel Hartman, Yashil Sukurdeep, Eric Klassen, Nicolas Charon, Martin Bauer

    Abstract: This paper introduces a set of numerical methods for Riemannian shape analysis of 3D surfaces within the setting of invariant (elastic) second-order Sobolev metrics. More specifically, we address the computation of geodesics and geodesic distances between parametrized or unparametrized immersed surfaces represented as 3D meshes. Building on this, we develop tools for the statistical shape analysis… ▽ More

    Submitted 5 December, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: 28 pages, 16 figures, 2 tables

    MSC Class: 68U05; 49Q10; 58D10

  25. arXiv:2203.06122  [pdf, other

    q-bio.NC cs.CV eess.IV

    Modeling the Shape of the Brain Connectome via Deep Neural Networks

    Authors: Haocheng Dai, Martin Bauer, P. Thomas Fletcher, Sarang Joshi

    Abstract: The goal of diffusion-weighted magnetic resonance imaging (DWI) is to infer the structural connectivity of an individual subject's brain in vivo. To statistically study the variability and differences between normal and abnormal brain connectomes, a mathematical model of the neural connections is required. In this paper, we represent the brain connectome as a Riemannian manifold, which allows us t… ▽ More

    Submitted 3 March, 2023; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: 12 pages, 5 figures

  26. arXiv:2203.03304  [pdf, other

    cs.LG stat.ML

    Regularising for invariance to data augmentation improves supervised learning

    Authors: Aleksander Botev, Matthias Bauer, Soham De

    Abstract: Data augmentation is used in machine learning to make the classifier invariant to label-preserving transformations. Usually this invariance is only encouraged implicitly by including a single augmented input during training. However, several works have recently shown that using multiple augmentations per input can improve generalisation or can be used to incorporate invariances more explicitly. In… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  27. arXiv:2109.09808  [pdf, other

    cs.CV

    Integrated Construction of Multimodal Atlases with Structural Connectomes in the Space of Riemannian Metrics

    Authors: Kristen M. Campbell, Haocheng Dai, Zhe Su, Martin Bauer, P. Thomas Fletcher, Sarang C. Joshi

    Abstract: The structural network of the brain, or structural connectome, can be represented by fiber bundles generated by a variety of tractography methods. While such methods give qualitative insights into brain structure, there is controversy over whether they can provide quantitative information, especially at the population level. In order to enable population-level statistical analysis of the structura… ▽ More

    Submitted 13 June, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org/papers/2022:016.html. arXiv admin note: substantial text overlap with arXiv:2103.05730

  28. arXiv:2106.14806  [pdf, other

    cs.LG stat.ML

    Laplace Redux -- Effortless Bayesian Deep Learning

    Authors: Erik Daxberger, Agustinus Kristiadi, Alexander Immer, Runa Eschenhagen, Matthias Bauer, Philipp Hennig

    Abstract: Bayesian formulations of deep learning have been shown to have compelling theoretical properties and offer practical functional benefits, such as improved predictive uncertainty quantification and model selection. The Laplace approximation (LA) is a classic, and arguably the simplest family of approximations for the intractable posteriors of deep neural networks. Yet, despite its simplicity, the L… ▽ More

    Submitted 14 March, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready version; source code: https://github.com/AlexImmer/Laplace

  29. arXiv:2106.08372  [pdf, other

    cs.RO cs.CV cs.LG eess.SP

    A Multi-Layered Approach for Measuring the Simulation-to-Reality Gap of Radar Perception for Autonomous Driving

    Authors: Anthony Ngo, Max Paul Bauer, Michael Resch

    Abstract: With the increasing safety validation requirements for the release of a self-driving car, alternative approaches, such as simulation-based testing, are emerging in addition to conventional real-world testing. In order to rely on virtual tests the employed sensor models have to be validated. For this reason, it is necessary to quantify the discrepancy between simulation and reality in order to dete… ▽ More

    Submitted 20 June, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted at the 24th IEEE International Conference on Intelligent Transportation Systems (ITSC 2021)

  30. IoT Virtualization with ML-based Information Extraction

    Authors: Martin Bauer

    Abstract: For IoT to reach its full potential, the sharing and reuse of information in different applications and across verticals is of paramount importance. However, there are a plethora of IoT platforms using different representations, protocols and interaction patterns. To address this issue, the Fed4IoT project has developed an IoT virtualization platform that, on the one hand, integrates information f… ▽ More

    Submitted 15 November, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Journal ref: IEEE 7th World Forum on Internet of Things (WF-IoT), 2021, pp. 915-920

  31. Deep Evaluation Metric: Learning to Evaluate Simulated Radar Point Clouds for Virtual Testing of Autonomous Driving

    Authors: Anthony Ngo, Max Paul Bauer, Michael Resch

    Abstract: The usage of environment sensor models for virtual testing is a promising approach to reduce the testing effort of autonomous driving. However, in order to deduce any statements regarding the performance of an autonomous driving function based on simulation, the sensor model has to be validated to determine the discrepancy between the synthetic and real sensor data. Since a certain degree of diver… ▽ More

    Submitted 21 June, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: 2021 IEEE Radar Conference (IEEE RadarConf 2021)

  32. arXiv:2104.04975  [pdf, other

    stat.ML cs.LG

    Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning

    Authors: Alexander Immer, Matthias Bauer, Vincent Fortuin, Gunnar Rätsch, Mohammad Emtiyaz Khan

    Abstract: Marginal-likelihood based model-selection, even though promising, is rarely used in deep learning due to estimation difficulties. Instead, most approaches rely on validation data, which may not be readily available. In this work, we present a scalable marginal-likelihood estimation method to select both hyperparameters and network architectures, based on the training data alone. Some hyperparamete… ▽ More

    Submitted 15 June, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: ICML 2021

  33. arXiv:2104.01194  [pdf, other

    cs.LG

    Physics Informed Convex Artificial Neural Networks (PICANNs) for Optimal Transport based Density Estimation

    Authors: Amanpreet Singh, Martin Bauer, Sarang Joshi

    Abstract: Optimal Mass Transport (OMT) is a well studied problem with a variety of applications in a diverse set of fields ranging from Physics to Computer Vision and in particular Statistics and Data Science. Since the original formulation of Monge in 1781 significant theoretical progress been made on the existence, uniqueness and properties of the optimal transport maps. The actual numerical computation o… ▽ More

    Submitted 22 October, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: 14 page, 6 figures, 1 table

  34. arXiv:2103.05730  [pdf, other

    cs.CV

    Structural Connectome Atlas Construction in the Space of Riemannian Metrics

    Authors: Kristen M. Campbell, Haocheng Dai, Zhe Su, Martin Bauer, P. Thomas Fletcher, Sarang C. Joshi

    Abstract: The structural connectome is often represented by fiber bundles generated from various types of tractography. We propose a method of analyzing connectomes by representing them as a Riemannian metric, thereby viewing them as points in an infinite-dimensional manifold. After equipping this space with a natural metric structure, the Ebin metric, we apply object-oriented statistical analysis to define… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 12 pages, 3 figures

  35. arXiv:2101.11046  [pdf, other

    stat.ML cs.LG

    Generalized Doubly Reparameterized Gradient Estimators

    Authors: Matthias Bauer, Andriy Mnih

    Abstract: Efficient low-variance gradient estimation enabled by the reparameterization trick (RT) has been essential to the success of variational autoencoders. Doubly-reparameterized gradients (DReGs) improve on the RT for multi-sample variational bounds by applying reparameterization a second time for an additional reduction in variance. Here, we develop two generalizations of the DReGs estimator and show… ▽ More

    Submitted 13 July, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Journal ref: 38th International Conference on Machine Learning (ICML 2021)

  36. Supervised deep learning of elastic SRV distances on the shape space of curves

    Authors: Emmanuel Hartman, Yashil Sukurdeep, Nicolas Charon, Eric Klassen, Martin Bauer

    Abstract: Motivated by applications from computer vision to bioinformatics, the field of shape analysis deals with problems where one wants to analyze geometric objects, such as curves, while ignoring actions that preserve their shape, such as translations, rotations, or reparametrizations. Mathematical tools have been developed to define notions of distances, averages, and optimal deformations for geometri… ▽ More

    Submitted 18 April, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 8 pages, 7 figures, 3 tables. Accepted to DiffCVML

    MSC Class: 68T10 ACM Class: I.5.1

  37. arXiv:2012.06144  [pdf, other

    physics.flu-dyn cs.MS cs.PF

    Highly Efficient Lattice-Boltzmann Multiphase Simulations of Immiscible Fluids at High-Density Ratios on CPUs and GPUs through Code Generation

    Authors: Markus Holzer, Martin Bauer, Ulrich Rüde

    Abstract: A high-performance implementation of a multiphase lattice Boltzmann method based on the conservative Allen-Cahn model supporting high-density ratios and high Reynolds numbers is presented. Metaprogramming techniques are used to generate optimized code for CPUs and GPUs automatically. The coupled model is specified in a high-level symbolic description and optimized through automatic transformations… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 17 pages, 9 figures

  38. arXiv:2008.08400  [pdf, other

    stat.ML cs.LG

    Improving predictions of Bayesian neural nets via local linearization

    Authors: Alexander Immer, Maciej Korzepa, Matthias Bauer

    Abstract: The generalized Gauss-Newton (GGN) approximation is often used to make practical Bayesian deep learning approaches scalable by replacing a second order derivative with a product of first order derivatives. In this paper we argue that the GGN approximation should be understood as a local linearization of the underlying Bayesian neural network (BNN), which turns the BNN into a generalized linear mod… ▽ More

    Submitted 25 February, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  39. arXiv:2008.02725  [pdf, other

    eess.SP cs.CV cs.RO

    A Sensitivity Analysis Approach for Evaluating a Radar Simulation for Virtual Testing of Autonomous Driving Functions

    Authors: Anthony Ngo, Max Paul Bauer, Michael Resch

    Abstract: Simulation-based testing is a promising approach to significantly reduce the validation effort of automated driving functions. Realistic models of environment perception sensors such as camera, radar and lidar play a key role in this testing strategy. A generally accepted method to validate these sensor models does not yet exist. Particularly radar has traditionally been one of the most difficult… ▽ More

    Submitted 12 October, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: IEEE 2020 Asia-Pacific Conference on Intelligent Robot Systems (ACIRS 2020)

  40. A numerical framework for elastic surface matching, comparison, and interpolation

    Authors: Martin Bauer, Nicolas Charon, Philipp Harms, Hsi-Wei Hsieh

    Abstract: Surface comparison and matching is a challenging problem in computer vision. While reparametrization-invariant Sobolev metrics provide meaningful elastic distances and point correspondences via the geodesic boundary value problem, solving this problem numerically tends to be difficult. Square root normal fields (SRNF) considerably simplify the computation of certain elastic distances between param… ▽ More

    Submitted 10 June, 2021; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: 21 pages, 11 figures, 1 table, 3 algorithms. Forthcoming in the International Journal of Computer Vision

    MSC Class: 68U05; 49Q10; 58D10

  41. A Standard-based Open Source IoT Platform: FIWARE

    Authors: Flavio Cirillo, Gürkan Solmaz, Everton Luís Berz, Martin Bauer, Bin Cheng, Ernoe Kovacs

    Abstract: The ever-increasing acceleration of technology evolution in all fields is rapidly changing the architectures of data-driven systems towards the Internet-of-Things concept. Many general and specific-purpose IoT platforms are already available. This article introduces the capabilities of the FIWARE framework that is transitioning from a research to a commercial level. We base our exposition on the a… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 8 pages, 4 figure, 2 tables, Published IEEE IoT Magazine article

    Journal ref: IEEE Internet of Things Magazine, vol. 2, no. 3, pp. 12-18, September 2019

  42. arXiv:2001.11806  [pdf, other

    cs.MS cs.CE cs.DC

    lbmpy: Automatic code generation for efficient parallel lattice Boltzmann methods

    Authors: Martin Bauer, Harald Köstler, Ulrich Rüde

    Abstract: Lattice Boltzmann methods are a popular mesoscopic alternative to macroscopic computational fluid dynamics solvers. Many variants have been developed that vary in complexity, accuracy, and computational cost. Extensions are available to simulate multi-phase, multi-component, turbulent, or non-Newtonian flows. In this work we present lbmpy, a code generation package that supports a wide variety of… ▽ More

    Submitted 11 April, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

  43. An inexact matching approach for the comparison of plane curves with general elastic metrics

    Authors: Yashil Sukurdeep, Martin Bauer, Nicolas Charon

    Abstract: This paper introduces a new mathematical formulation and numerical approach for the computation of distances and geodesics between immersed planar curves. Our approach combines the general simplifying transform for first-order elastic metrics that was recently introduced by Kurtek and Needham, together with a relaxation of the matching constraint using parametrization-invariant fidelity metrics. T… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: 5 pages, 5 figures

  44. arXiv:1909.13772  [pdf, other

    cs.DC cs.CE physics.comp-ph

    waLBerla: A block-structured high-performance framework for multiphysics simulations

    Authors: Martin Bauer, Sebastian Eibl, Christian Godenschwager, Nils Kohl, Michael Kuron, Christoph Rettinger, Florian Schornbaum, Christoph Schwarzmeier, Dominik Thönnes, Harald Köstler, Ulrich Rüde

    Abstract: Programming current supercomputers efficiently is a challenging task. Multiple levels of parallelism on the core, on the compute node, and between nodes need to be exploited to make full use of the system. Heterogeneous hardware architectures with accelerators further complicate the development process. waLBerla addresses these challenges by providing the user with highly efficient building blocks… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  45. arXiv:1906.02004  [pdf, other

    cs.LG stat.ML

    Interpretable and Differentially Private Predictions

    Authors: Frederik Harder, Matthias Bauer, Mijung Park

    Abstract: Interpretable predictions, where it is clear why a machine learning model has made a particular decision, can compromise privacy by revealing the characteristics of individual data points. This raises the central question addressed in this paper: Can models be interpretable without compromising privacy? For complex big data fit by correspondingly rich models, balancing privacy and explainability i… ▽ More

    Submitted 5 April, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  46. arXiv:1810.11428  [pdf, other

    stat.ML cs.LG

    Resampled Priors for Variational Autoencoders

    Authors: Matthias Bauer, Andriy Mnih

    Abstract: We propose Learned Accept/Reject Sampling (LARS), a method for constructing richer priors using rejection sampling with a learned acceptance function. This work is motivated by recent analyses of the VAE objective, which pointed out that commonly used simple priors can lead to underfitting. As the distribution induced by LARS involves an intractable normalizing constant, we show how to estimate it… ▽ More

    Submitted 26 April, 2019; v1 submitted 26 October, 2018; originally announced October 2018.

    Journal ref: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019

  47. arXiv:1808.05563  [pdf, other

    cs.LG stat.ML

    Learning Invariances using the Marginal Likelihood

    Authors: Mark van der Wilk, Matthias Bauer, ST John, James Hensman

    Abstract: Generalising well in supervised learning tasks relies on correctly extrapolating the training data to a large region of the input space. One way to achieve this is to constrain the predictions to be invariant to transformations on the input that are known to be irrelevant (e.g. translation). Commonly, this is done through data augmentation, where the training set is enlarged by applying hand-craft… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

  48. Standards-Based Worldwide Semantic Interoperability for IoT

    Authors: Erno Kovacs, Martin Bauer, Jaeho Kim, Jaeseok Yun, Franck Le Gall, Mengxuan Zhao

    Abstract: Global IoT services (GIoTS) are combining locally available IoT resources with Cloud-based services. They are targeting world-wide services. GIoTS require interoperability between the locally installed heterogeneous IoT systems. Semantic processing is an important technology to enable data mediation as well as knowledge-based processing. This paper explains a system architecture for achieving worl… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

    Comments: European Union (EU), FP7 FI-CORE, grant agreement No 632893, Horizon 2020 FIESTA grant agreement No 643943, EU-South Korea, Horizon 2020 Wise-IoT grant agreement 723156, South Korea, IITP, Korea government MSIP No.B0184-15-1003

    Journal ref: E. Kovacs, M. Bauer, J. Kim, J. Yun, F. Le Gall and M. Zhao, "Standards-Based Worldwide Semantic Interoperability for IoT," in IEEE Communications Magazine, vol. 54, no. 12, pp. 40-46, December 2016

  49. arXiv:1805.09921  [pdf, other

    stat.ML cs.LG

    Meta-Learning Probabilistic Inference For Prediction

    Authors: Jonathan Gordon, John Bronskill, Matthias Bauer, Sebastian Nowozin, Richard E. Turner

    Abstract: This paper introduces a new framework for data efficient and versatile learning. Specifically: 1) We develop ML-PIP, a general framework for Meta-Learning approximate Probabilistic Inference for Prediction. ML-PIP extends existing probabilistic interpretations of meta-learning to cover a broad class of methods. 2) We introduce VERSA, an instance of the framework employing a flexible and versatile… ▽ More

    Submitted 6 August, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: International Conference on Learning Representations (ICLR) 2019

    Journal ref: International Conference on Learning Representations (2019)

  50. arXiv:1805.05508  [pdf, other

    cs.SE

    Task Interruption in Software Development Projects: What Makes some Interruptions More Disruptive than Others?

    Authors: Zahra Shakeri Hossein Abad, Oliver Karras, Kurt Schneider, Ken Barker, Mike Bauer

    Abstract: Multitasking has always been an inherent part of software development and is known as the primary source of interruptions due to task switching in software development teams. Developing software involves a mix of analytical and creative work, and requires a significant load on brain functions, such as working memory and decision making. Thus, task switching in the context of software development i… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.