Skip to main content

Showing 1–50 of 58 results for author: Rudolph, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.15673  [pdf, ps, other

    cs.MM cs.NI

    Point Cloud Streaming with Latency-Driven Implicit Adaptation using MoQ

    Authors: Andrew Freeman, Michael Rudolph, Amr Rizk

    Abstract: Point clouds are a promising video representation for next-generation multimedia experiences in virtual and augmented reality. Point clouds are notoriously high-bitrate, however, which limits the feasibility of live streaming systems. Prior methods have adopted traditional HTTP-based protocols for point cloud streaming, but they rely on explicit client-side adaptation to maintain low latency under… ▽ More

    Submitted 21 July, 2025; originally announced July 2025.

  2. arXiv:2506.20729  [pdf, ps, other

    cs.LG astro-ph.CO cs.AI hep-ph hep-th

    Test-time Scaling Techniques in Theoretical Physics -- A Comparison of Methods on the TPBench Dataset

    Authors: Zhiqi Gao, Tianyi Li, Yurii Kvasiuk, Sai Chaitanya Tadepalli, Maja Rudolph, Daniel J. H. Chung, Frederic Sala, Moritz Münchmeyer

    Abstract: Large language models (LLMs) have shown strong capabilities in complex reasoning, and test-time scaling techniques can enhance their performance with comparably low cost. Many of these methods have been developed and evaluated on mathematical reasoning benchmarks such as AIME. This paper investigates whether the lessons learned from these benchmarks generalize to the domain of advanced theoretical… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 23 pages, 6 figures

  3. arXiv:2502.15815  [pdf, other

    cs.LG astro-ph.CO cs.AI hep-ph hep-th

    Theoretical Physics Benchmark (TPBench) -- a Dataset and Study of AI Reasoning Capabilities in Theoretical Physics

    Authors: Daniel J. H. Chung, Zhiqi Gao, Yurii Kvasiuk, Tianyi Li, Moritz Münchmeyer, Maja Rudolph, Frederic Sala, Sai Chaitanya Tadepalli

    Abstract: We introduce a benchmark to evaluate the capability of AI to solve problems in theoretical physics, focusing on high-energy theory and cosmology. The first iteration of our benchmark consists of 57 problems of varying difficulty, from undergraduate to research level. These problems are novel in the sense that they do not come from public problem collections. We evaluate our data set on various ope… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 48 pages, 4 figures

  4. arXiv:2502.08938  [pdf, ps, other

    cs.LG

    Reevaluating Policy Gradient Methods for Imperfect-Information Games

    Authors: Max Rudolph, Nathan Lichtle, Sobhan Mohammadpour, Alexandre Bayen, J. Zico Kolter, Amy Zhang, Gabriele Farina, Eugene Vinitsky, Samuel Sokota

    Abstract: In the past decade, motivated by the putative failure of naive self-play deep reinforcement learning (DRL) in adversarial imperfect-information games, researchers have developed numerous DRL algorithms based on fictitious play (FP), double oracle (DO), and counterfactual regret minimization (CFR). In light of recent results of the magnetic mirror descent algorithm, we hypothesize that simpler gene… ▽ More

    Submitted 19 July, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  5. arXiv:2502.07889  [pdf, other

    quant-ph cs.LG stat.ML

    A unifying account of warm start guarantees for patches of quantum landscapes

    Authors: Hela Mhiri, Ricard Puig, Sacha Lerch, Manuel S. Rudolph, Thiparat Chotibut, Supanut Thanasilp, Zoë Holmes

    Abstract: Barren plateaus are fundamentally a statement about quantum loss landscapes on average but there can, and generally will, exist patches of barren plateau landscapes with substantial gradients. Previous work has studied certain classes of parameterized quantum circuits and found example regions where gradients vanish at worst polynomially in system size. Here we present a general bound that unifies… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  6. arXiv:2412.05718  [pdf, other

    cs.AI cs.GR cs.LG cs.RO

    RLZero: Direct Policy Inference from Language Without In-Domain Supervision

    Authors: Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum

    Abstract: The reward hypothesis states that all goals and purposes can be understood as the maximization of a received scalar reward signal. However, in practice, defining such a reward signal is notoriously difficult, as humans are often unable to predict the optimal behavior corresponding to a reward function. Natural language offers an intuitive alternative for instructing reinforcement learning (RL) age… ▽ More

    Submitted 1 June, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

    Comments: 26 pages

  7. arXiv:2411.19896  [pdf, other

    quant-ph cs.LG stat.ML

    Efficient quantum-enhanced classical simulation for patches of quantum landscapes

    Authors: Sacha Lerch, Ricard Puig, Manuel S. Rudolph, Armando Angrisani, Tyson Jones, M. Cerezo, Supanut Thanasilp, Zoë Holmes

    Abstract: Understanding the capabilities of classical simulation methods is key to identifying where quantum computers are advantageous. Not only does this ensure that quantum computers are used only where necessary, but also one can potentially identify subroutines that can be offloaded onto a classical device. In this work, we show that it is always possible to generate a classical surrogate of a sub-regi… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

    Comments: 10 + 47 pages, 4 figures

    Report number: LA-UR: LA-UR-24-3269

  8. arXiv:2411.16289  [pdf, other

    cs.CV

    Utilizing Uncertainty in 2D Pose Detectors for Probabilistic 3D Human Mesh Recovery

    Authors: Tom Wehrbein, Marco Rudolph, Bodo Rosenhahn, Bastian Wandt

    Abstract: Monocular 3D human pose and shape estimation is an inherently ill-posed problem due to depth ambiguities, occlusions, and truncations. Recent probabilistic approaches learn a distribution over plausible 3D human meshes by maximizing the likelihood of the ground-truth pose given an image. We show that this objective function alone is not sufficient to best capture the full distributions. Instead, w… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: WACV 2025

  9. arXiv:2409.01706  [pdf, ps, other

    quant-ph cs.CC math-ph

    Classically estimating observables of noiseless quantum circuits

    Authors: Armando Angrisani, Alexander Schmidhuber, Manuel S. Rudolph, M. Cerezo, Zoë Holmes, Hsin-Yuan Huang

    Abstract: We present a classical algorithm based on Pauli propagation for estimating expectation values of arbitrary observables on random unstructured quantum circuits across all circuit architectures and depths, including those with all-to-all connectivity. We prove that for any architecture where each circuit layer is randomly sampled from a distribution invariant under single-qubit rotations, our algori… ▽ More

    Submitted 12 August, 2025; v1 submitted 3 September, 2024; originally announced September 2024.

    Comments: Main text: 9 pages, 2 figures. Appendices: 29 pages, 3 figures. Revised version with improved presentation and additional numerical experiments

    Report number: LA-UR-24-29028

  10. arXiv:2408.12739  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum Convolutional Neural Networks are (Effectively) Classically Simulable

    Authors: Pablo Bermejo, Paolo Braccia, Manuel S. Rudolph, Zoë Holmes, Lukasz Cincio, M. Cerezo

    Abstract: Quantum Convolutional Neural Networks (QCNNs) are widely regarded as a promising model for Quantum Machine Learning (QML). In this work we tie their heuristic success to two facts. First, that when randomly initialized, they can only operate on the information encoded in low-bodyness measurements of their input states. And second, that they are commonly benchmarked on "locally-easy'' datasets whos… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 11 + 13 pages , 6 + 3 figures, 1 table

    Report number: LA-UR-24-29027

  11. arXiv:2408.00599  [pdf, other

    cs.CV cs.MM eess.IV

    Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control

    Authors: Michael Rudolph, Aron Riemenschneider, Amr Rizk

    Abstract: Point cloud compression is essential to experience volumetric multimedia as it drastically reduces the required streaming data rates. Point attributes, specifically colors, extend the challenge of lossy compression beyond geometric representation to achieving joint reconstruction of texture and geometry. State-of-the-art methods separate geometry and attributes to compress them individually. This… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 20 pages, 13 figures

  12. arXiv:2406.16308  [pdf, other

    cs.LG cs.AI cs.CL

    Anomaly Detection of Tabular Data Using LLMs

    Authors: Aodong Li, Yunhan Zhao, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Large language models (LLMs) have shown their potential in long-context understanding and mathematical reasoning. In this paper, we study the problem of using LLMs to detect tabular anomalies and show that pre-trained LLMs are zero-shot batch-level anomaly detectors. That is, without extra distribution-specific model fitting, they can discover hidden outliers in a batch of data, demonstrating thei… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: accepted at the Anomaly Detection with Foundation Models workshop

  13. arXiv:2405.13699  [pdf, other

    cs.LG cs.AI

    Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior

    Authors: Lorenzo Perini, Maja Rudolph, Sabrina Schmedding, Chen Qiu

    Abstract: Anomaly detection is the task of identifying examples that do not behave as expected. Because anomalies are rare and unexpected events, collecting real anomalous examples is often challenging in several applications. In addition, learning an anomaly detector with limited (or no) anomalies often yields poor prediction performance. One option is to employ auxiliary synthetic anomalies to improve the… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  14. arXiv:2405.03113  [pdf, other

    cs.RO cs.AI

    Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

    Authors: Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum

    Abstract: Reinforcement Learning is a promising tool for learning complex policies even in fast-moving and object-interactive domains where human teleoperation or hard-coded policies might fail. To effectively reflect this challenging category of tasks, we introduce a dynamic, interactive RL testbed based on robot air hockey. By augmenting air hockey with a large family of tasks ranging from easy tasks like… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  15. arXiv:2404.06832  [pdf, other

    cs.CV cs.LG

    SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection

    Authors: Mathis Kruse, Marco Rudolph, Dominik Woiwode, Bodo Rosenhahn

    Abstract: Detecting anomalies in images has become a well-explored problem in both academia and industry. State-of-the-art algorithms are able to detect defects in increasingly difficult settings and data modalities. However, most current methods are not suited to address 3D objects captured from differing poses. While solutions using Neural Radiance Fields (NeRFs) have been proposed, they suffer from exces… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Visual Anomaly and Novelty Detection 2.0 Workshop at CVPR 2024

  16. arXiv:2403.16369  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Action-based Representations Using Invariance

    Authors: Max Rudolph, Caleb Chuck, Kevin Black, Misha Lvovsky, Scott Niekum, Amy Zhang

    Abstract: Robust reinforcement learning agents using high-dimensional observations must be able to identify relevant state features amidst many exogeneous distractors. A representation that captures controllability identifies these state elements by determining what affects agent control. While methods such as inverse dynamics and mutual information capture controllability for a limited number of timesteps,… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Published at the Reinforcement Learning Conference 2024

  17. arXiv:2403.00025  [pdf, ps, other

    cs.LG cs.AI

    On the Challenges and Opportunities in Generative AI

    Authors: Laura Manduchi, Clara Meister, Kushagra Pandey, Robert Bamler, Ryan Cotterell, Sina Däubener, Sophie Fellenz, Asja Fischer, Thomas Gärtner, Matthias Kirchler, Marius Kloft, Yingzhen Li, Christoph Lippert, Gerard de Melo, Eric Nalisnick, Björn Ommer, Rajesh Ranganath, Maja Rudolph, Karen Ullrich, Guy Van den Broeck, Julia E Vogt, Yixin Wang, Florian Wenzel, Frank Wood, Stephan Mandt , et al. (1 additional authors not shown)

    Abstract: The field of deep generative modeling has grown rapidly in the last few years. With the availability of massive amounts of training data coupled with advances in scalable unsupervised learning paradigms, recent large-scale generative models show tremendous promise in synthesizing high-resolution images and text, as well as structured data such as videos and molecules. However, we argue that curren… ▽ More

    Submitted 22 August, 2025; v1 submitted 28 February, 2024; originally announced March 2024.

  18. arXiv:2402.07211  [pdf, other

    cs.LG stat.ML

    Towards Fast Stochastic Sampling in Diffusion Generative Models

    Authors: Kushagra Pandey, Maja Rudolph, Stephan Mandt

    Abstract: Diffusion models suffer from slow sample generation at inference time. Despite recent efforts, improving the sampling efficiency of stochastic samplers for diffusion models remains a promising direction. We propose Splitting Integrators for fast stochastic sampling in pre-trained diffusion models in augmented spaces. Commonly used in molecular dynamics, splitting-based integrators attempt to impro… ▽ More

    Submitted 13 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: Accepted in the NeurIPS'23 Workshop on Diffusion Models. Full version of this work can be found at arXiv:2310.07894

  19. arXiv:2401.13127  [pdf, other

    cs.RO cs.MA

    Generalization of Heterogeneous Multi-Robot Policies via Awareness and Communication of Capabilities

    Authors: Pierce Howell, Max Rudolph, Reza Torbati, Kevin Fu, Harish Ravichandar

    Abstract: Recent advances in multi-agent reinforcement learning (MARL) are enabling impressive coordination in heterogeneous multi-robot teams. However, existing approaches often overlook the challenge of generalizing learned policies to teams of new compositions, sizes, and robots. While such generalization might not be important in teams of virtual agents that can retrain policies on-demand, it is pivotal… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Presented at the 7th Conference on Robot Learning (CoRL 2023), Atlanta, USA

  20. arXiv:2401.00033  [pdf, other

    cs.AI cs.LG

    Hybrid Modeling Design Patterns

    Authors: Maja Rudolph, Stefan Kurz, Barbara Rakitsch

    Abstract: Design patterns provide a systematic way to convey solutions to recurring modeling challenges. This paper introduces design patterns for hybrid modeling, an approach that combines modeling based on first principles with data-driven modeling techniques. While both approaches have complementary advantages there are often multiple ways to combine them into a hybrid model, and the appropriate solution… ▽ More

    Submitted 29 December, 2023; originally announced January 2024.

  21. arXiv:2312.13839  [pdf, other

    cs.CV cs.LG

    Q-SENN: Quantized Self-Explaining Neural Networks

    Authors: Thomas Norrenbrock, Marco Rudolph, Bodo Rosenhahn

    Abstract: Explanations in Computer Vision are often desired, but most Deep Neural Networks can only provide saliency maps with questionable faithfulness. Self-Explaining Neural Networks (SENN) extract interpretable concepts with fidelity, diversity, and grounding to combine them linearly for decision-making. While they can explain what was recognized, initial realizations lack accuracy and general applicabi… ▽ More

    Submitted 16 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024, SRRAI

  22. arXiv:2312.09121  [pdf, ps, other

    quant-ph cs.LG stat.ML

    Does provable absence of barren plateaus imply classical simulability?

    Authors: M. Cerezo, Martin Larocca, Diego García-Martín, N. L. Diaz, Paolo Braccia, Enrico Fontana, Manuel S. Rudolph, Pablo Bermejo, Aroosa Ijaz, Supanut Thanasilp, Eric R. Anschuetz, Zoë Holmes

    Abstract: A large amount of effort has recently been put into understanding the barren plateau phenomenon. In this perspective article, we face the increasingly loud elephant in the room and ask a question that has been hinted at by many but not explicitly addressed: Can the structure that allows one to avoid barren plateaus also be leveraged to efficiently simulate the loss classically? We collect evidence… ▽ More

    Submitted 25 August, 2025; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 15+22 pages, 5+2 figures, 2 tables, updated to published version

    Report number: LA-UR-23-33705

    Journal ref: Nature Communications 16, 7907 (2025)

  23. arXiv:2311.04765  [pdf, other

    cs.RO cs.AI cs.LG

    The voraus-AD Dataset for Anomaly Detection in Robot Applications

    Authors: Jan Thieß Brockmann, Marco Rudolph, Bodo Rosenhahn, Bastian Wandt

    Abstract: During the operation of industrial robots, unusual events may endanger the safety of humans and the quality of production. When collecting data to detect such cases, it is not ensured that data from all potentially occurring errors is included as unforeseeable events may happen over time. Therefore, anomaly detection (AD) delivers a practical solution, using only normal data to learn to detect unu… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 14 pages, 14 figures, accepted to Transactions on Robotics

  24. arXiv:2310.10461  [pdf, other

    cs.LG cs.CV

    Model Selection of Anomaly Detectors in the Absence of Labeled Validation Data

    Authors: Clement Fung, Chen Qiu, Aodong Li, Maja Rudolph

    Abstract: Anomaly detection is the task of identifying abnormal samples in large unlabeled datasets. While the advent of foundation models has produced powerful zero-shot anomaly detection methods, their deployment in practice is often hindered by the absence of labeled validation data -- without it, their detection performance cannot be evaluated reliably. In this work, we propose SWSA (Selection With Synt… ▽ More

    Submitted 16 September, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 14 pages

  25. arXiv:2310.07894  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Efficient Integrators for Diffusion Generative Models

    Authors: Kushagra Pandey, Maja Rudolph, Stephan Mandt

    Abstract: Diffusion models suffer from slow sample generation at inference time. Therefore, developing a principled framework for fast deterministic/stochastic sampling for a broader class of diffusion models is a promising direction. We propose two complementary frameworks for accelerating sample generation in pre-trained models: Conjugate Integrators and Splitting Integrators. Conjugate integrators genera… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  26. arXiv:2310.00035  [pdf, other

    cs.LG cs.AI

    LoRA ensembles for large language model fine-tuning

    Authors: Xi Wang, Laurence Aitchison, Maja Rudolph

    Abstract: Finetuned LLMs often exhibit poor uncertainty quantification, manifesting as overconfidence, poor calibration, and unreliable prediction results on test data or out-of-distribution samples. One approach commonly used in vision for alleviating this issue is a deep ensemble, which constructs an ensemble by training the same model multiple times using different random initializations. However, there… ▽ More

    Submitted 4 October, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Update the title in the PDF file

  27. Range Limited Coverage Control using Air-Ground Multi-Robot Teams

    Authors: Max Rudolph, Sean Wilson, Magnus Egerstedt

    Abstract: In this paper, we investigate how heterogeneous multi-robot systems with different sensing capabilities can observe a domain with an apriori unknown density function. Common coverage control techniques are targeted towards homogeneous teams of robots and do not consider what happens when the sensing capabilities of the robots are vastly different. This work proposes an extension to Lloyd's algorit… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Published at 2021 IEEE International Conference on Robotics and Automation (ICRA)

  28. arXiv:2305.02881  [pdf, other

    quant-ph cs.LG hep-ex stat.ML

    Trainability barriers and opportunities in quantum generative modeling

    Authors: Manuel S. Rudolph, Sacha Lerch, Supanut Thanasilp, Oriel Kiss, Sofia Vallecorsa, Michele Grossi, Zoë Holmes

    Abstract: Quantum generative models, in providing inherently efficient sampling strategies, show promise for achieving a near-term advantage on quantum hardware. Nonetheless, important questions remain regarding their scalability. In this work, we investigate the barriers to the trainability of quantum generative models posed by barren plateaus and exponential loss concentration. We explore the interplay be… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 20+32 pages, 9+2 figures

  29. arXiv:2303.13166  [pdf, other

    cs.CV cs.LG

    Take 5: Interpretable Image Classification with a Handful of Features

    Authors: Thomas Norrenbrock, Marco Rudolph, Bodo Rosenhahn

    Abstract: Deep Neural Networks use thousands of mostly incomprehensible features to identify a single class, a decision no human can follow. We propose an interpretable sparse and low dimensional final decision layer in a deep neural network with measurable aspects of interpretability and demonstrate it on fine-grained image classification. We argue that a human can only understand the decision of a machine… ▽ More

    Submitted 5 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    ACM Class: I.2.4

    Journal ref: Progress and Challenges in Building Trustworthy Embodied AI @NeurIPS, December 2022

  30. arXiv:2303.12834  [pdf, other

    quant-ph cs.AI cs.LG stat.ML

    The power and limitations of learning quantum dynamics incoherently

    Authors: Sofiene Jerbi, Joe Gibbs, Manuel S. Rudolph, Matthias C. Caro, Patrick J. Coles, Hsin-Yuan Huang, Zoë Holmes

    Abstract: Quantum process learning is emerging as an important tool to study quantum systems. While studied extensively in coherent frameworks, where the target and model system can share quantum information, less attention has been paid to whether the dynamics of quantum systems can be learned without the system and target directly interacting. Such incoherent frameworks are practically appealing since the… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 6+9 pages, 7 figures

    Report number: LA-UR-23-22871

  31. arXiv:2303.05904  [pdf, ps, other

    cs.LG

    Deep Anomaly Detection on Tennessee Eastman Process Data

    Authors: Fabian Hartung, Billy Joe Franks, Tobias Michels, Dennis Wagner, Philipp Liznerski, Steffen Reithermann, Sophie Fellenz, Fabian Jirasek, Maja Rudolph, Daniel Neider, Heike Leitte, Chen Song, Benjamin Kloepper, Stephan Mandt, Michael Bortz, Jakob Burger, Hans Hasse, Marius Kloft

    Abstract: This paper provides the first comprehensive evaluation and analysis of modern (deep-learning) unsupervised anomaly detection methods for chemical process data. We focus on the Tennessee Eastman process dataset, which has been a standard litmus test to benchmark anomaly detection methods for nearly three decades. Our extensive study will facilitate choosing appropriate anomaly detection methods in… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  32. arXiv:2302.07849  [pdf, other

    cs.LG cs.AI stat.ML

    Zero-Shot Anomaly Detection via Batch Normalization

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Anomaly detection (AD) plays a crucial role in many safety-critical application domains. The challenge of adapting an anomaly detector to drift in the normal data distribution, especially when no training data is available for the "new normal," has led to the development of zero-shot AD techniques. In this paper, we propose a simple yet effective method called Adaptive Centered Representations (AC… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: accepted at NeurIPS 2023

  33. arXiv:2302.07832  [pdf, other

    cs.LG cs.AI

    Deep Anomaly Detection under Labeling Budget Constraints

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Stephan Mandt, Maja Rudolph

    Abstract: Selecting informative data points for expert feedback can significantly improve the performance of anomaly detection (AD) in various contexts, such as medical diagnostics or fraud detection. In this paper, we determine a set of theoretical conditions under which anomaly scores generalize from labeled queries to unlabeled data. Motivated by these results, we propose a data labeling strategy with op… ▽ More

    Submitted 4 July, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  34. arXiv:2212.02085  [pdf, other

    cs.RO

    RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps

    Authors: Florian Sauerbeck, Benjamin Obermeier, Martin Rudolph, Johannes Betz

    Abstract: In this paper, we present a novel method for integrating 3D LiDAR depth measurements into the existing ORB-SLAM3 by building upon the RGB-D mode. We propose and compare two methods of depth map generation: conventional computer vision methods, namely an inverse dilation operation, and a supervised deep learning-based approach. We integrate the former directly into the ORB-SLAM3 framework by adding… ▽ More

    Submitted 6 December, 2022; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted at ICCCR 2023

  35. arXiv:2211.11607  [pdf

    cs.CV cs.LG

    Semantic Segmentation for Fully Automated Macrofouling Analysis on Coatings after Field Exposure

    Authors: Lutz M. K. Krause, Emily Manderfeld, Patricia Gnutt, Louisa Vogler, Ann Wassick, Kailey Richard, Marco Rudolph, Kelli Z. Hunsucker, Geoffrey W. Swain, Bodo Rosenhahn, Axel Rosenhahn

    Abstract: Biofouling is a major challenge for sustainable shipping, filter membranes, heat exchangers, and medical devices. The development of fouling-resistant coatings requires the evaluation of their effectiveness. Such an evaluation is usually based on the assessment of fouling progression after different exposure times to the target medium (e.g., salt water). The manual assessment of macrofouling requi… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: 33 pages, 10 figures

  36. arXiv:2210.07829  [pdf, other

    cs.LG cs.AI cs.CV

    Asymmetric Student-Teacher Networks for Industrial Anomaly Detection

    Authors: Marco Rudolph, Tom Wehrbein, Bodo Rosenhahn, Bastian Wandt

    Abstract: Industrial defect detection is commonly addressed with anomaly detection (AD) methods where no or only incomplete data of potentially occurring defects is available. This work discovers previously unknown problems of student-teacher approaches for AD and proposes a solution, where two neural networks are trained to produce the same output for the defect-free training examples. The core assumption… ▽ More

    Submitted 18 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: accepted to WACV 2023

  37. arXiv:2207.10821  [pdf, other

    cs.RO

    Rethinking Sim2Real: Lower Fidelity Simulation Leads to Higher Sim2Real Transfer in Navigation

    Authors: Joanne Truong, Max Rudolph, Naoki Yokoyama, Sonia Chernova, Dhruv Batra, Akshara Rai

    Abstract: If we want to train robots in simulation before deploying them in reality, it seems natural and almost self-evident to presume that reducing the sim2real gap involves creating simulators of increasing fidelity (since reality is what it is). We challenge this assumption and present a contrary hypothesis -- sim2real transfer of robots may be improved with lower (not higher) fidelity simulation. We c… ▽ More

    Submitted 25 November, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

  38. arXiv:2205.13845  [pdf, other

    cs.LG cs.AI

    Raising the Bar in Graph-level Anomaly Detection

    Authors: Chen Qiu, Marius Kloft, Stephan Mandt, Maja Rudolph

    Abstract: Graph-level anomaly detection has become a critical topic in diverse areas, such as financial fraud detection and detecting anomalous activities in social networks. While most research has focused on anomaly detection for visual data such as images, where high detection accuracies have been obtained, existing deep learning approaches for graphs currently show considerably worse performance. This p… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: To appear in IJCAI-ECAI 2022

    Journal ref: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI-22), 2022

  39. arXiv:2204.02075  [pdf, other

    cs.LG cs.AI cs.CV

    Complex-Valued Autoencoders for Object Discovery

    Authors: Sindy Löwe, Phillip Lippe, Maja Rudolph, Max Welling

    Abstract: Object-centric representations form the basis of human perception, and enable us to reason about the world and to systematically generalize to new settings. Currently, most works on unsupervised object discovery focus on slot-based approaches, which explicitly separate the latent representations of individual objects. While the result is easily interpretable, it usually requires the design of invo… ▽ More

    Submitted 18 November, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  40. arXiv:2203.04206  [pdf, other

    cs.CV cs.RO

    Lightweight Monocular Depth Estimation through Guided Decoding

    Authors: Michael Rudolph, Youssef Dawoud, Ronja Güldenring, Lazaros Nalpantidis, Vasileios Belagiannis

    Abstract: We present a lightweight encoder-decoder architecture for monocular depth estimation, specifically designed for embedded platforms. Our main contribution is the Guided Upsampling Block (GUB) for building the decoder of our model. Motivated by the concept of guided image filtering, GUB relies on the image to guide the decoder on upsampling the feature representation and the depth map reconstruction… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted to ICRA 2022

  41. arXiv:2202.08088  [pdf, other

    cs.LG cs.AI

    Latent Outlier Exposure for Anomaly Detection with Contaminated Data

    Authors: Chen Qiu, Aodong Li, Marius Kloft, Maja Rudolph, Stephan Mandt

    Abstract: Anomaly detection aims at identifying data points that show systematic deviations from the majority of data in an unlabeled dataset. A common assumption is that clean training data (free of anomalies) is available, which is often violated in practice. We propose a strategy for training an anomaly detector in the presence of unlabeled anomalies that is compatible with a broad class of models. The i… ▽ More

    Submitted 26 June, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: To appear in ICML 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, 2022, volume:162, pages:18153--18167

  42. arXiv:2202.03944  [pdf, other

    cs.LG cs.AI

    Detecting Anomalies within Time Series using Local Neural Transformations

    Authors: Tim Schneider, Chen Qiu, Marius Kloft, Decky Aspandi Latif, Steffen Staab, Stephan Mandt, Maja Rudolph

    Abstract: We develop a new method to detect anomalies within time series, which is essential in many application domains, reaching from self-driving cars, finance, and marketing to medical diagnosis and epidemiology. The method is based on self-supervised deep learning that has played a key role in facilitating deep anomaly detection on images, where powerful image transformations are available. However, su… ▽ More

    Submitted 20 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  43. arXiv:2111.11344  [pdf, other

    cs.LG stat.ML

    Modeling Irregular Time Series with Continuous Recurrent Units

    Authors: Mona Schirmer, Mazin Eltayeb, Stefan Lessmann, Maja Rudolph

    Abstract: Recurrent neural networks (RNNs) are a popular choice for modeling sequential data. Modern RNN architectures assume constant time-intervals between observations. However, in many datasets (e.g. medical records) observation times are irregular and can carry important information. To address this challenge, we propose continuous recurrent units (CRUs) -- a neural architecture that can naturally hand… ▽ More

    Submitted 26 July, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted at ICML 2022, Baltimore, Maryland

  44. arXiv:2111.08291  [pdf, other

    cs.LG cs.AI eess.SP

    Switching Recurrent Kalman Networks

    Authors: Giao Nguyen-Quynh, Philipp Becker, Chen Qiu, Maja Rudolph, Gerhard Neumann

    Abstract: Forecasting driving behavior or other sensor measurements is an essential component of autonomous driving systems. Often real-world multivariate time series data is hard to model because the underlying dynamics are nonlinear and the observations are noisy. In addition, driving data can often be multimodal in distribution, meaning that there are distinct predictions that are likely, but averaging c… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia

  45. arXiv:2110.02855  [pdf, other

    cs.CV

    Fully Convolutional Cross-Scale-Flows for Image-based Defect Detection

    Authors: Marco Rudolph, Tom Wehrbein, Bodo Rosenhahn, Bastian Wandt

    Abstract: In industrial manufacturing processes, errors frequently occur at unpredictable times and in unknown manifestations. We tackle the problem of automatic defect detection without requiring any image samples of defective parts. Recent works model the distribution of defect-free image data, using either strong statistical priors or overly simplified data representations. In contrast, our approach hand… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

  46. arXiv:2108.00346  [pdf, other

    cs.RO cs.MA

    Desperate Times Call for Desperate Measures: Towards Risk-Adaptive Task Allocation

    Authors: Max Rudolph, Sonia Chernova, Harish Ravichandar

    Abstract: Multi-robot task allocation (MRTA) problems involve optimizing the allocation of robots to tasks. MRTA problems are known to be challenging when tasks require multiple robots and the team is composed of heterogeneous robots. These challenges are further exacerbated when we need to account for uncertainties encountered in the real-world. In this work, we address coalition formation in heterogeneous… ▽ More

    Submitted 7 August, 2021; v1 submitted 31 July, 2021; originally announced August 2021.

    Comments: Accepted for IROS 2021

  47. arXiv:2107.13788  [pdf, other

    cs.CV

    Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows

    Authors: Tom Wehrbein, Marco Rudolph, Bodo Rosenhahn, Bastian Wandt

    Abstract: 3D human pose estimation from monocular images is a highly ill-posed problem due to depth ambiguities and occlusions. Nonetheless, most existing works ignore these ambiguities and only estimate a single solution. In contrast, we generate a diverse set of hypotheses that represents the full posterior distribution of feasible 3D poses. To this end, we propose a normalizing flow based method that exp… ▽ More

    Submitted 2 August, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: Accepted to ICCV 2021

  48. arXiv:2103.16440  [pdf, other

    cs.LG cs.AI

    Neural Transformation Learning for Deep Anomaly Detection Beyond Images

    Authors: Chen Qiu, Timo Pfrommer, Marius Kloft, Stephan Mandt, Maja Rudolph

    Abstract: Data transformations (e.g. rotations, reflections, and cropping) play an important role in self-supervised learning. Typically, images are transformed into different views, and neural networks trained on tasks involving these views produce useful feature representations for downstream tasks, including anomaly detection. However, for anomaly detection beyond image data, it is often unclear which tr… ▽ More

    Submitted 3 February, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, 2021, volume:139, pages:8703--8714

  49. arXiv:2011.14679  [pdf, other

    cs.CV

    CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild

    Authors: Bastian Wandt, Marco Rudolph, Petrissa Zell, Helge Rhodin, Bodo Rosenhahn

    Abstract: Human pose estimation from single images is a challenging problem in computer vision that requires large amounts of labeled training data to be solved accurately. Unfortunately, for many human activities (\eg outdoor sports) such training data does not exist and is hard or even impossible to acquire with traditional motion capture systems. We propose a self-supervised approach that learns a single… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  50. arXiv:2010.10403  [pdf, other

    cs.LG

    Variational Dynamic Mixtures

    Authors: Chen Qiu, Stephan Mandt, Maja Rudolph

    Abstract: Deep probabilistic time series forecasting models have become an integral part of machine learning. While several powerful generative models have been proposed, we provide evidence that their associated inference models are oftentimes too limited and cause the generative model to predict mode-averaged dynamics. Modeaveraging is problematic since many real-world sequences are highly multi-modal, an… ▽ More

    Submitted 4 December, 2020; v1 submitted 20 October, 2020; originally announced October 2020.