Skip to main content

Showing 1–32 of 32 results for author: Steinmetz, J

.
  1. arXiv:2506.01566  [pdf, ps, other

    cs.PF cs.AI cs.AR cs.LG

    FlexiSAGA: A Flexible Systolic Array GEMM Accelerator for Sparse and Dense Processing

    Authors: Mika Markus Müller, Konstantin Lübeck, Alexander Louis-Ferdinand Jung, Jannik Steinmetz, Oliver Bringmann

    Abstract: Artificial Intelligence (AI) algorithms, such as Deep Neural Networks (DNNs), have become an important tool for a wide range of applications, from computer vision to natural language processing. However, the computational complexity of DNN inference poses a significant challenge, particularly for processing on resource-constrained edge devices. One promising approach to address this challenge is t… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted Version for: SAMOS XXV

  2. arXiv:2502.14405  [pdf, other

    cs.SD eess.AS

    Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects

    Authors: Marco Comunità, Christian J. Steinmetz, Joshua D. Reiss

    Abstract: Audio effects are extensively used at every stage of audio and music content creation. The majority of differentiable audio effects modeling approaches fall into the black-box or gray-box paradigms; and most models have been proposed and applied to nonlinear effects like guitar amplifiers, overdrive, distortion, fuzz and compressor. Although a plethora of architectures have been introduced for the… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  3. arXiv:2502.11668  [pdf, other

    cs.SD

    NablAFx: A Framework for Differentiable Black-box and Gray-box Modeling of Audio Effects

    Authors: Marco Comunità, Christian J. Steinmetz, Joshua D. Reiss

    Abstract: We present NablAFx, an open-source framework developed to support research in differentiable black-box and gray-box modeling of audio effects. Built in PyTorch, NablAFx offers a versatile ecosystem to configure, train, evaluate, and compare various architectural approaches. It includes classes to manage model architectures, datasets, and training, along with features to compute and log losses, met… ▽ More

    Submitted 25 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  4. arXiv:2412.13330  [pdf, other

    quant-ph physics.comp-ph

    Simulating imperfect quantum optical circuits using unsymmetrized bases

    Authors: John Steinmetz, Maike Ostmann, Alex Neville, Brendan Pankovich, Adel Sohbi

    Abstract: Fault-tolerant photonic quantum computing requires the generation of large entangled resource states. The required size of these states makes it challenging to simulate the effects of errors such as loss and partial distinguishability. For an interferometer with $N$ partially distinguishable input photons and $M$ spatial modes, the Fock basis can have up to ${N+NM-1\choose N}$ elements. We show th… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: 16+7 pages, 9 figures

  5. arXiv:2410.21233  [pdf, other

    cs.SD eess.AS

    ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization

    Authors: Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, Joshua D. Reiss

    Abstract: Audio production style transfer is the task of processing an input to impart stylistic elements from a reference recording. Existing approaches often train a neural network to estimate control parameters for a set of audio effects. However, these approaches are limited in that they can only control a fixed set of effects, where the effects must be differentiable or otherwise employ specialized tra… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: Accepted to ISMIR 2024. Code available https://github.com/csteinmetz1/st-ito

  6. arXiv:2409.08595  [pdf, ps, other

    cs.PF cs.AI cs.AR cs.LG

    Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators

    Authors: Konstantin Lübeck, Alexander Louis-Ferdinand Jung, Felix Wedlich, Mika Markus Müller, Federico Nicolás Peccia, Felix Thömmes, Jannik Steinmetz, Valentin Biermaier, Adrian Frischknecht, Paul Palomero Bernardo, Oliver Bringmann

    Abstract: Implementing Deep Neural Networks (DNNs) on resource-constrained edge devices is a challenging task that requires tailored hardware accelerator architectures and a clear understanding of their performance characteristics when executing the intended AI workload. To facilitate this, we present an automated generation approach for fast performance models to accurately estimate the latency of a DNN ma… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: Accepted version for: ACM Transactions on Embedded Computing Systems

    Journal ref: Volume 24, Year 2025, Issue 2, Pages 1 - 32

  7. arXiv:2406.08330  [pdf, ps, other

    cs.PF cs.AI cs.AR cs.LG

    It's all about PR -- Smart Benchmarking AI Accelerators using Performance Representatives

    Authors: Alexander Louis-Ferdinand Jung, Jannik Steinmetz, Jonathan Gietz, Konstantin Lübeck, Oliver Bringmann

    Abstract: Statistical models are widely used to estimate the performance of commercial off-the-shelf (COTS) AI hardware accelerators. However, training of statistical performance models often requires vast amounts of data, leading to a significant time investment and can be difficult in case of limited hardware availability. To alleviate this problem, we propose a novel performance modeling methodology that… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted version for: SAMOS'24

    Journal ref: Embedded Computer Systems: Architectures, Modeling, and Simulation, LNCS, Volume 15226, Year 2024, Pages 59-75

  8. arXiv:2403.16331  [pdf, other

    cs.SD cs.LG eess.AS

    Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models

    Authors: Hanzhi Yin, Gang Cheng, Christian J. Steinmetz, Ruibin Yuan, Richard M. Stern, Roger B. Dannenberg

    Abstract: We describe a novel approach for developing realistic digital models of dynamic range compressors for digital audio production by analyzing their analog prototypes. While realistic digital dynamic compressors are potentially useful for many applications, the design process is challenging because the compressors operate nonlinearly over long time scales. Our approach is based on the structured stat… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  9. arXiv:2311.01526  [pdf, other

    cs.SD cs.LG eess.AS

    ATGNN: Audio Tagging Graph Neural Network

    Authors: Shubhr Singh, Christian J. Steinmetz, Emmanouil Benetos, Huy Phan, Dan Stowell

    Abstract: Deep learning models such as CNNs and Transformers have achieved impressive performance for end-to-end audio tagging. Recent works have shown that despite stacking multiple layers, the receptive field of CNNs remains severely limited. Transformers on the other hand are able to map global context through self-attention, but treat the spectrogram as a sequence of patches which is not flexible enough… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  10. arXiv:2310.11364  [pdf, other

    cs.SD eess.AS

    High-Fidelity Noise Reduction with Differentiable Signal Processing

    Authors: Christian J. Steinmetz, Thomas Walther, Joshua D. Reiss

    Abstract: Noise reduction techniques based on deep learning have demonstrated impressive performance in enhancing the overall quality of recorded speech. While these approaches are highly performant, their application in audio engineering can be limited due to a number of factors. These include operation only on speech without support for music, lack of real-time capability, lack of interpretable control pa… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted for publication at the 155th Convention of the Audio Engineering Society

  11. arXiv:2308.16177  [pdf, other

    cs.SD eess.AS

    General Purpose Audio Effect Removal

    Authors: Matthew Rice, Christian J. Steinmetz, George Fazekas, Joshua D. Reiss

    Abstract: Although the design and application of audio effects is well understood, the inverse problem of removing these effects is significantly more challenging and far less studied. Recently, deep learning has been applied to audio effect removal; however, existing approaches have focused on narrow formulations considering only one effect or source type at a time. In realistic scenarios, multiple effects… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Preprint. Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2023

  12. arXiv:2305.13262  [pdf, other

    cs.SD cs.LG eess.AS

    Modulation Extraction for LFO-driven Audio Effects

    Authors: Christopher Mitcheltree, Christian J. Steinmetz, Marco Comunità, Joshua D. Reiss

    Abstract: Low frequency oscillator (LFO) driven audio effects such as phaser, flanger, and chorus, modify an input signal using time-varying filters and delays, resulting in characteristic sweeping or widening effects. It has been shown that these effects can be modeled using neural networks when conditioned with the ground truth LFO signal. However, in most cases, the LFO signal is not accessible and measu… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to DAFx 2023. Listening samples and plugins can be found at https://christhetree.github.io/mod_extraction/

  13. arXiv:2304.04394  [pdf, other

    eess.AS cs.SD

    Leveraging Neural Representations for Audio Manipulation

    Authors: Scott H. Hawley, Christian J. Steinmetz

    Abstract: We investigate applying audio manipulations using pretrained neural network-based autoencoders as an alternative to traditional signal processing methods, since the former may provide greater semantic or perceptual organization. To establish the potential of this approach, we first establish if representations from these models encode information about manipulations. We carry out experiments and p… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted as Express Paper for AES Europe 2023, https://aeseurope.com/

  14. arXiv:2303.05913  [pdf, other

    math.ST

    Bootstrap Consistency for the Mack Bootstrap

    Authors: Julia Steinmetz, Carsten Jentsch

    Abstract: Mack's distribution-free chain ladder reserving model belongs to the most popular approaches in non-life insurance mathematics. Proposed to determine the first two moments of the reserve, it does not allow to identify the whole distribution of the reserve. For this purpose, Mack's model is usually equipped with a tailor-made bootstrap procedure. Although widely used in practice to estimate the res… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  15. arXiv:2211.07718  [pdf, other

    quant-ph

    Time-Dependent Hamiltonian Reconstruction using Continuous Weak Measurements

    Authors: Karthik Siva, Gerwin Koolstra, John Steinmetz, William P. Livingston, Debmalya Das, Larry Chen, John Mark Kreikebaum, Noah Stevenson, Christian Jünger, David I. Santiago, Irfan Siddiqi, Andrew N. Jordan

    Abstract: Reconstructing the Hamiltonian of a quantum system is an essential task for characterizing and certifying quantum processors and simulators. Existing techniques either rely on projective measurements of the system before and after coherent time evolution and do not explicitly reconstruct the full time-dependent Hamiltonian or interrupt evolution for tomography. Here, we experimentally demonstrate… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Main text: 12 pages, 4 figures. Appendix: 10 pages, 4 figures

  16. arXiv:2211.00497  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Modelling black-box audio effects with time-varying feature modulation

    Authors: Marco Comunità, Christian J. Steinmetz, Huy Phan, Joshua D. Reiss

    Abstract: Deep learning approaches for black-box modelling of audio effects have shown promise, however, the majority of existing work focuses on nonlinear effects with behaviour on relatively short time-scales, such as guitar amplifiers and distortion. While recurrent and convolutional architectures can theoretically be extended to capture behaviour at longer time scales, we show that simply scaling the wi… ▽ More

    Submitted 9 May, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

  17. arXiv:2207.08759  [pdf, other

    cs.SD eess.AS

    Style Transfer of Audio Effects with Differentiable Signal Processing

    Authors: Christian J. Steinmetz, Nicholas J. Bryan, Joshua D. Reiss

    Abstract: We present a framework that can impose the audio effects and production style from one recording to another by example with the goal of simplifying the audio production process. We train a deep neural network to analyze an input recording and a style reference recording, and predict the control parameters of audio effects used to render the output. In contrast to past work, we integrate audio effe… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: Preprint. To appear in the Journal of the Audio Engineering Society

  18. Quantum Telescopy Clock Games

    Authors: Robert Czupryniak, Eric Chitambar, John Steinmetz, Andrew N. Jordan

    Abstract: We consider the clock game-a task formulated in the framework of quantum information theory-that can be used to improve the existing schemes of quantum-enhanced telescopy. The problem of learning when a stellar photon reaches a telescope is translated into an abstract game, which we call the clock game. A winning strategy is provided that involves performing a quantum non-demolition measurement th… ▽ More

    Submitted 23 November, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: 18 pages, 7 figures

  19. arXiv:2203.03022  [pdf, ps, other

    cs.SD cs.AI cs.LG eess.AS stat.ML

    HEAR: Holistic Evaluation of Audio Representations

    Authors: Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk

    Abstract: What audio embedding approach generalizes best to a wide range of downstream tasks across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios. HEAR evaluates audio representations using a benchmark suite across a variety of domains, in… ▽ More

    Submitted 29 May, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: to appear in Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track

  20. arXiv:2112.02926  [pdf, other

    eess.AS cs.SD

    Steerable discovery of neural audio effects

    Authors: Christian J. Steinmetz, Joshua D. Reiss

    Abstract: Applications of deep learning for audio effects often focus on modeling analog effects or learning to control effects to emulate a trained audio engineer. However, deep learning approaches also have the potential to expand creativity through neural audio effects that enable new sound transformations. While recent work demonstrated that neural networks with random weights produce compelling audio e… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to NeurIPS 2021 Workshop on Machine Learning for Creativity and Design

  21. arXiv:2110.03691  [pdf, other

    eess.SP cs.LG cs.SD eess.AS

    Direct design of biquad filter cascades with deep learning by sampling random polynomials

    Authors: Joseph T. Colonel, Christian J. Steinmetz, Marcus Michelen, Joshua D. Reiss

    Abstract: Designing infinite impulse response filters to match an arbitrary magnitude response requires specialized techniques. Methods like modified Yule-Walker are relatively efficient, but may not be sufficiently accurate in matching high order responses. On the other hand, iterative optimization techniques often enable superior performance, but come at the cost of longer run-times and are sensitive to i… ▽ More

    Submitted 16 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted to ICASSP 2022

  22. arXiv:2110.01436  [pdf, other

    eess.AS cs.SD

    WaveBeat: End-to-end beat and downbeat tracking in the time domain

    Authors: Christian J. Steinmetz, Joshua D. Reiss

    Abstract: Deep learning approaches for beat and downbeat tracking have brought advancements. However, these approaches continue to rely on hand-crafted, subsampled spectral features as input, restricting the information available to the model. In this work, we propose WaveBeat, an end-to-end approach for joint beat and downbeat tracking operating directly on waveforms. This method forgoes engineered spectra… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: To appear at the 151st AES Convention

  23. arXiv:2108.01170  [pdf, other

    quant-ph physics.optics

    Optimal qubit circuits for quantum-enhanced telescopes

    Authors: Robert Czupryniak, John Steinmetz, Paul G. Kwiat, Andrew N. Jordan

    Abstract: We propose two optimal phase-estimation schemes that can be used for quantum-enhanced long-baseline interferometry. By using distributed entanglement, it is possible to eliminate the loss of stellar photons during transmission over the baselines. The first protocol is a sequence of gates using nonlinear optical elements, optimized over all possible measurement schemes to saturate the Cramér-Rao bo… ▽ More

    Submitted 15 November, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 14 pages, 9 figures

    Journal ref: Phys. Rev. A 108, 052408 (2023)

  24. arXiv:2107.07503  [pdf, other

    eess.AS cs.SD

    Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech

    Authors: Christian J. Steinmetz, Vamsi Krishna Ithapu, Paul Calamia

    Abstract: Deep learning approaches have emerged that aim to transform an audio signal so that it sounds as if it was recorded in the same room as a reference recording, with applications both in audio post-production and augmented reality. In this work, we propose FiNS, a Filtered Noise Shaping network that directly estimates the time domain room impulse response (RIR) from reverberant speech. Our domain-in… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted to WASPAA 2021. See details at https://facebookresearch.github.io/FiNS/

  25. Continuous measurement of a qudit using dispersively coupled radiation

    Authors: John Steinmetz, Debmalya Das, Irfan Siddiqi, Andrew N. Jordan

    Abstract: We analyze the continuous monitoring of a qudit coupled to a cavity using both phase-preserving and phase-sensitive amplification. The quantum trajectories of the system are described by a stochastic master equation, for which we derive the appropriate Lindblad operators. The measurement back-action causes spiraling in the state coordinates during collapse, which increases as the system levels bec… ▽ More

    Submitted 4 June, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 13 pages, 4 figures

    Journal ref: Phys. Rev. A 105, 052229 (2022)

  26. arXiv:2103.15752  [pdf, other

    quant-ph physics.optics

    Enhanced on-chip frequency measurement using weak value amplification

    Authors: John Steinmetz, Kevin Lyons, Meiting Song, Jaime Cardenas, Andrew N. Jordan

    Abstract: We present an integrated design to precisely measure optical frequency using weak value amplification with a multi-mode interferometer. The technique involves introducing a weak perturbation to the system and then post-selecting the data in such a way that the signal is amplified without amplifying the technical noise, as has previously been demonstrated in a free-space setup. We demonstrate the a… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 13 pages, 10 figures

  27. arXiv:2102.06200  [pdf, other

    eess.AS cs.SD

    Efficient neural networks for real-time modeling of analog dynamic range compression

    Authors: Christian J. Steinmetz, Joshua D. Reiss

    Abstract: Deep learning approaches have demonstrated success in modeling analog audio effects. Nevertheless, challenges remain in modeling more complex effects that involve time-varying nonlinear elements, such as dynamic range compressors. Existing neural network approaches for modeling compression either ignore the device parameters, do not attain sufficient accuracy, or otherwise require large noncausal… ▽ More

    Submitted 15 April, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Updated and will appear at 152nd AES Convention (note title change)

  28. arXiv:2010.10291  [pdf, other

    eess.AS cs.SD

    Automatic multitrack mixing with a differentiable mixing console of neural audio effects

    Authors: Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà

    Abstract: Applications of deep learning to automatic multitrack mixing are largely unexplored. This is partly due to the limited available data, coupled with the fact that such data is relatively unstructured and variable. To address these challenges, we propose a domain-inspired model with a strong inductive bias for the mixing task. We achieve this with the application of pre-trained sub-networks and weig… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  29. arXiv:2010.04237  [pdf, other

    eess.AS cs.SD

    Randomized Overdrive Neural Networks

    Authors: Christian J. Steinmetz, Joshua D. Reiss

    Abstract: By processing audio signals in the time-domain with randomly weighted temporal convolutional networks (TCNs), we uncover a wide range of novel, yet controllable overdrive effects. We discover that architectural aspects, such as the depth of the network, the kernel size, the number of channels, the activation function, as well as the weight initialization, all have a clear impact on the sonic chara… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Updating project URL. Now https://csteinmetz1.github.io/ronn

  30. Chaos in Continuously Monitored Quantum Systems: An Optimal Path Approach

    Authors: Philippe Lewalle, John Steinmetz, Andrew N. Jordan

    Abstract: We predict that continuously monitored quantum dynamics can be chaotic. The optimal paths between past and future boundary conditions can diverge exponentially in time when there is time-dependent evolution and continuous weak monitoring. Optimal paths are defined by extremizing the global probability density to move between two boundary conditions. We investigate the onset of chaos in pure-state… ▽ More

    Submitted 3 August, 2018; v1 submitted 20 March, 2018; originally announced March 2018.

    Comments: 12+11 pages, 11 figures. Supplemental Animations can be found at << https://drive.google.com/file/d/1cx__Aggt40s3r8ueTe8LlqZyAWLb5NV1/view?usp=sharing >> (all animations in .pdf format), or << https://drive.google.com/drive/folders/12LgI0dCiSjRYoHO9oiWDaXm7S0PzM7Y1?usp=sharing >> (as individual .mp4 files)

    Journal ref: Phys. Rev. A 98, 012141 (2018)

  31. arXiv:1512.03667  [pdf

    math.LO cs.LO

    An Intuitively Complete Analysis of Godel's Incompleteness

    Authors: Jason W. Steinmetz

    Abstract: A detailed and rigorous analysis of Gödel's proof of his first incompleteness theorem is presented. The purpose of this analysis is two-fold. The first is to reveal what Gödel actually proved to provide a clear and solid foundation upon which to base future research. The second is to construct a coherent explication of Gödel's proof that is not only approachable by the non-specialist, but also bri… ▽ More

    Submitted 28 April, 2020; v1 submitted 8 December, 2015; originally announced December 2015.

    Comments: 31 pages plus 2 appendices. In v2 multiple minor clarifications were made, two errors were fixed, and PDF bookmarks were added

    MSC Class: 03F40 (Primary) 03F50; 03A99 (Secondary) ACM Class: F.4.1; I.2.3

  32. arXiv:1110.1658   

    cs.CC

    Algorithm that Solves 3-SAT in Polynomial Time

    Authors: Jason W. Steinmetz

    Abstract: The question of whether the complexity class P is equal to the complexity class NP has been a seemingly intractable problem for over 4 decades. It has been clear that if an algorithm existed that would solve the problems in the NP class in polynomial time then P would equal NP. However, no one has yet been able to create that algorithm or to successfully prove that such an algorithm cannot exist.… ▽ More

    Submitted 2 June, 2015; v1 submitted 5 October, 2011; originally announced October 2011.

    Comments: This paper has been withdrawn by the author because the integer operations within the algorithm cannot be proven to have a polynomial run time

    ACM Class: F.1.3; I.1.2