Skip to main content

Showing 1–16 of 16 results for author: Parker, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.19842  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    Scaling Transformers for Low-Bitrate High-Quality Speech Coding

    Authors: Julian D Parker, Anton Smirnov, Jordi Pons, CJ Carr, Zack Zukowski, Zach Evans, Xubo Liu

    Abstract: The tokenization of speech with neural audio codec models is a vital part of modern AI pipelines for the generation or understanding of speech, alone or in a multimodal context. Traditionally such tokenization models have concentrated on low parameter-count architectures using only components with strong inductive biases. In this work we show that by scaling a transformer architecture with large p… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  2. arXiv:2408.03093  [pdf, other

    cs.LG cs.AI eess.SY

    Certifiably Robust Policies for Uncertain Parametric Environments

    Authors: Yannik Schnitzer, Alessandro Abate, David Parker

    Abstract: We present a data-driven approach for producing policies that are provably robust across unknown stochastic environments. Existing approaches can learn models of a single environment as an interval Markov decision processes (IMDP) and produce a robust policy with a probably approximately correct (PAC) guarantee on its performance. However these are unable to reason about the impact of environmenta… ▽ More

    Submitted 23 March, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

  3. arXiv:2407.14358  [pdf, other

    cs.SD cs.AI eess.AS

    Stable Audio Open

    Authors: Zach Evans, Julian D. Parker, CJ Carr, Zack Zukowski, Josiah Taylor, Jordi Pons

    Abstract: Open generative models are vitally important for the community, allowing for fine-tunes and serving as baselines when presenting new models. However, most current text-to-audio models are private and not accessible for artists and researchers to build upon. Here we describe the architecture and training process of a new open-weights text-to-audio model trained with Creative Commons data. Our evalu… ▽ More

    Submitted 31 July, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Demo: https://stability-ai.github.io/stable-audio-open-demo/ Weights: https://huggingface.co/stabilityai/stable-audio-open-1.0 Code: https://github.com/Stability-AI/stable-audio-tools. arXiv admin note: text overlap with arXiv:2404.10301

  4. arXiv:2404.10301  [pdf, other

    cs.SD cs.LG eess.AS

    Long-form music generation with latent diffusion

    Authors: Zach Evans, Julian D. Parker, CJ Carr, Zack Zukowski, Josiah Taylor, Jordi Pons

    Abstract: Audio-based generative models for music have seen great strides recently, but so far have not managed to produce full-length music tracks with coherent musical structure from text prompts. We show that by training a generative model on long temporal contexts it is possible to produce long-form music of up to 4m45s. Our model consists of a diffusion-transformer operating on a highly downsampled con… ▽ More

    Submitted 29 July, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  5. arXiv:2403.09184  [pdf, other

    eess.SY cs.AI cs.LO

    Learning Algorithms for Verification of Markov Decision Processes

    Authors: Tomáš Brázdil, Krishnendu Chatterjee, Martin Chmelik, Vojtěch Forejt, Jan Křetínský, Marta Kwiatkowska, Tobias Meggendorfer, David Parker, Mateusz Ujma

    Abstract: We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive exploration of the state space, instead focussing on particularly relevant areas of the system, guided by heuristics. Our work builds on the previous results of Br{á}z… ▽ More

    Submitted 31 March, 2025; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 82 pages. This is the TheoretiCS journal version

    Journal ref: TheoretiCS, Volume 4 (April 1, 2025) theoretics:13268

  6. arXiv:2312.08723  [pdf, other

    cs.SD cs.LG eess.AS

    StemGen: A music generation model that listens

    Authors: Julian D. Parker, Janne Spijkervet, Katerina Kosta, Furkan Yesiler, Boris Kuznetsov, Ju-Chiang Wang, Matt Avent, Jitong Chen, Duc Le

    Abstract: End-to-end generation of musical audio using deep learning techniques has seen an explosion of activity recently. However, most models concentrate on generating fully mixed music in response to abstract conditioning information. In this work, we present an alternative paradigm for producing music generation models that can listen and respond to musical context. We describe how such a model can be… ▽ More

    Submitted 16 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted for publication at ICASSP 2024

  7. arXiv:2306.17639  [pdf, other

    eess.SY cs.AI

    Point-Based Value Iteration for POMDPs with Neural Perception Mechanisms

    Authors: Rui Yan, Gabriel Santos, Gethin Norman, David Parker, Marta Kwiatkowska

    Abstract: The increasing trend to integrate neural networks and conventional software components in safety-critical settings calls for methodologies for their formal modelling, verification and correct-by-construction policy synthesis. We introduce neuro-symbolic partially observable Markov decision processes (NS-POMDPs), a variant of continuous-state POMDPs with discrete observations and actions, in which… ▽ More

    Submitted 7 August, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 65 pages, 14 figures

  8. arXiv:2306.00860  [pdf, other

    cs.SD eess.AS

    Differentiable Allpass Filters for Phase Response Estimation and Automatic Signal Alignment

    Authors: Anders R. Bargum, Stefania Serafin, Cumhur Erkut, Julian D. Parker

    Abstract: Virtual analog (VA) audio effects are increasingly based on neural networks and deep learning frameworks. Due to the underlying black-box methodology, a successful model will learn to approximate the data it is presented, including potential errors such as latency and audio dropouts as well as non-linear characteristics and frequency-dependent phase shifts produced by the hardware. The latter is o… ▽ More

    Submitted 2 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Collaboration done while interning/employed at Native Instruments. Accepted for publication in Proc. DAFX'23, Copenhagen, Denmark, September 2023. Sound examples at https://abargum.github.io v2: 10 pages, LaTeX; figures resized, pdf optimized

  9. Robust Control for Dynamical Systems With Non-Gaussian Noise via Formal Abstractions

    Authors: Thom Badings, Licio Romao, Alessandro Abate, David Parker, Hasan A. Poonawala, Marielle Stoelinga, Nils Jansen

    Abstract: Controllers for dynamical systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modeled as process noise in a dynamical system, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distri… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: To appear in the Journal of Artificial Intelligence Research (JAIR). arXiv admin note: text overlap with arXiv:2110.12662

    Journal ref: Journal of Artificial Intelligence Research (JAIR) 76 (2023) 341-391

  10. arXiv:2208.14445  [pdf

    q-bio.QM cs.CV eess.IV

    Artificial intelligence-based locoregional markers of brain peritumoral microenvironment

    Authors: Zahra Riahi Samani, Drew Parker, Hamed Akbari, Spyridon Bakas, Ronald L. Wolf, Steven Brem, Ragini Verma

    Abstract: In malignant primary brain tumors, cancer cells infiltrate into the peritumoral brain structures which results in inevitable recurrence. Quantitative assessment of infiltrative heterogeneity in the peritumoral region, the area where biopsy or resection can be hazardous, is important for clinical decision making. Previous work on characterizing the infiltrative heterogeneity in the peritumoral regi… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  11. arXiv:2204.10125  [pdf, other

    cs.SD cs.LG eess.AS physics.comp-ph

    Physical Modeling using Recurrent Neural Networks with Fast Convolutional Layers

    Authors: Julian D. Parker, Sebastian J. Schlecht, Rudolf Rabenstein, Maximilian Schäfer

    Abstract: Discrete-time modeling of acoustic, mechanical and electrical systems is a prominent topic in the musical signal processing literature. Such models are mostly derived by discretizing a mathematical model, given in terms of ordinary or partial differential equations, using established techniques. Recent work has applied the techniques of machine-learning to construct such models automatically from… ▽ More

    Submitted 1 June, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: Accepted to DAFx2022

  12. arXiv:2110.12662  [pdf, other

    eess.SY cs.AI cs.RO

    Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise

    Authors: Thom S. Badings, Alessandro Abate, Nils Jansen, David Parker, Hasan A. Poonawala, Marielle Stoelinga

    Abstract: Controllers for autonomous systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modelled as process noise, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distribution. We present a… ▽ More

    Submitted 13 December, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Journal ref: AAAI 2022 (distinguished paper)

  13. arXiv:2103.05285  [pdf

    eess.IV cs.CV

    3D-QCNet -- A Pipeline for Automated Artifact Detection in Diffusion MRI images

    Authors: Adnan Ahmad, Drew Parker, Zahra Riahi Samani, Ragini Verma

    Abstract: Artifacts are a common occurrence in Diffusion MRI (dMRI) scans. Identifying and removing them is essential to ensure the accuracy and viability of any post processing carried out on these scans. This makes QC (quality control) a crucial first step prior to any analysis of dMRI data. Several QC methods for artifact detection exist, however they suffer from problems like requiring manual interventi… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  14. arXiv:1911.06816  [pdf

    eess.IV cs.CV cs.LG

    QC-Automator: Deep Learning-based Automated Quality Control for Diffusion MR Images

    Authors: Zahra Riahi Samani, Jacob Antony Alappatt, Drew Parker, Abdol Aziz Ould Ismail, Ragini Verma

    Abstract: Quality assessment of diffusion MRI (dMRI) data is essential prior to any analysis, so that appropriate pre-processing can be used to improve data quality and ensure that the presence of MRI artifacts do not affect the results of subsequent image analysis. Manual quality assessment of the data is subjective, possibly error-prone, and infeasible, especially considering the growing number of consort… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  15. arXiv:1506.06419  [pdf, other

    cs.LO eess.SY

    Verification and Control of Partially Observable Probabilistic Real-Time Systems

    Authors: Gethin Norman, David Parker, Xueyi Zou

    Abstract: We propose automated techniques for the verification and control of probabilistic real-time systems that are only partially observable. To formally model such systems, we define an extension of probabilistic timed automata in which local states are partially visible to an observer or controller. We give a probabilistic temporal logic that can express a range of quantitative properties of these mod… ▽ More

    Submitted 22 June, 2015; v1 submitted 21 June, 2015; originally announced June 2015.

  16. arXiv:1504.04662  [pdf, other

    cs.LO eess.SY math.OC

    Permissive Controller Synthesis for Probabilistic Systems

    Authors: Klaus Drager, Vojtech Forejt, Marta Kwiatkowska, David Parker, Mateusz Ujma

    Abstract: We propose novel controller synthesis techniques for probabilistic systems modelled using stochastic two-player games: one player acts as a controller, the second represents its environment, and probability is used to capture uncertainty arising due to, for example, unreliable sensors or faulty system components. Our aim is to generate robust controllers that are resilient to unexpected system ch… ▽ More

    Submitted 29 June, 2015; v1 submitted 17 April, 2015; originally announced April 2015.

    Journal ref: Logical Methods in Computer Science, Volume 11, Issue 2 (June 30, 2015) lmcs:1576