Skip to main content

Showing 1–6 of 6 results for author: Schmidhuber, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2502.05672  [pdf, other

    stat.ML cs.AI cs.LG cs.NE eess.SY

    On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers

    Authors: Miroslav Štrupl, Oleg Szehr, Francesco Faccio, Dylan R. Ashley, Rupesh Kumar Srivastava, Jürgen Schmidhuber

    Abstract: This article provides a rigorous analysis of convergence and stability of Episodic Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning and Online Decision Transformers. These algorithms performed competitively across various benchmarks, from games to robotic tasks, but their theoretical understanding is limited to specific environmental conditions. This work initiates a theore… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 85 pages in main text + 4 pages of references + 26 pages of appendices, 12 figures in main text + 2 figures in appendices; source code available at https://github.com/struplm/eUDRL-GCSL-ODT-Convergence-public

    MSC Class: 68T07 ACM Class: I.2.6; I.5.1

  2. arXiv:2411.07772  [pdf, other

    cs.LG cs.AI cs.CL cs.MM cs.SD eess.AS

    Automatic Album Sequencing

    Authors: Vincent Herrmann, Dylan R. Ashley, Jürgen Schmidhuber

    Abstract: Album sequencing is a critical part of the album production process. Recently, a data-driven approach was proposed that sequences general collections of independent media by extracting the narrative essence of the items in the collections. While this approach implies an album sequencing technique, it is not widely accessible to a less technical audience, requiring advanced knowledge of machine lea… ▽ More

    Submitted 26 November, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: presented as a late breaking demo in the 25th International Society for Music Information Retrieval Conference; 3 pages in main text + 1 page of references, 3 figures in main text; source code available at https://github.com/dylanashley/automatic-album-sequencing

    MSC Class: 68T07 ACM Class: H.5.5; I.2.6; I.5.1; J.5

  3. arXiv:2402.03141  [pdf, other

    cs.LG cs.AI eess.SY

    Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays

    Authors: Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang

    Abstract: Reinforcement learning (RL) is challenging in the common case of delays between events and their sensory perceptions. State-of-the-art (SOTA) state augmentation techniques either suffer from state space explosion or performance degeneration in stochastic environments. To address these challenges, we present a novel Auxiliary-Delayed Reinforcement Learning (AD-RL) method that leverages auxiliary ta… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  4. arXiv:2311.07534  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Musical Object Discovery from Audio

    Authors: Joonsu Gha, Vincent Herrmann, Benjamin Grewe, Jürgen Schmidhuber, Anand Gopalakrishnan

    Abstract: Current object-centric learning models such as the popular SlotAttention architecture allow for unsupervised visual scene decomposition. Our novel MusicSlots method adapts SlotAttention to the audio domain, to achieve unsupervised music decomposition. Since concepts of opacity and occlusion in vision have no auditory analogues, the softmax normalization of alpha masks in the decoders of visual obj… ▽ More

    Submitted 14 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to Machine Learning for Audio Workshop, NeurIPS 2023

  5. arXiv:2211.12423  [pdf, other

    cs.CL cs.AI cs.LG cs.MM cs.NE cs.SD eess.AS

    On Narrative Information and the Distillation of Stories

    Authors: Dylan R. Ashley, Vincent Herrmann, Zachary Friggstad, Jürgen Schmidhuber

    Abstract: The act of telling stories is a fundamental part of what it means to be human. This work introduces the concept of narrative information, which we define to be the overlap in information space between a story and the items that compose the story. Using contrastive learning methods, we show how modern artificial neural networks can be leveraged to distill stories and extract a representation of the… ▽ More

    Submitted 13 February, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: presented in the Information-Theoretic Principles in Cognitive Systems Workshop at the 36th Conference on Neural Information Processing Systems; 4 pages in main text + 2 pages of references + 8 pages of appendices, 2 figures in main text + 3 in appendices, 1 table in main text, 2 algorithms in appendices; source code available at https://github.com/dylanashley/story-distiller

    MSC Class: 68T07 (Primary) 68P30; 68W50; 94A15 (Secondary) ACM Class: H.1.1; H.5.5; I.2.6; I.5.1; J.5

  6. arXiv:2111.02216  [pdf, other

    cs.CL cs.LG cs.MM cs.SD eess.AS

    Automatic Embedding of Stories Into Collections of Independent Media

    Authors: Dylan R. Ashley, Vincent Herrmann, Zachary Friggstad, Kory W. Mathewson, Jürgen Schmidhuber

    Abstract: We look at how machine learning techniques that derive properties of items in a collection of independent media can be used to automatically embed stories into such collections. To do so, we use models that extract the tempo of songs to make a music playlist follow a narrative arc. Our work specifies an open-source tool that uses pre-trained neural network models to extract the global tempo of a s… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 2 pages in main text + 1 page of references + 6 pages of appendices, 2 figures in main text + 3 figures in appendices, 1 algorithm in appendices; source code available at https://gist.github.com/dylanashley/1387a99deb85bfc0bce11286810cd98b

    ACM Class: H.5.5; I.2.6; J.5