Skip to main content

Showing 1–5 of 5 results for author: Namgyal, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.17069  [pdf, other

    cs.SD cs.AI cs.CV cs.LG eess.AS

    The Effect of Perceptual Metrics on Music Representation Learning for Genre Classification

    Authors: Tashi Namgyal, Alexander Hepburn, Raul Santos-Rodriguez, Valero Laparra, Jesus Malo

    Abstract: The subjective quality of natural signals can be approximated with objective perceptual metrics. Designed to approximate the perceptual behaviour of human observers, perceptual metrics often reflect structures found in natural signals and neurological pathways. Models trained with perceptual metrics as loss functions can capture perceptually meaningful features from the structures held within thes… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.03455

  2. arXiv:2312.03479  [pdf, other

    cs.SD cs.AI cs.HC eess.AS

    JAMMIN-GPT: Text-based Improvisation using LLMs in Ableton Live

    Authors: Sven Hollowell, Tashi Namgyal, Paul Marshall

    Abstract: We introduce a system that allows users of Ableton Live to create MIDI-clips by naming them with musical descriptions. Users can compose by typing the desired musical content directly in Ableton's clip view, which is then inserted by our integrated system. This allows users to stay in the flow of their creative process while quickly generating musical ideas. The system works by prompting ChatGPT t… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Conference: 24th International Society for Music Information Retrieval. Late Breaking Demo. 2023

  3. arXiv:2312.03455  [pdf, other

    cs.SD cs.AI cs.CV cs.LG eess.AS eess.IV

    Data is Overrated: Perceptual Metrics Can Lead Learning in the Absence of Training Data

    Authors: Tashi Namgyal, Alexander Hepburn, Raul Santos-Rodriguez, Valero Laparra, Jesus Malo

    Abstract: Perceptual metrics are traditionally used to evaluate the quality of natural signals, such as images and audio. They are designed to mimic the perceptual behaviour of human observers and usually reflect structures found in natural signals. This motivates their use as loss functions for training generative models such that models will learn to capture the structure held in the metric. We take this… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Machine Learning for Audio Workshop, NeurIPS 2023

  4. arXiv:2305.11605  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    MIDI-Draw: Sketching to Control Melody Generation

    Authors: Tashi Namgyal, Peter Flach, Raul Santos-Rodriguez

    Abstract: We describe a proof-of-principle implementation of a system for drawing melodies that abstracts away from a note-level input representation via melodic contours. The aim is to allow users to express their musical intentions without requiring prior knowledge of how notes fit together melodiously. Current approaches to controllable melody generation often require users to choose parameters that are… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Late-Breaking / Demo Session Extended Abstract, ISMIR 2022 Conference

  5. arXiv:2305.11582  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics

    Authors: Tashi Namgyal, Alexander Hepburn, Raul Santos-Rodriguez, Valero Laparra, Jesus Malo

    Abstract: In this study, we investigate the feasibility of utilizing state-of-the-art image perceptual metrics for evaluating audio signals by representing them as spectrograms. The encouraging outcome of the proposed approach is based on the similarity between the neural mechanisms in the auditory and visual pathways. Furthermore, we customise one of the metrics which has a psychoacoustically plausible arc… ▽ More

    Submitted 30 August, 2023; v1 submitted 19 May, 2023; originally announced May 2023.