Skip to main content

Showing 1–9 of 9 results for author: Pikrakis, A

.
  1. arXiv:2407.02156  [pdf, other

    cs.SD cs.AI cs.IR cs.LG eess.AS

    Towards Training Music Taggers on Synthetic Data

    Authors: Nadine Kroher, Steven Manangu, Aggelos Pikrakis

    Abstract: Most contemporary music tagging systems rely on large volumes of annotated data. As an alternative, we investigate the extent to which synthetically generated music excerpts can improve tagging systems when only small annotated collections are available. To this end, we release GTZAN-synth, a synthetic dataset that follows the taxonomy of the well-known GTZAN dataset while being ten times larger i… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, accepted to 21st International Conference on Content-based Multimedia Indexing (CBMI) 2024, code available https://github.com/NadineKroher/music-tagging-synthetic-data-cbmi-2024

    ACM Class: I.2

  2. arXiv:2312.07594  [pdf

    cs.CR

    On the Prediction of Hardware Security Properties of HLS Designs Using Graph Neural Networks

    Authors: Amalia Artemis Koufopoulou, Athanasios Papadimitriou, Aggelos Pikrakis, Mihalis Psarakis, David Hely

    Abstract: High-level synthesis (HLS) tools have provided significant productivity enhancements to the design flow of digital systems in recent years, resulting in highly-optimized circuits, in terms of area and latency. Given the evolution of hardware attacks, which can render them vulnerable, it is essential to consider security as a significant aspect of the HLS design flow. Yet the need to evaluate a hug… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 6 pages, 2 figures, 3 tables, submitted to 2023 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)

  3. arXiv:2311.09094  [pdf, other

    cs.SD cs.AI eess.AS

    Can MusicGen Create Training Data for MIR Tasks?

    Authors: Nadine Kroher, Helena Cuesta, Aggelos Pikrakis

    Abstract: We are investigating the broader concept of using AI-based generative music systems to generate training data for Music Information Retrieval (MIR) tasks. To kick off this line of work, we ran an initial experiment in which we trained a genre classifier on a fully artificial music dataset created with MusicGen. We constructed over 50 000 genre- conditioned textual descriptions and generated a coll… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: This is an extended abstract presented at the Late-Breaking / Demo Session of the International Society for Music Information Retrieval Conference (ISMIR) 2023 (Milan, Italy)

  4. arXiv:2208.09201  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Post-Processing of Audio Event Detectors Using Reinforcement Learning

    Authors: Petros Giannakopoulos, Aggelos Pikrakis, Yannis Cotronis

    Abstract: We apply post-processing to the class probability distribution outputs of audio event classification models and employ reinforcement learning to jointly discover the optimal parameters for various stages of a post-processing stack, such as the classification thresholds and the kernel sizes of median filtering algorithms used to smooth out model predictions. To achieve this we define a reinforcemen… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: Published on IEEE Access journal, Volume 10, 2022

  5. arXiv:2110.12778  [pdf, other

    cs.SD cs.LG eess.AS

    A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments

    Authors: Petros Giannakopoulos, Aggelos Pikrakis, Yannis Cotronis

    Abstract: In this work we apply deep reinforcement learning to the problems of navigating a three-dimensional environment and inferring the locations of human speaker audio sources within, in the case where the only available information is the raw sound from the environment, as a simulated human listener placed in the environment would hear it. For this purpose we create two virtual environments using the… ▽ More

    Submitted 27 November, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2105.04488

  6. A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment

    Authors: Petros Giannakopoulos, Aggelos Pikrakis, Yannis Cotronis

    Abstract: In this work we use deep reinforcement learning to create an autonomous agent that can navigate in a two-dimensional space using only raw auditory sensory information from the environment, a problem that has received very little attention in the reinforcement learning literature. Our experiments show that the agent can successfully identify a particular target speaker among a set of $N$ predefined… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: To be published in ICASSP 2021

  7. arXiv:2102.08870  [pdf, other

    cs.LG cs.DB

    Online Co-movement Pattern Prediction in Mobility Data

    Authors: Andreas Tritsarolis, Eva Chondrodima, Panagiotis Tampakis, Aggelos Pikrakis

    Abstract: Predictive analytics over mobility data are of great importance since they can assist an analyst to predict events, such as collisions, encounters, traffic jams, etc. A typical example of such analytics is future location prediction, where the goal is to predict the future location of a moving object,given a look-ahead time. What is even more challenging is being able to accurately predict collect… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  8. arXiv:1807.00069  [pdf, other

    cs.SD eess.AS

    Exploratory Analysis of a Large Flamenco Corpus using an Ensemble of Convolutional Neural Networks as a Structural Annotation Backend

    Authors: Nadine Kroher, Aggelos Pikrakis

    Abstract: We present computational tools that we developed for the analysis of a large corpus of flamenco music recordings, along with the related exploratory findings. The proposed computational backend is based on a set of Convolutional Neural Networks that provide the structural annotation of each music recording with respect to the presence of vocals, guitar and hand-clapping ("palmas"). The resulting,… ▽ More

    Submitted 29 June, 2018; originally announced July 2018.

  9. arXiv:1612.08391  [pdf, other

    cs.IR

    Audio-based Distributional Semantic Models for Music Auto-tagging and Similarity Measurement

    Authors: Giannis Karamanolakis, Elias Iosif, Athanasia Zlatintsi, Aggelos Pikrakis, Alexandros Potamianos

    Abstract: The recent development of Audio-based Distributional Semantic Models (ADSMs) enables the computation of audio and lexical vector representations in a joint acoustic-semantic space. In this work, these joint representations are applied to the problem of automatic tag generation. The predicted tags together with their corresponding acoustic representation are exploited for the construction of acoust… ▽ More

    Submitted 26 December, 2016; originally announced December 2016.