Skip to main content

Showing 1–5 of 5 results for author: Purwins, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.03748  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Enhancing Pipeline-Based Conversational Agents with Large Language Models

    Authors: Mina Foosherian, Hendrik Purwins, Purna Rathnayake, Touhidul Alam, Rui Teimao, Klaus-Dieter Thoben

    Abstract: The latest advancements in AI and deep learning have led to a breakthrough in large language model (LLM)-based agents such as GPT-4. However, many commercial conversational agent development tools are pipeline-based and have limitations in holding a human-like conversation. This paper investigates the capabilities of LLMs to enhance pipeline-based conversational agents during two phases: 1) in the… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  2. End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing -- A Preliminary Study

    Authors: Matteo Lionello, Hendrik Purwins

    Abstract: We present a preliminary study on an end-to-end variational autoencoder (VAE) for sound morphing. Two VAE variants are compared: VAE with dilation layers (DC-VAE) and VAE only with regular convolutional layers (CC-VAE). We combine the following loss functions: 1) the time-domain mean-squared error for reconstructing the input signal, 2) the Kullback-Leibler divergence to the standard normal distri… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

  3. arXiv:1905.00078  [pdf, other

    cs.SD eess.AS stat.ML

    Deep Learning for Audio Signal Processing

    Authors: Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-yiin Chang, Tara Sainath

    Abstract: Given the recent surge in developments of deep learning, this article provides a review of the state-of-the-art deep learning techniques for audio signal processing. Speech, music, and environmental sound processing are considered side-by-side, in order to point out similarities and differences between the domains, highlighting general methods, problems, key references, and potential for cross-fer… ▽ More

    Submitted 25 May, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

    Comments: 15 pages, 2 pdf figures

    ACM Class: I.2.6; H.5.1

    Journal ref: Journal of Selected Topics of Signal Processing 14, No. 8 (2019)

  4. arXiv:1712.00254  [pdf, other

    cs.SD eess.AS stat.ML

    Utilizing Domain Knowledge in End-to-End Audio Processing

    Authors: Tycho Max Sylvester Tax, Jose Luis Diez Antich, Hendrik Purwins, Lars Maaløe

    Abstract: End-to-end neural network based approaches to audio modelling are generally outperformed by models trained on high-level data representations. In this paper we present preliminary work that shows the feasibility of training the first layers of a deep convolutional neural network (CNN) model to learn the commonly-used log-scaled mel-spectrogram transformation. Secondly, we demonstrate that upon ini… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: Accepted at the ML4Audio workshop at the NIPS 2017

  5. arXiv:1502.00524  [pdf, other

    cs.SD cs.IR cs.LG stat.ML

    Unsupervised Incremental Learning and Prediction of Music Signals

    Authors: Ricard Marxer, Hendrik Purwins

    Abstract: A system is presented that segments, clusters and predicts musical audio in an unsupervised manner, adjusting the number of (timbre) clusters instantaneously to the audio input. A sequence learning algorithm adapts its structure to a dynamically changing clustering tree. The flow of the system is as follows: 1) segmentation by onset detection, 2) timbre representation of each segment by Mel freque… ▽ More

    Submitted 23 October, 2015; v1 submitted 2 February, 2015; originally announced February 2015.

    Comments: 13 pages, 10 figures

    MSC Class: 68T05 ACM Class: I.2.6; H.5.5