Skip to main content

Showing 1–2 of 2 results for author: Pomirski, A

.
  1. arXiv:2505.07701  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications

    Authors: Biel Tura Vecino, Adam Gabryś, Daniel Mątwicki, Andrzej Pomirski, Tom Iddon, Marius Cotescu, Jaime Lorenzo-Trueba

    Abstract: Recent works have shown that modelling raw waveform directly from text in an end-to-end (E2E) fashion produces more natural-sounding speech than traditional neural text-to-speech (TTS) systems based on a cascade or two-stage approach. However, current E2E state-of-the-art models are computationally complex and memory-consuming, making them unsuitable for real-time offline on-device applications in… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Published as a conference paper at SSW 2023

    Journal ref: 12th ISCA Speech Synthesis Workshop, 2023

  2. Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis

    Authors: Andrzej D. Dobrzycki, Ana M. Bernardos, Luca Bergesio, Andrzej Pomirski, Daniel Sáez-Trigueros

    Abstract: Accurate human posture classification in images and videos is crucial for automated applications across various fields, including work safety, physical rehabilitation, sports training, or daily assisted living. Recently, multimodal learning methods, such as Contrastive Language-Image Pretraining (CLIP), have advanced significantly in jointly understanding images and text. This study aims to assess… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Journal ref: Mathematics 2024, 12(1), 76