Skip to main content

Showing 1–2 of 2 results for author: Tětková, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2409.16302  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    How Redundant Is the Transformer Stack in Speech Representation Models?

    Authors: Teresa Dorszewski, Albert Kjøller Jacobsen, Lenka Tětková, Lars Kai Hansen

    Abstract: Self-supervised speech representation models, particularly those leveraging transformer architectures, have demonstrated remarkable performance across various tasks such as speech recognition, speaker identification, and emotion detection. Recent studies on transformer models revealed a high redundancy between layers and the potential for significant pruning, which we will investigate here for tra… ▽ More

    Submitted 17 January, 2025; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: To appear at ICASSP 2025 (excluding appendix)

  2. Convexity-based Pruning of Speech Representation Models

    Authors: Teresa Dorszewski, Lenka Tětková, Lars Kai Hansen

    Abstract: Speech representation models based on the transformer architecture and trained by self-supervised learning have shown great promise for solving tasks such as speech and speaker recognition, keyword spotting, emotion detection, and more. Typically, it is found that larger models lead to better performance. However, the significant computational effort involved in such large transformer systems is a… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Journal ref: 2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP), London, United Kingdom, 2024, pp. 1-6,