Skip to main content

Showing 1–6 of 6 results for author: Hansen, L K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2409.16302  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    How Redundant Is the Transformer Stack in Speech Representation Models?

    Authors: Teresa Dorszewski, Albert Kjøller Jacobsen, Lenka Tětková, Lars Kai Hansen

    Abstract: Self-supervised speech representation models, particularly those leveraging transformer architectures, have demonstrated remarkable performance across various tasks such as speech recognition, speaker identification, and emotion detection. Recent studies on transformer models revealed a high redundancy between layers and the potential for significant pruning, which we will investigate here for tra… ▽ More

    Submitted 17 January, 2025; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: To appear at ICASSP 2025 (excluding appendix)

  2. Convexity-based Pruning of Speech Representation Models

    Authors: Teresa Dorszewski, Lenka Tětková, Lars Kai Hansen

    Abstract: Speech representation models based on the transformer architecture and trained by self-supervised learning have shown great promise for solving tasks such as speech and speaker recognition, keyword spotting, emotion detection, and more. Typically, it is found that larger models lead to better performance. However, the significant computational effort involved in such large transformer systems is a… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Journal ref: 2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP), London, United Kingdom, 2024, pp. 1-6,

  3. SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

    Authors: Anders Gjølbye, Lina Skerath, William Lehn-Schiøler, Nicolas Langer, Lars Kai Hansen

    Abstract: Electroencephalography (EEG) research typically focuses on tasks with narrowly defined objectives, but recent studies are expanding into the use of unlabeled data within larger models, aiming for a broader range of applications. This addresses a critical challenge in EEG research. For example, Kostas et al. (2021) show that self-supervised learning (SSL) outperforms traditional supervised methods.… ▽ More

    Submitted 23 September, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: To appear in proceedings of 2024 IEEE International workshop on Machine Learning for Signal Processing

  4. arXiv:2307.12745  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Concept-based explainability for an EEG transformer model

    Authors: Anders Gjølbye, William Lehn-Schiøler, Áshildur Jónsdóttir, Bergdís Arnardóttir, Lars Kai Hansen

    Abstract: Deep learning models are complex due to their size, structure, and inherent randomness in training procedures. Additional complexity arises from the selection of datasets and inductive biases. Addressing these challenges for explainability, Kim et al. (2018) introduced Concept Activation Vectors (CAVs), which aim to understand deep models' internal states in terms of human-aligned concepts. These… ▽ More

    Submitted 22 August, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: To appear in proceedings of 2023 IEEE International workshop on Machine Learning for Signal Processing

  5. arXiv:2306.00561  [pdf, other

    cs.SD cs.AI eess.AS

    Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

    Authors: Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan

    Abstract: In this work, we propose a Multi-Window Masked Autoencoder (MW-MAE) fitted with a novel Multi-Window Multi-Head Attention (MW-MHA) module that facilitates the modelling of local-global interactions in every decoder transformer block through attention heads of several distinct local and global windows. Empirical results on ten downstream audio tasks show that MW-MAEs consistently outperform standar… ▽ More

    Submitted 1 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  6. arXiv:2010.15718  [pdf, other

    cs.CR cs.DC eess.IV

    Minimal Model Structure Analysis for Input Reconstruction in Federated Learning

    Authors: Jia Qian, Hiba Nassar, Lars Kai Hansen

    Abstract: \ac{fl} proposed a distributed \ac{ml} framework where every distributed worker owns a complete copy of global model and their own data. The training is occurred locally, which assures no direct transmission of training data. However, the recent work \citep{zhu2019deep} demonstrated that input data from a neural network may be reconstructed only using knowledge of gradients of that network, which… ▽ More

    Submitted 5 November, 2021; v1 submitted 29 October, 2020; originally announced October 2020.