Skip to main content

Showing 1–18 of 18 results for author: Mogren, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.02422  [pdf, other

    cs.SD cs.LG eess.AS

    Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning

    Authors: Richard Lindholm, Oscar Marklund, Olof Mogren, John Martinsson

    Abstract: The vast amounts of audio data collected in Sound Event Detection (SED) applications require efficient annotation strategies to enable supervised learning. Manual labeling is expensive and time-consuming, making Active Learning (AL) a promising approach for reducing annotation effort. We introduce Top K Entropy, a novel uncertainty aggregation strategy for AL that prioritizes the most uncertain se… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  2. arXiv:2502.09363  [pdf, other

    cs.LG

    The Accuracy Cost of Weakness: A Theoretical Analysis of Fixed-Segment Weak Labeling for Events in Time

    Authors: John Martinsson, Olof Mogren, Tuomas Virtanen, Maria Sandsten

    Abstract: Accurate labels are critical for deriving robust machine learning models. Labels are used to train supervised learning models and to evaluate most machine learning paradigms. In this paper, we model the accuracy and cost of a common weak labeling process where annotators assign presence or absence labels to fixed-length data segments for a given event class. The annotator labels a segment as "pres… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: Submitted to TMLR

  3. arXiv:2405.20287  [pdf, other

    cs.LG cs.AI math.NA physics.flu-dyn

    Flexible SE(2) graph neural networks with applications to PDE surrogates

    Authors: Maria Bånkestad, Olof Mogren, Aleksis Pirinen

    Abstract: This paper presents a novel approach for constructing graph neural networks equivariant to 2D rotations and translations and leveraging them as PDE surrogates on non-gridded domains. We show that aligning the representations with the principal axis allows us to sidestep many constraints while preserving SE(2) equivariance. By applying our model as a surrogate for fluid flow simulations and conduct… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 9 pages

  4. arXiv:2403.08525  [pdf, other

    cs.SD cs.LG eess.AS

    From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning

    Authors: John Martinsson, Olof Mogren, Maria Sandsten, Tuomas Virtanen

    Abstract: We propose an adaptive change point detection method (A-CPD) for machine guided weak label annotation of audio recording segments. The goal is to maximize the amount of information gained about the temporal activations of the target sounds. For each unlabeled audio recording, we use a prediction model to derive a probability curve used to guide annotation. The prediction model is initially pre-tra… ▽ More

    Submitted 26 August, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at EUSIPCO 2024 (nominated best student paper)

  5. arXiv:2403.04385  [pdf, other

    cs.CV cs.LG

    Impacts of Color and Texture Distortions on Earth Observation Data in Deep Learning

    Authors: Martin Willbo, Aleksis Pirinen, John Martinsson, Edvin Listo Zec, Olof Mogren, Mikael Nilsson

    Abstract: Land cover classification and change detection are two important applications of remote sensing and Earth observation (EO) that have benefited greatly from the advances of deep learning. Convolutional and transformer-based U-net models are the state-of-the-art architectures for these tasks, and their performances have been boosted by an increased availability of large-scale annotated EO datasets.… ▽ More

    Submitted 12 April, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  6. arXiv:2306.12768  [pdf, other

    cs.LG

    Concept-aware clustering for decentralized deep learning under temporal shift

    Authors: Marcus Toftås, Emilie Klefbom, Edvin Listo Zec, Martin Willbo, Olof Mogren

    Abstract: Decentralized deep learning requires dealing with non-iid data across clients, which may also change over time due to temporal shifts. While non-iid data has been extensively studied in distributed settings, temporal shifts have received no attention. To the best of our knowledge, we are first with tackling the novel and challenging problem of decentralized learning with non-iid and dynamic data.… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 4 pages, 2 figures

  7. arXiv:2306.10869  [pdf, other

    cs.CL cs.LG

    Grammatical gender in Swedish is predictable using recurrent neural networks

    Authors: Edvin Listo Zec, Olof Mogren

    Abstract: The grammatical gender of Swedish nouns is a mystery. While there are few rules that can indicate the gender with some certainty, it does in general not depend on either meaning or the structure of the word. In this paper we demonstrate the surprising fact that grammatical gender for Swedish nouns can be predicted with high accuracy using a recurrent neural network (RNN) working on the raw charact… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  8. arXiv:2304.01658  [pdf, other

    cs.CV

    Fully Convolutional Networks for Dense Water Flow Intensity Prediction in Swedish Catchment Areas

    Authors: Aleksis Pirinen, Olof Mogren, Mårten Västerdal

    Abstract: Intensifying climate change will lead to more extreme weather events, including heavy rainfall and drought. Accurate stream flow prediction models which are adaptable and robust to new circumstances in a changing climate will be an important source of information for decisions on climate adaptation efforts, especially regarding mitigation of the risks of and damages associated with flooding. In th… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  9. arXiv:2301.12755  [pdf, other

    cs.LG

    Efficient Node Selection in Private Personalized Decentralized Learning

    Authors: Edvin Listo Zec, Johan Östman, Olof Mogren, Daniel Gillblad

    Abstract: Personalized decentralized learning is a promising paradigm for distributed learning, enabling each node to train a local model on its own data and collaborate with other nodes to improve without sharing any data. However, this approach poses significant privacy risks, as nodes may inadvertently disclose sensitive information about their data or preferences through their collaboration choices. In… ▽ More

    Submitted 15 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  10. arXiv:2206.11682  [pdf, other

    cs.LG

    EFFGAN: Ensembles of fine-tuned federated GANs

    Authors: Ebba Ekblom, Edvin Listo Zec, Olof Mogren

    Abstract: Generative adversarial networks have proven to be a powerful tool for learning complex and high-dimensional data distributions, but issues such as mode collapse have been shown to make it difficult to train them. This is an even harder problem when the data is decentralized over several clients in a federated learning setup, as problems such as client drift and non-iid data make it hard for federa… ▽ More

    Submitted 31 October, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  11. arXiv:2206.08839  [pdf, other

    cs.LG

    Decentralized adaptive clustering of deep nets is beneficial for client collaboration

    Authors: Edvin Listo Zec, Ebba Ekblom, Martin Willbo, Olof Mogren, Sarunas Girdzijauskas

    Abstract: We study the problem of training personalized deep learning models in a decentralized peer-to-peer setting, focusing on the setting where data distributions differ between the clients and where different clients have different local learning tasks. We study both covariate and label shift, and our contribution is an algorithm which for each client finds beneficial collaborations based on a similari… ▽ More

    Submitted 31 October, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  12. arXiv:2107.08517  [pdf, other

    cs.LG

    Decentralized federated learning of deep neural networks on non-iid data

    Authors: Noa Onoszko, Gustav Karlsson, Olof Mogren, Edvin Listo Zec

    Abstract: We tackle the non-convex problem of learning a personalized deep learning model in a decentralized setting. More specifically, we study decentralized federated learning, a peer-to-peer setting where data is distributed among many clients and where there is no central server to orchestrate the training. In real world scenarios, the data distributions are often heterogeneous between clients. Therefo… ▽ More

    Submitted 20 July, 2021; v1 submitted 18 July, 2021; originally announced July 2021.

    Comments: 7 pages, 2 figures

  13. arXiv:2102.00875  [pdf, other

    cs.LG cs.CL cs.DC

    Scaling Federated Learning for Fine-tuning of Large Language Models

    Authors: Agrin Hilmkil, Sebastian Callh, Matteo Barbieri, Leon René Sütfeld, Edvin Listo Zec, Olof Mogren

    Abstract: Federated learning (FL) is a promising approach to distributed compute, as well as distributed data, and provides a level of privacy and compliance to legal frameworks. This makes FL attractive for both consumer and healthcare applications. While the area is actively being explored, few studies have examined FL in the context of larger language models and there is a lack of comprehensive reviews o… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

  14. arXiv:2010.02056  [pdf, other

    cs.LG

    Specialized federated learning using a mixture of experts

    Authors: Edvin Listo Zec, Olof Mogren, John Martinsson, Leon René Sütfeld, Daniel Gillblad

    Abstract: In federated learning, clients share a global model that has been trained on decentralized local client data. Although federated learning shows significant promise as a key approach when data cannot be shared or centralized, current methods show limited privacy properties and have shortcomings when applied to common real-world scenarios, especially when client data is heterogeneous. In this paper,… ▽ More

    Submitted 8 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 8 pages, 6 figures

  15. arXiv:2006.09114  [pdf, other

    eess.AS cs.LG cs.SD

    Adversarial representation learning for private speech generation

    Authors: David Ericsson, Adam Östberg, Edvin Listo Zec, John Martinsson, Olof Mogren

    Abstract: As more and more data is collected in various settings across organizations, companies, and countries, there has been an increase in the demand of user privacy. Developing privacy preserving methods for data analytics is thus an important area of research. In this work we present a model based on generative adversarial networks (GANs) that learns to obfuscate specific sensitive attributes in speec… ▽ More

    Submitted 17 June, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Submitted to ICML 2020 Workshop on Self-supervision in Audio and Speech (SAS)

  16. arXiv:2006.08039  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial representation learning for synthetic replacement of private attributes

    Authors: John Martinsson, Edvin Listo Zec, Daniel Gillblad, Olof Mogren

    Abstract: Data privacy is an increasingly important aspect of many real-world Data sources that contain sensitive information may have immense potential which could be unlocked using the right privacy enhancing transformations, but current methods often fail to produce convincing output. Furthermore, finding the right balance between privacy and utility is often a tricky trade-off. In this work, we propose… ▽ More

    Submitted 8 February, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

  17. arXiv:1611.09904  [pdf, other

    cs.AI cs.LG

    C-RNN-GAN: Continuous recurrent neural networks with adversarial training

    Authors: Olof Mogren

    Abstract: Generative adversarial networks have been proposed as a way of efficiently training deep generative neural networks. We propose a generative adversarial model that works on continuous sequential data, and apply it by training it on a collection of classical music. We conclude that it generates music that sounds better and better as the model is trained, report statistics on generated music, and le… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

    Comments: Accepted to Constructive Machine Learning Workshop (CML) at NIPS 2016 in Barcelona, Spain, December 10

  18. arXiv:0804.1115  [pdf, ps, other

    cs.DS cs.DC

    Adaptive Dynamics of Realistic Small-World Networks

    Authors: Olof Mogren, Oskar Sandberg, Vilhelm Verendel, Devdatt Dubhashi

    Abstract: Continuing in the steps of Jon Kleinberg's and others celebrated work on decentralized search in small-world networks, we conduct an experimental analysis of a dynamic algorithm that produces small-world networks. We find that the algorithm adapts robustly to a wide variety of situations in realistic geographic networks with synthetic test data and with real world data, even when vertices are un… ▽ More

    Submitted 7 April, 2008; originally announced April 2008.