Skip to main content

Showing 1–17 of 17 results for author: Hellander, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11029  [pdf, ps, other

    cs.LG

    Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere

    Authors: Li Ju, Max Andersson, Stina Fredriksson, Edward Glöckner, Andreas Hellander, Ekta Vats, Prashant Singh

    Abstract: Vision-language models (VLMs) as foundation models have significantly enhanced performance across a wide range of visual and textual tasks, without requiring large-scale training from scratch for downstream tasks. However, these deterministic VLMs fail to capture the inherent ambiguity and uncertainty in natural language and visual data. Recent probabilistic post-hoc adaptation methods address thi… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2505.08403  [pdf, ps, other

    cs.LG cs.AI stat.ML

    ConDiSim: Conditional Diffusion Models for Simulation Based Inference

    Authors: Mayank Nautiyal, Andreas Hellander, Prashant Singh

    Abstract: We present a conditional diffusion model - ConDiSim, for simulation-based inference of complex systems with intractable likelihoods. ConDiSim leverages denoising diffusion probabilistic models to approximate posterior distributions, consisting of a forward process that adds Gaussian noise to parameters, and a reverse process learning to denoise, conditioned on observed data. This approach effectiv… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  3. arXiv:2411.14511  [pdf, other

    cs.LG cs.AI

    Variational Autoencoders for Efficient Simulation-Based Inference

    Authors: Mayank Nautiyal, Andrey Shternshis, Andreas Hellander, Prashant Singh

    Abstract: We present a generative modeling approach based on the variational inference framework for likelihood-free simulation-based inference. The method leverages latent variables within variational autoencoders to efficiently estimate complex posterior distributions arising from stochastic simulations. We explore two variations of this approach distinguished by their treatment of the prior distribution.… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  4. Toward efficient resource utilization at edge nodes in federated learning

    Authors: Sadi Alawadi, Addi Ait-Mlouk, Salman Toor, Andreas Hellander

    Abstract: Federated learning (FL) enables edge nodes to collaboratively contribute to constructing a global model without sharing their data. This is accomplished by devices computing local, private model updates that are then aggregated by a server. However, computational resource constraints and network communication can become a severe bottleneck for larger model sizes typical for deep learning applicati… ▽ More

    Submitted 11 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: 16 pages, 5 tables, 8 figures

    Journal ref: 10 June 2024

  5. arXiv:2304.03228  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    FedBot: Enhancing Privacy in Chatbots with Federated Learning

    Authors: Addi Ait-Mlouk, Sadi Alawadi, Salman Toor, Andreas Hellander

    Abstract: Chatbots are mainly data-driven and usually based on utterances that might be sensitive. However, training deep learning models on shared data can violate user privacy. Such issues have commonly existed in chatbots since their inception. In the literature, there have been many approaches to deal with privacy, such as differential privacy and secure multi-party computation, but most of them need to… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  6. arXiv:2301.09357  [pdf, other

    cs.LG cs.DC

    Accelerating Fair Federated Learning: Adaptive Federated Adam

    Authors: Li Ju, Tianru Zhang, Salman Toor, Andreas Hellander

    Abstract: Federated learning is a distributed and privacy-preserving approach to train a statistical model collaboratively from decentralized data of different parties. However, when datasets of participants are not independent and identically distributed (non-IID), models trained by naive federated algorithms may be biased towards certain participants, and model performance across participants is non-unifo… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  7. arXiv:2202.04742  [pdf, other

    cs.CL cs.AI cs.LG

    FedQAS: Privacy-aware machine reading comprehension with federated learning

    Authors: Addi Ait-Mlouk, Sadi Alawadi, Salman Toor, Andreas Hellander

    Abstract: Machine reading comprehension (MRC) of text data is one important task in Natural Language Understanding. It is a complex NLP problem with a lot of ongoing research fueled by the release of the Stanford Question Answering Dataset (SQuAD) and Conversational Question Answering (CoQA). It is considered to be an effort to teach computers how to "understand" a text, and then to be able to answer questi… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  8. Efficient Hierarchical Storage Management Framework Empowered by Reinforcement Learning

    Authors: Tianru Zhang, Salman Toor, Andreas Hellander

    Abstract: With the rapid development of big data and cloud computing, data management has become increasingly challenging. Over the years, a number of frameworks for data management and storage with various characteristics and features have become available. Most of these are highly efficient, but ultimately create data silos. It becomes difficult to move and work coherently with data as new requirements em… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 20 pages, 13 figures

  9. arXiv:2103.00148  [pdf, other

    cs.LG cs.DC

    Scalable federated machine learning with FEDn

    Authors: Morgan Ekmefjord, Addi Ait-Mlouk, Sadi Alawadi, Mattias Åkesson, Prashant Singh, Ola Spjuth, Salman Toor, Andreas Hellander

    Abstract: Federated machine learning has great promise to overcome the input privacy challenge in machine learning. The appearance of several projects capable of simulating federated learning has led to a corresponding rapid progress on algorithmic aspects of the problem. However, there is still a lack of federated machine learning frameworks that focus on fundamental aspects such as scalability, robustness… ▽ More

    Submitted 4 April, 2022; v1 submitted 27 February, 2021; originally announced March 2021.

    MSC Class: 68-04; 68T07

  10. arXiv:2102.06521  [pdf, other

    stat.ML cs.LG q-bio.QM

    Robust and integrative Bayesian neural networks for likelihood-free parameter inference

    Authors: Fredrik Wrede, Robin Eriksson, Richard Jiang, Linda Petzold, Stefan Engblom, Andreas Hellander, Prashant Singh

    Abstract: State-of-the-art neural network-based methods for learning summary statistics have delivered promising results for simulation-based likelihood-free parameter inference. Existing approaches require density estimation as a post-processing step building upon deterministic neural networks, and do not take network prediction uncertainty into account. This work proposes a robust integrated approach that… ▽ More

    Submitted 7 May, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  11. arXiv:2001.11760  [pdf, other

    stat.ML cs.LG

    Convolutional Neural Networks as Summary Statistics for Approximate Bayesian Computation

    Authors: Mattias Åkesson, Prashant Singh, Fredrik Wrede, Andreas Hellander

    Abstract: Approximate Bayesian Computation is widely used in systems biology for inferring parameters in stochastic gene regulatory network models. Its performance hinges critically on the ability to summarize high-dimensional system responses such as time series into a few informative, low-dimensional summary statistics. The quality of those statistics acutely impacts the accuracy of the inference task. Ex… ▽ More

    Submitted 12 April, 2021; v1 submitted 31 January, 2020; originally announced January 2020.

  12. arXiv:2001.10865  [pdf, other

    cs.DC

    Smart Resource Management for Data Streaming using an Online Bin-packing Strategy

    Authors: Oliver Stein, Ben Blamey, Johan Karlsson, Alan Sabirsh, Ola Spjuth, Andreas Hellander, Salman Toor

    Abstract: Data stream processing frameworks provide reliable and efficient mechanisms for executing complex workflows over large datasets. A common challenge for the majority of currently available streaming frameworks is efficient utilization of resources. Most frameworks use static or semi-static settings for resource utilization that work well for established use cases but lead to marginal improvements f… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

  13. arXiv:1912.09088  [pdf, other

    cs.DC

    Resource- and Message Size-Aware Scheduling of Stream Processing at the Edge with application to Realtime Microscopy

    Authors: Ben Blamey, Ida-Maria Sintorn, Andreas Hellander, Salman Toor

    Abstract: Whilst computational resources at the cloud edge can be leveraged to improve latency and reduce the costs of cloud services for a wide variety mobile, web, and IoT applications; such resources are naturally constrained. For distributed stream processing applications, there are clear advantages to offloading some processing work to the cloud edge. Many state of the art stream processing application… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  14. arXiv:1901.07335  [pdf, other

    cs.DC

    Adapting The Secretary Hiring Problem for Optimal Hot-Cold Tier Placement under Top-$K$ Workloads

    Authors: Ben Blamey, Fredrik Wrede, Johan Karlsson, Andreas Hellander, Salman Toor

    Abstract: Top-K queries are an established heuristic in information retrieval. This paper presents an approach for optimal tiered storage allocation under stream processing workloads using this heuristic: those requiring the analysis of only the top-$K$ ranked most relevant, or most interesting, documents from a fixed-length stream, stream window, or batch job. In this workflow, documents are analyzed relev… ▽ More

    Submitted 12 March, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

  15. arXiv:1807.07724  [pdf, other

    cs.DC

    Apache Spark Streaming, Kafka and HarmonicIO: A Performance Benchmark and Architecture Comparison for Enterprise and Scientific Computing

    Authors: Ben Blamey, Andreas Hellander, Salman Toor

    Abstract: This paper presents a benchmark of stream processing throughput comparing Apache Spark Streaming (under file-, TCP socket- and Kafka-based stream integration), with a prototype P2P stream processing framework, HarmonicIO. Maximum throughput for a spectrum of stream processing loads are measured, specifically, those with large message sizes (up to 10MB), and heavy CPU loads -- more typical of scien… ▽ More

    Submitted 19 December, 2019; v1 submitted 20 July, 2018; originally announced July 2018.

  16. arXiv:1805.08647  [pdf, ps, other

    stat.ML cs.LG

    Multi-Statistic Approximate Bayesian Computation with Multi-Armed Bandits

    Authors: Prashant Singh, Andreas Hellander

    Abstract: Approximate Bayesian computation is an established and popular method for likelihood-free inference with applications in many disciplines. The effectiveness of the method depends critically on the availability of well performing summary statistics. Summary statistic selection relies heavily on domain knowledge and carefully engineered features, and can be a laborious time consuming process. Since… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  17. arXiv:1508.03604  [pdf, other

    cs.CE

    MOLNs: A cloud platform for interactive, reproducible and scalable spatial stochastic computational experiments in systems biology using PyURDME

    Authors: Brian Drawert, Michael Trogdon, Salman Toor, Linda Petzold, Andreas Hellander

    Abstract: Computational experiments using spatial stochastic simulations have led to important new biological insights, but they require specialized tools, a complex software stack, as well as large and scalable compute and data analysis resources due to the large computational cost associated with Monte Carlo computational workflows. The complexity of setting up and managing a large-scale distributed compu… ▽ More

    Submitted 14 August, 2015; originally announced August 2015.