Skip to main content

Showing 1–50 of 68 results for author: De Vos, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.13854  [pdf, ps, other

    cs.CY cs.AI

    Understanding the Process of Human-AI Value Alignment

    Authors: Jack McKinlay, Marina De Vos, Janina A. Hoffmann, Andreas Theodorou

    Abstract: Background: Value alignment in computer science research is often used to refer to the process of aligning artificial intelligence with humans, but the way the phrase is used often lacks precision. Objectives: In this paper, we conduct a systematic literature review to advance the understanding of value alignment in artificial intelligence by characterising the topic in the context of its research… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

    Comments: 39 pages, 7 figures

  2. arXiv:2509.01253  [pdf, ps, other

    cs.CR cs.LG

    Practical and Private Hybrid ML Inference with Fully Homomorphic Encryption

    Authors: Sayan Biswas, Philippe Chartier, Akash Dhasade, Tom Jurien, David Kerriou, Anne-Marie Kerrmarec, Mohammed Lemou, Franklin Tranie, Martijn de Vos, Milos Vujasinovic

    Abstract: In contemporary cloud-based services, protecting users' sensitive data and ensuring the confidentiality of the server's model are critical. Fully homomorphic encryption (FHE) enables inference directly on encrypted inputs, but its practicality is hindered by expensive bootstrapping and inefficient approximations of non-linear activations. We introduce Safhire, a hybrid inference framework that exe… ▽ More

    Submitted 1 September, 2025; originally announced September 2025.

  3. arXiv:2509.00102  [pdf, ps, other

    cs.LG cs.AI

    Exploiting a Mixture-of-Layers in an Electrocardiography Foundation Model

    Authors: Phu X. Nguyen, Huy Phan, Hieu Pham, Christos Chatzichristos, Bert Vandenberk, Maarten De Vos

    Abstract: Transformer-based foundation models for Electrocardiograms (ECGs) have recently achieved impressive performance in many downstream applications. However, the internal representations of such models across layers have not been fully understood and exploited. An important question arises: Does the final layer of the pre-trained Transformer model, the \emph{de facto} representational layer, provide o… ▽ More

    Submitted 27 August, 2025; originally announced September 2025.

  4. arXiv:2506.12417  [pdf, ps, other

    cs.DC

    HarMoEny: Efficient Multi-GPU Inference of MoE Models

    Authors: Zachary Doucet, Rishi Sharma, Martijn de Vos, Rafael Pires, Anne-Marie Kermarrec, Oana Balmau

    Abstract: Mixture-of-Experts (MoE) models offer computational efficiency during inference by activating only a subset of specialized experts for a given input. This enables efficient model scaling on multi-GPU systems that use expert parallelism without compromising performance. However, load imbalance among experts and GPUs introduces waiting times, which can significantly increase inference latency. To ad… ▽ More

    Submitted 17 June, 2025; v1 submitted 14 June, 2025; originally announced June 2025.

  5. arXiv:2505.21550  [pdf, ps, other

    cs.NI cs.AI cs.MA

    Collaborative Agentic AI Needs Interoperability Across Ecosystems

    Authors: Rishi Sharma, Martijn de Vos, Pradyumna Chari, Ramesh Raskar, Anne-Marie Kermarrec

    Abstract: Collaborative agentic AI is projected to transform entire industries by enabling AI-powered agents to autonomously perceive, plan, and act within digital environments. Yet, current solutions in this field are all built in isolation, and we are rapidly heading toward a landscape of fragmented, incompatible ecosystems. In this position paper, we argue that interoperability, achieved by the adoption… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  6. arXiv:2505.04796  [pdf, ps, other

    cs.LG

    Robust ML Auditing using Prior Knowledge

    Authors: Jade Garcia Bourrée, Augustin Godinot, Martijn De Vos, Milos Vujasinovic, Sayan Biswas, Gilles Tredan, Erwan Le Merrer, Anne-Marie Kermarrec

    Abstract: Among the many technical challenges to enforcing AI regulations, one crucial yet underexplored problem is the risk of audit manipulation. This manipulation occurs when a platform deliberately alters its answers to a regulator to pass an audit without modifying its answers to other users. In this paper, we introduce a novel approach to manipulation-proof auditing by taking into account the auditor'… ▽ More

    Submitted 22 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: Accepted to the 42nd International Conference on Machine Learning ICML25

  7. Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases

    Authors: Modhurita Mitra, Martine G. de Vos, Nicola Cortinovis, Dawa Ometto

    Abstract: There has been enormous interest in generative AI since ChatGPT was launched in 2022. However, there are concerns about the accuracy and consistency of the outputs of generative AI. We have carried out an exploratory study on the application of this new technology in research data processing. We identified tasks for which rule-based or traditional machine learning approaches were difficult to appl… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 10 pages, 4 figures, 6 tables. Published in Proceedings of the 2024 IEEE 20th International Conference on e-Science (e-Science), Osaka, Japan

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2503.08467  [pdf, other

    cs.LG cs.AI cs.DC

    Accelerating MoE Model Inference with Expert Sharding

    Authors: Oana Balmau, Anne-Marie Kermarrec, Rafael Pires, André Loureiro Espírito Santo, Martijn de Vos, Milos Vujasinovic

    Abstract: Mixture of experts (MoE) models achieve state-of-the-art results in language modeling but suffer from inefficient hardware utilization due to imbalanced token routing and communication overhead. While prior work has focused on optimizing MoE training and decoder architectures, inference for encoder-based MoE models in a multi-GPU with expert parallelism setting remains underexplored. We introduce… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: To appear in the proceedings of the 5th Workshop on Machine Learning and Systems (EuroMLSys 25)

  9. arXiv:2503.05530  [pdf, ps, other

    cs.DB cs.LG cs.PF

    Leveraging Approximate Caching for Faster Retrieval-Augmented Generation

    Authors: Shai Bergman, Zhang Ji, Anne-Marie Kermarrec, Diana Petrescu, Rafael Pires, Mathis Randl, Martijn de Vos

    Abstract: Retrieval-augmented generation (RAG) improves the reliability of large language model (LLM) answers by integrating external knowledge. However, RAG increases the end-to-end inference time since looking for relevant documents from large vector databases is computationally expensive. To address this, we introduce Proximity, an approximate key-value cache that optimizes the RAG workflow by leveraging… ▽ More

    Submitted 2 September, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

  10. Practical Federated Learning without a Server

    Authors: Akash Dhasade, Anne-Marie Kermarrec, Erick Lavoie, Johan Pouwelse, Rishi Sharma, Martijn de Vos

    Abstract: Federated Learning (FL) enables end-user devices to collaboratively train ML models without sharing raw data, thereby preserving data privacy. In FL, a central parameter server coordinates the learning process by iteratively aggregating the trained models received from clients. Yet, deploying a central server is not always feasible due to hardware unavailability, infrastructure constraints, or ope… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: To appear in the proceedings of EuroMLSys'25

  11. arXiv:2502.19280  [pdf, other

    cs.LG cs.DC cs.IR

    Efficient Federated Search for Retrieval-Augmented Generation

    Authors: Rachid Guerraoui, Anne-Marie Kermarrec, Diana Petrescu, Rafael Pires, Mathis Randl, Martijn de Vos

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across various domains but remain susceptible to hallucinations and inconsistencies, limiting their reliability. Retrieval-augmented generation (RAG) mitigates these issues by grounding model responses in external knowledge sources. Existing RAG workflows often leverage a single vector database, which is impractical in the comm… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: To appear in the proceedings of EuroMLSys'25

  12. arXiv:2411.07335  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.GT cs.MM

    Balancing Multimodal Training Through Game-Theoretic Regularization

    Authors: Konstantinos Kontras, Thomas Strypsteen, Christos Chatzichristos, Paul Pu Liang, Matthew Blaschko, Maarten De Vos

    Abstract: Multimodal learning holds promise for richer information extraction by capturing dependencies across data sources. Yet, current training methods often underperform due to modality competition, a phenomenon where modalities contend for training resources leaving some underoptimized. This raises a pivotal question: how can we address training imbalances, ensure adequate optimization across all modal… ▽ More

    Submitted 1 October, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: 23 pages, 7 figures, 6 tables, 1 algorithm

  13. Boosting Asynchronous Decentralized Learning with Model Fragmentation

    Authors: Sayan Biswas, Anne-Marie Kermarrec, Alexis Marouani, Rafael Pires, Rishi Sharma, Martijn de Vos

    Abstract: Decentralized learning (DL) is an emerging technique that allows nodes on the web to collaboratively train machine learning models without sharing raw data. Dealing with stragglers, i.e., nodes with slower compute or communication than others, is a key challenge in DL. We present DivShare, a novel asynchronous DL algorithm that achieves fast model convergence in the presence of communication strag… ▽ More

    Submitted 3 February, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Accepted to appear in the Proceedings of the ACM Web Conference 2025 (WWW '25)

  14. Fair Decentralized Learning

    Authors: Sayan Biswas, Anne-Marie Kermarrec, Rishi Sharma, Thibaud Trinca, Martijn de Vos

    Abstract: Decentralized learning (DL) is an emerging approach that enables nodes to collaboratively train a machine learning model without sharing raw data. In many application domains, such as healthcare, this approach faces challenges due to the high level of heterogeneity in the training data's feature space. Such feature heterogeneity lowers model utility and negatively impacts fairness, particularly fo… ▽ More

    Submitted 24 January, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: To appear in the proceedings of "3rd IEEE Conference on Secure and Trustworthy Machine Learning" (SatML'25)

  15. arXiv:2409.04104  [pdf, other

    cs.LG cs.AI cs.CV cs.HC eess.SP

    MixNet: Joining Force of Classical and Modern Approaches Toward the Comprehensive Pipeline in Motor Imagery EEG Classification

    Authors: Phairot Autthasan, Rattanaphon Chaisaen, Huy Phan, Maarten De Vos, Theerawit Wilaiprasitporn

    Abstract: Recent advances in deep learning (DL) have significantly impacted motor imagery (MI)-based brain-computer interface (BCI) systems, enhancing the decoding of electroencephalography (EEG) signals. However, most studies struggle to identify discriminative patterns across subjects during MI tasks, limiting MI classification performance. In this article, we propose MixNet, a novel classification framew… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: Supplementary materials and source codes are available on-line at https://github.com/Max-Phairot-A/MixNet

    Journal ref: IEEE Internet of Things Journal 2024

  16. PeerSwap: A Peer-Sampler with Randomness Guarantees

    Authors: Rachid Guerraoui, Anne-Marie Kermarrec, Anastasiia Kucherenko, Rafael Pinot, Martijn de Vos

    Abstract: The ability of a peer-to-peer (P2P) system to effectively host decentralized applications often relies on the availability of a peer-sampling service, which provides each participant with a random sample of other peers. Despite the practical effectiveness of existing peer samplers, their ability to produce random samples within a reasonable time frame remains poorly understood from a theoretical s… ▽ More

    Submitted 1 October, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Journal ref: 2024 43rd International Symposium on Reliable Distributed Systems (SRDS), IEEE

  17. arXiv:2407.01283  [pdf, other

    cs.LG cs.DC

    Energy-Aware Decentralized Learning with Intermittent Model Training

    Authors: Akash Dhasade, Paolo Dini, Elia Guerra, Anne-Marie Kermarrec, Marco Miozzo, Rafael Pires, Rishi Sharma, Martijn de Vos

    Abstract: Decentralized learning (DL) offers a powerful framework where nodes collaboratively train models without sharing raw data and without the coordination of a central server. In the iterative rounds of DL, models are trained locally, shared with neighbors in the topology, and aggregated with other models received from neighbors. Sharing and merging models contribute to convergence towards a consensus… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  18. arXiv:2405.15644  [pdf, other

    cs.LG cs.DC

    Harnessing Increased Client Participation with Cohort-Parallel Federated Learning

    Authors: Akash Dhasade, Anne-Marie Kermarrec, Tuan-Anh Nguyen, Rafael Pires, Martijn de Vos

    Abstract: Federated learning (FL) is a machine learning approach where nodes collaboratively train a global model. As more nodes participate in a round of FL, the effectiveness of individual model updates by nodes also diminishes. In this study, we increase the effectiveness of client updates by dividing the network into smaller partitions, or cohorts. We introduce Cohort-Parallel Federated Learning (CPFL):… ▽ More

    Submitted 10 March, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: To appear in the proceedings of EuroMLSys'25

  19. arXiv:2405.07930  [pdf, other

    cs.MM cs.CV cs.LG cs.SD eess.AS

    Improving Multimodal Learning with Multi-Loss Gradient Modulation

    Authors: Konstantinos Kontras, Christos Chatzichristos, Matthew Blaschko, Maarten De Vos

    Abstract: Learning from multiple modalities, such as audio and video, offers opportunities for leveraging complementary information, enhancing robustness, and improving contextual understanding and performance. However, combining such modalities presents challenges, especially when modalities differ in data structure, predictive contribution, and the complexity of their learning processes. It has been obser… ▽ More

    Submitted 14 October, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  20. arXiv:2404.09536  [pdf, other

    cs.DC cs.AI cs.CR cs.LG

    Noiseless Privacy-Preserving Decentralized Learning

    Authors: Sayan Biswas, Mathieu Even, Anne-Marie Kermarrec, Laurent Massoulie, Rafael Pires, Rishi Sharma, Martijn de Vos

    Abstract: Decentralized learning (DL) enables collaborative learning without a server and without training data leaving the users' devices. However, the models shared in DL can still be used to infer training data. Conventional defenses such as differential privacy and secure aggregation fall short in effectively safeguarding user privacy in DL, either sacrificing model utility or efficiency. We introduce S… ▽ More

    Submitted 12 September, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at PETS 2025

  21. arXiv:2402.08522  [pdf, other

    cs.LG

    Fairness Auditing with Multi-Agent Collaboration

    Authors: Martijn de Vos, Akash Dhasade, Jade Garcia Bourrée, Anne-Marie Kermarrec, Erwan Le Merrer, Benoit Rottembourg, Gilles Tredan

    Abstract: Existing work in fairness auditing assumes that each audit is performed independently. In this paper, we consider multiple agents working together, each auditing the same platform for different tasks. Agents have two levers: their collaboration strategy, with or without coordination beforehand, and their strategy for sampling appropriate data points. We theoretically compare the interplay of these… ▽ More

    Submitted 16 August, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures, ECAI

  22. arXiv:2311.15603  [pdf, other

    cs.LG cs.AI

    QuickDrop: Efficient Federated Unlearning by Integrated Dataset Distillation

    Authors: Akash Dhasade, Yaohong Ding, Song Guo, Anne-marie Kermarrec, Martijn De Vos, Leijie Wu

    Abstract: Federated Unlearning (FU) aims to delete specific training data from an ML model trained using Federated Learning (FL). We introduce QuickDrop, an efficient and original FU method that utilizes dataset distillation (DD) to accelerate unlearning and drastically reduces computational overhead compared to existing approaches. In QuickDrop, each client uses DD to generate a compact dataset representat… ▽ More

    Submitted 6 December, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted by Middleware 2024

  23. arXiv:2310.01972  [pdf, other

    cs.LG cs.DC

    Epidemic Learning: Boosting Decentralized Learning with Randomized Communication

    Authors: Martijn de Vos, Sadegh Farhadkhani, Rachid Guerraoui, Anne-Marie Kermarrec, Rafael Pires, Rishi Sharma

    Abstract: We present Epidemic Learning (EL), a simple yet powerful decentralized learning (DL) algorithm that leverages changing communication topologies to achieve faster model convergence compared to conventional DL approaches. At each round of EL, each node sends its model updates to a random sample of $s$ other nodes (in a system of $n$ nodes). We provide an extensive theoretical analysis of EL, demonst… ▽ More

    Submitted 27 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted paper at NeurIPS 2023

  24. arXiv:2306.16431  [pdf, other

    cs.LG cs.AI

    Increasing Performance And Sample Efficiency With Model-agnostic Interactive Feature Attributions

    Authors: Joran Michiels, Maarten De Vos, Johan Suykens

    Abstract: Model-agnostic feature attributions can provide local insights in complex ML models. If the explanation is correct, a domain expert can validate and trust the model's decision. However, if it contradicts the expert's knowledge, related work only corrects irrelevant features to improve the model. To allow for unlimited interaction, in this paper we provide model-agnostic implementations for two pop… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  25. arXiv:2306.10880  [pdf, other

    cs.LG cs.AI

    Explaining the Model and Feature Dependencies by Decomposition of the Shapley Value

    Authors: Joran Michiels, Maarten De Vos, Johan Suykens

    Abstract: Shapley values have become one of the go-to methods to explain complex models to end-users. They provide a model agnostic post-hoc explanation with foundations in game theory: what is the worth of a player (in machine learning, a feature value) in the objective function (the output of the complex machine learning model). One downside is that they always require outputs of the model when some featu… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  26. arXiv:2306.04663  [pdf, ps, other

    eess.SP cs.LG

    U-PASS: an Uncertainty-guided deep learning Pipeline for Automated Sleep Staging

    Authors: Elisabeth R. M. Heremans, Nabeel Seedat, Bertien Buyse, Dries Testelmans, Mihaela van der Schaar, Maarten De Vos

    Abstract: As machine learning becomes increasingly prevalent in critical fields such as healthcare, ensuring the safety and reliability of machine learning systems becomes paramount. A key component of reliability is the ability to estimate uncertainty, which enables the identification of areas of high and low confidence and helps to minimize the risk of error. In this study, we propose a machine learning p… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  27. arXiv:2304.06678  [pdf

    physics.med-ph cs.CY physics.bio-ph

    From the digital twins in healthcare to the Virtual Human Twin: a moon-shot project for digital health research

    Authors: Marco Viceconti, Maarten De Vos, Sabato Mellone, Liesbet Geris

    Abstract: The idea of a systematic digital representation of the entire known human pathophysiology, which we could call the Virtual Human Twin, has been around for decades. To date, most research groups focused instead on developing highly specialised, highly focused patient-specific models able to predict specific quantities of clinical relevance. While it has facilitated harvesting the low-hanging fruits… ▽ More

    Submitted 12 August, 2023; v1 submitted 27 March, 2023; originally announced April 2023.

  28. arXiv:2304.06485  [pdf, ps, other

    eess.SP cs.AI cs.LG

    CoRe-Sleep: A Multimodal Fusion Framework for Time Series Robust to Imperfect Modalities

    Authors: Konstantinos Kontras, Christos Chatzichristos, Huy Phan, Johan Suykens, Maarten De Vos

    Abstract: Sleep abnormalities can have severe health consequences. Automated sleep staging, i.e. labelling the sequence of sleep stages from the patient's physiological recordings, could simplify the diagnostic process. Previous work on automated sleep staging has achieved great results, mainly relying on the EEG signal. However, often multiple sources of information are available beyond EEG. This can be pa… ▽ More

    Submitted 27 March, 2023; originally announced April 2023.

    Comments: 10 pages, 4 figures, 2 tables, journal

  29. arXiv:2302.13837  [pdf, other

    cs.DC

    Decentralized Learning Made Practical with Client Sampling

    Authors: Martijn de Vos, Akash Dhasade, Anne-Marie Kermarrec, Erick Lavoie, Johan Pouwelse, Rishi Sharma

    Abstract: Decentralized learning (DL) leverages edge devices for collaborative model training while avoiding coordination by a central server. Due to privacy concerns, DL has become an attractive alternative to centralized learning schemes since training data never leaves the device. In a round of DL, all nodes participate in model training and exchange their model with some other nodes. Performing DL in la… ▽ More

    Submitted 7 May, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

  30. arXiv:2301.04508  [pdf, other

    cs.DC

    A Deployment-First Methodology to Mechanism Design and Refinement in Distributed Systems

    Authors: Martijn de Vos, Georgy Ishmaev, Johan Pouwelse, Stefanie Roos

    Abstract: Catalyzed by the popularity of blockchain technology, there has recently been a renewed interest in the design, implementation and evaluation of decentralized systems. Most of these systems are intended to be deployed at scale and in heterogeneous environments with real users and unpredictable workloads. Nevertheless, most research in this field evaluates such systems in controlled environments th… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted for publication at the PerFail'23 workshop

  31. arXiv:2301.03441  [pdf, ps, other

    eess.SP cs.LG

    L-SeqSleepNet: Whole-cycle Long Sequence Modelling for Automatic Sleep Staging

    Authors: Huy Phan, Kristian P. Lorenzen, Elisabeth Heremans, Oliver Y. Chén, Minh C. Tran, Philipp Koch, Alfred Mertins, Mathias Baumert, Kaare Mikkelsen, Maarten De Vos

    Abstract: Human sleep is cyclical with a period of approximately 90 minutes, implying long temporal dependency in the sleep data. Yet, exploring this long-term dependency when developing sleep staging models has remained untouched. In this work, we show that while encoding the logic of a whole sleep cycle is crucial to improve sleep staging performance, the sequential modelling approach in existing state-of… ▽ More

    Submitted 4 August, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: This article has been published in IEEE Journal of Biomedical and Health Informatics (JBHI). Source code is available at http://github.com/pquochuy/l-seqsleepnet

  32. arXiv:2209.11007  [pdf, other

    eess.SP cs.LG

    Avoiding Post-Processing with Event-Based Detection in Biomedical Signals

    Authors: Nick Seeuws, Maarten De Vos, Alexander Bertrand

    Abstract: Objective: Finding events of interest is a common task in biomedical signal processing. The detection of epileptic seizures and signal artefacts are two key examples. Epoch-based classification is the typical machine learning framework to detect such signal events because of the straightforward application of classical machine learning techniques. Usually, post-processing is required to achieve go… ▽ More

    Submitted 7 July, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  33. arXiv:2208.11254  [pdf, other

    cs.DC cs.PF

    Gromit: Benchmarking the Performance and Scalability of Blockchain Systems

    Authors: Bulat Nasrulin, Martijn De Vos, Georgy Ishmaev, Johan Pouwelse

    Abstract: The growing number of implementations of blockchain systems stands in stark contrast with still limited research on a systematic comparison of performance characteristics of these solutions. Such research is crucial for evaluating fundamental trade-offs introduced by novel consensus protocols and their implementations. These performance limitations are commonly analyzed with ad-hoc benchmarking fr… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  34. arXiv:2203.14996  [pdf, other

    cs.CL cs.LG

    Comparing in context: Improving cosine similarity measures with a metric tensor

    Authors: Isa M. Apallius de Vos, Ghislaine L. van den Boogerd, Mara D. Fennema, Adriana D. Correia

    Abstract: Cosine similarity is a widely used measure of the relatedness of pre-trained word embeddings, trained on a language modeling goal. Datasets such as WordSim-353 and SimLex-999 rate how similar words are according to human annotators, and as such are often used to evaluate the performance of language models. Thus, any improvement on the word similarity task requires an improved word representation.… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Presented at the 18th International Conference in Natural Language Processing (ICON `21). 11 pages, 3 figures, 6 tables

  35. arXiv:2201.00644  [pdf, ps, other

    eess.SP cs.LG

    Feature matching as improved transfer learning technique for wearable EEG

    Authors: Elisabeth R. M. Heremans, Huy Phan, Amir H. Ansari, Pascal Borzée, Bertien Buyse, Dries Testelmans, Maarten De Vos

    Abstract: Objective: With the rapid rise of wearable sleep monitoring devices with non-conventional electrode configurations, there is a need for automated algorithms that can perform sleep staging on configurations with small amounts of labeled data. Transfer learning has the ability to adapt neural network weights from a source modality (e.g. standard electrode configuration) to a new target modality (e.g… ▽ More

    Submitted 29 December, 2021; originally announced January 2022.

    Comments: 14 pages, 6 figues

  36. arXiv:2110.11006  [pdf, other

    cs.DC cs.LG

    Bristle: Decentralized Federated Learning in Byzantine, Non-i.i.d. Environments

    Authors: Joost Verbraeken, Martijn de Vos, Johan Pouwelse

    Abstract: Federated learning (FL) is a privacy-friendly type of machine learning where devices locally train a model on their private data and typically communicate model updates with a server. In decentralized FL (DFL), peers communicate model updates with each other instead. However, DFL is challenging since (1) the training data possessed by different peers is often non-i.i.d. (i.e., distributed differen… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  37. SleepTransformer: Automatic Sleep Staging with Interpretability and Uncertainty Quantification

    Authors: Huy Phan, Kaare Mikkelsen, Oliver Y. Chén, Philipp Koch, Alfred Mertins, Maarten De Vos

    Abstract: Background: Black-box skepticism is one of the main hindrances impeding deep-learning-based automatic sleep scoring from being used in clinical environments. Methods: Towards interpretability, this work proposes a sequence-to-sequence sleep-staging model, namely SleepTransformer. It is based on the transformer backbone and offers interpretability of the model's decisions at both the epoch and sequ… ▽ More

    Submitted 26 January, 2022; v1 submitted 23 May, 2021; originally announced May 2021.

    Comments: This article has been published in IEEE Transactions on Biomedical Engineering

  38. arXiv:2105.02175  [pdf, other

    cs.CR

    Automatic de-identification of Data Download Packages

    Authors: Laura Boeschoten, Roos Voorvaart, Casper Kaandorp, Ruben van den Goorbergh, Martine de Vos

    Abstract: The General Data Protection Regulation (GDPR) grants all natural persons the right of access to their personal data if this is being processed by data controllers. The data controllers are obliged to share the data in an electronic format and often provide the data in a so called Data Download Package (DDP). These DDPs contain all data collected by public and private entities during the course of… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  39. arXiv:2104.02612  [pdf, other

    cs.CR cs.SE

    ASTANA: Practical String Deobfuscation for Android Applications Using Program Slicing

    Authors: Martijn de Vos, Johan Pouwelse

    Abstract: Software obfuscation is widely used by Android developers to protect the source code of their applications against adversarial reverse-engineering efforts. A specific type of obfuscation, string obfuscation, transforms the content of all string literals in the source code to non-interpretable text and inserts logic to deobfuscate these string literals at runtime. In this work, we demonstrate that… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  40. arXiv:2103.09171  [pdf, other

    cs.LG eess.SP

    Interpretable Deep Learning for the Remote Characterisation of Ambulation in Multiple Sclerosis using Smartphones

    Authors: Andrew P. Creagh, Florian Lipsmeier, Michael Lindemann, Maarten De Vos

    Abstract: The emergence of digital technologies such as smartphones in healthcare applications have demonstrated the possibility of developing rich, continuous, and objective measures of multiple sclerosis (MS) disability that can be administered remotely and out-of-clinic. In this work, deep convolutional neural networks (DCNN) applied to smartphone inertial sensor data were shown to better distinguish hea… ▽ More

    Submitted 22 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  41. arXiv:2102.12245  [pdf, other

    cs.LG eess.SP

    Estimation of Continuous Blood Pressure from PPG via a Federated Learning Approach

    Authors: Eoin Brophy, Maarten De Vos, Geraldine Boylan, Tomas Ward

    Abstract: Ischemic heart disease is the highest cause of mortality globally each year. This not only puts a massive strain on the lives of those affected but also on the public healthcare systems. To understand the dynamics of the healthy and unhealthy heart doctors commonly use electrocardiogram (ECG) and blood pressure (BP) readings. These methods are often quite invasive, in particular when continuous ar… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  42. arXiv:2008.09524  [pdf, other

    cs.LG eess.SP stat.ML

    Change Point Detection in Time Series Data using Autoencoders with a Time-Invariant Representation

    Authors: Tim De Ryck, Maarten De Vos, Alexander Bertrand

    Abstract: Change point detection (CPD) aims to locate abrupt property changes in time series data. Recent CPD methods demonstrated the potential of using deep learning techniques, but often lack the ability to identify more subtle changes in the autocorrelation statistics of the signal and suffer from a high false alarm rate. To address these issues, we employ an autoencoder-based methodology with a novel l… ▽ More

    Submitted 10 February, 2021; v1 submitted 21 August, 2020; originally announced August 2020.

  43. XSleepNet: Multi-View Sequential Model for Automatic Sleep Staging

    Authors: Huy Phan, Oliver Y. Chén, Minh C. Tran, Philipp Koch, Alfred Mertins, Maarten De Vos

    Abstract: Automating sleep staging is vital to scale up sleep assessment and diagnosis to serve millions experiencing sleep deprivation and disorders and enable longitudinal sleep monitoring in home environments. Learning from raw polysomnography signals and their derived time-frequency image representations has been prevalent. However, learning from multi-view inputs (e.g., both the raw signals and the tim… ▽ More

    Submitted 31 March, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: This article has been published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  44. arXiv:2007.02064  [pdf

    eess.SP cs.HC

    Monitoring Depression in Bipolar Disorder using Circadian Measures from Smartphone Accelerometers

    Authors: Oliver Carr, Fernando Andreotti, Kate E. A. Saunders, Niclas Palmius, Guy M. Goodwin, Maarten De Vos

    Abstract: Current management of bipolar disorder relies on self-reported questionnaires and interviews with clinicians. The development of objective measures of deteriorating mood may also allow for early interventions to take place to avoid transitions into depressive states. The objective of this study was to use acceleration data recorded from smartphones to predict levels of depression in a population o… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: 8 pages, 3 figures

  45. Personalized Automatic Sleep Staging with Single-Night Data: a Pilot Study with KL-Divergence Regularization

    Authors: Huy Phan, Kaare Mikkelsen, Oliver Y. Chén, Philipp Koch, Alfred Mertins, Preben Kidmose, Maarten De Vos

    Abstract: Brain waves vary between people. An obvious way to improve automatic sleep staging for longitudinal sleep monitoring is personalization of algorithms based on individual characteristics extracted from the first night of data. As a single night is a very small amount of data to train a sleep staging model, we propose a Kullback-Leibler (KL) divergence regularized transfer learning approach to addre… ▽ More

    Submitted 11 May, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: This article has been published in Physiological Measurement

  46. arXiv:2004.05046  [pdf, other

    cs.DC

    XChange: A Blockchain-based Mechanism for Generic Asset Trading In Resource-constrained Environments

    Authors: Martijn de Vos, Can Umut Ileri, Johan Pouwelse

    Abstract: An increasing number of industries rely on Internet-of-Things devices to track physical resources. Blockchain technology provides primitives to represent these resources as digital assets on a secure distributed ledger. Due to the proliferation of blockchain-based assets, there is an increasing need for a generic mechanism to trade assets between isolated platforms. To date, there is no such mecha… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

  47. arXiv:2004.02575  [pdf, other

    cs.MA cs.AI

    A Norm Emergence Framework for Normative MAS -- Position Paper

    Authors: Andreasa Morris-Martin, Marina De Vos, Julian Padget

    Abstract: Norm emergence is typically studied in the context of multiagent systems (MAS) where norms are implicit, and participating agents use simplistic decision-making mechanisms. These implicit norms are usually unconsciously shared and adopted through agent interaction. A norm is deemed to have emerged when a threshold or predetermined percentage of agents follow the "norm". Conversely, in normative MA… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: 16 pages, 2 figures, pre-print for International Workshop on Coordination, Organizations, Institutions, Norms and Ethics for Governance of Multi-Agent Systems (COINE), co-located with AAMAS 2020

  48. arXiv:2001.05532  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Improving GANs for Speech Enhancement

    Authors: Huy Phan, Ian V. McLoughlin, Lam Pham, Oliver Y. Chén, Philipp Koch, Maarten De Vos, Alfred Mertins

    Abstract: Generative adversarial networks (GAN) have recently been shown to be efficient for speech enhancement. However, most, if not all, existing speech enhancement GANs (SEGAN) make use of a single generator to perform one-stage enhancement mapping. In this work, we propose to use multiple generators that are chained to perform multi-stage enhancement mapping, which gradually refines the noisy input sig… ▽ More

    Submitted 12 September, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: This letter has been accepted for publication in IEEE Signal Processing Letters

  49. arXiv:1909.07646   

    cs.LO cs.AI cs.PL

    Proceedings 35th International Conference on Logic Programming (Technical Communications)

    Authors: Bart Bogaerts, Esra Erdem, Paul Fodor, Andrea Formisano, Giovambattista Ianni, Daniela Inclezan, German Vidal, Alicia Villanueva, Marina De Vos, Fangkai Yang

    Abstract: Since the first conference held in Marseille in 1982, ICLP has been the premier international event for presenting research in logic programming. Contributions are sought in all areas of logic programming, including but not restricted to: Foundations: Semantics, Formalisms, Nonmonotonic reasoning, Knowledge representation. Languages: Concurrency, Objects, Coordination, Mobility, Higher Order,… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Journal ref: EPTCS 306, 2019

  50. arXiv:1907.13177  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning

    Authors: Huy Phan, Oliver Y. Chén, Philipp Koch, Zongqing Lu, Ian McLoughlin, Alfred Mertins, Maarten De Vos

    Abstract: Background: Despite recent significant progress in the development of automatic sleep staging methods, building a good model still remains a big challenge for sleep studies with a small cohort due to the data-variability and data-inefficiency issues. This work presents a deep transfer learning approach to overcome these issues and enable transferring knowledge from a large dataset to a small cohor… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: This article has been published in IEEE Transactions on Biomedical Engineering