Skip to main content

Showing 1–15 of 15 results for author: Mocanu, E

.
  1. arXiv:2506.00932  [pdf, other

    cs.LG

    Addressing the Collaboration Dilemma in Low-Data Federated Learning via Transient Sparsity

    Authors: Qiao Xiao, Boqian Wu, Andrey Poddubnyy, Elena Mocanu, Phuong H. Nguyen, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: Federated learning (FL) enables collaborative model training across decentralized clients while preserving data privacy, leveraging aggregated updates to build robust global models. However, this training paradigm faces significant challenges due to data heterogeneity and limited local datasets, which often impede effective collaboration. In such scenarios, we identify the Layer-wise Inertia Pheno… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  2. arXiv:2505.17909  [pdf, ps, other

    cs.LG cs.AI

    NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling

    Authors: Bram Grooten, Farid Hasanov, Chenxiang Zhang, Qiao Xiao, Boqian Wu, Zahra Atashgahi, Ghada Sokar, Shiwei Liu, Lu Yin, Elena Mocanu, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: Model ensembles have long been a cornerstone for improving generalization and robustness in deep learning. However, their effectiveness often comes at the cost of substantial computational overhead. To address this issue, state-of-the-art methods aim to replicate ensemble-class performance without requiring multiple independently trained networks. Unfortunately, these algorithms often still demand… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Our open-source code is available at https://github.com/bramgrooten/neurotrails

  3. arXiv:2410.03030  [pdf, other

    cs.CV cs.AI

    Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness

    Authors: Boqian Wu, Qiao Xiao, Shunxin Wang, Nicola Strisciuglio, Mykola Pechenizkiy, Maurice van Keulen, Decebal Constantin Mocanu, Elena Mocanu

    Abstract: It is generally perceived that Dynamic Sparse Training opens the door to a new era of scalability and efficiency for artificial neural networks at, perhaps, some costs in accuracy performance for the classification task. At the same time, Dense Training is widely accepted as being the "de facto" approach to train artificial neural networks if one would like to maximize their robustness against ima… ▽ More

    Submitted 4 March, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted at ICLR 2025

  4. arXiv:2312.04727  [pdf, other

    cs.CV

    E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation

    Authors: Boqian Wu, Qiao Xiao, Shiwei Liu, Lu Yin, Mykola Pechenizkiy, Decebal Constantin Mocanu, Maurice Van Keulen, Elena Mocanu

    Abstract: Deep neural networks have evolved as the leading approach in 3D medical image segmentation due to their outstanding performance. However, the ever-increasing model size and computation cost of deep neural networks have become the primary barrier to deploying them on real-world resource-limited hardware. In pursuit of improving performance and efficiency, we propose a 3D medical image segmentation… ▽ More

    Submitted 19 February, 2025; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted at NeurIPS 2024

  5. arXiv:2302.06548  [pdf, other

    cs.LG cs.AI

    Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning

    Authors: Bram Grooten, Ghada Sokar, Shibhansh Dohare, Elena Mocanu, Matthew E. Taylor, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement le… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted as full-paper at AAMAS 2023

  6. arXiv:2212.09840  [pdf, other

    cs.LG cs.AI

    Dynamic Sparse Network for Time Series Classification: Learning What to "see''

    Authors: Qiao Xiao, Boqian Wu, Yu Zhang, Shiwei Liu, Mykola Pechenizkiy, Elena Mocanu, Decebal Constantin Mocanu

    Abstract: The receptive field (RF), which determines the region of time series to be ``seen'' and used, is critical to improve the performance for time series classification (TSC). However, the variation of signal scales across and within time series data, makes it challenging to decide on proper RF sizes for TSC. In this paper, we propose a dynamic sparse network (DSN) with sparse connections for TSC, whic… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2022)

  7. arXiv:2106.14568  [pdf, other

    cs.LG cs.CV

    Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity

    Authors: Shiwei Liu, Tianlong Chen, Zahra Atashgahi, Xiaohan Chen, Ghada Sokar, Elena Mocanu, Mykola Pechenizkiy, Zhangyang Wang, Decebal Constantin Mocanu

    Abstract: The success of deep ensembles on improving predictive performance, uncertainty estimation, and out-of-distribution robustness has been extensively studied in the machine learning literature. Albeit the promising results, naively training multiple deep neural networks and combining their predictions at inference leads to prohibitive computational costs and memory requirements. Recently proposed eff… ▽ More

    Submitted 7 February, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: published in International Conference on Learning Representations (ICLR 2022)

    Journal ref: Proceedings of the International Conference on Machine Learning (ICLR 2022)

  8. arXiv:2106.04217  [pdf, other

    cs.LG cs.AI

    Dynamic Sparse Training for Deep Reinforcement Learning

    Authors: Ghada Sokar, Elena Mocanu, Decebal Constantin Mocanu, Mykola Pechenizkiy, Peter Stone

    Abstract: Deep reinforcement learning (DRL) agents are trained through trial-and-error interactions with the environment. This leads to a long training time for dense neural networks to achieve good performance. Hence, prohibitive computation and memory resources are consumed. Recently, learning efficient DRL agents has received increasing attention. Yet, current methods focus on accelerating inference time… ▽ More

    Submitted 5 May, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Published in the Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI-22)

  9. arXiv:2103.01636  [pdf, other

    cs.AI cs.LG cs.MA cs.NE

    Sparse Training Theory for Scalable and Efficient Agents

    Authors: Decebal Constantin Mocanu, Elena Mocanu, Tiago Pinto, Selima Curci, Phuong H. Nguyen, Madeleine Gibescu, Damien Ernst, Zita A. Vale

    Abstract: A fundamental task for artificial intelligence is learning. Deep Neural Networks have proven to cope perfectly with all learning paradigms, i.e. supervised, unsupervised, and reinforcement learning. Nevertheless, traditional deep learning approaches make use of cloud computing facilities and do not scale well to autonomous agents with low computational resources. Even in the cloud, they suffer fro… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Journal ref: 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021)

  10. arXiv:2012.00560  [pdf, other

    cs.LG stat.ML

    Quick and Robust Feature Selection: the Strength of Energy-efficient Sparse Training for Autoencoders

    Authors: Zahra Atashgahi, Ghada Sokar, Tim van der Lee, Elena Mocanu, Decebal Constantin Mocanu, Raymond Veldhuis, Mykola Pechenizkiy

    Abstract: Major complications arise from the recent increase in the amount of high-dimensional data, including high computational costs and memory requirements. Feature selection, which identifies the most relevant and informative attributes of a dataset, has been introduced as a solution to this problem. Most of the existing feature selection methods are computationally inefficient; inefficient algorithms… ▽ More

    Submitted 13 September, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 29 pages

  11. arXiv:1804.07645  [pdf, other

    cs.CV cs.LG stat.ML

    One-Shot Learning using Mixture of Variational Autoencoders: a Generalization Learning approach

    Authors: Decebal Constantin Mocanu, Elena Mocanu

    Abstract: Deep learning, even if it is very successful nowadays, traditionally needs very large amounts of labeled data to perform excellent on the classification task. In an attempt to solve this problem, the one-shot learning paradigm, which makes use of just one labeled sample per class and prior knowledge, becomes increasingly important. In this paper, we propose a new one-shot learning method, dubbed M… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Journal ref: 17th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2018)

  12. arXiv:1707.05878  [pdf, other

    cs.LG cs.AI math.OC

    On-line Building Energy Optimization using Deep Reinforcement Learning

    Authors: Elena Mocanu, Decebal Constantin Mocanu, Phuong H. Nguyen, Antonio Liotta, Michael E. Webber, Madeleine Gibescu, J. G. Slootweg

    Abstract: Unprecedented high volumes of data are becoming available with the growth of the advanced metering infrastructure. These are expected to benefit planning and operation of the future power system, and to help the customers transition from a passive to an active role. In this paper, we explore for the first time in the smart grid context the benefits of using Deep Reinforcement Learning, a hybrid ty… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

  13. Scalable Training of Artificial Neural Networks with Adaptive Sparse Connectivity inspired by Network Science

    Authors: Decebal Constantin Mocanu, Elena Mocanu, Peter Stone, Phuong H. Nguyen, Madeleine Gibescu, Antonio Liotta

    Abstract: Through the success of deep learning in various domains, artificial neural networks are currently among the most used artificial intelligence methods. Taking inspiration from the network properties of biological neural networks (e.g. sparsity, scale-freeness), we argue that (contrary to general practice) artificial neural networks, too, should not have fully-connected layers. Here we propose spars… ▽ More

    Submitted 20 June, 2018; v1 submitted 15 July, 2017; originally announced July 2017.

    Comments: 18 pages

    Journal ref: Nature Communications, 2018

  14. arXiv:1605.01939  [pdf, other

    stat.ML cs.AI cs.LG

    Energy Disaggregation for Real-Time Building Flexibility Detection

    Authors: Elena Mocanu, Phuong H. Nguyen, Madeleine Gibescu

    Abstract: Energy is a limited resource which has to be managed wisely, taking into account both supply-demand matching and capacity constraints in the distribution grid. One aspect of the smart energy management at the building level is given by the problem of real-time detection of flexible demand available. In this paper we propose the use of energy disaggregation techniques to perform this task. Firstly,… ▽ More

    Submitted 6 May, 2016; originally announced May 2016.

    Comments: To appear in IEEE PES General Meeting, 2016, Boston, USA

  15. A topological insight into restricted Boltzmann machines

    Authors: Decebal Constantin Mocanu, Elena Mocanu, Phuong H. Nguyen, Madeleine Gibescu, Antonio Liotta

    Abstract: Restricted Boltzmann Machines (RBMs) and models derived from them have been successfully used as basic building blocks in deep artificial neural networks for automatic features extraction, unsupervised weights initialization, but also as density estimators. Thus, their generative and discriminative capabilities, but also their computational time are instrumental to a wide range of applications. Ou… ▽ More

    Submitted 18 July, 2016; v1 submitted 20 April, 2016; originally announced April 2016.

    Comments: http://link.springer.com/article/10.1007/s10994-016-5570-z, Machine Learning, issn=1573-0565, 2016