Skip to main content

Showing 1–37 of 37 results for author: Dusparic, I

.
  1. arXiv:2410.18786  [pdf, other

    cs.AI

    Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling for Autonomous Vehicles

    Authors: Yucheng Shi, Wenlong Wang, Xiaowen Tao, Ivana Dusparic, Vinny Cahill

    Abstract: Dynamic scheduling of access to shared resources by autonomous systems is a challenging problem, characterized as being NP-hard. The complexity of this task leads to a combinatorial explosion of possibilities in highly dynamic systems where arriving requests must be continuously scheduled subject to strong safety and time constraints. An example of such a system is an unsignalized intersection, wh… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  2. arXiv:2410.08893  [pdf, other

    cs.LG cs.AI cs.RO

    Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

    Authors: Wenlong Wang, Ivana Dusparic, Yucheng Shi, Ke Zhang, Vinny Cahill

    Abstract: Model-based reinforcement learning (RL) offers a solution to the data inefficiency that plagues most model-free RL algorithms. However, learning a robust world model often requires complex and deep architectures, which are computationally expensive and challenging to train. Within the world model, sequence models play a critical role in accurate predictions, and various architectures have been exp… ▽ More

    Submitted 16 May, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at ICLR 2025

  3. arXiv:2409.05435  [pdf, other

    cs.AI

    Semifactual Explanations for Reinforcement Learning

    Authors: Jasmina Gajcin, Jovan Jeromela, Ivana Dusparic

    Abstract: Reinforcement Learning (RL) is a learning paradigm in which the agent learns from its environment through trial and error. Deep reinforcement learning (DRL) algorithms represent the agent's policies using neural networks, making their decisions difficult to interpret. Explaining the behaviour of DRL agents is necessary to advance user trust, increase engagement, and facilitate integration with rea… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 9 pages, 2 figures, 4 tables

  4. arXiv:2408.01188  [pdf, other

    cs.AI

    Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems

    Authors: Juan C. Rosero, Ivana Dusparic, Nicolás Cardozo

    Abstract: Reinforcement Learning (RL) is used extensively in Autonomous Systems (AS) as it enables learning at runtime without the need for a model of the environment or predefined actions. However, most applications of RL in AS, such as those based on Q-learning, can only optimize one objective, making it necessary in multi-objective systems to combine multiple objectives in a single objective function wit… ▽ More

    Submitted 30 September, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: pages, Accepted to AI4AS 2024 workshop

  5. arXiv:2402.06503  [pdf, other

    cs.AI cs.LG

    ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies

    Authors: Jasmina Gajcin, Ivana Dusparic

    Abstract: Understanding how failure occurs and how it can be prevented in reinforcement learning (RL) is necessary to enable debugging, maintain user trust, and develop personalized policies. Counterfactual reasoning has often been used to assign blame and understand failure by searching for the closest possible world in which the failure is avoided. However, current counterfactual state explanations in RL… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 17 pages, 4 Figures

  6. arXiv:2401.12405  [pdf, other

    cs.DC cs.SE

    Learning Recovery Strategies for Dynamic Self-healing in Reactive Systems

    Authors: Mateo Sanabria, Ivana Dusparic, Nicolas Cardozo

    Abstract: Self-healing systems depend on following a set of predefined instructions to recover from a known failure state. Failure states are generally detected based on domain specific specialized metrics. Failure fixes are applied at predefined application hooks that are not sufficiently expressive to manage different failure types. Self-healing is usually applied in the context of distributed systems, wh… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Preprint accepted to 19th International Conference on Software Engineering for Adaptive and Self-Managing Systems (SEAMS24)

  7. arXiv:2308.15969  [pdf, other

    cs.AI

    Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification

    Authors: Jasmina Gajcin, James McCarthy, Rahul Nair, Radu Marinescu, Elizabeth Daly, Ivana Dusparic

    Abstract: A well-defined reward function is crucial for successful training of an reinforcement learning (RL) agent. However, defining a suitable reward function is a notoriously challenging task, especially in complex, multi-objective environments. Developers often have to resort to starting with an initial, potentially misspecified reward function, and iteratively adjusting its parameters, based on observ… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 7 pages, 2 figures

  8. arXiv:2306.08785  [pdf, other

    cs.NI cs.DC cs.LG cs.MA

    Density-Aware Reinforcement Learning to Optimise Energy Efficiency in UAV-Assisted Networks

    Authors: Babatunji Omoniwa, Boris Galkin, Ivana Dusparic

    Abstract: Unmanned aerial vehicles (UAVs) serving as aerial base stations can be deployed to provide wireless connectivity to mobile users, such as vehicles. However, the density of vehicles on roads often varies spatially and temporally primarily due to mobility and traffic situations in a geographical area, making it difficult to provide ubiquitous service. Moreover, as energy-constrained UAVs hover in th… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 7 pages, To appear in the conference proceedings of IEEE WiMob 2023, Montreal, Canada

  9. arXiv:2303.10236  [pdf, other

    cs.SE

    Prevalence of Code Smells in Reinforcement Learning Projects

    Authors: Nicolás Cardozo, Ivana Dusparic, Christian Cabrera

    Abstract: Reinforcement Learning (RL) is being increasingly used to learn and adapt application behavior in many domains, including large-scale and safety critical systems, as for example, autonomous driving. With the advent of plug-n-play RL libraries, its applicability has further increased, enabling integration of RL algorithms by users. We note, however, that the majority of such code is not developed b… ▽ More

    Submitted 3 August, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Paper preprint for the 2nd International Conference on AI Engineering Software Engineering for AI CAIN2023

  10. arXiv:2303.08772  [pdf, other

    eess.SY

    Reservation of Virtualized Resources with Optimistic Online Learning

    Authors: Jean-Baptiste Monteil, George Iosifidis, Ivana Dusparic

    Abstract: The virtualization of wireless networks enables new services to access network resources made available by the Network Operator (NO) through a Network Slicing market. The different service providers (SPs) have the opportunity to lease the network resources from the NO to constitute slices that address the demand of their specific network service. The goal of any SP is to maximize its service utili… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 7 pages, 4 figures, ICC 2023 conference

  11. arXiv:2303.04475  [pdf, other

    cs.AI cs.LG

    RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement Learning

    Authors: Jasmina Gajcin, Ivana Dusparic

    Abstract: While reinforcement learning (RL) algorithms have been successfully applied to numerous tasks, their reliance on neural networks makes their behavior difficult to understand and trust. Counterfactual explanations are human-friendly explanations that offer users actionable advice on how to alter the model inputs to achieve the desired output from a black-box system. However, current approaches to g… ▽ More

    Submitted 10 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 10 pages, 3 figures, 3 tables

  12. arXiv:2303.01170  [pdf, other

    cs.LG cs.AI cs.MA

    Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning

    Authors: Alberto Castagna, Ivana Dusparic

    Abstract: Transfer learning in Reinforcement Learning (RL) has been widely studied to overcome training issues of Deep-RL, i.e., exploration cost, data availability and convergence time, by introducing a way to enhance training phase with external knowledge. Generally, knowledge is transferred from expert-agents to novices. While this fixes the issue for a novice agent, a good understanding of the task on e… ▽ More

    Submitted 28 July, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  13. arXiv:2211.05551  [pdf, other

    cs.LG cs.AI cs.RO

    Causal Counterfactuals for Improving the Robustness of Reinforcement Learning

    Authors: Tom He, Jasmina Gajcin, Ivana Dusparic

    Abstract: Reinforcement learning (RL) is used in various robotic applications. RL enables agents to learn tasks autonomously by interacting with the environment. The more critical the tasks are, the higher the demand for the robustness of the RL systems. Causal RL combines RL and causal inference to make RL more robust. Causal RL agents use a causal representation to capture the invariant causal mechanisms… ▽ More

    Submitted 5 June, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted to ARMS-2023 (ARMS-2023: AAMAS 2023 Workshop on Autonomous Robots and Multirobot Systems)

  14. Deep W-Networks: Solving Multi-Objective Optimisation Problems With Deep Reinforcement Learning

    Authors: Jernej Hribar, Luke Hackett, Ivana Dusparic

    Abstract: In this paper, we build on advances introduced by the Deep Q-Networks (DQN) approach to extend the multi-objective tabular Reinforcement Learning (RL) algorithm W-learning to large state spaces. W-learning algorithm can naturally solve the competition between multiple single policies in multi-objective environments. However, the tabular version does not scale well to environments with large state… ▽ More

    Submitted 23 February, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  15. arXiv:2210.11846  [pdf, other

    cs.AI

    Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and Opportunities

    Authors: Jasmina Gajcin, Ivana Dusparic

    Abstract: While AI algorithms have shown remarkable success in various fields, their lack of transparency hinders their application to real-life tasks. Although explanations targeted at non-experts are necessary for user trust and human-AI collaboration, the majority of explanation methods for AI are focused on developers and expert users. Counterfactual explanations are local explanations that offer users… ▽ More

    Submitted 9 February, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: 32 pages, 6 figures

  16. arXiv:2210.00041  [pdf, other

    cs.MA

    Communication-Enabled Deep Reinforcement Learning to Optimise Energy-Efficiency in UAV-Assisted Networks

    Authors: Babatunji Omoniwa, Boris Galkin, Ivana Dusparic

    Abstract: Unmanned aerial vehicles (UAVs) are increasingly deployed to provide wireless connectivity to static and mobile ground users in situations of increased network demand or points of failure in existing terrestrial cellular infrastructure. However, UAVs are energy-constrained and experience the challenge of interference from nearby UAV cells sharing the same frequency spectrum, thereby impacting the… ▽ More

    Submitted 27 June, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 16 pages, 22 figures, Under-review. Extension of the work arXiv:2204.01597

  17. arXiv:2207.08651  [pdf, other

    cs.LG cs.AI

    Boolean Decision Rules for Reinforcement Learning Policy Summarisation

    Authors: James McCarthy, Rahul Nair, Elizabeth Daly, Radu Marinescu, Ivana Dusparic

    Abstract: Explainability of Reinforcement Learning (RL) policies remains a challenging research problem, particularly when considering RL in a safety context. Understanding the decisions and intentions of an RL policy offer avenues to incorporate safety into the policy by limiting undesirable actions. We propose the use of a Boolean Decision Rules model to create a post-hoc rule-based summary of an agent's… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  18. arXiv:2206.12492  [pdf, other

    cs.SE

    Guidelines for Artifacts to Support Industry-Relevant Research on Self-Adaptation

    Authors: Danny Weyns, Ilias Gerostathopoulos, Barbora Buhnova, Nicolas Cardozo, Emilia Cioroaica, Ivana Dusparic, Lars Grunske, Pooyan Jamshidi, Christine Julien, Judith Michael, Gabriel Moreno, Shiva Nejati, Patrizio Pelliccione, Federico Quin, Genaina Rodrigues, Bradley Schmerl, Marco Vieira, Thomas Vogel, Rebekka Wohlrab

    Abstract: Artifacts support evaluating new research results and help comparing them with the state of the art in a field of interest. Over the past years, several artifacts have been introduced to support research in the field of self-adaptive systems. While these artifacts have shown their value, it is not clear to what extent these artifacts support research on problems in self-adaptation that are relevan… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 7 pages

  19. arXiv:2205.11519  [pdf, other

    cs.CR cs.LG

    FedSA: Accelerating Intrusion Detection in Collaborative Environments with Federated Simulated Annealing

    Authors: Helio N. Cunha Neto, Ivana Dusparic, Diogo M. F. Mattos, Natalia C. Fernandes

    Abstract: Fast identification of new network attack patterns is crucial for improving network security. Nevertheless, identifying an ongoing attack in a heterogeneous network is a non-trivial task. Federated learning emerges as a solution to collaborative training for an Intrusion Detection System (IDS). The federated learning-based IDS trains a global model using local machine learning models provided by f… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  20. arXiv:2204.01597  [pdf, other

    cs.NI cs.DC cs.LG

    Optimising Energy Efficiency in UAV-Assisted Networks using Deep Reinforcement Learning

    Authors: Babatunji Omoniwa, Boris Galkin, Ivana Dusparic

    Abstract: In this letter, we study the energy efficiency (EE) optimisation of unmanned aerial vehicles (UAVs) providing wireless coverage to static and mobile ground users. Recent multi-agent reinforcement learning approaches optimise the system's EE using a 2D trajectory design, neglecting interference from nearby UAV cells. We aim to maximise the system's EE by jointly optimising each UAV's 3D trajectory,… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 5 pages, Submitted to for publication in the IEEE Wireless Communication Letters

  21. arXiv:2203.11211  [pdf, other

    cs.LG cs.AI

    ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning

    Authors: Jasmina Gajcin, Ivana Dusparic

    Abstract: Despite notable results in various fields over the recent years, deep reinforcement learning (DRL) algorithms lack transparency, affecting user trust and hindering their deployment to high-risk tasks. Causal confusion refers to a phenomenon where an agent learns spurious correlations between features which might not hold across the entire state space, preventing safe deployment to real tasks where… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 18 pages, 4 tables, 4 figures

  22. Enabling Deep Reinforcement Learning on Energy Constrained Devices at the Edge of the Network

    Authors: Jernej Hribar, Ivana Dusparic

    Abstract: Deep Reinforcement Learning (DRL) solutions are becoming pervasive at the edge of the network as they enable autonomous decision-making in a dynamic environment. However, to be able to adapt to the ever-changing environment, the DRL solution implemented on an embedded device has to continue to occasionally take exploratory actions even after initial convergence. In other words, the device has to o… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  23. arXiv:2112.09462  [pdf, other

    cs.AI

    Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents

    Authors: Jasmina Gajcin, Rahul Nair, Tejaswini Pedapati, Radu Marinescu, Elizabeth Daly, Ivana Dusparic

    Abstract: In complex tasks where the reward function is not straightforward and consists of a set of objectives, multiple reinforcement learning (RL) policies that perform task adequately, but employ different strategies can be trained by adjusting the impact of individual objectives on reward function. Understanding the differences in strategies between policies is necessary to enable users to choose betwe… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 7 pages, 3 figures

  24. arXiv:2112.00424  [pdf, other

    cs.LG cs.AI cs.MA

    Multi-Agent Transfer Learning in Reinforcement Learning-Based Ride-Sharing Systems

    Authors: Alberto Castagna, Ivana Dusparic

    Abstract: Reinforcement learning (RL) has been used in a range of simulated real-world tasks, e.g., sensor coordination, traffic light control, and on-demand mobility services. However, real world deployments are rare, as RL struggles with dynamic nature of real world environments, requiring time for learning a task and adapting to changes in the environment. Transfer Learning (TL) can help lower these adap… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  25. arXiv:2111.02258  [pdf, other

    eess.SP cs.LG

    Multi-Agent Deep Reinforcement Learning For Optimising Energy Efficiency of Fixed-Wing UAV Cellular Access Points

    Authors: Boris Galkin, Babatunji Omoniwa, Ivana Dusparic

    Abstract: Unmanned Aerial Vehicles (UAVs) promise to become an intrinsic part of next generation communications, as they can be deployed to provide wireless connectivity to ground users to supplement existing terrestrial networks. The majority of the existing research into the use of UAV access points for cellular coverage considers rotary-wing UAV designs (i.e. quadcopters). However, we expect fixed-wing U… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  26. Analyse or Transmit: Utilising Correlation at the Edge with Deep Reinforcement Learning

    Authors: Jernej Hribar, Ryoichi Shinkuma, George Iosifidis, Ivana Dusparic

    Abstract: Millions of sensors, cameras, meters, and other edge devices are deployed in networks to collect and analyse data. In many cases, such devices are powered only by Energy Harvesting(EH) and have limited energy available to analyse acquired data. When edge infrastructure is available, a device has a choice: to perform analysis locally or offload the task to other resource-rich devices such as cloudl… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  27. arXiv:2106.00845  [pdf, other

    cs.MA cs.LG cs.NI

    Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning

    Authors: Babatunji Omoniwa, Boris Galkin, Ivana Dusparic

    Abstract: Unmanned aerial vehicles serving as aerial base stations (UAV-BSs) can be deployed to provide wireless connectivity to ground devices in events of increased network demand, points-of-failure in existing infrastructure, or disasters. However, it is challenging to conserve the energy of UAVs during prolonged coverage tasks, considering their limited on-board battery capacity. Reinforcement learning-… ▽ More

    Submitted 4 November, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted paper for presentation and publication in the Proceedings of IEEE CCNC 2022, Las Vegas, USA

  28. arXiv:2106.00654  [pdf, other

    cs.LG cs.NI eess.SP

    A reinforcement learning approach to improve communication performance and energy utilization in fog-based IoT

    Authors: Babatunji Omoniwa, Maxime Gueriau, Ivana Dusparic

    Abstract: Recent research has shown the potential of using available mobile fog devices (such as smartphones, drones, domestic and industrial robots) as relays to minimize communication outages between sensors and destination devices, where localized Internet-of-Things services (e.g., manufacturing process control, health and security monitoring) are delivered. However, these mobile relays deplete energy wh… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Submitted and published in IEEE proceedings

  29. arXiv:2103.06908  [pdf, ps, other

    cs.AI

    Adaptation to Unknown Situations as the Holy Grail of Learning-Based Self-Adaptive Systems: Research Directions

    Authors: Ivana Dusparic, Nicolas Cardozo

    Abstract: Self-adaptive systems continuously adapt to changes in their execution environment. Capturing all possible changes to define suitable behaviour beforehand is unfeasible, or even impossible in the case of unknown changes, hence human intervention may be required. We argue that adapting to unknown situations is the ultimate challenge for self-adaptive systems. Learning-based approaches are used to l… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  30. arXiv:2103.06757  [pdf, other

    cs.PL cs.AI cs.SE

    Auto-COP: Adaptation Generation in Context-Oriented Programming using Reinforcement Learning Options

    Authors: Nicolás Cardozo, Ivana Dusparic

    Abstract: Self-adaptive software systems continuously adapt in response to internal and external changes in their execution environment, captured as contexts. The COP paradigm posits a technique for the development of self-adaptive systems, capturing their main characteristics with specialized programming language constructs. COP adaptations are specified as independent modules composed in and out of the ba… ▽ More

    Submitted 3 August, 2023; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Submitted to journal of Information and software technology. 22 pages

  31. arXiv:2102.12899  [pdf, other

    cs.NI

    Mobility for Cellular-Connected UAVs: challenges for the network provider

    Authors: Erika Fonseca, Boris Galkin, Marvin Kelly, Luiz A. DaSilva, Ivana Dusparic

    Abstract: Unmanned Aerial Vehicle (UAV) technology is becoming more prevalent and more diverse in its application. 5G and beyond networks must enable UAV connectivity. This will require the network operator to consider this new type of user in the planning and operation of the network. This work presents the challenges an operator will encounter and should consider in the future as UAVs become users of the… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 6 pages, 4 figures

  32. arXiv:2011.03236  [pdf, other

    cs.NI

    Experimental Evaluation of a UAV User QoS from a Two-Tier 3.6GHz Spectrum Network

    Authors: Boris Galkin, Erika Fonseca, Gavin Lee, Conor Duff, Marvin Kelly, Edward Emmanuel, Ivana Dusparic

    Abstract: Unmanned Aerial Vehicle (UAV) technology is becoming increasingly used in a variety of applications such as video surveillance and deliveries. To enable safe and efficient use of UAVs, the devices will need to be connected into cellular networks. Existing research on UAV cellular connectivity shows that UAVs encounter significant issues with existing networks, such as strong interference and anten… ▽ More

    Submitted 9 April, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

  33. arXiv:2010.01126  [pdf, ps, other

    cs.IT eess.SP

    REQIBA: Regression and Deep Q-Learning for Intelligent UAV Cellular User to Base Station Association

    Authors: Boris Galkin, Erika Fonseca, Ramy Amer, Luiz A. DaSilva, Ivana Dusparic

    Abstract: Unmanned Aerial Vehicles (UAVs) are emerging as important users of next-generation cellular networks. By operating in the sky, UAV users experience very different radio conditions than terrestrial users, due to factors such as strong Line-of-Sight (LoS) channels (and interference) and Base Station (BS) antenna misalignment. As a consequence, the UAVs may experience significant degradation to their… ▽ More

    Submitted 3 November, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: To appear in IEEE Transactions on Vehicular Technology (TVT)

  34. arXiv:2007.13695  [pdf, other

    eess.SP cs.LG

    Adaptive Height Optimisation for Cellular-Connected UAVs using Reinforcement Learning

    Authors: Erika Fonseca, Boris Galkin, Ramy Amer, Luiz A. DaSilva, Ivana Dusparic

    Abstract: Providing reliable connectivity to cellular-connected UAV can be very challenging; their performance highly depends on the nature of the surrounding environment, such as density and heights of the ground BSs. On the other hand, tall buildings might block undesired interference signals from ground BSs, thereby improving the connectivity between the UAVs and their serving BSs. To address the connect… ▽ More

    Submitted 13 April, 2022; v1 submitted 27 July, 2020; originally announced July 2020.

  35. arXiv:1810.03679  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Multi-agent Deep Reinforcement Learning for Zero Energy Communities

    Authors: Amit Prasad, Ivana Dusparic

    Abstract: Advances in renewable energy generation and introduction of the government targets to improve energy efficiency gave rise to a concept of a Zero Energy Building (ZEB). A ZEB is a building whose net energy usage over a year is zero, i.e., its energy use is not larger than its overall renewables generation. A collection of ZEBs forms a Zero Energy Community (ZEC). This paper addresses the problem of… ▽ More

    Submitted 27 June, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: Accepted at ISGT Europe 2019

    MSC Class: 97R40 ACM Class: I.2.11; I.2.6

  36. arXiv:1805.09090  [pdf, other

    cs.MA

    Volunteers in the Smart City: Comparison of Contribution Strategies on Human-Centered Measures

    Authors: Stefano Bennati, Ivana Dusparic, Rhythima Shinde, Catholijn M. Jonker

    Abstract: Several smart city services rely on users contribution, e.g., data, which can be costly for the users in terms of privacy. High costs lead to reduced user participation, which undermine the success of smart city technologies. This work develops a scenario-independent design principle, based on public good theory, for resource management in smart city applications, where provision of a service depe… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

  37. arXiv:1409.4561  [pdf, other

    cs.MA

    Decentralised Multi-Agent Reinforcement Learning for Dynamic and Uncertain Environments

    Authors: Andrei Marinescu, Ivana Dusparic, Adam Taylor, Vinny Cahill, Siobhán Clarke

    Abstract: Multi-Agent Reinforcement Learning (MARL) is a widely used technique for optimization in decentralised control problems. However, most applications of MARL are in static environments, and are not suitable when agent behaviour and environment conditions are dynamic and uncertain. Addressing uncertainty in such environments remains a challenging problem for MARL-based systems. The dynamic nature of… ▽ More

    Submitted 16 September, 2014; originally announced September 2014.

    Comments: 7 pages, 7 figures, 1 Table, 1 algorithm, conference