Skip to main content

Showing 1–50 of 78 results for author: Musolesi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.12777  [pdf, other

    cs.MA cs.AI

    Multi-Agent Reinforcement Learning Simulation for Environmental Policy Synthesis

    Authors: James Rudd-Jones, Mirco Musolesi, María Pérez-Ortiz

    Abstract: Climate policy development faces significant challenges due to deep uncertainty, complex system dynamics, and competing stakeholder interests. Climate simulation methods, such as Earth System Models, have become valuable tools for policy exploration. However, their typical use is for evaluating potential polices, rather than directly synthesizing them. The problem can be inverted to optimize for p… ▽ More

    Submitted 14 May, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Published in AAMAS'25 Blue Sky Ideas Track

  2. arXiv:2502.14037  [pdf, other

    cs.CL cs.AI cs.LG

    DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Despite their increasing performance, large language models still tend to reproduce training data, generate several repetitions, and focus on the most common grammatical structures and words. A possible cause is the decoding strategy adopted: the most common ones either consider only the most probable tokens, reducing output diversity, or increase the likelihood of unlikely tokens at the cost of o… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  3. arXiv:2502.13207  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Despite the increasing use of large language models for creative tasks, their outputs often lack diversity. Common solutions, such as sampling at higher temperatures, can compromise the quality of the results. Drawing on information theory, we propose a context-based score to quantitatively evaluate value and originality. This score incentivizes accuracy and adherence to the request while fosterin… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  4. arXiv:2411.11603  [pdf, other

    cs.LG cs.CR

    Feature Selection for Network Intrusion Detection

    Authors: Charles Westphal, Stephen Hailes, Mirco Musolesi

    Abstract: Network Intrusion Detection (NID) remains a key area of research within the information security community, while also being relevant to Machine Learning (ML) practitioners. The latter generally aim to detect attacks using network features, which have been extracted from raw network data typically using dimensionality reduction methods, such as principal component analysis (PCA). However, PCA is n… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  5. arXiv:2411.00147  [pdf, other

    cs.LG

    Mutual Information Preserving Neural Network Pruning

    Authors: Charles Westphal, Stephen Hailes, Mirco Musolesi

    Abstract: Pruning has emerged as the primary approach used to limit the resource requirements of large neural networks (NNs). Since the proposal of the lottery ticket hypothesis, researchers have focused either on pruning at initialization or after training. However, recent theoretical findings have shown that the sample efficiency of robust pruned models is proportional to the mutual information (MI) betwe… ▽ More

    Submitted 3 February, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

  6. arXiv:2410.01639  [pdf, other

    cs.LG cs.AI cs.CY

    Moral Alignment for LLM Agents

    Authors: Elizaveta Tennant, Stephen Hailes, Mirco Musolesi

    Abstract: Decision-making agents based on pre-trained Large Language Models (LLMs) are increasingly being deployed across various domains of human activity. While their applications are currently rather specialized, several research efforts are underway to develop more generalist agents. As LLM-based systems become more agentic, their influence on human activity will grow and their transparency will decreas… ▽ More

    Submitted 11 May, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Published at the 13th International Conference on Learning Representations (ICLR'25), Singapore, Apr 2025. https://openreview.net/forum?id=MeGDmZjUXy

  7. arXiv:2409.07932  [pdf, other

    cs.LG cs.AI cs.MA cs.SI

    Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies

    Authors: Alexei Pisacane, Victor-Alexandru Darvariu, Mirco Musolesi

    Abstract: Graph path search is a classic computer science problem that has been recently approached with Reinforcement Learning (RL) due to its potential to outperform prior methods. Existing RL techniques typically assume a global view of the network, which is not suitable for large-scale, dynamic, and privacy-sensitive settings. An area of particular interest is search in social networks due to its numero… ▽ More

    Submitted 26 November, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

    Journal ref: Proceedings of the Third Learning on Graphs Conference (LoG 2024), PMLR 269

  8. arXiv:2407.13493  [pdf, other

    cs.CY cs.AI cs.LG

    Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law

    Authors: Giorgio Franceschelli, Claudia Cevenini, Mirco Musolesi

    Abstract: The training process of foundation models as for other classes of deep learning systems is based on minimizing the reconstruction error over a training set. For this reason, they are susceptible to the memorization and subsequent reproduction of training samples. In this paper, we introduce a training-as-compressing perspective, wherein the model's weights embody a compressed representation of the… ▽ More

    Submitted 12 March, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Spotlight presentation at GenLaw'24, see https://www.genlaw.org/2024-icml-papers#training-foundation-models-as-data-compression-on-information-model-weights-and-copyright-law

  9. arXiv:2405.19212  [pdf, other

    cs.LG cs.AI cs.IT

    Partial Information Decomposition for Data Interpretability and Feature Selection

    Authors: Charles Westphal, Stephen Hailes, Mirco Musolesi

    Abstract: In this paper, we introduce Partial Information Decomposition of Features (PIDF), a new paradigm for simultaneous data interpretability and feature selection. Contrary to traditional methods that assign a single importance value, our approach is based on three metrics per feature: the mutual information shared with the target variable, the feature's contribution to synergistic information, and the… ▽ More

    Submitted 18 November, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  10. arXiv:2405.13551  [pdf, other

    cs.LG cs.AI

    Large Language Models are Effective Priors for Causal Graph Discovery

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: Causal structure discovery from observations can be improved by integrating background knowledge provided by an expert to reduce the hypothesis space. Recently, Large Language Models (LLMs) have begun to be considered as sources of prior information given the low cost of querying them relative to a human expert. In this work, firstly, we propose a set of metrics for assessing LLM judgments for cau… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  11. arXiv:2405.00099  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    Creative Beam Search: LLM-as-a-Judge For Improving Response Generation

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Large language models are revolutionizing several areas, including artificial creativity. However, the process of generation in machines profoundly diverges from that observed in humans. In particular, machine generation is characterized by a lack of intentionality and an underlying creative process. We propose a method called Creative Beam Search that uses Diverse Beam Search and LLM-as-a-Judge t… ▽ More

    Submitted 7 October, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: Presented as a short paper at the 15th International Conference on Computational Creativity (ICCC'24)

  12. arXiv:2404.06492  [pdf, other

    cs.LG cs.AI

    Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: Graphs are a natural representation for systems based on relations between connected entities. Combinatorial optimization problems, which arise when considering an objective function related to a process of interest on discrete structures, are often challenging due to the rapid growth of the solution space. The trial-and-error paradigm of Reinforcement Learning has recently emerged as a promising… ▽ More

    Submitted 20 August, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: To appear in Transactions on Machine Learning Research (TMLR)

  13. arXiv:2403.07979  [pdf, other

    cs.LG cs.AI

    Do Agents Dream of Electric Sheep?: Improving Generalization in Reinforcement Learning through Generative Learning

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: The Overfitted Brain hypothesis suggests dreams happen to allow generalization in the human brain. Here, we ask if the same is true for reinforcement learning agents as well. Given limited experience in a real environment, we use imagination-based reinforcement learning to train a policy on dream-like episodes, where non-imaginative, predicted trajectories are modified through generative augmentat… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  14. arXiv:2403.04202  [pdf, other

    cs.MA cs.AI cs.CY cs.LG

    Dynamics of Moral Behavior in Heterogeneous Populations of Learning Agents

    Authors: Elizaveta Tennant, Stephen Hailes, Mirco Musolesi

    Abstract: Growing concerns about safety and alignment of AI systems highlight the importance of embedding moral capabilities in artificial agents: a promising solution is the use of learning from experience, i.e., Reinforcement Learning. In multi-agent (social) environments, complex population-level phenomena may emerge from interactions between individual learning agents. Many of the existing studies rely… ▽ More

    Submitted 16 January, 2025; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Presented at AIES 2024 (7th AAAI/ACM Conference on AI, Ethics, and Society - San Jose, CA, USA) - see https://ojs.aaai.org/index.php/AIES/article/view/31736

    Journal ref: Proceedings of the 7th AAAI/ACM Conference on AI, Ethics, and Society (AIES), vol. 7, (2024), pp 1444-1454

  15. arXiv:2402.09193  [pdf, other

    cs.CL cs.AI cs.HC

    (Ir)rationality and Cognitive Biases in Large Language Models

    Authors: Olivia Macmillan-Scott, Mirco Musolesi

    Abstract: Do large language models (LLMs) display rational reasoning? LLMs have been shown to contain human biases due to the data they have been trained on; whether this is reflected in rational reasoning remains less clear. In this paper, we answer this question by evaluating seven language models using tasks from the cognitive psychology literature. We find that, like humans, LLMs display irrationality i… ▽ More

    Submitted 15 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  16. arXiv:2401.11512  [pdf, other

    cs.LG cs.AI cs.IT

    Information-Theoretic State Variable Selection for Reinforcement Learning

    Authors: Charles Westphal, Stephen Hailes, Mirco Musolesi

    Abstract: Identifying the most suitable variables to represent the state is a fundamental challenge in Reinforcement Learning (RL). These variables must efficiently capture the information necessary for making optimal decisions. In order to address this problem, in this paper, we introduce the Transfer Entropy Redundancy Criterion (TERC), an information-theoretic criterion, which determines if there is \tex… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 47 pages, 12 figures

  17. arXiv:2312.01818  [pdf, other

    cs.AI cs.CY cs.LG cs.MA

    Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto

    Authors: Elizaveta Tennant, Stephen Hailes, Mirco Musolesi

    Abstract: Increasing interest in ensuring the safety of next-generation Artificial Intelligence (AI) systems calls for novel approaches to embedding morality into autonomous agents. This goal differs qualitatively from traditional task-specific AI methodologies. In this paper, we provide a systematization of existing approaches to the problem of introducing morality in machines - modelled as a continuum. Ou… ▽ More

    Submitted 16 January, 2025; v1 submitted 4 December, 2023; originally announced December 2023.

  18. arXiv:2311.17165  [pdf, ps, other

    cs.AI cs.CY cs.HC cs.LG cs.MA

    (Ir)rationality in AI: State of the Art, Research Challenges and Open Questions

    Authors: Olivia Macmillan-Scott, Mirco Musolesi

    Abstract: The concept of rationality is central to the field of artificial intelligence. Whether we are seeking to simulate human reasoning, or the goal is to achieve bounded optimality, we generally seek to make artificial agents as rational as possible. Despite the centrality of the concept within AI, there is no unified definition of what constitutes a rational agent. This article provides a survey of ra… ▽ More

    Submitted 11 February, 2025; v1 submitted 28 November, 2023; originally announced November 2023.

  19. arXiv:2311.10026  [pdf, other

    eess.SY cs.LG

    Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning

    Authors: Francesco De Lellis, Marco Coraggio, Giovanni Russo, Mirco Musolesi, Mario di Bernardo

    Abstract: In addressing control problems such as regulation and tracking through reinforcement learning, it is often required to guarantee that the acquired policy meets essential performance and stability criteria such as a desired settling time and steady-state error prior to deployment. Motivated by this necessity, we present a set of results and a systematic reward shaping procedure that (i) ensures the… ▽ More

    Submitted 20 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  20. arXiv:2310.13576  [pdf, other

    cs.LG cs.AI

    Tree Search in DAG Space with Model-based Reinforcement Learning for Causal Discovery

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: Identifying causal structure is central to many fields ranging from strategic decision-making to biology and economics. In this work, we propose CD-UCT, a model-based reinforcement learning method for causal discovery based on tree search that builds directed acyclic graphs incrementally. We also formalize and prove the correctness of an efficient algorithm for excluding edges that would introduce… ▽ More

    Submitted 13 February, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  21. Steps Towards Satisficing Distributed Dynamic Team Trust

    Authors: Edmund R. Hunt, Chris Baber, Mehdi Sobhani, Sanja Milivojevic, Sagir Yusuf, Mirco Musolesi, Patrick Waterson, Sally Maynard

    Abstract: Defining and measuring trust in dynamic, multiagent teams is important in a range of contexts, particularly in defense and security domains. Team members should be trusted to work towards agreed goals and in accordance with shared values. In this paper, our concern is with the definition of goals and values such that it is possible to define 'trust' in a way that is interpretable, and hence usable… ▽ More

    Submitted 4 November, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  22. arXiv:2308.10721  [pdf, other

    cs.LG cs.AI cs.MA

    CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-Making

    Authors: Giovanni Minelli, Mirco Musolesi

    Abstract: Robust coordination skills enable agents to operate cohesively in shared environments, together towards a common goal and, ideally, individually without hindering each other's progress. To this end, this paper presents Coordinated QMIX (CoMIX), a novel training framework for decentralized agents that enables emergent coordination through flexible policies, allowing at the same time independent dec… ▽ More

    Submitted 23 December, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

  23. Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Generative Artificial Intelligence (AI) is one of the most exciting developments in Computer Science of the last decade. At the same time, Reinforcement Learning (RL) has emerged as a very successful paradigm for a variety of machine learning tasks. In this survey, we discuss the state of the art, opportunities and open research questions in applying RL to generative AI. In particular, we will dis… ▽ More

    Submitted 8 February, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: Published in JAIR at https://www.jair.org/index.php/jair/article/view/15278

    Journal ref: JAIR 79 (2024) 417-446

  24. arXiv:2306.01158  [pdf, other

    cs.LG cs.AI

    Heterogeneous Knowledge for Augmented Modular Reinforcement Learning

    Authors: Lorenz Wolf, Mirco Musolesi

    Abstract: Existing modular Reinforcement Learning (RL) architectures are generally based on reusable components, also allowing for "plug-and-play" integration. However, these modules are homogeneous in nature - in fact, they essentially provide policies obtained via RL through the maximization of individual reward functions. Consequently, such solutions still lack the ability to integrate and process multip… ▽ More

    Submitted 31 October, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 23 pages, 11 figures

  25. On the Creativity of Large Language Models

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Large Language Models (LLMs) are revolutionizing several areas of Artificial Intelligence. One of the most remarkable applications is creative writing, e.g., poetry or storytelling: the generated outputs are often of astonishing quality. However, a natural question arises: can LLMs be really considered creative? In this article, we first analyze the development of LLMs under the lens of creativity… ▽ More

    Submitted 13 February, 2025; v1 submitted 27 March, 2023; originally announced April 2023.

    Comments: Published in AI & SOCIETY at https://link.springer.com/article/10.1007/s00146-024-02127-3

    Journal ref: AI & Soc (2024)

  26. arXiv:2301.08491  [pdf, other

    cs.MA cs.AI cs.LG

    Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning

    Authors: Elizaveta Tennant, Stephen Hailes, Mirco Musolesi

    Abstract: Practical uses of Artificial Intelligence (AI) in the real world have demonstrated the importance of embedding moral choices into intelligent agents. They have also highlighted that defining top-down ethical constraints on AI according to any one type of morality is extremely challenging and can pose risks. A bottom-up learning approach may be more appropriate for studying and developing ethical b… ▽ More

    Submitted 30 August, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

    Comments: Accepted at IJCAI 2023 (32nd International Joint Conference on Artificial Intelligence - Macao, S.A.R.)

  27. Investigating the Impact of Direct Punishment on the Emergence of Cooperation in Multi-Agent Reinforcement Learning Systems

    Authors: Nayana Dasgupta, Mirco Musolesi

    Abstract: Solving the problem of cooperation is fundamentally important for the creation and maintenance of functional societies. Problems of cooperation are omnipresent within human society, with examples ranging from navigating busy road junctions to negotiating treaties. As the use of AI becomes more pervasive throughout society, the need for socially intelligent agents capable of navigating these comple… ▽ More

    Submitted 17 June, 2024; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 50 pages, 19 figures

    Journal ref: Auton Agent Multi-Agent Syst 39, 19 (2025)

  28. arXiv:2212.01343  [pdf, other

    cs.LG eess.SY math.OC

    CT-DQN: Control-Tutored Deep Reinforcement Learning

    Authors: Francesco De Lellis, Marco Coraggio, Giovanni Russo, Mirco Musolesi, Mario di Bernardo

    Abstract: One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn the policy. Motivated by this, we present the design of the Control-Tutored Deep Q-Networks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approxima… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  29. arXiv:2209.05208  [pdf, other

    cs.LG cs.AI cs.NI

    Graph Neural Modeling of Network Flows

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: Network flow problems, which involve distributing traffic such that the underlying infrastructure is used effectively, are ubiquitous in transportation and logistics. Among them, the general Multi-Commodity Network Flow (MCNF) problem concerns the distribution of multiple flows of different sizes between several sources and sinks, while achieving effective utilization of the links. Due to the appe… ▽ More

    Submitted 18 March, 2024; v1 submitted 12 September, 2022; originally announced September 2022.

  30. arXiv:2205.13578  [pdf, other

    cs.LG cs.AI cs.CR physics.soc-ph

    Dynamic Network Reconfiguration for Entropy Maximization using Deep Reinforcement Learning

    Authors: Christoffel Doorman, Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: A key problem in network theory is how to reconfigure a graph in order to optimize a quantifiable objective. Given the ubiquity of networked systems, such work has broad practical applications in a variety of situations, ranging from drug and material design to telecommunications. The large decision space of possible reconfigurations, however, makes this problem computationally intensive. In this… ▽ More

    Submitted 27 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 10 pages, 6 figures, 1 appendix

    Journal ref: Proceedings of the First Learning on Graphs Conference (LoG 2022), PMLR 198:49:1-49:15

  31. arXiv:2205.12880  [pdf, other

    cs.MA cs.AI cs.LG

    Trust-based Consensus in Multi-Agent Reinforcement Learning Systems

    Authors: Ho Long Fung, Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: An often neglected issue in multi-agent reinforcement learning (MARL) is the potential presence of unreliable agents in the environment whose deviations from expected behavior can prevent a system from accomplishing its intended tasks. In particular, consensus is a fundamental underpinning problem of cooperative distributed multi-agent systems. Consensus requires different agents, situated in a de… ▽ More

    Submitted 30 May, 2024; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in proceedings of the first Reinforcement Learning Conference (RLC 2024)

  32. arXiv:2201.06118  [pdf, other

    cs.LG cs.AI cs.CY

    DeepCreativity: Measuring Creativity with Deep Learning Techniques

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Measuring machine creativity is one of the most fascinating challenges in Artificial Intelligence. This paper explores the possibility of using generative learning techniques for automatic assessment of creativity. The proposed solution does not involve human judgement, it is modular and of general applicability. We introduce a new measure, namely DeepCreativity, based on Margaret Boden's definiti… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    Comments: 12 pages, 2 figures

    Journal ref: Intelligenza Artificiale 16, 2 (2022), 151-163

  33. arXiv:2112.06018  [pdf, other

    cs.LG eess.SY math.OC

    Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control

    Authors: F. De Lellis, M. Coraggio, G. Russo, M. Musolesi, M. di Bernardo

    Abstract: We present an architecture where a feedback controller derived on an approximate model of the environment assists the learning process to enhance its data efficiency. This architecture, which we term as Control-Tutored Q-learning (CTQL), is presented in two alternative flavours. The former is based on defining the reward function so that a Boolean condition can be used to determine when the contro… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

  34. arXiv:2109.08236  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning on Encrypted Data

    Authors: Alberto Jesu, Victor-Alexandru Darvariu, Alessandro Staffolani, Rebecca Montanari, Mirco Musolesi

    Abstract: The growing number of applications of Reinforcement Learning (RL) in real-world domains has led to the development of privacy-preserving techniques due to the inherently sensitive nature of data. Most existing works focus on differential privacy, in which information is revealed in the clear to an agent whose learned model should be robust against information leakage to malicious third parties. Mo… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  35. arXiv:2106.06768  [pdf, other

    cs.AI

    Planning Spatial Networks with Monte Carlo Tree Search

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: We tackle the problem of goal-directed graph construction: given a starting graph, a budget of modifications, and a global objective function, the aim is to find a set of edges whose addition to the graph achieves the maximum improvement in the objective (e.g., communication efficiency). This problem emerges in many networks of great importance for society such as transportation and critical infra… ▽ More

    Submitted 16 February, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  36. arXiv:2106.06762  [pdf, other

    cs.AI cs.GT cs.LG cs.MA

    Solving Graph-based Public Good Games with Tree Search and Imitation Learning

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: Public goods games represent insightful settings for studying incentives for individual agents to make contributions that, while costly for each of them, benefit the wider society. In this work, we adopt the perspective of a central planner with a global view of a network of self-interested agents and the goal of maximizing some desired property in the context of a best-shot public goods game. Exi… ▽ More

    Submitted 26 October, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: To appear in Proceedings of 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  37. arXiv:2105.09266  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Copyright in Generative Deep Learning

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: Machine-generated artworks are now part of the contemporary art scene: they are attracting significant investments and they are presented in exhibitions together with those created by human artists. These artworks are mainly based on generative deep learning techniques, which have seen a formidable development and remarkable refinement in the very recent years. Given the inherent characteristics o… ▽ More

    Submitted 13 February, 2025; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Published in Data & Policy at https://www.cambridge.org/core/journals/data-and-policy/article/copyright-in-generative-deep-learning/C401539FDF79A6AC6CEE8C5256508B5E

    Journal ref: Data & Policy. 2022;4:e17

  38. arXiv:2104.02726  [pdf, other

    cs.LG cs.AI cs.CY

    Creativity and Machine Learning: A Survey

    Authors: Giorgio Franceschelli, Mirco Musolesi

    Abstract: There is a growing interest in the area of machine learning and creativity. This survey presents an overview of the history and the state of the art of computational creativity theories, key machine learning techniques (including generative deep learning), and corresponding automatic evaluation methods. After presenting a critical discussion of the key contributions in this area, we outline the cu… ▽ More

    Submitted 13 February, 2025; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: Published in ACM Computing Surveys at https://dl.acm.org/doi/10.1145/3664595

    Journal ref: ACM Comput. Surv. 56, 11, Article 283 (November 2024), 41 pages

  39. arXiv:2102.07523  [pdf, other

    cs.MA cs.AI

    Cooperation and Reputation Dynamics with Reinforcement Learning

    Authors: Nicolas Anastassacos, Julian García, Stephen Hailes, Mirco Musolesi

    Abstract: Creating incentives for cooperation is a challenge in natural and artificial systems. One potential answer is reputation, whereby agents trade the immediate cost of cooperation for the future benefits of having a good reputation. Game theoretical models have shown that specific social norms can make cooperation stable, but how agents can independently learn to establish effective reputation mechan… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: Published in AAMAS'21, 9 pages

  40. arXiv:2005.10125  [pdf, other

    stat.AP cs.CL stat.ME

    Modelling Grocery Retail Topic Distributions: Evaluation, Interpretability and Stability

    Authors: Mariflor Vega-Carrasco, Jason O'sullivan, Rosie Prior, Ioanna Manolopoulou, Mirco Musolesi

    Abstract: Understanding the shopping motivations behind market baskets has high commercial value in the grocery retail industry. Analyzing shopping transactions demands techniques that can cope with the volume and dimensionality of grocery transactional data while keeping interpretable outcomes. Latent Dirichlet Allocation (LDA) provides a suitable framework to process grocery transactions and to discover a… ▽ More

    Submitted 24 February, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 20 pages, 9 figures

  41. arXiv:2001.11279  [pdf, other

    cs.LG cs.AI stat.ML

    Goal-directed graph construction using reinforcement learning

    Authors: Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi

    Abstract: Graphs can be used to represent and reason about systems and a variety of metrics have been devised to quantify their global characteristics. However, little is currently known about how to construct a graph or improve an existing one given a target objective. In this work, we formulate the construction of a graph as a decision-making process in which a central agent creates topologies by trial an… ▽ More

    Submitted 27 October, 2021; v1 submitted 30 January, 2020; originally announced January 2020.

    Journal ref: Proceedings of the Royal Society A (2021)

  42. arXiv:1912.07662  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Input Representations for Machine Learning Applications in Urban Network Analysis

    Authors: Alessio Pagani, Abhinav Mehrotra, Mirco Musolesi

    Abstract: Understanding and learning the characteristics of network paths has been of particular interest for decades and has led to several successful applications. Such analysis becomes challenging for urban networks as their size and complexity are significantly higher compared to other networks. The state-of-the-art machine learning (ML) techniques allow us to detect hidden patterns and, thus, infer the… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  43. arXiv:1902.03185  [pdf, other

    cs.MA cs.AI

    Partner Selection for the Emergence of Cooperation in Multi-Agent Systems Using Reinforcement Learning

    Authors: Nicolas Anastassacos, Stephen Hailes, Mirco Musolesi

    Abstract: Social dilemmas have been widely studied to explain how humans are able to cooperate in society. Considerable effort has been invested in designing artificial agents for social dilemmas that incorporate explicit agent motivations that are chosen to favor coordinated or cooperative responses. The prevalence of this general approach points towards the importance of achieving an understanding of both… ▽ More

    Submitted 28 November, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: 8

    Report number: Published in AAAI'20

  44. arXiv:1809.10007  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas

    Authors: Nicolas Anastassacos, Mirco Musolesi

    Abstract: Multi-agent reinforcement learning has received significant interest in recent years notably due to the advancements made in deep reinforcement learning which have allowed for the developments of new architectures and learning algorithms. Using social dilemmas as the training ground, we present a novel learning architecture, Learning through Probing (LTP), where agents utilize a probing mechanism… ▽ More

    Submitted 22 December, 2018; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: 9 pages, 4 figures

  45. Predicting the temporal activity patterns of new venues

    Authors: Krittika D'Silva, Anastasios Noulas, Mirco Musolesi, Cecilia Mascolo, Max Sklar

    Abstract: Estimating revenue and business demand of a newly opened venue is paramount as these early stages often involve critical decisions such as first rounds of staffing and resource allocation. Traditionally, this estimation has been performed through coarse-grained measures such as observing numbers in local venues or venues at similar places (e.g., coffee shops around another station in the same city… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

    Journal ref: EPJ Data Sci. 7 (1) 13 (2018)

  46. arXiv:1803.10133  [pdf, other

    cs.CR cs.AI cs.SI

    You are your Metadata: Identification and Obfuscation of Social Media Users using Metadata Information

    Authors: Beatrice Perez, Mirco Musolesi, Gianluca Stringhini

    Abstract: Metadata are associated to most of the information we produce in our daily interactions and communication in the digital world. Yet, surprisingly, metadata are often still catergorized as non-sensitive. Indeed, in the past, researchers and practitioners have mainly focused on the problem of the identification of a user from the content of a message. In this paper, we use Twitter as a case study… ▽ More

    Submitted 14 May, 2018; v1 submitted 27 March, 2018; originally announced March 2018.

    Comments: 11 pages, 13 figures. Published in the Proceedings of the 12th International AAAI Conference on Web and Social Media (ICWSM 2018). June 2018. Stanford, CA, USA

  47. arXiv:1711.10171  [pdf, other

    cs.HC

    Intelligent Notification Systems: A Survey of the State of the Art and Research Challenges

    Authors: Abhinav Mehrotra, Mirco Musolesi

    Abstract: Notifications provide a unique mechanism for increasing the effectiveness of real-time information delivery systems. However, notifications that demand users' attention at inopportune moments are more likely to have adverse effects and might become a cause of potential disruption rather than proving beneficial to users. In order to address these challenges a variety of intelligent notification mec… ▽ More

    Submitted 2 January, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

  48. arXiv:1711.06350  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Deep Learning Models for Psychological State Prediction using Smartphone Data: Challenges and Opportunities

    Authors: Gatis Mikelsons, Matthew Smith, Abhinav Mehrotra, Mirco Musolesi

    Abstract: There is an increasing interest in exploiting mobile sensing technologies and machine learning techniques for mental health monitoring and intervention. Researchers have effectively used contextual information, such as mobility, communication and mobile phone usage patterns for quantifying individuals' mood and wellbeing. In this paper, we investigate the effectiveness of neural network models for… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: 6 pages, 2 figures, In Proceedings of the NIPS Workshop on Machine Learning for Healthcare 2017 (ML4H 2017). Colocated with NIPS 2017

  49. arXiv:1710.08464  [pdf

    stat.ML cs.CR cs.LG

    Interpretable Machine Learning for Privacy-Preserving Pervasive Systems

    Authors: Benjamin Baron, Mirco Musolesi

    Abstract: Our everyday interactions with pervasive systems generate traces that capture various aspects of human behavior and enable machine learning algorithms to extract latent information about users. In this paper, we propose a machine learning interpretability framework that enables users to understand how these generated traces violate their privacy.

    Submitted 4 June, 2019; v1 submitted 23 October, 2017; originally announced October 2017.

    Journal ref: IEEE Pervasive Computing, 2019

  50. arXiv:1709.06519  [pdf, other

    cs.SI

    Linking Twitter Events With Stock Market Jitters

    Authors: Fani Tsapeli, Nikolaos Bezirgiannidis, Peter Tino, Mirco Musolesi

    Abstract: Predicting investors reactions to financial and political news is important for the early detection of stock market jitters. Evidence from several recent studies suggests that online social media could improve prediction of stock market movements. However, utilizing such information to predict strong stock market fluctuations has not been explored so far. In this work, we propose a novel event det… ▽ More

    Submitted 19 June, 2017; originally announced September 2017.