-
Adaptive Network Intervention for Complex Systems: A Hierarchical Graph Reinforcement Learning Approach
Authors:
Qiliang Chen,
Babak Heydari
Abstract:
Effective governance and steering of behavior in complex multi-agent systems (MAS) are essential for managing system-wide outcomes, particularly in environments where interactions are structured by dynamic networks. In many applications, the goal is to promote pro-social behavior among agents, where network structure plays a pivotal role in shaping these interactions. This paper introduces a Hiera…
▽ More
Effective governance and steering of behavior in complex multi-agent systems (MAS) are essential for managing system-wide outcomes, particularly in environments where interactions are structured by dynamic networks. In many applications, the goal is to promote pro-social behavior among agents, where network structure plays a pivotal role in shaping these interactions. This paper introduces a Hierarchical Graph Reinforcement Learning (HGRL) framework that governs such systems through targeted interventions in the network structure. Operating within the constraints of limited managerial authority, the HGRL framework demonstrates superior performance across a range of environmental conditions, outperforming established baseline methods. Our findings highlight the critical influence of agent-to-agent learning (social learning) on system behavior: under low social learning, the HGRL manager preserves cooperation, forming robust core-periphery networks dominated by cooperators. In contrast, high social learning accelerates defection, leading to sparser, chain-like networks. Additionally, the study underscores the importance of the system manager's authority level in preventing system-wide failures, such as agent rebellion or collapse, positioning HGRL as a powerful tool for dynamic network-based governance.
△ Less
Submitted 30 October, 2024;
originally announced October 2024.
-
Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning
Authors:
Qiliang Chen,
Babak Heydari
Abstract:
We introduce a framework that integrates variational autoencoders (VAE) with reinforcement learning (RL) to balance system performance and resource usage in multi-agent systems by dynamically adjusting network structures over time. A key innovation of this method is its capability to handle the vast action space of the network structure. This is achieved by combining Variational Auto-Encoder and D…
▽ More
We introduce a framework that integrates variational autoencoders (VAE) with reinforcement learning (RL) to balance system performance and resource usage in multi-agent systems by dynamically adjusting network structures over time. A key innovation of this method is its capability to handle the vast action space of the network structure. This is achieved by combining Variational Auto-Encoder and Deep Reinforcement Learning to control the latent space encoded from the network structures. The proposed method, evaluated on the modified OpenAI particle environment under various scenarios, not only demonstrates superior performance compared to baselines but also reveals interesting strategies and insights through the learned behaviors.
△ Less
Submitted 30 October, 2024;
originally announced October 2024.
-
Instigating Cooperation among LLM Agents Using Adaptive Information Modulation
Authors:
Qiliang Chen,
Sepehr Ilami,
Nunzio Lore,
Babak Heydari
Abstract:
This paper introduces a novel framework combining LLM agents as proxies for human strategic behavior with reinforcement learning (RL) to engage these agents in evolving strategic interactions within team environments. Our approach extends traditional agent-based simulations by using strategic LLM agents (SLA) and introducing dynamic and adaptive governance through a pro-social promoting RL agent (…
▽ More
This paper introduces a novel framework combining LLM agents as proxies for human strategic behavior with reinforcement learning (RL) to engage these agents in evolving strategic interactions within team environments. Our approach extends traditional agent-based simulations by using strategic LLM agents (SLA) and introducing dynamic and adaptive governance through a pro-social promoting RL agent (PPA) that modulates information access across agents in a network, optimizing social welfare and promoting pro-social behavior. Through validation in iterative games, including the prisoner dilemma, we demonstrate that SLA agents exhibit nuanced strategic adaptations. The PPA agent effectively learns to adjust information transparency, resulting in enhanced cooperation rates. This framework offers significant insights into AI-mediated social dynamics, contributing to the deployment of AI in real-world team settings.
△ Less
Submitted 30 October, 2024; v1 submitted 16 September, 2024;
originally announced September 2024.
-
Large Model Strategic Thinking, Small Model Efficiency: Transferring Theory of Mind in Large Language Models
Authors:
Nunzio Lore,
Sepehr Ilami,
Babak Heydari
Abstract:
As the performance of larger, newer Large Language Models continues to improve for strategic Theory of Mind (ToM) tasks, the demand for these state-of-the-art models increases commensurately. However, their deployment is costly both in terms of processing power and time. In this paper, we investigate the feasibility of creating smaller, highly-performing specialized algorithms by way of fine-tunin…
▽ More
As the performance of larger, newer Large Language Models continues to improve for strategic Theory of Mind (ToM) tasks, the demand for these state-of-the-art models increases commensurately. However, their deployment is costly both in terms of processing power and time. In this paper, we investigate the feasibility of creating smaller, highly-performing specialized algorithms by way of fine-tuning. To do this, we first present a large pre-trained model with 20 unique scenarios that combine different social contexts with games of varying social dilemmas, record its answers, and use them for Q&A fine-tuning on a smaller model of the same family. Our focus is on in-context game-theoretic decision-making, the same domain within which human interaction occurs and that requires both a theory of mind (or a semblance thereof) and an understanding of social dynamics. The smaller model is therefore trained not just on the answers provided, but also on the motivations provided by the larger model, which should contain advice and guidelines to navigate both strategic dilemmas and social cues. We find that the fine-tuned smaller language model consistently bridged the gap in performance between the smaller pre-trained version of the model and its larger relative and that its improvements extended in areas and contexts beyond the ones provided in the training examples, including on out-of-sample scenarios that include completely different game structures. On average for all games, through fine-tuning, the smaller model showed a 46% improvement measured as alignment towards the behavior of the larger model, with 100% representing indistinguishable behavior. When presented with out-of-sample social contexts and games, the fine-tuned model still displays remarkable levels of alignment, reaching an improvement of 18% and 28% respectively.
△ Less
Submitted 30 October, 2024; v1 submitted 5 August, 2024;
originally announced August 2024.
-
Improved Robustness for Deep Learning-based Segmentation of Multi-Center Myocardial Perfusion MRI Datasets Using Data Adaptive Uncertainty-guided Space-time Analysis
Authors:
Dilek M. Yalcinkaya,
Khalid Youssef,
Bobak Heydari,
Janet Wei,
Noel Bairey Merz,
Robert Judd,
Rohan Dharmakumar,
Orlando P. Simonetti,
Jonathan W. Weinsaft,
Subha V. Raman,
Behzad Sharif
Abstract:
Background. Fully automatic analysis of myocardial perfusion MRI datasets enables rapid and objective reporting of stress/rest studies in patients with suspected ischemic heart disease. Developing deep learning techniques that can analyze multi-center datasets despite limited training data and variations in software and hardware is an ongoing challenge.
Methods. Datasets from 3 medical centers a…
▽ More
Background. Fully automatic analysis of myocardial perfusion MRI datasets enables rapid and objective reporting of stress/rest studies in patients with suspected ischemic heart disease. Developing deep learning techniques that can analyze multi-center datasets despite limited training data and variations in software and hardware is an ongoing challenge.
Methods. Datasets from 3 medical centers acquired at 3T (n = 150 subjects) were included: an internal dataset (inD; n = 95) and two external datasets (exDs; n = 55) used for evaluating the robustness of the trained deep neural network (DNN) models against differences in pulse sequence (exD-1) and scanner vendor (exD-2). A subset of inD (n = 85) was used for training/validation of a pool of DNNs for segmentation, all using the same spatiotemporal U-Net architecture and hyperparameters but with different parameter initializations. We employed a space-time sliding-patch analysis approach that automatically yields a pixel-wise "uncertainty map" as a byproduct of the segmentation process. In our approach, a given test case is segmented by all members of the DNN pool and the resulting uncertainty maps are leveraged to automatically select the "best" one among the pool of solutions.
Results. The proposed DAUGS analysis approach performed similarly to the established approach on the internal dataset (p = n.s.) whereas it significantly outperformed on the external datasets (p < 0.005 for exD-1 and exD-2). Moreover, the number of image series with "failed" segmentation was significantly lower for the proposed vs. the established approach (4.3% vs. 17.1%, p < 0.0005).
Conclusions. The proposed DAUGS analysis approach has the potential to improve the robustness of deep learning methods for segmentation of multi-center stress perfusion datasets with variations in the choice of pulse sequence, site location or scanner vendor.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Platform-Driven Collaboration Patterns: Structural Evolution Over Time and Scale
Authors:
Negin Maddah,
Babak Heydari
Abstract:
Within an increasingly digitalized organizational landscape, this research delves into the dynamics of decentralized collaboration, contrasting it with traditional collaboration models. An effective capturing of high-level collaborations (beyond direct massages) is introduced as the network construction methodology including both temporal and content dimensions of user collaborations - an Alternat…
▽ More
Within an increasingly digitalized organizational landscape, this research delves into the dynamics of decentralized collaboration, contrasting it with traditional collaboration models. An effective capturing of high-level collaborations (beyond direct massages) is introduced as the network construction methodology including both temporal and content dimensions of user collaborations - an Alternating Timed Interaction (ATI) metric as the first aspect, and a quantitative strategy of thematic similarity as the second aspect. This study validates three hypotheses that collectively underscore the complexities of digital team dynamics within sociotechnical systems: Firstly, it establishes the significant influence of problem context on team structures in work environments, emphasizing the need to consider the specific nature of tasks in analyzing collaborative dynamics. Secondly, the study reveals specific evolving patterns of team structures on digital platforms concerning team size and artifact maturity. Lastly, it identifies substantial differences in team structure patterns between digital platforms and traditional organizational settings, underscoring the unexplored nature of digital collaboration dynamics. The findings of this study are instrumental for organizations navigating the digital era, offering insights into effective knowledge sharing in the decentralized leadership of digital teams. By mapping out network structures and collaborative patterns, this study, with a focus on Wikipedia as a representative digital platform, paves the way for strategic interventions to optimize digital team dynamics and align them with broader organizational goals.
△ Less
Submitted 23 February, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
Strategic Behavior of Large Language Models: Game Structure vs. Contextual Framing
Authors:
Nunzio Lorè,
Babak Heydari
Abstract:
This paper investigates the strategic decision-making capabilities of three Large Language Models (LLMs): GPT-3.5, GPT-4, and LLaMa-2, within the framework of game theory. Utilizing four canonical two-player games -- Prisoner's Dilemma, Stag Hunt, Snowdrift, and Prisoner's Delight -- we explore how these models navigate social dilemmas, situations where players can either cooperate for a collectiv…
▽ More
This paper investigates the strategic decision-making capabilities of three Large Language Models (LLMs): GPT-3.5, GPT-4, and LLaMa-2, within the framework of game theory. Utilizing four canonical two-player games -- Prisoner's Dilemma, Stag Hunt, Snowdrift, and Prisoner's Delight -- we explore how these models navigate social dilemmas, situations where players can either cooperate for a collective benefit or defect for individual gain. Crucially, we extend our analysis to examine the role of contextual framing, such as diplomatic relations or casual friendships, in shaping the models' decisions. Our findings reveal a complex landscape: while GPT-3.5 is highly sensitive to contextual framing, it shows limited ability to engage in abstract strategic reasoning. Both GPT-4 and LLaMa-2 adjust their strategies based on game structure and context, but LLaMa-2 exhibits a more nuanced understanding of the games' underlying mechanics. These results highlight the current limitations and varied proficiencies of LLMs in strategic decision-making, cautioning against their unqualified use in tasks requiring complex strategic reasoning.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Temporal Uncertainty Localization to Enable Human-in-the-loop Analysis of Dynamic Contrast-enhanced Cardiac MRI Datasets
Authors:
Dilek M. Yalcinkaya,
Khalid Youssef,
Bobak Heydari,
Orlando Simonetti,
Rohan Dharmakumar,
Subha Raman,
Behzad Sharif
Abstract:
Dynamic contrast-enhanced (DCE) cardiac magnetic resonance imaging (CMRI) is a widely used modality for diagnosing myocardial blood flow (perfusion) abnormalities. During a typical free-breathing DCE-CMRI scan, close to 300 time-resolved images of myocardial perfusion are acquired at various contrast "wash in/out" phases. Manual segmentation of myocardial contours in each time-frame of a DCE image…
▽ More
Dynamic contrast-enhanced (DCE) cardiac magnetic resonance imaging (CMRI) is a widely used modality for diagnosing myocardial blood flow (perfusion) abnormalities. During a typical free-breathing DCE-CMRI scan, close to 300 time-resolved images of myocardial perfusion are acquired at various contrast "wash in/out" phases. Manual segmentation of myocardial contours in each time-frame of a DCE image series can be tedious and time-consuming, particularly when non-rigid motion correction has failed or is unavailable. While deep neural networks (DNNs) have shown promise for analyzing DCE-CMRI datasets, a "dynamic quality control" (dQC) technique for reliably detecting failed segmentations is lacking. Here we propose a new space-time uncertainty metric as a dQC tool for DNN-based segmentation of free-breathing DCE-CMRI datasets by validating the proposed metric on an external dataset and establishing a human-in-the-loop framework to improve the segmentation results. In the proposed approach, we referred the top 10% most uncertain segmentations as detected by our dQC tool to the human expert for refinement. This approach resulted in a significant increase in the Dice score (p<0.001) and a notable decrease in the number of images with failed segmentation (16.2% to 11.3%) whereas the alternative approach of randomly selecting the same number of segmentations for human referral did not achieve any significant improvement. Our results suggest that the proposed dQC framework has the potential to accurately identify poor-quality segmentations and may enable efficient DNN-based analysis of DCE-CMRI in a human-in-the-loop pipeline for clinical interpretation and reporting of dynamic CMRI datasets.
△ Less
Submitted 13 November, 2023; v1 submitted 25 August, 2023;
originally announced August 2023.
-
An Incentive-Compatible Scheme for Electricity Cooperatives: An Axiomatic Approach
Authors:
Abbas Ehsanfar,
Babak Heydari
Abstract:
This paper introduces a new scheme for autonomous electricity cooperatives, called predictive cooperative (PCP), which aggregates commercial and residential electricity consumers and participates in the electricity market on behalf of its members. An axiomatic approach is proposed to calculate the day-ahead bid and to disaggregate the collective cost among participating consumers. The resulting fo…
▽ More
This paper introduces a new scheme for autonomous electricity cooperatives, called predictive cooperative (PCP), which aggregates commercial and residential electricity consumers and participates in the electricity market on behalf of its members. An axiomatic approach is proposed to calculate the day-ahead bid and to disaggregate the collective cost among participating consumers. The resulting formulation is shown to keep the members incentivized to both participate in the cooperative and remain truthful in reporting their expected loads. The scheme is implemented using PJM (world's largest wholesale electricity market) real-time and day-ahead price data for 2015 and a collection of residential and commercial load profiles. The model performance of this framework is compared to that of real-time pricing (RTP) scheme, in which wholesale market prices are directly applied to individual consumers. The results show truthful load announcement by consumers, reduction in electricity price variation for all consumers, and comparative benefits for participants.
△ Less
Submitted 3 August, 2016;
originally announced August 2016.
-
Distributed or Monolithic? A Computational Architecture Decision Framework
Authors:
Mohsen Mosleh,
Kia Dalili,
Babak Heydari
Abstract:
Distributed architectures have become ubiquitous in many complex technical and socio-technical systems because of their role in improving uncertainty management, accommodating multiple stakeholders, and increasing scalability and evolvability. This departure from monolithic architectures provides a system with more flexibility and robustness in response to uncertainties that it may confront during…
▽ More
Distributed architectures have become ubiquitous in many complex technical and socio-technical systems because of their role in improving uncertainty management, accommodating multiple stakeholders, and increasing scalability and evolvability. This departure from monolithic architectures provides a system with more flexibility and robustness in response to uncertainties that it may confront during its lifetime. Distributed architecture does not provide benefits only, as it can increase cost and complexity of the system and result in potential instabilities. The mechanisms behind this trade-off, however, are analogous to those of the widely-studied transition from integrated to modular architectures. In this paper, we use a conceptual decision framework that unifies modularity and distributed architecture on a five-stage systems architecture spectrum. We add an extensive computational layer to the framework and explain how this can enhance decision making about the level of modularity of the architecture. We then apply it to a simplified demonstration of the Defense Advanced Research Projects Agency (DARPA) fractionated satellite program. Through simulation, we calculate the net value that is gained (or lost) by migrating from a monolithic architecture to a distributed architecture and show how this value changes as a function of uncertainties in the environment and various system parameters. Additionally, we use Value at Risk as a measure for the risk of losing the value of distributed architecture, given its inherent uncertainty.
△ Less
Submitted 2 August, 2016;
originally announced August 2016.
-
Distributed Resource Management in Systems of Systems: An Architecture Perspective
Authors:
Mohsen Mosleh,
Peter Ludlow,
Babak Heydari
Abstract:
This paper introduces a framework for studying the interactions of autonomous system components and the design of the connectivity structure in Systems of Systems (SoSs). This framework, which uses complex network models, is also used to study the connectivity structure's impact on resource management. We discuss resource sharing as a mechanism that adds a level of flexibility to distributed syste…
▽ More
This paper introduces a framework for studying the interactions of autonomous system components and the design of the connectivity structure in Systems of Systems (SoSs). This framework, which uses complex network models, is also used to study the connectivity structure's impact on resource management. We discuss resource sharing as a mechanism that adds a level of flexibility to distributed systems and describe the connectivity structures that enhance components' access to the resources available within the system. The framework introduced in this paper explicitly incorporates costs of connection and the benefits that are received by direct and indirect access to resources and provides measures of the optimality of connectivity structures. We discuss central and a distributed schemes that, respectively, represent systems in which a central planner determines the connectivity structure and systems in which distributed components are allowed to add and sever connections to improve their own resource access. Furthermore, we identify optimal connectivity structures for systems with various heterogeneity conditions.
△ Less
Submitted 4 August, 2016; v1 submitted 7 April, 2016;
originally announced April 2016.
-
Efficient Network Structures with Separable Heterogeneous Connection Costs
Authors:
Babak Heydari,
Mohsen Mosleh,
Kia Dalili
Abstract:
We introduce a heterogeneous connection model for network formation to capture the effect of cost heterogeneity on the structure of efficient networks. In the proposed model, connection costs are assumed to be separable, which means the total connection cost for each agent is uniquely proportional to its degree. For these sets of networks, we provide the analytical solution for the efficient netwo…
▽ More
We introduce a heterogeneous connection model for network formation to capture the effect of cost heterogeneity on the structure of efficient networks. In the proposed model, connection costs are assumed to be separable, which means the total connection cost for each agent is uniquely proportional to its degree. For these sets of networks, we provide the analytical solution for the efficient network and discuss stability impli- cations. We show that the efficient network exhibits a core-periphery structure, and for a given density, we find a lower bound for clustering coefficient of the efficient network.
△ Less
Submitted 11 December, 2015; v1 submitted 24 April, 2015;
originally announced April 2015.