-
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Authors:
Miguel Arana-Catania,
Weisi Guo
Abstract:
Causal understanding is important in many disciplines of science and engineering, where we seek to understand how different factors in the system causally affect an experiment or situation and pave a pathway towards creating effective or optimising existing models. Examples of use cases are autonomous exploration and modelling of unknown environments or assessing key variables in optimising large…
▽ More
Causal understanding is important in many disciplines of science and engineering, where we seek to understand how different factors in the system causally affect an experiment or situation and pave a pathway towards creating effective or optimising existing models. Examples of use cases are autonomous exploration and modelling of unknown environments or assessing key variables in optimising large complex systems. In this paper, we analyse a Reinforcement Learning approach called Causal Curiosity, which aims to estimate as accurately and efficiently as possible, without directly measuring them, the value of factors that causally determine the dynamics of a system. Whilst the idea presents a pathway forward, measurement accuracy is the foundation of methodology effectiveness. Focusing on the current causal curiosity's robotic manipulator, we present for the first time a measurement accuracy analysis of the future potentials and current limitations of this technique and an analysis of its sensitivity and confounding factor disentanglement capability - crucial for causal analysis. As a result of our work, we promote proposals for an improved and efficient design of Causal Curiosity methods to be applied to real-world complex scenarios.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
Probing the co-evolution of SMBHs and their hosts from scaling relations pairwise residuals: dominance of stellar velocity dispersion and host halo mass
Authors:
Francesco Shankar,
Mariangela Bernardi,
Daniel Roberts,
Miguel Arana-Catania,
Tobias Grubenmann,
Melanie Habouzit,
Amy Smith,
Christopher Marsden,
Karthik Mahesh Varadarajan,
Alba Vega Alonso Tetilla,
Daniel Anglés-Alcázar,
Lumen Boco,
Duncan Farrah,
Hao Fu,
Henryk Haniewicz,
Andrea Lapi,
Christopher C. Lovell,
Nicola Menci,
Meredith Powell,
Federica Ricci
Abstract:
The correlations between Supermassive Black Holes (SMBHs) and their host galaxies still defy our understanding from both the observational and theoretical perspectives. Here we perform pairwise residual analysis on the latest sample of local inactive galaxies with a uniform calibration of their photometric properties and with dynamically measured masses of their central SMBHs. The residuals reveal…
▽ More
The correlations between Supermassive Black Holes (SMBHs) and their host galaxies still defy our understanding from both the observational and theoretical perspectives. Here we perform pairwise residual analysis on the latest sample of local inactive galaxies with a uniform calibration of their photometric properties and with dynamically measured masses of their central SMBHs. The residuals reveal that stellar velocity dispersion $σ$ and, possibly host dark matter halo mass $M_{\rm halo}$, appear as the galactic properties most correlated with SMBH mass, with a secondary (weaker) correlation with spheroidal (bulge) mass $M_{\rm sph}$, as also corroborated by additional Machine Learning tests. These findings may favour energetic/kinetic feedback from Active Galactic Nuclei (AGN) as the main driver in shaping SMBH scaling relations. Two state-of-the-art hydrodynamic simulations, inclusive of kinetic AGN feedback, are able to broadly capture the mean trends observed in the residuals, although they tend to either favour $M_{\rm sph}$ as the most fundamental property, or generate too flat residuals. Increasing AGN feedback kinetic output does not improve the comparison with the data. In the Appendix we also show that the galaxies with dynamically measured SMBHs are biased high in $σ$ at fixed luminosity with respect to the full sample of local galaxies, proving that this bias is not a byproduct of stellar mass discrepancies. Overall, our results suggest that probing the SMBH-galaxy scaling relations in terms of total stellar mass alone may induce biases, and that either current data sets are incomplete, and/or that more insightful modelling is required to fully reproduce observations.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
A causal learning approach to in-orbit inertial parameter estimation for multi-payload deployers
Authors:
Konstantinos Platanitis,
Miguel Arana-Catania,
Saurabh Upadhyay,
Leonard Felicetti
Abstract:
This paper discusses an approach to inertial parameter estimation for the case of cargo carrying spacecraft that is based on causal learning, i.e. learning from the responses of the spacecraft, under actuation. Different spacecraft configurations (inertial parameter sets) are simulated under different actuation profiles, in order to produce an optimised time-series clustering classifier that can b…
▽ More
This paper discusses an approach to inertial parameter estimation for the case of cargo carrying spacecraft that is based on causal learning, i.e. learning from the responses of the spacecraft, under actuation. Different spacecraft configurations (inertial parameter sets) are simulated under different actuation profiles, in order to produce an optimised time-series clustering classifier that can be used to distinguish between them. The actuation is comprised of finite sequences of constant inputs that are applied in order, based on typical actuators available. By learning from the system's responses across multiple input sequences, and then applying measures of time-series similarity and F1-score, an optimal actuation sequence can be chosen either for one specific system configuration or for the overall set of possible configurations. This allows for both estimation of the inertial parameter set without any prior knowledge of state, as well as validation of transitions between different configurations after a deployment event. The optimisation of the actuation sequence is handled by a reinforcement learning model that uses the proximal policy optimisation (PPO) algorithm, by repeatedly trying different sequences and evaluating the impact on classifier performance according to a multi-objective metric.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
Machine Learning Information Retrieval and Summarisation to Support Systematic Review on Outcomes Based Contracting
Authors:
Iman Munire Bilal,
Zheng Fang,
Miguel Arana-Catania,
Felix-Anselm van Lier,
Juliana Outes Velarde,
Harry Bregazzi,
Eleanor Carter,
Mara Airoldi,
Rob Procter
Abstract:
As academic literature proliferates, traditional review methods are increasingly challenged by the sheer volume and diversity of available research. This article presents a study that aims to address these challenges by enhancing the efficiency and scope of systematic reviews in the social sciences through advanced machine learning (ML) and natural language processing (NLP) tools. In particular, w…
▽ More
As academic literature proliferates, traditional review methods are increasingly challenged by the sheer volume and diversity of available research. This article presents a study that aims to address these challenges by enhancing the efficiency and scope of systematic reviews in the social sciences through advanced machine learning (ML) and natural language processing (NLP) tools. In particular, we focus on automating stages within the systematic reviewing process that are time-intensive and repetitive for human annotators and which lend themselves to immediate scalability through tools such as information retrieval and summarisation guided by expert advice. The article concludes with a summary of lessons learnt regarding the integrated approach towards systematic reviews and future directions for improvement, including explainability.
△ Less
Submitted 11 December, 2024;
originally announced December 2024.
-
Deep Autoencoders for Unsupervised Anomaly Detection in Wildfire Prediction
Authors:
İrem Üstek,
Miguel Arana-Catania,
Alexander Farr,
Ivan Petrunin
Abstract:
Wildfires pose a significantly increasing hazard to global ecosystems due to the climate crisis. Due to its complex nature, there is an urgent need for innovative approaches to wildfire prediction, such as machine learning. This research took a unique approach, differentiating from classical supervised learning, and addressed the gap in unsupervised wildfire prediction using autoencoders and clust…
▽ More
Wildfires pose a significantly increasing hazard to global ecosystems due to the climate crisis. Due to its complex nature, there is an urgent need for innovative approaches to wildfire prediction, such as machine learning. This research took a unique approach, differentiating from classical supervised learning, and addressed the gap in unsupervised wildfire prediction using autoencoders and clustering techniques for anomaly detection. Historical weather and normalised difference vegetation index datasets of Australia for 2005 - 2021 were utilised. Two main unsupervised approaches were analysed. The first used a deep autoencoder to obtain latent features, which were then fed into clustering models, isolation forest, local outlier factor and one-class SVM for anomaly detection. The second approach used a deep autoencoder to reconstruct the input data and use reconstruction errors to identify anomalies. Long Short-Term Memory (LSTM) autoencoders and fully connected (FC) autoencoders were employed in this part, both in an unsupervised way learning only from nominal data. The FC autoencoder outperformed its counterparts, achieving an accuracy of 0.71, an F1-score of 0.74, and an MCC of 0.42. These findings highlight the practicality of this method, as it effectively predicts wildfires in the absence of ground truth, utilising an unsupervised learning technique.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Causal Reinforcement Learning for Optimisation of Robot Dynamics in Unknown Environments
Authors:
Julian Gerald Dcruz,
Sam Mahoney,
Jia Yun Chua,
Adoundeth Soukhabandith,
John Mugabe,
Weisi Guo,
Miguel Arana-Catania
Abstract:
Autonomous operations of robots in unknown environments are challenging due to the lack of knowledge of the dynamics of the interactions, such as the objects' movability. This work introduces a novel Causal Reinforcement Learning approach to enhancing robotics operations and applies it to an urban search and rescue (SAR) scenario. Our proposed machine learning architecture enables robots to learn…
▽ More
Autonomous operations of robots in unknown environments are challenging due to the lack of knowledge of the dynamics of the interactions, such as the objects' movability. This work introduces a novel Causal Reinforcement Learning approach to enhancing robotics operations and applies it to an urban search and rescue (SAR) scenario. Our proposed machine learning architecture enables robots to learn the causal relationships between the visual characteristics of the objects, such as texture and shape, and the objects' dynamics upon interaction, such as their movability, significantly improving their decision-making processes. We conducted causal discovery and RL experiments demonstrating the Causal RL's superior performance, showing a notable reduction in learning times by over 24.5% in complex situations, compared to non-causal models.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
Spacecraft inertial parameters estimation using time series clustering and reinforcement learning
Authors:
Konstantinos Platanitis,
Miguel Arana-Catania,
Leonardo Capicchiano,
Saurabh Upadhyay,
Leonard Felicetti
Abstract:
This paper presents a machine learning approach to estimate the inertial parameters of a spacecraft in cases when those change during operations, e.g. multiple deployments of payloads, unfolding of appendages and booms, propellant consumption as well as during in-orbit servicing and active debris removal operations. The machine learning approach uses time series clustering together with an optimis…
▽ More
This paper presents a machine learning approach to estimate the inertial parameters of a spacecraft in cases when those change during operations, e.g. multiple deployments of payloads, unfolding of appendages and booms, propellant consumption as well as during in-orbit servicing and active debris removal operations. The machine learning approach uses time series clustering together with an optimised actuation sequence generated by reinforcement learning to facilitate distinguishing among different inertial parameter sets. The performance of the proposed strategy is assessed against the case of a multi-satellite deployment system showing that the algorithm is resilient towards common disturbances in such kinds of operations.
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning
Authors:
Abdulaziz Alwalan,
Miguel Arana-Catania
Abstract:
In this work we demonstrate the possibility of estimating the wind environment of a UAV without specialised sensors, using only the UAV's trajectory, applying a causal machine learning approach. We implement the causal curiosity method which combines machine learning times series classification and clustering with a causal framework. We analyse three distinct wind environments: constant wind, shea…
▽ More
In this work we demonstrate the possibility of estimating the wind environment of a UAV without specialised sensors, using only the UAV's trajectory, applying a causal machine learning approach. We implement the causal curiosity method which combines machine learning times series classification and clustering with a causal framework. We analyse three distinct wind environments: constant wind, shear wind, and turbulence, and explore different optimisation strategies for optimal UAV manoeuvres to estimate the wind conditions. The proposed approach can be used to design optimal trajectories in challenging weather conditions, and to avoid specialised sensors that add to the UAV's weight and compromise its functionality.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
SyROCCo: Enhancing Systematic Reviews using Machine Learning
Authors:
Zheng Fang,
Miguel Arana-Catania,
Felix-Anselm van Lier,
Juliana Outes Velarde,
Harry Bregazzi,
Mara Airoldi,
Eleanor Carter,
Rob Procter
Abstract:
The sheer number of research outputs published every year makes systematic reviewing increasingly time- and resource-intensive. This paper explores the use of machine learning techniques to help navigate the systematic review process. ML has previously been used to reliably 'screen' articles for review - that is, identify relevant articles based on reviewers' inclusion criteria. The application of…
▽ More
The sheer number of research outputs published every year makes systematic reviewing increasingly time- and resource-intensive. This paper explores the use of machine learning techniques to help navigate the systematic review process. ML has previously been used to reliably 'screen' articles for review - that is, identify relevant articles based on reviewers' inclusion criteria. The application of ML techniques to subsequent stages of a review, however, such as data extraction and evidence mapping, is in its infancy. We therefore set out to develop a series of tools that would assist in the profiling and analysis of 1,952 publications on the theme of 'outcomes-based contracting'. Tools were developed for the following tasks: assign publications into 'policy area' categories; identify and extract key information for evidence mapping, such as organisations, laws, and geographical information; connect the evidence base to an existing dataset on the same topic; and identify subgroups of articles that may share thematic content. An interactive tool using these techniques and a public dataset with their outputs have been released. Our results demonstrate the utility of ML techniques to enhance evidence accessibility and analysis within the systematic review processes. These efforts show promise in potentially yielding substantial efficiencies for future systematic reviewing and for broadening their analytical scope. Our work suggests that there may be implications for the ease with which policymakers and practitioners can access evidence. While ML techniques seem poised to play a significant role in bridging the gap between research and policy by offering innovative ways of gathering, accessing, and analysing data from systematic reviews, we also highlight their current limitations and the need to exercise caution in their application, particularly given the potential for errors and biases.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Autonomous Robotic Arm Manipulation for Planetary Missions using Causal Machine Learning
Authors:
C. McDonnell,
M. Arana-Catania,
S. Upadhyay
Abstract:
Autonomous robotic arm manipulators have the potential to make planetary exploration and in-situ resource utilization missions more time efficient and productive, as the manipulator can handle the objects itself and perform goal-specific actions. We train a manipulator to autonomously study objects of which it has no prior knowledge, such as planetary rocks. This is achieved using causal machine l…
▽ More
Autonomous robotic arm manipulators have the potential to make planetary exploration and in-situ resource utilization missions more time efficient and productive, as the manipulator can handle the objects itself and perform goal-specific actions. We train a manipulator to autonomously study objects of which it has no prior knowledge, such as planetary rocks. This is achieved using causal machine learning in a simulated planetary environment. Here, the manipulator interacts with objects, and classifies them based on differing causal factors. These are parameters, such as mass or friction coefficient, that causally determine the outcomes of its interactions. Through reinforcement learning, the manipulator learns to interact in ways that reveal the underlying causal factors. We show that this method works even without any prior knowledge of the objects, or any previously-collected training data. We carry out the training in planetary exploration conditions, with realistic manipulator models.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Causal Discovery to Understand Hot Corrosion
Authors:
A. Varghese,
M. Arana-Catania,
S. Mori,
A. Encinas-Oropesa,
J. Sumner
Abstract:
Gas turbine superalloys experience hot corrosion, driven by factors including corrosive deposit flux, temperature, gas composition, and component material. The full mechanism still needs clarification and research often focuses on laboratory work. As such, there is interest in causal discovery to confirm the significance of factors and identify potential missing causal relationships or co-dependen…
▽ More
Gas turbine superalloys experience hot corrosion, driven by factors including corrosive deposit flux, temperature, gas composition, and component material. The full mechanism still needs clarification and research often focuses on laboratory work. As such, there is interest in causal discovery to confirm the significance of factors and identify potential missing causal relationships or co-dependencies between these factors. The causal discovery algorithm Fast Causal Inference (FCI) has been trialled on a small set of laboratory data, with the outputs evaluated for their significance to corrosion propagation, and compared to existing mechanistic understanding. FCI identified the salt deposition flux as the most influential corrosion variable for this limited dataset. However, HCl was the second most influential for pitting regions, compared to temperature for more uniformly corroding regions. Thus FCI generated causal links aligned with literature from a randomised corrosion dataset, while also identifying the presence of two different degradation modes in operation.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Some Observations on Fact-Checking Work with Implications for Computational Support
Authors:
Rob Procter,
Miguel Arana-Catania,
Yulan He,
Maria Liakata,
Arkaitz Zubiaga,
Elena Kochkina,
Runcong Zhao
Abstract:
Social media and user-generated content (UGC) have become increasingly important features of journalistic work in a number of different ways. However, the growth of misinformation means that news organisations have had devote more and more resources to determining its veracity and to publishing corrections if it is found to be misleading. In this work, we present the results of interviews with eig…
▽ More
Social media and user-generated content (UGC) have become increasingly important features of journalistic work in a number of different ways. However, the growth of misinformation means that news organisations have had devote more and more resources to determining its veracity and to publishing corrections if it is found to be misleading. In this work, we present the results of interviews with eight members of fact-checking teams from two organisations. Team members described their fact-checking processes and the challenges they currently face in completing a fact-check in a robust and timely way. The former reveals, inter alia, significant differences in fact-checking practices and the role played by collaboration between team members. We conclude with a discussion of the implications for the development and application of computational tools, including where computational tool support is currently lacking and the importance of being able to accommodate different fact-checking practices.
△ Less
Submitted 6 July, 2023; v1 submitted 3 May, 2023;
originally announced May 2023.
-
PANACEA: An Automated Misinformation Detection System on COVID-19
Authors:
Runcong Zhao,
Miguel Arana-Catania,
Lixing Zhu,
Elena Kochkina,
Lin Gui,
Arkaitz Zubiaga,
Rob Procter,
Maria Liakata,
Yulan He
Abstract:
In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporti…
▽ More
In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporting evidence with the stance towards the claim to be checked. In addition, PANACEA adapts the bi-directional graph convolutional networks model, which is able to detect rumours based on comment networks of related tweets, instead of relying on the knowledge base. This rumour detection module assists by warning the users in the early stages when a knowledge base may not be available.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
Embedding digital participatory budgeting within local government: motivations, strategies and barriers faced
Authors:
Jonathan Davies,
Miguel Arana-Catania,
Rob Procter
Abstract:
The challenging task of embedding innovative participatory processes and technologies within local government often falls upon local council officers. Using qualitative data collection and analysis, we investigate the ongoing work of Scottish local councils seeking to run the process of participatory budgeting (PB) within their institution, the use of digital platforms to support this and the chal…
▽ More
The challenging task of embedding innovative participatory processes and technologies within local government often falls upon local council officers. Using qualitative data collection and analysis, we investigate the ongoing work of Scottish local councils seeking to run the process of participatory budgeting (PB) within their institution, the use of digital platforms to support this and the challenges faced. In doing so this paper draws on empirical material to support the growing discussion on the dynamics or forces behind embedding. Our analysis shows that formal agreement alone does not make the process a certainty. Local council officers must work as mediators in the transitional space between representative structures and new, innovative ways of working, unsettling the entrenched power dynamics. To do so they must be well trained and well resourced, including the ability to use digital platforms effectively as part of the process. This provides the necessary, accessible, transparent and deliberative space for participation.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
PHEMEPlus: Enriching Social Media Rumour Verification with External Evidence
Authors:
John Dougrez-Lewis,
Elena Kochkina,
M. Arana-Catania,
Maria Liakata,
Yulan He
Abstract:
Work on social media rumour verification utilises signals from posts, their propagation and users involved. Other lines of work target identifying and fact-checking claims based on information from Wikipedia, or trustworthy news articles without considering social media context. However works combining the information from social media with external evidence from the wider web are lacking. To faci…
▽ More
Work on social media rumour verification utilises signals from posts, their propagation and users involved. Other lines of work target identifying and fact-checking claims based on information from Wikipedia, or trustworthy news articles without considering social media context. However works combining the information from social media with external evidence from the wider web are lacking. To facilitate research in this direction, we release a novel dataset, PHEMEPlus, an extension of the PHEME benchmark, which contains social media conversations as well as relevant external evidence for each rumour. We demonstrate the effectiveness of incorporating such evidence in improving rumour verification models. Additionally, as part of the evidence collection, we evaluate various ways of query formulation to identify the most effective method.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
Supporting peace negotiations in the Yemen war through machine learning
Authors:
M. Arana-Catania,
F. A. Van Lier,
Rob Procter
Abstract:
Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the di…
▽ More
Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the distinction between relevant and less relevant actors in peace-making, or the identification of key conflict issues and their interdependence. International peace efforts appear ill-equipped to successfully address these challenges. While technology is already being experimented with and used in a range of conflict related fields, such as conflict predicting or information gathering, less attention has been given to how technology can contribute to conflict mediation. This case study contributes to emerging research on the use of state-of-the-art machine learning technologies and techniques in conflict mediation processes. Using dialogue transcripts from peace negotiations in Yemen, this study shows how machine-learning can effectively support mediating teams by providing them with tools for knowledge management, extraction and conflict analysis. Apart from illustrating the potential of machine learning tools in conflict mediation, the paper also emphasises the importance of interdisciplinary and participatory, co-creation methodology for the development of context-sensitive and targeted tools and to ensure meaningful and responsible implementation.
△ Less
Submitted 23 July, 2022;
originally announced July 2022.
-
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims
Authors:
M. Arana-Catania,
Elena Kochkina,
Arkaitz Zubiaga,
Maria Liakata,
Rob Procter,
Yulan He
Abstract:
We present a comprehensive work on automated veracity assessment from dataset creation to developing novel methods based on Natural Language Inference (NLI), focusing on misinformation related to the COVID-19 pandemic. We first describe the construction of the novel PANACEA dataset consisting of heterogeneous claims on COVID-19 and their respective information sources. The dataset construction inc…
▽ More
We present a comprehensive work on automated veracity assessment from dataset creation to developing novel methods based on Natural Language Inference (NLI), focusing on misinformation related to the COVID-19 pandemic. We first describe the construction of the novel PANACEA dataset consisting of heterogeneous claims on COVID-19 and their respective information sources. The dataset construction includes work on retrieval techniques and similarity measurements to ensure a unique set of claims. We then propose novel techniques for automated veracity assessment based on Natural Language Inference including graph convolutional networks and attention based approaches. We have carried out experiments on evidence retrieval and veracity assessment on the dataset using the proposed techniques and found them competitive with SOTA methods, and provided a detailed discussion.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
Evaluating the application of NLP tools in mainstream participatory budgeting processes in Scotland
Authors:
Jonathan Davies,
Miguel Arana-Catania,
Rob Procter,
Felix-Anselm van Lier,
Yulan He
Abstract:
In recent years participatory budgeting (PB) in Scotland has grown from a handful of community-led processes to a movement supported by local and national government. This is epitomized by an agreement between the Scottish Government and the Convention of Scottish Local Authorities (COSLA) that at least 1% of local authority budgets will be subject to PB. This ongoing research paper explores the c…
▽ More
In recent years participatory budgeting (PB) in Scotland has grown from a handful of community-led processes to a movement supported by local and national government. This is epitomized by an agreement between the Scottish Government and the Convention of Scottish Local Authorities (COSLA) that at least 1% of local authority budgets will be subject to PB. This ongoing research paper explores the challenges that emerge from this 'scaling up' or 'mainstreaming' across the 32 local authorities that make up Scotland. The main objective is to evaluate local authority use of the digital platform Consul, which applies Natural Language Processing (NLP) to address these challenges. This project adopts a qualitative longitudinal design with interviews, observations of PB processes, and analysis of the digital platform data. Thematic analysis is employed to capture the major issues and themes which emerge. Longitudinal analysis then explores how these evolve over time. The potential for 32 live study sites provides a unique opportunity to explore discrete political and social contexts which materialize and allow for a deeper dive into the challenges and issues that may exist, something a wider cross-sectional study would miss. Initial results show that issues and challenges which come from scaling up may be tackled using NLP technology which, in a previous controlled use case-based evaluation, has shown to improve the effectiveness of citizen participation.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes
Authors:
M. Arana-Catania,
Rob Procter,
Yulan He,
Maria Liakata
Abstract:
We present work on summarising deliberative processes for non-English languages. Unlike commonly studied datasets, such as news articles, this deliberation dataset reflects difficulties of combining multiple narratives, mostly of poor grammatical quality, in a single text. We report an extensive evaluation of a wide range of abstractive summarisation models in combination with an off-the-shelf mac…
▽ More
We present work on summarising deliberative processes for non-English languages. Unlike commonly studied datasets, such as news articles, this deliberation dataset reflects difficulties of combining multiple narratives, mostly of poor grammatical quality, in a single text. We report an extensive evaluation of a wide range of abstractive summarisation models in combination with an off-the-shelf machine translation model. Texts are translated into English, summarised, and translated back to the original language. We obtain promising results regarding the fluency, consistency and relevance of the summaries produced. Our approach is easy to implement for many languages for production purposes by simply changing the translation model.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
A mixed-methods ethnographic approach to participatory budgeting in Scotland
Authors:
Jonathan Davies,
M. Arana-Catania,
Rob Procter,
F. A. Van Lier,
Yulan He
Abstract:
Participatory budgeting (PB) is already well established in Scotland in the form of community led grant-making yet has recently transformed from a grass-roots activity to a mainstream process or embedded 'policy instrument'. An integral part of this turn is the use of the Consul digital platform as the primary means of citizen participation. Using a mixed method approach, this ongoing research pap…
▽ More
Participatory budgeting (PB) is already well established in Scotland in the form of community led grant-making yet has recently transformed from a grass-roots activity to a mainstream process or embedded 'policy instrument'. An integral part of this turn is the use of the Consul digital platform as the primary means of citizen participation. Using a mixed method approach, this ongoing research paper explores how each of the 32 local authorities that make up Scotland utilise the Consul platform to engage their citizens in the PB process and how they then make sense of citizens' contributions. In particular, we focus on whether natural language processing (NLP) tools can facilitate both citizen engagement, and the processes by which citizens' contributions are analysed and translated into policies.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Machine Learning for Mediation in Armed Conflicts
Authors:
M. Arana-Catania,
F. A. Van Lier,
Rob Procter
Abstract:
Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the di…
▽ More
Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the distinction between relevant and less relevant actors in peace making, or the identification of key conflict issues and their interdependence. International peace efforts appear increasingly ill-equipped to successfully address these challenges. While technology is being increasingly used in a range of conflict related fields, such as conflict predicting or information gathering, less attention has been given to how technology can contribute to conflict mediation. This case study is the first to apply state-of-the-art machine learning technologies to data from an ongoing mediation process. Using dialogue transcripts from peace negotiations in Yemen, this study shows how machine-learning tools can effectively support international mediators by managing knowledge and offering additional conflict analysis tools to assess complex information. Apart from illustrating the potential of machine learning tools in conflict mediation, the paper also emphasises the importance of interdisciplinary and participatory research design for the development of context-sensitive and targeted tools and to ensure meaningful and responsible implementation.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
Citizen Participation and Machine Learning for a Better Democracy
Authors:
M. Arana-Catania,
F. A. Van Lier,
Rob Procter,
Nataliya Tkachenko,
Yulan He,
Arkaitz Zubiaga,
Maria Liakata
Abstract:
The development of democratic systems is a crucial task as confirmed by its selection as one of the Millennium Sustainable Development Goals by the United Nations. In this article, we report on the progress of a project that aims to address barriers, one of which is information overload, to achieving effective direct citizen participation in democratic decision-making processes. The main objective…
▽ More
The development of democratic systems is a crucial task as confirmed by its selection as one of the Millennium Sustainable Development Goals by the United Nations. In this article, we report on the progress of a project that aims to address barriers, one of which is information overload, to achieving effective direct citizen participation in democratic decision-making processes. The main objectives are to explore if the application of Natural Language Processing (NLP) and machine learning can improve citizens' experience of digital citizen participation platforms. Taking as a case study the "Decide Madrid" Consul platform, which enables citizens to post proposals for policies they would like to see adopted by the city council, we used NLP and machine learning to provide new ways to (a) suggest to citizens proposals they might wish to support; (b) group citizens by interests so that they can more easily interact with each other; (c) summarise comments posted in response to proposals; (d) assist citizens in aggregating and developing proposals. Evaluation of the results confirms that NLP and machine learning have a role to play in addressing some of the barriers users of platforms such as Consul currently experience.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Updated Constraints on General Squark Flavor Mixing
Authors:
M. Arana-Catania,
S. Heinemeyer,
M. J. Herrero
Abstract:
We explore the phenomenological implications on non-minimal flavor violating (NMFV) processes from squark flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the squark sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L, R; i,j = u, c, t or d, s, b) parameters. The present upper b…
▽ More
We explore the phenomenological implications on non-minimal flavor violating (NMFV) processes from squark flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the squark sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L, R; i,j = u, c, t or d, s, b) parameters. The present upper bounds on the most relevant NMFV processes, together with the requirement of compatibility in the choice of the MSSM parameters with the recent LHC and g-2 data, lead to updated constraints on all squark flavor mixing parameters.
△ Less
Submitted 27 May, 2014;
originally announced May 2014.
-
The flavour of supersymmetry: Phenomenological implications of sfermion mixing
Authors:
M. Arana-Catania
Abstract:
We study the phenomenological implications of sfermion flavour mixing in supersymmetry in the context of Non-Minimal Flavour Violation (NMFV). We study the general flavour mixing hypothesis, parametrizing the squark and slepton mass matrices by a complete set of delta^XY_ij (X,Y=L,R; i,j= t,c,u or b,s,d for squarks/1,2,3 for sleptons). With respect to the squark sector, we study the behaviour of t…
▽ More
We study the phenomenological implications of sfermion flavour mixing in supersymmetry in the context of Non-Minimal Flavour Violation (NMFV). We study the general flavour mixing hypothesis, parametrizing the squark and slepton mass matrices by a complete set of delta^XY_ij (X,Y=L,R; i,j= t,c,u or b,s,d for squarks/1,2,3 for sleptons). With respect to the squark sector, we study the behaviour of the B-physics observables BR(B -> Xs gamma), BR(Bs -> mu+ mu-) and delta M_B_s and update the constraints to the delta parameters coming from them. We present one-loop corrections to the Higgs boson masses in the MSSM with NMFV in the squark sector, and taking into account the previous constraints we evaluate them, finding sizable corrections, exceeding sometimes tens of GeV for the light Higgs boson. These corrections might be used to set further constraints on the delta parameters from the Higgs boson mass measurement. With respect to the slepton sector, we explore the implications on charged lepton flavour violating (LFV) processes. The present upper bounds on the most relevant LFV processes and the recent LHC and (g-2)_mu data lead to updated constraints on all slepton flavour mixing parameters. We also study the LFV Higgs decays h,H, A -> tau mu considering the relevant types of slepton mixing (LL23, LR23, RL23, RR23) in the context of a heavy SUSY with a scale into the multi-TeV range. These observables present a non-decoupling behaviour with mSUSY, and are shown here to remain constant as mSUSY grows, for large mSUSY> 2 TeV values and for all the mixings considered. We show that all the three channels could be measurable at the LHC even in these heavy SUSY scenarios, being h -> tau mu the most promising one, with up to about hundred of events expected with the current LHC centre-of-mass energy and luminosity. The most promising predictions for the future LHC stage are also included.
△ Less
Submitted 17 December, 2013;
originally announced December 2013.
-
Non-decoupling SUSY in LFV Higgs decays: a window to new physics at the LHC
Authors:
M. Arana-Catania,
E. Arganda,
M. J. Herrero
Abstract:
The recent discovery of a SM-like Higgs boson at the LHC, with a mass around 125-126 GeV, together with the absence of results in the direct searches for supersymmetry, is pushing the SUSY scale ($m_\text{SUSY}$) into the multi-TeV range. This discouraging situation from a low-energy SUSY point of view has its counterpart in indirect SUSY observables which present a non-decoupling behavior with…
▽ More
The recent discovery of a SM-like Higgs boson at the LHC, with a mass around 125-126 GeV, together with the absence of results in the direct searches for supersymmetry, is pushing the SUSY scale ($m_\text{SUSY}$) into the multi-TeV range. This discouraging situation from a low-energy SUSY point of view has its counterpart in indirect SUSY observables which present a non-decoupling behavior with $m_\text{SUSY}$. This is the case of the one-loop lepton flavor violating Higgs decay rates induced by SUSY, which are shown here to remain constant as $m_\text{SUSY}$ grows, for large $m_\text{SUSY} >$ 2 TeV values and for all classes of intergenerational mixing in the slepton sector, $LL$, $LR$, $RL$ and $RR$. In this work we focus on the LFV decays of the three neutral MSSM Higgs bosons $h$, $H$, $A \to τμ$, considering the four types of slepton mixing ($δ_{23}^{LL}$, $δ_{23}^{LR}$, $δ_{23}^{RL}$, $δ_{23}^{RR}$), and show that all the three channels could be measurable at the LHC, being $h \to τμ$ the most promising one, with up to about hundred of events expected with the current LHC center-of-mass energy and luminosity. The most promising predictions for the future LHC stage are also included.
△ Less
Submitted 8 October, 2015; v1 submitted 11 April, 2013;
originally announced April 2013.
-
New Constraints on General Slepton Flavor Mixing
Authors:
M. Arana-Catania,
S. Heinemeyer,
M. J. Herrero
Abstract:
We explore the phenomenological implications on charged lepton flavor violating (LFV) processes from slepton flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the slepton sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L,R; i,j = 1, 2, 3) parameters. The present upper bounds on…
▽ More
We explore the phenomenological implications on charged lepton flavor violating (LFV) processes from slepton flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the slepton sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L,R; i,j = 1, 2, 3) parameters. The present upper bounds on the most relevant LFV processes, together with the requirement of compatibility in the choice of the MSSM parameters with the recent LHC and (g-2) data, lead to updated constraints on all slepton flavor mixing parameters. A comparative discussion of the most effective LFV processes to constrain the various generation mixings is included.
△ Less
Submitted 3 July, 2013; v1 submitted 9 April, 2013;
originally announced April 2013.
-
The Higgs sector of the NMFV MSSM at the ILC
Authors:
M. Arana-Catania,
S. Heinemeyer,
M. J. Herrero,
S. Penaranda
Abstract:
We calculate the one-loop corrections to the Higgs boson masses within the context of the MSSM with Non-Minimal Flavor Violation in the squark sector. We take into account all the relevant restrictions from BR(B -> X_s gamma), BR(B_s -> mu^+ mu^-) and ΔM_{B_s}. We find sizable corrections to the lightest Higgs boson mass that are considerably larger than the expected ILC precision for acceptable v…
▽ More
We calculate the one-loop corrections to the Higgs boson masses within the context of the MSSM with Non-Minimal Flavor Violation in the squark sector. We take into account all the relevant restrictions from BR(B -> X_s gamma), BR(B_s -> mu^+ mu^-) and ΔM_{B_s}. We find sizable corrections to the lightest Higgs boson mass that are considerably larger than the expected ILC precision for acceptable values of the mixing parameters deltas. We find delta^{LR}_{ct} and delta^{RL}_{ct} specially relevant, mainly at low tan beta.
△ Less
Submitted 30 January, 2012;
originally announced January 2012.
-
Higgs Boson masses and B-Physics Constraints in Non-Minimal Flavor Violating SUSY scenarios
Authors:
M. Arana-Catania,
S. Heinemeyer,
M. J. Herrero,
S. Penaranda
Abstract:
We present one-loop corrections to the Higgs boson masses in the MSSM with Non-Minimal Flavor Violation. The flavor violation is generated from the hypothesis of general flavor mixing in the squark mass matrices, and these are parameterized by a complete set of delta^XY_ij (X, Y = L,R; i; j = t, c, u or b, s, d). We calculate the corrections to the Higgs masses in terms of these delta^XY_ij taking…
▽ More
We present one-loop corrections to the Higgs boson masses in the MSSM with Non-Minimal Flavor Violation. The flavor violation is generated from the hypothesis of general flavor mixing in the squark mass matrices, and these are parameterized by a complete set of delta^XY_ij (X, Y = L,R; i; j = t, c, u or b, s, d). We calculate the corrections to the Higgs masses in terms of these delta^XY_ij taking into account all relevant restrictions from B-physics data. This includes constraints from BR(B -> Xs gamma), BR(Bs -> mu+ mu-) and delta M_B_s . After taking into account these constraints we find sizable corrections to the Higgs boson masses, in the case of the lightest MSSM Higgs boson mass exceeding tens of GeV. These corrections are found mainly for the low tan beta case. In the case of a Higgs boson mass measurement these corrections might be used to set further constraints on delta^XY_ij.
△ Less
Submitted 9 April, 2012; v1 submitted 26 September, 2011;
originally announced September 2011.