Search | arXiv e-print repository

Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges

Authors: Miguel Arana-Catania, Weisi Guo

Abstract: Causal understanding is important in many disciplines of science and engineering, where we seek to understand how different factors in the system causally affect an experiment or situation and pave a pathway towards creating effective or optimising existing models. Examples of use cases are autonomous exploration and modelling of unknown environments or assessing key variables in optimising large… ▽ More Causal understanding is important in many disciplines of science and engineering, where we seek to understand how different factors in the system causally affect an experiment or situation and pave a pathway towards creating effective or optimising existing models. Examples of use cases are autonomous exploration and modelling of unknown environments or assessing key variables in optimising large complex systems. In this paper, we analyse a Reinforcement Learning approach called Causal Curiosity, which aims to estimate as accurately and efficiently as possible, without directly measuring them, the value of factors that causally determine the dynamics of a system. Whilst the idea presents a pathway forward, measurement accuracy is the foundation of methodology effectiveness. Focusing on the current causal curiosity's robotic manipulator, we present for the first time a measurement accuracy analysis of the future potentials and current limitations of this technique and an analysis of its sensitivity and confounding factor disentanglement capability - crucial for causal analysis. As a result of our work, we promote proposals for an improved and efficient design of Causal Curiosity methods to be applied to real-world complex scenarios. △ Less

Submitted 13 May, 2025; originally announced May 2025.

Comments: 24 pages, 10 figures, 9 tables

arXiv:2505.02920 [pdf, ps, other]

Probing the co-evolution of SMBHs and their hosts from scaling relations pairwise residuals: dominance of stellar velocity dispersion and host halo mass

Authors: Francesco Shankar, Mariangela Bernardi, Daniel Roberts, Miguel Arana-Catania, Tobias Grubenmann, Melanie Habouzit, Amy Smith, Christopher Marsden, Karthik Mahesh Varadarajan, Alba Vega Alonso Tetilla, Daniel Anglés-Alcázar, Lumen Boco, Duncan Farrah, Hao Fu, Henryk Haniewicz, Andrea Lapi, Christopher C. Lovell, Nicola Menci, Meredith Powell, Federica Ricci

Abstract: The correlations between Supermassive Black Holes (SMBHs) and their host galaxies still defy our understanding from both the observational and theoretical perspectives. Here we perform pairwise residual analysis on the latest sample of local inactive galaxies with a uniform calibration of their photometric properties and with dynamically measured masses of their central SMBHs. The residuals reveal… ▽ More The correlations between Supermassive Black Holes (SMBHs) and their host galaxies still defy our understanding from both the observational and theoretical perspectives. Here we perform pairwise residual analysis on the latest sample of local inactive galaxies with a uniform calibration of their photometric properties and with dynamically measured masses of their central SMBHs. The residuals reveal that stellar velocity dispersion $σ$ and, possibly host dark matter halo mass $M_{\rm halo}$, appear as the galactic properties most correlated with SMBH mass, with a secondary (weaker) correlation with spheroidal (bulge) mass $M_{\rm sph}$, as also corroborated by additional Machine Learning tests. These findings may favour energetic/kinetic feedback from Active Galactic Nuclei (AGN) as the main driver in shaping SMBH scaling relations. Two state-of-the-art hydrodynamic simulations, inclusive of kinetic AGN feedback, are able to broadly capture the mean trends observed in the residuals, although they tend to either favour $M_{\rm sph}$ as the most fundamental property, or generate too flat residuals. Increasing AGN feedback kinetic output does not improve the comparison with the data. In the Appendix we also show that the galaxies with dynamically measured SMBHs are biased high in $σ$ at fixed luminosity with respect to the full sample of local galaxies, proving that this bias is not a byproduct of stellar mass discrepancies. Overall, our results suggest that probing the SMBH-galaxy scaling relations in terms of total stellar mass alone may induce biases, and that either current data sets are incomplete, and/or that more insightful modelling is required to fully reproduce observations. △ Less

Submitted 5 May, 2025; originally announced May 2025.

Comments: MNRAS, accepted, 25 pages, 13 Figures, 3 Appendices

arXiv:2501.14824 [pdf]

A causal learning approach to in-orbit inertial parameter estimation for multi-payload deployers

Authors: Konstantinos Platanitis, Miguel Arana-Catania, Saurabh Upadhyay, Leonard Felicetti

Abstract: This paper discusses an approach to inertial parameter estimation for the case of cargo carrying spacecraft that is based on causal learning, i.e. learning from the responses of the spacecraft, under actuation. Different spacecraft configurations (inertial parameter sets) are simulated under different actuation profiles, in order to produce an optimised time-series clustering classifier that can b… ▽ More This paper discusses an approach to inertial parameter estimation for the case of cargo carrying spacecraft that is based on causal learning, i.e. learning from the responses of the spacecraft, under actuation. Different spacecraft configurations (inertial parameter sets) are simulated under different actuation profiles, in order to produce an optimised time-series clustering classifier that can be used to distinguish between them. The actuation is comprised of finite sequences of constant inputs that are applied in order, based on typical actuators available. By learning from the system's responses across multiple input sequences, and then applying measures of time-series similarity and F1-score, an optimal actuation sequence can be chosen either for one specific system configuration or for the overall set of possible configurations. This allows for both estimation of the inertial parameter set without any prior knowledge of state, as well as validation of transitions between different configurations after a deployment event. The optimisation of the actuation sequence is handled by a reinforcement learning model that uses the proximal policy optimisation (PPO) algorithm, by repeatedly trying different sequences and evaluating the impact on classifier performance according to a multi-objective metric. △ Less

Submitted 21 January, 2025; originally announced January 2025.

Comments: 10 pages, 18 figures, 1 table. Presented in 75th International Astronautical Congress (IAC), Milan, Italy, 14-18 October 2024

arXiv:2412.08578 [pdf]

Machine Learning Information Retrieval and Summarisation to Support Systematic Review on Outcomes Based Contracting

Authors: Iman Munire Bilal, Zheng Fang, Miguel Arana-Catania, Felix-Anselm van Lier, Juliana Outes Velarde, Harry Bregazzi, Eleanor Carter, Mara Airoldi, Rob Procter

Abstract: As academic literature proliferates, traditional review methods are increasingly challenged by the sheer volume and diversity of available research. This article presents a study that aims to address these challenges by enhancing the efficiency and scope of systematic reviews in the social sciences through advanced machine learning (ML) and natural language processing (NLP) tools. In particular, w… ▽ More As academic literature proliferates, traditional review methods are increasingly challenged by the sheer volume and diversity of available research. This article presents a study that aims to address these challenges by enhancing the efficiency and scope of systematic reviews in the social sciences through advanced machine learning (ML) and natural language processing (NLP) tools. In particular, we focus on automating stages within the systematic reviewing process that are time-intensive and repetitive for human annotators and which lend themselves to immediate scalability through tools such as information retrieval and summarisation guided by expert advice. The article concludes with a summary of lessons learnt regarding the integrated approach towards systematic reviews and future directions for improvement, including explainability. △ Less

Submitted 11 December, 2024; originally announced December 2024.

arXiv:2411.09844 [pdf, other]

Deep Autoencoders for Unsupervised Anomaly Detection in Wildfire Prediction

Authors: İrem Üstek, Miguel Arana-Catania, Alexander Farr, Ivan Petrunin

Abstract: Wildfires pose a significantly increasing hazard to global ecosystems due to the climate crisis. Due to its complex nature, there is an urgent need for innovative approaches to wildfire prediction, such as machine learning. This research took a unique approach, differentiating from classical supervised learning, and addressed the gap in unsupervised wildfire prediction using autoencoders and clust… ▽ More Wildfires pose a significantly increasing hazard to global ecosystems due to the climate crisis. Due to its complex nature, there is an urgent need for innovative approaches to wildfire prediction, such as machine learning. This research took a unique approach, differentiating from classical supervised learning, and addressed the gap in unsupervised wildfire prediction using autoencoders and clustering techniques for anomaly detection. Historical weather and normalised difference vegetation index datasets of Australia for 2005 - 2021 were utilised. Two main unsupervised approaches were analysed. The first used a deep autoencoder to obtain latent features, which were then fed into clustering models, isolation forest, local outlier factor and one-class SVM for anomaly detection. The second approach used a deep autoencoder to reconstruct the input data and use reconstruction errors to identify anomalies. Long Short-Term Memory (LSTM) autoencoders and fully connected (FC) autoencoders were employed in this part, both in an unsupervised way learning only from nominal data. The FC autoencoder outperformed its counterparts, achieving an accuracy of 0.71, an F1-score of 0.74, and an MCC of 0.42. These findings highlight the practicality of this method, as it effectively predicts wildfires in the absence of ground truth, utilising an unsupervised learning technique. △ Less

Submitted 14 November, 2024; originally announced November 2024.

Comments: 33 pages, 18 figure, 16 tables. To appear in Earth and Space Science

arXiv:2409.13423 [pdf]

Causal Reinforcement Learning for Optimisation of Robot Dynamics in Unknown Environments

Authors: Julian Gerald Dcruz, Sam Mahoney, Jia Yun Chua, Adoundeth Soukhabandith, John Mugabe, Weisi Guo, Miguel Arana-Catania

Abstract: Autonomous operations of robots in unknown environments are challenging due to the lack of knowledge of the dynamics of the interactions, such as the objects' movability. This work introduces a novel Causal Reinforcement Learning approach to enhancing robotics operations and applies it to an urban search and rescue (SAR) scenario. Our proposed machine learning architecture enables robots to learn… ▽ More Autonomous operations of robots in unknown environments are challenging due to the lack of knowledge of the dynamics of the interactions, such as the objects' movability. This work introduces a novel Causal Reinforcement Learning approach to enhancing robotics operations and applies it to an urban search and rescue (SAR) scenario. Our proposed machine learning architecture enables robots to learn the causal relationships between the visual characteristics of the objects, such as texture and shape, and the objects' dynamics upon interaction, such as their movability, significantly improving their decision-making processes. We conducted causal discovery and RL experiments demonstrating the Causal RL's superior performance, showing a notable reduction in learning times by over 24.5% in complex situations, compared to non-causal models. △ Less

Submitted 20 September, 2024; originally announced September 2024.

Comments: 6 pages, 12 figures, 3 tables. To be presented in 10th IEEE International Smart Cities Conference (ISC2-2024)

arXiv:2408.03445 [pdf, other]

Spacecraft inertial parameters estimation using time series clustering and reinforcement learning

Authors: Konstantinos Platanitis, Miguel Arana-Catania, Leonardo Capicchiano, Saurabh Upadhyay, Leonard Felicetti

Abstract: This paper presents a machine learning approach to estimate the inertial parameters of a spacecraft in cases when those change during operations, e.g. multiple deployments of payloads, unfolding of appendages and booms, propellant consumption as well as during in-orbit servicing and active debris removal operations. The machine learning approach uses time series clustering together with an optimis… ▽ More This paper presents a machine learning approach to estimate the inertial parameters of a spacecraft in cases when those change during operations, e.g. multiple deployments of payloads, unfolding of appendages and booms, propellant consumption as well as during in-orbit servicing and active debris removal operations. The machine learning approach uses time series clustering together with an optimised actuation sequence generated by reinforcement learning to facilitate distinguishing among different inertial parameter sets. The performance of the proposed strategy is assessed against the case of a multi-satellite deployment system showing that the algorithm is resilient towards common disturbances in such kinds of operations. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 6 pages, 3 figures, 1 table. To be presented in ESA - AI for Space (SPAICE)

arXiv:2407.01154 [pdf, other]

Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning

Authors: Abdulaziz Alwalan, Miguel Arana-Catania

Abstract: In this work we demonstrate the possibility of estimating the wind environment of a UAV without specialised sensors, using only the UAV's trajectory, applying a causal machine learning approach. We implement the causal curiosity method which combines machine learning times series classification and clustering with a causal framework. We analyse three distinct wind environments: constant wind, shea… ▽ More In this work we demonstrate the possibility of estimating the wind environment of a UAV without specialised sensors, using only the UAV's trajectory, applying a causal machine learning approach. We implement the causal curiosity method which combines machine learning times series classification and clustering with a causal framework. We analyse three distinct wind environments: constant wind, shear wind, and turbulence, and explore different optimisation strategies for optimal UAV manoeuvres to estimate the wind conditions. The proposed approach can be used to design optimal trajectories in challenging weather conditions, and to avoid specialised sensors that add to the UAV's weight and compromise its functionality. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 11 pages, 9 figures, 10 tables. To be presented in 15th International Conference on Mechanical and Aerospace Engineering (ICMAE)

arXiv:2406.16527 [pdf]

SyROCCo: Enhancing Systematic Reviews using Machine Learning

Authors: Zheng Fang, Miguel Arana-Catania, Felix-Anselm van Lier, Juliana Outes Velarde, Harry Bregazzi, Mara Airoldi, Eleanor Carter, Rob Procter

Abstract: The sheer number of research outputs published every year makes systematic reviewing increasingly time- and resource-intensive. This paper explores the use of machine learning techniques to help navigate the systematic review process. ML has previously been used to reliably 'screen' articles for review - that is, identify relevant articles based on reviewers' inclusion criteria. The application of… ▽ More The sheer number of research outputs published every year makes systematic reviewing increasingly time- and resource-intensive. This paper explores the use of machine learning techniques to help navigate the systematic review process. ML has previously been used to reliably 'screen' articles for review - that is, identify relevant articles based on reviewers' inclusion criteria. The application of ML techniques to subsequent stages of a review, however, such as data extraction and evidence mapping, is in its infancy. We therefore set out to develop a series of tools that would assist in the profiling and analysis of 1,952 publications on the theme of 'outcomes-based contracting'. Tools were developed for the following tasks: assign publications into 'policy area' categories; identify and extract key information for evidence mapping, such as organisations, laws, and geographical information; connect the evidence base to an existing dataset on the same topic; and identify subgroups of articles that may share thematic content. An interactive tool using these techniques and a public dataset with their outputs have been released. Our results demonstrate the utility of ML techniques to enhance evidence accessibility and analysis within the systematic review processes. These efforts show promise in potentially yielding substantial efficiencies for future systematic reviewing and for broadening their analytical scope. Our work suggests that there may be implications for the ease with which policymakers and practitioners can access evidence. While ML techniques seem poised to play a significant role in bridging the gap between research and policy by offering innovative ways of gathering, accessing, and analysing data from systematic reviews, we also highlight their current limitations and the need to exercise caution in their application, particularly given the potential for errors and biases. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 28 pages, 5 figures. To appear in Data & Policy journal

arXiv:2403.00470 [pdf, other]

Autonomous Robotic Arm Manipulation for Planetary Missions using Causal Machine Learning

Authors: C. McDonnell, M. Arana-Catania, S. Upadhyay

Abstract: Autonomous robotic arm manipulators have the potential to make planetary exploration and in-situ resource utilization missions more time efficient and productive, as the manipulator can handle the objects itself and perform goal-specific actions. We train a manipulator to autonomously study objects of which it has no prior knowledge, such as planetary rocks. This is achieved using causal machine l… ▽ More Autonomous robotic arm manipulators have the potential to make planetary exploration and in-situ resource utilization missions more time efficient and productive, as the manipulator can handle the objects itself and perform goal-specific actions. We train a manipulator to autonomously study objects of which it has no prior knowledge, such as planetary rocks. This is achieved using causal machine learning in a simulated planetary environment. Here, the manipulator interacts with objects, and classifies them based on differing causal factors. These are parameters, such as mass or friction coefficient, that causally determine the outcomes of its interactions. Through reinforcement learning, the manipulator learns to interact in ways that reveal the underlying causal factors. We show that this method works even without any prior knowledge of the objects, or any previously-collected training data. We carry out the training in planetary exploration conditions, with realistic manipulator models. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 8 pages, ASTRA 2023: 17th Symposium on Advanced Space Technologies in Robotics and Automation, 18-20 October 2023, Leiden, The Netherlands

arXiv:2402.07804 [pdf, other]

doi 10.1002/maco.202314240

Causal Discovery to Understand Hot Corrosion

Authors: A. Varghese, M. Arana-Catania, S. Mori, A. Encinas-Oropesa, J. Sumner

Abstract: Gas turbine superalloys experience hot corrosion, driven by factors including corrosive deposit flux, temperature, gas composition, and component material. The full mechanism still needs clarification and research often focuses on laboratory work. As such, there is interest in causal discovery to confirm the significance of factors and identify potential missing causal relationships or co-dependen… ▽ More Gas turbine superalloys experience hot corrosion, driven by factors including corrosive deposit flux, temperature, gas composition, and component material. The full mechanism still needs clarification and research often focuses on laboratory work. As such, there is interest in causal discovery to confirm the significance of factors and identify potential missing causal relationships or co-dependencies between these factors. The causal discovery algorithm Fast Causal Inference (FCI) has been trialled on a small set of laboratory data, with the outputs evaluated for their significance to corrosion propagation, and compared to existing mechanistic understanding. FCI identified the salt deposition flux as the most influential corrosion variable for this limited dataset. However, HCl was the second most influential for pitting regions, compared to temperature for more uniformly corroding regions. Thus FCI generated causal links aligned with literature from a randomised corrosion dataset, while also identifying the presence of two different degradation modes in operation. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 12 pages, published in Materials and Corrosion

arXiv:2305.02224 [pdf, ps, other]

Some Observations on Fact-Checking Work with Implications for Computational Support

Authors: Rob Procter, Miguel Arana-Catania, Yulan He, Maria Liakata, Arkaitz Zubiaga, Elena Kochkina, Runcong Zhao

Abstract: Social media and user-generated content (UGC) have become increasingly important features of journalistic work in a number of different ways. However, the growth of misinformation means that news organisations have had devote more and more resources to determining its veracity and to publishing corrections if it is found to be misleading. In this work, we present the results of interviews with eig… ▽ More Social media and user-generated content (UGC) have become increasingly important features of journalistic work in a number of different ways. However, the growth of misinformation means that news organisations have had devote more and more resources to determining its veracity and to publishing corrections if it is found to be misleading. In this work, we present the results of interviews with eight members of fact-checking teams from two organisations. Team members described their fact-checking processes and the challenges they currently face in completing a fact-check in a robust and timely way. The former reveals, inter alia, significant differences in fact-checking practices and the role played by collaboration between team members. We conclude with a discussion of the implications for the development and application of computational tools, including where computational tool support is currently lacking and the importance of being able to accommodate different fact-checking practices. △ Less

Submitted 6 July, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

Comments: 11 pages. International AAAI Conference on Web and Social Media, Mediate 2023: News Media and Computational Journalism Workshop

ACM Class: H.1.2; H.5.2

arXiv:2303.01241 [pdf, other]

PANACEA: An Automated Misinformation Detection System on COVID-19

Authors: Runcong Zhao, Miguel Arana-Catania, Lixing Zhu, Elena Kochkina, Lin Gui, Arkaitz Zubiaga, Rob Procter, Maria Liakata, Yulan He

Abstract: In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporti… ▽ More In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporting evidence with the stance towards the claim to be checked. In addition, PANACEA adapts the bi-directional graph convolutional networks model, which is able to detect rumours based on comment networks of related tweets, instead of relying on the knowledge base. This rumour detection module assists by warning the users in the early stages when a knowledge base may not be available. △ Less

Submitted 28 February, 2023; originally announced March 2023.

arXiv:2209.12598 [pdf]

Embedding digital participatory budgeting within local government: motivations, strategies and barriers faced

Authors: Jonathan Davies, Miguel Arana-Catania, Rob Procter

Abstract: The challenging task of embedding innovative participatory processes and technologies within local government often falls upon local council officers. Using qualitative data collection and analysis, we investigate the ongoing work of Scottish local councils seeking to run the process of participatory budgeting (PB) within their institution, the use of digital platforms to support this and the chal… ▽ More The challenging task of embedding innovative participatory processes and technologies within local government often falls upon local council officers. Using qualitative data collection and analysis, we investigate the ongoing work of Scottish local councils seeking to run the process of participatory budgeting (PB) within their institution, the use of digital platforms to support this and the challenges faced. In doing so this paper draws on empirical material to support the growing discussion on the dynamics or forces behind embedding. Our analysis shows that formal agreement alone does not make the process a certainty. Local council officers must work as mediators in the transitional space between representative structures and new, innovative ways of working, unsettling the entrenched power dynamics. To do so they must be well trained and well resourced, including the ability to use digital platforms effectively as part of the process. This provides the necessary, accessible, transparent and deliberative space for participation. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: 12 pages, presented at the 15th International Conference on Theory and Practice of Electronic Governance 2022

arXiv:2207.13970 [pdf, other]

PHEMEPlus: Enriching Social Media Rumour Verification with External Evidence

Authors: John Dougrez-Lewis, Elena Kochkina, M. Arana-Catania, Maria Liakata, Yulan He

Abstract: Work on social media rumour verification utilises signals from posts, their propagation and users involved. Other lines of work target identifying and fact-checking claims based on information from Wikipedia, or trustworthy news articles without considering social media context. However works combining the information from social media with external evidence from the wider web are lacking. To faci… ▽ More Work on social media rumour verification utilises signals from posts, their propagation and users involved. Other lines of work target identifying and fact-checking claims based on information from Wikipedia, or trustworthy news articles without considering social media context. However works combining the information from social media with external evidence from the wider web are lacking. To facilitate research in this direction, we release a novel dataset, PHEMEPlus, an extension of the PHEME benchmark, which contains social media conversations as well as relevant external evidence for each rumour. We demonstrate the effectiveness of incorporating such evidence in improving rumour verification models. Additionally, as part of the evidence collection, we evaluate various ways of query formulation to identify the most effective method. △ Less

Submitted 28 July, 2022; originally announced July 2022.

Comments: 10 pages, 1 figure, 5 tables, presented in the Fifth Fact Extraction and VERification Workshop (FEVER). 2022

arXiv:2207.11528 [pdf]

Supporting peace negotiations in the Yemen war through machine learning

Authors: M. Arana-Catania, F. A. Van Lier, Rob Procter

Abstract: Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the di… ▽ More Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the distinction between relevant and less relevant actors in peace-making, or the identification of key conflict issues and their interdependence. International peace efforts appear ill-equipped to successfully address these challenges. While technology is already being experimented with and used in a range of conflict related fields, such as conflict predicting or information gathering, less attention has been given to how technology can contribute to conflict mediation. This case study contributes to emerging research on the use of state-of-the-art machine learning technologies and techniques in conflict mediation processes. Using dialogue transcripts from peace negotiations in Yemen, this study shows how machine-learning can effectively support mediating teams by providing them with tools for knowledge management, extraction and conflict analysis. Apart from illustrating the potential of machine learning tools in conflict mediation, the paper also emphasises the importance of interdisciplinary and participatory, co-creation methodology for the development of context-sensitive and targeted tools and to ensure meaningful and responsible implementation. △ Less

Submitted 23 July, 2022; originally announced July 2022.

Comments: 28 pages, 16 figures, 2 tables. An earlier version of this paper was presented at the Data for Policy Conference, September, 2021. Current version to appear in Data & Policy journal

arXiv:2205.02596 [pdf, other]

Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims

Authors: M. Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Rob Procter, Yulan He

Abstract: We present a comprehensive work on automated veracity assessment from dataset creation to developing novel methods based on Natural Language Inference (NLI), focusing on misinformation related to the COVID-19 pandemic. We first describe the construction of the novel PANACEA dataset consisting of heterogeneous claims on COVID-19 and their respective information sources. The dataset construction inc… ▽ More We present a comprehensive work on automated veracity assessment from dataset creation to developing novel methods based on Natural Language Inference (NLI), focusing on misinformation related to the COVID-19 pandemic. We first describe the construction of the novel PANACEA dataset consisting of heterogeneous claims on COVID-19 and their respective information sources. The dataset construction includes work on retrieval techniques and similarity measurements to ensure a unique set of claims. We then propose novel techniques for automated veracity assessment based on Natural Language Inference including graph convolutional networks and attention based approaches. We have carried out experiments on evidence retrieval and veracity assessment on the dataset using the proposed techniques and found them competitive with SOTA methods, and provided a detailed discussion. △ Less

Submitted 5 May, 2022; originally announced May 2022.

Comments: 16 pages, 1 figure, 8 tables, presented in NAACL 2022

arXiv:2111.11766 [pdf]

Evaluating the application of NLP tools in mainstream participatory budgeting processes in Scotland

Authors: Jonathan Davies, Miguel Arana-Catania, Rob Procter, Felix-Anselm van Lier, Yulan He

Abstract: In recent years participatory budgeting (PB) in Scotland has grown from a handful of community-led processes to a movement supported by local and national government. This is epitomized by an agreement between the Scottish Government and the Convention of Scottish Local Authorities (COSLA) that at least 1% of local authority budgets will be subject to PB. This ongoing research paper explores the c… ▽ More In recent years participatory budgeting (PB) in Scotland has grown from a handful of community-led processes to a movement supported by local and national government. This is epitomized by an agreement between the Scottish Government and the Convention of Scottish Local Authorities (COSLA) that at least 1% of local authority budgets will be subject to PB. This ongoing research paper explores the challenges that emerge from this 'scaling up' or 'mainstreaming' across the 32 local authorities that make up Scotland. The main objective is to evaluate local authority use of the digital platform Consul, which applies Natural Language Processing (NLP) to address these challenges. This project adopts a qualitative longitudinal design with interviews, observations of PB processes, and analysis of the digital platform data. Thematic analysis is employed to capture the major issues and themes which emerge. Longitudinal analysis then explores how these evolve over time. The potential for 32 live study sites provides a unique opportunity to explore discrete political and social contexts which materialize and allow for a deeper dive into the challenges and issues that may exist, something a wider cross-sectional study would miss. Initial results show that issues and challenges which come from scaling up may be tackled using NLP technology which, in a previous controlled use case-based evaluation, has shown to improve the effectiveness of citizen participation. △ Less

Submitted 23 November, 2021; originally announced November 2021.

Comments: 7 pages, presented at the 14th International Conference on Theory and Practice of Electronic Governance 2021. arXiv admin note: text overlap with arXiv:2109.09517

arXiv:2110.05847 [pdf, other]

Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes

Authors: M. Arana-Catania, Rob Procter, Yulan He, Maria Liakata

Abstract: We present work on summarising deliberative processes for non-English languages. Unlike commonly studied datasets, such as news articles, this deliberation dataset reflects difficulties of combining multiple narratives, mostly of poor grammatical quality, in a single text. We report an extensive evaluation of a wide range of abstractive summarisation models in combination with an off-the-shelf mac… ▽ More We present work on summarising deliberative processes for non-English languages. Unlike commonly studied datasets, such as news articles, this deliberation dataset reflects difficulties of combining multiple narratives, mostly of poor grammatical quality, in a single text. We report an extensive evaluation of a wide range of abstractive summarisation models in combination with an off-the-shelf machine translation model. Texts are translated into English, summarised, and translated back to the original language. We obtain promising results regarding the fluency, consistency and relevance of the summaries produced. Our approach is easy to implement for many languages for production purposes by simply changing the translation model. △ Less

Submitted 12 October, 2021; originally announced October 2021.

Comments: 8 pages, presented in EMNLP 2021 - New Frontiers in Summarization Workshop

arXiv:2109.09517 [pdf]

doi 10.1145/3462203.3475891

A mixed-methods ethnographic approach to participatory budgeting in Scotland

Authors: Jonathan Davies, M. Arana-Catania, Rob Procter, F. A. Van Lier, Yulan He

Abstract: Participatory budgeting (PB) is already well established in Scotland in the form of community led grant-making yet has recently transformed from a grass-roots activity to a mainstream process or embedded 'policy instrument'. An integral part of this turn is the use of the Consul digital platform as the primary means of citizen participation. Using a mixed method approach, this ongoing research pap… ▽ More Participatory budgeting (PB) is already well established in Scotland in the form of community led grant-making yet has recently transformed from a grass-roots activity to a mainstream process or embedded 'policy instrument'. An integral part of this turn is the use of the Consul digital platform as the primary means of citizen participation. Using a mixed method approach, this ongoing research paper explores how each of the 32 local authorities that make up Scotland utilise the Consul platform to engage their citizens in the PB process and how they then make sense of citizens' contributions. In particular, we focus on whether natural language processing (NLP) tools can facilitate both citizen engagement, and the processes by which citizens' contributions are analysed and translated into policies. △ Less

Submitted 20 September, 2021; originally announced September 2021.

Comments: 6 pages, presented in GoodIT 2021 Conference

arXiv:2108.11942 [pdf]

Machine Learning for Mediation in Armed Conflicts

Authors: M. Arana-Catania, F. A. Van Lier, Rob Procter

Abstract: Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the di… ▽ More Today's conflicts are becoming increasingly complex, fluid and fragmented, often involving a host of national and international actors with multiple and often divergent interests. This development poses significant challenges for conflict mediation, as mediators struggle to make sense of conflict dynamics, such as the range of conflict parties and the evolution of their political positions, the distinction between relevant and less relevant actors in peace making, or the identification of key conflict issues and their interdependence. International peace efforts appear increasingly ill-equipped to successfully address these challenges. While technology is being increasingly used in a range of conflict related fields, such as conflict predicting or information gathering, less attention has been given to how technology can contribute to conflict mediation. This case study is the first to apply state-of-the-art machine learning technologies to data from an ongoing mediation process. Using dialogue transcripts from peace negotiations in Yemen, this study shows how machine-learning tools can effectively support international mediators by managing knowledge and offering additional conflict analysis tools to assess complex information. Apart from illustrating the potential of machine learning tools in conflict mediation, the paper also emphasises the importance of interdisciplinary and participatory research design for the development of context-sensitive and targeted tools and to ensure meaningful and responsible implementation. △ Less

Submitted 26 August, 2021; originally announced August 2021.

Comments: 24 pages, 16 figures, 2 tables, to be presented in Data for Policy conference

arXiv:2103.00508 [pdf]

doi 10.1145/3452118

Citizen Participation and Machine Learning for a Better Democracy

Authors: M. Arana-Catania, F. A. Van Lier, Rob Procter, Nataliya Tkachenko, Yulan He, Arkaitz Zubiaga, Maria Liakata

Abstract: The development of democratic systems is a crucial task as confirmed by its selection as one of the Millennium Sustainable Development Goals by the United Nations. In this article, we report on the progress of a project that aims to address barriers, one of which is information overload, to achieving effective direct citizen participation in democratic decision-making processes. The main objective… ▽ More The development of democratic systems is a crucial task as confirmed by its selection as one of the Millennium Sustainable Development Goals by the United Nations. In this article, we report on the progress of a project that aims to address barriers, one of which is information overload, to achieving effective direct citizen participation in democratic decision-making processes. The main objectives are to explore if the application of Natural Language Processing (NLP) and machine learning can improve citizens' experience of digital citizen participation platforms. Taking as a case study the "Decide Madrid" Consul platform, which enables citizens to post proposals for policies they would like to see adopted by the city council, we used NLP and machine learning to provide new ways to (a) suggest to citizens proposals they might wish to support; (b) group citizens by interests so that they can more easily interact with each other; (c) summarise comments posted in response to proposals; (d) assist citizens in aggregating and developing proposals. Evaluation of the results confirms that NLP and machine learning have a role to play in addressing some of the barriers users of platforms such as Consul currently experience. △ Less

Submitted 28 February, 2021; originally announced March 2021.

Comments: 19 pages, 5 figures, 4 tables, to appear in Digital Government: Research and Practice (DGOV)

arXiv:1405.6960 [pdf, ps, other]

doi 10.1103/PhysRevD.90.075003

Updated Constraints on General Squark Flavor Mixing

Authors: M. Arana-Catania, S. Heinemeyer, M. J. Herrero

Abstract: We explore the phenomenological implications on non-minimal flavor violating (NMFV) processes from squark flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the squark sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L, R; i,j = u, c, t or d, s, b) parameters. The present upper b… ▽ More We explore the phenomenological implications on non-minimal flavor violating (NMFV) processes from squark flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the squark sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L, R; i,j = u, c, t or d, s, b) parameters. The present upper bounds on the most relevant NMFV processes, together with the requirement of compatibility in the choice of the MSSM parameters with the recent LHC and g-2 data, lead to updated constraints on all squark flavor mixing parameters. △ Less

Submitted 27 May, 2014; originally announced May 2014.

Comments: 30 pages, 7 figures. arXiv admin note: text overlap with arXiv:1304.2783, arXiv:1109.6232

Report number: IFT-UAM/CSIC-14-045; FTUAM-14-18

Journal ref: Phys. Rev. D 90, 075003 (2014)

arXiv:1312.4888 [pdf, other]

The flavour of supersymmetry: Phenomenological implications of sfermion mixing

Authors: M. Arana-Catania

Abstract: We study the phenomenological implications of sfermion flavour mixing in supersymmetry in the context of Non-Minimal Flavour Violation (NMFV). We study the general flavour mixing hypothesis, parametrizing the squark and slepton mass matrices by a complete set of delta^XY_ij (X,Y=L,R; i,j= t,c,u or b,s,d for squarks/1,2,3 for sleptons). With respect to the squark sector, we study the behaviour of t… ▽ More We study the phenomenological implications of sfermion flavour mixing in supersymmetry in the context of Non-Minimal Flavour Violation (NMFV). We study the general flavour mixing hypothesis, parametrizing the squark and slepton mass matrices by a complete set of delta^XY_ij (X,Y=L,R; i,j= t,c,u or b,s,d for squarks/1,2,3 for sleptons). With respect to the squark sector, we study the behaviour of the B-physics observables BR(B -> Xs gamma), BR(Bs -> mu+ mu-) and delta M_B_s and update the constraints to the delta parameters coming from them. We present one-loop corrections to the Higgs boson masses in the MSSM with NMFV in the squark sector, and taking into account the previous constraints we evaluate them, finding sizable corrections, exceeding sometimes tens of GeV for the light Higgs boson. These corrections might be used to set further constraints on the delta parameters from the Higgs boson mass measurement. With respect to the slepton sector, we explore the implications on charged lepton flavour violating (LFV) processes. The present upper bounds on the most relevant LFV processes and the recent LHC and (g-2)_mu data lead to updated constraints on all slepton flavour mixing parameters. We also study the LFV Higgs decays h,H, A -> tau mu considering the relevant types of slepton mixing (LL23, LR23, RL23, RR23) in the context of a heavy SUSY with a scale into the multi-TeV range. These observables present a non-decoupling behaviour with mSUSY, and are shown here to remain constant as mSUSY grows, for large mSUSY> 2 TeV values and for all the mixings considered. We show that all the three channels could be measurable at the LHC even in these heavy SUSY scenarios, being h -> tau mu the most promising one, with up to about hundred of events expected with the current LHC centre-of-mass energy and luminosity. The most promising predictions for the future LHC stage are also included. △ Less

Submitted 17 December, 2013; originally announced December 2013.

Comments: PhD. Thesis. UAM (Madrid), December 2013. PhD Advisors: M.J. Herrero and S. Heinemeyer. 230 pages, 71 figures

arXiv:1304.3371 [pdf, other]

doi 10.1007/JHEP09(2013)160

Non-decoupling SUSY in LFV Higgs decays: a window to new physics at the LHC

Authors: M. Arana-Catania, E. Arganda, M. J. Herrero

Abstract: The recent discovery of a SM-like Higgs boson at the LHC, with a mass around 125-126 GeV, together with the absence of results in the direct searches for supersymmetry, is pushing the SUSY scale ($m_\text{SUSY}$) into the multi-TeV range. This discouraging situation from a low-energy SUSY point of view has its counterpart in indirect SUSY observables which present a non-decoupling behavior with… ▽ More The recent discovery of a SM-like Higgs boson at the LHC, with a mass around 125-126 GeV, together with the absence of results in the direct searches for supersymmetry, is pushing the SUSY scale ($m_\text{SUSY}$) into the multi-TeV range. This discouraging situation from a low-energy SUSY point of view has its counterpart in indirect SUSY observables which present a non-decoupling behavior with $m_\text{SUSY}$. This is the case of the one-loop lepton flavor violating Higgs decay rates induced by SUSY, which are shown here to remain constant as $m_\text{SUSY}$ grows, for large $m_\text{SUSY} >$ 2 TeV values and for all classes of intergenerational mixing in the slepton sector, $LL$, $LR$, $RL$ and $RR$. In this work we focus on the LFV decays of the three neutral MSSM Higgs bosons $h$, $H$, $A \to τμ$, considering the four types of slepton mixing ($δ_{23}^{LL}$, $δ_{23}^{LR}$, $δ_{23}^{RL}$, $δ_{23}^{RR}$), and show that all the three channels could be measurable at the LHC, being $h \to τμ$ the most promising one, with up to about hundred of events expected with the current LHC center-of-mass energy and luminosity. The most promising predictions for the future LHC stage are also included. △ Less

Submitted 8 October, 2015; v1 submitted 11 April, 2013; originally announced April 2013.

Comments: 24 pages, 10 figures. v5: includes erratum 4 pages, 5 figures

Report number: IFT-UAM/CSIC-13-024

arXiv:1304.2783 [pdf, ps, other]

doi 10.1103/PhysRevD.88.015026

New Constraints on General Slepton Flavor Mixing

Authors: M. Arana-Catania, S. Heinemeyer, M. J. Herrero

Abstract: We explore the phenomenological implications on charged lepton flavor violating (LFV) processes from slepton flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the slepton sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L,R; i,j = 1, 2, 3) parameters. The present upper bounds on… ▽ More We explore the phenomenological implications on charged lepton flavor violating (LFV) processes from slepton flavor mixing within the Minimal Supersymmetric Standard Model. We work under the model-independent hypothesis of general flavor mixing in the slepton sector, being parametrized by a complete set of dimensionless delta^AB_ij (A,B = L,R; i,j = 1, 2, 3) parameters. The present upper bounds on the most relevant LFV processes, together with the requirement of compatibility in the choice of the MSSM parameters with the recent LHC and (g-2) data, lead to updated constraints on all slepton flavor mixing parameters. A comparative discussion of the most effective LFV processes to constrain the various generation mixings is included. △ Less

Submitted 3 July, 2013; v1 submitted 9 April, 2013; originally announced April 2013.

Comments: 42 pages, 19 figures. Minor changes, version to appear in PRD

Report number: IFT-UAM/CSIC-13-023

arXiv:1201.6345 [pdf, ps, other]

The Higgs sector of the NMFV MSSM at the ILC

Authors: M. Arana-Catania, S. Heinemeyer, M. J. Herrero, S. Penaranda

Abstract: We calculate the one-loop corrections to the Higgs boson masses within the context of the MSSM with Non-Minimal Flavor Violation in the squark sector. We take into account all the relevant restrictions from BR(B -> X_s gamma), BR(B_s -> mu^+ mu^-) and ΔM_{B_s}. We find sizable corrections to the lightest Higgs boson mass that are considerably larger than the expected ILC precision for acceptable v… ▽ More We calculate the one-loop corrections to the Higgs boson masses within the context of the MSSM with Non-Minimal Flavor Violation in the squark sector. We take into account all the relevant restrictions from BR(B -> X_s gamma), BR(B_s -> mu^+ mu^-) and ΔM_{B_s}. We find sizable corrections to the lightest Higgs boson mass that are considerably larger than the expected ILC precision for acceptable values of the mixing parameters deltas. We find delta^{LR}_{ct} and delta^{RL}_{ct} specially relevant, mainly at low tan beta. △ Less

Submitted 30 January, 2012; originally announced January 2012.

Comments: LaTex, 7 pages, 3 figures. The 2011 International Workshop on Future Linear Colliders (LCWS11), Granada, Spain

Report number: IFT-UAM/CSIC-12-10

arXiv:1109.6232 [pdf, ps, other]

doi 10.1007/JHEP05(2012)015

Higgs Boson masses and B-Physics Constraints in Non-Minimal Flavor Violating SUSY scenarios

Authors: M. Arana-Catania, S. Heinemeyer, M. J. Herrero, S. Penaranda

Abstract: We present one-loop corrections to the Higgs boson masses in the MSSM with Non-Minimal Flavor Violation. The flavor violation is generated from the hypothesis of general flavor mixing in the squark mass matrices, and these are parameterized by a complete set of delta^XY_ij (X, Y = L,R; i; j = t, c, u or b, s, d). We calculate the corrections to the Higgs masses in terms of these delta^XY_ij taking… ▽ More We present one-loop corrections to the Higgs boson masses in the MSSM with Non-Minimal Flavor Violation. The flavor violation is generated from the hypothesis of general flavor mixing in the squark mass matrices, and these are parameterized by a complete set of delta^XY_ij (X, Y = L,R; i; j = t, c, u or b, s, d). We calculate the corrections to the Higgs masses in terms of these delta^XY_ij taking into account all relevant restrictions from B-physics data. This includes constraints from BR(B -> Xs gamma), BR(Bs -> mu+ mu-) and delta M_B_s . After taking into account these constraints we find sizable corrections to the Higgs boson masses, in the case of the lightest MSSM Higgs boson mass exceeding tens of GeV. These corrections are found mainly for the low tan beta case. In the case of a Higgs boson mass measurement these corrections might be used to set further constraints on delta^XY_ij. △ Less

Submitted 9 April, 2012; v1 submitted 26 September, 2011; originally announced September 2011.

Comments: 58 pages, 15 figures, Minor modifications, version to appear in JHEP

Report number: IFT-UAM/CSIC-11-57

Showing 1–28 of 28 results for author: Arana-Catania, M