-
E-ABIN: an Explainable module for Anomaly detection in BIological Networks
Authors:
Ugo Lomoio,
Tommaso Mazza,
Pierangelo Veltri,
Pietro Hiram Guzzi
Abstract:
The increasing availability of large-scale omics data calls for robust analytical frameworks capable of handling complex gene expression datasets while offering interpretable results. Recent advances in artificial intelligence have enabled the identification of aberrant molecular patterns distinguishing disease states from healthy controls. Coupled with improvements in model interpretability, thes…
▽ More
The increasing availability of large-scale omics data calls for robust analytical frameworks capable of handling complex gene expression datasets while offering interpretable results. Recent advances in artificial intelligence have enabled the identification of aberrant molecular patterns distinguishing disease states from healthy controls. Coupled with improvements in model interpretability, these tools now support the identification of genes potentially driving disease phenotypes. However, current approaches to gene anomaly detection often remain limited to single datasets and lack accessible graphical interfaces. Here, we introduce E-ABIN, a general-purpose, explainable framework for Anomaly detection in Biological Networks. E-ABIN combines classical machine learning and graph-based deep learning techniques within a unified, user-friendly platform, enabling the detection and interpretation of anomalies from gene expression or methylation-derived networks. By integrating algorithms such as Support Vector Machines, Random Forests, Graph Autoencoders (GAEs), and Graph Adversarial Attributed Networks (GAANs), E-ABIN ensures a high predictive accuracy while maintaining interpretability. We demonstrate the utility of E-ABIN through case studies of bladder cancer and coeliac disease, where it effectively uncovers biologically relevant anomalies and offers insights into disease mechanisms.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
ExDiff: A Framework for Simulating Diffusion Processes on Complex Networks with Explainable AI Integration
Authors:
Annamaria Defilippo,
Ugo Lomoio,
Barbara Puccio,
Pierangelo Veltri,
Pietro Hiram Guzzi
Abstract:
Understanding and controlling diffusion processes in complex networks is critical across domains ranging from epidemiology to information science. Here, we present ExDiff, an interactive and modular computational framework that integrates network simulation, graph neural networks (GNNs), and explainable artificial intelligence (XAI) to model and interpret diffusion dynamics. ExDiff combines classi…
▽ More
Understanding and controlling diffusion processes in complex networks is critical across domains ranging from epidemiology to information science. Here, we present ExDiff, an interactive and modular computational framework that integrates network simulation, graph neural networks (GNNs), and explainable artificial intelligence (XAI) to model and interpret diffusion dynamics. ExDiff combines classical compartmental models with deep learning techniques to capture both the structural and temporal characteristics of diffusion across diverse network topologies. The framework features dedicated modules for network analysis, neural modeling, simulation, and interpretability, all accessible via an intuitive interface built on Google Colab. Through a case study of the Susceptible Infectious Recovered Vaccinated Dead (SIRVD) model, we demonstrate the capacity to simulate disease spread, evaluate intervention strategies, classify node states, and reveal the structural determinants of contagion through XAI techniques. By unifying simulation and interpretability, ExDiff provides a powerful, flexible, and accessible platform for studying diffusion phenomena in networked systems, enabling both methodological innovation and practical insight.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Towards medical AI misalignment: a preliminary study
Authors:
Barbara Puccio,
Federico Castagna,
Allan Tucker,
Pierangelo Veltri
Abstract:
Despite their staggering capabilities as assistant tools, often exceeding human performances, Large Language Models (LLMs) are still prone to jailbreak attempts from malevolent users. Although red teaming practices have already identified and helped to address several such jailbreak techniques, one particular sturdy approach involving role-playing (which we named `Goofy Game') seems effective agai…
▽ More
Despite their staggering capabilities as assistant tools, often exceeding human performances, Large Language Models (LLMs) are still prone to jailbreak attempts from malevolent users. Although red teaming practices have already identified and helped to address several such jailbreak techniques, one particular sturdy approach involving role-playing (which we named `Goofy Game') seems effective against most of the current LLMs safeguards. This can result in the provision of unsafe content, which, although not harmful per se, might lead to dangerous consequences if delivered in a setting such as the medical domain. In this preliminary and exploratory study, we provide an initial analysis of how, even without technical knowledge of the internal architecture and parameters of generative AI models, a malicious user could construct a role-playing prompt capable of coercing an LLM into producing incorrect (and potentially harmful) clinical suggestions. We aim to illustrate a specific vulnerability scenario, providing insights that can support future advancements in the field.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Decoding Rarity: Large Language Models in the Diagnosis of Rare Diseases
Authors:
Valentina Carbonari,
Pierangelo Veltri,
Pietro Hiram Guzzi
Abstract:
Recent advances in artificial intelligence, particularly large language models LLMs, have shown promising capabilities in transforming rare disease research. This survey paper explores the integration of LLMs in the analysis of rare diseases, highlighting significant strides and pivotal studies that leverage textual data to uncover insights and patterns critical for diagnosis, treatment, and patie…
▽ More
Recent advances in artificial intelligence, particularly large language models LLMs, have shown promising capabilities in transforming rare disease research. This survey paper explores the integration of LLMs in the analysis of rare diseases, highlighting significant strides and pivotal studies that leverage textual data to uncover insights and patterns critical for diagnosis, treatment, and patient care. While current research predominantly employs textual data, the potential for multimodal data integration combining genetic, imaging, and electronic health records stands as a promising frontier. We review foundational papers that demonstrate the application of LLMs in identifying and extracting relevant medical information, simulating intelligent conversational agents for patient interaction, and enabling the formulation of accurate and timely diagnoses. Furthermore, this paper discusses the challenges and ethical considerations inherent in deploying LLMs, including data privacy, model transparency, and the need for robust, inclusive data sets. As part of this exploration, we present a section on experimentation that utilizes multiple LLMs alongside structured questionnaires, specifically designed for diagnostic purposes in the context of different diseases. We conclude with future perspectives on the evolution of LLMs towards truly multimodal platforms, which would integrate diverse data types to provide a more comprehensive understanding of rare diseases, ultimately fostering better outcomes in clinical settings.
△ Less
Submitted 18 May, 2025;
originally announced May 2025.
-
DCAE-SR: Design of a Denoising Convolutional Autoencoder for reconstructing Electrocardiograms signals at Super Resolution
Authors:
Ugo Lomoio,
Pierangelo Veltri,
Pietro Hiram Guzzi,
Pietro Lio'
Abstract:
Electrocardiogram (ECG) signals play a pivotal role in cardiovascular diagnostics, providing essential information on the electrical activity of the heart. However, the inherent noise and limited resolution in ECG recordings can hinder accurate interpretation and diagnosis. In this paper, we propose a novel model for ECG super resolution (SR) that uses a DNAE to enhance temporal and frequency info…
▽ More
Electrocardiogram (ECG) signals play a pivotal role in cardiovascular diagnostics, providing essential information on the electrical activity of the heart. However, the inherent noise and limited resolution in ECG recordings can hinder accurate interpretation and diagnosis. In this paper, we propose a novel model for ECG super resolution (SR) that uses a DNAE to enhance temporal and frequency information inside ECG signals. Our approach addresses the limitations of traditional ECG signal processing techniques. Our model takes in input 5-second length ECG windows sampled at 50 Hz (very low resolution) and it is able to reconstruct a denoised super-resolution signal with an x10 upsampling rate (sampled at 500 Hz). We trained the proposed DCAE-SR on public available myocardial infraction ECG signals. Our method demonstrates superior performance in reconstructing high-resolution ECG signals from very low-resolution signals with a sampling rate of 50 Hz. We compared our results with the current deep-learning literature approaches for ECG super-resolution and some non-deep learning reproducible methods that can perform both super-resolution and denoising. We obtained current state-of-the-art performances in super-resolution of very low resolution ECG signals frequently corrupted by ECG artifacts. We were able to obtain a signal-to-noise ratio of 12.20 dB (outperforms previous 4.68 dB), mean squared error of 0.0044 (outperforms previous 0.0154) and root mean squared error of 4.86% (outperforms previous 12.40%). In conclusion, our DCAE-SR model offers a robust (to artefact presence), versatile and explainable solution to enhance the quality of ECG signals. This advancement holds promise in advancing the field of cardiovascular diagnostics, paving the way for improved patient care and high-quality clinical decisions
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Leveraging graph neural networks for supporting Automatic Triage of Patients
Authors:
Annamaria Defilippo,
Pierangelo Veltri,
Pietro Lio',
Pietro Hiram Guzzi
Abstract:
Patient triage plays a crucial role in emergency departments, ensuring timely and appropriate care based on correctly evaluating the emergency grade of patient conditions.
Triage methods are generally performed by human operator based on her own experience and information that are gathered from the patient management process.
Thus, it is a process that can generate errors in emergency level as…
▽ More
Patient triage plays a crucial role in emergency departments, ensuring timely and appropriate care based on correctly evaluating the emergency grade of patient conditions.
Triage methods are generally performed by human operator based on her own experience and information that are gathered from the patient management process.
Thus, it is a process that can generate errors in emergency level associations. Recently, Traditional triage methods heavily rely on human decisions, which can be subjective and prone to errors.
Recently, a growing interest has been focused on leveraging artificial intelligence (AI) to develop algorithms able to maximize information gathering and minimize errors in patient triage processing.
We define and implement an AI based module to manage patients emergency code assignments in emergency departments. It uses emergency department historical data to train the medical decision process. Data containing relevant patient information, such as vital signs, symptoms, and medical history, are used to accurately classify patients into triage categories. Experimental results demonstrate that the proposed algorithm achieved high accuracy outperforming traditional triage methods. By using the proposed method we claim that healthcare professionals can predict severity index to guide patient management processing and resource allocation.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
A novel Network Science Algorithm for Improving Triage of Patients
Authors:
Pietro Hiram Guzzi,
Annamaria De Filippo,
Pierangelo Veltri
Abstract:
Patient triage plays a crucial role in healthcare, ensuring timely and appropriate care based on the urgency of patient conditions. Traditional triage methods heavily rely on human judgment, which can be subjective and prone to errors. Recently, a growing interest has been in leveraging artificial intelligence (AI) to develop algorithms for triaging patients. This paper presents the development of…
▽ More
Patient triage plays a crucial role in healthcare, ensuring timely and appropriate care based on the urgency of patient conditions. Traditional triage methods heavily rely on human judgment, which can be subjective and prone to errors. Recently, a growing interest has been in leveraging artificial intelligence (AI) to develop algorithms for triaging patients. This paper presents the development of a novel algorithm for triaging patients. It is based on the analysis of patient data to produce decisions regarding their prioritization. The algorithm was trained on a comprehensive data set containing relevant patient information, such as vital signs, symptoms, and medical history. The algorithm was designed to accurately classify patients into triage categories through rigorous preprocessing and feature engineering. Experimental results demonstrate that our algorithm achieved high accuracy and performance, outperforming traditional triage methods. By incorporating computer science into the triage process, healthcare professionals can benefit from improved efficiency, accuracy, and consistency, prioritizing patients effectively and optimizing resource allocation. Although further research is needed to address challenges such as biases in training data and model interpretability, the development of AI-based algorithms for triaging patients shows great promise in enhancing healthcare delivery and patient outcomes.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Design and Development of PCN-Miner: A tool for the Analysis of Protein Contact Networks
Authors:
Pietro Hiram Guzzi,
Luisa Di Paola,
Alessandro Giuliani,
Pierangelo Veltri
Abstract:
Protein Contact Network (PCN) is a powerful tool for analysing the structure and function of proteins. In particular, PCN has been used for disclosing the molecular features of allosteric regulation through PCN clustering. Such analysis is relevant in many applications, such as the recent study of SARS-CoV-2 Spike Protein. Despite its relevance, methods for the analysis of PCN are spread into a se…
▽ More
Protein Contact Network (PCN) is a powerful tool for analysing the structure and function of proteins. In particular, PCN has been used for disclosing the molecular features of allosteric regulation through PCN clustering. Such analysis is relevant in many applications, such as the recent study of SARS-CoV-2 Spike Protein. Despite its relevance, methods for the analysis of PCN are spread into a set of different libraries and tools. Therefore, the introduction of a tool that incorporates all the function may help researchers. We present PCN-Miner a software tool implemented in the Python programming language able to import protein in the Protein Data Bank format and generate the corresponding protein contact network. Then it offers a set of algorithms for the analysis of PCS that cover a large set of applications: from clustering to embedding and subsequent analysis.
Software is available at \url{https://github.com/hguzzi/ProteinContactNetworks}
△ Less
Submitted 12 January, 2022;
originally announced January 2022.
-
Using Dual-Network Analyser for extracting communities from Dual Networks
Authors:
Pietro Hiram Guzzi,
Giuseppe Tradigo,
Pierangelo Veltri
Abstract:
The representation of data and its relationships using networks is prevalent in many research fields such as computational biology, medical informatics and social networks. Recently, complex networks models have been introduced to better capture the insights of the modelled scenarios. Among others, dual networks -based models have been introduced, which consist in mapping information as pair of ne…
▽ More
The representation of data and its relationships using networks is prevalent in many research fields such as computational biology, medical informatics and social networks. Recently, complex networks models have been introduced to better capture the insights of the modelled scenarios. Among others, dual networks -based models have been introduced, which consist in mapping information as pair of networks containing the same nodes but different edges.
We focus on the use of a novel approach to visualise and analyse dual networks. The method uses two algorithms for community discovery, and it is provided as a Python-based tool with a graphical user interface. The tool is able to load dual networks and to extract both the densest connected subgraph as well as the common modular communities. The latter is obtained by using an adapted implementation of the Louvain algorithm.
The proposed algorithm and graphical tool have been tested by using social, biological, and co-authorship networks. Results demonstrate that the proposed approach is efficient and is able to extract meaningful information from dual networks. Finally, as contribution, the proposed graphical user interface can be considered a valuable innovation to the context.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
Extracting Dense and Connected Subgraphs in Dual Networks by Network Alignment
Authors:
Pietro Hiram Guzzi,
Emanuel Salerno,
Giuseppe Tradigo,
Pierangelo Veltri
Abstract:
The use of network based approaches to model and analyse large datasets is currently a growing research field. For instance in biology and medicine, networks are used to model interactions among biological molecules as well as relations among patients. Similarly, data coming from social networks can be trivially modelled by using graphs. More recently, the use of dual networks gained the attention…
▽ More
The use of network based approaches to model and analyse large datasets is currently a growing research field. For instance in biology and medicine, networks are used to model interactions among biological molecules as well as relations among patients. Similarly, data coming from social networks can be trivially modelled by using graphs. More recently, the use of dual networks gained the attention of researchers. A dual network model uses a pair of graphs to model a scenario in which one of the two graphs is usually unweighted (a network representing physical associations among nodes) while the other one is edge-weighted (a network representing conceptual associations among nodes). In this paper we focus on the problem of finding the Densest Connected sub-graph (DCS) having the largest density in the conceptual network which is also connected in the physical network. The problem is relevant but also computationally hard, therefore the need for introducing of novel algorithms arises. We formalise the problem and then we map DCS into a graph alignment problem. Then we propose a possible solution. A set of experiments is also presented to support our approach.
△ Less
Submitted 4 February, 2020;
originally announced February 2020.
-
Enhancing magic sets with an application to ontological reasoning
Authors:
Mario Alviano,
Nicola Leone,
Pierfrancesco Veltri,
Jessica Zangari
Abstract:
Magic sets are a Datalog to Datalog rewriting technique to optimize query answering. The rewritten program focuses on a portion of the stable model(s) of the input program which is sufficient to answer the given query. However, the rewriting may introduce new recursive definitions, which can involve even negation and aggregations, and may slow down program evaluation. This paper enhances the magic…
▽ More
Magic sets are a Datalog to Datalog rewriting technique to optimize query answering. The rewritten program focuses on a portion of the stable model(s) of the input program which is sufficient to answer the given query. However, the rewriting may introduce new recursive definitions, which can involve even negation and aggregations, and may slow down program evaluation. This paper enhances the magic set technique by preventing the creation of (new) recursive definitions in the rewritten program. It turns out that the new version of magic sets is closed for Datalog programs with stratified negation and aggregations, which is very convenient to obtain efficient computation of the stable model of the rewritten program. Moreover, the rewritten program is further optimized by the elimination of subsumed rules and by the efficient handling of the cases where binding propagation is lost. The research was stimulated by a challenge on the exploitation of Datalog/\textsc{dlv} for efficient reasoning on large ontologies. All proposed techniques have been hence implemented in the \textsc{dlv} system, and tested for ontological reasoning, confirming their effectiveness.
Under consideration for publication in Theory and Practice of Logic Programming.
△ Less
Submitted 19 July, 2019;
originally announced July 2019.
-
HetNetAligner: Design and Implementation of an algorithm for heterogeneous network alignment on Apache Spark
Authors:
Pietro H Guzzi,
Marianna Milano,
Pierangelo Veltri,
Mario Cannataro
Abstract:
The importance of the use of networks to model and analyse biological data and the interplay of bio-molecules is widely recognised. Consequently, many algorithms for the analysis and the comparison of networks (such as alignment algorithms) have been developed in the past. Recently, many different approaches tried to integrate into a single model the interplay of different molecules, such as genes…
▽ More
The importance of the use of networks to model and analyse biological data and the interplay of bio-molecules is widely recognised. Consequently, many algorithms for the analysis and the comparison of networks (such as alignment algorithms) have been developed in the past. Recently, many different approaches tried to integrate into a single model the interplay of different molecules, such as genes, transcription factors and microRNAs. A possible formalism to model such scenario comes from node coloured networks (or heterogeneous networks) implemented as node/ edge-coloured graphs. Consequently, the need for the introduction of alignment algorithms able to analyse heterogeneous networks arises. To the best of our knowledge, all the existing algorithms are not able to mine heterogeneous networks. We propose a two-step alignment strategy that receives as input two heterogeneous networks (node-coloured graphs) and a similarity function among nodes of two networks extending the previous formulations. We first build a single alignment graph. Then we mine this graph extracting relevant subgraphs. Despite this simple approach, the analysis of such networks relies on graph and subgraph isomorphism and the size of the data is still growing. Therefore the use of high-performance data analytics framework is needed. We here present HetNetAligner a framework built on top of Apache Spark. We also implemented our algorithm, and we tested it on some selected heterogeneous biological networks. Preliminary results confirm that our method may extract relevant knowledge from biological data reducing the computational time.
△ Less
Submitted 11 June, 2018;
originally announced June 2018.
-
A web-based tool to Analyze Semantic Similarity Networks
Authors:
Mario Cannataro,
Pietro Hiram Guzzi,
Marianna Milano,
Pierangelo Veltri
Abstract:
In computational biology, biological entities such as genes or proteins are usually annotated with terms extracted from Gene Ontology (GO). The functional similarity among terms of an ontology is evaluated by using Semantic Similarity Measures (SSM). More recently, the extensive application of SSMs yielded to the Semantic Similarity Networks (SSNs). SSNs are edge-weighted graphs where the nodes ar…
▽ More
In computational biology, biological entities such as genes or proteins are usually annotated with terms extracted from Gene Ontology (GO). The functional similarity among terms of an ontology is evaluated by using Semantic Similarity Measures (SSM). More recently, the extensive application of SSMs yielded to the Semantic Similarity Networks (SSNs). SSNs are edge-weighted graphs where the nodes are concepts (e.g. proteins) and each edge has an associated weight that represents the semantic similarity among related pairs of nodes. The analysis of SSNs may reveal biologically meaningful knowledge. For these aims, the need for the introduction of tool able to manage and analyze SSN arises. Consequently we developed SSN-Analyzer a web based tool able to build and preprocess SSN. As proof of concept we demonstrate that community detection algorithms applied to filtered (thresholded) networks, have better performances in terms of biological relevance of the results, with respect to the use of raw unfiltered networks.
△ Less
Submitted 21 December, 2014;
originally announced December 2014.