-
Explainable AI for Bioinformatics: Methods, Tools, and Applications
Authors:
Md. Rezaul Karim,
Tanhim Islam,
Oya Beyan,
Christoph Lange,
Michael Cochez,
Dietrich Rebholz-Schuhmann,
Stefan Decker
Abstract:
Artificial intelligence (AI) systems utilizing deep neural networks (DNNs) and machine learning (ML) algorithms are widely used for solving important problems in bioinformatics, biomedical informatics, and precision medicine. However, complex DNNs or ML models, which are often perceived as opaque and black-box, can make it difficult to understand the reasoning behind their decisions. This lack of…
▽ More
Artificial intelligence (AI) systems utilizing deep neural networks (DNNs) and machine learning (ML) algorithms are widely used for solving important problems in bioinformatics, biomedical informatics, and precision medicine. However, complex DNNs or ML models, which are often perceived as opaque and black-box, can make it difficult to understand the reasoning behind their decisions. This lack of transparency can be a challenge for both end-users and decision-makers, as well as AI developers. Additionally, in sensitive areas like healthcare, explainability and accountability are not only desirable but also legally required for AI systems that can have a significant impact on human lives. Fairness is another growing concern, as algorithmic decisions should not show bias or discrimination towards certain groups or individuals based on sensitive attributes. Explainable artificial intelligence (XAI) aims to overcome the opaqueness of black-box models and provide transparency in how AI systems make decisions. Interpretable ML models can explain how they make predictions and the factors that influence their outcomes. However, most state-of-the-art interpretable ML methods are domain-agnostic and evolved from fields like computer vision, automated reasoning, or statistics, making direct application to bioinformatics problems challenging without customization and domain-specific adaptation. In this paper, we discuss the importance of explainability in the context of bioinformatics, provide an overview of model-specific and model-agnostic interpretable ML methods and tools, and outline their potential caveats and drawbacks. Besides, we discuss how to customize existing interpretable ML methods for bioinformatics problems. Nevertheless, we demonstrate how XAI methods can improve transparency through case studies in bioimaging, cancer genomics, and text mining.
△ Less
Submitted 23 February, 2023; v1 submitted 25 December, 2022;
originally announced December 2022.
-
Insights from a computational analysis of the SARS-CoV-2 Omicron variant: Host-pathogen interaction, pathogenicity and possible therapeutics
Authors:
Md Sorwer Alam Parvez,
Manash Kumar Saha,
Md. Ibrahim,
Yusha Araf,
Md. Taufiqul Islam,
Gen Ohtsuki,
Mohammad Jakir Hosen
Abstract:
Prominently accountable for the upsurge of COVID-19 cases as the world attempts to recover from the previous two waves, Omicron has further threatened the conventional therapeutic approaches. Omicron is the fifth variant of concern (VOC), which comprises more than 10 mutations in the receptor-binding domain (RBD) of the spike protein. However, the lack of extensive research regarding Omicron has r…
▽ More
Prominently accountable for the upsurge of COVID-19 cases as the world attempts to recover from the previous two waves, Omicron has further threatened the conventional therapeutic approaches. Omicron is the fifth variant of concern (VOC), which comprises more than 10 mutations in the receptor-binding domain (RBD) of the spike protein. However, the lack of extensive research regarding Omicron has raised the need to establish correlations to understand this variant by structural comparisons. Here, we evaluate, correlate, and compare its genomic sequences through an immunoinformatic approach with wild and mutant RBD forms of the spike protein to understand its epidemiological characteristics and responses towards existing drugs for better patient management. Our computational analyses provided insights into infectious and pathogenic trails of the Omicron variant. In addition, while the analysis represented South Africa's Omicron variant being similar to the highly-infectious B.1.620 variant, mutations within the prominent proteins are hypothesized to alter its pathogenicity. Moreover, docking evaluations revealed significant differences in binding affinity with human receptors, ACE2 and NRP1. Owing to its characteristics of rendering existing treatments ineffective, we evaluated the drug efficacy against their target protein encoded in the Omicron through molecular docking approach. Most of the tested drugs were proven to be effective. Nirmatrelvir (Paxlovid), MPro 13b, and Lopinavir displayed increased effectiveness and efficacy, while Ivermectin showed the best result against Omicron.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
An early warning tool for predicting mortality risk of COVID-19 patients using machine learning
Authors:
Muhammad E. H. Chowdhury,
Tawsifur Rahman,
Amith Khandakar,
Somaya Al-Madeed,
Susu M. Zughaier,
Suhail A. R. Doi,
Hanadi Hassen,
Mohammad T. Islam
Abstract:
COVID-19 pandemic has created an extreme pressure on the global healthcare services. Fast, reliable and early clinical assessment of the severity of the disease can help in allocating and prioritizing resources to reduce mortality. In order to study the important blood biomarkers for predicting disease mortality, a retrospective study was conducted on 375 COVID-19 positive patients admitted to Ton…
▽ More
COVID-19 pandemic has created an extreme pressure on the global healthcare services. Fast, reliable and early clinical assessment of the severity of the disease can help in allocating and prioritizing resources to reduce mortality. In order to study the important blood biomarkers for predicting disease mortality, a retrospective study was conducted on 375 COVID-19 positive patients admitted to Tongji Hospital (China) from January 10 to February 18, 2020. Demographic and clinical characteristics, and patient outcomes were investigated using machine learning tools to identify key biomarkers to predict the mortality of individual patient. A nomogram was developed for predicting the mortality risk among COVID-19 patients. Lactate dehydrogenase, neutrophils (%), lymphocyte (%), high sensitive C-reactive protein, and age - acquired at hospital admission were identified as key predictors of death by multi-tree XGBoost model. The area under curve (AUC) of the nomogram for the derivation and validation cohort were 0.961 and 0.991, respectively. An integrated score (LNLCA) was calculated with the corresponding death probability. COVID-19 patients were divided into three subgroups: low-, moderate- and high-risk groups using LNLCA cut-off values of 10.4 and 12.65 with the death probability less than 5%, 5% to 50%, and above 50%, respectively. The prognostic model, nomogram and LNLCA score can help in early detection of high mortality risk of COVID-19 patients, which will help doctors to improve the management of patient stratification.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
Non-invasive assessment of the spatial and temporal distributions of interstitial fluid pressure, fluid velocity and fluid flow in cancers in vivo
Authors:
Md Tauhidul Islam,
Ennio Tasciotti,
Raffaella Righetti
Abstract:
Interstitial fluid pressure (IFP), interstitial fluid velocity (IFV), interstitial permeability (IP) and vascular permeability (VP) are cancer mechanopathological parameters of great clinical significance. To date, there is a lack of non-invasive techniques that can be used to estimate these parameters in vivo. In this study, we designed and tested new ultrasound poroelastography methods capable o…
▽ More
Interstitial fluid pressure (IFP), interstitial fluid velocity (IFV), interstitial permeability (IP) and vascular permeability (VP) are cancer mechanopathological parameters of great clinical significance. To date, there is a lack of non-invasive techniques that can be used to estimate these parameters in vivo. In this study, we designed and tested new ultrasound poroelastography methods capable of estimating the magnitude and spatial distribution of fluid pressure, fluid velocity and fluid flow inside tumors. We theoretically proved that fluid pressure, velocity and flow estimated using poroelastography from a tumor under creep compression are directly related to the underlying IFP, IFV and fluid flow, respectively, differing only in peak values. We also proved that, from the spatial distribution of the fluid pressure estimated using poroelastography, it is possible to derive: the parameter alpha, which quantifies the spatial distribution of the IFP; the ratio between VP and IP and the ratio between the peak IFP and effective vascular pressure in the tumor. Finally, we demonstrated that axial strain time constant (TC) elastograms are directly related to VP and IP in tumors. Our techniques were validated using finite element and ultrasound simulations, while experiments on a human breast cancer animal model were used to show the feasibility of these methods in vivo.
△ Less
Submitted 10 September, 2018;
originally announced September 2018.