-
Machine Learning Models for the Identification of Cardiovascular Diseases Using UK Biobank Data
Authors:
Sheikh Mohammed Shariful Islam,
Moloud Abrar,
Teketo Tegegne,
Liliana Loranjo,
Chandan Karmakar,
Md Abdul Awal,
Md. Shahadat Hossain,
Muhammad Ashad Kabir,
Mufti Mahmud,
Abbas Khosravi,
George Siopis,
Jeban C Moses,
Ralph Maddison
Abstract:
Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore…
▽ More
Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore, we aimed to develop machine learning models for CVD detection using primary healthcare data, compare the performance of different models, and identify the best models. We used data from the UK Biobank study, which included over 500,000 middle-aged participants from different primary healthcare centers in the UK. Data collected at baseline (2006--2010) and during imaging visits after 2014 were used in this study. Baseline characteristics, including sex, age, and the Townsend Deprivation Index, were included. Participants were classified as having CVD if they reported at least one of the following conditions: heart attack, angina, stroke, or high blood pressure. Cardiac imaging data such as electrocardiogram and echocardiography data, including left ventricular size and function, cardiac output, and stroke volume, were also used. We used 9 machine learning models (LSVM, RBFSVM, GP, DT, RF, NN, AdaBoost, NB, and QDA), which are explainable and easily interpretable. We reported the accuracy, precision, recall, and F-1 scores; confusion matrices; and area under the curve (AUC) curves.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
DTI-SNNFRA: Drug-Target interaction prediction by shared nearest neighbors and fuzzy-rough approximation
Authors:
Sk Mazharul Islam,
Sk Md Mosaddek Hossain,
Sumanta Ray
Abstract:
In-silico prediction of repurposable drugs is an effective drug discovery strategy that supplements de-nevo drug discovery from scratch. Reduced development time, less cost and absence of severe side effects are significant advantages of using drug repositioning. Most recent and most advanced artificial intelligence (AI) approaches have boosted drug repurposing in terms of throughput and accuracy…
▽ More
In-silico prediction of repurposable drugs is an effective drug discovery strategy that supplements de-nevo drug discovery from scratch. Reduced development time, less cost and absence of severe side effects are significant advantages of using drug repositioning. Most recent and most advanced artificial intelligence (AI) approaches have boosted drug repurposing in terms of throughput and accuracy enormously. However, with the growing number of drugs, targets and their massive interactions produce imbalanced data which may not be suitable as input to the classification model directly. Here, we have proposed DTI-SNNFRA, a framework for predicting drug-target interaction (DTI), based on shared nearest neighbour (SNN) and fuzzy-rough approximation (FRA). It uses sampling techniques to collectively reduce the vast search space covering the available drugs, targets and millions of interactions between them. DTI-SNNFRA operates in two stages: first, it uses SNN followed by a partitioning clustering for sampling the search space. Next, it computes the degree of fuzzy-rough approximations and proper degree threshold selection for the negative samples' undersampling from all possible interaction pairs between drugs and targets obtained in the first stage. Finally, classification is performed using the positive and selected negative samples. We have evaluated the efficacy of DTI-SNNFRA using AUC (Area under ROC Curve), Geometric Mean, and F1 Score. The model performs exceptionally well with a high prediction score of 0.95 for ROC-AUC. The predicted drug-target interactions are validated through an existing drug-target database (Connectivity Map (Cmap)).
△ Less
Submitted 20 February, 2021; v1 submitted 22 September, 2020;
originally announced September 2020.
-
A Rule Based Expert System to Assess Coronary Artery Disease under Uncertainty
Authors:
Sohrab Hossain,
Dhiman Sarma,
Rana Joyti Chakma,
Wahidul Alam,
Mohammed Moshiul Hoque,
Iqbal H. Sarker
Abstract:
The coronary artery disease (CAD) involves narrowing and damaging the major blood vessels has become the most life threating disease in the world especially in south Asian reason. Although outstanding medical facilities are available in Singapore and India for CAD patients, early detection of CAD stages are necessary to minimize the patients' sufferings and expenses. It is really challenging for d…
▽ More
The coronary artery disease (CAD) involves narrowing and damaging the major blood vessels has become the most life threating disease in the world especially in south Asian reason. Although outstanding medical facilities are available in Singapore and India for CAD patients, early detection of CAD stages are necessary to minimize the patients' sufferings and expenses. It is really challenging for doctors to incorporate numerous factors for details analysis and CAD detections are expensive as it needs expensive medical facilities. Clinical Decision Support Systems (CDSS) may assist to analyze numerous factors for patients. In this paper, a Rule Based Expert System (RBES) is proposed which can predict five different stages of CAD. RBES contains five different Belief Rule Based (BRB) systems and the final output is produced by combining all BRBs using the Evidential Reasoning (ER). Success, Error, Failure, False Omission rates are calculated to measures the performance of the RBES. The Success Rate and False Omission Rate show better performance comparing to existing CDSS.
△ Less
Submitted 16 March, 2020;
originally announced March 2020.
-
Application and Computation of Probabilistic Neural Plasticity
Authors:
Soaad Hossain
Abstract:
The discovery of neural plasticity has proved that throughout the life of a human being, the brain reorganizes itself through forming new neural connections. The formation of new neural connections are achieved through the brain's effort to adapt to new environments or to changes in the existing environment. Despite the realization of neural plasticity, there is a lack of understanding the probabi…
▽ More
The discovery of neural plasticity has proved that throughout the life of a human being, the brain reorganizes itself through forming new neural connections. The formation of new neural connections are achieved through the brain's effort to adapt to new environments or to changes in the existing environment. Despite the realization of neural plasticity, there is a lack of understanding the probability of neural plasticity occurring given some event. Using ordinary differential equations, neural firing equations and spike-train statistics, we show how an additive short-term memory (STM) equation can be formulated to approach the computation of neural plasticity. We then show how the additive STM equation can be used for probabilistic inference in computable neural plasticity, and the computation of probabilistic neural plasticity. We will also provide a brief introduction to the theory of probabilistic neural plasticity and conclude with showing how it can be applied to multiple disciplines such as behavioural science, machine learning, artificial intelligence and psychiatry.
△ Less
Submitted 6 August, 2020; v1 submitted 25 May, 2019;
originally announced July 2019.