AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data

Yuan, Han; Xie, Feng; Ong, Marcus Eng Hock; Ning, Yilin; Chee, Marcel Lucas; Saffari, Seyed Ehsan; Abdullah, Hairil Rizal; Goldstein, Benjamin Alan; Chakraborty, Bibhas; Liu, Nan

doi:10.1016/j.jbi.2022.104072

Abstract:Background: Medical decision-making impacts both individual and public health. Clinical scores are commonly used among a wide variety of decision-making models for determining the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model. Its current framework, however, still leaves room for improvement when addressing unbalanced data of rare events. Methods: Using machine intelligence approaches, we developed AutoScore-Imbalance, which comprises three components: training dataset optimization, sample weight optimization, and adjusted AutoScore. All scoring models were evaluated on the basis of their area under the curve (AUC) in the receiver operating characteristic analysis and balanced accuracy (i.e., mean value of sensitivity and specificity). By utilizing a publicly accessible dataset from Beth Israel Deaconess Medical Center, we assessed the proposed model and baseline approaches in the prediction of inpatient mortality. Results: AutoScore-Imbalance outperformed baselines in terms of AUC and balanced accuracy. The nine-variable AutoScore-Imbalance sub-model achieved the highest AUC of 0.786 (0.732-0.839) while the eleven-variable original AutoScore obtained an AUC of 0.723 (0.663-0.783), and the logistic regression with 21 variables obtained an AUC of 0.743 (0.685-0.800). The AutoScore-Imbalance sub-model (using down-sampling algorithm) yielded an AUC of 0. 0.771 (0.718-0.823) with only five variables, demonstrating a good balance between performance and variable sparsity. Conclusions: The AutoScore-Imbalance tool has the potential to be applied to highly unbalanced datasets to gain further insight into rare medical events and to facilitate real-world clinical decision-making.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2107.06039 [cs.LG]
	(or arXiv:2107.06039v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.06039
Related DOI:	https://doi.org/10.1016/j.jbi.2022.104072

Computer Science > Machine Learning

Title:AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators