-
Surrogate-assisted performance prediction for data-driven knowledge discovery algorithms: application to evolutionary modeling of clinical pathways
Authors:
Anastasia A. Funkner,
Aleksey N. Yakovlev,
Sergey V. Kovalchuk
Abstract:
The paper proposes and investigates an approach for surrogate-assisted performance prediction of data-driven knowledge discovery algorithms. The approach is based on the identification of surrogate models for prediction of the target algorithm's quality and performance. The proposed approach was implemented and investigated as applied to an evolutionary algorithm for discovering clusters of interp…
▽ More
The paper proposes and investigates an approach for surrogate-assisted performance prediction of data-driven knowledge discovery algorithms. The approach is based on the identification of surrogate models for prediction of the target algorithm's quality and performance. The proposed approach was implemented and investigated as applied to an evolutionary algorithm for discovering clusters of interpretable clinical pathways in electronic health records of patients with acute coronary syndrome. Several clustering metrics and execution time were used as the target quality and performance metrics respectively. An analytical software prototype based on the proposed approach for the prediction of algorithm characteristics and feature analysis was developed to provide a more interpretable prediction of the target algorithm's performance and quality that can be further used for parameter tuning.
△ Less
Submitted 7 January, 2022; v1 submitted 2 April, 2020;
originally announced April 2020.
-
The Atrial Fibrillation Risk Score for Hyperthyroidism Patients
Authors:
Ilya V. Derevitskii,
Daria A. Savitskaya,
Alina Y. Babenko,
Sergey V. Kovalchuk
Abstract:
Thyrotoxicosis (TT) is associated with an increase in both total and cardiovascu-lar mortality. One of the main thyrotoxicosis risks is Atrial Fibrillation (AF). Right AF predicts help medical personal prescribe the correct medicaments and correct surgical or radioiodine therapy. The main goal of this study is creating a method for practical treatment and diagnostic AF. This study proposes a new m…
▽ More
Thyrotoxicosis (TT) is associated with an increase in both total and cardiovascu-lar mortality. One of the main thyrotoxicosis risks is Atrial Fibrillation (AF). Right AF predicts help medical personal prescribe the correct medicaments and correct surgical or radioiodine therapy. The main goal of this study is creating a method for practical treatment and diagnostic AF. This study proposes a new method for assessing the risk of occurrence atrial fibrillation for patients with TT. This method considers both the features of the complication and the specifics of the chronic disease. A model is created based on case histories of patients with thyrotoxicosis. We used Machine Learning methods for creating several models. Each model has advantages and disadvantages depending on the diagnostic and medical purposes. The resulting models show high results in the different metrics of the prediction of AF. These models interpreted and simple for use. Therefore, models can be used as part of the support and decision-making system (DSS) by medical specialists in the treatment and diagnostic of AF.
△ Less
Submitted 28 February, 2020;
originally announced February 2020.
-
On Classification Issues within Ensemble-Based Complex System Simulation Tasks
Authors:
Sergey V. Kovalchuk,
Aleksey V. Krikunov,
Konstantin V. Knyazkov,
Sergey S. Kosukhin,
Alexander V. Boukhanovsky
Abstract:
Contemporary tasks of complex system simulation are often related to the issue of uncertainty management. It comes from the lack of information or knowledge about the simulated system as well as from restrictions of the model set being used. One of the powerful tools for the uncertainty management is ensemble-based simulation, which uses variation in input or output data, model parameters, or avai…
▽ More
Contemporary tasks of complex system simulation are often related to the issue of uncertainty management. It comes from the lack of information or knowledge about the simulated system as well as from restrictions of the model set being used. One of the powerful tools for the uncertainty management is ensemble-based simulation, which uses variation in input or output data, model parameters, or available versions of models to improve the simulation performance. Furthermore the system of models for complex system simulation (especially in case of hiring ensemble-based approach) can be considered as a complex system. As a result, the identification of the complex model's structure and parameters provide additional sources of uncertainty to be managed. Within the presented work we are developing a conceptual and technological approach to manage the ensemble-based simulation taking into account changing states of both simulated system and system of models within the ensemble-based approach. The states of these systems are considered as a subject of classification with consequent inference of better strategies for ensemble evolution over the simulation time and ensemble aggregation. Here the ensemble evolution enables implementation of dynamic reactive solutions which can automatically conform to the changing states of both systems. The ensemble aggregation can be considered within a scope of averaging (regression way) or selection (classification way, which complement the classification mentioned earlier) approach. The technological basis for such approach includes ensemble-based simulation techniques using domain-specific software combined within a composite application; data science approaches for analysis of available datasets (simulation data, observations, situation assessment etc.); and machine learning algorithms for classes identification, ensemble management and knowledge acquisition.
△ Less
Submitted 18 March, 2016; v1 submitted 1 October, 2015;
originally announced October 2015.