-
AI-Driven Decision Support in Oncology: Evaluating Data Readiness for Skin Cancer Treatment
Authors:
Joscha Grüger,
Tobias Geyer,
Tobias Brix,
Michael Storck,
Sonja Leson,
Laura Bley,
Carsten Weishaupt,
Ralph Bergmann,
Stephan A. Braun
Abstract:
This research focuses on evaluating and enhancing data readiness for the development of an Artificial Intelligence (AI)-based Clinical Decision Support System (CDSS) in the context of skin cancer treatment. The study, conducted at the Skin Tumor Center of the University Hospital Münster, delves into the essential role of data quality, availability, and extractability in implementing effective AI a…
▽ More
This research focuses on evaluating and enhancing data readiness for the development of an Artificial Intelligence (AI)-based Clinical Decision Support System (CDSS) in the context of skin cancer treatment. The study, conducted at the Skin Tumor Center of the University Hospital Münster, delves into the essential role of data quality, availability, and extractability in implementing effective AI applications in oncology. By employing a multifaceted methodology, including literature review, data readiness assessment, and expert workshops, the study addresses the challenges of integrating AI into clinical decision-making. The research identifies crucial data points for skin cancer treatment decisions, evaluates their presence and quality in various information systems, and highlights the difficulties in extracting information from unstructured data. The findings underline the significance of high-quality, accessible data for the success of AI-driven CDSS in medical settings, particularly in the complex field of oncology.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Leveraging Taxonomy Similarity for Next Activity Prediction in Patient Treatment
Authors:
Martin Kuhn,
Joscha Grüger,
Tobias Geyer,
Ralph Bergmann
Abstract:
The rapid progress in modern medicine presents physicians with complex challenges when planning patient treatment. Techniques from the field of Predictive Business Process Monitoring, like Next-activity-prediction (NAP) can be used as a promising technique to support physicians in treatment planning, by proposing a possible next treatment step. Existing patient data, often in the form of electroni…
▽ More
The rapid progress in modern medicine presents physicians with complex challenges when planning patient treatment. Techniques from the field of Predictive Business Process Monitoring, like Next-activity-prediction (NAP) can be used as a promising technique to support physicians in treatment planning, by proposing a possible next treatment step. Existing patient data, often in the form of electronic health records, can be analyzed to recommend the next suitable step in the treatment process. However, the use of patient data poses many challenges due to its knowledge-intensive character, high variability and scarcity of medical data. To overcome these challenges, this article examines the use of the knowledge encoded in taxonomies to improve and explain the prediction of the next activity in the treatment process. This study proposes the TS4NAP approach, which uses medical taxonomies (ICD-10-CM and ICD-10-PCS) in combination with graph matching to assess the similarities of medical codes to predict the next treatment step. The effectiveness of the proposed approach will be evaluated using event logs that are derived from the MIMIC-IV dataset. The results highlight the potential of using domain-specific knowledge held in taxonomies to improve the prediction of the next activity, and thus can improve treatment planning and decision-making by making the predictions more explainable.
△ Less
Submitted 17 March, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
Two Models for Surface Segmentation using the Total Variation of the Normal Vector
Authors:
Lukas Baumgärtner,
Ronny Bergmann,
Roland Herzog,
Stephan Schmidt,
Manuel Weiß
Abstract:
We consider the problem of surface segmentation, where the goal is to partition a surface represented by a triangular mesh. The segmentation is based on the similarity of the normal vector field to a given set of label vectors. We propose a variational approach and compare two different regularizers, both based on a total variation measure. The first regularizer penalizes the total variation of th…
▽ More
We consider the problem of surface segmentation, where the goal is to partition a surface represented by a triangular mesh. The segmentation is based on the similarity of the normal vector field to a given set of label vectors. We propose a variational approach and compare two different regularizers, both based on a total variation measure. The first regularizer penalizes the total variation of the assignment function directly, while the second regularizer penalizes the total variation in the label space. In order to solve the resulting optimization problems, we use variations of the split Bregman (ADMM) iteration adapted to the problem at hand. While computationally more expensive, the second regularizer yields better results in our experiments, in particular it removes noise more reliably in regions of constant curvature.
△ Less
Submitted 30 November, 2024;
originally announced December 2024.
-
Modelling Fire Incidents Response Times in Ålesund
Authors:
J. Christmas,
R. Bergmann,
A. Zhakatayev,
J. Rebenda,
S. Singh
Abstract:
In the ESGI-156 project together with Ålesund Brannvesen we develop a model for response times to fire incidents on publicly available data for Ålesund. We investigate different scenarios and a first step towards an interactive software for illustrating the response times.
In the ESGI-156 project together with Ålesund Brannvesen we develop a model for response times to fire incidents on publicly available data for Ålesund. We investigate different scenarios and a first step towards an interactive software for illustrating the response times.
△ Less
Submitted 16 August, 2024;
originally announced September 2024.
-
From Internet of Things Data to Business Processes: Challenges and a Framework
Authors:
Juergen Mangler,
Ronny Seiger,
Janik-Vasily Benzin,
Joscha Grüger,
Yusuf Kirikkayis,
Florian Gallik,
Lukas Malburg,
Matthias Ehrendorfer,
Yannis Bertrand,
Marco Franceschetti,
Barbara Weber,
Stefanie Rinderle-Ma,
Ralph Bergmann,
Estefanía Serral Asensio,
Manfred Reichert
Abstract:
The IoT and Business Process Management (BPM) communities co-exist in many shared application domains, such as manufacturing and healthcare. The IoT community has a strong focus on hardware, connectivity and data; the BPM community focuses mainly on finding, controlling, and enhancing the structured interactions among the IoT devices in processes. While the field of Process Mining deals with the e…
▽ More
The IoT and Business Process Management (BPM) communities co-exist in many shared application domains, such as manufacturing and healthcare. The IoT community has a strong focus on hardware, connectivity and data; the BPM community focuses mainly on finding, controlling, and enhancing the structured interactions among the IoT devices in processes. While the field of Process Mining deals with the extraction of process models and process analytics from process event logs, the data produced by IoT sensors often is at a lower granularity than these process-level events. The fundamental questions about extracting and abstracting process-related data from streams of IoT sensor values are: (1) Which sensor values can be clustered together as part of process events?, (2) Which sensor values signify the start and end of such events?, (3) Which sensor values are related but not essential? This work proposes a framework to semi-automatically perform a set of structured steps to convert low-level IoT sensor data into higher-level process events that are suitable for process mining. The framework is meant to provide a generic sequence of abstract steps to guide the event extraction, abstraction, and correlation, with variation points for plugging in specific analysis techniques and algorithms for each step. To assess the completeness of the framework, we present a set of challenges, how they can be tackled through the framework, and an example on how to instantiate the framework in a real-world demonstration from the field of smart manufacturing. Based on this framework, future research can be conducted in a structured manner through refining and improving individual steps.
△ Less
Submitted 22 May, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Likelihood-based Sensor Calibration using Affine Transformation
Authors:
Rüdiger Machhamer,
Lejla Begic Fazlic,
Eray Guven,
David Junk,
Gunes Karabulut Kurt,
Stefan Naumann,
Stephan Didas,
Klaus-Uwe Gollmer,
Ralph Bergmann,
Ingo J. Timm,
Guido Dartmann
Abstract:
An important task in the field of sensor technology is the efficient implementation of adaptation procedures of measurements from one sensor to another sensor of identical design. One idea is to use the estimation of an affine transformation between different systems, which can be improved by the knowledge of experts. This paper presents an improved solution from Glacier Research that was publishe…
▽ More
An important task in the field of sensor technology is the efficient implementation of adaptation procedures of measurements from one sensor to another sensor of identical design. One idea is to use the estimation of an affine transformation between different systems, which can be improved by the knowledge of experts. This paper presents an improved solution from Glacier Research that was published back in 1973. The results demonstrate the adaptability of this solution for various applications, including software calibration of sensors, implementation of expert-based adaptation, and paving the way for future advancements such as distributed learning methods. One idea here is to use the knowledge of experts for estimating an affine transformation between different systems. We evaluate our research with simulations and also with real measured data of a multi-sensor board with 8 identical sensors. Both data set and evaluation script are provided for download. The results show an improvement for both the simulation and the experiments with real data.
△ Less
Submitted 10 January, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Declarative Guideline Conformance Checking of Clinical Treatments: A Case Study
Authors:
Joscha Grüger,
Tobias Geyer,
Martin Kuhn,
Stefan Braun,
Ralph Bergmann
Abstract:
Conformance checking is a process mining technique that allows verifying the conformance of process instances to a given model. Thus, this technique is predestined to be used in the medical context for the comparison of treatment cases with clinical guidelines. However, medical processes are highly variable, highly dynamic, and complex. This makes the use of imperative conformance checking approac…
▽ More
Conformance checking is a process mining technique that allows verifying the conformance of process instances to a given model. Thus, this technique is predestined to be used in the medical context for the comparison of treatment cases with clinical guidelines. However, medical processes are highly variable, highly dynamic, and complex. This makes the use of imperative conformance checking approaches in the medical domain difficult. Studies show that declarative approaches can better address these characteristics. However, none of the approaches has yet gained practical acceptance. Another challenge are alignments, which usually do not add any value from a medical point of view. For this reason, we investigate in a case study the usability of the HL7 standard Arden Syntax for declarative, rule-based conformance checking and the use of manually modeled alignments. Using the approach, it was possible to check the conformance of treatment cases and create medically meaningful alignments for large parts of a medical guideline.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
An IoT-Enriched Event Log for Process Mining in Smart Factories
Authors:
Lukas Malburg,
Joscha Grüger,
Ralph Bergmann
Abstract:
Modern technologies such as the Internet of Things (IoT) are becoming increasingly important in various domains, including Business Process Management (BPM) research. One main research area in BPM is process mining, which can be used to analyze event logs, e.g., for checking the conformance of running processes. However, there are only a few IoT-based event logs available for research purposes. So…
▽ More
Modern technologies such as the Internet of Things (IoT) are becoming increasingly important in various domains, including Business Process Management (BPM) research. One main research area in BPM is process mining, which can be used to analyze event logs, e.g., for checking the conformance of running processes. However, there are only a few IoT-based event logs available for research purposes. Some of them are artificially generated and the problem occurs that they do not always completely reflect the actual physical properties of smart environments. In this paper, we present an IoT-enriched XES event log that is generated by a physical smart factory. For this purpose, we create the SensorStream XES extension for representing IoT-data in event logs. Finally, we present some preliminary analysis and properties of the log.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
SensorStream: An XES Extension for Enriching Event Logs with IoT-Sensor Data
Authors:
Joscha Grüger,
Lukas Malburg,
Juergen Mangler,
Yannis Bertrand,
Stefanie Rinderle-Ma,
Ralph Bergmann,
Estefanía Serral Asensio
Abstract:
Process management and process orchestration/execution are currently hot topics; prevalent trends such as automation and Industry 4.0 require solutions which allow domain-experts to easily model and execute processes in various domains, including manufacturing and health-care. These domains, in turn, rely on a tight integration between hardware and software, i.e. via the Internet of Things (IoT).…
▽ More
Process management and process orchestration/execution are currently hot topics; prevalent trends such as automation and Industry 4.0 require solutions which allow domain-experts to easily model and execute processes in various domains, including manufacturing and health-care. These domains, in turn, rely on a tight integration between hardware and software, i.e. via the Internet of Things (IoT). While process execution is about actuation, i.e. actively triggering actions and awaiting their completion, accompanying IoT sensors monitor humans and the environment. These sensors produce large amounts of procedural, discrete, and continuous data streams, that hold the key to understanding the quality of process subjects (e.g. produced parts), outcome (e.g. quantity and quality), and error causes. Processes constantly evolve in conjunction with their IoT environment. This requires joint storage of data generated by processes, with data generated by the IoT sensors is therefore needed. In this paper, we present an extension of the process log standard format XES, namely SensorStream. SensorStream enables to connect IoT data to process events, as well as a set of semantic annotations to describe the scenario and environment during data collection. This allows to preserve the full context required for data-analysis, so that logs can be analyzed even when scenarios or hardware artifacts are rapidly changing. Through additional semantic annotations, we envision the XES extension log format to be a solid based for the creation of a (semi-)automatic analysis pipeline, which can support domain experts by automatically providing data visualization, or even process insights.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Informed Machine Learning for Improved Similarity Assessment in Process-Oriented Case-Based Reasoning
Authors:
Maximilian Hoffmann,
Ralph Bergmann
Abstract:
Currently, Deep Learning (DL) components within a Case-Based Reasoning (CBR) application often lack the comprehensive integration of available domain knowledge. The trend within machine learning towards so-called Informed machine learning can help to overcome this limitation. In this paper, we therefore investigate the potential of integrating domain knowledge into Graph Neural Networks (GNNs) tha…
▽ More
Currently, Deep Learning (DL) components within a Case-Based Reasoning (CBR) application often lack the comprehensive integration of available domain knowledge. The trend within machine learning towards so-called Informed machine learning can help to overcome this limitation. In this paper, we therefore investigate the potential of integrating domain knowledge into Graph Neural Networks (GNNs) that are used for similarity assessment between semantic graphs within process-oriented CBR applications. We integrate knowledge in two ways: First, a special data representation and processing method is used that encodes structural knowledge about the semantic annotations of each graph node and edge. Second, the message-passing component of the GNNs is constrained by knowledge on legal node mappings. The evaluation examines the quality and training time of the extended GNNs, compared to the stock models. The results show that both extensions are capable of providing better quality, shorter training times, or in some configurations both advantages at once.
△ Less
Submitted 30 June, 2021;
originally announced June 2021.
-
Manifolds.jl: An Extensible Julia Framework for Data Analysis on Manifolds
Authors:
Seth D. Axen,
Mateusz Baran,
Ronny Bergmann,
Krzysztof Rzecki
Abstract:
We present the Julia package Manifolds$.$jl, providing a fast and easy-to-use library of Riemannian manifolds and Lie groups. This package enables working with data defined on a Riemannian manifold, such as the circle, the sphere, symmetric positive definite matrices, or one of the models for hyperbolic spaces. We introduce a common interface, available in ManifoldsBase$.$jl, with which new manifo…
▽ More
We present the Julia package Manifolds$.$jl, providing a fast and easy-to-use library of Riemannian manifolds and Lie groups. This package enables working with data defined on a Riemannian manifold, such as the circle, the sphere, symmetric positive definite matrices, or one of the models for hyperbolic spaces. We introduce a common interface, available in ManifoldsBase$.$jl, with which new manifolds, applications, and algorithms can be implemented. We demonstrate the utility of Manifolds$.$jl using Bézier splines, an optimization task on manifolds, and principal component analysis on nonlinear data. In a benchmark, Manifolds$.$jl outperforms all comparable packages for low-dimensional manifolds in speed; over Python and Matlab packages, the improvement is often several orders of magnitude, while over C/C++ packages, the improvement is two-fold. For high-dimensional manifolds, it outperforms all packages except for Tensorflow-Riemopt, which is specifically tailored for high-dimensional manifolds.
△ Less
Submitted 12 June, 2023; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Mesh Denoising and Inpainting using the Total Variation of the Normal and a Shape Newton Approach
Authors:
Lukas Baumgärtner,
Ronny Bergmann,
Roland Herzog,
Stephan Schmidt,
José Vidal-Núñez,
Manuel Weiß
Abstract:
We present a novel approach to denoising and inpainting problems for surface meshes. The purpose of these problems is to remove noise or fill in missing parts while preserving important features such as sharp edges. A discrete variant of the total variation of the unit normal vector field serves as a regularizing functional to achieve these goals. In order to solve the resulting problem, we use a…
▽ More
We present a novel approach to denoising and inpainting problems for surface meshes. The purpose of these problems is to remove noise or fill in missing parts while preserving important features such as sharp edges. A discrete variant of the total variation of the unit normal vector field serves as a regularizing functional to achieve these goals. In order to solve the resulting problem, we use a version of the split Bregman (ADMM) iteration adapted to the problem. A new formulation of the total variation regularizer, as well as the use of an inexact Newton method for the shape optimization step, bring significant speed-up compared to earlier methods. Numerical examples are included, demonstrating the performance of our algorithm with some complex 3D geometries.
△ Less
Submitted 12 March, 2024; v1 submitted 21 December, 2020;
originally announced December 2020.
-
Using Semantic Web Services for AI-Based Research in Industry 4.0
Authors:
Lukas Malburg,
Patrick Klein,
Ralph Bergmann
Abstract:
The transition to Industry 4.0 requires smart manufacturing systems that are easily configurable and provide a high level of flexibility during manufacturing in order to achieve mass customization or to support cloud manufacturing. To realize this, Cyber-Physical Systems (CPSs) combined with Artificial Intelligence (AI) methods find their way into manufacturing shop floors. For using AI methods in…
▽ More
The transition to Industry 4.0 requires smart manufacturing systems that are easily configurable and provide a high level of flexibility during manufacturing in order to achieve mass customization or to support cloud manufacturing. To realize this, Cyber-Physical Systems (CPSs) combined with Artificial Intelligence (AI) methods find their way into manufacturing shop floors. For using AI methods in the context of Industry 4.0, semantic web services are indispensable to provide a reasonable abstraction of the underlying manufacturing capabilities. In this paper, we present semantic web services for AI-based research in Industry 4.0. Therefore, we developed more than 300 semantic web services for a physical simulation factory based on Web Ontology Language for Web Services (OWL-S) and Web Service Modeling Ontology (WSMO) and linked them to an already existing domain ontology for intelligent manufacturing control. Suitable for the requirements of CPS environments, our pre- and postconditions are verified in near real-time by invoking other semantic web services in contrast to complex reasoning within the knowledge base. Finally, we evaluate our implementation by executing a cyber-physical workflow composed of semantic web services using a workflow management system.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
Towards an Argument Mining Pipeline Transforming Texts to Argument Graphs
Authors:
Mirko Lenz,
Premtim Sahitaj,
Sean Kallenberg,
Christopher Coors,
Lorik Dumani,
Ralf Schenkel,
Ralph Bergmann
Abstract:
This paper targets the automated extraction of components of argumentative information and their relations from natural language text. Moreover, we address a current lack of systems to provide complete argumentative structure from arbitrary natural language text for general usage. We present an argument mining pipeline as a universally applicable approach for transforming German and English langua…
▽ More
This paper targets the automated extraction of components of argumentative information and their relations from natural language text. Moreover, we address a current lack of systems to provide complete argumentative structure from arbitrary natural language text for general usage. We present an argument mining pipeline as a universally applicable approach for transforming German and English language texts to graph-based argument representations. We also introduce new methods for evaluating the results based on existing benchmark argument structures. Our results show that the generated argument graphs can be beneficial to detect new connections between different statements of an argumentative text. Our pipeline implementation is publicly available on GitHub.
△ Less
Submitted 28 September, 2020; v1 submitted 8 June, 2020;
originally announced June 2020.
-
Same Side Stance Classification Task: Facilitating Argument Stance Classification by Fine-tuning a BERT Model
Authors:
Stefan Ollinger,
Lorik Dumani,
Premtim Sahitaj,
Ralph Bergmann,
Ralf Schenkel
Abstract:
Research on computational argumentation is currently being intensively investigated. The goal of this community is to find the best pro and con arguments for a user given topic either to form an opinion for oneself, or to persuade others to adopt a certain standpoint. While existing argument mining methods can find appropriate arguments for a topic, a correct classification into pro and con is not…
▽ More
Research on computational argumentation is currently being intensively investigated. The goal of this community is to find the best pro and con arguments for a user given topic either to form an opinion for oneself, or to persuade others to adopt a certain standpoint. While existing argument mining methods can find appropriate arguments for a topic, a correct classification into pro and con is not yet reliable. The same side stance classification task provides a dataset of argument pairs classified by whether or not both arguments share the same stance and does not need to distinguish between topic-specific pro and con vocabulary but only the argument similarity within a stance needs to be assessed. The results of our contribution to the task are build on a setup based on the BERT architecture. We fine-tuned a pre-trained BERT model for three epochs and used the first 512 tokens of each argument to predict if two arguments share the same stance.
△ Less
Submitted 23 April, 2020;
originally announced April 2020.
-
Opening the Black Boxes in Data Flow Optimization
Authors:
Fabian Hueske,
Mathias Peters,
Matthias Sax,
Astrid Rheinländer,
Rico Bergmann,
Aljoscha Krettek,
Kostas Tzoumas
Abstract:
Many systems for big data analytics employ a data flow abstraction to define parallel data processing tasks. In this setting, custom operations expressed as user-defined functions are very common. We address the problem of performing data flow optimization at this level of abstraction, where the semantics of operators are not known. Traditionally, query optimization is applied to queries with know…
▽ More
Many systems for big data analytics employ a data flow abstraction to define parallel data processing tasks. In this setting, custom operations expressed as user-defined functions are very common. We address the problem of performing data flow optimization at this level of abstraction, where the semantics of operators are not known. Traditionally, query optimization is applied to queries with known algebraic semantics. In this work, we find that a handful of properties, rather than a full algebraic specification, suffice to establish reordering conditions for data processing operators. We show that these properties can be accurately estimated for black box operators by statically analyzing the general-purpose code of their user-defined functions. We design and implement an optimizer for parallel data flows that does not assume knowledge of semantics or algebraic properties of operators. Our evaluation confirms that the optimizer can apply common rewritings such as selection reordering, bushy join-order enumeration, and limited forms of aggregation push-down, hence yielding similar rewriting power as modern relational DBMS optimizers. Moreover, it can optimize the operator order of non-relational data flows, a unique feature among today's systems.
△ Less
Submitted 31 July, 2012;
originally announced August 2012.
-
Building and Refining Abstract Planning Cases by Change of Representation Language
Authors:
R. Bergmann,
W. Wilke
Abstract:
ion is one of the most promising approaches to improve the performance of problem solvers. In several domains abstraction by dropping sentences of a domain description -- as used in most hierarchical planners -- has proven useful. In this paper we present examples which illustrate significant drawbacks of abstraction by dropping sentences. To overcome these drawbacks, we propose a more general v…
▽ More
ion is one of the most promising approaches to improve the performance of problem solvers. In several domains abstraction by dropping sentences of a domain description -- as used in most hierarchical planners -- has proven useful. In this paper we present examples which illustrate significant drawbacks of abstraction by dropping sentences. To overcome these drawbacks, we propose a more general view of abstraction involving the change of representation language. We have developed a new abstraction methodology and a related sound and complete learning algorithm that allows the complete change of representation language of planning cases from concrete to abstract. However, to achieve a powerful change of the representation language, the abstract language itself as well as rules which describe admissible ways of abstracting states must be provided in the domain model. This new abstraction approach is the core of Paris (Plan Abstraction and Refinement in an Integrated System), a system in which abstract planning cases are automatically learned from given concrete cases. An empirical study in the domain of process planning in mechanical engineering shows significant advantages of the proposed reasoning from abstract cases over classical hierarchical planning.
△ Less
Submitted 30 June, 1995;
originally announced July 1995.