-
RadEx: A Framework for Structured Information Extraction from Radiology Reports based on Large Language Models
Authors:
Daniel Reichenpfader,
Jonas Knupp,
André Sander,
Kerstin Denecke
Abstract:
Annually and globally, over three billion radiography examinations and computer tomography scans result in mostly unstructured radiology reports containing free text. Despite the potential benefits of structured reporting, its adoption is limited by factors such as established processes, resource constraints and potential loss of information. However, structured information would be necessary for…
▽ More
Annually and globally, over three billion radiography examinations and computer tomography scans result in mostly unstructured radiology reports containing free text. Despite the potential benefits of structured reporting, its adoption is limited by factors such as established processes, resource constraints and potential loss of information. However, structured information would be necessary for various use cases, including automatic analysis, clinical trial matching, and prediction of health outcomes. This study introduces RadEx, an end-to-end framework comprising 15 software components and ten artifacts to develop systems that perform automated information extraction from radiology reports. It covers the complete process from annotating training data to extracting information by offering a consistent generic information model and setting boundaries for model development. Specifically, RadEx allows clinicians to define relevant information for clinical domains (e.g., mammography) and to create report templates. The framework supports both generative and encoder-only models and the decoupling of information extraction from template filling enables independent model improvements. Developing information extraction systems according to the RadEx framework facilitates implementation and maintenance as components are easily exchangeable, while standardized artifacts ensure interoperability between components.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Epidemic Intelligence for the Crowd, by the Crowd (Full Version)
Authors:
Ernesto Diaz-Aviles,
Avaré Stewart,
Edward Velasco,
Kerstin Denecke,
Wolfgang Nejdl
Abstract:
Tracking Twitter for public health has shown great potential. However, most recent work has been focused on correlating Twitter messages to influenza rates, a disease that exhibits a marked seasonal pattern. In the presence of sudden outbreaks, how can social media streams be used to strengthen surveillance capacity? In May 2011, Germany reported an outbreak of Enterohemorrhagic Escherichia coli (…
▽ More
Tracking Twitter for public health has shown great potential. However, most recent work has been focused on correlating Twitter messages to influenza rates, a disease that exhibits a marked seasonal pattern. In the presence of sudden outbreaks, how can social media streams be used to strengthen surveillance capacity? In May 2011, Germany reported an outbreak of Enterohemorrhagic Escherichia coli (EHEC). It was one of the largest described outbreaks of EHEC/HUS worldwide and the largest in Germany. In this work, we study the crowd's behavior in Twitter during the outbreak. In particular, we report how tracking Twitter helped to detect key user messages that triggered signal detection alarms before MedISys and other well established early warning systems. We also introduce a personalized learning to rank approach that exploits the relationships discovered by: (i) latent semantic topics computed using Latent Dirichlet Allocation (LDA), and (ii) observing the social tagging behavior in Twitter, to rank tweets for epidemic intelligence. Our results provide the grounds for new public health research based on social media.
△ Less
Submitted 5 March, 2012;
originally announced March 2012.
-
Essential Variables and Separable Sets in Universal Algebra
Authors:
Slavcho Shtrakov,
Klaus Denecke
Abstract:
The study of essential and strongly essential variables in functions defined on finite sets is a part of $k$-valued logic. We extend the main definitions from functions to terms. This allows us to apply concepts and results of Universal Algebra. On the basis of the concept of a separable set of variables in a term we introduce a new notion of complexity of terms, algebras and varieties and give…
▽ More
The study of essential and strongly essential variables in functions defined on finite sets is a part of $k$-valued logic. We extend the main definitions from functions to terms. This allows us to apply concepts and results of Universal Algebra. On the basis of the concept of a separable set of variables in a term we introduce a new notion of complexity of terms, algebras and varieties and give examples.
△ Less
Submitted 10 December, 2008;
originally announced December 2008.
-
Multi-Hypersubstitutions and Colored Solid Varieties
Authors:
Klaus Denecke,
Jorg Koppitz,
Slavcho Shtrakov
Abstract:
Hypersubstitutions are mappings which map operation symbols to terms. Terms can be visualized by trees. Hypersubstitutions can be extended to mappings defined on sets of trees. The nodes of the trees, describing terms, are labelled by operation symbols and by colors, i.e. certain positive integers. We are interested in mappings which map differently colored operation symbols to different terms.…
▽ More
Hypersubstitutions are mappings which map operation symbols to terms. Terms can be visualized by trees. Hypersubstitutions can be extended to mappings defined on sets of trees. The nodes of the trees, describing terms, are labelled by operation symbols and by colors, i.e. certain positive integers. We are interested in mappings which map differently colored operation symbols to different terms. In this paper we extend the theory of hypersubstitutions and solid varieties to multi-hypersubstitutions and colored solid varieties. We develop the interconnections between such colored terms and multi-hypersubstitutions and the equational theory of Universal Algebra. The collection of all varieties of a given type forms a complete lattice which is very complex and difficult to study; multi-hypersubstitutions and colored solid varieties offer a new method to study complete sublattices of this lattice.
△ Less
Submitted 3 December, 2008; v1 submitted 28 November, 2008;
originally announced November 2008.
-
The Depth of a Hypersubstitution
Authors:
Klaus Denecke,
Jorg Koppitz,
Slavcho Shtrakov
Abstract:
For given depth of a we derive a formula for the depth of the image of that term under a given hypersubstitution.
For given depth of a we derive a formula for the depth of the image of that term under a given hypersubstitution.
△ Less
Submitted 28 November, 2008;
originally announced November 2008.