MIMICause: Representation and automatic extraction of causal relation types from clinical notes

Khetan, Vivek; Rizvi, Md Imbesat Hassan; Huber, Jessica; Bartusiak, Paige; Sacaleanu, Bogdan; Fano, Andrew

Computer Science > Computation and Language

arXiv:2110.07090 (cs)

[Submitted on 14 Oct 2021 (v1), last revised 14 Mar 2022 (this version, v2)]

Title:MIMICause: Representation and automatic extraction of causal relation types from clinical notes

Authors:Vivek Khetan, Md Imbesat Hassan Rizvi, Jessica Huber, Paige Bartusiak, Bogdan Sacaleanu, Andrew Fano

View PDF

Abstract:Understanding causal narratives communicated in clinical notes can help make strides towards personalized healthcare. Extracted causal information from clinical notes can be combined with structured EHR data such as patients' demographics, diagnoses, and medications. This will enhance healthcare providers' ability to identify aspects of a patient's story communicated in the clinical notes and help make more informed decisions.
In this work, we propose annotation guidelines, develop an annotated corpus and provide baseline scores to identify types and direction of causal relations between a pair of biomedical concepts in clinical notes; communicated implicitly or explicitly, identified either in a single sentence or across multiple sentences.
We annotate a total of 2714 de-identified examples sampled from the 2018 n2c2 shared task dataset and train four different language model based architectures. Annotation based on our guidelines achieved a high inter-annotator agreement i.e. Fleiss' kappa ($\kappa$) score of 0.72, and our model for identification of causal relations achieved a macro F1 score of 0.56 on the test data. The high inter-annotator agreement for clinical text shows the quality of our annotation guidelines while the provided baseline F1 score sets the direction for future research towards understanding narratives in clinical texts.

Comments:	Accepted at the Findings of ACL 2022
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7; I.5.4
Cite as:	arXiv:2110.07090 [cs.CL]
	(or arXiv:2110.07090v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2110.07090

Submission history

From: Vivek Khetan [view email]
[v1] Thu, 14 Oct 2021 00:15:36 UTC (2,195 KB)
[v2] Mon, 14 Mar 2022 00:22:42 UTC (3,716 KB)

Computer Science > Computation and Language

Title:MIMICause: Representation and automatic extraction of causal relation types from clinical notes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MIMICause: Representation and automatic extraction of causal relation types from clinical notes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators