Automated Identification of Eviction Status from Electronic Health Record Notes

Yao, Zonghai; Tsai, Jack; Liu, Weisong; Levy, David A.; Druhl, Emily; Reisman, Joel I; Yu, Hong

doi:10.1093/jamia/ocad081

Computer Science > Computation and Language

arXiv:2212.02762 (cs)

[Submitted on 6 Dec 2022 (v1), last revised 20 May 2023 (this version, v3)]

Title:Automated Identification of Eviction Status from Electronic Health Record Notes

Authors:Zonghai Yao, Jack Tsai, Weisong Liu, David A. Levy, Emily Druhl, Joel I Reisman, Hong Yu

View PDF

Abstract:Objective: Evictions are important social and behavioral determinants of health. Evictions are associated with a cascade of negative events that can lead to unemployment, housing insecurity/homelessness, long-term poverty, and mental health problems. In this study, we developed a natural language processing system to automatically detect eviction status from electronic health record (EHR) notes.
Materials and Methods: We first defined eviction status (eviction presence and eviction period) and then annotated eviction status in 5000 EHR notes from the Veterans Health Administration (VHA). We developed a novel model, KIRESH, that has shown to substantially outperform other state-of-the-art models such as fine-tuning pre-trained language models like BioBERT and BioClinicalBERT. Moreover, we designed a novel prompt to further improve the model performance by using the intrinsic connection between the two sub-tasks of eviction presence and period prediction. Finally, we used the Temperature Scaling-based Calibration on our KIRESH-Prompt method to avoid over-confidence issues arising from the imbalance dataset.
Results: KIRESH-Prompt substantially outperformed strong baseline models including fine-tuning the BioClinicalBERT model to achieve 0.74672 MCC, 0.71153 Macro-F1, and 0.83396 Micro-F1 in predicting eviction period and 0.66827 MCC, 0.62734 Macro-F1, and 0.7863 Micro-F1 in predicting eviction presence. We also conducted additional experiments on a benchmark social determinants of health (SBDH) dataset to demonstrate the generalizability of our methods.
Conclusion and Future Work: KIRESH-Prompt has substantially improved eviction status classification. We plan to deploy KIRESH-Prompt to the VHA EHRs as an eviction surveillance system to help address the US Veterans' housing insecurity.

Comments:	This article has been accepted for publication in Journal of the American Medical Informatics Association Published by Oxford University Press. this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.02762 [cs.CL]
	(or arXiv:2212.02762v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.02762
Journal reference:	Journal of the American Medical Informatics Association, ocad081, 2023
Related DOI:	https://doi.org/10.1093/jamia/ocad081

Submission history

From: Zonghai Yao [view email]
[v1] Tue, 6 Dec 2022 05:25:32 UTC (235 KB)
[v2] Fri, 7 Apr 2023 15:17:30 UTC (147 KB)
[v3] Sat, 20 May 2023 05:03:57 UTC (147 KB)

Computer Science > Computation and Language

Title:Automated Identification of Eviction Status from Electronic Health Record Notes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Automated Identification of Eviction Status from Electronic Health Record Notes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators