Public Health Informatics: Proposing Causal Sequence of Death Using Neural Machine Translation

Zhu, Yuanda; Sha, Ying; Wu, Hang; Li, Mai; Hoffman, Ryan A.; Wang, May D.

Computer Science > Machine Learning

arXiv:2009.10318 (cs)

[Submitted on 22 Sep 2020 (v1), last revised 9 Mar 2021 (this version, v2)]

Title:Public Health Informatics: Proposing Causal Sequence of Death Using Neural Machine Translation

Authors:Yuanda Zhu, Ying Sha, Hang Wu, Mai Li, Ryan A. Hoffman, May D. Wang

View PDF

Abstract:Each year there are nearly 57 million deaths around the world, with over 2.7 million in the United States. Timely, accurate and complete death reporting is critical in public health, as institutions and government agencies rely on death reports to analyze vital statistics and to formulate responses to communicable diseases. Inaccurate death reporting may result in potential misdirection of public health policies. Determining the causes of death is, nevertheless, challenging even for experienced physicians. To facilitate physicians in accurately reporting causes of death, we present an advanced AI approach to determine a chronically ordered sequence of clinical conditions that lead to death, based on decedent's last hospital discharge record. The sequence of clinical codes on the death report is named as causal chain of death, coded in the tenth revision of International Statistical Classification of Diseases (ICD-10); in line with the ICD-9-CM Official Guidelines for Coding and Reporting, the priority-ordered clinical conditions on the discharge record are coded in ICD-9. We identify three challenges in proposing the causal chain of death: two versions of coding system in clinical codes, medical domain knowledge conflict, and data interoperability. To overcome the first challenge in this sequence-to-sequence problem, we apply neural machine translation models to generate target sequence. Along with three accuracy metrics, we evaluate the quality of generated sequences with the BLEU (BiLingual Evaluation Understudy) score and achieve 16.04 out of 100. To address the second challenge, we incorporate expert-verified medical domain knowledge as constraint in generating output sequence to exclude infeasible causal chains. Lastly, we demonstrate the usability of our work in a Fast Healthcare Interoperability Resources (FHIR) interface to address the third challenge.

Comments:	11 pages, 8 figures, 8 tables. Updates: (1) Added Section II: Recent Work (2) Added three accuracy evaluation criteria (3) Re-run experiments of OpenNMT and updated the results and discussion in Section VI: Results and Discussion (4) Finished FHIR mobile app and updated Section VII: FHIR Interface (5) Revised Section VIII: Conclusion accordingly
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2009.10318 [cs.LG]
	(or arXiv:2009.10318v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2009.10318

Submission history

From: Yuanda Zhu [view email]
[v1] Tue, 22 Sep 2020 04:56:23 UTC (1,283 KB)
[v2] Tue, 9 Mar 2021 19:43:37 UTC (1,049 KB)

Computer Science > Machine Learning

Title:Public Health Informatics: Proposing Causal Sequence of Death Using Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Public Health Informatics: Proposing Causal Sequence of Death Using Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators