The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models

Yan, Yao; Yu, Thomas; Muenzen, Kathleen; Liu, Sijia; Boyle, Connor; Koslowski, George; Zheng, Jiaxin; Dobbins, Nicholas; Essien, Clement; Liu, Hongfang; Omberg, Larsson; Yestigen, Meliha; Taylor, Bradley; Eddy, James A; Guinney, Justin; Mooney, Sean; Schaffter, Thomas

Computer Science > Computation and Language

arXiv:2206.14181 (cs)

[Submitted on 28 Jun 2022]

Title:The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models

Authors:Yao Yan, Thomas Yu, Kathleen Muenzen, Sijia Liu, Connor Boyle, George Koslowski, Jiaxin Zheng, Nicholas Dobbins, Clement Essien, Hongfang Liu, Larsson Omberg, Meliha Yestigen, Bradley Taylor, James A Eddy, Justin Guinney, Sean Mooney, Thomas Schaffter

View PDF

Abstract:Objective The evaluation of natural language processing (NLP) models for clinical text de-identification relies on the availability of clinical notes, which is often restricted due to privacy concerns. The NLP Sandbox is an approach for alleviating the lack of data and evaluation frameworks for NLP models by adopting a federated, model-to-data approach. This enables unbiased federated model evaluation without the need for sharing sensitive data from multiple institutions. Materials and Methods We leveraged the Synapse collaborative framework, containerization software, and OpenAPI generator to build the NLP Sandbox (this http URL). We evaluated two state-of-the-art NLP de-identification focused annotation models, Philter and NeuroNER, using data from three institutions. We further validated model performance using data from an external validation site. Results We demonstrated the usefulness of the NLP Sandbox through de-identification clinical model evaluation. The external developer was able to incorporate their model into the NLP Sandbox template and provide user experience feedback. Discussion We demonstrated the feasibility of using the NLP Sandbox to conduct a multi-site evaluation of clinical text de-identification models without the sharing of data. Standardized model and data schemas enable smooth model transfer and implementation. To generalize the NLP Sandbox, work is required on the part of data owners and model developers to develop suitable and standardized schemas and to adapt their data or model to fit the schemas. Conclusions The NLP Sandbox lowers the barrier to utilizing clinical data for NLP model evaluation and facilitates federated, multi-site, unbiased evaluation of NLP models.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2206.14181 [cs.CL]
	(or arXiv:2206.14181v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2206.14181

Submission history

From: Yao Yan [view email]
[v1] Tue, 28 Jun 2022 17:47:56 UTC (895 KB)

Computer Science > Computation and Language

Title:The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators