Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Wadhwa, Somin; DeYoung, Jay; Nye, Benjamin; Amir, Silvio; Wallace, Byron C.

Computer Science > Computation and Language

arXiv:2305.03642 (cs)

[Submitted on 5 May 2023 (v1), last revised 18 Jul 2023 (this version, v3)]

Title:Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Authors:Somin Wadhwa, Jay DeYoung, Benjamin Nye, Silvio Amir, Byron C. Wallace

View PDF

Abstract:Results from Randomized Controlled Trials (RCTs) establish the comparative effectiveness of interventions, and are in turn critical inputs for evidence-based care. However, results from RCTs are presented in (often unstructured) natural language articles describing the design, execution, and outcomes of trials; clinicians must manually extract findings pertaining to interventions and outcomes of interest from such articles. This onerous manual process has motivated work on (semi-)automating extraction of structured evidence from trial reports. In this work we propose and evaluate a text-to-text model built on instruction-tuned Large Language Models (LLMs) to jointly extract Interventions, Outcomes, and Comparators (ICO elements) from clinical abstracts, and infer the associated results reported. Manual (expert) and automated evaluations indicate that framing evidence extraction as a conditional generation task and fine-tuning LLMs for this purpose realizes considerable ($\sim$20 point absolute F1 score) gains over the previous SOTA. We perform ablations and error analyses to assess aspects that contribute to model performance, and to highlight potential directions for further improvements. We apply our model to a collection of published RCTs through mid-2022, and release a searchable database of structured findings: this http URL

Comments:	Accepted to MLHC 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.03642 [cs.CL]
	(or arXiv:2305.03642v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.03642

Submission history

From: Somin Wadhwa [view email]
[v1] Fri, 5 May 2023 16:02:06 UTC (7,310 KB)
[v2] Fri, 14 Jul 2023 15:38:01 UTC (7,296 KB)
[v3] Tue, 18 Jul 2023 01:36:42 UTC (7,296 KB)

Computer Science > Computation and Language

Title:Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators