Skip to main content

Showing 1–4 of 4 results for author: Wajsbürt, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.02042  [pdf

    cs.CL

    Impact of translation on biomedical information extraction from real-life clinical notes

    Authors: Christel Gérardin, Yuhan Xiong, Perceval Wajsbürt, Fabrice Carrat, Xavier Tannier

    Abstract: The objective of our study is to determine whether using English tools to extract and normalize French medical concepts on translations provides comparable performance to French models trained on a set of annotated French clinical notes. We compare two methods: a method involving French language models and a method involving English language models. For the native French method, the Named Entity R… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 26 pages, 2 figures, 5 tables

  2. arXiv:2305.13817  [pdf

    cs.CL

    Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing

    Authors: Christel Gérardin, Perceval Wajsbürt, Basile Dura, Alice Calliger, Alexandre Moucher, Xavier Tannier, Romain Bey

    Abstract: Objective:Develop and validate an algorithm for analyzing the layout of PDF clinical documents to improve the performance of downstream natural language processing tasks. Materials and Methods: We designed an algorithm to process clinical PDF documents and extract only clinically relevant text. The algorithm consists of several steps: initial text extraction using a PDF parser, followed by classif… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 22 pages, 5 figures

  3. arXiv:2303.13451  [pdf

    cs.CL

    Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

    Authors: Xavier Tannier, Perceval Wajsbürt, Alice Calliger, Basile Dura, Alexandre Mouchet, Martin Hilka, Romain Bey

    Abstract: The objective of this study is to address the critical issue of de-identification of clinical reports in order to allow access to data for research purposes, while ensuring patient privacy. The study highlights the difficulties faced in sharing tools and resources in this domain and presents the experience of the Greater Paris University Hospitals (AP-HP) in implementing a systematic pseudonymizat… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  4. arXiv:2104.01037  [pdf, other

    cs.CL cs.LG

    Effect of depth order on iterative nested named entity recognition models

    Authors: Perceval Wajsburt, Yoann Taillé, Xavier Tannier

    Abstract: This paper studies the effect of the order of depth of mention on nested named entity recognition (NER) models. NER is an essential task in the extraction of biomedical information, and nested entities are common since medical concepts can assemble to form larger entities. Conventional NER systems only predict disjointed entities. Thus, iterative models for nested NER use multiple predictions to e… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.