Skip to main content

Showing 1–15 of 15 results for author: Mercer, R E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03659  [pdf, ps, other

    cs.CL

    Trustworthy Medical Question Answering: An Evaluation-Centric Survey

    Authors: Yinuo Wang, Robert E. Mercer, Frank Rudzicz, Sudipta Singha Roy, Pengjie Ren, Zhumin Chen, Xindi Wang

    Abstract: Trustworthiness in healthcare question-answering (QA) systems is important for ensuring patient safety, clinical effectiveness, and user confidence. As large language models (LLMs) become increasingly integrated into medical settings, the reliability of their responses directly influences clinical decision-making and patient outcomes. However, achieving comprehensive trustworthiness in medical QA… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2410.02930  [pdf, other

    cs.CL

    Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification

    Authors: Sudipta Singha Roy, Xindi Wang, Robert E. Mercer, Frank Rudzicz

    Abstract: Long document classification presents challenges in capturing both local and global dependencies due to their extensive content and complex structure. Existing methods often struggle with token limits and fail to adequately model hierarchical relationships within documents. To address these constraints, we propose a novel model leveraging a graph-tree structure. Our approach integrates syntax tree… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: accepted to EMNLP findings 2024

  3. arXiv:2405.19093  [pdf, other

    cs.CL cs.IR

    Multi-stage Retrieve and Re-rank Model for Automatic Medical Coding Recommendation

    Authors: Xindi Wang, Robert E. Mercer, Frank Rudzicz

    Abstract: The International Classification of Diseases (ICD) serves as a definitive medical classification system encompassing a wide range of diseases and conditions. The primary objective of ICD indexing is to allocate a subset of ICD codes to a medical record, which facilitates standardized documentation and management of various health conditions. Most existing approaches have suffered from selecting th… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024 -- camera-ready version

  4. arXiv:2405.19084  [pdf, other

    cs.CL

    Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification

    Authors: Xindi Wang, Robert E. Mercer, Frank Rudzicz

    Abstract: The International Classification of Diseases (ICD) is an authoritative medical classification system of different diseases and conditions for clinical and management purposes. ICD indexing assigns a subset of ICD codes to a medical record. Since human coding is labour-intensive and error-prone, many studies employ machine learning to automate the coding process. ICD coding is a challenging task, a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to LREC-COLING 2024 -- camera-ready version

  5. arXiv:2307.15411  [pdf, other

    cs.CL

    Investigating the Learning Behaviour of In-context Learning: A Comparison with Supervised Learning

    Authors: Xindi Wang, Yufei Wang, Can Xu, Xiubo Geng, Bowen Zhang, Chongyang Tao, Frank Rudzicz, Robert E. Mercer, Daxin Jiang

    Abstract: Large language models (LLMs) have shown remarkable capacity for in-context learning (ICL), where learning a new task from just a few training examples is done without being explicitly pre-trained. However, despite the success of LLMs, there has been little understanding of how ICL learns the knowledge from the given prompts. In this paper, to make progress toward understanding the learning behavio… ▽ More

    Submitted 1 August, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: accepted to ECAI 2023 (camera-ready)

  6. arXiv:2204.13604  [pdf, other

    cs.CL cs.IR

    MeSHup: A Corpus for Full Text Biomedical Document Indexing

    Authors: Xindi Wang, Robert E. Mercer, Frank Rudzicz

    Abstract: Medical Subject Heading (MeSH) indexing refers to the problem of assigning a given biomedical document with the most relevant labels from an extremely large set of MeSH terms. Currently, the vast number of biomedical articles in the PubMed database are manually annotated by human curators, which is time consuming and costly; therefore, a computational system that can assist the indexing is highly… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: LREC 2022 main conference

  7. arXiv:2204.07571  [pdf, other

    cs.CL cs.AI

    Evaluation Benchmarks for Spanish Sentence Representations

    Authors: Vladimir Araujo, Andrés Carvallo, Souvik Kundu, José Cañete, Marcelo Mendoza, Robert E. Mercer, Felipe Bravo-Marquez, Marie-Francine Moens, Alvaro Soto

    Abstract: Due to the success of pre-trained language models, versions of languages other than English have been released in recent years. This fact implies the need for resources to evaluate these models. In the case of Spanish, there are few ways to systematically assess the models' quality. In this paper, we narrow the gap by building two evaluation benchmarks. Inspired by previous work (Conneau and Kiela… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted paper at LREC2022

  8. arXiv:2203.06835  [pdf, other

    cs.CL

    KenMeSH: Knowledge-enhanced End-to-end Biomedical Text Labelling

    Authors: Xindi Wang, Robert E. Mercer, Frank Rudzicz

    Abstract: Currently, Medical Subject Headings (MeSH) are manually assigned to every biomedical article published and subsequently recorded in the PubMed database to facilitate retrieving relevant information. With the rapid growth of the PubMed database, large-scale biomedical document indexing becomes increasingly important. MeSH indexing is a challenging task for machine learning, as it needs to assign mu… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: main conference at ACL 2022

  9. arXiv:1901.00066  [pdf, other

    cs.CL cs.LG

    Improving Tree-LSTM with Tree Attention

    Authors: Mahtab Ahmed, Muhammad Rifayat Samee, Robert E. Mercer

    Abstract: In Natural Language Processing (NLP), we often need to extract information from tree topology. Sentence structure can be represented via a dependency tree or a constituency tree structure. For this reason, a variant of LSTMs, named Tree-LSTM, was proposed to work on tree topology. In this paper, we design a generalized attention framework for both dependency and constituency trees by encoding vari… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

    Comments: 8 Pages, 3 figures, Accepted in The 13th IEEE International Conference on Semantic Computing (ICSC 2019)

  10. arXiv:1809.01074  [pdf, other

    cs.CL cs.LG

    A Novel Neural Sequence Model with Multiple Attentions for Word Sense Disambiguation

    Authors: Mahtab Ahmed, Muhammad Rifayat Samee, Robert E. Mercer

    Abstract: Word sense disambiguation (WSD) is a well researched problem in computational linguistics. Different research works have approached this problem in different ways. Some state of the art results that have been achieved for this problem are by supervised models in terms of accuracy, but they often fall behind flexible knowledge-based solutions which use engineered features as well as human annotator… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

    Comments: 9 pages, 3 Figures, Accepted as a conference paper in ICMLA 2018

  11. arXiv:1808.03227  [pdf, other

    q-bio.QM cs.CL cs.IR cs.LG

    Identifying Protein-Protein Interaction using Tree LSTM and Structured Attention

    Authors: Mahtab Ahmed, Jumayel Islam, Muhammad Rifayat Samee, Robert E. Mercer

    Abstract: Identifying interactions between proteins is important to understand underlying biological processes. Extracting a protein-protein interaction (PPI) from the raw text is often very difficult. Previous supervised learning methods have used handcrafted features on human-annotated data sets. In this paper, we propose a novel tree recurrent neural network with structured attention architecture for doi… ▽ More

    Submitted 27 July, 2018; originally announced August 2018.

    Comments: 9 Pages, 2 Figures, Under Review

  12. arXiv:1807.10805  [pdf, other

    cs.CL cs.LG

    Improving Neural Sequence Labelling using Additional Linguistic Information

    Authors: Mahtab Ahmed, Muhammad Rifayat Samee, Robert E. Mercer

    Abstract: Sequence labelling is the task of assigning categorical labels to a data sequence. In Natural Language Processing, sequence labelling can be applied to various fundamental problems, such as Part of Speech (POS) tagging, Named Entity Recognition (NER), and Chunking. In this study, we propose a method to add various linguistic features to the neural sequence framework to improve sequence labelling.… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

    Comments: 9 pages, 1 figure, Under review

  13. arXiv:1307.8057  [pdf

    cs.CL cs.IR

    Extracting Connected Concepts from Biomedical Texts using Fog Index

    Authors: Rushdi Shams, Robert E. Mercer

    Abstract: In this paper, we establish Fog Index (FI) as a text filter to locate the sentences in texts that contain connected biomedical concepts of interest. To do so, we have used 24 random papers each containing four pairs of connected concepts. For each pair, we categorize sentences based on whether they contain both, any or none of the concepts. We then use FI to measure difficulty of the sentences of… ▽ More

    Submitted 30 July, 2013; originally announced July 2013.

    Comments: 12th Conference of the Pacific Association for Computational Linguistics (PACLING 2011), Kuala Lumpur, Malaysia, July 19-21, 2011

  14. arXiv:1204.2231  [pdf, other

    cs.DL cs.IR

    Investigating Keyphrase Indexing with Text Denoising

    Authors: Rushdi Shams, Robert E. Mercer

    Abstract: In this paper, we report on indexing performance by a state-of-the-art keyphrase indexer, Maui, when paired with a text extraction procedure called text denoising. Text denoising is a method that extracts the denoised text, comprising the content-rich sentences, from full texts. The performance of the keyphrase indexer is demonstrated on three standard corpora collected from three domains, namely… ▽ More

    Submitted 10 April, 2012; originally announced April 2012.

    Comments: The full paper submitted to 12th ACM/ IEEE-CS Joint Conference on Digital Libraries (JCDL2012)

    ACM Class: H.3.1; H.3.3; H.3.4

  15. arXiv:cs/0003031  [pdf, ps, other

    cs.AI

    Optimal Belief Revision

    Authors: Carmen Vodislav, Robert E. Mercer

    Abstract: We propose a new approach to belief revision that provides a way to change knowledge bases with a minimum of effort. We call this way of revising belief states optimal belief revision. Our revision method gives special attention to the fact that most belief revision processes are directed to a specific informational objective. This approach to belief change is founded on notions such as optimal… ▽ More

    Submitted 8 March, 2000; originally announced March 2000.

    Comments: NMR'2000 Workshop 6 pages

    ACM Class: I.2.3