Skip to main content

Showing 1–4 of 4 results for author: Comeau, D C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.00589  [pdf

    cs.IR cs.AI cs.CL q-bio.QM

    MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval

    Authors: Qiao Jin, Won Kim, Qingyu Chen, Donald C. Comeau, Lana Yeganova, W. John Wilbur, Zhiyong Lu

    Abstract: Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we… ▽ More

    Submitted 3 October, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: The MedCPT code and API are available at https://github.com/ncbi/MedCPT

  2. arXiv:2306.10070  [pdf

    cs.CY cs.AI cs.CL q-bio.QM

    Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health

    Authors: Shubo Tian, Qiao Jin, Lana Yeganova, Po-Ting Lai, Qingqing Zhu, Xiuying Chen, Yifan Yang, Qingyu Chen, Won Kim, Donald C. Comeau, Rezarta Islamaj, Aadit Kapoor, Xin Gao, Zhiyong Lu

    Abstract: ChatGPT has drawn considerable attention from both the general public and domain experts with its remarkable text generation capabilities. This has subsequently led to the emergence of diverse applications in the field of biomedicine and health. In this work, we examine the diverse applications of large language models (LLMs), such as ChatGPT, in biomedicine and health. Specifically we explore the… ▽ More

    Submitted 16 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  3. arXiv:2008.03397  [pdf

    cs.DL cs.DB cs.IR cs.LG

    Navigating the landscape of COVID-19 research through literature analysis: A bird's eye view

    Authors: Lana Yeganova, Rezarta Islamaj, Qingyu Chen, Robert Leaman, Alexis Allot, Chin-Hsuan Wei, Donald C. Comeau, Won Kim, Yifan Peng, W. John Wilbur, Zhiyong Lu

    Abstract: Timely access to accurate scientific literature in the battle with the ongoing COVID-19 pandemic is critical. This unprecedented public health risk has motivated research towards understanding the disease in general, identifying drugs to treat the disease, developing potential vaccines, etc. This has given rise to a rapidly growing body of literature that doubles in number of publications every 20… ▽ More

    Submitted 11 September, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 10 pages, 8 Figures, Submitted to KDD 2020 Health Day

    Journal ref: KDD 2020 Health Day: AI for COVID, August 23-27, 2020, Virtual Conference, CA, US

  4. arXiv:1804.05957  [pdf

    cs.DL

    PMC text mining subset in BioC: 2.3 million full text articles and growing

    Authors: Donald C. Comeau, Chih-Hsuan Wei, Rezarta Islamaj Doğan, Zhiyong Lu

    Abstract: Interest in full text mining biomedical research articles is growing. NCBI provides the PMC Open Access and Author Manuscript sets of articles which are available for text mining. We have made all of these articles available in BioC, an XML and JSON format which is convenient for sharing text, annotations, and relations. These articles are available both via ftp for bulk download and via a Web API… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: 8 pages, 6 figures, 1 table