Skip to main content

Showing 1–4 of 4 results for author: Yeganova, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.00589  [pdf

    cs.IR cs.AI cs.CL q-bio.QM

    MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval

    Authors: Qiao Jin, Won Kim, Qingyu Chen, Donald C. Comeau, Lana Yeganova, W. John Wilbur, Zhiyong Lu

    Abstract: Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we… ▽ More

    Submitted 3 October, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: The MedCPT code and API are available at https://github.com/ncbi/MedCPT

  2. arXiv:2306.10070  [pdf

    cs.CY cs.AI cs.CL q-bio.QM

    Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health

    Authors: Shubo Tian, Qiao Jin, Lana Yeganova, Po-Ting Lai, Qingqing Zhu, Xiuying Chen, Yifan Yang, Qingyu Chen, Won Kim, Donald C. Comeau, Rezarta Islamaj, Aadit Kapoor, Xin Gao, Zhiyong Lu

    Abstract: ChatGPT has drawn considerable attention from both the general public and domain experts with its remarkable text generation capabilities. This has subsequently led to the emergence of diverse applications in the field of biomedicine and health. In this work, we examine the diverse applications of large language models (LLMs), such as ChatGPT, in biomedicine and health. Specifically we explore the… ▽ More

    Submitted 16 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  3. arXiv:2008.03397  [pdf

    cs.DL cs.DB cs.IR cs.LG

    Navigating the landscape of COVID-19 research through literature analysis: A bird's eye view

    Authors: Lana Yeganova, Rezarta Islamaj, Qingyu Chen, Robert Leaman, Alexis Allot, Chin-Hsuan Wei, Donald C. Comeau, Won Kim, Yifan Peng, W. John Wilbur, Zhiyong Lu

    Abstract: Timely access to accurate scientific literature in the battle with the ongoing COVID-19 pandemic is critical. This unprecedented public health risk has motivated research towards understanding the disease in general, identifying drugs to treat the disease, developing potential vaccines, etc. This has given rise to a rapidly growing body of literature that doubles in number of publications every 20… ▽ More

    Submitted 11 September, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 10 pages, 8 Figures, Submitted to KDD 2020 Health Day

    Journal ref: KDD 2020 Health Day: AI for COVID, August 23-27, 2020, Virtual Conference, CA, US

  4. arXiv:1912.02077  [pdf

    cs.CL cs.IR

    PDC -- a probabilistic distributional clustering algorithm: a case study on suicide articles in PubMed

    Authors: Rezarta Islamaj, Lana Yeganova, Won Kim, Natalie Xie, W. John Wilbur, Zhiyong Lu

    Abstract: The need to organize a large collection in a manner that facilitates human comprehension is crucial given the ever-increasing volumes of information. In this work, we present PDC (probabilistic distributional clustering), a novel algorithm that, given a document collection, computes disjoint term sets representing topics in the collection. The algorithm relies on probabilities of word co-occurrenc… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: AMIA Informatics Summit 2020, 18 pages, Algorithm in the Appendix, 3 figures