Skip to main content

Showing 1–8 of 8 results for author: Rawat, B P S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.03304  [pdf, other

    cs.CL

    Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding

    Authors: Zhichao Yang, Shufan Wang, Bhanu Pratap Singh Rawat, Avijit Mitra, Hong Yu

    Abstract: Automatic International Classification of Diseases (ICD) coding aims to assign multiple ICD codes to a medical note with average length of 3,000+ tokens. This task is challenging due to a high-dimensional space of multi-label assignment (tens of thousands of ICD codes) and the long-tail challenge: only a few codes (common diseases) are frequently assigned while most codes (rare diseases) are infre… ▽ More

    Submitted 12 October, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted by Findings of EMNLP 2022, code is available at https://github.com/whaleloops/KEPT

  2. arXiv:2205.07872  [pdf, other

    cs.LG cs.AI cs.CL

    ScAN: Suicide Attempt and Ideation Events Dataset

    Authors: Bhanu Pratap Singh Rawat, Samuel Kovaly, Wilfred R. Pigeon, Hong Yu

    Abstract: Suicide is an important public health concern and one of the leading causes of death worldwide. Suicidal behaviors, including suicide attempts (SA) and suicide ideations (SI), are leading risk factors for death by suicide. Information related to patients' previous and current SA and SI are frequently documented in the electronic health record (EHR) notes. Accurate detection of such documentation m… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Paper accepted at NAACL 2022

  3. arXiv:2104.08305  [pdf, other

    cs.CL

    Membership Inference Attack Susceptibility of Clinical Language Models

    Authors: Abhyuday Jagannatha, Bhanu Pratap Singh Rawat, Hong Yu

    Abstract: Deep Neural Network (DNN) models have been shown to have high empirical privacy leakages. Clinical language models (CLMs) trained on clinical data have been used to improve performance in biomedical natural language processing tasks. In this work, we investigate the risks of training-data leakage through white-box or black-box access to CLMs. We design and employ membership inference attacks to es… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  4. Conversational Machine Comprehension: a Literature Review

    Authors: Somil Gupta, Bhanu Pratap Singh Rawat, Hong Yu

    Abstract: Conversational Machine Comprehension (CMC), a research track in conversational AI, expects the machine to understand an open-domain natural language text and thereafter engage in a multi-turn conversation to answer questions related to the text. While most of the research in Machine Reading Comprehension (MRC) revolves around single-turn question answering (QA), multi-turn CMC has recently gained… ▽ More

    Submitted 5 November, 2020; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: Accepted to COLING 2020

  5. arXiv:2005.06587  [pdf, other

    cs.AI cs.CL cs.LG

    Entity-Enriched Neural Models for Clinical Question Answering

    Authors: Bhanu Pratap Singh Rawat, Wei-Hung Weng, So Yeon Min, Preethi Raghavan, Peter Szolovits

    Abstract: We explore state-of-the-art neural models for question answering on electronic medical records and improve their ability to generalize better on previously unseen (paraphrased) questions at test time. We enable this by learning to predict logical forms as an auxiliary task along with the main task of answer span detection. The predicted logical forms also serve as a rationale for the answer. Furth… ▽ More

    Submitted 19 February, 2021; v1 submitted 13 May, 2020; originally announced May 2020.

    Journal ref: BioNLP Workshop, ACL'2020

  6. arXiv:2004.02288  [pdf, other

    cs.CL cs.LG

    Continual Domain-Tuning for Pretrained Language Models

    Authors: Subendhu Rongali, Abhyuday Jagannatha, Bhanu Pratap Singh Rawat, Hong Yu

    Abstract: Pre-trained language models (LM) such as BERT, DistilBERT, and RoBERTa can be tuned for different domains (domain-tuning) by continuing the pre-training phase on a new target domain corpus. This simple domain tuning (SDT) technique has been widely used to create domain-tuned models such as BioBERT, SciBERT and ClinicalBERT. However, during the pretraining phase on the target domain, the LM models… ▽ More

    Submitted 19 March, 2021; v1 submitted 5 April, 2020; originally announced April 2020.

    Comments: Updated from a previous shorter version

  7. arXiv:1710.01216  [pdf, other

    cs.CV

    Group Affect Prediction Using Multimodal Distributions

    Authors: Saqib Shamsi, Bhanu Pratap Singh Rawat, Manya Wadhwa

    Abstract: We describe our approach towards building an efficient predictive model to detect emotions for a group of people in an image. We have proposed that training a Convolutional Neural Network (CNN) model on the emotion heatmaps extracted from the image, outperforms a CNN model trained entirely on the raw images. The comparison of the models have been done on a recently published dataset of Emotion Rec… ▽ More

    Submitted 12 March, 2018; v1 submitted 17 September, 2017; originally announced October 2017.

    Comments: This research paper has been accepted at Workshop on Computer Vision for Active and Assisted Living, WACV 2018

  8. arXiv:1610.00465  [pdf, ps, other

    cs.LG cs.AI

    Can Evolutionary Sampling Improve Bagged Ensembles?

    Authors: Harsh Nisar, Bhanu Pratap Singh Rawat

    Abstract: Perturb and Combine (P&C) group of methods generate multiple versions of the predictor by perturbing the training set or construction and then combining them into a single predictor (Breiman, 1996b). The motive is to improve the accuracy in unstable classification and regression methods. One of the most well known method in this group is Bagging. Arcing or Adaptive Resampling and Combining methods… ▽ More

    Submitted 3 October, 2016; originally announced October 2016.

    Comments: 3 pages, 1 table, Data Efficient Machine Learning Workshop (DEML'16), ICML