Skip to main content

Showing 1–15 of 15 results for author: Karn, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.15932  [pdf, other

    cs.CL cs.CR

    CVE-LLM : Ontology-Assisted Automatic Vulnerability Evaluation Using Large Language Models

    Authors: Rikhiya Ghosh, Hans-Martin von Stockhausen, Martin Schmitt, George Marica Vasile, Sanjeev Kumar Karn, Oladimeji Farri

    Abstract: The National Vulnerability Database (NVD) publishes over a thousand new vulnerabilities monthly, with a projected 25 percent increase in 2024, highlighting the crucial need for rapid vulnerability identification to mitigate cybersecurity attacks and save costs and resources. In this work, we propose using large language models (LLMs) to learn vulnerability evaluation from historical assessments of… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2407.14640

  2. arXiv:2404.16192  [pdf, other

    cs.CL cs.CV

    Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering

    Authors: Cuong Nhat Ha, Shima Asaadi, Sanjeev Kumar Karn, Oladimeji Farri, Tobias Heimann, Thomas Runkler

    Abstract: Vision-language models, while effective in general domains and showing strong performance in diverse multi-modal applications like visual question-answering (VQA), struggle to maintain the same level of effectiveness in more specialized domains, e.g., medical. We propose a medical vision-language model that integrates large vision and language models adapted for the medical domain. This model goes… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Clinical NLP @ NAACL 2024

  3. arXiv:2402.01758  [pdf, other

    cs.CY cs.AI cs.CL

    Aalap: AI Assistant for Legal & Paralegal Functions in India

    Authors: Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta

    Abstract: Using proprietary Large Language Models on legal tasks poses challenges due to data privacy issues, domain data heterogeneity, domain knowledge sophistication, and domain objectives uniqueness. We created Aalalp, a fine-tuned Mistral 7B model on instructions data related to specific Indian legal tasks. The performance of Aalap is better than gpt-3.5-turbo in 31\% of our test data and obtains an eq… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

  4. arXiv:2311.17213  [pdf

    cs.CL eess.IV

    General-Purpose vs. Domain-Adapted Large Language Models for Extraction of Structured Data from Chest Radiology Reports

    Authors: Ali H. Dhanaliwala, Rikhiya Ghosh, Sanjeev Kumar Karn, Poikavila Ullaskrishnan, Oladimeji Farri, Dorin Comaniciu, Charles E. Kahn

    Abstract: Radiologists produce unstructured data that can be valuable for clinical care when consumed by information systems. However, variability in style limits usage. Study compares system using domain-adapted language model (RadLing) and general-purpose LLM (GPT-4) in extracting relevant features from chest radiology reports and standardizing them to common data elements (CDEs). Three radiologists annot… ▽ More

    Submitted 9 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  5. arXiv:2306.10448  [pdf, other

    cs.CV cs.CL

    Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge

    Authors: Manuela Daniela Danu, George Marica, Sanjeev Kumar Karn, Bogdan Georgescu, Awais Mansoor, Florin Ghesu, Lucian Mihai Itu, Constantin Suciu, Sasa Grbic, Oladimeji Farri, Dorin Comaniciu

    Abstract: Among all the sub-sections in a typical radiology report, the Clinical Indications, Findings, and Impression often reflect important details about the health status of a patient. The information included in Impression is also often covered in Findings. While Findings and Impression can be deduced by inspecting the image, Clinical Indications often require additional context. The cognitive task of… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Information Technology and Quantitative Management (ITQM 2023)

    Journal ref: Information Technology and Quantitative Management (ITQM 2023

  6. arXiv:2306.03264  [pdf, other

    cs.CL

    shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

    Authors: Sanjeev Kumar Karn, Rikhiya Ghosh, Kusuma P, Oladimeji Farri

    Abstract: Instruction-tuned generative Large language models (LLMs) like ChatGPT and Bloomz possess excellent generalization abilities, but they face limitations in understanding radiology reports, particularly in the task of generating the IMPRESSIONS section from the FINDINGS section. They tend to generate either verbose or incomplete IMPRESSIONS, mainly due to insufficient exposure to medical text data d… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 1st Place in Task 1B: Radiology Report Summarization at BioNLP 2023

    Journal ref: BioNLP 2023, Co-located with ACL 2023

  7. arXiv:2306.02492  [pdf, other

    cs.CL

    RadLing: Towards Efficient Radiology Report Understanding

    Authors: Rikhiya Ghosh, Sanjeev Kumar Karn, Manuela Daniela Danu, Larisa Micu, Ramya Vunikili, Oladimeji Farri

    Abstract: Most natural language tasks in the radiology domain use language models pre-trained on biomedical corpus. There are few pretrained language models trained specifically for radiology, and fewer still that have been trained in a low data setting and gone on to produce comparable results in fine-tuning tasks. We present RadLing, a continuously pretrained language model using Electra-small (Clark et a… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Association for Computational Linguistics (ACL), 2023

    Journal ref: 61st Annual Meeting of the Association for Computational Linguistics (ACL), July 9-14, 2023, Toronto, Canada

  8. arXiv:2304.09548  [pdf, other

    cs.CL cs.AI cs.LG

    SemEval 2023 Task 6: LegalEval - Understanding Legal Texts

    Authors: Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for developing NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we organized the shared task LegalEval - Understanding Legal Texts at SemEval 2023. LegalEval task has three sub-tasks: Task-A (Rhetorical Roles Labeling) is about… ▽ More

    Submitted 1 May, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 13 Pages (9 Pages + References), Accepted at SemEval 2023 at ACL 2023

  9. arXiv:2211.03442  [pdf, other

    cs.CL cs.AI

    Named Entity Recognition in Indian court judgments

    Authors: Prathamesh Kalamkar, Astha Agarwal, Aman Tiwari, Smita Gupta, Saurabh Karn, Vivek Raghavan

    Abstract: Identification of named entities from legal texts is an essential building block for developing other legal Artificial Intelligence applications. Named Entities in legal texts are slightly different and more fine-grained than commonly used named entities like Person, Organization, Location etc. In this paper, we introduce a new corpus of 46545 annotated legal named entities mapped to 14 legal enti… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: to be published in NLLP 2022 Workshop at EMNLP

  10. arXiv:2203.08257  [pdf, other

    cs.CL

    Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

    Authors: Sanjeev Kumar Karn, Ning Liu, Hinrich Schuetze, Oladimeji Farri

    Abstract: The IMPRESSIONS section of a radiology report about an imaging study is a summary of the radiologist's reasoning and conclusions, and it also aids the referring physician in confirming or excluding certain diagnoses. A cascade of tasks are required to automatically generate an abstractive summary of the typical information-rich radiology report. These tasks include acquisition of salient content f… ▽ More

    Submitted 29 April, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted at 60th Annual Meeting of the Association for Computational Linguistics 2022 Main Conference

    Journal ref: 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland, 2022

  11. arXiv:2201.13125  [pdf, other

    cs.CL cs.AI cs.LG

    Corpus for Automatic Structuring of Legal Documents

    Authors: Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for developing techniques for processing and organizing legal documents. In this paper, we introduce a new corpus for structuring legal documents. In particular, we introduce a corpus of legal judgment documents in English that are segmented into topical and coherent parts. Each of these parts is annotated… ▽ More

    Submitted 19 September, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Accepted at LREC 2022, 10 Pages (8 page main paper + 2 page references)

  12. arXiv:2103.05131  [pdf, other

    cs.CL

    Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data

    Authors: Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schuetze

    Abstract: Interleaved texts, where posts belonging to different threads occur in a sequence, commonly occur in online chat posts, so that it can be time-consuming to quickly obtain an overview of the discussions. Existing systems first disentangle the posts by threads and then extract summaries from those threads. A major issue with such systems is error propagation from the disentanglement component. While… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Adapt-NLP: The Second Workshop on Domain Adaptation for NLP

  13. arXiv:1906.01973  [pdf, other

    cs.CL

    A Hierarchical Decoder with Three-level Hierarchical Attention to Generate Abstractive Summaries of Interleaved Texts

    Authors: Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schütze

    Abstract: Interleaved texts, where posts belonging to different threads occur in one sequence, are a common occurrence, e.g., online chat conversations. To quickly obtain an overview of such texts, existing systems first disentangle the posts by threads and then extract summaries from those threads. The major issues with such systems are error propagation and non-fluent summary. To address those, we propose… ▽ More

    Submitted 9 April, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  14. arXiv:1807.11535  [pdf, other

    cs.CL

    News Article Teaser Tweets and How to Generate Them

    Authors: Sanjeev Kumar Karn, Mark Buckley, Ulli Waltinger, Hinrich Schütze

    Abstract: In this work, we define the task of teaser generation and provide an evaluation benchmark and baseline systems for the process of generating teasers. A teaser is a short reading suggestion for an article that is illustrative and includes curiosity-arousing elements to entice potential readers to read particular news items. Teasers are one of the main vehicles for transmitting news to social media… ▽ More

    Submitted 18 April, 2019; v1 submitted 30 July, 2018; originally announced July 2018.

    Journal ref: 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

  15. Neural Architectures for Open-Type Relation Argument Extraction

    Authors: Benjamin Roth, Costanza Conforti, Nina Poerner, Sanjeev Karn, Hinrich Schütze

    Abstract: In this work, we introduce the task of Open-Type Relation Argument Extraction (ORAE): Given a corpus, a query entity Q and a knowledge base relation (e.g.,"Q authored notable work with title X"), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, e.g. X: the title of a book or a work of art) from the corpus. A dist… ▽ More

    Submitted 30 September, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Journal ref: Nat. Lang. Eng. 25 (2019) 219-238