Skip to main content

Showing 1–21 of 21 results for author: Visweswaran, S

.
  1. arXiv:2410.05180  [pdf, other

    cs.CL

    Mitigating the Risk of Health Inequity Exacerbated by Large Language Models

    Authors: Yuelyu Ji, Wenhe Ma, Sonish Sivarajkumar, Hang Zhang, Eugene Mathew Sadhu, Zhuochun Li, Xizhi Wu, Shyam Visweswaran, Yanshan Wang

    Abstract: Recent advancements in large language models have demonstrated their potential in numerous medical applications, particularly in automating clinical trial matching for translational research and enhancing medical question answering for clinical decision support. However, our study shows that incorporating non decisive sociodemographic factors such as race, sex, income level, LGBT+ status, homeless… ▽ More

    Submitted 14 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

  2. arXiv:2408.07832  [pdf, ps, other

    cs.CL cs.CV

    LADDER: Language-Driven Slice Discovery and Error Rectification in Vision Classifiers

    Authors: Shantanu Ghosh, Rayan Syed, Chenyu Wang, Vaibhav Choudhary, Binxu Li, Clare B. Poynton, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: Error slice discovery is crucial to diagnose and mitigate model errors. Current clustering or discrete attribute-based slice discovery methods face key limitations: 1) clustering results in incoherent slices, while assigning discrete attributes to slices leads to incomplete coverage of error patterns due to missing or insufficient attributes; 2) these methods lack complex reasoning, preventing the… ▽ More

    Submitted 29 May, 2025; v1 submitted 31 July, 2024; originally announced August 2024.

    Comments: ACL 2025 (Findings). Code: https://github.com/batmanlab/Ladder

  3. arXiv:2405.12255  [pdf, other

    eess.IV cs.CV

    Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography

    Authors: Shantanu Ghosh, Clare B. Poynton, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: The lack of large and diverse training data on Computer-Aided Diagnosis (CAD) in breast cancer detection has been one of the concerns that impedes the adoption of the system. Recently, pre-training with large-scale image text datasets via Vision-Language models (VLM) (\eg CLIP) partially addresses the issue of robustness and data efficiency in computer vision (CV). This paper proposes Mammo-CLIP,… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: MICCAI 2024, early accept, top 11%

  4. arXiv:2405.05993  [pdf

    cs.LG cs.AI

    Precision Rehabilitation for Patients Post-Stroke based on Electronic Health Records and Machine Learning

    Authors: Fengyi Gao, Xingyu Zhang, Sonish Sivarajkumar, Parker Denny, Bayan Aldhahwani, Shyam Visweswaran, Ryan Shi, William Hogan, Allyn Bove, Yanshan Wang

    Abstract: In this study, we utilized statistical analysis and machine learning methods to examine whether rehabilitation exercises can improve patients post-stroke functional abilities, as well as forecast the improvement in functional abilities. Our dataset is patients' rehabilitation exercises and demographic information recorded in the unstructured electronic health records (EHRs) data and free-text reha… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  5. arXiv:2405.02559  [pdf

    cs.CL cs.AI

    A Framework for Human Evaluation of Large Language Models in Healthcare Derived from Literature Review

    Authors: Thomas Yu Chow Tam, Sonish Sivarajkumar, Sumit Kapoor, Alisa V Stolyar, Katelyn Polanska, Karleigh R McCarthy, Hunter Osterhoudt, Xizhi Wu, Shyam Visweswaran, Sunyang Fu, Piyush Mathur, Giovanni E. Cacciamani, Cong Sun, Yifan Peng, Yanshan Wang

    Abstract: With generative artificial intelligence (AI), particularly large language models (LLMs), continuing to make inroads in healthcare, it is critical to supplement traditional automated evaluations with human evaluations. Understanding and evaluating the output of LLMs is essential to assuring safety, reliability, and effectiveness. However, human evaluation's cumbersome, time-consuming, and non-stand… ▽ More

    Submitted 23 September, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

  6. arXiv:2401.11120  [pdf, other

    cs.CL cs.AI

    Enhancing Large Language Models for Clinical Decision Support by Incorporating Clinical Practice Guidelines

    Authors: David Oniani, Xizhi Wu, Shyam Visweswaran, Sumit Kapoor, Shravan Kooragayalu, Katelyn Polanska, Yanshan Wang

    Abstract: Background Large Language Models (LLMs), enhanced with Clinical Practice Guidelines (CPGs), can significantly improve Clinical Decision Support (CDS). However, methods for incorporating CPGs into LLMs are not well studied. Methods We develop three distinct methods for incorporating CPGs into LLMs: Binary Decision Tree (BDT), Program-Aided Graph Construction (PAGC), and Chain-of-Thought-Few-Shot Pr… ▽ More

    Submitted 23 January, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  7. arXiv:2310.03559  [pdf, other

    eess.IV cs.CV

    MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images

    Authors: Yanwu Xu, Li Sun, Wei Peng, Shuyue Jia, Katelyn Morrison, Adam Perer, Afrooz Zandifar, Shyam Visweswaran, Motahhare Eslami, Kayhan Batmanghelich

    Abstract: This paper introduces an innovative methodology for producing high-quality 3D lung CT images guided by textual information. While diffusion-based generative models are increasingly used in medical imaging, current state-of-the-art approaches are limited to low-resolution outputs and underutilize radiology reports' abundant information. The radiology reports can enhance the generation process by pr… ▽ More

    Submitted 15 October, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  8. arXiv:2309.08008  [pdf

    cs.CL cs.AI

    An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing

    Authors: Sonish Sivarajkumar, Mark Kelley, Alyssa Samolyk-Mazzanti, Shyam Visweswaran, Yanshan Wang

    Abstract: Large language models (LLMs) have shown remarkable capabilities in Natural Language Processing (NLP), especially in domains where labeled data is scarce or expensive, such as clinical domain. However, to unlock the clinical knowledge hidden in these LLMs, we need to design effective prompts that can guide them to perform specific clinical NLP tasks without any task-specific training data. This is… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  9. arXiv:2303.13466  [pdf

    cs.CL cs.AI

    Mining Clinical Notes for Physical Rehabilitation Exercise Information: Natural Language Processing Algorithm Development and Validation Study

    Authors: Sonish Sivarajkumar, Fengyi Gao, Parker E. Denny, Bayan M. Aldhahwani, Shyam Visweswaran, Allyn Bove, Yanshan Wang

    Abstract: Post-stroke patient rehabilitation requires precise, personalized treatment plans. Natural Language Processing (NLP) offers potential to extract valuable exercise information from clinical notes, aiding in the development of more effective rehabilitation strategies. Objective: This study aims to develop and evaluate a variety of NLP algorithms to extract and categorize physical rehabilitation exer… ▽ More

    Submitted 15 March, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

  10. arXiv:2208.06361  [pdf, other

    q-bio.BM cs.LG

    Hyperbolic Molecular Representation Learning for Drug Repositioning

    Authors: Ke Yu, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: Learning accurate drug representations is essential for task such as computational drug repositioning. A drug hierarchy is a valuable source that encodes knowledge of relations among drugs in a tree-like structure where drugs that act on the same organs, treat the same disease, or bind to the same biological target are grouped together. However, its utility in learning drug representations has not… ▽ More

    Submitted 6 July, 2022; originally announced August 2022.

    Comments: Accepted by NeurIPS workshop 2020. arXiv admin note: substantial text overlap with arXiv:2006.00986

  11. arXiv:2204.09601  [pdf

    cs.CL cs.AI

    Extraction of Sleep Information from Clinical Notes of Patients with Alzheimer's Disease Using Natural Language Processing

    Authors: Sonish Sivarajkumar, Thomas Yu CHow Tam, Haneef Ahamed Mohammad, Samual Viggiano, David Oniani, Shyam Visweswaran, Yanshan Wang

    Abstract: Alzheimer's Disease (AD) is the most common form of dementia in the United States. Sleep is one of the lifestyle-related factors that has been shown critical for optimal cognitive function in old age. However, there is a lack of research studying the association between sleep and AD incidence. A major bottleneck for conducting such research is that the traditional way to acquire sleep information… ▽ More

    Submitted 15 March, 2024; v1 submitted 8 March, 2022; originally announced April 2022.

  12. arXiv:2006.00986  [pdf, other

    cs.LG q-bio.MN q-bio.QM stat.ML

    Semi-Supervised Hierarchical Drug Embedding in Hyperbolic Space

    Authors: Ke Yu, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: Learning accurate drug representation is essential for tasks such as computational drug repositioning and prediction of drug side-effects. A drug hierarchy is a valuable source that encodes human knowledge of drug relations in a tree-like structure where drugs that act on the same organs, treat the same disease, or bind to the same biological target are grouped together. However, its utility in le… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  13. arXiv:1905.10330  [pdf, other

    stat.ML cs.LG stat.ME

    Dirac Delta Regression: Conditional Density Estimation with Clinical Trials

    Authors: Eric V. Strobl, Shyam Visweswaran

    Abstract: Personalized medicine seeks to identify the causal effect of treatment for a particular patient as opposed to a clinical population at large. Most investigators estimate such personalized treatment effects by regressing the outcome of a randomized clinical trial (RCT) on patient covariates. The realized value of the outcome may however lie far from the conditional expectation. We therefore introdu… ▽ More

    Submitted 1 September, 2021; v1 submitted 24 May, 2019; originally announced May 2019.

  14. arXiv:1705.09031  [pdf, other

    stat.ME stat.ML

    Fast Causal Inference with Non-Random Missingness by Test-Wise Deletion

    Authors: Eric V. Strobl, Shyam Visweswaran, Peter L. Spirtes

    Abstract: Many real datasets contain values missing not at random (MNAR). In this scenario, investigators often perform list-wise deletion, or delete samples with any missing values, before applying causal discovery algorithms. List-wise deletion is a sound and general strategy when paired with algorithms such as FCI and RFCI, but the deletion procedure also eliminates otherwise good samples that contain on… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

  15. arXiv:1702.03877  [pdf, other

    stat.ME stat.ML

    Approximate Kernel-based Conditional Independence Tests for Fast Non-Parametric Causal Discovery

    Authors: Eric V. Strobl, Kun Zhang, Shyam Visweswaran

    Abstract: Constraint-based causal discovery (CCD) algorithms require fast and accurate conditional independence (CI) testing. The Kernel Conditional Independence Test (KCIT) is currently one of the most popular CI tests in the non-parametric setting, but many investigators cannot use KCIT with large datasets because the test scales cubicly with sample size. We therefore devise two relaxations called the Ran… ▽ More

    Submitted 12 April, 2017; v1 submitted 13 February, 2017; originally announced February 2017.

    Comments: R package: github.com/ericstrobl/RCIT

  16. arXiv:1607.03975  [pdf, other

    stat.ML stat.ME

    Estimating and Controlling the False Discovery Rate for the PC Algorithm Using Edge-Specific P-Values

    Authors: Eric V. Strobl, Peter L. Spirtes, Shyam Visweswaran

    Abstract: The PC algorithm allows investigators to estimate a complete partially directed acyclic graph (CPDAG) from a finite dataset, but few groups have investigated strategies for estimating and controlling the false discovery rate (FDR) of the edges in the CPDAG. In this paper, we introduce PC with p-values (PC-p), a fast algorithm which robustly computes edge-specific p-values and then estimates and co… ▽ More

    Submitted 9 May, 2017; v1 submitted 13 July, 2016; originally announced July 2016.

  17. arXiv:1509.03935  [pdf

    math.ST stat.ME stat.ML

    Markov Boundary Discovery with Ridge Regularized Linear Models

    Authors: Eric V. Strobl, Shyam Visweswaran

    Abstract: Ridge regularized linear models (RRLMs), such as ridge regression and the SVM, are a popular group of methods that are used in conjunction with coefficient hypothesis testing to discover explanatory variables with a significant multivariate association to a response. However, many investigators are reluctant to draw causal interpretations of the selected variables due to the incomplete knowledge o… ▽ More

    Submitted 13 September, 2015; originally announced September 2015.

    Comments: submitted to the Journal of Causal Inference

  18. arXiv:1407.7566  [pdf

    q-bio.QM cs.LG stat.ML

    Dependence versus Conditional Dependence in Local Causal Discovery from Gene Expression Data

    Authors: Eric V. Strobl, Shyam Visweswaran

    Abstract: Motivation: Algorithms that discover variables which are causally related to a target may inform the design of experiments. With observational gene expression data, many methods discover causal variables by measuring each variable's degree of statistical dependence with the target using dependence measures (DMs). However, other methods measure each variable's ability to explain the statistical dep… ▽ More

    Submitted 28 July, 2014; originally announced July 2014.

    Comments: 11 pages, 2 algorithms, 4 figures, 5 tables

  19. arXiv:1407.2483  [pdf

    stat.ML cs.AI cs.LG

    Counting Markov Blanket Structures

    Authors: Shyam Visweswaran, Gregory F. Cooper

    Abstract: Learning Markov blanket (MB) structures has proven useful in performing feature selection, learning Bayesian networks (BNs), and discovering causal relationships. We present a formula for efficiently determining the number of MB structures given a target variable and a set of other variables. As expected, the number of MB structures grows exponentially. However, we show quantitatively that there a… ▽ More

    Submitted 12 July, 2014; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: 5 pages, 2 figures, 1 table

  20. arXiv:1402.0108  [pdf

    stat.ML cs.LG

    Markov Blanket Ranking using Kernel-based Conditional Dependence Measures

    Authors: Eric V. Strobl, Shyam Visweswaran

    Abstract: Developing feature selection algorithms that move beyond a pure correlational to a more causal analysis of observational data is an important problem in the sciences. Several algorithms attempt to do so by discovering the Markov blanket of a target, but they all contain a forward selection step which variables must pass in order to be included in the conditioning set. As a result, these algorithms… ▽ More

    Submitted 2 May, 2014; v1 submitted 1 February, 2014; originally announced February 2014.

    Comments: 10 pages, 4 figures, 2 algorithms, NIPS 2013 Workshop on Causality, code: github.com/ericstrobl/

  21. arXiv:1310.3101  [pdf

    stat.ML cs.LG

    Deep Multiple Kernel Learning

    Authors: Eric Strobl, Shyam Visweswaran

    Abstract: Deep learning methods have predominantly been applied to large artificial neural networks. Despite their state-of-the-art performance, these large networks typically do not generalize well to datasets with limited sample sizes. In this paper, we take a different approach by learning multiple layers of kernels. We combine kernels at each layer and then optimize over an estimate of the support vecto… ▽ More

    Submitted 11 October, 2013; originally announced October 2013.

    Comments: 4 pages, 1 figure, 1 table, conference paper

    Journal ref: IEEE 12th International Conference on Machine Learning and Applications (ICMLA 2013)