Skip to main content

Showing 1–7 of 7 results for author: Puri, I

Searching in archive cs. Search in all archives.
.
  1. A System for Accurate Tracking and Video Recordings of Rodent Eye Movements using Convolutional Neural Networks for Biomedical Image Segmentation

    Authors: Isha Puri, David Cox

    Abstract: Research in neuroscience and vision science relies heavily on careful measurements of animal subject's gaze direction. Rodents are the most widely studied animal subjects for such research because of their economic advantage and hardiness. Recently, video based eye trackers that use image processing techniques have become a popular option for gaze tracking because they are easy to use and are comp… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Journal ref: 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA, 2018, pp. 3590-3593

  2. arXiv:2506.05586  [pdf, ps, other

    cs.LG cs.AI

    CoFrNets: Interpretable Neural Architecture Inspired by Continued Fractions

    Authors: Isha Puri, Amit Dhurandhar, Tejaswini Pedapati, Kartikeyan Shanmugam, Dennis Wei, Kush R. Varshney

    Abstract: In recent years there has been a considerable amount of research on local post hoc explanations for neural networks. However, work on building interpretable neural architectures has been relatively sparse. In this paper, we present a novel neural architecture, CoFrNet, inspired by the form of continued fractions which are known to have many attractive properties in number theory, such as fast conv… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2021, vol 34, pp 21668-21690

  3. arXiv:2505.24040  [pdf, ps, other

    cs.CL cs.AI

    MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

    Authors: Yuexing Hao, Kumail Alhamoud, Hyewon Jeong, Haoran Zhang, Isha Puri, Philip Torr, Mike Schaekermann, Ariel D. Stern, Marzyeh Ghassemi

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance on various medical question-answering (QA) benchmarks, including standardized medical exams. However, correct answers alone do not ensure correct logic, and models may reach accurate conclusions through flawed processes. In this study, we introduce the MedPAIR (Medical Dataset Comparing Physicians and AI Relevance Estimation and… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  4. arXiv:2502.01618  [pdf, ps, other

    cs.LG cs.AI

    Rollout Roulette: A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods

    Authors: Isha Puri, Shivchander Sudalairaj, Guangxuan Xu, Kai Xu, Akash Srivastava

    Abstract: Large language models (LLMs) have achieved significant performance gains via scaling up model sizes and/or data. However, recent evidence suggests diminishing returns from such approaches, motivating scaling the computation spent at inference time. Existing inference-time scaling methods, usually with reward models, cast the task as a search problem, which tends to be vulnerable to reward hacking… ▽ More

    Submitted 5 June, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  5. arXiv:2405.12021  [pdf, other

    cs.CL

    Can AI Relate: Testing Large Language Model Response for Mental Health Support

    Authors: Saadia Gabriel, Isha Puri, Xuhai Xu, Matteo Malgaroli, Marzyeh Ghassemi

    Abstract: Large language models (LLMs) are already being piloted for clinical use in hospital systems like NYU Langone, Dana-Farber and the NHS. A proposed deployment use case is psychotherapy, where a LLM-powered chatbot can treat a patient undergoing a mental health crisis. Deployment of LLMs for mental health response could hypothetically broaden access to psychotherapy and provide new possibilities for… ▽ More

    Submitted 7 October, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: EMNLP 2024 Findings

  6. arXiv:2402.08225  [pdf, other

    cs.LG

    Improving Black-box Robustness with In-Context Rewriting

    Authors: Kyle O'Brien, Nathan Ng, Isha Puri, Jorge Mendez, Hamid Palangi, Yoon Kim, Marzyeh Ghassemi, Thomas Hartvigsen

    Abstract: Machine learning models for text classification often excel on in-distribution (ID) data but struggle with unseen out-of-distribution (OOD) inputs. Most techniques for improving OOD robustness are not applicable to settings where the model is effectively a black box, such as when the weights are frozen, retraining is costly, or the model is leveraged via an API. Test-time augmentation (TTA) is a s… ▽ More

    Submitted 4 August, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  7. arXiv:2206.11104  [pdf, other

    cs.LG cs.AI

    OpenXAI: Towards a Transparent Evaluation of Model Explanations

    Authors: Chirag Agarwal, Dan Ley, Satyapriya Krishna, Eshika Saxena, Martin Pawelczyk, Nari Johnson, Isha Puri, Marinka Zitnik, Himabindu Lakkaraju

    Abstract: While several types of post hoc explanation methods have been proposed in recent literature, there is very little work on systematically benchmarking these methods. Here, we introduce OpenXAI, a comprehensive and extensible open-source framework for evaluating and benchmarking post hoc explanation methods. OpenXAI comprises of the following key components: (i) a flexible synthetic data generator a… ▽ More

    Submitted 13 March, 2024; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Newer version with updated results and code