Skip to main content

Showing 1–4 of 4 results for author: Bhatia, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.00059  [pdf, other

    cs.CL cs.SD eess.AS

    BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition

    Authors: Paige Tuttösí, Mantaj Dhillon, Luna Sang, Shane Eastwood, Poorvi Bhatia, Quang Minh Dinh, Avni Kapoor, Yewon Jin, Angelica Lim

    Abstract: Some speech recognition tasks, such as automatic speech recognition (ASR), are approaching or have reached human performance in many reported metrics. Yet, they continue to struggle in complex, real-world, situations, such as with distanced speech. Previous challenges have released datasets to address the issue of distanced ASR, however, the focus remains primarily on distance, specifically relyin… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: Accepted to Computer Speech and Language, Special issue: Multi-Speaker, Multi-Microphone, and Multi-Modal Distant Speech Recognition (September 2025)

  2. arXiv:2504.12527  [pdf

    q-bio.OT eess.IV

    Analysis of the MICCAI Brain Tumor Segmentation -- Metastases (BraTS-METS) 2025 Lighthouse Challenge: Brain Metastasis Segmentation on Pre- and Post-treatment MRI

    Authors: Nazanin Maleki, Raisa Amiruddin, Ahmed W. Moawad, Nikolay Yordanov, Athanasios Gkampenis, Pascal Fehringer, Fabian Umeh, Crystal Chukwurah, Fatima Memon, Bojan Petrovic, Justin Cramer, Mark Krycia, Elizabeth B. Shrickel, Ichiro Ikuta, Gerard Thompson, Lorenna Vidal, Vilma Kosovic, Adam E. Goldman-Yassen, Virginia Hill, Tiffany So, Sedra Mhana, Albara Alotaibi, Nathan Page, Prisha Bhatia, Melisa S. Guelen , et al. (219 additional authors not shown)

    Abstract: Despite continuous advancements in cancer treatment, brain metastatic disease remains a significant complication of primary cancer and is associated with an unfavorable prognosis. One approach for improving diagnosis, management, and outcomes is to implement algorithms based on artificial intelligence for the automated segmentation of both pre- and post-treatment MRI brain images. Such algorithms… ▽ More

    Submitted 10 July, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: 28 pages, 4 figures, 2 tables

  3. arXiv:2410.12831  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Segment as You Wish -- Free-Form Language-Based Segmentation for Medical Images

    Authors: Longchao Da, Rui Wang, Xiaojian Xu, Parminder Bhatia, Taha Kass-Hout, Hua Wei, Cao Xiao

    Abstract: Medical imaging is crucial for diagnosing a patient's health condition, and accurate segmentation of these images is essential for isolating regions of interest to ensure precise diagnosis and treatment planning. Existing methods primarily rely on bounding boxes or point-based prompts, while few have explored text-related prompts, despite clinicians often describing their observations and instruct… ▽ More

    Submitted 29 June, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: 19 pages, 9 as main content. The paper was accepted to KDD2025

    MSC Class: 68T45; 68U10; 92C55 ACM Class: I.2.7; I.4.9; H.3.3; I.2.6

  4. arXiv:2310.16872  [pdf, other

    eess.IV cs.CV

    SonoSAMTrack -- Segment and Track Anything on Ultrasound Images

    Authors: Hariharan Ravishankar, Rohan Patil, Vikram Melapudi, Harsh Suthar, Stephan Anzengruber, Parminder Bhatia, Kass-Hout Taha, Pavan Annangi

    Abstract: In this paper, we present SonoSAMTrack - that combines a promptable foundational model for segmenting objects of interest on ultrasound images called SonoSAM, with a state-of-the art contour tracking model to propagate segmentations on 2D+t and 3D ultrasound datasets. Fine-tuned and tested exclusively on a rich, diverse set of objects from $\approx200$k ultrasound image-mask pairs, SonoSAM demonst… ▽ More

    Submitted 16 November, 2023; v1 submitted 25 October, 2023; originally announced October 2023.