Skip to main content

Showing 1–3 of 3 results for author: Ahuja, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.03054  [pdf, other

    cs.AI cs.CL cs.SD eess.AS

    BLAB: Brutally Long Audio Bench

    Authors: Orevaoghene Ahia, Martijn Bartelds, Kabir Ahuja, Hila Gonen, Valentin Hofmann, Siddhant Arora, Shuyue Stella Li, Vishal Puttagunta, Mofetoluwa Adeyemi, Charishma Buchireddy, Ben Walls, Noah Bennett, Shinji Watanabe, Noah A. Smith, Yulia Tsvetkov, Sachin Kumar

    Abstract: Developing large audio language models (LMs) capable of understanding diverse spoken interactions is essential for accommodating the multimodal nature of human communication and can increase the accessibility of language technologies across different user populations. Recent work on audio LMs has primarily evaluated their performance on short audio segments, typically under 30 seconds, with limite… ▽ More

    Submitted 12 May, 2025; v1 submitted 5 May, 2025; originally announced May 2025.

  2. arXiv:2501.01464  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint

    Authors: Prabhjot Kaur, Atul Singh Minhas, Chirag Kamal Ahuja, Anil Kumar Sao

    Abstract: Limited accessibility to high field MRI scanners (such as 7T, 11T) has motivated the development of post-processing methods to improve low field images. Several existing post-processing methods have shown the feasibility to improve 3T images to produce 7T-like images [3,18]. It has been observed that improving lower field (LF, <=1.5T) images comes with additional challenges due to poor image quali… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: conference paper

    Journal ref: Medical Image Computing and Computer Assisted Intervention - MICCAI 2023. Lecture Notes in Computer Science, vol 14229. Springer, Cham

  3. arXiv:2405.01600  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Improved and Explainable Cervical Cancer Classification using Ensemble Pooling of Block Fused Descriptors

    Authors: Saurabh Saini, Kapil Ahuja, Akshat S. Chauhan

    Abstract: Cervical cancer is the second most common cancer in women and causes high death rates. Earlier models for detecting cervical cancer had limited success. In this work, we propose new models that substantially outperform previous models. Previous studies show that pretrained ResNets extract features from cervical cancer images well. Hence, our first model involves working with three ResNets (50, 1… ▽ More

    Submitted 24 June, 2025; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: 26 Pages, 10 figures, and 8 tables

    ACM Class: I.2.1; I.5.2