Skip to main content

Showing 1–14 of 14 results for author: Bhandari, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08713  [pdf, other

    cs.LG cs.AI

    ProtoECGNet: Case-Based Interpretable Deep Learning for Multi-Label ECG Classification with Contrastive Learning

    Authors: Sahil Sethi, David Chen, Thomas Statchen, Michael C. Burkhart, Nipun Bhandari, Bashar Ramadan, Brett Beaulieu-Jones

    Abstract: Deep learning-based electrocardiogram (ECG) classification has shown impressive performance but clinical adoption has been slowed by the lack of transparent and faithful explanations. Post hoc methods such as saliency maps may fail to reflect a model's true decision process. Prototype-based reasoning offers a more transparent alternative by grounding decisions in similarity to learned representati… ▽ More

    Submitted 17 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

  2. arXiv:2504.08231  [pdf, other

    cs.CL

    Out of Style: RAG's Fragility to Linguistic Variation

    Authors: Tianyu Cao, Neel Bhandari, Akhila Yerukola, Akari Asai, Maarten Sap

    Abstract: Despite the impressive performance of Retrieval-augmented Generation (RAG) systems across various NLP benchmarks, their robustness in handling real-world user-LLM interaction queries remains largely underexplored. This presents a critical gap for practical deployment, where user queries exhibit greater linguistic variations and can trigger cascading errors across interdependent RAG components. In… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  3. arXiv:2412.07937  [pdf, other

    cs.CL

    Style-agnostic evaluation of ASR using multiple reference transcripts

    Authors: Quinten McNamara, Miguel Ángel del Río Fernández, Nishchal Bhandari, Martin Ratajczak, Danny Chen, Corey Miller, Migüel Jetté

    Abstract: Word error rate (WER) as a metric has a variety of limitations that have plagued the field of speech recognition. Evaluation datasets suffer from varying style, formality, and inherent ambiguity of the transcription task. In this work, we attempt to mitigate some of these differences by performing style-agnostic evaluation of ASR systems using multiple references transcribed under opposing style p… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  4. arXiv:2410.03930  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Reverb: Open-Source ASR and Diarization from Rev

    Authors: Nishchal Bhandari, Danny Chen, Miguel Ángel del Río Fernández, Natalie Delworth, Jennifer Drexler Fox, Migüel Jetté, Quinten McNamara, Corey Miller, Ondřej Novotný, Ján Profant, Nan Qin, Martin Ratajczak, Jean-Philippe Robichaud

    Abstract: Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use. We are releasing both a full production pipeline for developers as well as pared-down research models for experimentation. Rev hopes that these releases will spur research and innovation in the fast-moving domain of voice technology. The speech recognition models released today outperform all exi… ▽ More

    Submitted 24 February, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

  5. Quantification of stylistic differences in human- and ASR-produced transcripts of African American English

    Authors: Annika Heuser, Tyler Kendall, Miguel del Rio, Quinten McNamara, Nishchal Bhandari, Corey Miller, Migüel Jetté

    Abstract: Common measures of accuracy used to assess the performance of automatic speech recognition (ASR) systems, as well as human transcribers, conflate multiple sources of error. Stylistic differences, such as verbatim vs non-verbatim, can play a significant role in ASR performance evaluation when differences exist between training and test datasets. The problem is compounded for speech from underrepres… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Published in Interspeech 2024 Proceedings, 5 pages excluding references, 5 figures

  6. arXiv:2402.07827  [pdf, other

    cs.CL

    Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

    Authors: Ahmet Üstün, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker

    Abstract: Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages. What does it take to broaden access to breakthroughs beyond first-class citizen languages? Our work introduces Aya, a massively multilingual generative language model that follows instructions in 101 languages of which over 50% are considered as lower-resourced. Aya outperforms mT0 and BLOOM… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  7. arXiv:2401.04144  [pdf, other

    cs.LG cs.AI

    Robust Calibration For Improved Weather Prediction Under Distributional Shift

    Authors: Sankalp Gilda, Neel Bhandari, Wendy Mak, Andrea Panizza

    Abstract: In this paper, we present results on improving out-of-domain weather prediction and uncertainty estimation as part of the \texttt{Shifts Challenge on Robustness and Uncertainty under Real-World Distributional Shift} challenge. We find that by leveraging a mixture of experts in conjunction with an advanced data augmentation technique borrowed from the computer vision domain, in conjunction with rob… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Presented at the Bayesian Deep Learning workshop at NeurIPS 2021

  8. Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation

    Authors: Neel Bhandari, Pin-Yu Chen

    Abstract: Language Models today provide a high accuracy across a large number of downstream tasks. However, they remain susceptible to adversarial attacks, particularly against those where the adversarial examples maintain considerable similarity to the original text. Given the multilingual nature of text, the effectiveness of adversarial examples across translations and how machine translations can improve… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Published at International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

  9. arXiv:2202.02958  [pdf

    q-bio.GN cs.AI cs.LG

    A comprehensive survey on computational learning methods for analysis of gene expression data

    Authors: Nikita Bhandari, Rahee Walambe, Ketan Kotecha, Satyajeet Khare

    Abstract: Computational analysis methods including machine learning have a significant impact in the fields of genomics and medicine. High-throughput gene expression analysis methods such as microarray technology and RNA sequencing produce enormous amounts of data. Traditionally, statistical methods are used for comparative analysis of gene expression data. However, more complex analysis for classification… ▽ More

    Submitted 27 September, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 43 pages, 8 figures, 5 tables

  10. arXiv:2105.07659  [pdf

    q-bio.GN cs.LG

    Comparison of machine learning and deep learning techniques in promoter prediction across diverse species

    Authors: Nikita Bhandari, Satyajeet Khare, Rahee Walambe, Ketan Kotecha

    Abstract: Gene promoters are the key DNA regulatory elements positioned around the transcription start sites and are responsible for regulating gene transcription process. Various alignment-based, signal-based and content-based approaches are reported for the prediction of promoters. However, since all promoter sequences do not show explicit features, the prediction performance of these techniques is poor.… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: 17 pages, 4 figures, 4 tables

    Journal ref: PeerJ Comput. Sci. 7:e365 (2021)

  11. Earnings-21: A Practical Benchmark for ASR in the Wild

    Authors: Miguel Del Rio, Natalie Delworth, Ryan Westerman, Michelle Huang, Nishchal Bhandari, Joseph Palakapilly, Quinten McNamara, Joshua Dong, Piotr Zelasko, Miguel Jette

    Abstract: Commonly used speech corpora inadequately challenge academic and commercial ASR systems. In particular, speech corpora lack metadata needed for detailed analysis and WER measurement. In response, we present Earnings-21, a 39-hour corpus of earnings calls containing entity-dense speech from nine different financial sectors. This corpus is intended to benchmark ASR systems in the wild with special a… ▽ More

    Submitted 15 June, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to INTERSPEECH 2021. June 15 2021: Addressing the comments of reviewers and updating the results of our internal ESPNet model. The results do not change our conclusions. April 28th, 2021: We found and resolved an issue in our experimental evaluation that scored the LibriSpeech model at ~20% worse relative WER than the actual WER. The updated results do not affect our conclusions

  12. arXiv:2104.10747  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Accented Speech Recognition: A Survey

    Authors: Arthur Hinsvark, Natalie Delworth, Miguel Del Rio, Quinten McNamara, Joshua Dong, Ryan Westerman, Michelle Huang, Joseph Palakapilly, Jennifer Drexler, Ilya Pirkin, Nishchal Bhandari, Miguel Jette

    Abstract: Automatic Speech Recognition (ASR) systems generalize poorly on accented speech. The phonetic and linguistic variability of accents present hard challenges for ASR systems today in both data collection and modeling strategies. The resulting bias in ASR performance across accents comes at a cost to both users and providers of ASR. We present a survey of current promising approaches to accented sp… ▽ More

    Submitted 2 June, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  13. arXiv:2007.08032  [pdf, other

    cs.CV cs.LG

    When and how CNNs generalize to out-of-distribution category-viewpoint combinations

    Authors: Spandan Madan, Timothy Henry, Jamell Dozier, Helen Ho, Nishchal Bhandari, Tomotake Sasaki, Frédo Durand, Hanspeter Pfister, Xavier Boix

    Abstract: Object recognition and viewpoint estimation lie at the heart of visual understanding. Recent works suggest that convolutional neural networks (CNNs) fail to generalize to out-of-distribution (OOD) category-viewpoint combinations, ie. combinations not seen during training. In this paper, we investigate when and how such OOD generalization may be possible by evaluating CNNs trained to classify both… ▽ More

    Submitted 17 November, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

  14. arXiv:1204.3874  [pdf

    cs.NI

    Overview of MC CDMA PAPR Reduction Techniques

    Authors: B. Sarala, D. S. Venkateswarulu, B. N. Bhandari

    Abstract: High Peak to Average Power Ratio (PAPR) of the transmitted signal is a critical problem in multicarrier modulation systems (MCM) such as Orthogonal Frequency Division Multiplexing (OFDM), and Multi-Carrier Code Division Multiple Access (MC CDMA) systems, due to large number of subcarriers. High PAPR leads to reduced resolution, and battery life. It also deteriorates system performance. This paper… ▽ More

    Submitted 10 April, 2012; originally announced April 2012.

    Comments: 14 pages, 7 figures, IJDPS March 2012