Skip to main content

Showing 1–7 of 7 results for author: Kharbanda, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17298  [pdf, other

    cs.CL cs.AI cs.LG

    Mercury: Ultra-Fast Language Models Based on Diffusion

    Authors: Inception Labs, Samar Khanna, Siddhant Kharbanda, Shufan Li, Harshit Varma, Eric Wang, Sawyer Birnbaum, Ziyang Luo, Yanis Miraoui, Akash Palrecha, Stefano Ermon, Aditya Grover, Volodymyr Kuleshov

    Abstract: We present Mercury, a new generation of commercial-scale large language models (LLMs) based on diffusion. These models are parameterized via the Transformer architecture and trained to predict multiple tokens in parallel. In this report, we detail Mercury Coder, our first set of diffusion LLMs designed for coding applications. Currently, Mercury Coder comes in two sizes: Mini and Small. These mode… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 15 pages; equal core, cross-function, senior authors listed alphabetically

  2. arXiv:2405.04545  [pdf, other

    cs.LG cs.IR

    Learning label-label correlations in Extreme Multi-label Classification via Label Features

    Authors: Siddhant Kharbanda, Devaansh Gupta, Erik Schultheis, Atmadeep Banerjee, Cho-Jui Hsieh, Rohit Babbar

    Abstract: Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent works in this domain have increasingly focused on a symmetric problem setting where both input instances and label features are short-text in nature. Short-text XMC with label features has found numerous applications in a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2405.03714  [pdf, other

    cs.LG cs.AI

    UniDEC : Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification

    Authors: Siddhant Kharbanda, Devaansh Gupta, Gururaj K, Pankaj Malhotra, Amit Singh, Cho-Jui Hsieh, Rohit Babbar

    Abstract: Extreme Multi-label Classification (XMC) involves predicting a subset of relevant labels from an extremely large label space, given an input query and labels with textual features. Models developed for this problem have conventionally made use of dual encoder (DE) to embed the queries and label texts and one-vs-all (OvA) classifiers to rerank the shortlisted labels by the DE. While such methods ha… ▽ More

    Submitted 3 March, 2025; v1 submitted 4 May, 2024; originally announced May 2024.

    Journal ref: In Proceedings of the ACM Web Conference 2025 (WWW 2025)

  4. arXiv:2308.15226  [pdf, other

    cs.CV cs.AI cs.CL

    CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation

    Authors: Devaansh Gupta, Siddhant Kharbanda, Jiawei Zhou, Wanhua Li, Hanspeter Pfister, Donglai Wei

    Abstract: There has been a growing interest in developing multimodal machine translation (MMT) systems that enhance neural machine translation (NMT) with visual knowledge. This problem setup involves using images as auxiliary information during training, and more recently, eliminating their use during inference. Towards this end, previous works face a challenge in training powerful MMT models from scratch d… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 15 pages, 9 figures, to be published In Proceedings of International Conference of Computer Vision(ICCV), 2023

  5. arXiv:2211.00640  [pdf, ps, other

    cs.LG cs.CL stat.ML

    CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

    Authors: Siddhant Kharbanda, Atmadeep Banerjee, Erik Schultheis, Rohit Babbar

    Abstract: Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent approaches, such as XR-Transformer and LightXML, leverage a transformer instance to achieve state-of-the-art performance. However, in this process, these approaches need to make various trade-offs between performance and… ▽ More

    Submitted 29 October, 2022; originally announced November 2022.

  6. arXiv:2109.07319  [pdf, other

    cs.CL cs.AI cs.LG

    InceptionXML: A Lightweight Framework with Synchronized Negative Sampling for Short Text Extreme Classification

    Authors: Siddhant Kharbanda, Atmadeep Banerjee, Devaansh Gupta, Akash Palrecha, Rohit Babbar

    Abstract: Automatic annotation of short-text data to a large number of target labels, referred to as Short Text Extreme Classification, has found numerous applications including prediction of related searches and product recommendation. In this paper, we propose a convolutional architecture InceptionXML which is light-weight, yet powerful, and robust to the inherent lack of word-order in short-text queries… ▽ More

    Submitted 3 May, 2024; v1 submitted 13 September, 2021; originally announced September 2021.

  7. arXiv:2101.05478  [pdf, other

    cs.CL cs.SD eess.AS

    WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm

    Authors: Akshay Krishna Sheshadri, Anvesh Rao Vijjini, Sukhdeep Kharbanda

    Abstract: Automatic Speech Recognition (ASR) systems are evaluated using Word Error Rate (WER), which is calculated by comparing the number of errors between the ground truth and the transcription of the ASR system. This calculation, however, requires manual transcription of the speech signal to obtain the ground truth. Since transcribing audio signals is a costly process, Automatic WER Evaluation (e-WER) m… ▽ More

    Submitted 13 February, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: Accepted Long Paper at EACL 2021