Skip to main content

Showing 1–9 of 9 results for author: Dedhia, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.17539  [pdf, other

    cs.CV

    Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

    Authors: Bhishma Dedhia, David Bourgin, Krishna Kumar Singh, Yuheng Li, Yan Kang, Zhan Xu, Niraj K. Jha, Yuchen Liu

    Abstract: Diffusion Transformers (DiTs) can generate short photorealistic videos, yet directly training and sampling longer videos with full attention across the video remains computationally challenging. Alternative methods break long videos down into sequential generation of short video segments, requiring multiple sampling chain iterations and specialized consistency modules. To overcome these challenges… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2403.07887  [pdf, other

    cs.CV cs.AI

    Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations

    Authors: Bhishma Dedhia, Niraj K. Jha

    Abstract: Several accounts of human cognition posit that our intelligence is rooted in our ability to form abstract composable concepts, ground them in our environment, and reason over these grounded entities. This trifecta of human thought has remained elusive in modern intelligent machines. In this work, we investigate whether slot representations extracted from visual scenes serve as appropriate composit… ▽ More

    Submitted 8 May, 2025; v1 submitted 2 February, 2024; originally announced March 2024.

  3. arXiv:2305.17328  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph in Pre-Trained Transformers

    Authors: Hongjie Wang, Bhishma Dedhia, Niraj K. Jha

    Abstract: Deployment of Transformer models on edge devices is becoming increasingly challenging due to the exponentially growing inference cost that scales quadratically with the number of tokens in the input sequence. Token pruning is an emerging solution to address this challenge due to its ease of deployment on various Transformer backbones. However, most token pruning methods require computationally exp… ▽ More

    Submitted 7 April, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  4. arXiv:2305.17262  [pdf, other

    cs.CV cs.AI

    Im-Promptu: In-Context Composition from Image Prompts

    Authors: Bhishma Dedhia, Michael Chang, Jake C. Snell, Thomas L. Griffiths, Niraj K. Jha

    Abstract: Large language models are few-shot learners that can solve diverse tasks from a handful of demonstrations. This implicit understanding of tasks suggests that the attention mechanisms over word tokens may play a role in analogical reasoning. In this work, we investigate whether analogical reasoning can enable in-context composition over composable elements of visual stimuli. First, we introduce a s… ▽ More

    Submitted 22 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  5. arXiv:2207.04208  [pdf, other

    cs.AI cs.LG

    SCouT: Synthetic Counterfactuals via Spatiotemporal Transformers for Actionable Healthcare

    Authors: Bhishma Dedhia, Roshini Balasubramanian, Niraj K. Jha

    Abstract: The Synthetic Control method has pioneered a class of powerful data-driven techniques to estimate the counterfactual reality of a unit from donor units. At its core, the technique involves a linear model fitted on the pre-intervention period that combines donor outcomes to yield the counterfactual. However, linearly combining spatial information at each time instance using time-agnostic weights fa… ▽ More

    Submitted 23 November, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

  6. arXiv:2205.11656  [pdf, other

    cs.LG cs.CL

    FlexiBERT: Are Current Transformer Architectures too Homogeneous and Rigid?

    Authors: Shikhar Tuli, Bhishma Dedhia, Shreshth Tuli, Niraj K. Jha

    Abstract: The existence of a plethora of language models makes the problem of selecting the best one for a custom task challenging. Most state-of-the-art methods leverage transformer-based models (e.g., BERT) or their variants. Training such models and exploring their hyperparameter space, however, is computationally expensive. Prior work proposes several neural architecture search (NAS) methods that employ… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Preprint. In review

  7. arXiv:2009.07842  [pdf, ps, other

    cs.LG math.OC stat.ML

    Lower Bounds for Policy Iteration on Multi-action MDPs

    Authors: Kumar Ashutosh, Sarthak Consul, Bhishma Dedhia, Parthasarathi Khirwadkar, Sahil Shah, Shivaram Kalyanakrishnan

    Abstract: Policy Iteration (PI) is a classical family of algorithms to compute an optimal policy for any given Markov Decision Problem (MDP). The basic idea in PI is to begin with some initial policy and to repeatedly update the policy to one from an improving set, until an optimal policy is reached. Different variants of PI result from the (switching) rule used for improvement. An important theoretical que… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 8 pages, 3 diagrams, 2 tables. Paper in IEEE CDC 2020

  8. arXiv:2005.04036  [pdf, other

    eess.SY cs.IT

    On Minimizing Channel-Aware Age of Information in a Multi-Sensor Setting

    Authors: Bhishma Dedhia, Sharayu Moharir

    Abstract: We propose a variant of the Age of Information (AoI) metric called Channel-Aware Age of Information (CA-AoI). Unlike AoI, CA-AoI takes into account the channel conditions between the source and the intended destination to compute the "age" of the recent most update received by the destination. This new metric ensures that the resource allocation is not heavily tilted towards the sources with poor… ▽ More

    Submitted 11 January, 2021; v1 submitted 8 May, 2020; originally announced May 2020.

  9. arXiv:1911.12842  [pdf, other

    cs.LG math.OC stat.ML

    Analysis of Lower Bounds for Simple Policy Iteration

    Authors: Sarthak Consul, Bhishma Dedhia, Kumar Ashutosh, Parthasarathi Khirwadkar

    Abstract: Policy iteration is a family of algorithms that are used to find an optimal policy for a given Markov Decision Problem (MDP). Simple Policy iteration (SPI) is a type of policy iteration where the strategy is to change the policy at exactly one improvable state at every step. Melekopoglou and Condon [1990] showed an exponential lower bound on the number of iterations taken by SPI for a 2 action MDP… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.