Skip to main content

Showing 1–5 of 5 results for author: Dewan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.05266  [pdf, other

    cs.CV q-bio.NC

    Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers

    Authors: Andrew F. Luo, Jacob Yeung, Rushikesh Zawar, Shaurya Dewan, Margaret M. Henderson, Leila Wehbe, Michael J. Tarr

    Abstract: Advances in large-scale artificial neural networks have facilitated novel insights into the functional topology of the brain. Here, we leverage this approach to study how semantic categories are organized in the human visual cortex. To overcome the challenge presented by the co-occurrence of multiple categories in natural images, we introduce BrainSAIL (Semantic Attribution and Image Localization)… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  2. arXiv:2406.13735  [pdf, other

    cs.CV cs.LG

    StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

    Authors: Rushikesh Zawar, Shaurya Dewan, Andrew F. Luo, Margaret M. Henderson, Michael J. Tarr, Leila Wehbe

    Abstract: Understanding the semantics of visual scenes is a fundamental challenge in Computer Vision. A key aspect of this challenge is that objects sharing similar semantic meanings or functions can exhibit striking visual differences, making accurate identification and categorization difficult. Recent advancements in text-to-image frameworks have led to models that implicitly capture natural scene statist… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Dataset website: https://stablesemantics.github.io/StableSemantics

  3. arXiv:2406.05191  [pdf, other

    cs.CV

    DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

    Authors: Rushikesh Zawar, Shaurya Dewan, Prakanshul Saxena, Yingshan Chang, Andrew Luo, Yonatan Bisk

    Abstract: Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models have achieved remarkable success, the underlying mechanisms driving their performance are not yet fully accounted for, with many unanswered questions surrounding w… ▽ More

    Submitted 14 November, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Journal ref: Thirty-Eighth Annual Conference on Neural Information Processing Systems (2024)

  4. arXiv:2401.04198  [pdf, other

    cs.LG cs.AI

    Curiosity & Entropy Driven Unsupervised RL in Multiple Environments

    Authors: Shaurya Dewan, Anisha Jain, Zoe LaLena, Lifan Yu

    Abstract: The authors of 'Unsupervised Reinforcement Learning in Multiple environments' propose a method, alpha-MEPOL, to tackle unsupervised RL across multiple environments. They pre-train a task-agnostic exploration policy using interactions from an entire environment class and then fine-tune this policy for various tasks using supervision. We expanded upon this work, with the goal of improving performanc… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  5. arXiv:2212.02493  [pdf, other

    cs.CV

    Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields

    Authors: Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, Madhava Krishna, Srinath Sridhar

    Abstract: Coordinate-based implicit neural networks, or neural fields, have emerged as useful representations of shape and appearance in 3D computer vision. Despite advances, however, it remains challenging to build neural fields for categories of objects without datasets like ShapeNet that provide "canonicalized" object instances that are consistently aligned for their 3D position and orientation (pose). W… ▽ More

    Submitted 17 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.