Skip to main content

Showing 1–4 of 4 results for author: Govindarajan, H

.
  1. arXiv:2503.09878  [pdf, other

    cs.CV cs.AI cs.RO

    CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation

    Authors: Hariprasath Govindarajan, Maciej K. Wozniak, Marvin Klingner, Camille Maurice, B Ravi Kiran, Senthil Yogamani

    Abstract: Vision foundation models (VFMs) such as DINO have led to a paradigm shift in 2D camera-based perception towards extracting generalized features to support many downstream tasks. Recent works introduce self-supervised cross-modal knowledge distillation (KD) as a way to transfer these powerful generalization capabilities into 3D LiDAR-based models. However, they either rely on highly complex distill… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  2. arXiv:2410.23085  [pdf, other

    cs.CV cs.AI cs.RO

    S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving

    Authors: Maciej K. Wozniak, Hariprasath Govindarajan, Marvin Klingner, Camille Maurice, B Ravi Kiran, Senthil Yogamani

    Abstract: Recent self-supervised clustering-based pre-training techniques like DINO and Cribo have shown impressive results for downstream detection and segmentation tasks. However, real-world applications such as autonomous driving face challenges with imbalanced object class and size distributions and complex scene geometries. In this paper, we propose S3PT a novel scene semantics and structure guided clu… ▽ More

    Submitted 24 January, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: Accepted for WACV 2025 (Oral)

  3. arXiv:2410.14060  [pdf, other

    cs.LG cs.AI cs.CV

    On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods

    Authors: Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten

    Abstract: A prominent self-supervised learning paradigm is to model the representations as clusters, or more generally as a mixture model. Learning to map the data samples to compact representations and fitting the mixture model simultaneously leads to the representation collapse problem. Regularizing the distribution of data points over the clusters is the prevalent strategy to avoid this issue. While this… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: First version of the paper appeared in OpenReview on 22 Sep 2023. Accepted to BMVC 2024

  4. arXiv:2405.10939  [pdf, other

    cs.LG cs.AI cs.CV

    DINO as a von Mises-Fisher mixture model

    Authors: Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten

    Abstract: Self-distillation methods using Siamese networks are popular for self-supervised pre-training. DINO is one such method based on a cross-entropy loss between $K$-dimensional probability vectors, obtained by applying a softmax function to the dot product between representations and learnt prototypes. Given the fact that the learned representations are $L^2$-normalized, we show that DINO and its deri… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted to ICLR 2023