Skip to main content

Showing 1–5 of 5 results for author: Mohan, D D

.
  1. arXiv:2501.08347  [pdf, other

    cs.CV cs.AI

    SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval

    Authors: Bhavin Jawade, Joao V. B. Soares, Kapil Thadani, Deen Dayal Mohan, Amir Erfan Eshratifar, Benjamin Culpepper, Paloma de Juan, Srirangaraj Setlur, Venu Govindaraju

    Abstract: Compositional image retrieval (CIR) is a multimodal learning task where a model combines a query image with a user-provided text modification to retrieve a target image. CIR finds applications in a variety of domains including product retrieval (e-commerce) and web search. Existing methods primarily focus on fully-supervised learning, wherein models are trained on datasets of labeled triplets such… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

    Comments: Paper accepted at WACV 2025 in round 1

  2. arXiv:2312.10046  [pdf, other

    cs.CV cs.AI cs.IR cs.LG

    Deep Metric Learning for Computer Vision: A Brief Overview

    Authors: Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraj

    Abstract: Objective functions that optimize deep neural networks play a vital role in creating an enhanced feature representation of the input data. Although cross-entropy-based loss formulations have been extensively used in a variety of supervised deep-learning applications, these methods tend to be less adequate when there is large intra-class variance and low inter-class variance in input data distribut… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Book Chapter Published In Handbook of Statistics, Special Issue - Deep Learning 48, 59

  3. arXiv:2307.10237  [pdf, other

    cs.CV cs.AI cs.LG

    CoNAN: Conditional Neural Aggregation Network For Unconstrained Face Feature Fusion

    Authors: Bhavin Jawade, Deen Dayal Mohan, Dennis Fedorishin, Srirangaraj Setlur, Venu Govindaraju

    Abstract: Face recognition from image sets acquired under unregulated and uncontrolled settings, such as at large distances, low resolutions, varying viewpoints, illumination, pose, and atmospheric conditions, is challenging. Face feature aggregation, which involves aggregating a set of N feature representations present in a template into a single global representation, plays a pivotal role in such recognit… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Paper accepted at IJCB 2023

  4. RidgeBase: A Cross-Sensor Multi-Finger Contactless Fingerprint Dataset

    Authors: Bhavin Jawade, Deen Dayal Mohan, Srirangaraj Setlur, Nalini Ratha, Venu Govindaraju

    Abstract: Contactless fingerprint matching using smartphone cameras can alleviate major challenges of traditional fingerprint systems including hygienic acquisition, portability and presentation attacks. However, development of practical and robust contactless fingerprint matching techniques is constrained by the limited availability of large scale real-world datasets. To motivate further advances in contac… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: Paper accepted at IJCB 2022

    Journal ref: 2022 IEEE International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates, 2022, pp. 1-9

  5. arXiv:2211.03019  [pdf, other

    cs.CV

    Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

    Authors: Dennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju

    Abstract: Learning to localize the sound source in videos without explicit annotations is a novel area of audio-visual research. Existing work in this area focuses on creating attention maps to capture the correlation between the two modalities to localize the source of the sound. In a video, oftentimes, the objects exhibiting movement are the ones generating the sound. In this work, we capture this charact… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: Accepted to WACV 2023