Skip to main content

Showing 1–9 of 9 results for author: Hemani, M

.
  1. arXiv:2505.03394  [pdf, ps, other

    cs.CV

    EOPose : Exemplar-based object reposing using Generalized Pose Correspondences

    Authors: Sarthak Mehrotra, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy, Mausoom Sarkar

    Abstract: Reposing objects in images has a myriad of applications, especially for e-commerce where several variants of product images need to be produced quickly. In this work, we leverage the recent advances in unsupervised keypoint correspondence detection between different object images of the same class to propose an end-to-end framework for generic object reposing. Our method, EOPose, takes a target po… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted in CVPR 2025 AI4CC workshop

  2. arXiv:2307.04392  [pdf, other

    cs.CV

    FODVid: Flow-guided Object Discovery in Videos

    Authors: Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy

    Abstract: Segmentation of objects in a video is challenging due to the nuances such as motion blurring, parallax, occlusions, changes in illumination, etc. Instead of addressing these nuances separately, we focus on building a generalizable solution that avoids overfitting to the individual intricacies. Such a solution would also help us save enormous resources involved in human annotation of video corpora.… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: CVPR 2023 (L3D-IVU workshop)

  3. arXiv:2303.15122  [pdf, other

    cs.CV

    Parameter Efficient Local Implicit Image Function Network for Face Segmentation

    Authors: Mausoom Sarkar, Nikitha SR, Mayur Hemani, Rishabh Jain, Balaji Krishnamurthy

    Abstract: Face parsing is defined as the per-pixel labeling of images containing human faces. The labels are defined to identify key facial regions like eyes, lips, nose, hair, etc. In this work, we make use of the structural consistency of the human face to propose a lightweight face-parsing method using a Local Implicit Function network, FP-LIIF. We propose a simple architecture having a convolutional enc… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  4. arXiv:2303.10431  [pdf, other

    cs.CV

    DeAR: Debiasing Vision-Language Models with Additive Residuals

    Authors: Ashish Seth, Mayur Hemani, Chirag Agarwal

    Abstract: Large pre-trained vision-language models (VLMs) reduce the time for developing predictive models for various vision-grounded language downstream tasks by providing rich, adaptable image and text representations. However, these models suffer from societal biases owing to the skewed distribution of various identity groups in the training data. These biases manifest as the skewed similarity between t… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR'23. Codes and dataset will be released soon

  5. arXiv:2211.10157  [pdf, other

    cs.CV cs.AI

    UMFuse: Unified Multi View Fusion for Human Editing applications

    Authors: Rishabh Jain, Mayur Hemani, Duygu Ceylan, Krishna Kumar Singh, Jingwan Lu, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Numerous pose-guided human editing methods have been explored by the vision community due to their extensive practical applications. However, most of these methods still use an image-to-image formulation in which a single image is given as input to produce an edited image as output. This objective becomes ill-defined in cases when the target pose differs significantly from the input pose. Existing… ▽ More

    Submitted 28 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 8 pages, 6 figures

    ACM Class: I.4; I.5

  6. arXiv:2211.08540  [pdf, other

    cs.CV cs.AI

    VGFlow: Visibility guided Flow Network for Human Reposing

    Authors: Rishabh Jain, Krishna Kumar Singh, Mayur Hemani, Jingwan Lu, Mausoom Sarkar, Duygu Ceylan, Balaji Krishnamurthy

    Abstract: The task of human reposing involves generating a realistic image of a person standing in an arbitrary conceivable pose. There are multiple difficulties in generating perceptually accurate images, and existing methods suffer from limitations in preserving texture, maintaining pattern coherence, respecting cloth boundaries, handling occlusions, manipulating skin generation, etc. These difficulties a… ▽ More

    Submitted 28 March, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Selected for publication in CVPR2023

    ACM Class: I.4; I.5

  7. arXiv:2109.07001  [pdf, other

    cs.CV

    ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors

    Authors: Ayush Chopra, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy

    Abstract: Image-based virtual try-on involves synthesizing perceptually convincing images of a model wearing a particular garment and has garnered significant research interest due to its immense practical applicability. Recent methods involve a two stage process: i) warping of the garment to align with the model ii) texture fusion of the warped garment and target model to generate the try-on output. Issues… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted at ICCV 2021

  8. arXiv:2004.15014  [pdf, other

    cs.CV

    SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation

    Authors: Siddhartha Gairola, Mayur Hemani, Ayush Chopra, Balaji Krishnamurthy

    Abstract: Few-shot segmentation (FSS) methods perform image segmentation for a particular object class in a target (query) image, using a small set of (support) image-mask pairs. Recent deep neural network based FSS methods leverage high-dimensional feature similarity between the foreground features of the support images and the query image features. In this work, we demonstrate gaps in the utilization of t… ▽ More

    Submitted 2 May, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: An updated version of this work was accepted at IJCAI 2020

  9. arXiv:2001.06265  [pdf, other

    cs.CV cs.LG eess.IV

    SieveNet: A Unified Framework for Robust Image-Based Virtual Try-On

    Authors: Surgan Jandial, Ayush Chopra, Kumar Ayush, Mayur Hemani, Abhijeet Kumar, Balaji Krishnamurthy

    Abstract: Image-based virtual try-on for fashion has gained considerable attention recently. The task requires trying on a clothing item on a target model image. An efficient framework for this is composed of two stages: (1) warping (transforming) the try-on cloth to align with the pose and shape of the target model, and (2) a texture transfer module to seamlessly integrate the warped try-on cloth onto the… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Accepted at IEEE WACV 2020