Skip to main content

Showing 1–9 of 9 results for author: Revanur, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12541  [pdf, other

    cs.CV

    GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models

    Authors: Sai Sree Harsha, Ambareesh Revanur, Dhwanit Agarwal, Shradha Agrawal

    Abstract: Video editing methods based on diffusion models that rely solely on a text prompt for the edit are hindered by the limited expressive power of text prompts. Thus, incorporating a reference target image as a visual guide becomes desirable for precise control over edit. Also, most existing methods struggle to accurately edit a video when the shape and size of the object in the target image differ fr… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPRw 2024

  2. arXiv:2303.05031  [pdf, other

    cs.CV

    CoralStyleCLIP: Co-optimized Region and Layer Selection for Image Editing

    Authors: Ambareesh Revanur, Debraj Basu, Shradha Agrawal, Dhwanit Agarwal, Deepak Pai

    Abstract: Edit fidelity is a significant issue in open-world controllable generative image editing. Recently, CLIP-based approaches have traded off simplicity to alleviate these problems by introducing spatial attention in a handpicked layer of a StyleGAN. In this paper, we propose CoralStyleCLIP, which incorporates a multi-layer attention-guided blending strategy in the feature space of StyleGAN2 for obtai… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  3. arXiv:2202.12368  [pdf, other

    cs.CV

    Instantaneous Physiological Estimation using Video Transformers

    Authors: Ambareesh Revanur, Ananyananda Dasari, Conrad S. Tucker, Laszlo A. Jeni

    Abstract: Video-based physiological signal estimation has been limited primarily to predicting episodic scores in windowed intervals. While these intermittent values are useful, they provide an incomplete picture of patients' physiological status and may lead to late detection of critical conditions. We propose a video Transformer for estimating instantaneous heart rate and respiration rate from face videos… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 13 pages, 4 figures, AAAI workshop and Springer Studies in Computational Intelligence 2022. For project page see https://github.com/revanurambareesh/instantaneous_transformer

  4. arXiv:2109.10471  [pdf, other

    cs.CY cs.CV cs.LG eess.IV

    The First Vision For Vitals (V4V) Challenge for Non-Contact Video-Based Physiological Estimation

    Authors: Ambareesh Revanur, Zhihua Li, Umur A. Ciftci, Lijun Yin, Laszlo A. Jeni

    Abstract: Telehealth has the potential to offset the high demand for help during public health emergencies, such as the COVID-19 pandemic. Remote Photoplethysmography (rPPG) - the problem of non-invasively estimating blood volume variations in the microvascular tissue from video - would be well suited for these situations. Over the past few years a number of research groups have made rapid advances in remot… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: ICCVw'21. V4V Dataset and Challenge: https://vision4vitals.github.io/

  5. Semi-Supervised Visual Representation Learning for Fashion Compatibility

    Authors: Ambareesh Revanur, Vijay Kumar, Deepthi Sharma

    Abstract: We consider the problem of complementary fashion prediction. Existing approaches focus on learning an embedding space where fashion items from different categories that are visually compatible are closer to each other. However, creating such labeled outfits is intensive and also not feasible to generate all possible outfit combinations, especially with large fashion catalogs. In this work, we prop… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: ACM RecSys'21 (9 pages) DOI: https://dl.acm.org/doi/10.1145/3460231.3474233

  6. arXiv:2103.11169  [pdf, other

    cs.LG cs.CV

    Your Classifier can Secretly Suffice Multi-Source Domain Adaptation

    Authors: Naveen Venkat, Jogendra Nath Kundu, Durgesh Kumar Singh, Ambareesh Revanur, R. Venkatesh Babu

    Abstract: Multi-Source Domain Adaptation (MSDA) deals with the transfer of task knowledge from multiple labeled source domains to an unlabeled target domain, under a domain-shift. Existing methods aim to minimize this domain-shift using auxiliary distribution alignment objectives. In this work, we present a different perspective to MSDA wherein deep models are observed to implicitly align the domains under… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: NeurIPS 2020. Project page: https://sites.google.com/view/simpal

  7. arXiv:2008.01389  [pdf, other

    cs.LG cs.CV stat.ML

    Class-Incremental Domain Adaptation

    Authors: Jogendra Nath Kundu, Rahul Mysore Venkatesh, Naveen Venkat, Ambareesh Revanur, R. Venkatesh Babu

    Abstract: We introduce a practical Domain Adaptation (DA) paradigm called Class-Incremental Domain Adaptation (CIDA). Existing DA methods tackle domain-shift but are unsuitable for learning novel target-domain classes. Meanwhile, class-incremental (CI) methods enable learning of new classes in absence of source training data but fail under a domain-shift without labeled supervision. In this work, we effecti… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  8. arXiv:2008.01388  [pdf, other

    cs.CV

    Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation

    Authors: Jogendra Nath Kundu, Ambareesh Revanur, Govind Vitthal Waghmare, Rahul Mysore Venkatesh, R. Venkatesh Babu

    Abstract: We present a deployment friendly, fast bottom-up framework for multi-person 3D human pose estimation. We adopt a novel neural representation of multi-person 3D pose which unifies the position of person instances with their corresponding 3D pose representation. This is realized by learning a generative pose embedding which not only ensures plausible 3D pose predictions, but also eliminates the usua… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  9. arXiv:2004.04388  [pdf, other

    cs.CV cs.LG

    Towards Inheritable Models for Open-Set Domain Adaptation

    Authors: Jogendra Nath Kundu, Naveen Venkat, Ambareesh Revanur, Rahul M V, R. Venkatesh Babu

    Abstract: There has been a tremendous progress in Domain Adaptation (DA) for visual recognition tasks. Particularly, open-set DA has gained considerable attention wherein the target domain contains additional unseen categories. Existing open-set DA approaches demand access to a labeled source dataset along with unlabeled target instances. However, this reliance on co-existing source and target data is highl… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 (Oral). Code available at https://github.com/val-iisc/inheritune