Skip to main content

Showing 1–50 of 56 results for author: Radhakrishnan, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18301  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Selective Mixup Fine-Tuning for Optimizing Non-Decomposable Objectives

    Authors: Shrinivas Ramasubramanian, Harsh Rangwani, Sho Takemori, Kunal Samanta, Yuhei Umeda, Venkatesh Babu Radhakrishnan

    Abstract: The rise in internet usage has led to the generation of massive amounts of data, resulting in the adoption of various supervised and semi-supervised machine learning algorithms, which can effectively utilize the colossal amount of data to train models. However, before deploying these models in the real world, these must be strictly evaluated on performance measures like worst-case recall and satis… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 SpotLight

  2. arXiv:2401.03661  [pdf, other

    cs.CE

    GrainGNN: A dynamic graph neural network for predicting 3D grain microstructure

    Authors: Yigong Qin, Stephen DeWitt, Balasubramaniam Radhakrishnan, George Biros

    Abstract: We propose GrainGNN, a surrogate model for the evolution of polycrystalline grain structure under rapid solidification conditions in metal additive manufacturing. High fidelity simulations of solidification microstructures are typically performed using multicomponent partial differential equations (PDEs) with moving interfaces. The inherent randomness of the PDE initial conditions (grain seeds) ne… ▽ More

    Submitted 31 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  3. arXiv:2304.14738  [pdf, other

    cs.LG cs.CL cs.CV

    Cost-Sensitive Self-Training for Optimizing Non-Decomposable Metrics

    Authors: Harsh Rangwani, Shrinivas Ramasubramanian, Sho Takemori, Kato Takashi, Yuhei Umeda, Venkatesh Babu Radhakrishnan

    Abstract: Self-training based semi-supervised learning algorithms have enabled the learning of highly accurate deep neural networks, using only a fraction of labeled data. However, the majority of work on self-training has focused on the objective of improving accuracy, whereas practical machine learning systems can have complex goals (e.g. maximizing the minimum of recall across classes, etc.) that are non… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: NeurIPS 2022. Code: https://github.com/val-iisc/CostSensitiveSelfTraining

  4. arXiv:2011.11226  [pdf, other

    cs.LG

    Detection and Classification of mental illnesses on social media using RoBERTa

    Authors: Ankit Murarka, Balaji Radhakrishnan, Sushma Ravichandran

    Abstract: Given the current social distancing regulations across the world, social media has become the primary mode of communication for most people. This has resulted in the isolation of many people suffering from mental illnesses who are unable to receive assistance in person. They have increasingly turned to social media to express themselves and to look for guidance in dealing with their illnesses. Kee… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: 8 pages, 1 figure, 6 tables

  5. arXiv:2005.01939  [pdf, other

    cs.CV

    From Image Collections to Point Clouds with Self-supervised Shape and Pose Networks

    Authors: K L Navaneet, Ansu Mathew, Shashank Kashyap, Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

    Abstract: Reconstructing 3D models from 2D images is one of the fundamental problems in computer vision. In this work, we propose a deep learning technique for 3D object reconstruction from a single image. Contrary to recent works that either use 3D supervision or multi-view supervision, we use only single view images with no pose information during training as well. This makes our approach more practical r… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: Accepted to CVPR 2020; Codes are available at https://github.com/val-iisc/ssl_3d_recon

  6. arXiv:2004.12771  [pdf, other

    cs.CV

    Adversarial Fooling Beyond "Flipping the Label"

    Authors: Konda Reddy Mopuri, Vaisakh Shaj, R. Venkatesh Babu

    Abstract: Recent advancements in CNNs have shown remarkable achievements in various CV/AI applications. Though CNNs show near human or better than human performance in many critical tasks, they are quite vulnerable to adversarial attacks. These attacks are potentially dangerous in real-life deployments. Though there have been many adversarial attacks proposed in recent years, there is no proper way of quant… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: CVPR-AMLCV-2020

  7. arXiv:2004.04400  [pdf, other

    cs.CV

    Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis

    Authors: Jogendra Nath Kundu, Siddharth Seth, Varun Jampani, Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty

    Abstract: Camera captured human pose is an outcome of several sources of variation. Performance of supervised 3D pose estimation approaches comes at the cost of dispensing with variations, such as shape and appearance, that may be useful for solving other related tasks. As a result, the learned model not only inculcates task-bias but also dataset-bias because of its strong reliance on the annotated samples,… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 (Oral)

  8. arXiv:2004.04393  [pdf, other

    cs.CV cs.LG

    Universal Source-Free Domain Adaptation

    Authors: Jogendra Nath Kundu, Naveen Venkat, Rahul M V, R. Venkatesh Babu

    Abstract: There is a strong incentive to develop versatile learning techniques that can transfer the knowledge of class-separability from a labeled source domain to an unlabeled target domain in the presence of a domain-shift. Existing domain adaptation (DA) approaches are not equipped for practical DA scenarios as a result of their reliance on the knowledge of source-target label-set relationship (e.g. Clo… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: CVPR 2020. Code available at https://github.com/val-iisc/usfda

  9. arXiv:2004.04388  [pdf, other

    cs.CV cs.LG

    Towards Inheritable Models for Open-Set Domain Adaptation

    Authors: Jogendra Nath Kundu, Naveen Venkat, Ambareesh Revanur, Rahul M V, R. Venkatesh Babu

    Abstract: There has been a tremendous progress in Domain Adaptation (DA) for visual recognition tasks. Particularly, open-set DA has gained considerable attention wherein the target domain contains additional unseen categories. Existing open-set DA approaches demand access to a labeled source dataset along with unlabeled target instances. However, this reliance on co-existing source and target data is highl… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 (Oral). Code available at https://github.com/val-iisc/inheritune

  10. arXiv:2004.00306  [pdf, other

    cs.CV cs.CR cs.LG

    Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

    Authors: Sravanti Addepalli, Vivek B. S., Arya Baburaj, Gaurang Sriramanan, R. Venkatesh Babu

    Abstract: As humans, we inherently perceive images based on their predominant features, and ignore noise embedded within lower bit planes. On the contrary, Deep Neural Networks are known to confidently misclassify images corrupted with meticulously crafted perturbations that are nearly imperceptible to the human eye. In this work, we attempt to address this problem by training networks to form coarse impres… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: CVPR 2020

  11. arXiv:1908.03919  [pdf, other

    cs.CV cs.LG

    GAN-Tree: An Incrementally Learned Hierarchical Generative Framework for Multi-Modal Data Distributions

    Authors: Jogendra Nath Kundu, Maharshi Gor, Dakshit Agrawal, R. Venkatesh Babu

    Abstract: Despite the remarkable success of generative adversarial networks, their performance seems less impressive for diverse training sets, requiring learning of discontinuous mapping functions. Though multi-mode prior or multi-generator models have been proposed to alleviate this problem, such approaches may fail depending on the empirically chosen initial mode components. In contrast to such bottom-up… ▽ More

    Submitted 16 September, 2019; v1 submitted 11 August, 2019; originally announced August 2019.

    Comments: ICCV 2019 (code available at https://github.com/val-iisc/GANTree)

  12. arXiv:1908.03884  [pdf, other

    cs.CV

    UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation

    Authors: Jogendra Nath Kundu, Nishank Lakkakula, R. Venkatesh Babu

    Abstract: Aiming towards human-level generalization, there is a need to explore adaptable representation learning methods with greater transferability. Most existing approaches independently address task-transferability and cross-domain adaptation, resulting in limited generalization. In this paper, we propose UM-Adapt - a unified framework to effectively perform unsupervised domain adaptation for spatially… ▽ More

    Submitted 16 September, 2019; v1 submitted 11 August, 2019; originally announced August 2019.

    Comments: ICCV 2019 (Oral)

  13. arXiv:1906.07538  [pdf, other

    cs.CV

    Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection

    Authors: Deepak Babu Sam, Skand Vishwanath Peri, Mukuntha Narayanan Sundararaman, Amogh Kamath, R. Venkatesh Babu

    Abstract: We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every… ▽ More

    Submitted 15 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: Accepted in T-PAMI, 2020. Code available at : https://github.com/val-iisc/lsc-cnn

  14. arXiv:1905.11522  [pdf, other

    cs.CV

    Enhancing Salient Object Segmentation Through Attention

    Authors: Anuj Pahuja, Avishek Majumder, Anirban Chakraborty, R. Venkatesh Babu

    Abstract: Segmenting salient objects in an image is an important vision task with ubiquitous applications. The problem becomes more challenging in the presence of a cluttered and textured background, low resolution and/or low contrast images. Even though existing algorithms perform well in segmenting most of the object(s) of interest, they often end up segmenting false positives due to resembling salient ob… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: CVPRW - Deep Vision 2019

  15. arXiv:1905.08114  [pdf, other

    cs.LG cs.CV stat.ML

    Zero-Shot Knowledge Distillation in Deep Networks

    Authors: Gaurav Kumar Nayak, Konda Reddy Mopuri, Vaisakh Shaj, R. Venkatesh Babu, Anirban Chakraborty

    Abstract: Knowledge distillation deals with the problem of training a smaller model (Student) from a high capacity source model (Teacher) so as to retain most of its performance. Existing approaches use either the training data or meta-data extracted from it in order to train the Student. However, accessing the dataset on which the Teacher has been trained may not always be feasible if the dataset is very l… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: Accepted in ICML 2019, codes will be available at https://github.com/vcl-iisc/ZSKD

  16. arXiv:1901.08906  [pdf, other

    cs.CV

    Dense 3D Point Cloud Reconstruction Using a Deep Pyramid Network

    Authors: Priyanka Mandikal, R. Venkatesh Babu

    Abstract: Reconstructing a high-resolution 3D model of an object is a challenging task in computer vision. Designing scalable and light-weight architectures is crucial while addressing this problem. Existing point-cloud based reconstruction approaches directly predict the entire point cloud in a single stage. Although this technique can handle low-resolution point clouds, it is not a viable solution for gen… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

    Comments: WACV 2019

  17. arXiv:1812.02591  [pdf, other

    cs.CV

    BiHMP-GAN: Bidirectional 3D Human Motion Prediction GAN

    Authors: Jogendra Nath Kundu, Maharshi Gor, R. Venkatesh Babu

    Abstract: Human motion prediction model has applications in various fields of computer vision. Without taking into account the inherent stochasticity in the prediction of future pose dynamics, such methods often converges to a deterministic undesired mean of multiple probable outcomes. Devoid of this, we propose a novel probabilistic generative approach called Bidirectional Human motion prediction GAN, or B… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: Accepted at AAAI 2019

  18. arXiv:1811.11731  [pdf, other

    cs.CV

    CAPNet: Continuous Approximation Projection For 3D Point Cloud Reconstruction Using 2D Supervision

    Authors: Navaneet K L, Priyanka Mandikal, Mayank Agarwal, R. Venkatesh Babu

    Abstract: Knowledge of 3D properties of objects is a necessity in order to build effective computer vision systems. However, lack of large scale 3D datasets can be a major constraint for data-driven approaches in learning such properties. We consider the task of single image 3D point cloud reconstruction, and aim to utilize multiple foreground masks as our supervisory data to alleviate the need for large sc… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: Accepted at AAAI-2019; Codes are available at https://github.com/val-iisc/capnet

  19. arXiv:1810.00461  [pdf, other

    cs.CV

    3D-PSRNet: Part Segmented 3D Point Cloud Reconstruction From a Single Image

    Authors: Priyanka Mandikal, Navaneet K L, R. Venkatesh Babu

    Abstract: We propose a mechanism to reconstruct part annotated 3D point clouds of objects given just a single input image. We demonstrate that jointly training for both reconstruction and segmentation leads to improved performance in both the tasks, when compared to training for each task individually. The key idea is to propagate information from each task so as to aid the other during the training procedu… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.

    Comments: Accepted at ECCV Workshop 2018. Codes are available at https://github.com/val-iisc/3d-psrnet

  20. arXiv:1809.00553  [pdf, other

    cs.CV

    Object Pose Estimation from Monocular Image using Multi-View Keypoint Correspondence

    Authors: Jogendra Nath Kundu, Rahul M. V., Aditya Ganeshan, R. Venkatesh Babu

    Abstract: Understanding the geometry and pose of objects in 2D images is a fundamental necessity for a wide range of real world applications. Driven by deep neural networks, recent methods have brought significant improvements to object pose estimation. However, they suffer due to scarcity of keypoint/pose-annotated real images and hence can not exploit the object's 3D structural information effectively. In… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted in ECCV-W; Code available at this http url: https://github.com/val-iisc/pose_estimation

  21. arXiv:1808.01153  [pdf, other

    cs.CV cs.LG

    Ask, Acquire, and Attack: Data-free UAP Generation using Class Impressions

    Authors: Konda Reddy Mopuri, Phani Krishna Uppala, R. Venkatesh Babu

    Abstract: Deep learning models are susceptible to input specific noise, called adversarial perturbations. Moreover, there exist input-agnostic noise, called Universal Adversarial Perturbations (UAP) that can affect inference of the models over most input samples. Given a model, there exist broadly two approaches to craft UAPs: (i) data-driven: that require data, and (ii) data-free: that do not require data… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: Accepted in ECCV 2018

  22. arXiv:1808.01134  [pdf, other

    cs.CV

    iSPA-Net: Iterative Semantic Pose Alignment Network

    Authors: Jogendra Nath Kundu, Aditya Ganeshan, Rahul M. V., Aditya Prakash, R. Venkatesh Babu

    Abstract: Understanding and extracting 3D information of objects from monocular 2D images is a fundamental problem in computer vision. In the task of 3D object pose estimation, recent data driven deep neural network based approaches suffer from scarcity of real images with 3D keypoint and pose annotations. Drawing inspiration from human cognition, where the annotators use a 3D CAD model as structural refere… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: Accepted at ACMMM 2018. Code available at https://github.com/val-iisc/iSPA-Net

  23. arXiv:1807.07796  [pdf, other

    cs.CV

    3D-LMNet: Latent Embedding Matching for Accurate and Diverse 3D Point Cloud Reconstruction from a Single Image

    Authors: Priyanka Mandikal, K L Navaneet, Mayank Agarwal, R. Venkatesh Babu

    Abstract: 3D reconstruction from single view images is an ill-posed problem. Inferring the hidden regions from self-occluded images is both challenging and ambiguous. We propose a two-pronged approach to address these issues. To better incorporate the data prior and generate meaningful reconstructions, we propose 3D-LMNet, a latent embedding matching approach for 3D reconstruction. We first train a 3D point… ▽ More

    Submitted 26 March, 2019; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: Accepted at BMVC 2018; Codes are available at https://github.com/val-iisc/3d-lmnet

  24. arXiv:1807.07295  [pdf, other

    cs.CV

    Operator-in-the-Loop Deep Sequential Multi-camera Feature Fusion for Person Re-identification

    Authors: K L Navaneet, Ravi Kiran Sarvadevabhatla, Shashank Shekhar, R. Venkatesh Babu, Anirban Chakraborty

    Abstract: Given a target image as query, person re-identification systems retrieve a ranked list of candidate matches on a per-camera basis. In deployed systems, a human operator scans these lists and labels sighted targets by touch or mouse-based selection. However, classical re-id approaches generate per-camera lists independently. Therefore, target identifications by operator in a subset of cameras canno… ▽ More

    Submitted 5 December, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: Accepted at IEEE Transactions on Information Forensics & Security

  25. arXiv:1803.01599  [pdf, other

    cs.CV

    AdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation

    Authors: Jogendra Nath Kundu, Phani Krishna Uppala, Anuj Pahuja, R. Venkatesh Babu

    Abstract: Supervised deep learning methods have shown promising results for the task of monocular depth estimation; but acquiring ground truth is costly, and prone to noise as well as inaccuracies. While synthetic datasets have been used to circumvent above problems, the resultant models do not generalize well to natural scenes due to the inherent domain shift. Recent adversarial approaches for domain adapt… ▽ More

    Submitted 7 June, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: CVPR 2018

  26. arXiv:1801.09356  [pdf, other

    cs.CV cs.AI

    Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing

    Authors: Ravi Kiran Sarvadevabhatla, Shiv Surya, Trisha Mittal, Venkatesh Babu Radhakrishnan

    Abstract: The ability of intelligent agents to play games in human-like fashion is popularly considered a benchmark of progress in Artificial Intelligence. Similarly, performance on multi-disciplinary tasks such as Visual Question Answering (VQA) is considered a marker for gauging progress in Computer Vision. In our work, we bring games and VQA together. Specifically, we introduce the first computational mo… ▽ More

    Submitted 28 January, 2018; originally announced January 2018.

    Comments: To be presented at AAAI-2018. Code, pre-trained models and dataset at github.com/val-iisc/sketchguess

  27. arXiv:1801.08092  [pdf, other

    cs.CV cs.AI cs.LG

    Generalizable Data-free Objective for Crafting Universal Adversarial Perturbations

    Authors: Konda Reddy Mopuri, Aditya Ganeshan, R. Venkatesh Babu

    Abstract: Machine learning models are susceptible to adversarial perturbations: small changes to input that can cause large changes in output. It is also demonstrated that there exist input-agnostic perturbations, called universal adversarial perturbations, which can change the inference of target model on most of the data samples. However, existing methods to craft universal perturbations are (i) task spec… ▽ More

    Submitted 24 July, 2018; v1 submitted 24 January, 2018; originally announced January 2018.

    Comments: TPAMI | Repository: https://github.com/val-iisc/GD-UAP

  28. arXiv:1712.07384  [pdf, other

    cs.CV

    DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs

    Authors: K. Ram Prabhakar, V. Sai Srikar, R. Venkatesh Babu

    Abstract: We present a novel deep learning architecture for fusing static multi-exposure images. Current multi-exposure fusion (MEF) approaches use hand-crafted features to fuse input sequence. However, the weak hand-crafted representations are not robust to varying input conditions. Moreover, they perform poorly for extreme exposure image pairs. Thus, it is highly desirable to have a method that is robust… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: ICCV 2017

  29. arXiv:1712.03390  [pdf, other

    cs.CV cs.AI cs.LG

    NAG: Network for Adversary Generation

    Authors: Konda Reddy Mopuri, Utkarsh Ojha, Utsav Garg, R. Venkatesh Babu

    Abstract: Adversarial perturbations can pose a serious threat for deploying machine learning systems. Recent works have shown existence of image-agnostic perturbations that can fool classifiers over most natural images. Existing methods present optimization approaches that solve for a fooling objective with an imperceptibility constraint to craft the perturbations. However, for a given classifier, they gene… ▽ More

    Submitted 28 March, 2018; v1 submitted 9 December, 2017; originally announced December 2017.

    Comments: CVPR 2018

  30. arXiv:1709.01295  [pdf, other

    cs.CV cs.GR cs.MM

    SketchParse : Towards Rich Descriptions for Poorly Drawn Sketches using Multi-Task Hierarchical Deep Networks

    Authors: Ravi Kiran Sarvadevabhatla, Isht Dwivedi, Abhijat Biswas, Sahil Manocha, R. Venkatesh Babu

    Abstract: The ability to semantically interpret hand-drawn line sketches, although very challenging, can pave way for novel applications in multimedia. We propose SketchParse, the first deep-network architecture for fully automatic parsing of freehand object sketches. SketchParse is configured as a two-level fully convolutional network. The first level contains shared layers common to all object categories.… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Comments: A shorter version of this submission was accepted at ACM Multimedia (ACMMM) 2017. Code, annotated datasets and pre-trained models available at https://github.com/val-iisc/sketch-parse

  31. arXiv:1708.06670  [pdf, other

    cs.CV

    CNN Fixations: An unraveling approach to visualize the discriminative image regions

    Authors: Konda Reddy Mopuri, Utsav Garg, R. Venkatesh Babu

    Abstract: Deep convolutional neural networks (CNN) have revolutionized various fields of vision research and have seen unprecedented adoption for multiple tasks such as classification, detection, captioning, etc. However, they offer little transparency into their inner workings and are often treated as black boxes that deliver excellent performance. In this work, we aim at alleviating this opaqueness of CNN… ▽ More

    Submitted 12 December, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: Accepted in Trans. on Image Processing (TIP) 2018 and Codes are available at https://github.com/utsavgarg/cnn-fixations

  32. arXiv:1708.00159  [pdf, other

    cs.CV

    Image Denoising via CNNs: An Adversarial Approach

    Authors: Nithish Divakar, R. Venkatesh Babu

    Abstract: Is it possible to recover an image from its noisy version using convolutional neural networks? This is an interesting problem as convolutional layers are generally used as feature detectors for tasks like classification, segmentation and object detection. We present a new CNN architecture for blind image denoising which synergically combines three architecture components, a multi-scale feature ext… ▽ More

    Submitted 1 August, 2017; originally announced August 2017.

  33. arXiv:1707.07013  [pdf, other

    cs.CV

    Confidence estimation in Deep Neural networks via density modelling

    Authors: Akshayvarun Subramanya, Suraj Srinivas, R. Venkatesh Babu

    Abstract: State-of-the-art Deep Neural Networks can be easily fooled into providing incorrect high-confidence predictions for images with small amounts of adversarial noise. Does this expose a flaw with deep neural networks, or do we simply need a better way to estimate confidence? In this paper we consider the problem of accurately estimating predictive confidence. We formulate this problem as that of dens… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

    Comments: ICME 2017

  34. arXiv:1707.05572  [pdf, other

    cs.CV

    Fast Feature Fool: A data independent approach to universal adversarial perturbations

    Authors: Konda Reddy Mopuri, Utsav Garg, R. Venkatesh Babu

    Abstract: State-of-the-art object recognition Convolutional Neural Networks (CNNs) are shown to be fooled by image agnostic perturbations, called universal adversarial perturbations. It is also observed that these perturbations generalize across multiple networks trained on the same target data. However, these algorithms require training data on which the CNNs were trained and compute adversarial perturbati… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: BMVC 2017 and codes are available at https://github.com/utsavgarg/fast-feature-fool

  35. arXiv:1706.02071  [pdf, other

    cs.CV

    DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data

    Authors: Swaminathan Gurumurthy, Ravi Kiran Sarvadevabhatla, Venkatesh Babu Radhakrishnan

    Abstract: A class of recent approaches for generating images, called Generative Adversarial Networks (GAN), have been used to generate impressively realistic images of objects, bedrooms, handwritten digits and a variety of other image modalities. However, typical GAN-based approaches require large amounts of training data to capture the diversity across the image modality. In this paper, we propose DeLiGAN… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

    Comments: Accepted at CVPR-2017. Code for training the GAN models and computing modified inception-scores can be found at https://github.com/val-iisc/deligan

  36. arXiv:1705.09142  [pdf, other

    cs.CV

    Deep image representations using caption generators

    Authors: Konda Reddy Mopuri, Vishal B. Athreya, R. Venkatesh Babu

    Abstract: Deep learning exploits large volumes of labeled data to learn powerful models. When the target dataset is small, it is a common practice to perform transfer learning using pre-trained models to learn new task specific representations. However, pre-trained CNNs for image recognition are provided with limited information about the image during training, which is label alone. Tasks such as scene retr… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

    Comments: ICME 2017

  37. Object category understanding via eye fixations on freehand sketches

    Authors: Ravi Kiran Sarvadevabhatla, Sudharshan Suresh, R. Venkatesh Babu

    Abstract: The study of eye gaze fixations on photographic images is an active research area. In contrast, the image subcategory of freehand sketches has not received as much attention for such studies. In this paper, we analyze the results of a free-viewing gaze fixation study conducted on 3904 freehand sketches distributed across 160 object categories. Our analysis shows that fixation sequences exhibit mar… ▽ More

    Submitted 19 March, 2017; originally announced March 2017.

    Comments: Accepted for publication in Transactions on Image Processing (http://ieeexplore.ieee.org/document/7866001/)

  38. arXiv:1611.07703  [pdf, other

    cs.CV

    'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems

    Authors: Ravi Kiran Sarvadevabhatla, Shanthakumar Venkatraman, R. Venkatesh Babu

    Abstract: An examination of object recognition challenge leaderboards (ILSVRC, PASCAL-VOC) reveals that the top-performing classifiers typically exhibit small differences amongst themselves in terms of error rate/mAP. To better differentiate the top performers, additional criteria are required. Moreover, the (test) images, on which the performance scores are based, predominantly contain fully visible object… ▽ More

    Submitted 24 November, 2016; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: Extended version of our ACCV-2016 paper. Author formatting modified

  39. arXiv:1611.06791  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Generalized Dropout

    Authors: Suraj Srinivas, R. Venkatesh Babu

    Abstract: Deep Neural Networks often require good regularizers to generalize well. Dropout is one such regularizer that is widely used among Deep Learning practitioners. Recent work has shown that Dropout can also be viewed as performing Approximate Bayesian Inference over the network parameters. In this work, we generalize this notion and introduce a rich family of regularizers which we call Generalized Dr… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  40. arXiv:1611.06694  [pdf, other

    cs.CV cs.LG

    Training Sparse Neural Networks

    Authors: Suraj Srinivas, Akshayvarun Subramanya, R. Venkatesh Babu

    Abstract: Deep neural networks with lots of parameters are typically used for large-scale computer vision tasks such as image classification. This is a result of using dense matrix multiplications and convolutions. However, sparse computations are known to be much more efficient. In this work, we train and build neural networks which implicitly use sparse computations. We introduce additional gate variables… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  41. arXiv:1611.05744  [pdf, other

    cs.CV

    Compensating for Large In-Plane Rotations in Natural Images

    Authors: Lokesh Boominathan, Suraj Srinivas, R. Venkatesh Babu

    Abstract: Rotation invariance has been studied in the computer vision community primarily in the context of small in-plane rotations. This is usually achieved by building invariant image features. However, the problem of achieving invariance for large rotation angles remains largely unexplored. In this work, we tackle this problem by directly compensating for large rotations, as opposed to building invarian… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: Accepted at Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2016

  42. arXiv:1608.06197  [pdf, other

    cs.CV

    CrowdNet: A Deep Convolutional Network for Dense Crowd Counting

    Authors: Lokesh Boominathan, Srinivas S S Kruthiventi, R. Venkatesh Babu

    Abstract: Our work proposes a novel deep learning framework for estimating crowd density from static images of highly dense crowds. We use a combination of deep and shallow, fully convolutional networks to predict the density map for a given crowd image. Such a combination is used for effectively capturing both the high-level semantic information (face/body detectors) and the low-level features (blob detect… ▽ More

    Submitted 22 August, 2016; originally announced August 2016.

    Comments: Accepted at ACM Multimedia (MM) 2016

  43. arXiv:1608.03369  [pdf, other

    cs.CV

    Enabling My Robot To Play Pictionary : Recurrent Neural Networks For Sketch Recognition

    Authors: Ravi Kiran Sarvadevabhatla, Jogendra Kundu, Babu R. Venkatesh

    Abstract: Freehand sketching is an inherently sequential process. Yet, most approaches for hand-drawn sketch recognition either ignore this sequential aspect or exploit it in an ad-hoc manner. In our work, we propose a recurrent neural network architecture for sketch object recognition which exploits the long-term sequential and structural regularities in stroke data in a scalable manner. Specifically, we i… ▽ More

    Submitted 11 August, 2016; originally announced August 2016.

    Comments: Accepted at ACMMM 2016. Code and models at https://github.com/val-iisc/sketch-object-recognition

  44. arXiv:1607.08764  [pdf, other

    cs.CV

    SwiDeN : Convolutional Neural Networks For Depiction Invariant Object Recognition

    Authors: Ravi Kiran Sarvadevabhatla, Shiv Surya, Srinivas S S Kruthiventi, Venkatesh Babu R

    Abstract: Current state of the art object recognition architectures achieve impressive performance but are typically specialized for a single depictive style (e.g. photos only, sketches only). In this paper, we present SwiDeN : our Convolutional Neural Network (CNN) architecture which recognizes objects regardless of how they are visually depicted (line drawing, realistic shaded drawing, photograph etc.). I… ▽ More

    Submitted 29 July, 2016; originally announced July 2016.

    Comments: Accepted at ACMMM 2016. The first two authors contributed equally. Code and models at https://github.com/val-iisc/swiden

  45. arXiv:1601.06615  [pdf, other

    cs.CV cs.LG cs.MM

    A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

    Authors: Suraj Srinivas, Ravi Kiran Sarvadevabhatla, Konda Reddy Mopuri, Nikita Prabhu, Srinivas S S Kruthiventi, R. Venkatesh Babu

    Abstract: Traditional architectures for solving computer vision problems and the degree of success they enjoyed have been heavily reliant on hand-crafted features. However, of late, deep learning techniques have offered a compelling alternative -- that of automatically learning problem-specific features. With this new paradigm, every problem in computer vision is now being re-examined from a deep learning p… ▽ More

    Submitted 25 January, 2016; originally announced January 2016.

    Comments: Published in Frontiers in Robotics and AI (http://goo.gl/6691Bm)

    Journal ref: Frontiers in Robotics and AI 2(36), January 2016

  46. arXiv:1511.05497  [pdf, other

    cs.LG cs.CV cs.NE

    Learning Neural Network Architectures using Backpropagation

    Authors: Suraj Srinivas, R. Venkatesh Babu

    Abstract: Deep neural networks with millions of parameters are at the heart of many state of the art machine learning models today. However, recent works have shown that models with much smaller number of parameters can also perform just as well. In this work, we introduce the problem of architecture-learning, i.e; learning the architecture of a neural network along with weights. We introduce a new trainabl… ▽ More

    Submitted 2 August, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: BMVC 2016 ; Title modified from 'Learning the Architecture of Deep Neural Networks'

  47. arXiv:1510.02927  [pdf, other

    cs.CV

    DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations

    Authors: Srinivas S. S. Kruthiventi, Kumar Ayush, R. Venkatesh Babu

    Abstract: Understanding and predicting the human visual attentional mechanism is an active area of research in the fields of neuroscience and computer vision. In this work, we propose DeepFix, a first-of-its-kind fully convolutional neural network for accurate saliency prediction. Unlike classical works which characterize the saliency map using various hand-crafted features, our model automatically learns f… ▽ More

    Submitted 10 October, 2015; originally announced October 2015.

  48. arXiv:1509.06658  [pdf, other

    cs.CV

    Attribute-Graph: A Graph based approach to Image Ranking

    Authors: Nikita Prabhu, R. Venkatesh Babu

    Abstract: We propose a novel image representation, termed Attribute-Graph, to rank images by their semantic similarity to a given query image. An Attribute-Graph is an undirected fully connected graph, incorporating both local and global image characteristics. The graph nodes characterise objects as well as the overall scene context using mid-level semantic attributes, while the edges capture the object top… ▽ More

    Submitted 8 October, 2015; v1 submitted 22 September, 2015; originally announced September 2015.

    Comments: In IEEE International Conference on Computer Vision (ICCV) 2015

  49. arXiv:1509.04399  [pdf, other

    cs.CV

    Analyzing structural characteristics of object category representations from their semantic-part distributions

    Authors: Ravi Kiran Sarvadevabhatla, Venkatesh Babu R

    Abstract: Studies from neuroscience show that part-mapping computations are employed by human visual system in the process of object recognition. In this work, we present an approach for analyzing semantic-part characteristics of object category representations. For our experiments, we use category-epitome, a recently proposed sketch-based spatial representation for objects. To enable part-importance analys… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

  50. arXiv:1507.06149  [pdf, other

    cs.CV

    Data-free parameter pruning for Deep Neural Networks

    Authors: Suraj Srinivas, R. Venkatesh Babu

    Abstract: Deep Neural nets (NNs) with millions of parameters are at the heart of many state-of-the-art computer vision systems today. However, recent works have shown that much smaller models can achieve similar levels of performance. In this work, we address the problem of pruning parameters in a trained NN model. Instead of removing individual weights one at a time as done in previous works, we remove one… ▽ More

    Submitted 22 July, 2015; originally announced July 2015.

    Comments: BMVC 2015