Skip to main content

Showing 1–4 of 4 results for author: Palanisamy, K

.
  1. arXiv:2307.03073  [pdf, other

    cs.CV cs.RO

    Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning

    Authors: Jishnu Jaykumar P, Kamalesh Palanisamy, Yu-Wei Chao, Xinya Du, Yu Xiang

    Abstract: We propose a novel framework for few-shot learning by leveraging large-scale vision-language models such as CLIP. Motivated by unimodal prototypical networks for few-shot learning, we introduce Proto-CLIP which utilizes image prototypes and text prototypes for few-shot learning. Specifically, Proto-CLIP adapts the image and text encoder embeddings from CLIP in a joint fashion using few-shot exampl… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  2. arXiv:2302.03793  [pdf, other

    cs.RO cs.CV cs.LG

    Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction

    Authors: Yangxiao Lu, Ninad Khargonkar, Zesheng Xu, Charles Averill, Kamalesh Palanisamy, Kaiyu Hang, Yunhui Guo, Nicholas Ruozzi, Yu Xiang

    Abstract: We introduce a novel robotic system for improving unseen object instance segmentation in the real world by leveraging long-term robot interaction with objects. Previous approaches either grasp or push an object and then obtain the segmentation mask of the grasped or pushed object after one action. Instead, our system defers the decision on segmenting objects after a sequence of robot pushing actio… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 11 pages, 7 figures, 5 tables

  3. arXiv:2011.04232  [pdf, other

    cs.LG cs.CR stat.ML

    SplitEasy: A Practical Approach for Training ML models on Mobile Devices

    Authors: Kamalesh Palanisamy, Vivek Khimani, Moin Hussain Moti, Dimitris Chatzopoulos

    Abstract: Modern mobile devices, although resourceful, cannot train state-of-the-art machine learning models without the assistance of servers, which require access to, potentially, privacy-sensitive user data. Split learning has recently emerged as a promising technique for training complex deep learning (DL) models on low-powered mobile devices. The core idea behind this technique is to train the sensitiv… ▽ More

    Submitted 29 January, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures, Accepted at the ACM HotMobile workshop

  4. arXiv:2007.11154  [pdf, other

    cs.CV cs.SD eess.AS

    Rethinking CNN Models for Audio Classification

    Authors: Kamalesh Palanisamy, Dipika Singhania, Angela Yao

    Abstract: In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used as strong baseline networks for audio classification. Even though there is a significant difference between audio Spectrogram and standard ImageNet image samples, transfer learning assumptions still hold firmly. To understand what enables the ImageNet pretrained models to learn useful audio representations, we sys… ▽ More

    Submitted 13 November, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: 8 pages, 3 figures