Skip to main content

Showing 1–8 of 8 results for author: Kamboj, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.15352  [pdf, ps, other

    cs.LG cs.AI cs.CV eess.SP

    Towards Achieving Perfect Multimodal Alignment

    Authors: Abhi Kamboj, Minh N. Do

    Abstract: Multimodal alignment constructs a joint latent vector space where modalities representing the same concept map to neighboring latent vectors. We formulate this as an inverse problem and show that, under certain conditions, paired data from each modality can map to equivalent latent vectors, which we refer to as perfect alignment. When perfect alignment cannot be achieved, it can be approximated us… ▽ More

    Submitted 9 June, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

  2. arXiv:2407.16803  [pdf, ps, other

    cs.CV cs.AI cs.HC cs.LG eess.SP

    C3T: Cross-modal Transfer Through Time for Sensor-based Human Activity Recognition

    Authors: Abhi Kamboj, Anh Duy Nguyen, Minh N. Do

    Abstract: In order to unlock the potential of diverse sensors, we investigate a method to transfer knowledge between time-series modalities using a multimodal \textit{temporal} representation space for Human Activity Recognition (HAR). Specifically, we explore the setting where the modality used in testing has no labeled data during training, which we refer to as Unsupervised Modality Adaptation (UMA). We c… ▽ More

    Submitted 9 June, 2025; v1 submitted 23 July, 2024; originally announced July 2024.

  3. arXiv:2406.16784  [pdf, other

    cs.CV cs.AI

    The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers

    Authors: Abhi Kamboj

    Abstract: The transformer neural network architecture allows for autoregressive sequence-to-sequence modeling through the use of attention layers. It was originally created with the application of machine translation but has revolutionized natural language processing. Recently, transformers have also been applied across a wide variety of pattern recognition tasks, particularly in computer vision. In this li… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This report was written in November 2022, and may not contain more recent works since then

  4. arXiv:2406.11786  [pdf, other

    cs.RO cs.AI cs.CV

    A Brief Survey on Leveraging Large Scale Vision Models for Enhanced Robot Grasping

    Authors: Abhi Kamboj, Katherine Driggs-Campbell

    Abstract: Robotic grasping presents a difficult motor task in real-world scenarios, constituting a major hurdle to the deployment of capable robots across various industries. Notably, the scarcity of data makes grasping particularly challenging for learned models. Recent advancements in computer vision have witnessed a growth of successful unsupervised training mechanisms predicated on massive amounts of da… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: This report was written in February 2023, thus does not account for any works since then

  5. arXiv:2403.15444  [pdf, other

    eess.SP cs.AI cs.CV cs.LG eess.IV

    A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition

    Authors: Abhi Kamboj, Minh Do

    Abstract: Despite living in a multi-sensory world, most AI models are limited to textual and visual understanding of human motion and behavior. In fact, full situational awareness of human motion could best be understood through a combination of sensors. In this survey we investigate how knowledge can be transferred and utilized amongst modalities for Human Activity/Action Recognition (HAR), i.e. cross-moda… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  6. arXiv:2208.10455  [pdf, other

    cs.RO cs.CY cs.HC cs.SD eess.AS

    Examining Audio Communication Mechanisms for Supervising Fleets of Agricultural Robots

    Authors: Abhi Kamboj, Tianchen Ji, Katie Driggs-Campbell

    Abstract: Agriculture is facing a labor crisis, leading to increased interest in fleets of small, under-canopy robots (agbots) that can perform precise, targeted actions (e.g., crop scouting, weeding, fertilization), while being supervised by human operators remotely. However, farmers are not necessarily experts in robotics technology and will not adopt technologies that add to their workload or do not prov… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Camera ready version for IEEE RO-MAN 2022

  7. Towards Modeling Human Motor Learning Dynamics in High-Dimensional Spaces

    Authors: Ankur Kamboj, Rajiv Ranganathan, Xiaobo Tan, Vaibhav Srivastava

    Abstract: Designing effective rehabilitation strategies for upper extremities, particularly hands and fingers, warrants the need for a computational model of human motor learning. The presence of large degrees of freedom (DoFs) available in these systems makes it difficult to balance the trade-off between learning the full dexterity and accomplishing manipulation goals. The motor learning literature argues… ▽ More

    Submitted 26 March, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: accepted to "American Control Conference 2022"

  8. UESegNet: Context Aware Unconstrained ROI Segmentation Networks for Ear Biometric

    Authors: Aman Kamboj, Rajneesh Rani, Aditya Nigam, Ranjeet Ranjan Jha

    Abstract: Biometric-based personal authentication systems have seen a strong demand mainly due to the increasing concern in various privacy and security applications. Although the use of each biometric trait is problem dependent, the human ear has been found to have enough discriminating characteristics to allow its use as a strong biometric measure. To locate an ear in a 2D side face image is a challenging… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.