Skip to main content

Showing 1–10 of 10 results for author: Luthra, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11774  [pdf, ps, other

    cs.CV cs.AI cs.HC

    Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation

    Authors: Abhishek Jaiswal, Armeet Singh Luthra, Purav Jangir, Bhavya Garg, Nisheeth Srivastava

    Abstract: Isometric exercises appeal to individuals seeking convenience, privacy, and minimal dependence on equipments. However, such fitness training is often overdependent on unreliable digital media content instead of expert supervision, introducing serious risks, including incorrect posture, injury, and disengagement due to lack of corrective feedback. To address these challenges, we present a real-time… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  2. arXiv:2506.04411  [pdf, ps, other

    cs.LG

    Self-Supervised Contrastive Learning is Approximately Supervised Contrastive Learning

    Authors: Achleshwar Luthra, Tianbao Yang, Tomer Galanti

    Abstract: Despite its empirical success, the theoretical foundations of self-supervised contrastive learning (CL) are not yet fully established. In this work, we address this gap by showing that standard CL objectives implicitly approximate a supervised variant we call the negatives-only supervised contrastive loss (NSCL), which excludes same-class contrasts. We prove that the gap between the CL and NSCL lo… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  3. arXiv:2505.12495  [pdf, other

    cs.CL

    KG-QAGen: A Knowledge-Graph-Based Framework for Systematic Question Generation and Long-Context LLM Evaluation

    Authors: Nikita Tatarinov, Vidhyakshaya Kannan, Haricharana Srinivasa, Arnav Raj, Harpreet Singh Anand, Varun Singh, Aditya Luthra, Ravij Lade, Agam Shah, Sudheer Chava

    Abstract: The increasing context length of modern language models has created a need for evaluating their ability to retrieve and process information across extensive documents. While existing benchmarks test long-context capabilities, they often lack a structured way to systematically vary question complexity. We introduce KG-QAGen (Knowledge-Graph-based Question-Answer Generation), a framework that (1) ex… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  4. arXiv:2405.06786  [pdf, other

    eess.IV cs.CV

    SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

    Authors: Trevor J. Chan, Aarush Sahni, Yijin Fang, Jie Li, Alisha Luthra, Alison Pouch, Chamith S. Rajapakse

    Abstract: We introduce SAM3D, a new approach to semi-automatic zero-shot segmentation of 3D images building on the existing Segment Anything Model. We achieve fast and accurate segmentations in 3D images with a four-step strategy involving: user prompting with 3D polylines, volume slicing along multiple axes, slice-wide inference with a pretrained model, and recomposition and refinement in 3D. We evaluated… ▽ More

    Submitted 7 August, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  5. arXiv:2301.05434  [pdf, other

    cs.CV cs.LG eess.IV

    LVRNet: Lightweight Image Restoration for Aerial Images under Low Visibility

    Authors: Esha Pahwa, Achleshwar Luthra, Pratik Narang

    Abstract: Learning to recover clear images from images having a combination of degrading factors is a challenging task. That being said, autonomous surveillance in low visibility conditions caused by high pollution/smoke, poor air quality index, low light, atmospheric scattering, and haze during a blizzard becomes even more important to prevent accidents. It is thus crucial to form a solution that can resul… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  6. arXiv:2212.03384  [pdf

    cs.CV

    DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

    Authors: Santosh Kumar Yadav, Achleshwar Luthra, Esha Pahwa, Kamlesh Tiwari, Heena Rathore, Hari Mohan Pandey, Peter Corcoran

    Abstract: Human activity recognition (HAR) using drone-mounted cameras has attracted considerable interest from the computer vision research community in recent years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoi… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.05531

  7. arXiv:2211.05531  [pdf

    cs.CV

    SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

    Authors: Santosh Kumar Yadav, Esha Pahwa, Achleshwar Luthra, Kamlesh Tiwari, Hari Mohan Pandey, Peter Corcoran

    Abstract: Drone-camera based human activity recognition (HAR) has received significant attention from the computer vision research community in the past few years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints,… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  8. arXiv:2111.10971  [pdf, other

    cs.CV

    Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras

    Authors: Aniket Shirke, Aziz Saifuddin, Achleshwar Luthra, Jiangong Li, Tawni Williams, Xiaodan Hu, Aneesh Kotnana, Okan Kocabalkanli, Narendra Ahuja, Angela Green-Miller, Isabella Condotta, Ryan N. Dilger, Matthew Caesar

    Abstract: Increasing demand for meat products combined with farm labor shortages has resulted in a need to develop new real-time solutions to monitor animals effectively. Significant progress has been made in continuously locating individual pigs using tracking-by-detection methods. However, these methods fail for oblong pens because a single fixed camera does not cover the entire floor at adequate resoluti… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

    Comments: 6 pages, 4 figures, Accepted at the CVPR 2021 CV4Animals workshop

  9. arXiv:2110.06199  [pdf, other

    cs.CV cs.AI cs.GR

    ABO: Dataset and Benchmarks for Real-World 3D Object Understanding

    Authors: Jasmine Collins, Shubham Goel, Kenan Deng, Achleshwar Luthra, Leon Xu, Erhan Gundogdu, Xi Zhang, Tomas F. Yago Vicente, Thomas Dideriksen, Himanshu Arora, Matthieu Guillaumin, Jitendra Malik

    Abstract: We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, and artist-created 3D models with complex geometries and physically-based materials that correspond to real, household objects. We derive challenging benchmarks that exploit the unique properties of ABO and measure… ▽ More

    Submitted 24 June, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  10. arXiv:2109.08044  [pdf, other

    eess.IV cs.CV

    Eformer: Edge Enhancement based Transformer for Medical Image Denoising

    Authors: Achleshwar Luthra, Harsh Sulakhe, Tanish Mittal, Abhishek Iyer, Santosh Yadav

    Abstract: In this work, we present Eformer - Edge enhancement based transformer, a novel architecture that builds an encoder-decoder network using transformer blocks for medical image denoising. Non-overlapping window-based self-attention is used in the transformer block that reduces computational requirements. This work further incorporates learnable Sobel-Feldman operators to enhance edges in the image an… ▽ More

    Submitted 9 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted in ICCVW'2021