Skip to main content

Showing 1–16 of 16 results for author: Nasrollahi, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.14301  [pdf, other

    cs.CV cs.AI cs.MM

    Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization

    Authors: Nazia Aslam, Kamal Nasrollahi

    Abstract: The rapid development of video surveillance systems for object detection, tracking, activity recognition, and anomaly detection has revolutionized our day-to-day lives while setting alarms for privacy concerns. It isn't easy to strike a balance between visual privacy and action recognition performance in most computer vision models. Is it possible to safeguard privacy without sacrificing performan… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

    Comments: Accepted to CVPRW 2025

  2. arXiv:2503.19588  [pdf, ps, other

    cs.CV

    Video Anomaly Detection with Contours -- A Study

    Authors: Mia Siemon, Ivan Nikolov, Thomas B. Moeslund, Kamal Nasrollahi

    Abstract: In Pose-based Video Anomaly Detection prior art is rooted on the assumption that abnormal events can be mostly regarded as a result of uncommon human behavior. Opposed to utilizing skeleton representations of humans, however, we investigate the potential of learning recurrent motion patterns of normal human behavior using 2D contours. Keeping all advantages of pose-based methods, such as increased… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  3. arXiv:2503.15166  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU

    Authors: Àlex Pujol Vidal, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund

    Abstract: Machine unlearning methods have become increasingly important for selective concept removal in large pre-trained models. While recent work has explored unlearning in Euclidean contrastive vision-language models, the effectiveness of concept removal in hyperbolic spaces remains unexplored. This paper investigates machine unlearning in hyperbolic contrastive learning by adapting Alignment Calibratio… ▽ More

    Submitted 14 April, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

    Comments: Preprint

  4. YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID

    Authors: Iñaki Erregue, Kamal Nasrollahi, Sergio Escalera

    Abstract: We introduce YOLO11-JDE, a fast and accurate multi-object tracking (MOT) solution that combines real-time object detection with self-supervised Re-Identification (Re-ID). By incorporating a dedicated Re-ID branch into YOLO11s, our model performs Joint Detection and Embedding (JDE), generating appearance features for each detection. The Re-ID branch is trained in a fully self-supervised setting whi… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: This paper has been accepted to the 5th Workshop on Real-World Surveillance: Applications and Challenges (WACV 2025)

  5. arXiv:2411.13332  [pdf, other

    cs.LG cs.AI

    Verifying Machine Unlearning with Explainable AI

    Authors: Àlex Pujol Vidal, Anders S. Johansen, Mohammad N. S. Jahromi, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund

    Abstract: We investigate the effectiveness of Explainable AI (XAI) in verifying Machine Unlearning (MU) within the context of harbor front monitoring, focusing on data privacy and regulatory compliance. With the increasing need to adhere to privacy legislation such as the General Data Protection Regulation (GDPR), traditional methods of retraining ML models for data deletions prove impractical due to their… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: ICPRW2024

  6. arXiv:2407.06000  [pdf, other

    cs.CV

    Bounding Boxes and Probabilistic Graphical Models: Video Anomaly Detection Simplified

    Authors: Mia Siemon, Thomas B. Moeslund, Barry Norton, Kamal Nasrollahi

    Abstract: In this study, we formulate the task of Video Anomaly Detection as a probabilistic analysis of object bounding boxes. We hypothesize that the representation of objects via their bounding boxes only, can be sufficient to successfully identify anomalous events in a scene. The implied value of this approach is increased object anonymization, faster model training and fewer computational resources. Th… ▽ More

    Submitted 8 November, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at GCPR 2024, after peer review. Use of this Accepted Version is subject to the publisher's Accepted Manuscript terms of use https://www.springer-nature.com/gp/open-research/policies/accepted-manuscript-terms. Code available on GitHub: https://github.com/milestonesys-research/VAD-with-PGMs/

  7. arXiv:2404.08088  [pdf, other

    cs.CV

    Visual Context-Aware Person Fall Detection

    Authors: Aleksander Nagaj, Zenjie Li, Dim P. Papadopoulos, Kamal Nasrollahi

    Abstract: As the global population ages, the number of fall-related incidents is on the rise. Effective fall detection systems, specifically in healthcare sector, are crucial to mitigate the risks associated with such events. This study evaluates the role of visual context, including background objects, on the accuracy of fall detection classifiers. We present a segmentation pipeline to semi-automatically s… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures, KES IDT-24 conference

  8. arXiv:2308.16572  [pdf, other

    cs.CV cs.AI cs.LG

    CL-MAE: Curriculum-Learned Masked Autoencoders

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu

    Abstract: Masked image modeling has been demonstrated as a powerful pretext task for generating robust representations that can be effectively generalized across multiple downstream tasks. Typically, this approach involves randomly masking patches (tokens) in input images, with the masking strategy remaining unchanged during training. In this paper, we propose a curriculum learning approach that updates the… ▽ More

    Submitted 28 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted at WACV 2024

  9. Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection has recently gained increasing attention in the field of computer vision, likely due to its broad set of applications ranging from product fault detection on industrial production lines and impending event detection in video surveillance to finding lesions in medical scans. Regardless of the domain, anomaly detection is typically framed as a one-class classification task, where t… ▽ More

    Submitted 5 October, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  10. arXiv:2207.08003  [pdf, other

    cs.CV cs.LG

    SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection

    Authors: Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: A self-supervised multi-task learning (SSMTL) framework for video anomaly detection was recently introduced in literature. Due to its highly accurate results, the method attracted the attention of many researchers. In this work, we revisit the self-supervised multi-task learning framework, proposing several updates to the original method. First, we study various detection methods, e.g. based on de… ▽ More

    Submitted 12 February, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: Accepted in Computer Vision and Image Understanding

  11. Video Transformers: A Survey

    Authors: Javier Selva, Anders S. Johansen, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund, Albert Clapés

    Abstract: Transformer models have shown great success handling long-range interactions, making them a promising tool for modeling video. However, they lack inductive biases and scale quadratically with input length. These limitations are further exacerbated when dealing with the high dimensionality introduced by the temporal dimension. While there are surveys analyzing the advances of Transformers for visio… ▽ More

    Submitted 13 February, 2023; v1 submitted 16 January, 2022; originally announced January 2022.

  12. arXiv:2111.09099  [pdf, other

    cs.CV cs.LG

    Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection

    Authors: Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection is commonly pursued as a one-class classification problem, where models can only learn from normal training samples, while being evaluated on both normal and abnormal test samples. Among the successful approaches for anomaly detection, a distinguished category of methods relies on predicting masked information (e.g. patches, future frames, etc.) and leveraging the reconstruction… ▽ More

    Submitted 14 March, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted at CVPR 2022. Paper + supplementary (14 pages, 9 figures)

  13. arXiv:2102.03113  [pdf, other

    cs.CV

    Real-World Super-Resolution of Face-Images from Surveillance Cameras

    Authors: Andreas Aakerberg, Kamal Nasrollahi, Thomas B. Moeslund

    Abstract: Most existing face image Super-Resolution (SR) methods assume that the Low-Resolution (LR) images were artificially downsampled from High-Resolution (HR) images with bicubic interpolation. This operation changes the natural image characteristics and reduces noise. Hence, SR methods trained on such data most often fail to produce good results when applied to real LR images. To solve this problem, w… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

  14. arXiv:2011.13367  [pdf, other

    cs.CV

    SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos

    Authors: Adrien Deliège, Anthony Cioppa, Silvio Giancola, Meisam J. Seikavandi, Jacob V. Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B. Moeslund, Marc Van Droogenbroeck

    Abstract: Understanding broadcast videos is a challenging task in computer vision, as it requires generic reasoning capabilities to appreciate the content offered by the video editing. In this work, we propose SoccerNet-v2, a novel large-scale corpus of manual annotations for the SoccerNet video dataset, along with open challenges to encourage more research in soccer understanding and broadcast production.… ▽ More

    Submitted 19 April, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: Paper accepted for the CVsports workshop at CVPR2021. This document contains 8 pages + references + supplementary material

  15. arXiv:2004.01382  [pdf, other

    cs.CV cs.LG eess.IV

    Effective Fusion of Deep Multitasking Representations for Robust Visual Tracking

    Authors: Seyed Mojtaba Marvasti-Zadeh, Hossein Ghanei-Yakhdan, Shohreh Kasaei, Kamal Nasrollahi, Thomas B. Moeslund

    Abstract: Visual object tracking remains an active research field in computer vision due to persisting challenges with various problem-specific factors in real-world scenes. Many existing tracking methods based on discriminative correlation filters (DCFs) employ feature extraction networks (FENs) to model the target appearance during the learning process. However, using deep feature maps extracted from FENs… ▽ More

    Submitted 20 September, 2021; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: To be appeared in The Visual Computer (International Journal of Computer Graphics), Springer, 2021

  16. arXiv:1805.10078  [pdf

    cs.CV

    A Double-Deep Spatio-Angular Learning Framework for Light Field based Face Recognition

    Authors: Alireza Sepas-Moghaddam, Mohammad A. Haque, Paulo Lobato Correia, Kamal Nasrollahi, Thomas B. Moeslund, Fernando Pereira

    Abstract: Face recognition has attracted increasing attention due to its wide range of applications, but it is still challenging when facing large variations in the biometric data characteristics. Lenslet light field cameras have recently come into prominence to capture rich spatio-angular information, thus offering new possibilities for advanced biometric recognition systems. This paper proposes a double-d… ▽ More

    Submitted 24 April, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: Submitted to IEEE Transactions on Circuits and Systems for Video Technology