Skip to main content

Showing 1–22 of 22 results for author: Aghaei, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.03155  [pdf, ps, other

    cs.LG

    Rethinking the Global Convergence of Softmax Policy Gradient with Linear Function Approximation

    Authors: Max Qiushi Lin, Jincheng Mei, Matin Aghaei, Michael Lu, Bo Dai, Alekh Agarwal, Dale Schuurmans, Csaba Szepesvari, Sharan Vaswani

    Abstract: Policy gradient (PG) methods have played an essential role in the empirical successes of reinforcement learning. In order to handle large state-action spaces, PG methods are typically used with function approximation. In this setting, the approximation error in modeling problem-dependent quantities is a key notion for characterizing the global convergence of PG methods. We focus on Softmax PG with… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 75 pages

  2. arXiv:2502.14254  [pdf, ps, other

    cs.RO cs.AI

    Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation

    Authors: Lingfeng Zhang, Yuecheng Liu, Zhanguang Zhang, Matin Aghaei, Yaochen Hu, Hongjian Gu, Mohammad Ali Alomrani, David Gamaliel Arcos Bravo, Raika Karimi, Atia Hamidizadeh, Haoping Xu, Guowei Huang, Zhanpeng Zhang, Tongtong Cao, Weichao Qiu, Xingyue Quan, Jianye Hao, Yuzheng Zhuang, Yingxue Zhang

    Abstract: Recent advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs) have made them powerful tools in embodied navigation, enabling agents to leverage commonsense and spatial reasoning for efficient exploration in unfamiliar environments. Existing LLM-based approaches convert global memory, such as semantic or topological maps, into language descriptions to guide navigation. While… ▽ More

    Submitted 10 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  3. arXiv:2405.13136  [pdf, other

    cs.LG

    Towards Principled, Practical Policy Gradient for Bandits and Tabular MDPs

    Authors: Michael Lu, Matin Aghaei, Anant Raj, Sharan Vaswani

    Abstract: We consider (stochastic) softmax policy gradient (PG) methods for bandits and tabular Markov decision processes (MDPs). While the PG objective is non-concave, recent research has used the objective's smoothness and gradient domination properties to achieve convergence to an optimal policy. However, these theoretical results require setting the algorithm parameters according to unknown problem-depe… ▽ More

    Submitted 30 September, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted at RLC 2024

  4. arXiv:2306.11763  [pdf, other

    cs.CV

    Exploring the Effectiveness of Dataset Synthesis: An application of Apple Detection in Orchards

    Authors: Alexander van Meekeren, Maya Aghaei, Klaas Dijkstra

    Abstract: Deep object detection models have achieved notable successes in recent years, but one major obstacle remains: the requirement for a large amount of training data. Obtaining such data is a tedious process and is mainly time consuming, leading to the exploration of new research avenues like synthetic data generation techniques. In this study, we explore the usability of Stable Diffusion 2.1-base for… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  5. arXiv:2306.09762  [pdf, other

    cs.CV

    The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models

    Authors: Roy Voetman, Maya Aghaei, Klaas Dijkstra

    Abstract: Despite the notable accomplishments of deep object detection models, a major challenge that persists is the requirement for extensive amounts of training data. The process of procuring such real-world data is a laborious undertaking, which has prompted researchers to explore new avenues of research, such as synthetic data generation techniques. This study presents a framework for the generation of… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  6. arXiv:2207.01687  [pdf, other

    cs.CV

    Crime scene classification from skeletal trajectory analysis in surveillance settings

    Authors: Alina-Daniela Matei, Estefania Talavera, Maya Aghaei

    Abstract: Video anomaly analysis is a core task actively pursued in the field of computer vision, with applications extending to real-world crime detection in surveillance footage. In this work, we address the task of human-related crime classification. In our proposed approach, the human body in video frames, represented as skeletal joints trajectories, is used as the main source of exploration. First, we… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  7. arXiv:2203.12350  [pdf, other

    cs.CV

    Hyper-Spectral Imaging for Overlapping Plastic Flakes Segmentation

    Authors: Guillem Martinez, Maya Aghaei, Martin Dijkstra, Bhalaji Nagarajan, Femke Jaarsma, Jaap van de Loosdrecht, Petia Radeva, Klaas Dijkstra

    Abstract: Given the hyper-spectral imaging unique potentials in grasping the polymer characteristics of different materials, it is commonly used in sorting procedures. In a practical plastic sorting scenario, multiple plastic flakes may overlap which depending on their characteristics, the overlap can be reflected in their spectral signature. In this work, we use hyper-spectral imaging for the segmentation… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Submitted to ICIP2022

  8. arXiv:2203.11209  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    On the Effect of Pre-Processing and Model Complexity for Plastic Analysis Using Short-Wave-Infrared Hyper-Spectral Imaging

    Authors: Klaas Dijkstra, Maya Aghaei, Femke Jaarsma, Martin Dijkstra, Rudy Folkersma, Jan Jager, Jaap van de Loosdrecht

    Abstract: The importance of plastic waste recycling is undeniable. In this respect, computer vision and deep learning enable solutions through the automated analysis of short-wave-infrared hyper-spectral images of plastics. In this paper, we offer an exhaustive empirical study to show the importance of efficient model selection for resolving the task of hyper-spectral image segmentation of various plastic f… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  9. HR-Crime: Human-Related Anomaly Detection in Surveillance Videos

    Authors: Kayleigh Boekhoudt, Alina Matei, Maya Aghaei, Estefanía Talavera

    Abstract: The automatic detection of anomalies captured by surveillance settings is essential for speeding the otherwise laborious approach. To date, UCF-Crime is the largest available dataset for automatic visual analysis of anomalies and consists of real-world crime scenes of various categories. In this paper, we introduce HR-Crime, a subset of the UCF-Crime dataset suitable for human-related anomaly dete… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    Comments: Accepted by CAIP 2021

  10. arXiv:2106.11098  [pdf, other

    cs.CV

    Obstacle Detection for BVLOS Drones

    Authors: Jan Moros Esteban, Jaap van de Loosdrecht, Maya Aghaei

    Abstract: With the introduction of new regulations in the European Union, the future of Beyond Visual Line Of Sight (BVLOS) drones is set to bloom. This led to the creation of the theBEAST project, which aims to create an autonomous security drone, with focus on those regulations and on safety. This technical paper describes the first steps of a module within this project, which revolves around detecting ob… ▽ More

    Submitted 22 June, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 7 pages, 7 figures, Supervisors: Maya Aghaei Gavari and Jaap van de Loosdrecht

  11. arXiv:2011.02018  [pdf, other

    cs.CV

    Single Image Human Proxemics Estimation for Visual Social Distancing

    Authors: Maya Aghaei, Matteo Bustreo, Yiming Wang, Gianluca Bailo, Pietro Morerio, Alessio Del Bue

    Abstract: In this work, we address the problem of estimating the so-called "Social Distancing" given a single uncalibrated image in unconstrained scenarios. Our approach proposes a semi-automatic solution to approximate the homography matrix between the scene ground and image plane. With the estimated homography, we then leverage an off-the-shelf pose detector to detect body poses on the image and to reason… ▽ More

    Submitted 5 November, 2020; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: Paper accepted at WACV 2021 conference

  12. arXiv:2004.09374  [pdf, other

    cs.CV

    Complex-Object Visual Inspection via Multiple Lighting Configurations

    Authors: Maya Aghaei, Matteo Bustreo, Pietro Morerio, Nicolo Carissimi, Alessio Del Bue, Vittorio Murino

    Abstract: The design of an automatic visual inspection system is usually performed in two stages. While the first stage consists in selecting the most suitable hardware setup for highlighting most effectively the defects on the surface to be inspected, the second stage concerns the development of algorithmic solutions to exploit the potentials offered by the collected data. In this paper, first, we presen… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 8 pages, 7 figures, submitted to ICPR2020

  13. arXiv:1801.09103  [pdf, other

    cs.CV

    Understanding Deep Architectures by Visual Summaries

    Authors: Marco Carletti, Marco Godi, Maedeh Aghaei, Francesco Giuliari, Marco Cristani

    Abstract: In deep learning, visualization techniques extract the salient patterns exploited by deep networks for image classification, focusing on single images; no effort has been spent in investigating whether these patterns are systematically related to precise semantic entities over multiple images belonging to a same class, thus failing to capture the very understanding of the image class the network h… ▽ More

    Submitted 29 August, 2019; v1 submitted 27 January, 2018; originally announced January 2018.

    Comments: Project page and code available at http://marcocarletti.altervista.org/publications/understanding-visual-summaries/

  14. arXiv:1709.05775  [pdf, other

    cs.CV

    Social Style Characterization from Egocentric Photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Cristian Canton Ferrer, Petia Radeva

    Abstract: This paper proposes a system for automatic social pattern characterization using a wearable photo-camera. The proposed pipeline consists of three major steps. First, detection of people with whom the camera wearer interacts and, second, categorization of the detected social interactions into formal and informal. These two steps act at event-level where each potential social event is modeled as a m… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: International Conference on Computer Vision (ICCV). Workshop on Egocentric Percetion, Interaction and Computing

  15. arXiv:1709.01424  [pdf, other

    cs.CV

    Towards social pattern characterization in egocentric photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Cristian Canton Ferrer, Petia Radeva

    Abstract: Following the increasingly popular trend of social interaction analysis in egocentric vision, this manuscript presents a comprehensive study for automatic social pattern characterization of a wearable photo-camera user, by relying on the visual analysis of egocentric photo-streams. The proposed framework consists of three major steps. The first step is to detect social interactions of the user whe… ▽ More

    Submitted 9 January, 2018; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: 42 pages, 14 figures. Submitted to Elsevier, Computer Vision and Image Understanding (Under Review)

  16. arXiv:1704.02809  [pdf, other

    cs.CV

    R-Clustering for Egocentric Video Segmentation

    Authors: Estefania Talavera, Mariella Dimiccoli, Marc Bolaños, Maedeh Aghaei, Petia Radeva

    Abstract: In this paper, we present a new method for egocentric video temporal segmentation based on integrating a statistical mean change detector and agglomerative clustering(AC) within an energy-minimization framework. Given the tendency of most AC methods to oversegment video sequences when clustering their frames, we combine the clustering with a concept drift detection technique (ADWIN) that has rigor… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  17. arXiv:1704.02231  [pdf, other

    cs.CV

    Clothing and People - A Social Signal Processing Perspective

    Authors: Maedeh Aghaei, Federico Parezzan, Mariella Dimiccoli, Petia Radeva, Marco Cristani

    Abstract: In our society and century, clothing is not anymore used only as a means for body protection. Our paper builds upon the evidence, studied within the social sciences, that clothing brings a clear communicative message in terms of social signals, influencing the impression and behaviour of others towards a person. In fact, clothing correlates with personality traits, both in terms of self-assessment… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

    Comments: To appear in the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017)

  18. arXiv:1703.01790  [pdf, other

    cs.CV

    All the people around me: face discovery in egocentric photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Petia Radeva

    Abstract: Given an unconstrained stream of images captured by a wearable photo-camera (2fpm), we propose an unsupervised bottom-up approach for automatic clustering appearing faces into the individual identities present in these data. The problem is challenging since images are acquired under real world conditions; hence the visible appearance of the people in the images undergoes intensive variations. Our… ▽ More

    Submitted 12 May, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: 5 pages, 3 figures, accepted in IEEE International Conference on Image Processing (ICIP 2017)

  19. arXiv:1701.05138  [pdf, other

    math.LO cs.LO

    Rejecting inadmissible rules in reduced normal forms in S4

    Authors: Mojtaba Aghaei, Maryam Rostami Giv

    Abstract: Several methods for checking admissibility of rules in the modal logic $S4$ are presented in [1], [15]. These methods determine admissibility of rules in $S4$, but they don't determine or give substitutions rejecting inadmissible rules. In this paper, we investigate some relations between one of the above methods, based on the reduced normal form rules, and sets of substitutions which reject them.… ▽ More

    Submitted 18 January, 2017; originally announced January 2017.

    MSC Class: 03B47; 03D15

  20. arXiv:1605.04129  [pdf, other

    cs.CV

    With Whom Do I Interact? Detecting Social Interactions in Egocentric Photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Petia Radeva

    Abstract: Given a user wearing a low frame rate wearable camera during a day, this work aims to automatically detect the moments when the user gets engaged into a social interaction solely by reviewing the automatically captured photos by the worn camera. The proposed method, inspired by the sociological concept of F-formation, exploits distance and orientation of the appearing individuals -with respect to… ▽ More

    Submitted 12 May, 2017; v1 submitted 13 May, 2016; originally announced May 2016.

    Comments: 6 pages, 9 figures, accepted and presented in International Conference on Pattern Recognition (ICPR 2016)

  21. SR-Clustering: Semantic Regularized Clustering for Egocentric Photo Streams Segmentation

    Authors: Mariella Dimiccoli, Marc Bolaños, Estefania Talavera, Maedeh Aghaei, Stavri G. Nikolov, Petia Radeva

    Abstract: While wearable cameras are becoming increasingly popular, locating relevant information in large unstructured collections of egocentric images is still a tedious and time consuming processes. This paper addresses the problem of organizing egocentric photo streams acquired by a wearable camera into semantically meaningful segments. First, contextual and semantic information is extracted for each im… ▽ More

    Submitted 17 October, 2016; v1 submitted 22 December, 2015; originally announced December 2015.

    Comments: 23 pages, 10 figures, 2 tables. In Press in Computer Vision and Image Understanding Journal

  22. Multi-Face Tracking by Extended Bag-of-Tracklets in Egocentric Videos

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Petia Radeva

    Abstract: Wearable cameras offer a hands-free way to record egocentric images of daily experiences, where social events are of special interest. The first step towards detection of social events is to track the appearance of multiple persons involved in it. In this paper, we propose a novel method to find correspondences of multiple faces in low temporal resolution egocentric videos acquired through a weara… ▽ More

    Submitted 13 January, 2016; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: 27 pages, 18 figures, submitted to computer vision and image understanding journal

    Report number: YCVIU2393