Skip to main content

Showing 1–27 of 27 results for author: Stathaki, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.13936  [pdf, other

    cs.CV cs.LG

    Image compositing is all you need for data augmentation

    Authors: Ang Jia Ning Shermaine, Michalis Lazarou, Tania Stathaki

    Abstract: This paper investigates the impact of various data augmentation techniques on the performance of object detection models. Specifically, we explore classical augmentation methods, image compositing, and advanced generative models such as Stable Diffusion XL and ControlNet. The objective of this work is to enhance model robustness and improve detection accuracy, particularly when working with limite… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: Accepted in VISAPP 2025

  2. arXiv:2410.09474   

    cs.CV cs.AI cs.LG

    Distilling Invariant Representations with Dual Augmentation

    Authors: Nikolaos Giakoumoglou, Tania Stathaki

    Abstract: Knowledge distillation (KD) has been widely used to transfer knowledge from large, accurate models (teachers) to smaller, efficient ones (students). Recent methods have explored enforcing consistency by incorporating causal interpretations to distill invariant representations. In this work, we extend this line of research by introducing a dual augmentation strategy to promote invariant feature lea… ▽ More

    Submitted 20 December, 2024; v1 submitted 12 October, 2024; originally announced October 2024.

    Comments: Not completed work

    MSC Class: 68T07 ACM Class: I.4; I.2

  3. arXiv:2410.02401  [pdf, other

    cs.CV cs.AI

    SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learning

    Authors: Nikolaos Giakoumoglou, Tania Stathaki

    Abstract: Contrastive learning has become a dominant approach in self-supervised visual representation learning, but efficiently leveraging hard negatives, which are samples closely resembling the anchor, remains challenging. We introduce SynCo (Synthetic negatives in Contrastive learning), a novel approach that improves model performance by generating synthetic hard negatives on the representation space. B… ▽ More

    Submitted 17 February, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Preprint. Code: https://github.com/giakoumoglou/synco, Supplementary: https://giakoumoglou.com/src/synco_suppl.pdf

    MSC Class: I.4; I.2

  4. arXiv:2408.16315   

    cs.HC cs.LG eess.SP

    Passenger hazard perception based on EEG signals for highly automated driving vehicles

    Authors: Ashton Yu Xuan Tan, Yingkai Yang, Xiaofei Zhang, Bowen Li, Xiaorong Gao, Sifa Zheng, Jianqiang Wang, Xinyu Gu, Jun Li, Yang Zhao, Yuxin Zhang, Tania Stathaki

    Abstract: Enhancing the safety of autonomous vehicles is crucial, especially given recent accidents involving automated systems. As passengers in these vehicles, humans' sensory perception and decision-making can be integrated with autonomous systems to improve safety. This study explores neural mechanisms in passenger-vehicle interactions, leading to the development of a Passenger Cognitive Model (PCM) and… ▽ More

    Submitted 27 March, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: We have decided to withdraw this submission due to ongoing revisions and further refinements in our research. A revised version may be resubmitted in the future. We appreciate the feedback and interest from the community

  5. arXiv:2408.13646  [pdf, other

    cs.CV

    Mean Height Aided Post-Processing for Pedestrian Detection

    Authors: Jing Yuan, Tania Stathaki, Guangyu Ren

    Abstract: The design of pedestrian detectors seldom considers the unique characteristics of this task and usually follows the common strategies for general object detection. To explore the potential of these characteristics, we take the perspective effect in pedestrian datasets as an example and propose the mean height aided suppression for post-processing. This method rejects predictions that fall at level… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  6. arXiv:2408.13639  [pdf, other

    cs.CV

    Size Aware Cross-shape Scribble Supervision for Medical Image Segmentation

    Authors: Jing Yuan, Tania Stathaki

    Abstract: Scribble supervision, a common form of weakly supervised learning, involves annotating pixels using hand-drawn curve lines, which helps reduce the cost of manual labelling. This technique has been widely used in medical image segmentation tasks to fasten network training. However, scribble supervision has limitations in terms of annotation consistency across samples and the availability of compreh… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  7. arXiv:2407.12073  [pdf, other

    cs.CV cs.AI

    Relational Representation Distillation

    Authors: Nikolaos Giakoumoglou, Tania Stathaki

    Abstract: Knowledge distillation involves transferring knowledge from large, cumbersome teacher models to more compact student models. The standard approach minimizes the Kullback-Leibler (KL) divergence between the probabilistic outputs of a teacher and student network. However, this approach fails to capture important structural relationships in the teacher's internal representations. Recent advances have… ▽ More

    Submitted 12 May, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Preprint. Code: https://github.com/giakoumoglou/distillers, Supplementary: https://giakoumoglou.com/src/rrd_suppl.pdf

    MSC Class: 68T07 ACM Class: I.4; I.2

  8. arXiv:2407.11802  [pdf, other

    cs.CV cs.AI

    Discriminative and Consistent Representation Distillation

    Authors: Nikolaos Giakoumoglou, Tania Stathaki

    Abstract: Knowledge Distillation (KD) aims to transfer knowledge from a large teacher model to a smaller student model. While contrastive learning has shown promise in self-supervised learning by creating discriminative representations, its application in knowledge distillation remains limited and focuses primarily on discrimination, neglecting the structural relationships captured by the teacher model. To… ▽ More

    Submitted 12 May, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Preprint. Code: https://github.com/giakoumoglou/distillers, Supplementary: https://giakoumoglou.com/src/dcd_suppl.pdf

    MSC Class: 68T07 ACM Class: I.4; I.2

  9. arXiv:2405.17446  [pdf, other

    eess.IV cs.CV

    Comparing ImageNet Pre-training with Digital Pathology Foundation Models for Whole Slide Image-Based Survival Analysis

    Authors: Kleanthis Marios Papadopoulos, Tania Stathaki

    Abstract: The abundance of information present in Whole Slide Images (WSIs) renders them an essential tool for survival analysis. Several Multiple Instance Learning frameworks proposed for this task utilize a ResNet50 backbone pre-trained on natural images. By leveraging recenetly released histopathological foundation models such as UNI and Hibou, the predictive prowess of existing MIL networks can be enhan… ▽ More

    Submitted 6 December, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  10. arXiv:2405.06828  [pdf, other

    cs.CV

    G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D Part Grouping

    Authors: Junfeng Cheng, Tania Stathaki

    Abstract: This paper proposes a novel task named "3D part grouping". Suppose there is a mixed set containing scattered parts from various shapes. This task requires algorithms to find out every possible combination among all the parts. To address this challenge, we propose the so called Gradient Field-based Auto-Regressive Sampling framework (G-FARS) tailored specifically for the 3D part grouping task. In o… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  11. arXiv:2405.04969  [pdf, other

    cs.CV cs.AI

    A Review on Discriminative Self-supervised Learning Methods in Computer Vision

    Authors: Nikolaos Giakoumoglou, Tania Stathaki, Athanasios Gkelias

    Abstract: Self-supervised learning (SSL) has rapidly emerged as a transformative approach in computer vision, enabling the extraction of rich feature representations from vast amounts of unlabeled data and reducing reliance on costly manual annotations. This review presents a comprehensive analysis of discriminative SSL methods, which focus on learning representations by solving pretext tasks that do not re… ▽ More

    Submitted 16 May, 2025; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Preprint. 97 pages, 12 figures, 16 tables

  12. arXiv:2403.15152  [pdf, other

    cs.CV

    A Multimodal Approach for Cross-Domain Image Retrieval

    Authors: Lucas Iijima, Nikolaos Giakoumoglou, Tania Stathaki

    Abstract: Cross-Domain Image Retrieval (CDIR) is a challenging task in computer vision, aiming to match images across different visual domains such as sketches, paintings, and photographs. Traditional approaches focus on visual image features and rely heavily on supervised learning with labeled data and cross-domain correspondences, which leads to an often struggle with the significant domain gap. This pape… ▽ More

    Submitted 5 October, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  13. arXiv:2403.13429  [pdf, other

    q-fin.TR cs.CE cs.LG q-fin.CP q-fin.GN

    Detecting and Triaging Spoofing using Temporal Convolutional Networks

    Authors: Kaushalya Kularatnam, Tania Stathaki

    Abstract: As algorithmic trading and electronic markets continue to transform the landscape of financial markets, detecting and deterring rogue agents to maintain a fair and efficient marketplace is crucial. The explosion of large datasets and the continually changing tricks of the trade make it difficult to adapt to new market conditions and detect bad actors. To that end, we propose a framework that can b… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Journal ref: AAAI 2024 Workshop on AI in Finance for Social Impact

  14. arXiv:2401.07028  [pdf, other

    cs.CV

    Image edge enhancement for effective image classification

    Authors: Tianhao Bu, Michalis Lazarou, Tania Stathaki

    Abstract: Image classification has been a popular task due to its feasibility in real-world applications. Training neural networks by feeding them RGB images has demonstrated success over it. Nevertheless, improving the classification accuracy and computational efficiency of this process continues to present challenges that researchers are actively addressing. A widely popular embraced method to improve the… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Accepted at VISIGRAPP: VISAPP2024

  15. arXiv:2310.19996  [pdf, other

    cs.CV

    Adaptive Anchor Label Propagation for Transductive Few-Shot Learning

    Authors: Michalis Lazarou, Yannis Avrithis, Guangyu Ren, Tania Stathaki

    Abstract: Few-shot learning addresses the issue of classifying images using limited labeled data. Exploiting unlabeled data through the use of transductive inference methods such as label propagation has been shown to improve the performance of few-shot learning significantly. Label propagation infers pseudo-labels for unlabeled data by utilizing a constructed graph that exploits the underlying manifold str… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: published in ICIP 2023

  16. arXiv:2304.14281  [pdf, other

    cs.CV

    Adaptive manifold for imbalanced transductive few-shot learning

    Authors: Michalis Lazarou, Yannis Avrithis, Tania Stathaki

    Abstract: Transductive few-shot learning algorithms have showed substantially superior performance over their inductive counterparts by leveraging the unlabeled queries. However, the vast majority of such methods are evaluated on perfectly class-balanced benchmarks. It has been shown that they undergo remarkable drop in performance under a more realistic, imbalanced setting. To this end, we propose a novel… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  17. arXiv:2211.11847  [pdf, other

    cs.CV

    Towards Automated Polyp Segmentation Using Weakly- and Semi-Supervised Learning and Deformable Transformers

    Authors: Guangyu Ren, Michalis Lazarou, Jing Yuan, Tania Stathaki

    Abstract: Polyp segmentation is a crucial step towards computer-aided diagnosis of colorectal cancer. However, most of the polyp segmentation methods require pixel-wise annotated datasets. Annotated datasets are tedious and time-consuming to produce, especially for physicians who must dedicate their time to their patients. We tackle this issue by proposing a novel framework that can be trained using only we… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  18. arXiv:2106.09517  [pdf, other

    cs.CV

    Dynamic Knowledge Distillation With Noise Elimination for RGB-D Salient Object Detection

    Authors: Guangyu Ren, Yinxiao Yu, Hengyan Liu, Tania Stathaki

    Abstract: RGB-D salient object detection (SOD) demonstrates its superiority on detecting in complex environments due to the additional depth information introduced in the data. Inevitably, an independent stream is introduced to extract features from depth images, leading to extra computation and parameters. This methodology sacrifices the model size to improve the detection accuracy which may impede the pra… ▽ More

    Submitted 2 June, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

  19. arXiv:2106.05321  [pdf, other

    cs.CV

    Tensor feature hallucination for few-shot learning

    Authors: Michalis Lazarou, Tania Stathaki, Yannis Avrithis

    Abstract: Few-shot learning addresses the challenge of learning how to address novel tasks given not just limited supervision but limited data as well. An attractive solution is synthetic data generation. However, most such methods are overly sophisticated, focusing on high-quality, realistic data in the input space. It is unclear whether adapting them to the few-shot regime and using them for the downstrea… ▽ More

    Submitted 4 January, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted at WACV 2022. arXiv admin note: text overlap with arXiv:2104.09467

  20. arXiv:2106.03941  [pdf, other

    cs.CV

    Progressive Multi-scale Fusion Network for RGB-D Salient Object Detection

    Authors: Guangyu Ren, Yanchu Xie, Tianhong Dai, Tania Stathaki

    Abstract: Salient object detection(SOD) aims at locating the most significant object within a given image. In recent years, great progress has been made in applying SOD on many vision tasks. The depth map could provide additional spatial prior and boundary cues to boost the performance. Combining the depth information with image data obtained from standard visual cameras has been widely used in recent SOD w… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  21. arXiv:2104.09467  [pdf, ps, other

    cs.CV

    Few-shot learning via tensor hallucination

    Authors: Michalis Lazarou, Yannis Avrithis, Tania Stathaki

    Abstract: Few-shot classification addresses the challenge of classifying examples given only limited labeled data. A powerful approach is to go beyond data augmentation, towards data synthesis. However, most of data augmentation/synthesis methods for few-shot classification are overly complex and sophisticated, e.g. training a wGAN with multiple regularizers or training a network to transfer latent diversit… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted as oral at ICLR2021 workshop: "Synthetic Data Generation: Quality, Privacy, Bias"

  22. arXiv:2101.03923  [pdf, other

    cs.CV

    A novel shape matching descriptor for real-time hand gesture recognition

    Authors: Michalis Lazarou, Bo Li, Tania Stathaki

    Abstract: The current state-of-the-art hand gesture recognition methodologies heavily rely in the use of machine learning. However there are scenarios that machine learning cannot be applied successfully, for example in situations where data is scarce. This is the case when one-to-one matching is required between a query and a dataset of hand gestures where each gesture represents a unique class. In situati… ▽ More

    Submitted 10 March, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

  23. arXiv:2012.07962  [pdf, other

    cs.LG cs.AI cs.CV

    Iterative label cleaning for transductive and semi-supervised few-shot learning

    Authors: Michalis Lazarou, Tania Stathaki, Yannis Avrithis

    Abstract: Few-shot learning amounts to learning representations and acquiring knowledge such that novel tasks may be solved with both supervision and data being limited. Improved performance is possible by transductive inference, where the entire test set is available concurrently, and semi-supervised learning, where more unlabeled data is available. Focusing on these two settings, we introduce a new algori… ▽ More

    Submitted 28 March, 2023; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: published in ICCV 2021

  24. arXiv:2004.14552  [pdf, other

    cs.CV

    Salient Object Detection Combining a Self-attention Module and a Feature Pyramid Network

    Authors: Guangyu Ren, Tianhong Dai, Panagiotis Barmpoutis, Tania Stathaki

    Abstract: Salient object detection has achieved great improvement by using the Fully Convolution Network (FCN). However, the FCN-based U-shape architecture may cause the dilution problem in the high-level semantic information during the up-sample operations in the top-down pathway. Thus, it can weaken the ability of salient object localization and produce degraded boundaries. To this end, in order to overco… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  25. arXiv:1912.08661  [pdf, other

    cs.CV

    Coupled Network for Robust Pedestrian Detection with Gated Multi-Layer Feature Extraction and Deformable Occlusion Handling

    Authors: Tianrui Liu, Wenhan Luo, Lin Ma, Jun-Jie Huang, Tania Stathaki, Tianhong Dai

    Abstract: Pedestrian detection methods have been significantly improved with the development of deep convolutional neural networks. Nevertheless, detecting small-scaled pedestrians and occluded pedestrians remains a challenging problem. In this paper, we propose a pedestrian detection method with a couple-network to simultaneously address these two issues. One of the sub-networks, the gated multi-layer feat… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Journal ref: IEEE Transactions on Image Processing, 2020

  26. arXiv:1910.11761  [pdf, other

    cs.CV

    Gated Multi-layer Convolutional Feature Extraction Network for Robust Pedestrian Detection

    Authors: Tianrui Liu, Jun-Jie Huang, Tianhong Dai, Guangyu Ren, Tania Stathaki

    Abstract: Pedestrian detection methods have been significantly improved with the development of deep convolutional neural networks. Nevertheless, robustly detecting pedestrians with a large variant on sizes and with occlusions remains a challenging problem. In this paper, we propose a gated multi-layer convolutional feature extraction method which can adaptively generate discriminative features for candidat… ▽ More

    Submitted 18 December, 2019; v1 submitted 25 October, 2019; originally announced October 2019.

    Report number: Accepted by ICASSP'20

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020

  27. arXiv:1808.02246  [pdf, other

    cs.CV

    SAM-RCNN: Scale-Aware Multi-Resolution Multi-Channel Pedestrian Detection

    Authors: Tianrui Liu, Mohamed Elmikaty, Tania Stathaki

    Abstract: Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. Recently, aggregating features from multiple layers of a CNN has been considered as an effective approach, however, the same approach regarding feature representation is used for detecting pedestrians of varying scales. Consequently, it i… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: published in British Machine Vision Conference (BMVC) 2018