Skip to main content

Showing 1–22 of 22 results for author: Poullis, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00774  [pdf, other

    cs.CV

    Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking

    Authors: Milad Khanchi, Maria Amer, Charalambos Poullis

    Abstract: Current motion-based multiple object tracking (MOT) approaches rely heavily on Intersection-over-Union (IoU) for object association. Without using 3D features, they are ineffective in scenarios with occlusions or visually similar objects. To address this, our paper presents a novel depth-aware framework for MOT. We estimate depth using a zero-shot approach and incorporate it as an independent feat… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: ICIP 2025

  2. arXiv:2503.04006  [pdf, other

    cs.CV cs.LG

    DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation

    Authors: Amin Karimi, Charalambos Poullis

    Abstract: Few-shot semantic segmentation (FSS) aims to enable models to segment novel/unseen object classes using only a limited number of labeled examples. However, current FSS methods frequently struggle with generalization due to incomplete and biased feature representations, especially when support images do not capture the full appearance variability of the target class. To improve the FSS pipeline, we… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  3. arXiv:2412.07899  [pdf, other

    cs.CV cs.RO

    Pix2Poly: A Sequence Prediction Method for End-to-end Polygonal Building Footprint Extraction from Remote Sensing Imagery

    Authors: Yeshwanth Kumar Adimoolam, Charalambos Poullis, Melinos Averkiou

    Abstract: Extraction of building footprint polygons from remotely sensed data is essential for several urban understanding tasks such as reconstruction, navigation, and mapping. Despite significant progress in the area, extracting accurate polygonal building footprints remains an open problem. In this paper, we introduce Pix2Poly, an attention-based end-to-end trainable and differentiable deep neural networ… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted to WACV 2025. 20 pages, 13 figures, 8 tables

  4. arXiv:2410.14505  [pdf, other

    cs.CV cs.GR

    Neural Real-Time Recalibration for Infrared Multi-Camera Systems

    Authors: Benyamin Mehmandar, Reza Talakoob, Charalambos Poullis

    Abstract: Currently, there are no learning-free or neural techniques for real-time recalibration of infrared multi-camera systems. In this paper, we address the challenge of real-time, highly-accurate calibration of multi-camera infrared systems, a critical task for time-sensitive applications. Unlike traditional calibration techniques that lack adaptability and struggle with on-the-fly recalibrations, we p… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: real-time camera calibration, infrared camera, neural calibration

  5. arXiv:2409.15213  [pdf, other

    cs.CV cs.LG

    HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learning

    Authors: Naghmeh Shafiee Roudbari, Ursula Eicker, Charalambos Poullis, Zachary Patterson

    Abstract: Hydrometric forecasting is crucial for managing water resources, flood prediction, and environmental protection. Water stations are interconnected, and this connectivity influences the measurements at other stations. However, the dynamic and implicit nature of water flow paths makes it challenging to extract a priori knowledge of the connectivity structure. We hypothesize that terrain elevation si… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  6. arXiv:2312.05961  [pdf, other

    cs.LG

    TransGlow: Attention-augmented Transduction model based on Graph Neural Networks for Water Flow Forecasting

    Authors: Naghmeh Shafiee Roudbari, Charalambos Poullis, Zachary Patterson, Ursula Eicker

    Abstract: The hydrometric prediction of water quantity is useful for a variety of applications, including water management, flood forecasting, and flood control. However, the task is difficult due to the dynamic nature and limited data of water systems. Highly interconnected water systems can significantly affect hydrometric forecasting. Consequently, it is crucial to develop models that represent the relat… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  7. arXiv:2311.18480  [pdf, other

    cs.HC

    ESPiM: Eye-Strain Probation Model, An Eye-Tracking Analysis Measure for Digital Displays

    Authors: Mohsen Parisay, Negar Haghbin, Charalambos Poullis, Marta Kersten-Oertel

    Abstract: Eye-strain is a common issue among computer users due to the prolonged periods they spend working in front of digital displays. This can lead to vision problems, such as irritation and tiredness of the eyes and headaches. We propose the Eye-Strain Probation Model (ESPiM), a computational model based on eye-tracking data that measures eye-strain on digital displays based on the spatial properties o… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  8. arXiv:2304.02296  [pdf, other

    cs.CV

    Efficient Deduplication and Leakage Detection in Large Scale Image Datasets with a focus on the CrowdAI Mapping Challenge Dataset

    Authors: Yeshwanth Kumar Adimoolam, Bodhiswatta Chatterjee, Charalambos Poullis, Melinos Averkiou

    Abstract: Recent advancements in deep learning and computer vision have led to widespread use of deep neural networks to extract building footprints from remote-sensing imagery. The success of such methods relies on the availability of large databases of high-resolution remote sensing images with high-quality annotations. The CrowdAI Mapping Challenge Dataset is one of these datasets that has been used exte… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 9 pages, 2 figures

  9. arXiv:2209.06926  [pdf, other

    cs.CV

    End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes

    Authors: Qiao Chen, Charalambos Poullis

    Abstract: Image-based 3D reconstruction is one of the most important tasks in Computer Vision with many solutions proposed over the last few decades. The objective is to extract metric information i.e. the geometry of scene objects directly from images. These can then be used in a wide range of applications such as film, games, virtual reality, etc. Recently, deep learning techniques have been proposed to t… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: IEEE International Conference on Signal Processing, Sensors, and Intelligent Systems (SPSIS 2022)

  10. arXiv:2209.03858  [pdf, other

    cs.LG cs.CV

    Simpler is better: Multilevel Abstraction with Graph Convolutional Recurrent Neural Network Cells for Traffic Prediction

    Authors: Naghmeh Shafiee Roudbari, Zachary Patterson, Ursula Eicker, Charalambos Poullis

    Abstract: In recent years, graph neural networks (GNNs) combined with variants of recurrent neural networks (RNNs) have reached state-of-the-art performance in spatiotemporal forecasting tasks. This is particularly the case for traffic forecasting, where GNN models use the graph structure of road networks to account for spatial correlation between links and nodes. Recent solutions are either based on comple… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  11. arXiv:2208.11546  [pdf, other

    cs.CV

    Unsupervised Structure-Consistent Image-to-Image Translation

    Authors: Shima Shahfar, Charalambos Poullis

    Abstract: The Swapping Autoencoder achieved state-of-the-art performance in deep image manipulation and image-to-image translation. We improve this work by introducing a simple yet effective auxiliary module based on gradient reversal layers. The auxiliary module's loss forces the generator to learn to reconstruct an image with an all-zero texture code, encouraging better disentanglement between the structu… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: structure-consistent image-to-image translation \and style transfer \and training class imbalance

  12. arXiv:2206.12464  [pdf, other

    cs.CV

    Motion Estimation for Large Displacements and Deformations

    Authors: Qiao Chen, Charalambos Poullis

    Abstract: Large displacement optical flow is an integral part of many computer vision tasks. Variational optical flow techniques based on a coarse-to-fine scheme interpolate sparse matches and locally optimize an energy model conditioned on colour, gradient and smoothness, making them sensitive to noise in the sparse matches, deformations, and arbitrarily large displacements. This paper addresses this probl… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  13. arXiv:2206.09480  [pdf, other

    cs.HC cs.LG

    Predicting Human Performance in Vertical Hierarchical Menu Selection in Immersive AR Using Hand-gesture and Head-gaze

    Authors: Majid Pourmemar, Yashas Joshi, Charalambos Poullis

    Abstract: There are currently limited guidelines on designing user interfaces (UI) for immersive augmented reality (AR) applications. Designers must reflect on their experience designing UI for desktop and mobile applications and conjecture how a UI will influence AR users' performance. In this work, we introduce a predictive model for determining users' performance for a target UI without the subsequent in… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

  14. arXiv:2205.15846  [pdf, other

    cs.GR cs.MM

    SaccadeNet: Towards Real-time Saccade Prediction for Virtual Reality Infinite Walking

    Authors: Yashas Joshi, Charalambos Poullis

    Abstract: Modern Redirected Walking (RDW) techniques significantly outperform classical solutions. Nevertheless, they are often limited by their heavy reliance on eye-tracking hardware embedded within the VR headset to reveal redirection opportunities. We propose a novel RDW technique that leverages the temporary blindness induced due to saccades for redirection. However, unlike the state-of-the-art, our… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: redirected walking, virtual reality

  15. arXiv:2204.06626  [pdf, other

    cs.CV

    Adaptive Memory Management for Video Object Segmentation

    Authors: Ali Pourganjalikhan, Charalambos Poullis

    Abstract: Matching-based networks have achieved state-of-the-art performance for video object segmentation (VOS) tasks by storing every-k frames in an external memory bank for future inference. Storing the intermediate frames' predictions provides the network with richer cues for segmenting an object in the current frame. However, the size of the memory bank gradually increases with the length of the video,… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: In proceeding of the 19th Conference on Robots and Vision (CRV), 2022

  16. arXiv:2202.13017  [pdf, other

    cs.CV cs.GR

    Multi-view Gradient Consistency for SVBRDF Estimation of Complex Scenes under Natural Illumination

    Authors: Alen Joy, Charalambos Poullis

    Abstract: This paper presents a process for estimating the spatially varying surface reflectance of complex scenes observed under natural illumination. In contrast to previous methods, our process is not limited to scenes viewed under controlled lighting conditions but can handle complex indoor and outdoor scenes viewed under arbitrary illumination conditions. An end-to-end process uses a model of the scene… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  17. arXiv:2105.06820  [pdf, other

    cs.CV cs.GR

    Predicting Surface Reflectance Properties of Outdoor Scenes Under Unknown Natural Illumination

    Authors: Farhan Rahman Wasee, Alen Joy, Charalambos Poullis

    Abstract: Estimating and modelling the appearance of an object under outdoor illumination conditions is a complex process. Although there have been several studies on illumination estimation and relighting, very few of them focus on estimating the reflectance properties of outdoor objects and scenes. This paper addresses this problem and proposes a complete framework to predict surface reflectance propertie… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  18. EyeTAP: A Novel Technique using Voice Inputs to Address the Midas Touch Problem for Gaze-based Interactions

    Authors: Mohsen Parisay, Charalambos Poullis, Marta Kersten

    Abstract: One of the main challenges of gaze-based interactions is the ability to distinguish normal eye function from a deliberate interaction with the computer system, commonly referred to as 'Midas touch'. In this paper we propose, EyeTAP (Eye tracking point-and-select by Targeted Acoustic Pulse) a hands-free interaction method for point-and-select tasks. We evaluated the prototype in two separate user s… ▽ More

    Submitted 22 March, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

  19. arXiv:1912.09216  [pdf, other

    cs.CV

    Semantic Segmentation from Remote Sensor Data and the Exploitation of Latent Learning for Classification of Auxiliary Tasks

    Authors: Bodhiswatta Chatterjee, Charalambos Poullis

    Abstract: In this paper we address three different aspects of semantic segmentation from remote sensor data using deep neural networks. Firstly, we focus on the semantic segmentation of buildings from remote sensor data and propose ICT-Net. The proposed network has been tested on the INRIA and AIRS benchmark datasets and is shown to outperform all other state of the art by more than 1.5% and 1.8% on the Jac… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: 17 pages

  20. arXiv:1911.12327  [pdf, other

    cs.GR

    Inattentional Blindness for Redirected Walking Using Dynamic Foveated Rendering

    Authors: Yashas Joshi, Charalambos Poullis

    Abstract: Redirected walking is a Virtual Reality(VR) locomotion technique which enables users to navigate virtual environments (VEs) that are spatially larger than the available physical tracked space. In this work we present a novel technique for redirected walking in VR based on the psychological phenomenon of inattentional blindness. Based on the user's visual fixation points we divide the user's view i… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  21. arXiv:1709.07368  [pdf, other

    cs.CV

    Multi-label Pixelwise Classification for Reconstruction of Large-scale Urban Areas

    Authors: Yuanlie He, Sudhir Mudur, Charalambos Poullis

    Abstract: Object classification is one of the many holy grails in computer vision and as such has resulted in a very large number of algorithms being proposed already. Specifically in recent years there has been considerable progress in this area primarily due to the increased efficiency and accessibility of deep learning techniques. In fact, for single-label object classification [i.e. only one object pres… ▽ More

    Submitted 23 January, 2018; v1 submitted 21 September, 2017; originally announced September 2017.

  22. arXiv:1406.6595  [pdf, other

    cs.CV

    3DUNDERWORLD-SLS: An Open-Source Structured-Light Scanning System for Rapid Geometry Acquisition

    Authors: Qing Gu, Kyriakos Herakleous, Charalambos Poullis

    Abstract: Recently, there has been an increase in the demand of virtual 3D objects representing real-life objects. A plethora of methods and systems have already been proposed for the acquisition of the geometry of real-life objects ranging from those which employ active sensor technology, passive sensor technology or a combination of various techniques. In this paper we present the development of a 3D sc… ▽ More

    Submitted 20 August, 2016; v1 submitted 25 June, 2014; originally announced June 2014.

    Comments: 30 pages describing the 3DUNDERWORLD-SLS open source software by the ICT lab (www.theICTlab.org)

    Report number: ICT-TR-2016-01