Skip to main content

Showing 1–50 of 72 results for author: Breckon, T P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.21630  [pdf, ps, other

    cs.RO cs.CV cs.LG

    TOMD: A Trail-based Off-road Multimodal Dataset for Traversable Pathway Segmentation under Challenging Illumination Conditions

    Authors: Yixin Sun, Li Li, Wenke E, Amir Atapour-Abarghouei, Toby P. Breckon

    Abstract: Detecting traversable pathways in unstructured outdoor environments remains a significant challenge for autonomous robots, especially in critical applications such as wide-area search and rescue, as well as incident management scenarios like forest fires. Existing datasets and models primarily target urban settings or wide, vehicle-traversable off-road tracks, leaving a substantial gap in addressi… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 8 pages, 9 figures, 2025 IJCNN

  2. arXiv:2506.14667  [pdf, ps, other

    cs.CV

    DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification

    Authors: Matt Poyser, Toby P. Breckon

    Abstract: In order to address the scalability challenge within Neural Architecture Search (NAS), we speed up NAS training via dynamic hard example mining within a curriculum learning framework. By utilizing an autoencoder that enforces an image similarity embedding in latent space, we construct an efficient kd-tree structure to order images by furthest neighbour dissimilarity in a low-dimensional embedding.… ▽ More

    Submitted 23 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: 27 single-column pages, 8 figures, to be published in Pattern Recognition

  3. arXiv:2504.18746  [pdf, other

    cs.CV

    Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection

    Authors: Brian K. S. Isaac-Medina, Toby P. Breckon

    Abstract: Deep neural networks have demonstrated great generalization capabilities for tasks whose training and test sets are drawn from the same distribution. Nevertheless, out-of-distribution (OOD) detection remains a challenging task that has received significant attention in recent years. Specifically, OOD detection refers to the detection of instances that do not belong to the training distribution, wh… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: 9 pages, 6 figures, 2 tables, LatinX in AI CVPR 2025 Workshop

  4. arXiv:2503.13004  [pdf, other

    cs.CV

    TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba

    Authors: Jiaxu Liu, Li Li, Hubert P. H. Shum, Toby P. Breckon

    Abstract: Diffusion models currently demonstrate impressive performance over various generative tasks. Recent work on image diffusion highlights the strong capabilities of Mamba (state space models) due to its efficient handling of long-range dependencies and sequential data modeling. Unfortunately, joint consideration of state space models with 3D point cloud generation remains limited. To harness the powe… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  5. arXiv:2503.00675  [pdf, other

    cs.CV cs.RO

    Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving

    Authors: Wenke E, Chao Yuan, Li Li, Yixin Sun, Yona Falinie A. Gaus, Amir Atapour-Abarghouei, Toby P. Breckon

    Abstract: We present Dur360BEV, a novel spherical camera autonomous driving dataset equipped with a high-resolution 128-channel 3D LiDAR and a RTK-refined GNSS/INS system, along with a benchmark architecture designed to generate Bird-Eye-View (BEV) maps using only a single spherical camera. This dataset and benchmark address the challenges of BEV generation in autonomous driving, particularly by reducing ha… ▽ More

    Submitted 6 March, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

  6. arXiv:2412.01596  [pdf, other

    cs.CV

    FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection

    Authors: Brian K. S. Isaac-Medina, Mauricio Che, Yona F. A. Gaus, Samet Akcay, Toby P. Breckon

    Abstract: Modern machine learning models, that excel on computer vision tasks such as classification and object detection, are often overconfident in their predictions for Out-of-Distribution (OOD) examples, resulting in unpredictable behaviour for open-set environments. Recent works have demonstrated that the free energy score is an effective measure of uncertainty for OOD detection given its close relatio… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 18 pages, 15 figures, 4 tables

  7. arXiv:2408.13902  [pdf, other

    cs.CV cs.LG cs.RO

    TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training

    Authors: Li Li, Tanqiu Qiao, Hubert P. H. Shum, Toby P. Breckon

    Abstract: 3D point clouds are essential for perceiving outdoor scenes, especially within the realm of autonomous driving. Recent advances in 3D LiDAR Object Detection focus primarily on the spatial positioning and distribution of points to ensure accurate detection. However, despite their robust performance in variable conditions, these methods are hindered by their sole reliance on coordinates and point in… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: BMVC 2024; 15 pages, 3 figures, 3 tables; Code at https://github.com/l1997i/rapid_seg

    Journal ref: Brit. Mach. Vis. Conf. (BMVC 2024)

  8. arXiv:2407.15763  [pdf, other

    cs.CV

    Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

    Authors: Brian K. S. Isaac-Medina, Yona Falinie A. Gaus, Neelanjan Bhowmik, Toby P. Breckon

    Abstract: Object detection is a pivotal task in computer vision that has received significant attention in previous years. Nonetheless, the capability of a detector to localise objects out of the training distribution remains unexplored. Whilst recent approaches in object-level out-of-distribution (OoD) detection heavily rely on class labels, such approaches contradict truly open-world scenarios where the c… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 35 pages, 21 figures, includes supplementary material, accepted at ECCV 2024

  9. RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

    Authors: Li Li, Hubert P. H. Shum, Toby P. Breckon

    Abstract: 3D point clouds play a pivotal role in outdoor scene perception, especially in the context of autonomous driving. Recent advancements in 3D LiDAR segmentation often focus intensely on the spatial positioning and distribution of points for accurate segmentation. However, these methods, while robust in variable conditions, encounter challenges due to sole reliance on coordinates and point intensity,… ▽ More

    Submitted 13 September, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: ECCV 2024 (Oral); 18 pages, 6 figures, 7 tables; Code at https://github.com/l1997i/rapid_seg

    Journal ref: Eur. Conf. Comput. Vis. (ECCV 2024 ORAL)

  10. DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications

    Authors: Li Li, Khalid N. Ismail, Hubert P. H. Shum, Toby P. Breckon

    Abstract: We present DurLAR, a high-fidelity 128-channel 3D LiDAR dataset with panoramic ambient (near infrared) and reflectivity imagery, as well as a sample benchmark task using depth estimation for autonomous driving applications. Our driving platform is equipped with a high resolution 128 channel LiDAR, a 2MPix stereo camera, a lux meter and a GNSS/INS system. Ambient and reflectivity images are made av… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by 3DV 2021; 13 pages, 14 figures; Dataset at https://github.com/l1997i/durlar

    Journal ref: Proc. Int. Conf. on 3D Vision (3DV 2021)

  11. arXiv:2404.12285  [pdf, other

    cs.CV

    Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery

    Authors: Yona Falinie A. Gaus, Neelanjan Bhowmik, Brian K. S. Isaac-Medina, Toby P. Breckon

    Abstract: The Segment Anything Model (SAM) is a deep neural network foundational model designed to perform instance segmentation which has gained significant popularity given its zero-shot segmentation ability. SAM operates by generating masks based on various input prompts such as text, bounding boxes, points, or masks, introducing a novel methodology to overcome the constraints posed by dataset-specific s… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  12. arXiv:2403.19897  [pdf, other

    cs.CV cs.LG

    Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics

    Authors: Seyma Yucer, Amir Atapour Abarghouei, Noura Al Moubayed, Toby P. Breckon

    Abstract: Achieving an effective fine-grained appearance variation over 2D facial images, whilst preserving facial identity, is a challenging task due to the high complexity and entanglement of common 2D facial feature encoding spaces. Despite these challenges, such fine-grained control, by way of disentanglement is a crucial enabler for data-driven racial bias mitigation strategies across multiple automate… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  13. arXiv:2311.06018  [pdf, other

    cs.CV

    U3DS$^3$: Unsupervised 3D Semantic Scene Segmentation

    Authors: Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, Hubert P. H. Shum

    Abstract: Contemporary point cloud segmentation approaches largely rely on richly annotated 3D training data. However, it is both time-consuming and challenging to obtain consistently accurate annotations for such 3D scene data. Moreover, there is still a lack of investigation into fully unsupervised scene segmentation for point clouds, especially for holistic 3D scenes. This paper presents U3DS$^3$, as a s… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 10 Pages, 4 figures, accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  14. arXiv:2310.16435  [pdf, other

    cs.CV

    On Pixel-level Performance Assessment in Anomaly Detection

    Authors: Mehdi Rafiei, Toby P. Breckon, Alexandros Iosifidis

    Abstract: Anomaly detection methods have demonstrated remarkable success across various applications. However, assessing their performance, particularly at the pixel-level, presents a complex challenge due to the severe imbalance that is most commonly present between normal and abnormal samples. Commonly adopted evaluation metrics designed for pixel-level detection may not effectively capture the nuanced pe… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 5 pages, 5 figures, 1 table

  15. arXiv:2308.14152  [pdf, other

    cs.CV

    Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers

    Authors: Abril Corona-Figueroa, Sam Bond-Taylor, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: Generating 3D images of complex objects conditionally from a few 2D views is a difficult synthesis problem, compounded by issues such as domain gap and geometric misalignment. For instance, a unified framework such as Generative Adversarial Networks cannot achieve this unless they explicitly define both a domain-invariant and geometric-invariant joint latent distribution, whereas Neural Radiance F… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Camera-ready version for ICCV 2023

  16. arXiv:2305.00817  [pdf, other

    cs.CV

    Racial Bias within Face Recognition: A Survey

    Authors: Seyma Yucer, Furkan Tektas, Noura Al Moubayed, Toby P. Breckon

    Abstract: Facial recognition is one of the most academically studied and industrially developed areas within computer vision where we readily find associated applications deployed globally. This widespread adoption has uncovered significant performance variation across subjects of different racial profiles leading to focused research attention on racial bias within face recognition spanning both current cau… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  17. arXiv:2303.11203  [pdf, other

    cs.CV

    Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

    Authors: Li Li, Hubert P. H. Shum, Toby P. Breckon

    Abstract: Whilst the availability of 3D LiDAR point cloud data has significantly grown in recent years, annotation remains expensive and time-consuming, leading to a demand for semi-supervised semantic segmentation methods with application domains such as autonomous driving. Existing work very often employs relatively large segmentation backbone networks to improve segmentation accuracy, at the expense of c… ▽ More

    Submitted 28 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023; 11 pages, 8 figures; Code at https://github.com/l1997i/lim3d

  18. arXiv:2303.05938  [pdf, other

    cs.CV

    ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction

    Authors: Zhengdi Yu, Shaoli Huang, Chen Fang, Toby P. Breckon, Jue Wang

    Abstract: Reconstructing two hands from monocular RGB images is challenging due to frequent occlusion and mutual confusion. Existing methods mainly learn an entangled representation to encode two interacting hands, which are incredibly fragile to impaired interaction, such as truncated hands, separate hands, or external occlusion. This paper presents ACR (Attention Collaboration-based Regressor), which make… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023; Code at https://github.com/ZhengdiYu/Arbitrary-Hands-3D-Reconstruction

  19. arXiv:2303.03925  [pdf, other

    cs.LG cs.AI

    Robust Semi-Supervised Anomaly Detection via Adversarially Learned Continuous Noise Corruption

    Authors: Jack W Barker, Neelanjan Bhowmik, Yona Falinie A Gaus, Toby P Breckon

    Abstract: Anomaly detection is the task of recognising novel samples which deviate significantly from pre-establishednormality. Abnormal classes are not present during training meaning that models must learn effective rep-resentations solely across normal class data samples. Deep Autoencoders (AE) have been widely used foranomaly detection tasks, but suffer from overfitting to a null identity function. To a… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 18th International Conference on Computer Vision Theory and Applications

    Journal ref: Volume 4: VISAPP, ISBN 978-989-758-555-5, ISSN 2184-4321, pages 868-876. 2023

  20. arXiv:2211.13508  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

    Authors: Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda , et al. (48 additional authors not shown)

    Abstract: The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detec… ▽ More

    Submitted 28 November, 2022; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: MaCVi 2023 was part of WACV 2023. This report (38 pages) discusses the competition as part of MaCVi

  21. arXiv:2211.12285  [pdf, other

    cs.CV cs.GR

    Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields

    Authors: Brian K. S. Isaac-Medina, Chris G. Willcocks, Toby P. Breckon

    Abstract: Neural Radiance Fields (NeRF) have attracted significant attention due to their ability to synthesize novel scene views with great accuracy. However, inherent to their underlying formulation, the sampling of points along a ray with zero width may result in ambiguous representations that lead to further rendering artifacts such as aliasing in the final scene. To address this issue, the recent varia… ▽ More

    Submitted 25 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 15 pages,10 figures

  22. arXiv:2210.16453  [pdf, other

    cs.CV cs.AI cs.LG

    Joint Sub-component Level Segmentation and Classification for Anomaly Detection within Dual-Energy X-Ray Security Imagery

    Authors: Neelanjan Bhowmik, Toby P. Breckon

    Abstract: X-ray baggage security screening is in widespread use and crucial to maintaining transport security for threat/anomaly detection tasks. The automatic detection of anomaly, which is concealed within cluttered and complex electronics/electrical items, using 2D X-ray imagery is of primary interest in recent years. We address this task by introducing joint object sub-component level segmentation and c… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  23. arXiv:2210.14083  [pdf, other

    cs.CV

    On Fine-Tuned Deep Features for Unsupervised Domain Adaptation

    Authors: Qian Wang, Toby P. Breckon

    Abstract: Prior feature transformation based approaches to Unsupervised Domain Adaptation (UDA) employ the deep features extracted by pre-trained deep models without fine-tuning them on the specific source or target domain data for a particular domain adaptation task. In contrast, end-to-end learning based approaches optimise the pre-trained backbones and the customised adaptation modules simultaneously to… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  24. arXiv:2208.07613  [pdf, other

    cs.CV cs.CY cs.LG

    Does lossy image compression affect racial bias within face recognition?

    Authors: Seyma Yucer, Matt Poyser, Noura Al Moubayed, Toby P. Breckon

    Abstract: Yes - This study investigates the impact of commonplace lossy image compression on face recognition algorithms with regard to the racial characteristics of the subject. We adopt a recently proposed racial phenotype-based bias analysis methodology to measure the effect of varying levels of lossy compression across racial phenotype categories. Additionally, we determine the relationship between chro… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  25. arXiv:2206.00432  [pdf, ps, other

    cs.RO cs.CV cs.LG

    Evaluating Gaussian Grasp Maps for Generative Grasping Models

    Authors: William Prew, Toby P. Breckon, Magnus Bordewich, Ulrik Beierholm

    Abstract: Generalising robotic grasping to previously unseen objects is a key task in general robotic manipulation. The current method for training many antipodal generative grasping models rely on a binary ground truth grasp map generated from the centre thirds of correctly labelled grasp rectangles. However, these binary maps do not accurately reflect the positions in which a robotic arm can correctly gra… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures, to be published in IJCNN 2022

  26. arXiv:2205.08002  [pdf, other

    cs.CV cs.AI

    Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery

    Authors: Neelanjan Bhowmik, Jack W. Barker, Yona Falinie A. Gaus, Toby P. Breckon

    Abstract: Lossy image compression strategies allow for more efficient storage and transmission of data by encoding data to a reduced form. This is essential enable training with larger datasets on less storage-equipped environments. However, such compression can cause severe decline in performance of deep Convolution Neural Network (CNN) architectures even when mild compression is applied and the resulting… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  27. arXiv:2112.00556  [pdf, other

    cs.CV eess.IV

    Semi-Supervised Surface Anomaly Detection of Composite Wind Turbine Blades From Drone Imagery

    Authors: Jack. W. Barker, Neelanjan Bhowmik, Toby. P. Breckon

    Abstract: Within commercial wind energy generation, the monitoring and predictive maintenance of wind turbine blades in-situ is a crucial task, for which remote monitoring via aerial survey from an Unmanned Aerial Vehicle (UAV) is commonplace. Turbine blades are susceptible to both operational and weather-based damage over time, reducing the energy efficiency output of turbines. In this study, we address au… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: In-proceedings at 2022 17th International Conference on Computer Vision Theory and Applications (VISAPP)

  28. arXiv:2111.12701  [pdf, other

    cs.CV cs.LG

    Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

    Authors: Sam Bond-Taylor, Peter Hessey, Hiroshi Sasaki, Toby P. Breckon, Chris G. Willcocks

    Abstract: Whilst diffusion probabilistic models can generate high quality image content, key limitations remain in terms of both generating high-resolution imagery and their associated high computational requirements. Recent Vector-Quantized image models have overcome this limitation of image resolution but are prohibitively slow and unidirectional as they generate tokens via element-wise autoregressive sam… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 19 pages, 14 figures

    MSC Class: 68T01 (Primary); 68T07 (Secondary) ACM Class: I.5.0; I.4.0; G.3

  29. arXiv:2110.12635  [pdf, other

    cs.CV cs.LG

    Progressively Select and Reject Pseudo-labelled Samples for Open-Set Domain Adaptation

    Authors: Qian Wang, Fanlin Meng, Toby P. Breckon

    Abstract: Domain adaptation solves image classification problems in the target domain by taking advantage of the labelled source data and unlabelled target data. Usually, the source and target domains share the same set of classes. As a special case, Open-Set Domain Adaptation (OSDA) assumes there exist additional classes in the target domain but not present in the source domain. To solve such a domain adap… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 8 pages

  30. Measuring Hidden Bias within Face Recognition via Racial Phenotypes

    Authors: Seyma Yucer, Furkan Tektas, Noura Al Moubayed, Toby P. Breckon

    Abstract: Recent work reports disparate performance for intersectional racial groups across face recognition tasks: face verification and identification. However, the definition of those racial groups has a significant impact on the underlying findings of such racial bias analysis. Previous studies define these groups based on either demographic information (e.g. African, Asian etc.) or skin tone (e.g. ligh… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: published in IEEE Winter Conference on Applications of Computer Vision, WACV, 2022

  31. arXiv:2110.04906  [pdf, other

    cs.CV cs.AI cs.LG

    Operationalizing Convolutional Neural Network Architectures for Prohibited Object Detection in X-Ray Imagery

    Authors: Thomas W. Webb, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon

    Abstract: The recent advancement in deep Convolutional Neural Network (CNN) has brought insight into the automation of X-ray security screening for aviation security and beyond. Here, we explore the viability of two recent end-to-end object detection CNN architectures, Cascade R-CNN and FreeAnchor, for prohibited item detection by balancing processing time and the impact of image data compression from an op… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  32. arXiv:2108.12505  [pdf, other

    cs.CV cs.LG

    On the impact of using X-ray energy response imagery for object detection via Convolutional Neural Networks

    Authors: Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon

    Abstract: Automatic detection of prohibited items within complex and cluttered X-ray security imagery is essential to maintaining transport security, where prior work on automatic prohibited item detection focus primarily on pseudo-colour (rgb}) X-ray imagery. In this work we study the impact of variant X-ray imagery, i.e., X-ray energy response (high, low}) and effective-z compared to rgb, via the use of d… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  33. arXiv:2104.13702  [pdf, ps, other

    cs.CV

    PANDA : Perceptually Aware Neural Detection of Anomalies

    Authors: Jack W. Barker, Toby P. Breckon

    Abstract: Semi-supervised methods of anomaly detection have seen substantial advancement in recent years. Of particular interest are applications of such methods to diverse, real-world anomaly detection problems where anomalous variations can vary from the visually obvious to the very subtle. In this work, we propose a novel fine-grained VAE-GAN architecture trained in a semi-supervised manner in order to d… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: In-proceedings at 2021 International Joint Conference on Neural Networks (IJCNN)

  34. arXiv:2104.06219  [pdf, other

    cs.CV cs.LG cs.RO

    UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-identification in Video Imagery

    Authors: Daniel Organisciak, Matthew Poyser, Aishah Alsehaim, Shanfeng Hu, Brian K. S. Isaac-Medina, Toby P. Breckon, Hubert P. H. Shum

    Abstract: As unmanned aerial vehicles (UAVs) become more accessible with a growing range of applications, the potential risk of UAV disruption increases. Recent development in deep learning allows vision-based counter-UAV systems to detect and track UAVs with a single camera. However, the coverage of a single camera is limited, necessitating the need for multicamera configurations to match UAVs across camer… ▽ More

    Submitted 2 December, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

  35. arXiv:2104.05358  [pdf, other

    cs.CV cs.LG eess.IV

    UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models

    Authors: Hiroshi Sasaki, Chris G. Willcocks, Toby P. Breckon

    Abstract: We propose a novel unpaired image-to-image translation method that uses denoising diffusion probabilistic models without requiring adversarial training. Our method, UNpaired Image Translation with Denoising Diffusion Probabilistic Models (UNIT-DDPM), trains a generative model to infer the joint distribution of images over both domains as a Markov chain by minimising a denoising score matching obje… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 10 pages, 8 figures

  36. Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark

    Authors: Brian K. S. Isaac-Medina, Matt Poyser, Daniel Organisciak, Chris G. Willcocks, Toby P. Breckon, Hubert P. H. Shum

    Abstract: Unmanned Aerial Vehicles (UAV) can pose a major risk for aviation safety, due to both negligent and malicious use. For this reason, the automated detection and tracking of UAV is a fundamental task in aerial security systems. Common technologies for UAV detection include visible-band and thermal infrared imaging, radio frequency and radar. Recent advances in deep neural networks (DNNs) for image-b… ▽ More

    Submitted 18 August, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  37. arXiv:2012.11753  [pdf, other

    cs.CV cs.LG eess.IV

    Contraband Materials Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery

    Authors: Qian Wang, Toby P. Breckon

    Abstract: Automatic prohibited object detection within 2D/3D X-ray Computed Tomography (CT) has been studied in literature to enhance the aviation security screening at checkpoints. Deep Convolutional Neural Networks (CNN) have demonstrated superior performance in 2D X-ray imagery. However, there exists very limited proof of how deep neural networks perform in materials detection within volumetric 3D CT bag… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: 8 pages

  38. arXiv:2012.05320  [pdf, other

    cs.CV

    Multi-Model Learning for Real-Time Automotive Semantic Foggy Scene Understanding via Domain Adaptation

    Authors: Naif Alshammari, Samet Akcay, Toby P. Breckon

    Abstract: Robust semantic scene segmentation for automotive applications is a challenging problem in two key aspects: (1) labelling every individual scene pixel and (2) performing this task under unstable weather and illumination changes (e.g., foggy weather), which results in poor outdoor scene visibility. Such visibility limitations lead to non-optimal performance of generalised deep convolutional neural… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1909.07697

  39. arXiv:2012.05304  [pdf, other

    cs.CV

    Competitive Simplicity for Multi-Task Learning for Real-Time Foggy Scene Understanding via Domain Adaptation

    Authors: Naif Alshammari, Samet Akcay, Toby P. Breckon

    Abstract: Automotive scene understanding under adverse weather conditions raises a realistic and challenging problem attributable to poor outdoor scene visibility (e.g. foggy weather). However, because most contemporary scene understanding approaches are applied under ideal-weather conditions, such approaches may not provide genuinely optimal performance when compared to established a priori insights on ext… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  40. arXiv:2012.00848  [pdf, other

    cs.CV cs.LG

    Data Augmentation with norm-VAE for Unsupervised Domain Adaptation

    Authors: Qian Wang, Fanlin Meng, Toby P. Breckon

    Abstract: We address the Unsupervised Domain Adaptation (UDA) problem in image classification from a new perspective. In contrast to most existing works which either align the data distributions or learn domain-invariant features, we directly learn a unified classifier for both domains within a high-dimensional homogeneous feature space without explicit domain adaptation. To this end, we employ the effectiv… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Comments: 12 pages

  41. arXiv:2010.08833  [pdf, other

    cs.CV cs.LG

    Efficient and Compact Convolutional Neural Network Architectures for Non-temporal Real-time Fire Detection

    Authors: William Thomson, Neelanjan Bhowmik, Toby P. Breckon

    Abstract: Automatic visual fire detection is used to complement traditional fire detection sensor systems (smoke/heat). In this work, we investigate different Convolutional Neural Network (CNN) architectures and their variants for the non-temporal real-time bounds detection of fire pixel regions in video (or still) imagery. Two reduced complexity compact CNN architectures (NasNet-A-OnFire and ShuffleNetV2-O… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

  42. arXiv:2008.06318  [pdf, other

    cs.CV

    Not 3D Re-ID: a Simple Single Stream 2D Convolution for Robust Video Re-identification

    Authors: Toby P. Breckon, Aishah Alsehaim

    Abstract: Video-based person re-identification has received increasing attention recently, as it plays an important role within surveillance video analysis. Video-based Re-ID is an expansion of earlier image-based re-identification methods by learning features from a video via multiple image frames for each person. Most contemporary video Re-ID methods utilise complex CNNbased network architectures using 3D… ▽ More

    Submitted 17 August, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: have been submitted to ICPR 2020 and has been ACCEPTED for presentation and inclusion in the proceedings

  43. arXiv:2008.01218  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery

    Authors: Qian Wang, Neelanjan Bhowmik, Toby P. Breckon

    Abstract: Automatic detection of prohibited objects within passenger baggage is important for aviation security. X-ray Computed Tomography (CT) based 3D imaging is widely used in airports for aviation security screening whilst prior work on automatic prohibited item detection focus primarily on 2D X-ray imagery. These works have proven the possibility of extending deep convolutional neural networks (CNN) ba… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Durham University

  44. arXiv:2008.01214  [pdf, other

    cs.CV cs.LG stat.ML

    Generalized Zero-Shot Domain Adaptation via Coupled Conditional Variational Autoencoders

    Authors: Qian Wang, Toby P. Breckon

    Abstract: Domain adaptation approaches aim to exploit useful information from the source domain where supervised learning examples are easier to obtain to address a learning problem in the target domain where there is no or limited availability of such examples. In classification problems, domain adaptation has been studied under varying supervised, unsupervised and semi-supervised conditions. However, a co… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Durham University

  45. arXiv:2007.14314  [pdf, other

    cs.CV

    On the Impact of Lossy Image and Video Compression on the Performance of Deep Convolutional Neural Network Architectures

    Authors: Matt Poyser, Amir Atapour-Abarghouei, Toby P. Breckon

    Abstract: Recent advances in generalized image understanding have seen a surge in the use of deep convolutional neural networks (CNN) across a broad range of image-based detection, classification and prediction tasks. Whilst the reported performance of these approaches is impressive, this study investigates the hitherto unapproached question of the impact of commonplace image and video compression technique… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 8 pages, 21 figures, to be published in ICPR 2020 conference

  46. arXiv:2005.02436  [pdf, other

    cs.CV cs.LG eess.IV

    Data Augmentation via Mixed Class Interpolation using Cycle-Consistent Generative Adversarial Networks Applied to Cross-Domain Imagery

    Authors: Hiroshi Sasaki, Chris G. Willcocks, Toby P. Breckon

    Abstract: Machine learning driven object detection and classification within non-visible imagery has an important role in many fields such as night vision, all-weather surveillance and aviation security. However, such applications often suffer due to the limited quantity and variety of non-visible spectral domain imagery, in contrast to the high data availability of visible-band imagery that readily enables… ▽ More

    Submitted 1 January, 2021; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: 9 pages, 9 figures, accepted at the 25th International Conference on Pattern Recognition (ICPR 2020)

  47. arXiv:2004.12427  [pdf, other

    cs.LG cs.CV stat.ML

    Cross-Domain Structure Preserving Projection for Heterogeneous Domain Adaptation

    Authors: Qian Wang, Toby P. Breckon

    Abstract: Heterogeneous Domain Adaptation (HDA) addresses the transfer learning problems where data from the source and target domains are of different modalities (e.g., texts and images) or feature dimensions (e.g., features extracted with different methods). It is useful for multi-modal data analysis. Traditional domain adaptation algorithms assume that the representations of source and target samples res… ▽ More

    Submitted 8 October, 2021; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: Technical Report

  48. arXiv:2004.08945  [pdf, other

    cs.CV

    Exploring Racial Bias within Face Recognition via per-subject Adversarially-Enabled Data Augmentation

    Authors: Seyma Yucer, Samet Akçay, Noura Al-Moubayed, Toby P. Breckon

    Abstract: Whilst face recognition applications are becoming increasingly prevalent within our daily lives, leading approaches in the field still suffer from performance bias to the detriment of some racial profiles within society. In this study, we propose a novel adversarial derived data augmentation methodology that aims to enable dataset balance at a per-subject level via the use of image-to-image transf… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 - Fair, Data Efficient and Trusted Computer Vision Workshop

  49. arXiv:2003.12625  [pdf, other

    cs.CV cs.LG

    On the Evaluation of Prohibited Item Classification and Detection in Volumetric 3D Computed Tomography Baggage Security Screening Imagery

    Authors: Qian Wang, Neelanjan Bhowmik, Toby P. Breckon

    Abstract: X-ray Computed Tomography (CT) based 3D imaging is widely used in airports for aviation security screening whilst prior work on prohibited item detection focuses primarily on 2D X-ray imagery. In this paper, we aim to evaluate the possibility of extending the automatic prohibited item detection from 2D X-ray imagery to volumetric 3D CT baggage security screening imagery. To these ends, we take adv… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Comments: Accepted to IJCNN 2020

  50. arXiv:2001.05459  [pdf, other

    cs.CV cs.CR cs.LG eess.IV

    A Reference Architecture for Plausible Threat Image Projection (TIP) Within 3D X-ray Computed Tomography Volumes

    Authors: Qian Wang, Najla Megherbi, Toby P. Breckon

    Abstract: Threat Image Projection (TIP) is a technique used in X-ray security baggage screening systems that superimposes a threat object signature onto a benign X-ray baggage image in a plausible and realistic manner. It has been shown to be highly effective in evaluating the ongoing performance of human operators, improving their vigilance and performance on threat detection. However, with the increasing… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: Technical Report, Durham University