Skip to main content

Showing 1–50 of 71 results for author: O'Connor, N E

.
  1. arXiv:2505.23872  [pdf, ps, other

    eess.IV cs.CV

    Parameter-Free Bio-Inspired Channel Attention for Enhanced Cardiac MRI Reconstruction

    Authors: Anam Hashmi, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor

    Abstract: Attention is a fundamental component of the human visual recognition system. The inclusion of attention in a convolutional neural network amplifies relevant visual features and suppresses the less important ones. Integrating attention mechanisms into convolutional neural networks enhances model performance and interpretability. Spatial and channel attention mechanisms have shown significant advant… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: presented at the 28th UK Conference on Medical Image Understanding and Analysis - MIUA, 24 - 26 July 2024

  2. arXiv:2505.08561  [pdf, other

    cs.CV

    Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection

    Authors: Ayush K. Rai, Kyle Min, Tarun Krishna, Feiyan Hu, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Masked video modeling~(MVM) has emerged as a highly effective pre-training strategy for visual foundation models, whereby the model reconstructs masked spatiotemporal tokens using information from visible tokens. However, a key challenge in such approaches lies in selecting an appropriate masking strategy. Previous studies have explored predefined masking techniques, including random and tube-base… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  3. arXiv:2412.13966  [pdf, other

    cs.LG physics.data-an

    Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates

    Authors: Sen Yan, David J. O'Connor, Xiaojun Wang, Noel E. O'Connor, Alan F. Smeaton, Mingming Liu

    Abstract: Urban pollution poses serious health risks, particularly in relation to traffic-related air pollution, which remains a major concern in many cities. Vehicle emissions contribute to respiratory and cardiovascular issues, especially for vulnerable and exposed road users like pedestrians and cyclists. Therefore, accurate air quality monitoring with high spatial resolution is vital for good urban envi… ▽ More

    Submitted 25 December, 2024; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted by IEEE CIETES 2025, with 8 pages, 3 figures, and 2 tables

  4. arXiv:2412.09160  [pdf, other

    cs.CV

    Pinpoint Counterfactuals: Reducing social bias in foundation models via localized counterfactual generation

    Authors: Kirill Sirotkin, Marcos Escudero-Viñolo, Pablo Carballeira, Mayug Maniparambil, Catarina Barata, Noel E. O'Connor

    Abstract: Foundation models trained on web-scraped datasets propagate societal biases to downstream tasks. While counterfactual generation enables bias analysis, existing methods introduce artifacts by modifying contextual elements like clothing and background. We present a localized counterfactual generation method that preserves image context by constraining counterfactual modifications to specific attrib… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

  5. arXiv:2409.19425  [pdf, other

    cs.CV

    Harnessing Frozen Unimodal Encoders for Flexible Multimodal Alignment

    Authors: Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Ankit Singh, Noel E. O'Connor

    Abstract: Recent contrastive multimodal vision-language models like CLIP have demonstrated robust open-world semantic understanding, becoming the standard image backbones for vision-language applications. However, recent findings suggest high semantic similarity between well-trained unimodal encoders, which raises a key question: Is there a plausible way to connect unimodal backbones for vision-language tas… ▽ More

    Submitted 23 March, 2025; v1 submitted 28 September, 2024; originally announced September 2024.

    Comments: Accepted CVPR 2025; First two authors contributed equally;

  6. arXiv:2408.00006  [pdf, other

    cs.DC cs.LG

    Synthetic Time Series for Anomaly Detection in Cloud Microservices

    Authors: Mohamed Allam, Noureddine Boujnah, Noel E. O'Connor, Mingming Liu

    Abstract: This paper proposes a framework for time series generation built to investigate anomaly detection in cloud microservices. In the field of cloud computing, ensuring the reliability of microservices is of paramount concern and yet a remarkably challenging task. Despite the large amount of research in this area, validation of anomaly detection algorithms in realistic environments is difficult to achi… ▽ More

    Submitted 21 July, 2024; originally announced August 2024.

    Comments: The paper has been accepted by the 10th International Conference on Machine Learning, Optimization and Data Science

  7. arXiv:2407.05528  [pdf, other

    cs.CV

    An accurate detection is not all you need to combat label noise in web-noisy datasets

    Authors: Paul Albert, Jack Valmadre, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Training a classifier on web-crawled data demands learning algorithms that are robust to annotation errors and irrelevant examples. This paper builds upon the recent empirical observation that applying unsupervised contrastive learning to noisy, web-crawled datasets yields a feature representation under which the in-distribution (ID) and out-of-distribution (OOD) samples are linearly separable. We… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted in the European Conference on Computer Vision (ECCV) 2024

  8. arXiv:2404.06941  [pdf, other

    eess.IV cs.CV

    Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach

    Authors: Anam Hashmi, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor

    Abstract: Cine cardiac magnetic resonance (CMR) imaging is recognised as the benchmark modality for the comprehensive assessment of cardiac function. Nevertheless, the acquisition process of cine CMR is considered as an impediment due to its prolonged scanning time. One commonly used strategy to expedite the acquisition process is through k-space undersampling, though it comes with a drawback of introducing… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted for the 32nd European Signal Processing Conference EUSIPCO 2024 in Lyon

  9. arXiv:2404.06362  [pdf, other

    cs.CV cs.AI

    Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation

    Authors: Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia Dietlmeier, Guenole Silvestre, Kathleen Curran, Noel E. O'Connor, Suzanne Little

    Abstract: The Segment Anything Model (SAM) and CLIP are remarkable vision foundation models (VFMs). SAM, a prompt driven segmentation model, excels in segmentation tasks across diverse domains, while CLIP is renowned for its zero shot recognition capabilities. However, their unified potential has not yet been explored in medical image segmentation. To adapt SAM to medical imaging, existing methods primarily… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  10. arXiv:2401.05224  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Do Vision and Language Encoders Represent the World Similarly?

    Authors: Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

    Abstract: Aligned text-image encoders such as CLIP have become the de facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a central question: does an alignment exist between uni-modal vision and language encoders since they fundamentally represent the same physical world? Analyzing the latent spaces structure… ▽ More

    Submitted 22 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted CVPR 2024

  11. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  12. Breathing Green: Maximising Health and Environmental Benefits for Active Transportation Users Leveraging Large Scale Air Quality Data

    Authors: Sen Yan, Shaoshu Zhu, Jaime B. Fernandez, Eric Arazo Sánchez, Yingqi Gu, Noel E. O'Connor, David O'Connor, Mingming Liu

    Abstract: Pollution in urban areas can have significant adverse effects on the health and well-being of citizens, with traffic-related air pollution being a major concern in many cities. Pollutants emitted by vehicles, such as nitrogen oxides, carbon monoxide, and particulate matter, can cause respiratory and cardiovascular problems, particularly for vulnerable road users like pedestrians and cyclists. Furt… ▽ More

    Submitted 18 July, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: The manuscript has been accepted by the IEEE ITSC 2023

  13. arXiv:2307.12033  [pdf, other

    cs.CV

    Self-Supervised and Semi-Supervised Polyp Segmentation using Synthetic Data

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Early detection of colorectal polyps is of utmost importance for their treatment and for colorectal cancer prevention. Computer vision techniques have the potential to aid professionals in the diagnosis stage, where colonoscopies are manually carried out to examine the entirety of the patient's colon. The main challenge in medical imaging is the lack of data, and a further challenge specific to po… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  14. arXiv:2307.11661  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts

    Authors: Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor

    Abstract: Contrastive pretrained large Vision-Language Models (VLMs) like CLIP have revolutionized visual representation learning by providing good performance on downstream datasets. VLMs are 0-shot adapted to a downstream dataset by designing prompts that are relevant to the dataset. Such prompt engineering makes use of domain expertise and a validation dataset. Meanwhile, recent developments in generativ… ▽ More

    Submitted 8 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Paper accepted at ICCV-W 2023. V2 contains additional comparisons with concurrent works

  15. Joint one-sided synthetic unpaired image translation and segmentation for colorectal cancer prevention

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We propose CUT-seg, a joint training where a segmentation model and a… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.08680

  16. Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels

    Authors: Enric Moreu, Alex Martinelli, Martina Naughton, Philip Kelly, Noel E. O'Connor

    Abstract: Accurate product information is critical for e-commerce stores to allow customers to browse, filter, and search for products. Product data quality is affected by missing or incorrect information resulting in poor customer experience. While machine learning can be used to correct inaccurate or missing information, achieving high performance on fashion image classification tasks requires large amoun… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  17. U-Park: A User-Centric Smart Parking Recommendation System for Electric Shared Micromobility Services

    Authors: Sen Yan, Noel E. O'Connor, Mingming Liu

    Abstract: Electric Shared Micromobility Services (ESMS) has become a vital element within the Mobility as a Service framework, contributing to sustainable transportation systems. However, existing ESMS face notable design challenges such as shortcomings in integration, transparency, and user-centred approaches, resulting in increased operational costs and decreased service quality. A key operational issue f… ▽ More

    Submitted 18 July, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: The manuscript has been accepted by the IEEE Transactions on Artificial Intelligence. This manuscript includes 15 pages with 11 figures and 6 tables

  18. arXiv:2301.13019  [pdf, other

    cs.RO cs.LG

    Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Francisco Roldan Sanchez, Kevin McGuinness, Noel E. O'Connor, Stephen J. Redmond

    Abstract: This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline data. Participants were provided with two types of datasets for each task: expert and mixed datasets with varying skill levels. While the simplest offline policy le… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  19. arXiv:2301.11734  [pdf, other

    cs.LG cs.RO

    Improving Behavioural Cloning with Positive Unlabeled Learning

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel E. O'Connor, Nico Gürtler, Felix Widmaier, Francisco Roldan Sanchez, Stephen J. Redmond

    Abstract: Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we would consider as positive examples; i.e., high-quality demonstrations. Therefore, we propose a novel iterative learning algorithm for identifying expert trajecto… ▽ More

    Submitted 21 September, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  20. arXiv:2301.09164  [pdf, other

    cs.LG cs.CV

    Unifying Synergies between Self-supervised Learning and Dynamic Computation

    Authors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'Connor

    Abstract: Computationally expensive training strategies make self-supervised learning (SSL) impractical for resource constrained industrial settings. Techniques like knowledge distillation (KD), dynamic computation (DC), and pruning are often used to obtain a lightweightmodel, which usually involves multiple epochs of fine-tuning (or distilling steps) of a large pre-trained model, making it more computation… ▽ More

    Submitted 9 September, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC 2023

  21. arXiv:2210.05574  [pdf, other

    cs.CV cs.AI cs.LG

    Motion Aware Self-Supervision for Generic Event Boundary Detection

    Authors: Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The task of Generic Event Boundary Detection (GEBD) aims to detect moments in videos that are naturally perceived by humans as generic and taxonomy-free event boundaries. Modeling the dynamically evolving temporal and spatial changes in a video makes GEBD a difficult problem to solve. Existing approaches involve very complex and sophisticated pipelines in terms of architectural design choices, hen… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  22. arXiv:2210.04578  [pdf, other

    cs.CV cs.LG

    Is your noise correction noisy? PLS: Robustness to label noise with two stage detection

    Authors: Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Designing robust algorithms capable of training accurate neural networks on uncurated datasets from the web has been the subject of much research as it reduces the need for time consuming human labor. The focus of many previous research contributions has been on the detection of different types of label noise; however, this paper proposes to improve the correction accuracy of noisy samples once th… ▽ More

    Submitted 15 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 9 pages 4 figures. Accepted at WACV 2023

  23. arXiv:2209.09714  [pdf, other

    eess.IV cs.CV

    Cardiac Segmentation using Transfer Learning under Respiratory Motion Artifacts

    Authors: Carles Garcia-Cabrera, Eric Arazo, Kathleen M. Curran, Noel E. O'Connor, Kevin McGuinness

    Abstract: Methods that are resilient to artifacts in the cardiac magnetic resonance imaging (MRI) while performing ventricle segmentation, are crucial for ensuring quality in structural and functional analysis of those tissues. While there has been significant efforts on improving the quality of the algorithms, few works have tackled the harm that the artifacts generate in the predictions. In this work, we… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: accepted for the STACOM2022 workshop @ MICCAI2022

  24. arXiv:2207.12065  [pdf, other

    cs.CV

    Dynamic Channel Selection in Self-Supervised Learning

    Authors: Tarun Krishna, Ayush K. Rai, Yasser A. D. Djilali, Alan F. Smeaton, Kevin McGuinness, Noel E. O'Connor

    Abstract: Whilst computer vision models built using self-supervised approaches are now commonplace, some important questions remain. Do self-supervised models learn highly redundant channel features? What if a self-supervised network could dynamically select the important channels and get rid of the unnecessary ones? Currently, convnets pre-trained with self-supervision have obtained comparable performance… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in Irish Machine Vision and Image Processing Conference 2022

  25. arXiv:2207.01573  [pdf, other

    cs.CV

    Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets

    Authors: Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Using search engines for web image retrieval is a tempting alternative to manual curation when creating an image dataset, but their main drawback remains the proportion of incorrect (noisy) samples retrieved. These noisy samples have been evidenced by previous works to be a mixture of in-distribution (ID) samples, assigned to the incorrect category but presenting similar visual semantics to other… ▽ More

    Submitted 18 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  26. arXiv:2204.09343  [pdf

    cs.CV

    Utilizing unsupervised learning to improve sward content prediction and herbage mass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling H. O'Connor, Noel E. O'Connor, Kevin McGuinness

    Abstract: Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of p… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 3 pages. Accepted at the 29th EGF General Meeting 2022

  27. arXiv:2204.08271  [pdf, other

    cs.CV

    Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Jaime Fernandez, Brian Mac Namee, Deirdre Hennessey, Noel E. O'Connor, Kevin McGuinness

    Abstract: Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 11 pages, 5 figures. Accepted at the Agriculture-Vision CVPR 2022 Workshop

  28. Parking Behaviour Analysis of Shared E-Bike Users Based on a Real-World Dataset -- A Case Study in Dublin, Ireland

    Authors: Sen Yan, Mingming Liu, Noel E. O'Connor

    Abstract: In recent years, an increasing number of shared E-bikes have been rolling out rapidly in our cities. It therefore becomes important to understand new behaviour patterns of the cyclists in using these E-bikes as a foundation for the novel design of shared micromobility services as part of the realisation for next generation intelligent transportation systems. In this paper, we deeply investigate th… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: The manuscript has been accepted by the IEEE VTC 2022-Spring Conference

  29. arXiv:2202.08680  [pdf, other

    eess.IV cs.CV

    Synthetic data for unsupervised polyp segmentation

    Authors: Enric Moreu, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We use zero annotations from medical professionals in our pipeline. Ou… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  30. arXiv:2202.08670  [pdf, other

    cs.CV cs.AI

    Domain Randomization for Object Counting

    Authors: Enric Moreu, Kevin McGuinness, Diego Ortego, Noel E. O'Connor

    Abstract: Recently, the use of synthetic datasets based on game engines has been shown to improve the performance of several tasks in computer vision. However, these datasets are typically only appropriate for the specific domains depicted in computer games, such as urban scenes involving vehicles and people. In this paper, we present an approach to generate synthetic datasets for object counting for any do… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  31. arXiv:2201.10243  [pdf, other

    cs.CV cs.LG

    BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

    Authors: Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

    Abstract: Evaluating video captioning systems is a challenging task as there are multiple factors to consider; for instance: the fluency of the caption, multiple actions happening in a single scene, and the human bias of what is considered important. Most metrics try to measure how similar the system generated captions are to a single or a set of human-annotated captions. This paper presents a new method ba… ▽ More

    Submitted 16 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: In press in Language Resources and Evaluation Conference(LREC) 2022

  32. arXiv:2111.09056  [pdf, other

    cs.CV cs.CY cs.MM

    Improving Person Re-Identification with Temporal Constraints

    Authors: Julia Dietlmeier, Feiyan Hu, Frances Ryan, Noel E. O'Connor, Kevin McGuinness

    Abstract: In this paper we introduce an image-based person re-identification dataset collected across five non-overlapping camera views in the large and busy airport in Dublin, Ireland. Unlike all publicly available image-based datasets, our dataset contains timestamp information in addition to frame number, and camera and person IDs. Also our dataset has been fully anonymized to comply with modern data pri… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 10 pages, RWS @ WACV2022

  33. arXiv:2110.14283  [pdf, other

    cs.CV

    How Important is Importance Sampling for Deep Budgeted Training?

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Long iterative training processes for Deep Neural Networks (DNNs) are commonly required to achieve state-of-the-art performance in many computer vision tasks. Importance sampling approaches might play a key role in budgeted training regimes, i.e. when limiting the number of training iterations. These approaches aim at dynamically estimating the importance of each sample to focus on the most releva… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: British Machine Vision Conference (BMVC) 2021, oral presentation

  34. arXiv:2106.10090  [pdf, other

    cs.CV cs.AI

    Discerning Generic Event Boundaries in Long-Form Wild Videos

    Authors: Ayush K Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F Smeaton, Noel E O'Connor

    Abstract: Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Technical Report for Generic Event Boundary Challenge - LOVEU Challenge (CVPR 2021)

  35. arXiv:2105.09460  [pdf, other

    cs.NI eess.SY

    Optimal Distributed Bandwidth Allocation in NB-IoT Networks

    Authors: Hongde Wu, Zhengyong Chen, Noel E. O'Connor, Mingming Liu

    Abstract: In this paper, we investigate a key problem of Narrowband-Internet of Things (NB-IoT) in the context of 5G with Mobile Edge Computing (MEC). We address the challenge that IoT devices may have different priorities when demanding bandwidth for data transmission in specific applications and services. Due to the scarcity of bandwidth in an MEC enabled IoT network, our objective is to optimize bandwidt… ▽ More

    Submitted 5 March, 2021; originally announced May 2021.

    Comments: The paper has been accepted by the 6th ACM/IEEE Conference on Internet of Things Design and Implementation

  36. arXiv:2104.10644  [pdf, other

    cs.LG eess.SY

    A Comparative Study of Using Spatial-Temporal Graph Convolutional Networks for Predicting Availability in Bike Sharing Schemes

    Authors: Zhengyong Chen, Hongde Wu, Noel E. O'Connor, Mingming Liu

    Abstract: Accurately forecasting transportation demand is crucial for efficient urban traffic guidance, control and management. One solution to enhance the level of prediction accuracy is to leverage graph convolutional networks (GCN), a neural network based modelling approach with the ability to process data contained in graph based structures. As a powerful extension of GCN, a spatial-temporal graph convo… ▽ More

    Submitted 6 July, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: This manuscript has been accepted at the IEEE ITSC 2021

  37. arXiv:2104.07614  [pdf, other

    eess.SY

    An ADMM-based Optimal Transmission Frequency Management System for IoT Edge Intelligence

    Authors: Hongde Wu, Noel E. O'Connor, Jennifer Bruton, Mingming Liu

    Abstract: In this paper, we investigate a key problem of Internet of Things (IoT) applications in practice. Our research objective is to optimize the transmission frequencies for a group of IoT edge devices under practical constraints. Our key assumption is that different IoT devices may have different priority levels when transmitting data in a resource-constrained environment and that those priority level… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: The paper has been accepted at the 7th IEEE World Forum on Internet of Things (IEEE WF-IoT)

  38. arXiv:2103.00548  [pdf, other

    eess.SY

    An Intelligent Multi-Speed Advisory System using Improved Whale Optimisation Algorithm

    Authors: Beiran Chen, Mingming Liu, Yi Zhang, Zhengyong Chen, Yingqi Gu, Noel E. O'Connor

    Abstract: An intelligent speed advisory system can be used to recommend speed for vehicles travelling in a given road network in cities. In this paper, we extend our previous work where a distributed speed advisory system has been devised to recommend an optimal consensus speed for a fleet of Internal Combustion Engine Vehicles (ICEVs) in a highway scenario. In particular, we propose a novel optimisation fr… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: This paper has been accepted by IEEE VTC2021-Spring for presentation

  39. arXiv:2102.04993  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to b… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, 2020

  40. arXiv:2101.06451  [pdf, other

    eess.SY

    MPC-CSAS: Multi-Party Computation for Real-time Privacy-preserving Speed Advisory Systems

    Authors: Mingming Liu, Long Cheng, Yingqi Gu, Ying Wang, Qingzhi Liu, Noel E. O'Connor

    Abstract: As a part of Advanced Driver Assistance Systems (ADASs), Consensus-based Speed Advisory Systems (CSAS) have been proposed to recommend a common speed to a group of vehicles for specific application purposes, such as emission control and energy management. With Vehicle-to-Vehicle (V2V), Vehicle-to-Infrastructure (V2I) technologies and advanced control theories in place, state-of-the-art CSAS can be… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: This manuscript has been accepted by the IEEE Transactions on Intelligent Transportation Systems

  41. arXiv:2012.15641  [pdf, other

    cs.MM cs.AI cs.CV

    Investigating Memorability of Dynamic Media

    Authors: Phuc H. Le-Khac, Ayush K. Rai, Graham Healy, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The Predicting Media Memorability task in MediaEval'20 has some challenging aspects compared to previous years. In this paper we identify the high-dynamic content in videos and dataset of limited size as the core challenges for the task, we propose directions to overcome some of these challenges and we present our initial result in these directions.

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 3 pages, 1 figure. 1 table

    Journal ref: MediaEval Multimedia Benchmark Workshop Working Notes, 14-15 December 2020

  42. arXiv:2012.04462  [pdf, other

    cs.CV

    Multi-Objective Interpolation Training for Robustness to Label Noise

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Deep neural networks trained with standard cross-entropy loss memorize noisy labels, which degrades their performance. Most research to mitigate this memorization proposes new robust classification loss functions. Conversely, we propose a Multi-Objective Interpolation Training (MOIT) approach that jointly exploits contrastive learning and classification to mutually help each other and boost perfor… ▽ More

    Submitted 18 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021. 10 pages, 1 figure, and 9 tables

  43. arXiv:2011.07616  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Contrastive Learning of Sound Event Representations

    Authors: Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra

    Abstract: Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data---a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: A 4-page version is submitted to ICASSP 2021

  44. arXiv:2010.06307  [pdf, other

    cs.CV cs.AI cs.LG

    How important are faces for person re-identification?

    Authors: Julia Dietlmeier, Joseph Antony, Kevin McGuinness, Noel E. O'Connor

    Abstract: This paper investigates the dependence of existing state-of-the-art person re-identification models on the presence and visibility of human faces. We apply a face detection and blurring algorithm to create anonymized versions of several popular person re-identification datasets including Market1501, DukeMTMC-reID, CUHK03, Viper, and Airport. Using a cross-section of existing state-of-the-art model… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 25th International Conference on Pattern Recognition (ICPR2020), Milan, Italy, 10-15 January 2021

  45. arXiv:2008.00106  [pdf, other

    cs.CV

    Utilising Visual Attention Cues for Vehicle Detection and Tracking

    Authors: Feiyan Hu, Venkatesh G M, Noel E. O'Connor, Alan F. Smeaton, Suzanne Little

    Abstract: Advanced Driver-Assistance Systems (ADAS) have been attracting attention from many researchers. Vision-based sensors are the closest way to emulate human driver visual behavior while driving. In this paper, we explore possible ways to use visual attention (saliency) for object detection and tracking. We investigate: 1) How a visual attention map such as a \emph{subjectness} attention or saliency m… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted in ICPR2020

  46. arXiv:2007.11866  [pdf, other

    cs.CV

    Reliable Label Bootstrapping for Semi-Supervised Learning

    Authors: Paul Albert, Diego Ortego, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Reducing the amount of labels required to train convolutional neural networks without performance degradation is key to effectively reduce human annotation efforts. We propose Reliable Label Bootstrapping (ReLaB), an unsupervised preprossessing algorithm which improves the performance of semi-supervised algorithms in extremely low supervision settings. Given a dataset with few labeled samples, we… ▽ More

    Submitted 25 February, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 10 pages, 3 figures

  47. arXiv:2006.15349  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Chroma Intra Prediction with attention-based CNN architectures

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be used in video coding to improve chroma intra-prediction. In particular, usage of fully-connected networks has enabled better cross-component prediction with respect to traditional linear models. Nonetheless, state-of-the-art architectures tend to disregard the location of individual reference samples in the prediction process. This paper proposes a new neural network archite… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates

  48. arXiv:2006.06392  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Interpreting CNN for Low Complexity Learned Sub-pixel Motion Compensation in Video Coding

    Authors: Luka Murn, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Deep learning has shown great potential in image and video compression tasks. However, it brings bit savings at the cost of significant increases in coding complexity, which limits its potential for implementation within practical applications. In this paper, a novel neural network-based tool is presented which improves the interpolation of reference samples needed for fractional precision motion… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP), 2020, pp. 798-802

  49. arXiv:2005.00430  [pdf, other

    cs.CV

    Investigating Class-level Difficulty Factors in Multi-label Classification Problems

    Authors: Mark Marsden, Kevin McGuinness, Joseph Antony, Haolin Wei, Milan Redzic, Jian Tang, Zhilan Hu, Alan Smeaton, Noel E O'Connor

    Abstract: This work investigates the use of class-level difficulty factors in multi-label classification problems for the first time. Four class-level difficulty factors are proposed: frequency, visual variation, semantic abstraction, and class co-occurrence. Once computed for a given multi-label classification dataset, these difficulty factors are shown to have several potential applications including the… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Published in ICME 2020

  50. arXiv:1912.08741  [pdf, other

    cs.CV

    Towards Robust Learning with Different Label Noise Distributions

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Noisy labels are an unavoidable consequence of labeling processes and detecting them is an important step towards preventing performance degradations in Convolutional Neural Networks. Discarding noisy labels avoids a harmful memorization, while the associated image content can still be exploited in a semi-supervised learning (SSL) setup. Clean samples are usually identified using the small loss tr… ▽ More

    Submitted 27 July, 2020; v1 submitted 18 December, 2019; originally announced December 2019.