Skip to main content

Showing 1–25 of 25 results for author: Prugel-Bennett, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.02464  [pdf, other

    cs.CV cs.AI

    CornerPoint3D: Look at the Nearest Corner Instead of the Center

    Authors: Ruixiao Zhang, Runwei Guan, Xiangyu Chen, Adam Prugel-Bennett, Xiaohao Cai

    Abstract: 3D object detection aims to predict object centers, dimensions, and rotations from LiDAR point clouds. Despite its simplicity, LiDAR captures only the near side of objects, making center-based detectors prone to poor localization accuracy in cross-domain tasks with varying point distributions. Meanwhile, existing evaluation metrics designed for single-domain assessment also suffer from overfitting… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2407.04061

  2. arXiv:2501.19271  [pdf, other

    cs.AI cs.LG

    Concept-Based Explainable Artificial Intelligence: Metrics and Benchmarks

    Authors: Halil Ibrahim Aysel, Xiaohao Cai, Adam Prugel-Bennett

    Abstract: Concept-based explanation methods, such as concept bottleneck models (CBMs), aim to improve the interpretability of machine learning models by linking their decisions to human-understandable concepts, under the critical assumption that such concepts can be accurately attributed to the network's feature space. However, this foundational assumption has not been rigorously validated, mainly because t… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: 17 pages it total, 8 main pages

  3. arXiv:2410.23451  [pdf, other

    cs.LG

    Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz Constraints

    Authors: Jay Bear, Adam Prügel-Bennett, Jonathon Hare

    Abstract: Iterative algorithms solve problems by taking steps until a solution is reached. Models in the form of Deep Thinking (DT) networks have been demonstrated to learn iterative algorithms in a way that can scale to different sized problems at inference time using recurrent computation and convolutions. However, they are often unstable during training, and have no guarantees of convergence/termination… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 10 pages (main body), 26 pages (total), 13 figures, 3 tables, submitted to the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  4. arXiv:2408.12708  [pdf, other

    cs.CV

    Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection

    Authors: Ruixiao Zhang, Juheon Lee, Xiaohao Cai, Adam Prugel-Bennett

    Abstract: Deep learning models such as convolutional neural networks and transformers have been widely applied to solve 3D object detection problems in the domain of autonomous driving. While existing models have achieved outstanding performance on most open benchmarks, the generalization ability of these deep networks is still in doubt. To adapt models to other domains including different cities, countries… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by the ICONIP 2024

  5. arXiv:2408.08412  [pdf, other

    cs.CV

    Penny-Wise and Pound-Foolish in Deepfake Detection

    Authors: Yabin Wang, Zhiwu Huang, Su Zhou, Adam Prugel-Bennett, Xiaopeng Hong

    Abstract: The diffusion of deepfake technologies has sparked serious concerns about its potential misuse across various domains, prompting the urgent need for robust detection methods. Despite advancement, many current approaches prioritize short-term gains at expense of long-term effectiveness. This paper critiques the overly specialized approach of fine-tuning pre-trained models solely with a penny-wise o… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2407.04061  [pdf, other

    cs.CV

    Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection

    Authors: Ruixiao Zhang, Yihong Wu, Juheon Lee, Adam Prugel-Bennett, Xiaohao Cai

    Abstract: The performance of domain adaptation technologies has not yet reached an ideal level in the current 3D object detection field for autonomous driving, which is mainly due to significant differences in the size of vehicles, as well as the environments they operate in when applied across domains. These factors together hinder the effective transfer and application of knowledge learned from specific d… ▽ More

    Submitted 12 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted by the 27th European Conference on Artificial Intelligence (ECAI 2024)

  7. arXiv:2305.15608  [pdf, other

    cs.CV cs.AI

    Semantic Segmentation by Semantic Proportions

    Authors: Halil Ibrahim Aysel, Xiaohao Cai, Adam Prügel-Bennett

    Abstract: Semantic segmentation is a critical task in computer vision aiming to identify and classify individual pixels in an image, with numerous applications in for example autonomous driving and medical image analysis. However, semantic segmentation can be highly challenging particularly due to the need for large amounts of annotated data. Annotating images is a time-consuming and costly process, often r… ▽ More

    Submitted 14 November, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2202.07350  [pdf, other

    cs.LG

    Generalisation and the Risk--Entropy Curve

    Authors: Dominic Belcher, Antonia Marcu, Adam Prügel-Bennett

    Abstract: In this paper we show that the expected generalisation performance of a learning machine is determined by the distribution of risks or equivalently its logarithm -- a quantity we term the risk entropy -- and the fluctuations in a quantity we call the training ratio. We show that the risk entropy can be empirically inferred for deep neural network models using Markov Chain Monte Carlo techniques. R… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  9. arXiv:2202.07052  [pdf, other

    cs.LG

    Orthogonalising gradients to speed up neural network optimisation

    Authors: Mark Tuddenham, Adam Prügel-Bennett, Jonathan Hare

    Abstract: The optimisation of neural networks can be sped up by orthogonalising the gradients before the optimisation step, ensuring the diversification of the learned representations. We orthogonalise the gradients of the layer's components/filters with respect to each other to separate out the intermediate representations. Our method of orthogonalisation allows the weights to be used more flexibly, in con… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  10. arXiv:2111.11514  [pdf, other

    cs.LG

    On Data-centric Myths

    Authors: Antonia Marcu, Adam Prügel-Bennett

    Abstract: The community lacks theory-informed guidelines for building good data sets. We analyse theoretical directions relating to what aspects of the data matter and conclude that the intuitions derived from the existing literature are incorrect and misleading. Using empirical counter-examples, we show that 1) data dimension should not necessarily be minimised and 2) when manipulating data, preserving the… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:2110.13968

  11. arXiv:2110.13968  [pdf, other

    cs.LG

    On the Effects of Artificial Data Modification

    Authors: Antonia Marcu, Adam Prügel-Bennett

    Abstract: Data distortion is commonly applied in vision models during both training (e.g methods like MixUp and CutMix) and evaluation (e.g. shape-texture bias and robustness). This data modification can introduce artificial information. It is often assumed that the resulting artefacts are detrimental to training, whilst being negligible when analysing models. We investigate these assumptions and conclude t… ▽ More

    Submitted 6 July, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

  12. GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

    Authors: Takaki Yamada, Adam Prügel-Bennett, Stefan B. Williams, Oscar Pizarro, Blair Thornton

    Abstract: This paper describes Georeference Contrastive Learning of visual Representation (GeoCLR) for efficient training of deep-learning Convolutional Neural Networks (CNNs). The method leverages georeference information by generating a similar image pair using images taken of nearby locations, and contrasting these with an image pair that is far apart. The underlying assumption is that images gathered wi… ▽ More

    Submitted 26 June, 2022; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: 30 pages, 9 figures

    Journal ref: Field Robotics 2 (2022) 1134-1155

  13. arXiv:2107.12021  [pdf, other

    cs.CV

    Language Models as Zero-shot Visual Semantic Learners

    Authors: Yue Jiao, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Visual Semantic Embedding (VSE) models, which map images into a rich semantic embedding space, have been a milestone in object recognition and zero-shot learning. Current approaches to VSE heavily rely on static word em-bedding techniques. In this work, we propose a Visual Se-mantic Embedding Probe (VSEP) designed to probe the semantic information of contextualized word embeddings in visual semant… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  14. arXiv:2107.11991  [pdf, other

    cs.CV

    What Remains of Visual Semantic Embeddings

    Authors: Yue Jiao, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Zero shot learning (ZSL) has seen a surge in interest over the decade for its tight links with the mechanism making young children recognize novel objects. Although different paradigms of visual semantic embedding models are designed to align visual features and distributed word representations, it is unclear to what extent current ZSL models encode semantic information from distributed word repre… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  15. arXiv:2105.02964  [pdf, other

    cs.CV cs.LG

    Object detection for crabs in top-view seabed imagery

    Authors: Vlad Velici, Adam Prügel-Bennett

    Abstract: This report presents the application of object detection on a database of underwater images of different species of crabs, as well as aerial images of sea lions and finally the Pascal VOC dataset. The model is an end-to-end object detection neural network based on a convolutional network base and a Long Short-Term Memory detector.

    Submitted 1 May, 2021; originally announced May 2021.

  16. arXiv:2105.00357  [pdf, other

    cs.LG

    RotLSTM: Rotating Memories in Recurrent Neural Networks

    Authors: Vlad Velici, Adam Prügel-Bennett

    Abstract: Long Short-Term Memory (LSTM) units have the ability to memorise and use long-term dependencies between inputs to generate predictions on time series data. We introduce the concept of modifying the cell state (memory) of LSTMs using rotation matrices parametrised by a new set of trainable weights. This addition shows significant increases of performance on some of the tasks from the bAbI dataset.

    Submitted 1 May, 2021; originally announced May 2021.

  17. arXiv:2012.01938  [pdf, other

    cs.LG

    Quasi-Newton's method in the class gradient defined high-curvature subspace

    Authors: Mark Tuddenham, Adam Prügel-Bennett, Jonathan Hare

    Abstract: Classification problems using deep learning have been shown to have a high-curvature subspace in the loss landscape equal in dimension to the number of classes. Moreover, this subspace corresponds to the subspace spanned by the logit gradients for each class. An obvious strategy to speed up optimisation would be to use Newton's method in the high-curvature subspace and stochastic gradient descent… ▽ More

    Submitted 28 November, 2020; originally announced December 2020.

    Journal ref: OPT2020: 12th Annual Workshop on Optimization for Machine Learning

  18. arXiv:2008.07922  [pdf, other

    cs.LG stat.ML

    Linear Disentangled Representations and Unsupervised Action Estimation

    Authors: Matthew Painter, Jonathon Hare, Adam Prugel-Bennett

    Abstract: Disentangled representation learning has seen a surge in interest over recent times, generally focusing on new models which optimise one of many disparate disentanglement metrics. Symmetry Based Disentangled Representation learning introduced a robust mathematical framework that defined precisely what is meant by a "linear disentangled representation". This framework determined that such represent… ▽ More

    Submitted 15 December, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  19. arXiv:2002.12047  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    FMix: Enhancing Mixed Sample Data Augmentation

    Authors: Ethan Harris, Antonia Marcu, Matthew Painter, Mahesan Niranjan, Adam Prügel-Bennett, Jonathon Hare

    Abstract: Mixed Sample Data Augmentation (MSDA) has received increasing attention in recent years, with many successful variants such as MixUp and CutMix. By studying the mutual information between the function learned by a VAE on the original data and on the augmented data we show that MixUp distorts learned functions in a way that CutMix does not. We further demonstrate this by showing that MixUp acts as… ▽ More

    Submitted 28 February, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: Code available at https://github.com/ecs-vlc/FMix

  20. arXiv:1911.04301  [pdf, other

    cs.LG stat.ML

    Rethinking Generalisation

    Authors: Antonia Marcu, Adam Prügel-Bennett

    Abstract: In this paper, a new approach to computing the generalisation performance is presented that assumes the distribution of risks, $ρ(r)$, for a learning scenario is known. From this, the expected error of a learning machine using empirical risk minimisation is computed for both classification and regression problems. A critical quantity in determining the generalisation performance is the power-law b… ▽ More

    Submitted 26 March, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

  21. arXiv:1906.06565  [pdf, other

    cs.LG stat.ML

    Deep Set Prediction Networks

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Current approaches for predicting sets from feature vectors ignore the unordered nature of sets and suffer from discontinuity issues as a result. We propose a general model for predicting sets that properly respects the structure of sets and avoids this problem. With a single feature vector as input, we show that our model is able to auto-encode point sets, predict the set of bounding boxes of obj… ▽ More

    Submitted 24 April, 2020; v1 submitted 15 June, 2019; originally announced June 2019.

    Comments: Appendix C contains an erratum

    Journal ref: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

  22. arXiv:1906.02795  [pdf, other

    cs.LG cs.AI stat.ML

    FSPool: Learning Set Representations with Featurewise Sort Pooling

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Traditional set prediction models can struggle with simple datasets due to an issue we call the responsibility problem. We introduce a pooling method for sets of feature vectors based on sorting features across elements of the set. This can be used to construct a permutation-equivariant auto-encoder that avoids this responsibility problem. On a toy dataset of polygons and a set version of MNIST, w… ▽ More

    Submitted 1 May, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Published at International Conference on Learning Representations (ICLR) 2020

  23. arXiv:1812.03928  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Representations of Sets through Optimized Permutations

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Representations of sets are challenging to learn because operations on sets should be permutation-invariant. To this end, we propose a Permutation-Optimisation module that learns how to permute a set end-to-end. The permuted set can be further processed to learn a permutation-invariant representation of that set, avoiding a bottleneck in traditional set models. We demonstrate our model's ability t… ▽ More

    Submitted 14 January, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: Published in ICLR 2019

  24. arXiv:1802.05766  [pdf, other

    cs.CV cs.CL

    Learning to Count Objects in Natural Images for Visual Question Answering

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Visual Question Answering (VQA) models have struggled with counting objects in natural images so far. We identify a fundamental problem due to soft attention in these models as a cause. To circumvent this problem, we propose a neural network component that allows robust counting from object proposals. Experiments on a toy task show the effectiveness of this component and we obtain state-of-the-art… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: Published in ICLR 2018

  25. arXiv:1311.5022  [pdf, other

    cs.LG cs.DS

    Extended Formulations for Online Linear Bandit Optimization

    Authors: Shaona Ghosh, Adam Prugel-Bennett

    Abstract: On-line linear optimization on combinatorial action sets (d-dimensional actions) with bandit feedback, is known to have complexity in the order of the dimension of the problem. The exponential weighted strategy achieves the best known regret bound that is of the order of $d^{2}\sqrt{n}$ (where $d$ is the dimension of the problem, $n$ is the time horizon). However, such strategies are provably subo… ▽ More

    Submitted 30 September, 2015; v1 submitted 20 November, 2013; originally announced November 2013.