Skip to main content

Showing 1–6 of 6 results for author: Pantazis, O

.
  1. arXiv:2411.02537  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    INQUIRE: A Natural World Text-to-Image Retrieval Benchmark

    Authors: Edward Vendrow, Omiros Pantazis, Alexander Shepard, Gabriel Brostow, Kate E. Jones, Oisin Mac Aodha, Sara Beery, Grant Van Horn

    Abstract: We introduce INQUIRE, a text-to-image retrieval benchmark designed to challenge multimodal vision-language models on expert-level queries. INQUIRE includes iNaturalist 2024 (iNat24), a new dataset of five million natural world images, along with 250 expert-level retrieval queries. These queries are paired with all relevant images comprehensively labeled within iNat24, comprising 33,000 total match… ▽ More

    Submitted 11 November, 2024; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: Published in NeurIPS 2024, Datasets and Benchmarks Track

  2. arXiv:2408.14348  [pdf

    cs.CV

    Deep learning-based ecological analysis of camera trap images is impacted by training data quality and quantity

    Authors: Peggy A. Bevan, Omiros Pantazis, Holly Pringle, Guilherme Braga Ferreira, Daniel J. Ingram, Emily Madsen, Liam Thomas, Dol Raj Thanet, Thakur Silwal, Santosh Rayamajhi, Gabriel Brostow, Oisin Mac Aodha, Kate E. Jones

    Abstract: Large image collections generated from camera traps offer valuable insights into species richness, occupancy, and activity patterns, significantly aiding biodiversity monitoring. However, the manual processing of these datasets is time-consuming, hindering analytical processes. To address this, deep neural networks have been adopted to automate image labelling, but the impact of classification err… ▽ More

    Submitted 7 May, 2025; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: *equally contributing authors

  3. arXiv:2210.03794  [pdf, other

    cs.CV

    SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

    Authors: Omiros Pantazis, Gabriel Brostow, Kate Jones, Oisin Mac Aodha

    Abstract: Vision-language models such as CLIP are pretrained on large volumes of internet sourced image and text pairs, and have been shown to sometimes exhibit impressive zero- and low-shot image classification performance. However, due to their size, fine-tuning these models on new datasets can be prohibitively expensive, both in terms of the supervision and compute required. To combat this, a series of l… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: BMVC 2022

  4. arXiv:2208.07654  [pdf, other

    cs.CV

    Matching Multiple Perspectives for Efficient Representation Learning

    Authors: Omiros Pantazis, Mathew Salvaris

    Abstract: Representation learning approaches typically rely on images of objects captured from a single perspective that are transformed using affine transformations. Additionally, self-supervised learning, a successful paradigm of representation learning, relies on instance discrimination and self-augmentations which cannot always bridge the gap between observations of the same object viewed from a differe… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: ECCVW 2022

  5. arXiv:2108.06435  [pdf, other

    cs.CV cs.LG

    Focus on the Positives: Self-Supervised Learning for Biodiversity Monitoring

    Authors: Omiros Pantazis, Gabriel Brostow, Kate Jones, Oisin Mac Aodha

    Abstract: We address the problem of learning self-supervised representations from unlabeled image collections. Unlike existing approaches that attempt to learn useful features by maximizing similarity between augmented versions of each input image or by speculatively picking negative samples, we instead also make use of the natural variation that occurs in image collections that are captured using static mo… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  6. arXiv:1910.13872  [pdf, other

    cs.HC

    The Game Performance Index for Mobile Phones

    Authors: Hesham Dar, James Kwan, Yang Liu, Omiros Pantazis, Robert Sharp

    Abstract: With the recent increase in the quantity of high fidelity games appearing on mobile devices and the recent trend of gaming focused mobile devices, there is a new requirement for a clear and comprehensive measure of the quality of gaming performance on the mobile device platform. This paper proposes a conceptual framework for a user-experience and user-perception based set of performance measures f… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: 7 pages, 2 figures