Skip to main content

Showing 1–22 of 22 results for author: van Noord, N

Searching in archive cs. Search in all archives.
.
  1. Artifacts of Idiosyncracy in Global Street View Data

    Authors: Tim Alpherts, Sennay Ghebreab, Nanne van Noord

    Abstract: Street view data is increasingly being used in computer vision applications in recent years. Machine learning datasets are collected for these applications using simple sampling techniques. These datasets are assumed to be a systematic representation of cities, especially when densely sampled. Prior works however, show that there are clear gaps in coverage, with certain cities or regions being cov… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Published at FAccT '25

  2. arXiv:2503.17716  [pdf, other

    cs.CV

    EMPLACE: Self-Supervised Urban Scene Change Detection

    Authors: Tim Alpherts, Sennay Ghebreab, Nanne van Noord

    Abstract: Urban change is a constant process that influences the perception of neighbourhoods and the lives of the people within them. The field of Urban Scene Change Detection (USCD) aims to capture changes in street scenes using computer vision and can help raise awareness of changes that make it possible to better understand the city and its residents. Traditionally, the field of USCD has used supervised… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: 7 pages, 7 figures, published at AAAI 2025

  3. arXiv:2410.12379  [pdf, other

    cs.CV

    Stylistic Multi-Task Analysis of Ukiyo-e Woodblock Prints

    Authors: Selina Khan, Nanne van Noord

    Abstract: In this work we present a large-scale dataset of \textit{Ukiyo-e} woodblock prints. Unlike previous works and datasets in the artistic domain that primarily focus on western art, this paper explores this pre-modern Japanese art form with the aim of broadening the scope for stylistic analysis and to provide a benchmark to evaluate a variety of art focused Computer Vision approaches. Our dataset con… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  4. arXiv:2410.12369  [pdf, other

    cs.CV

    Context-Infused Visual Grounding for Art

    Authors: Selina Khan, Nanne van Noord

    Abstract: Many artwork collections contain textual attributes that provide rich and contextualised descriptions of artworks. Visual grounding offers the potential for localising subjects within these descriptions on images, however, existing approaches are trained on natural images and generalise poorly to art. In this paper, we present CIGAr (Context-Infused GroundingDINO for Art), a visual grounding appro… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  5. arXiv:2410.10034  [pdf, other

    cs.CV

    TULIP: Token-length Upgraded CLIP

    Authors: Ivona Najdenkoska, Mohammad Mahdi Derakhshani, Yuki M. Asano, Nanne van Noord, Marcel Worring, Cees G. M. Snoek

    Abstract: We address the challenge of representing long captions in vision-language models, such as CLIP. By design these models are limited by fixed, absolute positional encodings, restricting inputs to a maximum of 77 tokens and hindering performance on tasks requiring longer descriptions. Although recent work has attempted to overcome this limit, their proposed approaches struggle to model token relation… ▽ More

    Submitted 28 March, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

  6. arXiv:2404.06486  [pdf, other

    cs.LG cs.CV

    GO4Align: Group Optimization for Multi-Task Alignment

    Authors: Jiayi Shen, Cheems Wang, Zehao Xiao, Nanne Van Noord, Marcel Worring

    Abstract: This paper proposes \textit{GO4Align}, a multi-task optimization approach that tackles task imbalance by explicitly aligning the optimization across tasks. To achieve this, we design an adaptive group risk minimization strategy, comprising two techniques in implementation: (i) dynamical group assignment, which clusters similar tasks based on task interactions; (ii) risk-guided group indicators, wh… ▽ More

    Submitted 29 October, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  7. Find the Cliffhanger: Multi-Modal Trailerness in Soap Operas

    Authors: Carlo Bretti, Pascal Mettes, Hendrik Vincent Koops, Daan Odijk, Nanne van Noord

    Abstract: Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and time-consuming task. This requires selecting moments based on both visual and dialogue information. We introduce a multi-modal method for predicting the trailerness to assist editors in selecting trailer-worthy moments from long-form videos. We present re… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: MMM24

  8. arXiv:2310.06633  [pdf, other

    cs.CV cs.CY

    Blind Dates: Examining the Expression of Temporality in Historical Photographs

    Authors: Alexandra Barancová, Melvin Wevers, Nanne van Noord

    Abstract: This paper explores the capacity of computer vision models to discern temporal information in visual content, focusing specifically on historical photographs. We investigate the dating of images using OpenCLIP, an open-source implementation of CLIP, a multi-modal language and vision model. Our experiment consists of three steps: zero-shot classification, fine-tuning, and analysis of visual content… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  9. arXiv:2309.02401  [pdf, other

    cs.CV cs.MM

    Prototype-based Dataset Comparison

    Authors: Nanne van Noord

    Abstract: Dataset summarisation is a fruitful approach to dataset inspection. However, when applied to a single dataset the discovery of visual concepts is restricted to those most prominent. We argue that a comparative approach can expand upon this paradigm to enable richer forms of dataset inspection that go beyond the most prominent concepts. To enable dataset comparison we present a module that learns c… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: To be presented at ICCV 2023

  10. arXiv:2301.00436  [pdf, other

    cs.CV cs.AI cs.LG

    Hierarchical Explanations for Video Action Recognition

    Authors: Sadaf Gulshad, Teng Long, Nanne van Noord

    Abstract: To interpret deep neural networks, one main approach is to dissect the visual input and find the prototypical parts responsible for the classification. However, existing methods often ignore the hierarchical relationship between these prototypes, and thus can not explain semantic concepts at both higher level (e.g., water sports) and lower level (e.g., swimming). In this paper inspired by human co… ▽ More

    Submitted 3 April, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

  11. arXiv:2211.07460  [pdf, ps, other

    cs.CY cs.AI

    An Analytics of Culture: Modeling Subjectivity, Scalability, Contextuality, and Temporality

    Authors: Nanne van Noord, Melvin Wevers, Tobias Blanke, Julia Noordegraaf, Marcel Worring

    Abstract: There is a bidirectional relationship between culture and AI; AI models are increasingly used to analyse culture, thereby shaping our understanding of culture. On the other hand, the models are trained on collections of cultural artifacts thereby implicitly, and not always correctly, encoding expressions of culture. This creates a tension that both limits the use of AI for analysing culture and le… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: To be presented at Cultures in AI/AI in Culture workshop at NeurIPS 2022

  12. arXiv:2203.05898  [pdf, other

    cs.CV

    Hyperbolic Image Segmentation

    Authors: Mina GhadimiAtigh, Julian Schoep, Erman Acar, Nanne van Noord, Pascal Mettes

    Abstract: For image segmentation, the current standard is to perform pixel-level optimization and inference in Euclidean output embedding spaces through linear hyperplanes. In this work, we show that hyperbolic manifolds provide a valuable alternative for image segmentation and propose a tractable formulation of hierarchical pixel-level classification in hyperbolic space. Hyperbolic Image Segmentation opens… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: accepted to CVPR 2022

  13. arXiv:2202.01747  [pdf, other

    cs.CV

    The Met Dataset: Instance-level Recognition for Artworks

    Authors: Nikolaos-Antonios Ypsilantis, Noa Garcia, Guangxing Han, Sarah Ibrahimi, Nanne Van Noord, Giorgos Tolias

    Abstract: This work introduces a dataset for large-scale instance-level recognition in the domain of artworks. The proposed benchmark exhibits a number of different challenges such as large inter-class similarity, long tail distribution, and many classes. We rely on the open access collection of The Met museum to form a large training set of about 224k classes, where each class corresponds to a museum exhib… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  14. arXiv:2112.11294  [pdf, other

    cs.IR cs.LG cs.MM

    Extending CLIP for Category-to-image Retrieval in E-commerce

    Authors: Mariya Hendriksen, Maurits Bleeker, Svitlana Vakulenko, Nanne van Noord, Ernst Kuiper, Maarten de Rijke

    Abstract: E-commerce provides rich multimodal data that is barely leveraged in practice. One aspect of this data is a category tree that is being used in search and recommendation. However, in practice, during a user's session there is often a mismatch between a textual and a visual representation of a given category. Motivated by the problem, we introduce the task of category-to-image retrieval in e-commer… ▽ More

    Submitted 4 January, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 15 pages, accepted as a full paper at ECIR 2022

  15. arXiv:2111.13546  [pdf, other

    cs.CV

    Inside Out Visual Place Recognition

    Authors: Sarah Ibrahimi, Nanne van Noord, Tim Alpherts, Marcel Worring

    Abstract: Visual Place Recognition (VPR) is generally concerned with localizing outdoor images. However, localizing indoor scenes that contain part of an outdoor scene can be of large value for a wide range of applications. In this paper, we introduce Inside Out Visual Place Recognition (IOVPR), a task aiming to localize images based on outdoor scenes visible through windows. For this task we present the ne… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2021

  16. arXiv:1909.01218  [pdf, other

    cs.CV cs.HC cs.LG cs.SD eess.AS

    Translating Visual Art into Music

    Authors: Maximilian Müller-Eberstein, Nanne van Noord

    Abstract: The Synesthetic Variational Autoencoder (SynVAE) introduced in this research is able to learn a consistent mapping between visual and auditive sensory modalities in the absence of paired datasets. A quantitative evaluation on MNIST as well as the Behance Artistic Media dataset (BAM) shows that SynVAE is capable of retaining sufficient information content during the translation while maintaining cr… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: Accepted for ICCV 2019 Workshop on Fashion, Art and Design

  17. arXiv:1908.02711  [pdf, other

    cs.CV

    I Bet You Are Wrong: Gambling Adversarial Networks for Structured Semantic Segmentation

    Authors: Laurens Samson, Nanne van Noord, Olaf Booij, Michael Hofmann, Efstratios Gavves, Mohsen Ghafoorian

    Abstract: Adversarial training has been recently employed for realizing structured semantic segmentation, in which the aim is to preserve higher-level scene structural consistencies in dense predictions. However, as we show, value-based discrimination between the predictions from the segmentation network and ground-truth annotations can hinder the training process from learning to improve structural qualiti… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: 13 pages, 8 figures

  18. arXiv:1904.03011  [pdf, other

    cs.CV

    Learning Task Relatedness in Multi-Task Learning for Images in Context

    Authors: Gjorgji Strezoski, Nanne van Noord, Marcel Worring

    Abstract: Multimedia applications often require concurrent solutions to multiple tasks. These tasks hold clues to each-others solutions, however as these relations can be complex this remains a rarely utilized property. When task relations are explicitly defined based on domain knowledge multi-task learning (MTL) offers such concurrent solutions, while exploiting relatedness between multiple tasks performed… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: To appear in ICMR 2019 (Oral + Lightning Talk + Poster)

  19. arXiv:1903.12117  [pdf, other

    cs.CV

    Many Task Learning with Task Routing

    Authors: Gjorgji Strezoski, Nanne van Noord, Marcel Worring

    Abstract: Typical multi-task learning (MTL) methods rely on architectural adjustments and a large trainable parameter set to jointly optimize over several tasks. However, when the number of tasks increases so do the complexity of the architectural adjustments and resource requirements. In this paper, we introduce a method which applies a conditional feature-wise transformation over the convolutional activat… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: 8 Pages, 5 Figures, 2 Tables

  20. arXiv:1801.05585  [pdf, other

    cs.CV

    Light-weight pixel context encoders for image inpainting

    Authors: Nanne van Noord, Eric Postma

    Abstract: In this work we propose Pixel Content Encoders (PCE), a light-weight image inpainting model, capable of generating novel con-tent for large missing regions in images. Unlike previously presented convolutional neural network based models, our PCE model has an order of magnitude fewer trainable parameters. Moreover, by incorporating dilated convolutions we are able to preserve fine grained spatial i… ▽ More

    Submitted 17 January, 2018; originally announced January 2018.

  21. arXiv:1602.01255  [pdf, other

    cs.CV

    Learning scale-variant and scale-invariant features for deep image classification

    Authors: Nanne van Noord, Eric Postma

    Abstract: Convolutional Neural Networks (CNNs) require large image corpora to be trained on classification tasks. The variation in image resolutions, sizes of objects and patterns depicted, and image scales, hampers CNN training and performance, because the task-relevant information varies over spatial scales. Previous work attempting to deal with such scale variations focused on encouraging scale-invariant… ▽ More

    Submitted 13 May, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

  22. arXiv:1506.05929  [pdf, other

    cs.CV

    Exploring the influence of scale on artist attribution

    Authors: Nanne van Noord, Eric Postma

    Abstract: Previous work has shown that the artist of an artwork can be identified by use of computational methods that analyse digital images. However, the digitised artworks are often investigated at a coarse scale discarding many of the important details that may define an artist's style. In recent years high resolution images of artworks have become available, which, combined with increased processing po… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.