Skip to main content

Showing 1–8 of 8 results for author: Habel, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.08550  [pdf, ps, other

    cs.CV

    ViewSparsifier: Killing Redundancy in Multi-View Plant Phenotyping

    Authors: Robin-Nico Kampa, Fabian Deuser, Konrad Habel, Norbert Oswald

    Abstract: Plant phenotyping involves analyzing observable characteristics of plants to better understand their growth, health, and development. In the context of deep learning, this analysis is often approached through single-view classification or regression models. However, these methods often fail to capture all information required for accurate estimation of target phenotypic traits, which can adversely… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

  2. arXiv:2508.19182  [pdf, ps, other

    cs.CV

    SoccerNet 2025 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Marc Gutiérrez-Pérez, Jan Held, Carlos Hinojosa, Victor Joos, Arnaud Leduc, Floriane Magera, Karen Sanchez, Vladimir Somers, Artur Xarles, Antonio Agudo, Alexandre Alahi, Olivier Barnich, Albert Clapés, Christophe De Vleeschouwer, Sergio Escalera, Bernard Ghanem, Thomas B. Moeslund, Marc Van Droogenbroeck, Tomoki Abe, Saad Alotaibi, Faisal Altawijri, Steven Araujo, Xiang Bai , et al. (93 additional authors not shown)

    Abstract: The SoccerNet 2025 Challenges mark the fifth annual edition of the SoccerNet open benchmarking effort, dedicated to advancing computer vision research in football video understanding. This year's challenges span four vision-based tasks: (1) Team Ball Action Spotting, focused on detecting ball-related actions in football broadcasts and assigning actions to teams; (2) Monocular Depth Estimation, tar… ▽ More

    Submitted 26 August, 2025; originally announced August 2025.

  3. arXiv:2409.10587  [pdf, other

    cs.CV

    SoccerNet 2024 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Victor Joos, Floriane Magera, Jan Held, Seyed Abolfazl Ghasemzadeh, Xin Zhou, Karolina Seweryn, Mateusz Kowalczyk, Zuzanna Mróz, Szymon Łukasik, Michał Hałoń, Hassan Mkhallati, Adrien Deliège, Carlos Hinojosa, Karen Sanchez, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Adam Gorski , et al. (59 additional authors not shown)

    Abstract: The SoccerNet 2024 challenges represent the fourth annual video understanding challenges organized by the SoccerNet team. These challenges aim to advance research across multiple themes in football, including broadcast video understanding, field understanding, and player understanding. This year, the challenges encompass four vision-based tasks. (1) Ball Action Spotting, focusing on precisely loca… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 7 pages, 1 figure

  4. SoccerNet 2023 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim , et al. (77 additional authors not shown)

    Abstract: The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, fo… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  5. arXiv:2308.00982  [pdf, other

    cs.CV

    Orientation-Guided Contrastive Learning for UAV-View Geo-Localisation

    Authors: Fabian Deuser, Konrad Habel, Martin Werner, Norbert Oswald

    Abstract: Retrieving relevant multimedia content is one of the main problems in a world that is increasingly data-driven. With the proliferation of drones, high quality aerial footage is now available to a wide audience for the first time. Integrating this footage into applications can enable GPS-less geo-localisation or location correction. In this paper, we present an orientation-guided training framewo… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  6. CLIP-ReIdent: Contrastive Training for Player Re-Identification

    Authors: Konrad Habel, Fabian Deuser, Norbert Oswald

    Abstract: Sports analytics benefits from recent advances in machine learning providing a competitive advantage for teams or individuals. One important task in this context is the performance measurement of individual players to provide reports and log files for subsequent analysis. During sport events like basketball, this involves the re-identification of players during a match either from multiple camera… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  7. arXiv:2303.11851  [pdf, other

    cs.CV

    Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation

    Authors: Fabian Deuser, Konrad Habel, Norbert Oswald

    Abstract: Cross-View Geo-Localisation is still a challenging task where additional modules, specific pre-processing or zooming strategies are necessary to determine accurate positions of images. Since different views have different geometries, pre-processing like polar transformation helps to merge them. However, this results in distorted images which then have to be rectified. Adding hard negatives to the… ▽ More

    Submitted 29 August, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

  8. arXiv:2206.05281  [pdf, other

    cs.CV cs.CL cs.LG

    Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model

    Authors: Fabian Deuser, Konrad Habel, Philipp J. Rösch, Norbert Oswald

    Abstract: Current architectures for multi-modality tasks such as visual question answering suffer from their high complexity. As a result, these architectures are difficult to train and require high computational resources. To address these problems we present a CLIP-based architecture that does not require any fine-tuning of the feature extractors. A simple linear classifier is used on the concatenated fea… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: VizWiz Grand Challenge: Describing Images and Videos Taken by Blind People (CVPR Workshop 2022)