Skip to main content

Showing 1–16 of 16 results for author: Kooij, J F P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.10827  [pdf, other

    cs.CV

    NeuSEditor: From Multi-View Images to Text-Guided Neural Surface Edits

    Authors: Nail Ibrahimli, Julian F. P. Kooij, Liangliang Nan

    Abstract: Implicit surface representations are valued for their compactness and continuity, but they pose significant challenges for editing. Despite recent advancements, existing methods often fail to preserve identity and maintain geometric consistency during editing. To address these challenges, we present NeuSEditor, a novel method for text-guided editing of neural implicit surfaces derived from multi-v… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2505.04982  [pdf, other

    cs.RO eess.SY

    A Vehicle System for Navigating Among Vulnerable Road Users Including Remote Operation

    Authors: Oscar de Groot, Alberto Bertipaglia, Hidde Boekema, Vishrut Jain, Marcell Kegl, Varun Kotian, Ted Lentsch, Yancong Lin, Chrysovalanto Messiou, Emma Schippers, Farzam Tajdari, Shiming Wang, Zimin Xia, Mubariz Zaffar, Ronald Ensing, Mario Garzon, Javier Alonso-Mora, Holger Caesar, Laura Ferranti, Riender Happee, Julian F. P. Kooij, Georgios Papaioannou, Barys Shyrokau, Dariu M. Gavrila

    Abstract: We present a vehicle system capable of navigating safely and efficiently around Vulnerable Road Users (VRUs), such as pedestrians and cyclists. The system comprises key modules for environment perception, localization and mapping, motion planning, and control, integrated into a prototype vehicle. A key innovation is a motion planner based on Topology-driven Model Predictive Control (T-MPC). The gu… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: Intelligent Vehicles Symposium 2025

  3. arXiv:2505.04950  [pdf, ps, other

    cs.AI

    Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly 'Know When They Do Not Know'

    Authors: Shireen Kudukkil Manchingal, Andrew Bradley, Julian F. P. Kooij, Keivan Shariatmadar, Neil Yorke-Smith, Fabio Cuzzolin

    Abstract: Despite AI's impressive achievements, including recent advances in generative and large language models, there remains a significant gap in the ability of AI systems to handle uncertainty and generalize beyond their training data. AI models consistently fail to make robust enough predictions when facing unfamiliar or adversarial data. Traditional machine learning approaches struggle to address thi… ▽ More

    Submitted 27 June, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

  4. arXiv:2406.00474  [pdf, other

    cs.CV

    Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

    Authors: Zimin Xia, Yujiao Shi, Hongdong Li, Julian F. P. Kooij

    Abstract: Given a ground-level query image and a geo-referenced aerial image that covers the query's local surroundings, fine-grained cross-view localization aims to estimate the location of the ground camera inside the aerial image. Recent works have focused on developing advanced networks trained with accurate ground truth (GT) locations of ground images. However, the trained models always suffer a perfor… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2404.00546  [pdf, other

    cs.CV

    On the Estimation of Image-matching Uncertainty in Visual Place Recognition

    Authors: Mubariz Zaffar, Liangliang Nan, Julian F. P. Kooij

    Abstract: In Visual Place Recognition (VPR) the pose of a query image is estimated by comparing the image to a map of reference images with known reference poses. As is typical for image retrieval problems, a feature extractor maps the query and reference images to a feature space, where a nearest neighbor search is then performed. However, till recently little attention has been given to quantifying the co… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: To appear in the proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  6. arXiv:2312.05046  [pdf, other

    cs.CV

    MuVieCAST: Multi-View Consistent Artistic Style Transfer

    Authors: Nail Ibrahimli, Julian F. P. Kooij, Liangliang Nan

    Abstract: We introduce MuVieCAST, a modular multi-view consistent style transfer network architecture that enables consistent style transfer between multiple viewpoints of the same scene. This network architecture supports both sparse and dense views, making it versatile enough to handle a wide range of multi-view image datasets. The approach consists of three modules that perform specific tasks related to… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  7. arXiv:2309.14516  [pdf, other

    cs.CV cs.RO

    UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for Robustness against Missing Sensor Modalities

    Authors: Shiming Wang, Holger Caesar, Liangliang Nan, Julian F. P. Kooij

    Abstract: Multi-sensor object detection is an active research topic in automated driving, but the robustness of such detection models against missing sensor input (modality missing), e.g., due to a sudden sensor failure, is a critical problem which remains under-studied. In this work, we propose UniBEV, an end-to-end multi-modal 3D object detection framework designed for robustness against missing modalitie… ▽ More

    Submitted 8 May, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE Intelligent Vehicles Symposium (IV 2024), camera-ready. Code: https://github.com/tudelft-iv/UniBEV

  8. arXiv:2305.05318  [pdf, other

    cs.LG

    How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?

    Authors: Jetze T. Schuurmans, Kim Batselier, Julian F. P. Kooij

    Abstract: Tensor decompositions have been successfully applied to compress neural networks. The compression algorithms using tensor decompositions commonly minimize the approximation error on the weights. Recent work assumes the approximation error on the weights is a proxy for the performance of the model to compress multiple layers and fine-tune the compressed model. Surprisingly, little research has syst… ▽ More

    Submitted 4 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at ICLR 2023. Appendix A.5 was added after the conference

  9. CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression

    Authors: Mubariz Zaffar, Liangliang Nan, Julian Francisco Pieter Kooij

    Abstract: Visual Place Recognition (VPR) is an image-based localization method that estimates the camera location of a query image by retrieving the most similar reference image from a map of geo-tagged reference images. In this work, we look into two fundamental bottlenecks for its localization accuracy: reference map sparseness and viewpoint invariance. Firstly, the reference images for VPR are only avail… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Published in the IEEE Transactions on Robotics, April 2023

  10. arXiv:2303.05915  [pdf, other

    cs.CV

    Convolutional Cross-View Pose Estimation

    Authors: Zimin Xia, Olaf Booij, Julian F. P. Kooij

    Abstract: We propose a novel end-to-end method for cross-view pose estimation. Given a ground-level query image and an aerial image that covers the query's local neighborhood, the 3 Degrees-of-Freedom camera pose of the query is estimated by matching its image descriptor to descriptors of local regions within the aerial image. The orientation-aware descriptors are obtained by using a translationally equivar… ▽ More

    Submitted 22 December, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  11. arXiv:2211.14651  [pdf, other

    cs.CV

    SliceMatch: Geometry-guided Aggregation for Cross-View Pose Estimation

    Authors: Ted Lentsch, Zimin Xia, Holger Caesar, Julian F. P. Kooij

    Abstract: This work addresses cross-view camera pose estimation, i.e., determining the 3-Degrees-of-Freedom camera pose of a given ground-level image w.r.t. an aerial image of the local area. We propose SliceMatch, which consists of ground and aerial feature extractors, feature aggregators, and a pose predictor. The feature extractors extract dense features from the ground and aerial images. Given a set of… ▽ More

    Submitted 28 March, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

  12. arXiv:2211.13309  [pdf, other

    cs.CV cs.LG

    How do Cross-View and Cross-Modal Alignment Affect Representations in Contrastive Learning?

    Authors: Thomas M. Hehn, Julian F. P. Kooij, Dariu M. Gavrila

    Abstract: Various state-of-the-art self-supervised visual representation learning approaches take advantage of data from multiple sensors by aligning the feature representations across views and/or modalities. In this work, we investigate how aligning representations affects the visual features obtained from cross-view and cross-modal contrastive learning on images and point clouds. On five real-world datas… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  13. arXiv:2208.08519  [pdf, other

    cs.CV

    Visual Cross-View Metric Localization with Dense Uncertainty Estimates

    Authors: Zimin Xia, Olaf Booij, Marco Manfredi, Julian F. P. Kooij

    Abstract: This work addresses visual cross-view metric localization for outdoor robotics. Given a ground-level color image and a satellite patch that contains the local surroundings, the task is to identify the location of the ground camera within the satellite patch. Related work addressed this task for range-sensors (LiDAR, Radar), but for vision, only as a secondary regression step after an initial cross… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: ECCV 2022

  14. arXiv:2007.15739  [pdf, other

    cs.RO cs.LG cs.SD eess.AS

    Hearing What You Cannot See: Acoustic Vehicle Detection Around Corners

    Authors: Yannick Schulz, Avinash Kini Mattar, Thomas M. Hehn, Julian F. P. Kooij

    Abstract: This work proposes to use passive acoustic perception as an additional sensing modality for intelligent vehicles. We demonstrate that approaching vehicles behind blind corners can be detected by sound before such vehicles enter in line-of-sight. We have equipped a research vehicle with a roof-mounted microphone array, and show on data collected with this sensor setup that wall reflections provide… ▽ More

    Submitted 25 February, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted to IEEE Robotics & Automation Letters (2021), DOI: 10.1109/LRA.2021.3062254. Code, Data & Video: https://github.com/tudelft-iv/occluded_vehicle_acoustic_detection

  15. CNN based Road User Detection using the 3D Radar Cube

    Authors: Andras Palffy, Jiaao Dong, Julian F. P. Kooij, Dariu M. Gavrila

    Abstract: This letter presents a novel radar based, single-frame, multi-class detection method for moving road users (pedestrian, cyclist, car), which utilizes low-level radar cube data. The method provides class information both on the radar target- and object-level. Radar targets are classified individually after extending the target features with a cropped block of the 3D radar cube around their position… ▽ More

    Submitted 16 July, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Journal ref: IEEE Robotics and Automation Letters (RAL), vol. 5, nr. 2, pp. 1263-1270, 2020

  16. arXiv:1810.11641  [pdf, other

    cs.CV eess.IV

    Cross-Modal Distillation for RGB-Depth Person Re-Identification

    Authors: Frank Hafner, Amran Bhuiyan, Julian F. P. Kooij, Eric Granger

    Abstract: Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models for visual recognition, and inexpensive RGB-D cameras and sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate the relatively unexplored problem of cross-modal re-identification of persons between RGB (color) and depth images. The… ▽ More

    Submitted 12 February, 2022; v1 submitted 27 October, 2018; originally announced October 2018.

    Journal ref: Computer Vision and Image Understanding, 103352 (2022)