Skip to main content

Showing 1–30 of 30 results for author: Gehrig, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.09780  [pdf, ps, other

    cs.RO

    Neural Inertial Odometry from Lie Events

    Authors: Royina Karegoudra Jayanth, Yinshuang Xu, Evangelos Chatzipantazis, Kostas Daniilidis, Daniel Gehrig

    Abstract: Neural displacement priors (NDP) can reduce the drift in inertial odometry and provide uncertainty estimates that can be readily fused with off-the-shelf filters. However, they fail to generalize to different IMU sampling rates and trajectory profiles, which limits their robustness in diverse settings. To address this challenge, we replace the traditional NDP inputs comprising raw IMU data with Li… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: accepted at RSS 2025

  2. arXiv:2412.00133  [pdf, other

    cs.CV cs.LG cs.RO

    ETAP: Event-based Tracking of Any Point

    Authors: Friedhelm Hamann, Daniel Gehrig, Filbert Febryanto, Kostas Daniilidis, Guillermo Gallego

    Abstract: Tracking any point (TAP) recently shifted the motion estimation paradigm from focusing on individual salient points with local templates to tracking arbitrary points with global image contexts. However, while research has mostly focused on driving the accuracy of models in nominal settings, addressing scenarios with difficult lighting conditions and high-speed motions remains out of reach due to t… ▽ More

    Submitted 24 March, 2025; v1 submitted 28 November, 2024; originally announced December 2024.

    Comments: 17 pages, 15 figures, 8 tables. Project page: https://github.com/tub-rip/ETAP

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 2025

  3. arXiv:2408.06321  [pdf, other

    cs.RO cs.CV

    EqNIO: Subequivariant Neural Inertial Odometry

    Authors: Royina Karegoudra Jayanth, Yinshuang Xu, Ziyun Wang, Evangelos Chatzipantazis, Daniel Gehrig, Kostas Daniilidis

    Abstract: Neural networks are seeing rapid adoption in purely inertial odometry, where accelerometer and gyroscope measurements from commodity inertial measurement units (IMU) are used to regress displacements and associated uncertainties. They can learn informative displacement priors, which can be directly fused with the raw data with off-the-shelf non-linear filters. Nevertheless, these networks do not c… ▽ More

    Submitted 10 October, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 27 pages

  4. arXiv:2406.15015  [pdf, other

    cs.DB cs.AI cs.CL

    GraLMatch: Matching Groups of Entities with Graphs and Language Models

    Authors: Fernando De Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger

    Abstract: In this paper, we present an end-to-end multi-source Entity Matching problem, which we call entity group matching, where the goal is to assign to the same group, records originating from multiple data sources but representing the same real-world entity. We focus on the effects of transitively matched records, i.e. the records connected by paths in the graph G = (V,E) whose nodes and edges represen… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures, accepted as research paper at EDBT 2025

  5. An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

    Authors: Ling Gao, Daniel Gehrig, Hang Su, Davide Scaramuzza, Laurent Kneip

    Abstract: Event cameras respond primarily to edges--formed by strong gradients--and are thus particularly well-suited for line-based motion estimation. Recent work has shown that events generated by a single line each satisfy a polynomial constraint which describes a manifold in the space-time volume. Multiple such constraints can be solved simultaneously to recover the partial linear velocity and line para… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  6. A 5-Point Minimal Solver for Event Camera Relative Motion Estimation

    Authors: Ling Gao, Hang Su, Daniel Gehrig, Marco Cannici, Davide Scaramuzza, Laurent Kneip

    Abstract: Event-based cameras are ideal for line-based motion estimation, since they predominantly respond to edges in the scene. However, accurately determining the camera displacement based on events continues to be an open problem. This is because line feature extraction and dynamics estimation are tightly coupled when using event cameras, and no precise model is currently available for describing the co… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Journal ref: IEEE/CVF International Conference on Computer Vision (ICCV), 2023

  7. arXiv:2309.09947  [pdf, other

    cs.CV

    Deep Visual Odometry with Events and Frames

    Authors: Roberto Pellerito, Marco Cannici, Daniel Gehrig, Joris Belhadj, Olivier Dubois-Matra, Massimo Casasco, Davide Scaramuzza

    Abstract: Visual Odometry (VO) is crucial for autonomous robotic navigation, especially in GPS-denied environments like planetary terrains. To improve robustness, recent model-based VO systems have begun combining standard and event-based cameras. While event cameras excel in low-light and high-speed motion, standard cameras provide dense and easier-to-track features. However, the field of image- and event-… ▽ More

    Submitted 10 September, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

  8. E-Calib: A Fast, Robust and Accurate Calibration Toolbox for Event Cameras

    Authors: Mohammed Salah, Abdulla Ayyad, Muhammad Humais, Daniel Gehrig, Abdelqader Abusafieh, Lakmal Seneviratne, Davide Scaramuzza, Yahya Zweiri

    Abstract: Event cameras triggered a paradigm shift in the computer vision community delineated by their asynchronous nature, low latency, and high dynamic range. Calibration of event cameras is always essential to account for the sensor intrinsic parameters and for 3D perception. However, conventional image-based calibration techniques are not applicable due to the asynchronous, binary output of the sensor.… ▽ More

    Submitted 22 November, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: IEEE Transactions on Image Processing

    Journal ref: in IEEE Transactions on Image Processing, vol. 33, pp. 3977-3990, 2024

  9. arXiv:2304.13455  [pdf, other

    cs.CV cs.LG

    From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection

    Authors: Nikola Zubić, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

    Abstract: Today, state-of-the-art deep neural networks that process events first convert them into dense, grid-like input representations before using an off-the-shelf network. However, selecting the appropriate representation for the task traditionally requires training a neural network for each representation and selecting the best one based on the validation score, which is very time-consuming. This work… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 15 pages, 11 figures, 2 tables, ICCV 2023 Camera Ready paper

  10. arXiv:2303.17479  [pdf, other

    cs.RO

    Event-based Agile Object Catching with a Quadrupedal Robot

    Authors: Benedek Forrai, Takahiro Miki, Daniel Gehrig, Marco Hutter, Davide Scaramuzza

    Abstract: Quadrupedal robots are conquering various indoor and outdoor applications due to their ability to navigate challenging uneven terrains. Exteroceptive information greatly enhances this capability since perceiving their surroundings allows them to adapt their controller and thus achieve higher levels of robustness. However, sensors such as LiDARs and RGB cameras do not provide sufficient information… ▽ More

    Submitted 6 April, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA) 2023, London

  11. arXiv:2303.14176  [pdf, other

    cs.CV cs.AI

    A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception

    Authors: Asude Aydin, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

    Abstract: Spiking Neural Networks (SNN) are a class of bio-inspired neural networks that promise to bring low-power and low-latency inference to edge devices through asynchronous and sparse processing. However, being temporal models, SNNs depend heavily on expressive states to generate predictions on par with classical artificial neural networks (ANNs). These states converge only after long transient period… ▽ More

    Submitted 17 April, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, 2024

  12. arXiv:2301.05191  [pdf, other

    cs.CV

    A Unified Framework for Event-based Frame Interpolation with Ad-hoc Deblurring in the Wild

    Authors: Lei Sun, Daniel Gehrig, Christos Sakaridis, Mathias Gehrig, Jingyun Liang, Peng Sun, Zhijie Xu, Kaiwei Wang, Luc Van Gool, Davide Scaramuzza

    Abstract: Effective video frame interpolation hinges on the adept handling of motion in the input scene. Prior work acknowledges asynchronous event information for this, but often overlooks whether motion induces blur in the video, limiting its scope to sharp frame interpolation. We instead propose a unified framework for event-based frame interpolation that performs deblurring ad-hoc and thus works both on… ▽ More

    Submitted 18 February, 2025; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted to T-PAMI

  13. arXiv:2211.12324  [pdf, other

    cs.CV

    Pushing the Limits of Asynchronous Graph-based Object Detection with Event Cameras

    Authors: Daniel Gehrig, Davide Scaramuzza

    Abstract: State-of-the-art machine-learning methods for event cameras treat events as dense representations and process them with conventional deep neural networks. Thus, they fail to maintain the sparsity and asynchronous nature of event data, thereby imposing significant computation and latency constraints on downstream systems. A recent line of work tackles this issue by modeling events as spatiotemporal… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  14. arXiv:2204.05880  [pdf, other

    cs.CV cs.RO

    Exploring Event Camera-based Odometry for Planetary Robots

    Authors: Florian Mahlknecht, Daniel Gehrig, Jeremy Nash, Friedrich M. Rockenbauer, Benjamin Morrell, Jeff Delaune, Davide Scaramuzza

    Abstract: Due to their resilience to motion blur and high robustness in low-light and high dynamic range conditions, event cameras are poised to become enabling sensors for vision-based exploration on future Mars helicopter missions. However, existing event-based visual-inertial odometry (VIO) algorithms either suffer from high tracking errors or are brittle, since they cannot cope with significant depth un… ▽ More

    Submitted 29 July, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2022

  15. arXiv:2203.17191  [pdf, other

    cs.CV

    Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion

    Authors: Stepan Tulyakov, Alfredo Bochicchio, Daniel Gehrig, Stamatios Georgoulis, Yuanyou Li, Davide Scaramuzza

    Abstract: Recently, video frame interpolation using a combination of frame- and event-based cameras has surpassed traditional image-based methods both in terms of performance and memory efficiency. However, current methods still suffer from (i) brittle image-level fusion of complementary interpolation results, that fails in the presence of artifacts in the fused image, (ii) potentially temporally inconsiste… ▽ More

    Submitted 25 April, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 2022

  16. arXiv:2203.17149  [pdf, other

    cs.CV

    AEGNN: Asynchronous Event-based Graph Neural Networks

    Authors: Simon Schaefer, Daniel Gehrig, Davide Scaramuzza

    Abstract: The best performing learning algorithms devised for event cameras work by first converting events into dense representations that are then processed using standard CNNs. However, these steps discard both the sparsity and high temporal resolution of events, leading to high computational burden and latency. For this reason, recent works have adopted Graph Neural Networks (GNNs), which process events… ▽ More

    Submitted 1 November, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 2022

  17. arXiv:2203.14672  [pdf, other

    cs.CV

    Are High-Resolution Event Cameras Really Needed?

    Authors: Daniel Gehrig, Davide Scaramuzza

    Abstract: Due to their outstanding properties in challenging conditions, event cameras have become indispensable in a wide range of applications, ranging from automotive, computational photography, and SLAM. However, as further improvements are made to the sensor design, modern event cameras are trending toward higher and higher sensor resolutions, which result in higher bandwidth and computational requirem… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  18. arXiv:2203.10016  [pdf, other

    cs.CV cs.LG

    ESS: Learning Event-based Semantic Segmentation from Still Images

    Authors: Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza

    Abstract: Retrieving accurate semantic information in challenging high dynamic range (HDR) and high-speed conditions remains an open challenge for image-based algorithms due to severe image degradations. Event cameras promise to address these challenges since they feature a much higher dynamic range and are resilient to motion blur. Nonetheless, semantic segmentation with event cameras is still in its infan… ▽ More

    Submitted 2 August, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Journal ref: European Conference on Computer Vision (ECCV), 2022

  19. arXiv:2203.06622  [pdf, other

    eess.IV cs.CV

    Multi-Bracket High Dynamic Range Imaging with Event Cameras

    Authors: Nico Messikommer, Stamatios Georgoulis, Daniel Gehrig, Stepan Tulyakov, Julius Erbach, Alfredo Bochicchio, Yuanyou Li, Davide Scaramuzza

    Abstract: Modern high dynamic range (HDR) imaging pipelines align and fuse multiple low dynamic range (LDR) images captured at different exposure times. While these methods work well in static scenes, dynamic scenes remain a challenge since the LDR images still suffer from saturation and noise. In such scenarios, event cameras would be a valid complement, thanks to their higher temporal resolution and dynam… ▽ More

    Submitted 28 April, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), New Orleans, 2022

  20. Bridging the Gap between Events and Frames through Unsupervised Domain Adaptation

    Authors: Nico Messikommer, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

    Abstract: Reliable perception during fast motion maneuvers or in high dynamic range environments is crucial for robotic systems. Since event cameras are robust to these challenging conditions, they have great potential to increase the reliability of robot vision. However, event-based vision has been held back by the shortage of labeled datasets due to the novelty of event cameras. To overcome this drawback,… ▽ More

    Submitted 3 February, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2022

  21. arXiv:2108.10552  [pdf, other

    cs.CV

    E-RAFT: Dense Optical Flow from Event Cameras

    Authors: Mathias Gehrig, Mario Millhäusler, Daniel Gehrig, Davide Scaramuzza

    Abstract: We propose to incorporate feature correlation and sequential processing into dense optical flow estimation from event cameras. Modern frame-based optical flow methods heavily rely on matching costs computed from feature correlation. In contrast, there exists no optical flow method for event cameras that explicitly computes matching costs. Instead, learning-based approaches using events usually res… ▽ More

    Submitted 21 October, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

    Comments: International Conference on 3D Vision (3DV)

  22. arXiv:2106.07286  [pdf, other

    cs.CV

    TimeLens: Event-based Video Frame Interpolation

    Authors: Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

    Abstract: State-of-the-art frame interpolation methods generate intermediate frames by inferring object motions in the image from consecutive key-frames. In the absence of additional information, first-order approximations, i.e. optical flow, must be used, but this choice restricts the types of motions that can be modeled, leading to errors in highly dynamic scenarios. Event cameras are novel sensors that a… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

  23. arXiv:2105.12362  [pdf, other

    cs.CV

    How to Calibrate Your Event Camera

    Authors: Manasi Muglikar, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

    Abstract: We propose a generic event camera calibration framework using image reconstruction. Instead of relying on blinking LED patterns or external screens, we show that neural-network-based image reconstruction is well suited for the task of intrinsic and extrinsic calibration of event cameras. The advantage of our proposed approach is that we can use standard calibration patterns that do not rely on act… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition Workshops

  24. arXiv:2103.06011  [pdf, other

    cs.CV cs.RO

    DSEC: A Stereo Event Camera Dataset for Driving Scenarios

    Authors: Mathias Gehrig, Willem Aarents, Daniel Gehrig, Davide Scaramuzza

    Abstract: Once an academic venture, autonomous driving has received unparalleled corporate funding in the last decade. Still, the operating conditions of current autonomous cars are mostly restricted to ideal scenarios. This means that driving in challenging illumination conditions such as night, sunrise, and sunset remains an open problem. In these cases, standard cameras are being pushed to their limits i… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: IEEE Robotics and Automation Letters

  25. arXiv:2102.09320  [pdf, other

    cs.CV

    Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction

    Authors: Daniel Gehrig, Michelle Rüegg, Mathias Gehrig, Javier Hidalgo Carrio, Davide Scaramuzza

    Abstract: Event cameras are novel vision sensors that report per-pixel brightness changes as a stream of asynchronous "events". They offer significant advantages compared to standard cameras due to their high temporal resolution, high dynamic range and lack of motion blur. However, events only measure the varying component of the visual signal, which limits their ability to encode scene context. By contrast… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2021

  26. arXiv:2010.08350  [pdf, other

    cs.CV cs.LG

    Learning Monocular Dense Depth from Events

    Authors: Javier Hidalgo-Carrió, Daniel Gehrig, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous events instead of intensity frames. Compared to conventional image sensors, they offer significant advantages: high temporal resolution, high dynamic range, no motion blur, and much lower bandwidth. Recently, learning-based approaches have been applied to event-based data, thus unlocking their po… ▽ More

    Submitted 22 October, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: IEEE International Conference on 3D Vision (3DV), 2020

  27. arXiv:2003.09148  [pdf, other

    cs.CV cs.LG eess.SP

    Event-based Asynchronous Sparse Convolutional Networks

    Authors: Nico Messikommer, Daniel Gehrig, Antonio Loquercio, Davide Scaramuzza

    Abstract: Event cameras are bio-inspired sensors that respond to per-pixel brightness changes in the form of asynchronous and sparse "events". Recently, pattern recognition algorithms, such as learning-based methods, have made significant progress with event cameras by converting events into synchronous dense, image-like representations and applying traditional machine learning methods developed for standar… ▽ More

    Submitted 17 July, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Journal ref: European Conference on Computer Vision (ECCV), 2020

  28. arXiv:1912.03095  [pdf, other

    cs.CV

    Video to Events: Recycling Video Datasets for Event Cameras

    Authors: Daniel Gehrig, Mathias Gehrig, Javier Hidalgo-Carrió, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high dynamic range (HDR), high temporal resolution, and no motion blur. Recently, novel learning approaches operating on event data have achieved impressive results. Yet, these methods re… ▽ More

    Submitted 1 April, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  29. arXiv:1904.08245  [pdf, other

    cs.CV

    End-to-End Learning of Representations for Asynchronous Event-Based Data

    Authors: Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza

    Abstract: Event cameras are vision sensors that record asynchronous streams of per-pixel brightness changes, referred to as "events". They have appealing advantages over frame-based cameras for computer vision, including high temporal resolution, high dynamic range, and no motion blur. Due to the sparse, non-uniform spatiotemporal layout of the event signal, pattern recognition algorithms typically aggregat… ▽ More

    Submitted 20 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: To appear at ICCV 2019

  30. Asynchronous, Photometric Feature Tracking using Events and Frames

    Authors: Daniel Gehrig, Henri Rebecq, Guillermo Gallego, Davide Scaramuzza

    Abstract: We present a method that leverages the complementarity of event cameras and standard cameras to track visual features with low-latency. Event cameras are novel sensors that output pixel-level brightness changes, called "events". They offer significant advantages over standard cameras, namely a very high dynamic range, no motion blur, and a latency in the order of microseconds. However, because the… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: 22 pages, 15 figures, Video: https://youtu.be/A7UfeUnG6c4

    Journal ref: European Conference on Computer Vision (ECCV), Munich, 2018