Search | arXiv e-print repository

A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection

Authors: Carlo Sgaravatti, Roberto Basla, Riccardo Pieroni, Matteo Corno, Sergio M. Savaresi, Luca Magri, Giacomo Boracchi

Abstract: We present a new way to detect 3D objects from multimodal inputs, leveraging both LiDAR and RGB cameras in a hybrid late-cascade scheme, that combines an RGB detection network and a 3D LiDAR detector. We exploit late fusion principles to reduce LiDAR False Positives, matching LiDAR detections with RGB ones by projecting the LiDAR bounding boxes on the image. We rely on cascade fusion principles to… ▽ More We present a new way to detect 3D objects from multimodal inputs, leveraging both LiDAR and RGB cameras in a hybrid late-cascade scheme, that combines an RGB detection network and a 3D LiDAR detector. We exploit late fusion principles to reduce LiDAR False Positives, matching LiDAR detections with RGB ones by projecting the LiDAR bounding boxes on the image. We rely on cascade fusion principles to recover LiDAR False Negatives leveraging epipolar constraints and frustums generated by RGB detections of separate views. Our solution can be plugged on top of any underlying single-modal detectors, enabling a flexible training process that can take advantage of pre-trained LiDAR and RGB detectors, or train the two branches separately. We evaluate our results on the KITTI object detection benchmark, showing significant performance improvements, especially for the detection of Pedestrians and Cyclists. △ Less

Submitted 25 April, 2025; originally announced April 2025.

arXiv:2501.14502 [pdf, other]

LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing

Authors: Marcello Cellina, Matteo Corno, Sergio Matteo Savaresi

Abstract: Autonomous racing provides a controlled environment for testing the software and hardware of autonomous vehicles operating at their performance limits. Competitive interactions between multiple autonomous racecars however introduce challenging and potentially dangerous scenarios. Accurate and consistent vehicle detection and tracking is crucial for overtaking maneuvers, and low-latency sensor proc… ▽ More Autonomous racing provides a controlled environment for testing the software and hardware of autonomous vehicles operating at their performance limits. Competitive interactions between multiple autonomous racecars however introduce challenging and potentially dangerous scenarios. Accurate and consistent vehicle detection and tracking is crucial for overtaking maneuvers, and low-latency sensor processing is essential to respond quickly to hazardous situations. This paper presents the LiDAR-based perception algorithms deployed on Team PoliMOVE's autonomous racecar, which won multiple competitions in the Indy Autonomous Challenge series. Our Vehicle Detection and Tracking pipeline is composed of a novel fast Point Cloud Segmentation technique and a specific Vehicle Pose Estimation methodology, together with a variable-step Multi-Target Tracking algorithm. Experimental results demonstrate the algorithm's performance, robustness, computational efficiency, and suitability for autonomous racing applications, enabling fully autonomous overtaking maneuvers at velocities exceeding 275 km/h. △ Less

Submitted 24 January, 2025; originally announced January 2025.

Comments: 13 pages

arXiv:2403.13452 [pdf, other]

Mobile Robot Localization: a Modular, Odometry-Improving Approach

Authors: Luca Mozzarelli, Luca Cattaneo, Matteo Corno, Sergio Matteo Savaresi

Abstract: Despite the number of works published in recent years, vehicle localization remains an open, challenging problem. While map-based localization and SLAM algorithms are getting better and better, they remain a single point of failure in typical localization pipelines. This paper proposes a modular localization architecture that fuses sensor measurements with the outputs of off-the-shelf localization… ▽ More Despite the number of works published in recent years, vehicle localization remains an open, challenging problem. While map-based localization and SLAM algorithms are getting better and better, they remain a single point of failure in typical localization pipelines. This paper proposes a modular localization architecture that fuses sensor measurements with the outputs of off-the-shelf localization algorithms. The fusion filter estimates model uncertainties to improve odometry in case absolute pose measurements are lost entirely. The architecture is validated experimentally on a real robot navigating autonomously proving a reduction of the position error of more than 90% with respect to the odometrical estimate without uncertainty estimation in a two-minute navigation period without position measurements. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: Accepted at IEEE European Control Conference 2024

arXiv:2403.13431 [pdf, other]

Automatic Navigation Map Generation for Mobile Robots in Urban Environments

Authors: Luca Mozzarelli, Simone Specchia, Matteo Corno, Sergio Matteo Savaresi

Abstract: A fundamental prerequisite for safe and efficient navigation of mobile robots is the availability of reliable navigation maps upon which trajectories can be planned. With the increasing industrial interest in mobile robotics, especially in urban environments, the process of generating navigation maps has become of particular interest, being a labor intensive step of the deployment process. Automat… ▽ More A fundamental prerequisite for safe and efficient navigation of mobile robots is the availability of reliable navigation maps upon which trajectories can be planned. With the increasing industrial interest in mobile robotics, especially in urban environments, the process of generating navigation maps has become of particular interest, being a labor intensive step of the deployment process. Automating this step is challenging and becomes even more arduous when the perception capabilities are limited by cost considerations. This paper proposes an algorithm to automatically generate navigation maps using a typical navigation-oriented sensor setup: a single top-mounted 3D LiDAR sensor. The proposed method is designed and validated with the urban environment as the main use case: it is shown to be able to produce accurate maps featuring different terrain types, positive obstacles of different heights as well as negative obstacles. The algorithm is applied to data collected in a typical urban environment with a wheeled inverted pendulum robot, showing its robustness against localization, perception and dynamic uncertainties. The generated map is validated against a human-made map. △ Less

Submitted 20 March, 2024; originally announced March 2024.

arXiv:2403.04112 [pdf, other]

Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving

Authors: Riccardo Pieroni, Simone Specchia, Matteo Corno, Sergio Matteo Savaresi

Abstract: This paper presents a novel multi-modal Multi-Object Tracking (MOT) algorithm for self-driving cars that combines camera and LiDAR data. Camera frames are processed with a state-of-the-art 3D object detector, whereas classical clustering techniques are used to process LiDAR observations. The proposed MOT algorithm comprises a three-step association process, an Extended Kalman filter for estimating… ▽ More This paper presents a novel multi-modal Multi-Object Tracking (MOT) algorithm for self-driving cars that combines camera and LiDAR data. Camera frames are processed with a state-of-the-art 3D object detector, whereas classical clustering techniques are used to process LiDAR observations. The proposed MOT algorithm comprises a three-step association process, an Extended Kalman filter for estimating the motion of each detected dynamic obstacle, and a track management phase. The EKF motion model requires the current measured relative position and orientation of the observed object and the longitudinal and angular velocities of the ego vehicle as inputs. Unlike most state-of-the-art multi-modal MOT approaches, the proposed algorithm does not rely on maps or knowledge of the ego global pose. Moreover, it uses a 3D detector exclusively for cameras and is agnostic to the type of LiDAR sensor used. The algorithm is validated both in simulation and with real-world data, with satisfactory results. △ Less

Submitted 12 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

Comments: Published at IEEE European Control Conference 2024

arXiv:2401.10945 [pdf, other]

Automatic dimensionality reduction of Twin-in-the-Loop Observers

Authors: Giacomo Delcaro, Federico Dettù, Simone Formentin, Sergio Matteo Savaresi

Abstract: State-of-the-art vehicle dynamics estimation techniques usually share one common drawback: each variable to estimate is computed with an independent, simplified filtering module. These modules run in parallel and need to be calibrated separately. To solve this issue, a unified Twin-in-the-Loop (TiL) Observer architecture has recently been proposed: the classical simplified control-oriented vehicle… ▽ More State-of-the-art vehicle dynamics estimation techniques usually share one common drawback: each variable to estimate is computed with an independent, simplified filtering module. These modules run in parallel and need to be calibrated separately. To solve this issue, a unified Twin-in-the-Loop (TiL) Observer architecture has recently been proposed: the classical simplified control-oriented vehicle model in the estimators is replaced by a full-fledged vehicle simulator, or digital twin (DT). The states of the DT are corrected in real time with a linear time invariant output error law. Since the simulator is a black-box, no explicit analytical formulation is available, hence classical filter tuning techniques cannot be used. Due to this reason, Bayesian Optimization will be used to solve a data-driven optimization problem to tune the filter. Due to the complexity of the DT, the optimization problem is high-dimensional. This paper aims to find a procedure to tune the high-complexity observer by lowering its dimensionality. In particular, in this work we will analyze both a supervised and an unsupervised learning approach. The strategies have been validated for speed and yaw-rate estimation on real-world data. △ Less

Submitted 18 January, 2024; originally announced January 2024.

arXiv:2310.01230 [pdf, other]

Vehicle Fuel Consumption Virtual Sensing from GNSS and IMU Measurements

Authors: Marcello Cellina, Silvia Strada, Sergio Matteo Savaresi

Abstract: This paper presents a vehicle-independent, non-intrusive, and light monitoring system for accurately measuring fuel consumption in road vehicles from longitudinal speed and acceleration derived continuously in time from GNSS and IMU sensors mounted inside the vehicle. In parallel to boosting the transition to zero-carbon cars, there is an increasing interest in low-cost instruments for precise mea… ▽ More This paper presents a vehicle-independent, non-intrusive, and light monitoring system for accurately measuring fuel consumption in road vehicles from longitudinal speed and acceleration derived continuously in time from GNSS and IMU sensors mounted inside the vehicle. In parallel to boosting the transition to zero-carbon cars, there is an increasing interest in low-cost instruments for precise measurement of the environmental impact of the many internal combustion engine vehicles still in circulation. The main contribution of this work is the design and comparison of two innovative black-box algorithms, one based on a reduced complexity physics modeling while the other relying on a feedforward neural network for black-box fuel consumption estimation using only velocity and acceleration measurements. Based on suitable metrics, the developed algorithms outperform the state of the art best approach, both in the instantaneous and in the integral fuel consumption estimation, with errors smaller than 1\% with respect to the fuel flow ground truth. The data used for model identification, testing, and experimental validation is composed of GNSS velocity and IMU acceleration measurements collected during several trips using a diesel fuel vehicle on different roads, in different seasons, and with varying numbers of passengers. Compared to built-in vehicle monitoring systems, this methodology is not customized, uses off-the-shelf sensors, and is based on two simple algorithms that have been validated offline and could be easily implemented in a real-time environment. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:1907.09453 [pdf, other]

Analysis and development of an automatic eCall for motorcycles: a one-class cepstrum approach

Authors: Simone Gelmini, Giulio Panzani, Sergio Savaresi

Abstract: The automatic dial of an emergency call - eCall - in response to a road accident is a feature that is gaining interest in the intelligent vehicle community. It indirectly increases the driving safety of road vehicles, but presents the technical challenge of developing an algorithm which triggers the emergency call only when needed, a non-trivial task for two-wheeled vehicles due to their complex d… ▽ More The automatic dial of an emergency call - eCall - in response to a road accident is a feature that is gaining interest in the intelligent vehicle community. It indirectly increases the driving safety of road vehicles, but presents the technical challenge of developing an algorithm which triggers the emergency call only when needed, a non-trivial task for two-wheeled vehicles due to their complex dynamics. In the present work, we propose an eCall algorithm that detects these anomalies in the data time series, thanks to the cepstral analysis. The main advantage of the proposed approach is the direct focus on the data dynamics, solving the limits of approaches based on the analysis of the instantaneous value of some signals combination. The algorithm is calibrated and tested against real driving data of ten different drivers, including seven real crash events, and performance are compared with known methods. △ Less

Submitted 19 July, 2019; originally announced July 2019.

arXiv:1903.11849 [pdf, other]

doi 10.1109/ICCW.2019.8756931

Inertial Sensor Aided mmWave Beam Tracking to Support Cooperative Autonomous Driving

Authors: Mattia Brambilla, Monica Nicoli, Sergio Savaresi, Umberto Spagnolini

Abstract: This paper presents an inertial sensor aided technique for beam alignment and tracking in massive multiple-input multiple-output (MIMO) vehicle-to-vehicle (V2V) communications based on millimeter waves (mmWave). Since directional communications in vehicular scenarios are severely hindered by beam pointing issues, a beam alignment procedure has to be periodically carried out to guarantee the commun… ▽ More This paper presents an inertial sensor aided technique for beam alignment and tracking in massive multiple-input multiple-output (MIMO) vehicle-to-vehicle (V2V) communications based on millimeter waves (mmWave). Since directional communications in vehicular scenarios are severely hindered by beam pointing issues, a beam alignment procedure has to be periodically carried out to guarantee the communication reliability. When dealing with massive MIMO links, the beam sweeping approach is known to be time consuming and often unfeasible due to latency constraints. To speed up the process, we propose a method that exploits a-priori information on array dynamics provided by an inertial sensor on transceivers to assist the beam alignment procedure. The proposed inertial sensor aided technique allows a continuous tracking of the beam while transmitting, avoiding frequent realignment phases. Numerical results based on real measurements of on-transceiver accelerometers demonstrate a significant gain in terms of V2V communication throughput with respect to conventional beam alignment protocols. △ Less

Submitted 28 March, 2019; originally announced March 2019.

Comments: Accepted at IEEE ICC 2019

arXiv:1804.02960 [pdf, other]

Analysis and development of a novel algorithm for the in-vehicle hand-usage of a smartphone

Authors: Simone Gelmini, Silvia Strada, Mara Tanelli, Sergio Savaresi, Vincenzo Biase

Abstract: Smartphone usage while driving is unanimously considered to be a really dangerous habit due to strong correlation with road accidents. In this paper, the problem of detecting whether the driver is using the phone during a trip is addressed. To do this, high-frequency data from the triaxial inertial measurement unit (IMU) integrated in almost all modern phone is processed without relying on externa… ▽ More Smartphone usage while driving is unanimously considered to be a really dangerous habit due to strong correlation with road accidents. In this paper, the problem of detecting whether the driver is using the phone during a trip is addressed. To do this, high-frequency data from the triaxial inertial measurement unit (IMU) integrated in almost all modern phone is processed without relying on external inputs so as to provide a self-contained approach. By resorting to a frequency-domain analysis, it is possible to extract from the raw signals the useful information needed to detect when the driver is using the phone, without being affected by the effects that vehicle motion has on the same signals. The selected features are used to train a Support Vector Machine (SVM) algorithm. The performance of the proposed approach are analyzed and tested on experimental data collected during mixed naturalistic driving scenarios, proving the effectiveness of the proposed approach. △ Less

Submitted 30 August, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

Showing 1–10 of 10 results for author: Savaresi, S