-
Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving
Authors:
Sándor Kunsági-Máté,
Levente Pető,
Lehel Seres,
Tamás Matuszka
Abstract:
3D detection of traffic management objects, such as traffic lights and road signs, is vital for self-driving cars, particularly for address-to-address navigation where vehicles encounter numerous intersections with these static objects. This paper introduces a novel method for automatically generating accurate and temporally consistent 3D bounding box annotations for traffic lights and signs, effe…
▽ More
3D detection of traffic management objects, such as traffic lights and road signs, is vital for self-driving cars, particularly for address-to-address navigation where vehicles encounter numerous intersections with these static objects. This paper introduces a novel method for automatically generating accurate and temporally consistent 3D bounding box annotations for traffic lights and signs, effective up to a range of 200 meters. These annotations are suitable for training real-time models used in self-driving cars, which need a large amount of training data. The proposed method relies only on RGB images with 2D bounding boxes of traffic management objects, which can be automatically obtained using an off-the-shelf image-space detector neural network, along with GNSS/INS data, eliminating the need for LiDAR point cloud data.
△ Less
Submitted 19 March, 2025; v1 submitted 19 September, 2024;
originally announced September 2024.
-
aiMotive Dataset: A Multimodal Dataset for Robust Autonomous Driving with Long-Range Perception
Authors:
Tamás Matuszka,
Iván Barton,
Ádám Butykai,
Péter Hajas,
Dávid Kiss,
Domonkos Kovács,
Sándor Kunsági-Máté,
Péter Lengyel,
Gábor Németh,
Levente Pető,
Dezső Ribli,
Dávid Szeghy,
Szabolcs Vajna,
Bálint Varga
Abstract:
Autonomous driving is a popular research area within the computer vision research community. Since autonomous vehicles are highly safety-critical, ensuring robustness is essential for real-world deployment. While several public multimodal datasets are accessible, they mainly comprise two sensor modalities (camera, LiDAR) which are not well suited for adverse weather. In addition, they lack far-ran…
▽ More
Autonomous driving is a popular research area within the computer vision research community. Since autonomous vehicles are highly safety-critical, ensuring robustness is essential for real-world deployment. While several public multimodal datasets are accessible, they mainly comprise two sensor modalities (camera, LiDAR) which are not well suited for adverse weather. In addition, they lack far-range annotations, making it harder to train neural networks that are the base of a highway assistant function of an autonomous vehicle. Therefore, we introduce a multimodal dataset for robust autonomous driving with long-range perception. The dataset consists of 176 scenes with synchronized and calibrated LiDAR, camera, and radar sensors covering a 360-degree field of view. The collected data was captured in highway, urban, and suburban areas during daytime, night, and rain and is annotated with 3D bounding boxes with consistent identifiers across frames. Furthermore, we trained unimodal and multimodal baseline models for 3D object detection. Data are available at \url{https://github.com/aimotive/aimotive_dataset}.
△ Less
Submitted 22 September, 2023; v1 submitted 17 November, 2022;
originally announced November 2022.
-
A Comparison of Two Smoothing Methods for Word Bigram Models
Authors:
Linda Bauman Peto
Abstract:
A COMPARISON OF TWO SMOOTHING METHODS FOR WORD BIGRAM MODELS
Linda Bauman Peto
Department of Computer Science
University of Toronto Abstract Word bigram models estimated from text corpora require smoothing methods to estimate the probabilities of unseen bigrams. The deleted estimation method uses the formula:
Pr(i|j) = lambda f_i + (1-lambda)f_i|j, where f_i and f_i|j are the relative fr…
▽ More
A COMPARISON OF TWO SMOOTHING METHODS FOR WORD BIGRAM MODELS
Linda Bauman Peto
Department of Computer Science
University of Toronto Abstract Word bigram models estimated from text corpora require smoothing methods to estimate the probabilities of unseen bigrams. The deleted estimation method uses the formula:
Pr(i|j) = lambda f_i + (1-lambda)f_i|j, where f_i and f_i|j are the relative frequency of i and the conditional relative frequency of i given j, respectively, and lambda is an optimized parameter. MacKay (1994) proposes a Bayesian approach using Dirichlet priors, which yields a different formula:
Pr(i|j) = (alpha/F_j + alpha) m_i + (1 - alpha/F_j + alpha) f_i|j where F_j is the count of j and alpha and m_i are optimized parameters. This thesis describes an experiment in which the two methods were trained on a two-million-word corpus taken from the Canadian _Hansard_ and compared on the basis of the experimental perplexity that they assigned to a shared test corpus. The methods proved to be about equally accurate, with MacKay's method using fewer resources.
△ Less
Submitted 31 October, 1994;
originally announced October 1994.