AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features
Authors:
Ruochen Zhang,
Hyeung-Sik Choi,
Dongwook Jung,
Phan Huy Nam Anh,
Sang-Ki Jeong,
Zihao Zhu
Abstract:
Monocular 3D object detection is a challenging task in autonomous systems due to the lack of explicit depth information in single-view images. Existing methods often depend on external depth estimators or expensive sensors, which increase computational complexity and hinder real-time performance. To overcome these limitations, we propose AuxDepthNet, an efficient framework for real-time monocular…
▽ More
Monocular 3D object detection is a challenging task in autonomous systems due to the lack of explicit depth information in single-view images. Existing methods often depend on external depth estimators or expensive sensors, which increase computational complexity and hinder real-time performance. To overcome these limitations, we propose AuxDepthNet, an efficient framework for real-time monocular 3D object detection that eliminates the reliance on external depth maps or pre-trained depth models. AuxDepthNet introduces two key components: the Auxiliary Depth Feature (ADF) module, which implicitly learns depth-sensitive features to improve spatial reasoning and computational efficiency, and the Depth Position Mapping (DPM) module, which embeds depth positional information directly into the detection process to enable accurate object localization and 3D bounding box regression. Leveraging the DepthFusion Transformer architecture, AuxDepthNet globally integrates visual and depth-sensitive features through depth-guided interactions, ensuring robust and efficient detection. Extensive experiments on the KITTI dataset show that AuxDepthNet achieves state-of-the-art performance, with $\text{AP}_{3D}$ scores of 24.72\% (Easy), 18.63\% (Moderate), and 15.31\% (Hard), and $\text{AP}_{\text{BEV}}$ scores of 34.11\% (Easy), 25.18\% (Moderate), and 21.90\% (Hard) at an IoU threshold of 0.7.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
A Study on Impacts of RTT Inaccuracy on Dynamic Bandwidth Allocation in PON and Solution
Authors:
Son Nguyen Hong,
Hao Nguyen Anh,
Thua Huynh Trong
Abstract:
The circle travelling delay between OLT (Optical Line Terminal) and ONU (Optical Network Unit) is one of most important items in dynamic bandwidth allocation (DBA) algorithms in PON, called RTT (Round Trip Time). The RTT is taken into account when OLT assigns the start times for upstream bandwidth grants. In most case, RTT is estimated before making bandwidth allocation decisions in dynamic bandwi…
▽ More
The circle travelling delay between OLT (Optical Line Terminal) and ONU (Optical Network Unit) is one of most important items in dynamic bandwidth allocation (DBA) algorithms in PON, called RTT (Round Trip Time). The RTT is taken into account when OLT assigns the start times for upstream bandwidth grants. In most case, RTT is estimated before making bandwidth allocation decisions in dynamic bandwidth allocation algorithms. If the estimated RTT is incorrect, the bandwidth allocation decisions are not matched with bandwidth requests of channels. Thus, performance of PON can get worse by deviation of RTT. There are several reasons that cause the RTT to be varying, such as processing delay, distance of OLT and ONU, changing in fiber refractive index resulting from temperature drift, and degree of accuracy of RTT estimation methods. In this paper, we evaluate the impacts of RTT inaccuracy on performance of DBA and identify levels of collision and waste of bandwidth. By this way, we propose a method to remedy the performance degradation encountered by the situation
△ Less
Submitted 12 October, 2014;
originally announced October 2014.