-
Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition
Authors:
M. Usman Maqbool Bhutta,
Yuxiang Sun,
Darwin Lau,
Ming Liu
Abstract:
Deep learning-based image retrieval techniques for the loop closure detection demonstrate satisfactory performance. However, it is still challenging to achieve high-level performance based on previously trained models in different geographical regions. This paper addresses the problem of their deployment with simultaneous localization and mapping (SLAM) systems in the new environment. The general…
▽ More
Deep learning-based image retrieval techniques for the loop closure detection demonstrate satisfactory performance. However, it is still challenging to achieve high-level performance based on previously trained models in different geographical regions. This paper addresses the problem of their deployment with simultaneous localization and mapping (SLAM) systems in the new environment. The general baseline approach uses additional information, such as GPS, sequential keyframes tracking, and re-training the whole environment to enhance the recall rate. We propose a novel approach for improving image retrieval based on previously trained models. We present an intelligent method, MAQBOOL, to amplify the power of pre-trained models for better image recall and its application to real-time multiagent SLAM systems. We achieve comparable image retrieval results at a low descriptor dimension (512-D), compared to the high descriptor dimension (4096-D) of state-of-the-art methods. We use spatial information to improve the recall rate in image retrieval on pre-trained models.
△ Less
Submitted 10 January, 2022;
originally announced January 2022.
-
Smart-Inspect: Micro Scale Localization and Classification of Smartphone Glass Defects for Industrial Automation
Authors:
M Usman Maqbool Bhutta,
Shoaib Aslam,
Peng Yun,
Jianhao Jiao,
Ming Liu
Abstract:
The presence of any type of defect on the glass screen of smart devices has a great impact on their quality. We present a robust semi-supervised learning framework for intelligent micro-scaled localization and classification of defects on a 16K pixel image of smartphone glass. Our model features the efficient recognition and labeling of three types of defects: scratches, light leakage due to crack…
▽ More
The presence of any type of defect on the glass screen of smart devices has a great impact on their quality. We present a robust semi-supervised learning framework for intelligent micro-scaled localization and classification of defects on a 16K pixel image of smartphone glass. Our model features the efficient recognition and labeling of three types of defects: scratches, light leakage due to cracks, and pits. Our method also differentiates between the defects and light reflections due to dust particles and sensor regions, which are classified as non-defect areas. We use a partially labeled dataset to achieve high robustness and excellent classification of defect and non-defect areas as compared to principal components analysis (PCA), multi-resolution and information-fusion-based algorithms. In addition, we incorporated two classifiers at different stages of our inspection framework for labeling and refining the unlabeled defects. We successfully enhanced the inspection depth-limit up to 5 microns. The experimental results show that our method outperforms manual inspection in testing the quality of glass screen samples by identifying defects on samples that have been marked as good by human inspection.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Loop-box: Multi-Agent Direct SLAM Triggered by Single Loop Closure for Large-Scale Mapping
Authors:
M Usman Maqbool Bhutta,
Manohar Kuse,
Rui Fan,
Yanan Liu,
Ming Liu
Abstract:
In this paper, we present a multi-agent framework for real-time large-scale 3D reconstruction applications. In SLAM, researchers usually build and update a 3D map after applying non-linear pose graph optimization techniques. Moreover, many multi-agent systems are prevalently using odometry information from additional sensors. These methods generally involve intensive computer vision algorithms and…
▽ More
In this paper, we present a multi-agent framework for real-time large-scale 3D reconstruction applications. In SLAM, researchers usually build and update a 3D map after applying non-linear pose graph optimization techniques. Moreover, many multi-agent systems are prevalently using odometry information from additional sensors. These methods generally involve intensive computer vision algorithms and are tightly coupled with various sensors. We develop a generic method for the keychallenging scenarios in multi-agent 3D mapping based on different camera systems. The proposed framework performs actively in terms of localizing each agent after the first loop closure between them. It is shown that the proposed system only uses monocular cameras to yield real-time multi-agent large-scale localization and 3D global mapping. Based on the initial matching, our system can calculate the optimal scale difference between multiple 3D maps and then estimate an accurate relative pose transformation for large-scale global mapping.
△ Less
Submitted 29 September, 2020;
originally announced September 2020.
-
PCR-Pro: 3D Sparse and Different Scale Point Clouds Registration and Robust Estimation of Information Matrix For Pose Graph SLAM
Authors:
M. Usman Maqbool Bhutta,
Ming Liu
Abstract:
For both indoor and outdoor environments, we propose an efficient and novel method for different scales and sparse 3D point clouds registration that cannot be handled by the current popular ICP approaches. Our algorithm efficiently detects the scale difference between point clouds and uses the keyframes to estimate the relative pose for calculating the scale difference. The algorithm applies a fil…
▽ More
For both indoor and outdoor environments, we propose an efficient and novel method for different scales and sparse 3D point clouds registration that cannot be handled by the current popular ICP approaches. Our algorithm efficiently detects the scale difference between point clouds and uses the keyframes to estimate the relative pose for calculating the scale difference. The algorithm applies a filter and computes the final transformation which coverages to a global minimum. The good estimation of transform and scale helps in the calculation of the covariance matrix using a closed form solution efficiently. This covariance between point clouds helps in the estimation of information matrix for pose-graph SLAM.
△ Less
Submitted 29 August, 2018;
originally announced August 2018.
-
Multiple Lane Detection Algorithm Based on Optimised Dense Disparity Map Estimation
Authors:
Han Ma,
Yixin Ma,
Jianhao Jiao,
M Usman Maqbool Bhutta,
Mohammud Junaid Bocus,
Lujia Wang,
Ming Liu,
Rui Fan
Abstract:
Lane detection is very important for self-driving vehicles. In recent years, computer stereo vision has been prevalently used to enhance the accuracy of the lane detection systems. This paper mainly presents a multiple lane detection algorithm developed based on optimised dense disparity map estimation, where the disparity information obtained at time t_{n} is utilised to optimise the process of d…
▽ More
Lane detection is very important for self-driving vehicles. In recent years, computer stereo vision has been prevalently used to enhance the accuracy of the lane detection systems. This paper mainly presents a multiple lane detection algorithm developed based on optimised dense disparity map estimation, where the disparity information obtained at time t_{n} is utilised to optimise the process of disparity estimation at time t_{n+1}. This is achieved by estimating the road model at time t_{n} and then controlling the search range for the disparity estimation at time t_{n+1}. The lanes are then detected using our previously published algorithm, where the vanishing point information is used to model the lanes. The experimental results illustrate that the runtime of the disparity estimation is reduced by around 37% and the accuracy of the lane detection is about 99%.
△ Less
Submitted 28 August, 2018;
originally announced August 2018.