3D Object Detection for Autonomous Driving using Deep Learning
Mohamed Khaled Gamil Ismail Hussein;
Abstract
Abstract
3D Object detection is one of the most important perception tasks needed by autonomous vehicles to detect different road agents like other vehicles, cyclists, and pedestrians which is required for driving tasks like collision avoidance and path planning. State of the art 3D Object detection deep learning models take as input point clouds from very expensive LIDAR sensors because it provides accurate depth information. Another alternative is using cameras which are far cheaper but lack accurate depth information which causes a huge gap in average precision from LIDAR models vs camera models. In this thesis, our work is focused on 3D Object Detection for car class from stereo images without LIDAR supervision neither during training nor during inference and the challenging task of running 3D Object Detection on an embedded target Nvidia Jetson TX2 by modifying Stereo R-CNN model and reducing the model size to approximately one third the size of the original model to be more suitable for embedded targets. Experiments on KITTI dataset showed that our model’s inference time is 1.8 seconds and its’ average precision for moderate car class is 17% on the test set. Our model decreases training and inference time by approximately 60% with a 13% drop on the test set which is an expected trade-off when decreasing the number of parameters inside the model.
Keywords: 3D Object Detection, Stereo Vision, Autonomous Driving, Embedded Systems
3D Object detection is one of the most important perception tasks needed by autonomous vehicles to detect different road agents like other vehicles, cyclists, and pedestrians which is required for driving tasks like collision avoidance and path planning. State of the art 3D Object detection deep learning models take as input point clouds from very expensive LIDAR sensors because it provides accurate depth information. Another alternative is using cameras which are far cheaper but lack accurate depth information which causes a huge gap in average precision from LIDAR models vs camera models. In this thesis, our work is focused on 3D Object Detection for car class from stereo images without LIDAR supervision neither during training nor during inference and the challenging task of running 3D Object Detection on an embedded target Nvidia Jetson TX2 by modifying Stereo R-CNN model and reducing the model size to approximately one third the size of the original model to be more suitable for embedded targets. Experiments on KITTI dataset showed that our model’s inference time is 1.8 seconds and its’ average precision for moderate car class is 17% on the test set. Our model decreases training and inference time by approximately 60% with a 13% drop on the test set which is an expected trade-off when decreasing the number of parameters inside the model.
Keywords: 3D Object Detection, Stereo Vision, Autonomous Driving, Embedded Systems
Other data
Title | 3D Object Detection for Autonomous Driving using Deep Learning | Other Titles | اكتشاف كائن ثلاثي الأبعاد للقيادة الذاتية باستخدام التعلم العميق | Authors | Mohamed Khaled Gamil Ismail Hussein | Issue Date | 2022 |
Attached Files
File | Size | Format | |
---|---|---|---|
BB13764.pdf | 384.19 kB | Adobe PDF | View/Open |
Similar Items from Core Recommender Database
Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.