Combination of Computer Vision Detection and Segmentation for Autonomous Driving

Yu-Ho Tseng, Shau-Shiun Jan

Abstract: Most existing deep learning networks for computer vision attempt to improve the performance of either semantic segmentation or object detection. This study develops a unified network architecture that uses both semantic segmentation and object detection to detect people, cars, and roads simultaneously. To achieve this goal, we create an environment in the Unity engine as our dataset. We train our proposed unified network that combines segmentation and detection approaches with the simulation dataset. The proposed network can perform end-to-end prediction and performs well on the tested dataset. The proposed approach is also efficient, processing each image in about 30 ms on an NVIDIA GTX 1070.
Published in: 2018 IEEE/ION Position, Location and Navigation Symposium (PLANS)
April 23 - 26, 2018
Hyatt Regency Hotel
Monterey, CA
Pages: 1047 - 1052
Cite this article: Tseng, Yu-Ho, Jan, Shau-Shiun, "Combination of Computer Vision Detection and Segmentation for Autonomous Driving," 2018 IEEE/ION Position, Location and Navigation Symposium (PLANS), Monterey, CA, April 2018, pp. 1047-1052.
Full Paper: ION Members/Non-Members: 1 Download Credit
Sign In