Massively-parallel column-level segmentation of depth images

https://doi.org/10.1016/j.jocs.2021.101298Get rights and content
Under a Creative Commons license
open access

Abstract

Column-level segmentation of depth images is an energy-efficient strategy to perform 3D perception in autonomous-driving systems. These systems must perform 3D perception in real time through a pipeline of multiple tasks, which benefits from proposals that prioritize low complexity and short execution time over high levels of accuracy. For many years, column-level segmentation of depth images has been solved with the Stixels proposal, which uses an optimization algorithm with O(n2) computational complexity. This manuscript is an extended version of the ICCS paper “GPU-accelerated RDP Algorithm for Data Segmentation” (Cebrian and Moure, 2020). We present an alternative column-level segmentation proposal based on the RDP split-and-merge strategy, which has O(nlogn) computational complexity. The qualitative results obtained with the KITTI and Synthia image datasets evidence that our proposal can generate depth representations with greater compression and accuracy than the Stixels proposal. More importantly, we engineered a massively parallel design optimized for the low-power, GPU-accelerated embedded systems typically used for autonomous driving applications. For the datasets above, our proposal runs on a low-power NVIDIA Volta GPU 22 to 68 times faster than Stixels GPU-accelerated code. Additionally, our code achieves higher performance speedups as the computational capabilities and size of depth images increase.

Keywords

GPU acceleration
Data segmentation
Segmented parallel reduction

Cited by (0)

Pau Cebrian received the BSc degree in Computer Science from Universitat Autonoma de Barcelona (UAB) in 2016, and the MSc on Computer Vision at the UAB, UPC, UPF and UOC in 2017. Currently, he is working towards a PhD on Computer Vision and High Performance Computing supervised by Dr. Juan Carlos Moure. His focus is to improve 3D perception algorithms and adapt them to GPU devices. To this end, he is immersed in the study of depth image segmentation. He is interested in self-driving cars, 3D perception and embedded systems.

Juan C. Moure is an associate professor in the Computer Architecture and Operating Systems Department at the Universitat Autonoma of Barcelona (UAB), where he teaches Computer Architecture, Performance Engineering and Parallel Programming. His current research interests include massive parallel architectures, programming, and algorithms, mainly focused on Computer Vision, Signal Processing and Bioinformatics applications. He is the author of more than 50 papers, and has participated in several European and Spanish projects related to high-performance computing.

The code (and data) in this article has been certified as Reproducible by Code Ocean: (https://codeocean.com/). More information on the Reproducibility Badge Initiative is available at https://www.elsevier.com/physical-sciences-and-engineering/computer-science/journals.