Abstract
Depth map quality is an important factor that affects the quality of synthesized stereoscopic images in stereoscopic visual communication systems using the depth image-based rendering (DIBR) technique. This paper proposes a method using a generative adversarial network (GAN) to denoise depth maps corrupted by several types of distortion. The generative network of the proposed GAN builds on convolutional layers, residual layers, and transposed convolutional layers with symmetric skip connections. The discriminative network of the proposed GAN is designed as a convolutional neural network. The generative network for denoising depth maps is trained with cropped depth maps where distortion is applied. Objective and subjective assessment of denoised depth maps and DIBR-synthesized stereoscopic images demonstrate that the proposed GAN effectively reduces the distortion in the depth maps and improves the quality of DIBR-synthesized stereoscopic images.
Similar content being viewed by others
References
Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al (2016) Tensorflow: a system for large-scale machine learning. In: 12th \(\{\)USENIX\(\}\) symposium on operating systems design and implementation (\(\{\)OSDI\(\}\) 16), pp 265–283
Arjovsky M, Chintala S, Bottou L (2017) Wasserstein gan. arXiv preprint arXiv:1701.07875
Fehn C (2004) Depth-image-based rendering (dibr), compression, and transmission for a new approach on 3d-tv. In: Stereoscopic displays and virtual reality systems XI, vol 5291, pp 93–104. International Society for Optics and Photonics
Fleishman S, Drori I, Cohen-Or D (2004) Bilateral mesh denoising. In: ACM SIGGRAPH 2003 papers, pp 950–953 x
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in neural information processing systems, pp 2672–2680
Han L, Wu X, Liang W, Hou G, Jia Y (2010) Discriminative human action recognition in the learned hierarchical manifold space. Image Vis Comput 28(5):836–849
Herrera D, Kannala J, Heikkilä J (2012) Joint depth and color camera calibration with distortion correction. IEEE Trans Pattern Anal Mach Intell 34(10):2058–2064
Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S (2017) Gans trained by a two time-scale update rule converge to a local nash equilibrium. In: Advances in neural information processing systems, pp 6626–6637
Hu W, Li X, Cheung G, Au O (2013) Depth map denoising using graph-based transform and group sparsity. In: 2013 IEEE 15th international workshop on multimedia signal processing (MMSP), pp 001–006. IEEE
Isola P, Zhu JY, Zhou T, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1125–1134
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Kopf J, Cohen MF, Lischinski D, Uyttendaele M (2007) Joint bilateral upsampling. ACM Trans Graph ToG 26(3):96
Kundu D, Choi LK, Bovik AC, Evans BL (2018) Perceptual quality evaluation of synthetic pictures distorted by compression and transmission. Sig Process Image Commun 61:54–72
Kupyn O, Budzan V, Mykhailych M, Mishkin D, Matas J (2018) Deblurgan: blind motion deblurring using conditional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8183–8192
Li F, Yu J, Chai J (2008) A hybrid camera for motion deblurring and depth map super-resolution. In: 2008 IEEE conference on computer vision and pattern recognition, pp 1–8. IEEE
Liu A, Lin W, Narwaria M (2011) Image quality assessment based on gradient similarity. IEEE Trans Image Process 21(4):1500–1512
Lu S, Ren X, Liu F (2014) Depth enhancement via low-rank matrix completion. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3390–3397
Mao X, Li Q, Xie H, Lau RY, Wang Z, Paul Smolley S (2017) Least squares generative adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2794–2802
Nah S, Hyun Kim T, Mu Lee K (2017) Deep multi-scale convolutional neural network for dynamic scene deblurring. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3883–3891
Nasr MAS, Al Rahmawy MF, Tolba A (2016) Multi-scale structural similarity index for motion. Structure 147:148
Pan J, Sun D, Pfister H, Yang MH (2016) Blind image deblurring using dark channel prior. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1628–1636
Richardt C, Stoll C, Dodgson NA, Seidel HP, Theobalt C (2012) Coherent spatiotemporal filtering, upsampling and rendering of rgbz videos. In: Computer graphics forum, vol 31, pp 247–256. Wiley Online Library
Roth S, Black MJ (2005) Fields of experts: a framework for learning image priors. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2, pp 860–867. IEEE
Scharstein D, Hirschmüller H, Kitajima Y, Krathwohl G, Nešić N, Wang X, Westling P (2014) High-resolution stereo datasets with subpixel-accurate ground truth. In: German conference on pattern recognition, pp 31–42. Springer
Song R, Ko H, Kuo CCJ (2014) Mcl-3d: a database for stereoscopic image quality assessment using 2d-image-plus-depth source. J Inform Sci Eng 31
Sterzentsenko V, Saroglou L, Chatzitofis A, Thermos S, Zioulis N, Doumanoglou A, Zarpalas D, Daras P (2019) Self-supervised deep depth denoising. In: Proceedings of the IEEE international conference on computer vision, pp 1242–1251
Tanimoto M, Fujii T, Suzuki K (2009) View synthesis algorithm in view synthesis reference software 2.0 (vsrs2. 0). ISO/IEC JTC1/SC29/WG11 M 16090, 2009
Wang KF, Gou C, Duan YJ, Lin YL, Zheng XH, Wang F (2017) Generative adversarial networks: the state of the art and beyond. Acta Automatica Sinica 43(3):321–332
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Xie J, Feris RS, Yu SS, Sun MT (2015) Joint super resolution and denoising from a single depth image. IEEE Trans Multimed 17(9):1525–1537
Xu L, Zheng S, Jia J (2013) Unnatural \(l_0\) sparse representation for natural image deblurring. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1107–1114
Xue W, Zhang L, Mou X, Bovik AC (2013) Gradient magnitude similarity deviation: a highly efficient perceptual image quality index. IEEE Trans Image Process 23(2):684–695
Yoshizawa S, Belyaev A, Seidel HP (2006) Smoothing by example: mesh denoising by averaging with similarity-based weights. In: IEEE international conference on shape modeling and applications 2006 (SMI’06), p 9. IEEE
Zaeemzadeh A, Rahnavard N, Shah M (2020) Norm-preservation: Why residual networks can become extremely deep? IEEE Trans Pattern Anal Mach Intell
Zhang L, Jieyu Z, Xulun Y et al (2018) Co-operative generative adversarial nets. Acta Automatica Sinica 44(5):804–810
Zhang X, Wu R (2016) Fast depth image denoising and enhancement using a deep convolutional network. In: 2016 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 2499–2503. IEEE
Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2223–2232
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This research was supported by Jiangsu Key Laboratory of Meteorological Observation and Information Processing Open Project (KDXS1805) and by the Priority Academic Program Development of Jiangsu Higher Education Institutions Project.
Rights and permissions
About this article
Cite this article
Zhang, C., Sun, Xw., Xu, J. et al. A Generative Adversarial Network to Denoise Depth Maps for Quality Improvement of DIBR-Synthesized Stereoscopic Images. J. Electr. Eng. Technol. 16, 2201–2210 (2021). https://doi.org/10.1007/s42835-021-00728-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42835-021-00728-2