Abstract
Human vision easily identifies the structural degradation in a video and thus perceptual quality improvement is necessary. To adjust the video quality to match with the human vision we proposed a structural similarity (SSIM) based rate control algorithm for the 3D video. We incorporated a structural similarity index (SSIM) as the quality metric in rate control algorithm. The rate-distortion model and Lagrange multiplier are derived considering the structural dissimilarity (dSSIM) as the distortion metric to achieve the rate control with perceptual quality improvement. Furthermore, the optimal joint bit allocation scheme at texture video/depth map level, frame level, and basic unit (BU) level is modified to incorporate dSSIM for measuring distortion. The proposed algorithm is implemented in HEVC compression standard HTM-16.2. The performance of the proposed algorithm is compared using RD curves, BD-Rate comparison, and subjective evaluation. Besides, rate accuracy is also computed to measure the bit rate mismatch. Compared to the original λ-domain rate control algorithm, the proposed algorithm achieves a better SSIM along with a reduction in bit rate.
Similar content being viewed by others
References
Bai Y, Zhang Y, Li Z (2015) 3D Video Coding using Just Noticeable Depth Difference based on H. 265/HEVC. In: 11Th international conference on computational intelligence and security (CIS). IEEE, pp 142–145
Chen X, Gu D (2014) Macroblock Layer Rate Control based on Structural Similarity and Mean Absolute Difference for H. 264. Adv Multimed:5
Cordina M, Debono CJ (2016) A Depth Map Rate Control Algorithm for HEVC Multi-View Video plus Depth. In: IEEE International conference on multimedia & expo workshops (ICMEW), pp 1–6
Cui Z, Gan Z, Zhu X (2011) Structural Similarity Optimal MB Layer Rate Control for H. 264. In: Proceedings of IEEE International Conference on Wireless Communications and Signal Processing (WCSP), pp 1–5
Fehn C (2004) Depth-Image-Based Rendering (DIBR), Compression, and Transmission for a New Approach on 3D-TV. In: Electronic imaging 2004. International society for optics and photonics, pp 93–104
Fujii Laboratory, Nagoya University. Available at: http://www.fujii.nuee.nagoya-u.ac.jp/multiview-data/, [Online; accessed on 29-July-2018]
HM Reference Software HTM-16.2. Available at: https://hevc.hhi.fraunhofer.de/trac/3d-hevc/browser/3DVCSoftware/tags/HTM-16.2, [Online; accessed on 21-September-2018]
Li B, Li H, Li L, Zhang J (2014) Lambda Domain Rate Control Algorithm for High Efficiency Video Coding. IEEE Trans Image Process 23:3841–3854
Morvan Y, Farin D (2007) Joint Depth/Texture Bit-Allocation for Multi-View Video Compression. In: Picture coding symposium (PCS)
Ou TS, Huang YH, Chen HH (2010) A Perceptual-based Approach to Bit Allocation for H. 264 Encoder. In: Visual communications and image processing 2010. International society for optics and photonics, vol 7744, pp 77441b
Shao F, Jiang G, Lin W, Yu M, Dai Q (2013) Joint Bit Allocation and Rate Control for Coding Multi-View Video plus Depth Based 3D Video. IEEE Trans Multimed 15(8):1843–1854
De Silva DVS, Fernando WAC, Nur G, Ekmekcioglu E, Worrall ST (2010) 3D Video Assessment with Just Noticeable Difference in Depth Evaluation. In: IEEE International conference on image processing. IEEE, pp 4013–4016
Tan S, Ma S, Wang S, Wang S, Gao W (2017) Inter-View Dependency-Based Rate Control for 3D-HEVC. IEEE Trans Circ Syst Video Technol 27 (2):337–351. https://doi.org/10.1109/TCSVT.2015.2511878
Tan S, Si J, Ma S, Wang S, Gao W (2014) Adaptive Frame Level Rate Control in 3D-HEVC. In: Proceedings of IEEE visual communications and image processing conf, pp 382–385. https://doi.org/10.1109/VCIP.2014.7051586
Harshalatha Y, Biswas PK (2018) SSIM-based Joint-Bit Allocation for 3D Video Coding. Multimed Tools Appl 77(15):19,051. https://doi.org/10.1007/s11042-017-5327-0
Yang C, An P, Shen L (2016) Adaptive Bit Allocation for 3D Video Coding. Circ Syst Signal Process:1–23
Yeo C, Tan HL, Tan YH (2013) On Rate Distortion Optimization using SSIM. IEEE Trans Circ Syst Video Technol 23(7):1170–1181
Yuan H, Chang Y, Li M, Yang F (2010) Model Based Bit Allocation Between Texture Images and Depth maps. In: International conference on computer and communication technologies in agriculture engineering (CCTAE). IEEE, vol 3, pp 380–383
Zhao H, Xie W, Zhang Y, Yu L, Men A (2013) An SSIM-Motivated LCU-Level Rate Control Algorithm for HEVC. In: Proceedings of Picture coding symp. (PCS), pp 85–88. https://doi.org/10.1109/PCS.2013.6737689
Zitnick CL, Kang SB, Uyttendaele M, Winder S, Szeliski R (2004) High-quality video view interpolation using a layered representation. In: ACM Transactions on graphics (TOG). ACM, vol 23, pp 600–608
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Harshalatha, Y., Biswas, P.K. Structural similarity-based rate control algorithm for 3D video. Multimed Tools Appl 80, 25897–25908 (2021). https://doi.org/10.1007/s11042-021-10922-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-10922-z