Abstract
Bitrate reduction with little to no degradation in visual perception is a long-standing challenge in video coding. This paper targets this challenge by adaptively filtering the content prior to video compression and in the preprocessing stage. This is done by applying a bilateral filter where the filter parameters are selected according to regional content complexity and estimated visual importance besides bitrate and quality requirements. A multi-scale metric based on 2D gradient is employed to determine bandwidth requirements of different regions. A random forest regression model is trained to predict distortion and bit requirements for a block, if it is filtered and encoded at a given quality. The predicted distortion and bit requirements are used to select filter parameters considering a cost function. The proposed approach is applied to both H.264 and HEVC encoders, with different GOP structures. The results show up to 60% bitrate reduction in terms of BD-Rate (about 20% on average) for the attempted test cases with little to no noticeable quality degradation.
Similar content being viewed by others
Notes
Python sklearn notations are used for different forest parameters.
Consider all possible combinations of i, j, f in \(B_{i,j}^f\), and also different quality (Q) values and different content types.
Test sequences are from [2] and http://media.xiph.org/video/derf/.
References
Bjontegaard, G.: Calculation of average PSNR differences between RD-curves. In: Proceedings of the ITU-T Video Coding Experts Group, Thirteenth Meeting (2001)
Bossen, F.: Common test conditions and software reference configurations. JCT-VC, Technical report I1100 (2012)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Chen, Z., Liu, H.: JND modeling: approaches and applications. In: International Conference on Digital Signal Processing, pp. 827–830 (2014)
Duvar, R., Akbulut, O., Urhan, O.: Fast inter mode decision exploiting intra-block similarity in HEVC. Signal Process. Image Commun. 78, 503–510 (2019)
Grois, D., Nguyen, T., Marpe, D.: Performance comparison of AV1, JEM, VP9, and HEVC encoders. In: Proceedings of SPIE, 10396, (2018)
Guo, L., Cock, J.D., Aaron, A.: Compression performance comparison of x264, x265, libvpx and aomenc for on-demand adaptive streaming applications. In: Picture Coding Symposium, pp. 26–30 (2018)
Heidari, B., Ramezanpour, M.: Reduction of intra-coding time for HEVC based on temporary direction map. J. Real-Time Image Process. (2018). https://doi.org/10.1007/s11554-018-0815-7
Huang, X., Zhang, Q., Zhao, X., Zhang, W., Zhang, Y., Gan, Y.: Fast inter-prediction mode decision algorithm for HEVC. Signal Image Video Process. 11(1), 33–40 (2017)
Jenab, M., Amer, I., Ivanovic, B., Saeedi, M., Liu, Y., Sines, G., Shirani, S.: Content-adaptive resolution control to improve video coding efficiency. In: IEEE International Conference on Multimedia Expo Workshops, pp. 1–4 (2018)
Karunaratne, P.V., Segall, C.A., Katsaggelos, A.K.: A rate-distortion optimal video pre-processing algorithm. In: International Conference on Image Processing, 1, 481–484 (2001)
Kerofsky, L.J., Vanam, R., Reznik, Y.A.: Improved adaptive video delivery system using a perceptual pre-processing filter. In: Signal and Information Processing, pp. 1058–1062 (2014)
Ki, S., Bae, S., Kim, M., Ko, H.: Learning-based just-noticeable-quantization-distortion modeling for perceptual video coding. IEEE Trans. Image Process. 27(7), 3178–3193 (2018)
Kim, W.J., Yi, J.W., Kim, S.D.: A bit allocation method based on picture activity for still image coding. IEEE Trans. Image Process. 8(7), 974–977 (1999)
Lee, J., Ebrahimi, T.: Perceptual video compression: a survey. IEEE J.Sel. Top. Signal Process. 6(6), 684–697 (2012)
Li, S., Zhang, F., Ma, L., Ngan, K.N.: Image quality assessment by separately evaluating detail losses and additive impairments. IEEE Trans. Multimed. 13(5), 935–949 (2011)
Li, Z., Aaron, A., Katsavounidis, I., Moorthy, A., Manohara, M.: Toward a practical perceptual video quality metric. Netflix Technology Blog (2016)
Li, Z., Bampis, C., Novak, J., Aaron, A., Swanson, K., Moorthy, A., Cock, J.D.: VMAF: The journey continues. Netflix Technology Blog (2018)
Lu, S.P., Zhang, S.R.: Saliency-based fidelity adaptation preprocessing for video coding. J. Comput. Sci. Technol. 26(1), 195–202 (2011)
Najafabadi, N., Ramezanpour, M.: Mass center direction-based decision method for intraprediction in HEVC standard. J. Real-Time Image Process. (2019). https://doi.org/10.1007/s11554-019-00864-z
Nguyen, T., Marpe, D.: Future video coding technologies: a performance evaluation of AV1, JEM, VP9, and HM. In: Picture Coding Symposium, pp. 31–35 (2018)
Oh, H., Kim, W.: Video processing for human perceptual visual quality-oriented video coding. IEEE Trans. Image Process. 22(4), 1526–1535 (2013)
Ramezanpour, M., Zargari, F.: Fast CU size and prediction mode decision method for HEVC encoder based on spatial features. Signal Image Video Process. 10(7), 1233–1240 (2016)
Rassool, R.: VMAF reproducibility: validating a perceptual practical video quality metric. In: IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, pp. 1–2 (2017)
Shaw, M.Q., Allebach, J.P., Delp, E.J.: Color difference weighted adaptive residual preprocessing using perceptual modeling for video compression. Image Commun. 39, 355–368 (2015)
Sheikh, H.R., Bovik, A.C.: Image information and visual quality. IEEE Trans. Image Process. 15(2), 430–444 (2006)
Tomasi, C., Manduchi, R.: Bilateral filtering for gray and color images. In: International Conference on Computer Vision, pp. 839–846 (1998)
Vanam, R., Kerofsky, L.J., Reznik, Y.A.: Perceptual pre-processing filter for adaptive video on demand content delivery. In: IEEE International Conference on Image Processing, pp. 2537–2541 (2014)
Vidal, E., Sturmel, N., Guillemot, C., Corlay, P., Coudoux, F.X.: New adaptive filters as perceptual preprocessing for rate-quality performance optimization of video coding. Signal Process. Image Commun. 52, 124–137 (2017)
Yang, X., Ling, W., Lu, Z., Ong, E.P., Yao, S.: Just noticeable distortion model and its applications in video coding. Signal Process. Image Commun. 20(7), 662–680 (2005)
Zhang, X., Lin, W., Wang, S., Liu, J., Ma, S., Gao, W.: Fine-grained quality assessment for compressed images. IEEE Trans Image Process. 28(3), 1163–1175 (2019)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Saeedi, M., Ivanovic, B., Stolarczyk, T. et al. Content adaptive pre-filtering for video compression. SIViP 14, 935–943 (2020). https://doi.org/10.1007/s11760-019-01625-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-019-01625-y