Joint Optimization for SSIM-Based CTU-Level Bit Allocation and Rate Distortion Optimization,IEEE Transactions on Broadcasting

当前位置： X-MOL 学术 › IEEE Trans. Broadcast. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Joint Optimization for SSIM-Based CTU-Level Bit Allocation and Rate Distortion Optimization
IEEE Transactions on Broadcasting ( IF 4.5 ) Pub Date : 2021-04-01 , DOI: 10.1109/tbc.2021.3068871
Yang Li , Xuanqin Mou

Structural similarity (SSIM)-based distortion

$\text {D}_{\text {SSIM}}$

is more consistent with human perception than the traditional mean squared error

$\text {D}_{\text {MSE}}$

. To achieve better video encoding quality, many studies on optimal bit allocation (OBA) used

$\text {D}_{\text {SSIM}}$

as the distortion metric. However, the MSE-based rate distortion optimization (RDO) was still used in these studies. The inconsistency between the optimization goals of OBA and RDO results in a non-optimal SSIM-based encoding performance. To solve this problem, we propose an accurate coding tree unit level

$\text {D}_{\text {SSIM}}-\text {D}_{\text {MSE}}$

model, which enables performing the SSIM-based RDO with simpler R-

$\text {D}_{\text {MSE}}$

cost scaled by the SSIM-based Lagrangian parameter

${\lambda _{\text {SSIM}}}$

. Moreover, based on this model, the R-

$\text {D}_{\text {SSIM}}$

model can be accurately estimated based on the joint relationship of R-

$\text {D}_{\text {SSIM}}-{\lambda _{\text {SSIM}}}$

. With the accurate R-

$\text {D}_{\text {SSIM}}$

model, the SSIM-based OBA problem is then solved. Accordingly, the SSIM-based OBA and SSIM-based RDO are unified together in our scheme, called SOSR. Compared with the HEVC reference encoder HM16.20, SOSR saves 5%, 11%, and 17% bitrate under the same SSIM in the commonly used all-intra, hierarchical and non-hierarchical low-delay-B configurations, which is superior to existing state-of-the-art SSIM-based OBA schemes.

中文翻译：

基于 SSIM 的 CTU 级比特分配和速率失真优化的联合优化

基于结构相似性 (SSIM) 的失真

$\text {D}_{\text {SSIM}}$

比传统的均方误差更符合人类的感知

$\text {D}_{\text {MSE}}$

. 为了获得更好的视频编码质量，许多关于最佳比特分配 (OBA) 的研究使用了

$\text {D}_{\text {SSIM}}$

作为失真度量。然而，这些研究中仍然使用基于 MSE 的速率失真优化 (RDO)。OBA 和 RDO 的优化目标之间的不一致导致基于 SSIM 的编码性能非最佳。为了解决这个问题，我们提出了一个准确的编码树单元级别

$\text {D}_{\text {SSIM}}-\text {D}_{\text {MSE}}$

模型，它可以使用更简单的 R- 执行基于 SSIM 的 RDO

$\text {D}_{\text {MSE}}$

由基于 SSIM 的拉格朗日参数缩放的成本

${\lambda _{\text {SSIM}}}$

. 此外，基于该模型，R-

$\text {D}_{\text {SSIM}}$

基于R-的联合关系可以准确估计模型

$\text {D}_{\text {SSIM}}-{\lambda _{\text {SSIM}}}$

. 用准确的 R-

$\text {D}_{\text {SSIM}}$

模型，然后解决基于 SSIM 的 OBA 问题。因此，基于 SSIM 的 OBA 和基于 SSIM 的 RDO 在我们的方案中统一在一起，称为 SOSR。与HEVC参考编码器HM16.20相比，SOSR在常用的all-intra、hierarchical和non-hierarchical low-delay-B配置下，在相同SSIM下分别节省5%、11%、17%的码率，优于现有最先进的基于 SSIM 的 OBA 方案。

更新日期：2021-06-08

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>