当前位置: X-MOL 学术Electronics › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Fast CU Partition Decision Algorithm for VVC Intra Coding Using an MET-CNN
Electronics ( IF 2.9 ) Pub Date : 2022-09-27 , DOI: 10.3390/electronics11193090
Yanjun Wang, Pu Dai, Jinchao Zhao, Qiuwen Zhang

The newest video coding standard, the versatile video coding standard (VVC/H.266), came into effect in November 2020. Different from the previous generation standard—high-efficiency video coding (HEVC/H.265)—VVC adopts a more flexible block division structure, the quad-tree with nested multi-type tree (QTMT) structure, which improves its coding performance by 24%. However, it also causes a substantial increase in computational complexity. Therefore, this paper first proposes the concept of a stage grid map, which divides the overall division of a 32 × 32 coding unit (CU) into four stages and represents it as a structured output. Second, a multi-stage early termination convolutional neural network (MET-CNN) model is devised to predict the full partition information of a CU with a size of 32 × 32. Finally, a fast CU partition decision algorithm for VVC intra coding based on an MET-CNN is proposed. The algorithm can predict all partition information of a CU with a size of 32 × 32 and its sub-CUs in one run, completely replacing the complex rate-distortion optimization (RDO) process. It also has an early exit mechanism, thereby greatly reducing the encoding time. The experimental results illustrate that the scheme proposed in this paper reduces the encoding time by 49.24% on average, while the Bjøntegaard Delta Bit Rate (BDBR) only increases by 0.97%.

中文翻译:

使用 MET-CNN 的 VVC 帧内编码的快速 CU 分区决策算法

最新的视频编码标准——通用视频编码标准(VVC/H.266)于2020年11月生效。不同于上一代标准——高效视频编码(HEVC/H.265)——VVC采用了更灵活的分块结构,四叉树嵌套多类型树(QTMT)结构,编码性能提升24%。然而,它也导致计算复杂度的大幅增加。因此,本文首先提出了stage grid map的概念,将一个32×32的编码单元(CU)整体划分为四个stage,并将其表示为结构化输出。其次,设计了一个多阶段提前终止卷积神经网络(MET-CNN)模型来预测大小为 32 × 32 的 CU 的完整分区信息。最后,提出了一种基于MET-CNN的VVC帧内编码快速CU划分决策算法。该算法可以在一次运行中预测一个大小为 32×32 的 CU 及其子 CU 的所有分区信息,完全替代了复杂的速率失真优化 (RDO) 过程。它还具有提前退出机制,从而大大减少了编码时间。实验结果表明,本文提出的方案平均减少了 49.24% 的编码时间,而 Bjøntegaard Delta Bit Rate (BDBR) 仅增加了 0.97%。从而大大减少了编码时间。实验结果表明,本文提出的方案平均减少了 49.24% 的编码时间,而 Bjøntegaard Delta Bit Rate (BDBR) 仅增加了 0.97%。从而大大减少了编码时间。实验结果表明,本文提出的方案平均减少了 49.24% 的编码时间,而 Bjøntegaard Delta Bit Rate (BDBR) 仅增加了 0.97%。
更新日期:2022-09-27
down
wechat
bug