论文标题

合奏学习有效的VVC比特阶梯预测

Ensemble Learning for Efficient VVC Bitrate Ladder Prediction

论文作者

Nasiri, Fatemeh, Hamidouche, Wassim, Morin, Luce, Dholland, Nicolas, Aubié, Jean-Yves

论文摘要

更改编码参数,尤其是视频分辨率,是转码之前的常见做法。为此,流和广播平台受益于所谓的比特级梯子,以确定给定比特率的最佳分辨率。但是,确定比特阶梯的任务通常可能具有挑战性,因为一方面,所谓的适合所有静态梯子会浪费带宽,另一方面,在计算复杂性方面,完全专业的梯子通常无法负担得起。在本文中,我们提出了一种基于ML的方案,用于根据视频内容预测比特率梯子。解决方案的基线使用两种不需要编码通行证的组成方法预测比特率梯子。为了进一步提高组成方法的性能,我们集成了一种有条件的集合方法来汇总他们的决策,而编码通过的数量却忽略不计。该实验在VVC标准的优化软件编码器实现(称为VVenc)上进行了显着改善。与静态比特率梯子相比,所提出的方法在BD-BR方面可提供约13%的比特率降低,而额外的计算开销可忽略不计。相反,与完全专业的比特阶梯方法相比,所提出的方法可以提供约86%至92%的复杂性,而成本仅为BD-BR的编码效率下降0.8%至0.9%。

Changing the encoding parameters, in particular the video resolution, is a common practice before transcoding. To this end, streaming and broadcast platforms benefit from so-called bitrate ladders to determine the optimal resolution for given bitrates. However, the task of determining the bitrate ladder can usually be challenging as, on one hand, so-called fit-for-all static ladders would waste bandwidth, and on the other hand, fully specialized ladders are often not affordable in terms of computational complexity. In this paper, we propose an ML-based scheme for predicting the bitrate ladder based on the content of the video. The baseline of our solution predicts the bitrate ladder using two constituent methods, which require no encoding passes. To further enhance the performance of the constituent methods, we integrate a conditional ensemble method to aggregate their decisions, with a negligibly limited number of encoding passes. The experiment, carried out on the optimized software encoder implementation of the VVC standard, called VVenC, shows significant performance improvement. When compared to static bitrate ladder, the proposed method can offer about 13% bitrate reduction in terms of BD-BR with a negligible additional computational overhead. Conversely, when compared to the fully specialized bitrate ladder method, the proposed method can offer about 86% to 92% complexity reduction, at cost the of only 0.8% to 0.9% coding efficiency drop in terms of BD-BR.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源