论文标题
快速多编码以降低视频流的成本
Fast multi-encoding to reduce the cost of video streaming
论文作者
论文摘要
视频互联网流量的增长以及视频属性中的进步(例如帧速率,分辨率和比特深度)的增长增强了设计大规模,高效的视频编码环境的需求。这对于基于HTTP(DASH)基于HTTP(DASH)的内容提供的动态自适应流提供了更为必要的内容,因为它需要编码同一视频内容的大量表示。高效率视频编码(HEVC)是一种标准视频编解码器,可显着提高其前身高级视频编码(AVC)的编码效率。这一改进是以显着增加时间复杂性为代价的,这是内容和服务提供商的挑战。由于各种表示与在不同的比特率或分辨率上编码的视频内容相同,因此可以共享来自已编码的表示的编码分析信息以加速其他表示形式的编码。几种最先进的方案首先编码单个表示,称为参考表示。在此编码过程中,编码器通过诸如SlaceType决策,CU,PU,TU分区和HEVC bitstream本身等信息创建分析元数据。其余的表示为依赖表示,分析上述元数据,然后重用它跳过搜索某些分区,从而降低了计算复杂性。随着基于云的编码服务的出现,视频编码通过利用增加数量的资源(即使用多核CPU)加速,可以并行编码多个表示。本文分别概述了各种多种编码方案的概述,并且分别支持集成到HEVC测试模型(HM)和X265中的机器学习方法。
The growth in video Internet traffic and advancements in video attributes such as framerate, resolution, and bit-depth boost the demand to devise a large-scale, highly efficient video encoding environment. This is even more essential for Dynamic Adaptive Streaming over HTTP (DASH)-based content provisioning as it requires encoding numerous representations of the same video content. High Efficiency Video Coding (HEVC) is one standard video codec that significantly improves encoding efficiency over its predecessor Advanced Video Coding (AVC). This improvement is achieved at the expense of significantly increased time complexity, which is a challenge for content and service providers. As various representations are the same video content encoded at different bitrates or resolutions, the encoding analysis information from the already encoded representations can be shared to accelerate the encoding of other representations. Several state-of-the-art schemes first encode a single representation, called a reference representation. During this encoding, the encoder creates analysis metadata with information such as the slicetype decisions, CU, PU, TU partitioning, and the HEVC bitstream itself. The remaining representations, called dependent representations, analyze the above metadata and then reuse it to skip searching some partitioning, thus, reducing the computational complexity. With the emergence of cloud-based encoding services, video encoding is accelerated by utilizing an increased number of resources, i.e., with multi-core CPUs, multiple representations can be encoded in parallel. This paper presents an overview of a wide range of multi-encoding schemes with and without the support of machine learning approaches integrated into the HEVC Test Model (HM) and x265, respectively.