Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding
arxiv(2024)
摘要
This paper presents the joint reference frame synthesis (RFS) and
post-processing filter enhancement (PFE) for Versatile Video Coding (VVC),
aiming to explore the combination of different neural network-based video
coding (NNVC) tools to better utilize the hierarchical bi-directional coding
structure of VVC. Both RFS and PFE utilize the Space-Time Enhancement Network
(STENet), which receives two input frames with artifacts and produces two
enhanced frames with suppressed artifacts, along with an intermediate
synthesized frame. STENet comprises two pipelines, the synthesis pipeline and
the enhancement pipeline, tailored for different purposes. During RFS, two
reconstructed frames are sent into STENet's synthesis pipeline to synthesize a
virtual reference frame, similar to the current to-be-coded frame. The
synthesized frame serves as an additional reference frame inserted into the
reference picture list (RPL). During PFE, two reconstructed frames are fed into
STENet's enhancement pipeline to alleviate their artifacts and distortions,
resulting in enhanced frames with reduced artifacts and distortions. To reduce
inference complexity, we propose joint inference of RFS and PFE (JISE),
achieved through a single execution of STENet. Integrated into the VVC
reference software VTM-15.0, RFS, PFE, and JISE are coordinated within a novel
Space-Time Enhancement Window (STEW) under Random Access (RA) configuration.
The proposed method could achieve -7.34
average for three components under RA configuration.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要