Modular Blind Video Quality Assessment
CVPR 2024(2024)
摘要
Blind video quality assessment (BVQA) plays a pivotal role in evaluating and
improving the viewing experience of end-users across a wide range of
video-based platforms and services. Contemporary deep learning-based models
primarily analyze the video content in its aggressively downsampled format,
while being blind to the impact of actual spatial resolution and frame rate on
video quality. In this paper, we propose a modular BVQA model, and a method of
training it to improve its modularity. Specifically, our model comprises a base
quality predictor, a spatial rectifier, and a temporal rectifier, responding to
the visual content and distortion, spatial resolution, and frame rate changes
on video quality, respectively. During training, spatial and temporal
rectifiers are dropped out with some probabilities so as to make the base
quality predictor a standalone BVQA model, which should work better with the
rectifiers. Extensive experiments on both professionally-generated content and
user generated content video databases show that our quality model achieves
superior or comparable performance to current methods. Furthermore, the
modularity of our model offers a great opportunity to analyze existing video
quality databases in terms of their spatial and temporal complexities. Last,
our BVQA model is cost-effective to add other quality-relevant video attributes
such as dynamic range and color gamut as additional rectifiers.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要