SSIM Prediction for H.265/HEVC based on Convolutional Neural Networks

2019 IEEE Visual Communications and Image Processing (VCIP)(2019)

引用 5|浏览4
暂无评分
摘要
In signal compression, distortion information is significant for rate distortion optimization. In this paper, we propose a convolutional neural network (CNN) to predict distortion information for H.265/HEVC. With the strong representation power of CNN, structural similarity (SSIM) maps indicating distortion information can be predicted directly in an end-to-end, pixel-to-pixel way. Different from traditional CNNs which focus on learning one-to-one mappings from input to output, we show that our CNN model can predict SSIM maps conditioned on quantization parameters (QPs), realizing one-to-many mappings. To construct our CNN network, QP labels are designed as conditions to feed the CNN model. We also apply symmetrical network architecture and multi-level feature fusion method to ensure our network can utilize both high-level semantic features and low-level structure features. The experiments on MS COCO database demonstrate the effectiveness of our CNN-based method for SSIM prediction.
更多
查看译文
关键词
SSIM,distortion prediction,convolutional neural network,H.265/HEVC,feature fusion,QP label
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要