Task-Aware Encoder Control for Deep Video Compression
arxiv(2024)
摘要
Prior research on deep video compression (DVC) for machine tasks typically
necessitates training a unique codec for each specific task, mandating a
dedicated decoder per task. In contrast, traditional video codecs employ a
flexible encoder controller, enabling the adaptation of a single codec to
different tasks through mechanisms like mode prediction. Drawing inspiration
from this, we introduce an innovative encoder controller for deep video
compression for machines. This controller features a mode prediction and a
Group of Pictures (GoP) selection module. Our approach centralizes control at
the encoding stage, allowing for adaptable encoder adjustments across different
tasks, such as detection and tracking, while maintaining compatibility with a
standard pre-trained DVC decoder. Empirical evidence demonstrates that our
method is applicable across multiple tasks with various existing pre-trained
DVCs. Moreover, extensive experiments demonstrate that our method outperforms
previous DVC by about 25
pre-trained decoder.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要