Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems
arxiv(2024)
摘要
This paper introduces Multi-Resolution Rescored Byte-Track (MR2-ByteTrack), a
novel video object detection framework for ultra-low-power embedded processors.
This method reduces the average compute load of an off-the-shelf Deep Neural
Network (DNN) based object detector by up to 2.25× by alternating the
processing of high-resolution images (320×320 pixels) with multiple
down-sized frames (192×192 pixels). To tackle the accuracy degradation
due to the reduced image input size, MR2-ByteTrack correlates the output
detections over time using the ByteTrack tracker and corrects potential
misclassification using a novel probabilistic Rescore algorithm. By
interleaving two down-sized images for every high-resolution one as the input
of different state-of-the-art DNN object detectors with our MR2-ByteTrack, we
demonstrate an average accuracy increase of 2.16
43
scheme using exclusively full-resolution images. Code available at:
https://github.com/Bomps4/Multi_Resolution_Rescored_ByteTrack
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要