Enhancing Visual Place Recognition Using Discrete Cosine Transform and Difference-Based Descriptors

Qieshi Zhang,Zhenyu Xu, Zhiyong Yang,Ziliang Ren, Shuai Yuan,Jun Cheng

IEEE Transactions on Circuits and Systems II: Express Briefs(2024)

引用 0|浏览2
暂无评分
摘要
Visual Place Recognition (VPR) is a popular technique in agent localization, which is used to identify visited places based on their visual appearance. However, visual appearance is easily distorted by environmental factors such as weather and season, causing a decline performance of recent methods. To alleviate the influence of these factors, a Discrete Cosine Transform (DCT)-Mask Net is introduced, exploiting mid-frequency information to enhance the appearance invariance of representations. To further improve, a Difference Net is employed to capture the change between frames and learn difference-based sequential descriptors. Besides, to improve the discriminability of features and prevent overfitting of the model, an improved loss function, namely Decorrelation and Regulation-triplet (DR-triplet), is proposed. Experimental results show our method is more robust to appearance changes than state-of-the-art methods, which demonstrates superior recognition performance across different seasons on the Nordland dataset and desired generalization capabilities under various weather conditions on the Virtual KITTI 2 dataset.
更多
查看译文
关键词
Visual Place Recognition (VPR),deep learning,agent localization,intelligent control,frequency domain
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要