Parallel Data Augmentation for Text-based Person Re-identification

2022 International Joint Conference on Neural Networks (IJCNN)(2022)

引用 0|浏览27
暂无评分
摘要
Given textual descriptions, text-based person re-identification aims at retrieving the matched target person in a large-scale image pool. In contrast to the traditional person re-identification (Re-ID) task, text-based person Re-ID requires extra extracted discriminative textual representations and then aligns two modal features to narrow down the semantic gap between linguistic domain and visual domain. A majority of previous works design complex network structures and concatenate multi-branch features while failing to pay much attention to problems with the dataset, which requires more parameters learning and might lead to over-fitting. Hence, in this paper, we propose a Parallel Data Augmentation method (PDA) to reduce over-fitting and make the model occlusion resistant without increasing the number of training parameters. Specifically, prior to the training, for an image, we randomly choose a rectangular region of variable size and erase the region with a constant value. Similar to image processing, we randomly add a mask of random length words to a sentence, then the processed data is sent to the TIPCB framework for training. Extensive experimentations on the large-scale CUHK-PEDES dataset show the effectiveness of our method and verify that our method exceeds the state-of-the-art methods.
更多
查看译文
关键词
Person re-identification,cross-modality,data augmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要