Synthetic Aerial Dataset for UAV Detection via Text-to-Image Diffusion Models

2023 IEEE Conference on Artificial Intelligence (CAI)（2023）

引用 0|浏览16

暂无评分

摘要

In this work, we present an approach to generate a synthetic aerial dataset for efficient Unmanned Aerial Vehicle (UAV) detection. We propose controlling the output of a text-to-image diffusion model by applying additional input conditions. Specifically, we train a diffusion model that enables conditional inputs, i.e., binary masks that specify all tractable parameters, including quantity, scale, pose, color, background, etc. Diverse photorealistic images with corresponding ground truth bounding boxes are generated automatically in an end-to-end manner. Without any interference, the dataset can be scaled to a large magnitude to facilitate the training process of UAV detection. Experimental results of YOLOv7 trained on the synthetic dataset demonstrate an extensive precision increment on unseen datasets of real images.

查看译文

关键词

UAV detection, Text-to-Image Diffusion, Generative Model

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要