MtArtGPT: A Multi-task Art Generation System with Pre-Trained Transformer

Cong Jin, Ruolin Zhu, Zixing Zhu,Lu Yang,Min Yang,Jiebo Luo

IEEE Transactions on Circuits and Systems for Video Technology(2024)

引用 0|浏览15
暂无评分
摘要
Instruction tuning large language models are making rapid advances in the field of artificial intelligence where GPT-4 models have exhibited impressive multi-modal perception capabilities. Such models have been used as the core assistant for many tasks including art generation. However, high-quality art generation relies heavily on human prompt engineering which is in general uncontrollable. To address these issues, we propose a multi-task AI generated content (AIGC) system for art generation. Specifically, a dense representation manager is designed to process multi-modal user queries and generate dense and applicable prompts to GPT. To enhance artistic sophistication of the whole system, we fine-tune the GPT model by a meticulously collected prompt-art dataset. Furthermore, we introduce artistic benchmarks for evaluating the system based on professional knowledge. Experiments demonstrate the advantages of our proposed MtArtGPT system.
更多
查看译文
关键词
dense representation,art generation,prompt engineering,vision-language representation,AI generated content
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要