Guest Editorial Introduction to the Special Section on Transformer Models in Vision

IEEE Transactions on Pattern Analysis and Machine Intelligence(2023)

引用 0|浏览59
暂无评分
摘要
Transformer models have achieved outstanding results on a variety of language tasks, such as text classification, ma- chine translation, and question answering. This success in the field of Natural Language Processing (NLP) has sparked interest in the computer vision community to apply these models to vision and multi-modal learning tasks. However, visual data has a unique structure, requiring the need to rethink network designs and training methods. As a result, Transformer models and their variations have been suc- cessfully used for image recognition, object detection, seg- mentation, image super-resolution, video understanding, image generation, text-image synthesis, and visual question answering, among other applications.
更多
查看译文
关键词
Special issues and sections, Transformers, Text categorization, Machine translation, Natural language processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要