[논문 정리] An image is worth 16x16 words : Transformers for image recognition
논문정보 An image is worth 16x16 words :Transformers for image recognition An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used to..