Exploring Plain Vision Transformer Backbones for Object Detection
Investigating the effectiveness of plain Vision Transformers as backbones for object detection and proposing modifications to improve their performance.
Explore machine learning papers and reviews related to transformers. Find insights, analysis, and implementation details.
Investigating the effectiveness of plain Vision Transformers as backbones for object detection and proposing modifications to improve their performance.
Introducing DETR, a novel end-to-end object detection framework that leverages Transformers to directly predict a set of object bounding boxes.
Introducing Vision Transformer (ViT), a pure transformer architecture for image recognition that achieves state-of-the-art results.
A comprehensive survey of techniques for optimizing the inference phase of transformer networks.
Introducing Swin Transformer, a hierarchical Vision Transformer that uses shifted windows to achieve improved efficiency and performance in various vision tasks.
A deep dive into the revolutionary Transformer architecture paper that changed the landscape of deep learning.
A case study on optimizing transformers by focusing on data movement