Abstract: Transformers have intrigued the vision research community with their state-of-the-art performance in natural language processing. With their superior performance, transformers have found ...