Papers Reading Training data-efficient image transformers & distillation through attention Training data-efficient image transformers & distillation through attention Recently, neural networks purely