Contains checkpoints for a well-tuned ViT S/16 model for ImageNet-1k. It achieves 76.23% top-1 accuracy on ImageNet-1k validation set in 90 epochs.
Configuration file: https://github.com/google-research/big_vision/blob/main/big_vision/configs/vit_s16_i1k.py
Training: https://github.com/sayakpaul/big_vision_experiments/blob/main/setup.md