Are Pre-Trained Convolutions Better Than Pre-Trained Transformers? (2021) arxiv.org 2 points by fzliu 7 hours ago