How did RoBERTa outperform XLNet with no architectural changes to the original BERT? — In 2018, Jacob Devlin et al. (Google AI Language) released BERT that achieved state-of-the-art results on nine NLP tasks, inspiring other researchers to come up with further improvements. Soon, newer models superseded the original BERT models. At least, that was how it looked. One such example was XLNet by Zhilin…