Other large scale transformer models include EleutherAI GPT-J, BAAI's Wu Dao 2.0, Google's Switch Transformer, and NVIDIA-Microsoft’s MT-NLG.