Archives for sparse vs dense language model

21 Dec

What Is Sparsity In Language Model?

GLaM is a sparse language model, which means it activates only a part of the architecture for a given task