Archives for transformers memorising new data

31 Mar

Can a language model acquire new knowledge by simply reading new data?

Poulomi Chatterjee Google research on memorizing transformers

By presenting a simple extension to the transformer, known as kNN-augmented attention, the research found that it could increase the length of the context in a language model.

31 Mar

Can a language model acquire new knowledge by simply reading new data?

Poulomi Chatterjee Google research on memorizing transformers

By presenting a simple extension to the transformer, known as kNN-augmented attention, the research found that it could increase the length of the context in a language model.