Language & LLMs

What Is ELECTRA?

ELECTRA is a language model pretraining approach that replaces some input tokens with plausible alternatives and trains a discriminator to detect which tokens were replaced. This replaced-token detection task is more sample-efficient than masked language modeling. It allows strong results with less compute than comparable BERT models.

Further reading

Read more about ELECTRA — articles and blogs from around the web: