Core Concepts

What Is Knowledge Distillation?

Knowledge distillation is a technique in which a smaller “student” model is trained to imitate the outputs of a larger “teacher” model. The student ends up much cheaper to run while retaining a good deal of the teacher’s capability. Distillation is widely used to make powerful models practical for phones and other constrained devices.

Further reading

Read more about Knowledge distillation — articles and blogs from around the web: