Vision & Generative Media
What Is CLIP?
CLIP learns a shared embedding space for images and text by training on image-caption pairs. This lets it match images to text descriptions and perform zero-shot classification.
Further reading
Read more about clip — articles and blogs from around the web: