clip Papers - Abhik | ML & CV Consultant

2021

Learning Transferable Visual Models From Natural Language Supervision

Computer Vision Natural Language Processing Deep Learning Multimodal Learning CLIP

Introducing CLIP, a neural network trained on a massive dataset of image-text pairs that learns to connect images with their textual descriptions, enabling zero-shot image classification and other powerful capabilities.

Read review Original Paper

clip

Papers Related to clip

Learning Transferable Visual Models From Natural Language Supervision