68. Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation August 12, 2024 Contrastive Learning, Image Captioning, Zero-Shot Learning Download ppt