41. Align before Fuse: Vision and Language Representation Learning with Momentum Distillation August 25, 2023 Foundation Model, Zero-Shot Learning, Contrastive Learning Download ppt