41. Align before Fuse : Vision and Language Representation Learning with Momentum Distillation August 25, 2023 Foundation Model, Zero-Shot Learning Download ppt