36. MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

36. MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Transformer, Segmentation