资讯

Image-only classification Text-only classification Multimodal classification: text and image inputs Attention mechanism visualization Image-only classification with the multimodal model trained on ...
To further test the robustness of the model against background interference, we propose an ImageNet background interference test set, ImageNet-Bg, based on the ImageNet validation set with 48,285 ...