Our paper VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena was accepted at ACL 2022 Main.
A survey by members of the COST Action Multi3Generation: Multi-task, Multilingual, Multi-modal, to appear in JAIR