Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning

About

intro figure

Research Questions and Aims

1. How well can a model perform on the images which requires geo-diverse commonsense to understand?

2. What are the reasons behind performance disparity on Western and non-Western images?

3. We aim to broaden researchers' vision on a realistic issue existing all over the world, and call upon researchers to consider more inclusive commonsense knowledge and better model transferability on various cultures.

Read the Paper Download the Data & Code

Citations

If you find our work helpful, please cite us.

@inproceedings{yin2021broaden,
  title = {{Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning}},
  author = {Yin, Da and Li, Liunian Harold and Hu, Ziniu and Peng, Nanyun and Chang, Kai-Wei},
  booktitle = {EMNLP},
  url = {https://arxiv.org/abs/2109.06860},
  year = {2021}
}

Contact

For any questions, please contact Da Yin or open a github issue. We appreciate the opensource code for website layout made by Micheal Zhang.