Towards fairer datasets

TLDR

Computer vision is widely used yet its datasets are unrepresentative, leading to misbehavior such as offensive predictions and poorer performance for underrepresented groups, because models are trained on manually annotated image collections whose data and label distributions critically shape model behavior. The authors aim to examine ImageNet to illuminate root causes of bias and to initiate constructive mitigation steps. The study analyzes ImageNet’s person subtree, focusing on its stagnant WordNet vocabulary, exhaustive but uneven image coverage, and unequal representation, to identify how these dataset factors drive downstream bias.

Abstract

Computer vision technology is being used by many but remains representative of only a few. People have reported misbehavior of computer vision models, including offensive prediction results and lower performance for underrepresented groups. Current computer vision models are typically developed using datasets consisting of manually annotated images or videos; the data and label distributions in these datasets are critical to the models' behavior. In this paper, we examine ImageNet, a large-scale ontology of images that has spurred the development of many modern computer vision methods. We consider three key factors within the person subtree of ImageNet that may lead to problematic behavior in downstream computer vision technology: (1) the stagnant concept vocabulary of WordNet, (2) the attempt at exhaustive illustration of all categories with images, and (3) the inequality of representation in the images within concepts. We seek to illuminate the root causes of these concerns and take the first steps to mitigate them constructively.

References

Page 1

	Year	Citations
Deep Residual Learning for Image Recognition Kaiming He, Xiangyu Zhang, Shaoqing Ren, Image ClassificationDeep Neural NetworksMachine VisionImage AnalysisMachine Learning	2016	214.9K
ImageNet: A large-scale hierarchical image database Jia Deng, Wei Dong, Richard Socher, 2009 IEEE Conference on Computer Vision and Pattern Recognition EngineeringMachine LearningImage RetrievalImage DatabaseImage Recognition (Computer Vision)	2009	60.2K
ImageNet Large Scale Visual Recognition Challenge Olga Russakovsky, Jia Deng, Hao Su, International Journal of Computer Vision Image ClassificationConvolutional Neural NetworkMachine VisionImage AnalysisEngineering	2015	39.5K
Fast R-CNN Ross Girshick Image ClassificationConvolutional Neural NetworkImage AnalysisMachine LearningMachine Vision	2015	27.2K
The Pascal Visual Object Classes (VOC) Challenge Mark Everingham, Luc Van Gool, Christopher K. I. Williams, International Journal of Computer Vision Image AnalysisMachine VisionEngineeringObject CategorizationPattern Recognition	2009	19K
Deep Learning Face Attributes in the Wild Ziwei Liu, Ping Luo, Xiaogang Wang, Face DetectionConvolutional Neural NetworkFacial Recognition SystemMachine VisionImage Analysis	2015	7.5K
Show and tell: A neural image caption generator Oriol Vinyals, Alexander Toshev, Samy Bengio, Natural Language ProcessingArtificial IntelligenceLarge Ai ModelMultimodal LlmEngineering	2015	6.2K
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations Ranjay Krishna, Yuke Zhu, Oliver Groth, International Journal of Computer Vision	2017	5.1K
Places: A 10 Million Image Database for Scene Recognition Bolei Zhou, Àgata Lapedriza, Aditya Khosla, IEEE Transactions on Pattern Analysis and Machine Intelligence Convolutional Neural NetworkScene AnalysisEngineeringMachine LearningImage Database	2017	3.9K
Fairness through awareness Cynthia Dwork, Moritz Hardt, Toniann Pitassi, EngineeringDiscriminationFairness Through AwarenessSocial StratificationClassification Task	2012	3.3K

Page 1