Understanding the Effect of Accuracy on Trust in Machine Learning Models

Abstract

We address a relatively under-explored aspect of human-computer interaction: people's abilities to understand the relationship between a machine learning model's stated performance on held-out data and its expected performance post deployment. We conduct large-scale, randomized human-subject experiments to examine whether laypeople's trust in a model, measured in terms of both the frequency with which they revise their predictions to match those of the model and their self-reported levels of trust in the model, varies depending on the model's stated accuracy on held-out data and on its observed accuracy in practice. We find that people's trust in a model is affected by both its stated accuracy and its observed accuracy, and that the effect of stated accuracy can change depending on the observed accuracy. Our work relates to recent research on interpretable machine learning, but moves beyond the typical focus on model internals, exploring a different component of the machine learning pipeline.

References

Page 1

	Year	Citations
"Why Should I Trust You?" Marco Túlio Ribeiro, Sameer Singh, Carlos Guestrin Artificial IntelligenceEngineeringMachine LearningTrust Management ArchitectureVerification	2016	14K
Dermatologist-level classification of skin cancer with deep neural networks Andre Esteva, Brett Kuprel, Roberto A. Novoa, Nature Dermoscopic ImageConvolutional Neural NetworkDeep Neural NetworksEngineeringMachine Learning	2017	13K
Towards A Rigorous Science of Interpretable Machine Learning Finale Doshi‐Velez, Been Kim arXiv (Cornell University) Artificial IntelligenceRigorous EvaluationInterpretable Machine LearningRigorous ScienceEngineering	2017	3.1K
Semantics derived automatically from language corpora contain human-like biases Aylin Caliskan, Joanna J. Bryson, Arvind Narayanan Science	2017	2.7K
Algorithm aversion: People erroneously avoid algorithms after seeing them err. Berkeley J. Dietvorst, Joseph P. Simmons, Cade Massey Journal of Experimental Psychology General Behavioral Decision MakingCognitionJudgmental ForecastingHuman ForecastersSocial Sciences	2014	2.1K
Model Cards for Model Reporting	2019	1.4K
Overcoming Algorithm Aversion: People Will Use Imperfect Algorithms If They Can (Even Slightly) Modify Them Berkeley J. Dietvorst, Joseph P. Simmons, Cade Massey Management Science Artificial IntelligenceBehavioral Decision MakingJudgmental ForecastingForecasting OutcomeSocial Sciences	2016	1K
Gender Differences in Mate Selection: Evidence From a Speed Dating Experiment Raymond Fisman, Sheena S. Iyengar, Emir Kamenica, The Quarterly Journal of Economics	2006	518
Would You Trust a (Faulty) Robot? Maha Salem, Gabriella Lakatos, Farshid Amirabdollahian, Robot AffectHuman-robot Collaborative AssemblyEngineeringSocially Assistive RobotVerification	2015	457
The Mythos of Model Interpretability Zachary C. Lipton Communications of the ACM Artificial IntelligenceCognitive ScienceEngineeringMachine LearningExplanation-based Learning	2018	456

Page 1