Publication | Open Access
Predicting the Difficulty of Multiple Choice Questions in a High-stakes Medical Exam
45
Citations
22
References
2019
Year
Unknown Venue
EngineeringClinical Decision-makingDiagnosisPsycholinguisticsText MiningWord EmbeddingsNatural Language ProcessingMedical Decision MakingInformation RetrievalHigh-stakes Medical ExamLanguage TestingComputational LinguisticsLanguage EngineeringMultiple Choice QuestionsLanguage StudiesConstruct-relevant DifficultyAutomated AssessmentDecision TheoryStatisticsMachine TranslationReliabilityCognitive ScienceQuestion AnsweringNlp TaskEducational TestingMedical Decision AnalysisItem DifficultyRetrieval Augmented GenerationPatient SafetyDecision ScienceLinguisticsMultiple-choice Questions
Predicting the construct-relevant difficulty of Multiple-Choice Questions (MCQs) has the potential to reduce cost while maintaining the quality of high-stakes exams. In this paper, we propose a method for estimating the difficulty of MCQs from a high-stakes medical exam, where all questions were deliberately written to a common reading level. To accomplish this, we extract a large number of linguistic features and embedding types, as well as features quantifying the difficulty of the items for an automatic question-answering system. The results show that the proposed approach outperforms various baselines with a statistically significant difference. Best results were achieved when using the full feature set, where embeddings had the highest predictive power, followed by linguistic features. An ablation study of the various types of linguistic features suggested that information from all levels of linguistic processing contributes to predicting item difficulty, with features related to semantic ambiguity and the psycholinguistic properties of words having a slightly higher importance. Owing to its generic nature, the presented approach has the potential to generalize over other exams containing MCQs.
| Year | Citations | |
|---|---|---|
Page 1
Page 1