Publication | Closed Access
The Generalizability of Scores for a Performance Assessment Scored with a Computer‐Automated Scoring System
39
Citations
27
References
2000
Year
Computer‐automated Scoring SystemEngineeringGeneralizability TheoryEducationExpert ReviewPerformance MeasurementClassical Test TheoryMedical Expert SystemPerformance AssessmentApplied MeasurementBiostatisticsAutomated AssessmentStatisticsReliabilityExpert SystemsOutcomes ResearchGeneralizability AnalysisEducational TestingClinical Decision SupportDecision Support SystemsEducational MeasurementMedical Decision AnalysisPerformance StudiesEvaluation MeasurePerformance MeasurePerformance Assessment Scored
When performance assessments are delivered and scored by computer, the costs of scoring may be substantially lower than those of scoring the same assessment based on expert review of the individual performances. Computerized scoring algorithms also ensure that the scoring rules are implemented precisely and uniformly. Such computerized algorithms represent an effort to encode the scoring policies of experts. This raises the question, would a different group of experts have produced a meaningfully different algorithm? The research reported in this paper uses generalizability theory to assess the impact of using independent, randomly equivalent groups of experts to develop the scoring algorithms for a set of computer‐simulation tasks designed to measure physicians’ patient management skills. The results suggest that the impact of this “expert group” effect may be significant but that it can be controlled with appropriate test development strategies. The appendix presents multivariate generalizability analysis to examine the stability of the assessed proficiency across scores representing the scoring policies of different groups of experts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1