Publication | Closed Access
Sampling Variability of Performance Assessments
335
Citations
16
References
1993
Year
EngineeringScore ComparabilityGeneralizability TheoryEducationPerformance MeasurementPerformance AssessmentApplied MeasurementClassroom AssessmentSurvey MethodologyComplex UniverseStatisticsSampling FrameworkReliabilityEducational TestingValidity TheoryEducational MeasurementStudent AssessmentPerformance MeasurePerformance AssessmentsEducational AssessmentPsychological Measurement
The article frames performance assessments within a sampling framework. The authors use generalizability theory to analyze performance assessments as samples drawn from a complex universe of tasks, occasions, raters, and measurement methods. Task‑sampling variability dominates measurement error, requiring many tasks for reliable elementary math and science scores, and methods do not converge, so scores depend on both task and method.
In this article, performance assessments are cast within a sampling framework. More specifically, a performance assessment is viewed as a sample of student performance drawn from a complex universe defined by a combination of all possible tasks, occasions, raters, and measurement methods. Using generalizability theory, we present evidence bearing on the generalizability and convergent validity of performance assessments sampled from a range of measurement facets and measurement methods. Results at both the individual and school level indicate that task‐sampling variability is the major source ofmeasurment error. Large numbers of tasks are needed to get a reliable measure of mathematics and science achievement at the elementary level. With respect to convergent validity, results suggest that methods do not converge. Students' performance scores, then, are dependent on both the task and method sampled.
| Year | Citations | |
|---|---|---|
Page 1
Page 1