Publication | Open Access
Assessing agreement on classification tasks: the kappa statistic
2.1K
Citations
11
References
1996
Year
EngineeringSubjective JudgmentsPsycholinguisticsCorpus LinguisticsText MiningNatural Language ProcessingApplied LinguisticsClassification MethodData MiningLanguage TestingComputational LinguisticsInterobserver AgreementDiscourse AnalysisLanguage StudiesContent AnalysisStatisticsKappa StatisticInteractional LinguisticsCognitive ScientistsCognitive ScienceAutomatic ClassificationPredictive AnalyticsKnowledge DiscoveryLanguage TechnologyDistributional SemanticsDiscourse StructureLanguage CorpusClassificationLinguisticsSurvey Methodology
Currently, computational linguists and cognitive scientists working in the area of discourse and dialogue argue that their subjective judgments are reliable using several different statistics, none of which are easily interpretable or comparable to each other. Meanwhile, researchers in content analysis have already experienced the same difficulties and come up with a solution in the kappa statistic. We discuss what is wrong with reliability measures as they are currently used for discourse and dialogue work in computational linguistics and cognitive science, and argue that we would be better off as a field adopting techniques from content analysis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1