Publication | Open Access
Dependency Distance as a Metric of Language Comprehension Difficulty
422
Citations
32
References
2008
Year
Syntactic ParsingEngineeringDependency LinguisticsPsycholinguisticsSyntactic StructureLanguage LearningLanguage ProcessingNatural Language ProcessingApplied LinguisticsSecond Language AcquisitionSyntaxComputational LinguisticsLanguage TestingLanguage AcquisitionGrammarLinguistic ComplexityLanguage StudiesDependency GrammarCognitive ScienceTreebanksDependency DistanceLanguage ComprehensionLexical Complexity PredictionLinguistics
Linguistic complexity is a measure of the cognitive difficulty of human language processing. The present paper proposes dependency distance, in the framework of dependency grammar, as an insightful metric of complexity. Three hypotheses are formulated: (1) The human language parser prefers linear orders that minimize the average dependency distance of the recognized sentence (2) There is a threshold that the average dependency distance of most sentences or texts of human languages does not exceed (3) Grammar and cognition combine to keep dependency distance within the threshold. Twenty corpora from different languages with dependency syntactic annotation are used to test these hypotheses. The paper reports the average dependency distance in these corpora and analyzes the factors which influence dependency distance. The findings-that average dependency distance has a tendency to be minimized in human language and that there is a threshold of less than 3 words in average dependency distance and grammar plays an important role in constraining distance-support all three hypotheses, although some questions are still open for further research.
| Year | Citations | |
|---|---|---|
Page 1
Page 1