Publication | Closed Access
Cutting the Gordian Knot: The Moving-Average Type–Token Ratio (MATTR)
411
Citations
15
References
2010
Year
EngineeringGordian KnotGeometryAbstract Type–token RatioLanguage ProcessingText MiningNatural Language ProcessingInformation RetrievalKnot TheoryComputational LinguisticsWord Segmentation (Natural Language Processing)Corpus AnalysisLanguage StudiesMachine TranslationWord Segmentation (Phonological Awareness)Nlp TaskText LengthDistributional SemanticsVocabulary SizeText NormalizationLexical Complexity PredictionText ProcessingLinguistics
Abstract Type–token ratio (TTR), or vocabulary size divided by text length (V/N), is a time-honoured but unsatisfactory measure of lexical diversity. The problem is that the TTR of a text sample is affected by its length. We present an algorithm for rapidly computing TTR through a moving window that is independent of text length, and we demonstrate that this measurement can detect changes within a text as well as differences between texts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1