Publication | Closed Access
Word lengths are optimized for efficient communication
674
Citations
34
References
2011
Year
EngineeringNeurolinguisticsPsycholinguisticsSpoken Language ProcessingLexical SemanticsSemanticsCommunicationCorpus LinguisticsNatural Language ProcessingComputational LinguisticsLanguage StudiesLexiconComputational LexicologyLanguage TechnologyWord LengthHuman LexiconsDistributional SemanticsSpeech CommunicationInformation ContentLanguage UseSpeech ProcessingLexical Complexity PredictionText ProcessingWord LengthsLinguistics
We demonstrate a substantial improvement on one of the most celebrated empirical laws in the study of language, Zipf's 75-y-old theory that word length is primarily determined by frequency of use. In accord with rational theories of communication, we show across 10 languages that average information content is a much better predictor of word length than frequency. This indicates that human lexicons are efficiently structured for communication by taking into account interword statistical dependencies. Lexical systems result from an optimization of communicative pressures, coding meanings efficiently given the complex statistics of natural language use.
| Year | Citations | |
|---|---|---|
Page 1
Page 1