Publication | Open Access
Thinking ahead: spontaneous prediction in context as a keystone of language in humans and machines
52
Citations
76
References
2020
Year
Unknown Venue
EngineeringNeurolinguisticsSemantic ProcessingPsycholinguisticsCognitionMultilingual PretrainingLarge Language ModelLanguage LearningWord EmbeddingsApplied LinguisticsNatural Language ProcessingCognitive LinguisticsComputational LinguisticsLanguage AcquisitionLanguage StudiesLanguage ModelsNatural LanguageCognitive SciencePre-trained ModelsDeep LearningPredictive LearningPredictive CodingSpontaneous PredictionDeep Language ModelsLanguage ScienceStatic Semantic EmbeddingsLanguage Modeling (Theoretical Linguistics)Linguistics
Abstract Departing from traditional linguistic models, advances in deep learning have resulted in a new type of predictive (autoregressive) deep language models (DLMs). Using a self-supervised next-word prediction task, these models are trained to generate appropriate linguistic responses in a given context. We provide empirical evidence that the human brain and autoregressive DLMs share three fundamental computational principles as they process natural language: 1) both are engaged in continuous next-word prediction before word-onset; 2) both match their pre-onset predictions to the incoming word to calculate post-onset surprise (i.e., prediction error signals); 3) both represent words as a function of the previous context. In support of these three principles, our findings indicate that: a) the neural activity before word-onset contains context-dependent predictive information about forthcoming words, even hundreds of milliseconds before the words are perceived; b) the neural activity after word-onset reflects the surprise level and prediction error; and c) autoregressive DLM contextual embeddings capture the neural representation of context-specific word meaning better than arbitrary or static semantic embeddings. Together, our findings suggest that autoregressive DLMs provide a novel and biologically feasible computational framework for studying the neural basis of language.
| Year | Citations | |
|---|---|---|
Page 1
Page 1