Publication | Open Access
Emergent Abilities of Large Language Models
1K
Citations
0
References
2022
Year
EngineeringAdditional ScalingPsycholinguisticsMultilingual PretrainingLarge Language ModelLanguage LearningCorpus LinguisticsNatural Language ProcessingLarge Language ModelsLanguage AdaptationComputational LinguisticsLanguage AcquisitionLanguage StudiesLanguage ModelsMachine TranslationLarge Ai ModelNatural LanguageCognitive ScienceEmergent AbilitiesLanguage ScienceLinguistics
Scaling language models consistently improves performance and sample efficiency across many downstream tasks. This paper examines the unpredictable phenomenon of emergent abilities that appear only in large models. Emergent abilities cannot be forecasted by extrapolating smaller models, suggesting further scaling may unlock additional capabilities.
Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.