Publication | Closed Access
The Unreasonable Effectiveness of Data
1.8K
Citations
12
References
2009
Year
Artificial IntelligenceStructured PredictionEngineeringMachine LearningLarge Language ModelNeat FormulasNatural Language ProcessingData ScienceBiasComputational LinguisticsLanguage EngineeringUnreasonable EffectivenessFair Data PrincipleData ManagementStatisticsMachine TranslationLarge Ai ModelHigh-quality ModelsNlp TaskKnowledge DiscoveryData PrivacyComputer ScienceInformation ManagementData SecurityNatural Language UnderstandingResponsible Data ManagementData TreatmentArtsLinguisticsBig Data
Problems that involve interacting with humans, such as natural language understanding, have not proven to be solvable by concise, neat formulas like F = ma. Instead, the best approach appears to be to embrace the complexity of the domain and address it by harnessing the power of data: if other humans engage in the tasks and generate large amounts of unlabeled, noisy data, new algorithms can be used to build high-quality models from the data.
| Year | Citations | |
|---|---|---|
Page 1
Page 1