Publication | Open Access
Native Language Identification with User Generated Content
23
Citations
30
References
2018
Year
Unknown Venue
EngineeringMultilingualismCommunicationLanguage LearningCorpus LinguisticsText MiningApplied LinguisticsNatural Language ProcessingSecond Language AcquisitionLanguage DocumentationNative Language IdentificationComputational LinguisticsSocial Media OutletLanguage EngineeringLanguage StudiesContent AnalysisMachine TranslationLanguage TechnologyAuthor ProfilingLanguage LocalisationLanguage RecognitionSocial Media ContentSocial Medium DataLinguistics
We address the task of native language identification in the context of social media content, where authors are highly-fluent, advanced nonnative speakers (of English). Using both linguistically-motivated features and the characteristics of the social media outlet, we obtain high accuracy on this challenging task. We provide a detailed analysis of the features that sheds light on differences between native and nonnative speakers, and among nonnative speakers with different backgrounds.
| Year | Citations | |
|---|---|---|
Page 1
Page 1