Publication | Open Access
Gender Inference of Twitter Users in Non-English Contexts
132
Citations
16
References
2013
Year
Unknown Venue
Gendered PerceptionEngineeringCorpus LinguisticsText MiningLatent Attribute InferenceApplied LinguisticsNatural Language ProcessingSocial MediaGender IdentityData ScienceGender StudiesComputational LinguisticsLanguage StudiesContent AnalysisSocial Medium MiningMachine TranslationGendered ContextLanguage TechnologyGender InferenceSocial Medium DataLinguistics
While much work has considered the problem of latent attribute inference for users of social media such as Twitter, little has been done on non-English-based content and users. Here, we conduct the first assessment of latent attribute inference in languages beyond English, focusing on gender inference. We find that the gender inference problem in quite diverse languages can be addressed using existing machinery. Further, accuracy gains can be made by taking language-specific features into account. We identify languages with complex orthography, such as Japanese, as difficult for existing methods, suggesting a valuable direction for future research.
| Year | Citations | |
|---|---|---|
Page 1
Page 1