Publication | Open Access
Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls
688
Citations
21
References
2014
Year
Other Methodological PitfallsSocial Data AnalysisBig QuestionsEngineeringSocial Medium MonitoringHuman ActivityOnline CommunitiesSocial TechnologiesCommunicationComputational Social ScienceSocial MediaData ScienceLarge-scale DatabasesSocial Aspects Of Data MiningContent AnalysisSocial Network AnalysisSocial Medium MiningSocial NetworksInformation ManagementSocial Data ManagementSocial Media PlatformsSocial Media MiningMedia PoliciesSocial Medium IntelligenceSocial ComputingPolitical CampaignsSocial Medium DataArtsBig Data
Large‑scale social‑media databases have attracted intense research interest, yet the field faces persistent challenges such as over‑reliance on Twitter, sampling biases, vague sampling frames, algorithmic invisibility, and the influence of broader societal events that can confound network‑based analyses. The authors examine methodological and conceptual challenges in social‑media big‑data research, focusing on validity and representativeness. They conclude with a call to action for practical steps to enhance analytic capacity in this rapidly growing field.
Large-scale databases of human activity in social media have captured scientific and policy attention, producing a flood of research and discussion. This paper considers methodological and conceptual challenges for this emergent field, with special attention to the validity and representativeness of social media big data analyses. Persistent issues include the over-emphasis of a single platform, Twitter, sampling biases arising from selection by hashtags, and vague and unrepresentative sampling frames. The socio-cultural complexity of user behavior aimed at algorithmic invisibility (such as subtweeting, mock-retweeting, use of “screen captures” for text, etc.) further complicate interpretation of big data social media. Other challenges include accounting for field effects, i.e. broadly consequential events that do not diffuse only through the network under study but affect the whole society. The application of network methods from other fields to the study of human social activity may not always be appropriate. The paper concludes with a call to action on practical steps to improve our analytic capacity in this promising, rapidly-growing field.
| Year | Citations | |
|---|---|---|
Page 1
Page 1