Publication | Closed Access
Ten-Year Compilation of #SaveKPK Twitter Dataset
13
Citations
8
References
2020
Year
Unknown Venue
Latent Dirichlet AllocationEngineeringSocial Medium MonitoringCommunicationCorruption EradicationLarge-scale DatasetsCorpus LinguisticsJournalismText MiningNatural Language ProcessingComputational Social ScienceSavekpk Twitter DatasetSocial MediaData ScienceData MiningData IntegrationContent AnalysisData ManagementSocial Medium MiningKnowledge DiscoveryTopic ModelSocial Medium DataArtsBig Data
Politic is one of the most favorite topics to discuss in social media for people in Indonesia. It was proven when a people movement to support the Commission of Corruption Eradication (KPK) named #SaveKPK has been enlived for ten years and become a trending topic on Twitter for several times. In this research, all tweets contain `#SaveKPK' are crawled and compiled using an alternative algorithm to get twitter historical data instead of using Twitter API. The result described the characteristic of the dataset statistically, from the most frequently used words to the most active users. A clustering algorithm named Latent Dirichlet Allocation (LDA) was run over the gathered text dataset to discover most relevant keywords using unsupervised learning approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1