Publication | Closed Access
Topical Clustering of Tweets
152
Citations
8
References
2011
Year
Unknown Venue
Engineering“ TweetsSocial Medium MonitoringComputational AnalysisCommunicationJournalismText MiningComputational Social ScienceSocial MediaData ScienceData MiningSocial Aspects Of Data MiningContent AnalysisSocial Medium MiningDocument ClusteringGoogle NewsKnowledge DiscoverySocial Media MiningSupervised MethodologyTopic ModelTopical ClusteringSocial Medium DataArts
In the emerging field of micro-blogging and social communication services, users post millions of short messages every day. Keeping track of all the messages posted by your friends and the conversation as a whole can become tedious or even impossible. In this paper, we presented a study on automatically clustering and classifying Twitter messages, also known as “tweets”, into different categories, inspired by the approaches taken by news aggregating services like Google News. Our results suggest that the clusters produced by traditional unsupervised methods can often be incoherent from a topical perspective, but utilizing a supervised methodology that utilize the hash-tags as indicators of topics produce surprisingly good results. We also offer a discussion on temporal effects of our methodology and training set size considerations. Lastly, we describe a simple method of finding the most representative tweet in a cluster, and provide an analysis of the results.
| Year | Citations | |
|---|---|---|
Page 1
Page 1