Sentiment strength detection for the social web

TLDR

Sentiment analysis traditionally targets commercial tasks, but its application to the social web—especially Twitter—faces challenges because existing algorithms rely on indirect cues that can be confounded by genre or topic, leading to spurious sentiment detection. This study evaluates an improved version of SentiStrength for detecting sentiment strength on the social web using direct sentiment indicators. The authors test SentiStrength 2, an enhanced algorithm that emphasizes direct sentiment cues rather than indirect indicators, across multiple social media platforms. Across six diverse social‑web datasets, SentiStrength 2 outperforms a baseline in both supervised and unsupervised settings, though it is sometimes outperformed by machine‑learning methods and shows weaker performance for positive sentiment in news‑related discussions, yet remains robust enough for broad application.

Abstract

Abstract Sentiment analysis is concerned with the automatic extraction of sentiment‐related information from text. Although most sentiment analysis addresses commercial tasks, such as extracting opinions from product reviews, there is increasing interest in the affective dimension of the social web, and Twitter in particular. Most sentiment analysis algorithms are not ideally suited to this task because they exploit indirect indicators of sentiment that can reflect genre or topic instead. Hence, such algorithms used to process social web texts can identify spurious sentiment patterns caused by topics rather than affective phenomena. This article assesses an improved version of the algorithm SentiStrength for sentiment strength detection across the social web that primarily uses direct indications of sentiment. The results from six diverse social web data sets (MySpace, Twitter, YouTube, Digg, Runners World, BBC Forums) indicate that SentiStrength 2 is successful in the sense of performing better than a baseline approach for all data sets in both supervised and unsupervised cases. SentiStrength is not always better than machine‐learning approaches that exploit indirect indicators of sentiment, however, and is particularly weaker for positive sentiment in news‐related discussions. Overall, the results suggest that, even unsupervised, SentiStrength is robust enough to be applied to a wide variety of different social web contexts.

References

Page 1

	Year	Citations
Content Analysis: An Introduction to its Methodology. Mack Shelley, Klaus Krippendorff Journal of the American Statistical Association ReliabilityError AnalysisSurvey MethodologyEngineeringData Reliability	1984	24.6K
Opinion Mining and Sentiment Analysis Bo Pang, Lillian Lee Foundations and Trends® in Information Retrieval Natural Language ProcessingCustomer ReviewOpinion MiningEngineeringData Science	2008	6.1K
Advances in kernel methods: support vector learning Bernhard Schölkopf, Christopher J. C. Burges, Alexander J. Smola International Conference on Neural Information Processing Support VectorEngineeringMachine LearningSupport Vector LearningSupport Vector Machine	1999	5.8K
Making Large-Scale SVM Learning Practical Thorsten Joachims Technical reports Artificial IntelligenceMathematical ProgrammingSupport Vector MachineImage AnalysisMachine Learning	2006	4.3K
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews Peter Peter, Turney Meeting of the Association for Computational Linguistics Semantic Orientation AppliedEngineeringMultimodal Sentiment AnalysisCorpus LinguisticsLanguage Processing	2002	3.7K
Lexicon-Based Methods for Sentiment Analysis Maite Taboada, Julian Brooke, Milan Tofiloski, Computational Linguistics Semantic Orientation CalculatorPublic OpinionMultimodal Sentiment AnalysisCorpus LinguisticsSentiment Analysis	2011	3.2K
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. Stefano Baccianella, Andrea Esuli, Fabrizio Sebastiani	2010	2.7K
From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series Brendan O’Connor, Ramnath Balasubramanyan, Bryan Routledge, Proceedings of the International AAAI Conference on Web and Social Media Text SentimentSocial MediaOpinion AggregationSocial Medium MonitoringSentiment Analysis	2010	1.9K
Inter-Coder Agreement for Computational Linguistics Ron Artstein, Massimo Poesio Computational Linguistics EngineeringPart-of-speech TaggingAlpha-like CoefficientsCommunicationSemantics	2008	1.5K
Sentiment strength detection in short informal text Mike Thelwall, Kevan Buckley, Di Cai, Journal of the American Society for Information Science and Technology EngineeringMachine LearningSocial Medium MonitoringCommunicationMultimodal Sentiment Analysis	2010	1.4K

Page 1