Sentimental Analysis of COVID-19 Tweets Using Deep Learning Models

TLDR

COVID‑19 remains a global pandemic, and misinformation on Twitter fuels public anxiety. The study aims to analyze Indian tweets during the lockdown to understand public sentiment. Tweets from 23 March to 15 July 2020 were labeled as fear, sad, anger, or joy and classified using BERT, logistic regression, SVM, and LSTM, with accuracy computed per sentiment. BERT achieved 89 % accuracy, outperforming LR (75 %), SVM (74.75 %), and LSTM (65 %), with sentiment accuracies ranging 75.88–87.33 % (median 79.34 %), revealing prevalent keywords and informing public health authorities.

Abstract

The novel coronavirus disease (COVID-19) is an ongoing pandemic with large global attention. However, spreading false news on social media sites like Twitter is creating unnecessary anxiety towards this disease. The motto behind this study is to analyses tweets by Indian netizens during the COVID-19 lockdown. The data included tweets collected on the dates between 23 March 2020 and 15 July 2020 and the text has been labelled as fear, sad, anger, and joy. Data analysis was conducted by Bidirectional Encoder Representations from Transformers (BERT) model, which is a new deep-learning model for text analysis and performance and was compared with three other models such as logistic regression (LR), support vector machines (SVM), and long-short term memory (LSTM). Accuracy for every sentiment was separately calculated. The BERT model produced 89% accuracy and the other three models produced 75%, 74.75%, and 65%, respectively. Each sentiment classification has accuracy ranging from 75.88-87.33% with a median accuracy of 79.34%, which is a relatively considerable value in text mining algorithms. Our findings present the high prevalence of keywords and associated terms among Indian tweets during COVID-19. Further, this work clarifies public opinion on pandemics and lead public health authorities for a better society.

References

Page 1

	Year	Citations

Page 1