Publication | Open Access
Sentimental Analysis of COVID-19 Tweets Using Deep Learning Models
187
Citations
25
References
2021
Year
Novel Coronavirus DiseaseEngineeringIndian TweetsMultimodal Sentiment AnalysisAnalyses TweetsCorpus LinguisticsJournalismSentiment AnalysisLanguage ProcessingText MiningNatural Language ProcessingLarge Language ModelsSentimental AnalysisWord EmbeddingsSocial MediaData ScienceComputational LinguisticsLanguage StudiesLanguage ModelsContent AnalysisSocial Medium MiningLanguage Modeling (Natural Language Processing)Social Medium IntelligenceLanguage Modeling (Theoretical Linguistics)Social Medium DataLinguistics
COVID‑19 remains a global pandemic, and misinformation on Twitter fuels public anxiety. The study aims to analyze Indian tweets during the lockdown to understand public sentiment. Tweets from 23 March to 15 July 2020 were labeled as fear, sad, anger, or joy and classified using BERT, logistic regression, SVM, and LSTM, with accuracy computed per sentiment. BERT achieved 89 % accuracy, outperforming LR (75 %), SVM (74.75 %), and LSTM (65 %), with sentiment accuracies ranging 75.88–87.33 % (median 79.34 %), revealing prevalent keywords and informing public health authorities.
The novel coronavirus disease (COVID-19) is an ongoing pandemic with large global attention. However, spreading false news on social media sites like Twitter is creating unnecessary anxiety towards this disease. The motto behind this study is to analyses tweets by Indian netizens during the COVID-19 lockdown. The data included tweets collected on the dates between 23 March 2020 and 15 July 2020 and the text has been labelled as fear, sad, anger, and joy. Data analysis was conducted by Bidirectional Encoder Representations from Transformers (BERT) model, which is a new deep-learning model for text analysis and performance and was compared with three other models such as logistic regression (LR), support vector machines (SVM), and long-short term memory (LSTM). Accuracy for every sentiment was separately calculated. The BERT model produced 89% accuracy and the other three models produced 75%, 74.75%, and 65%, respectively. Each sentiment classification has accuracy ranging from 75.88-87.33% with a median accuracy of 79.34%, which is a relatively considerable value in text mining algorithms. Our findings present the high prevalence of keywords and associated terms among Indian tweets during COVID-19. Further, this work clarifies public opinion on pandemics and lead public health authorities for a better society.
| Year | Citations | |
|---|---|---|
Page 1
Page 1