Large language models for sentiment analysis of newspaper articles during COVID-19: The Guardian

2405.13056

YC

0

Reddit

0

Published 5/24/2024 by Rohitash Chandra, Baicheng Zhu, Qingying Fang, Eka Shinjikashvili

💬

Abstract

During the COVID-19 pandemic, the news media coverage encompassed a wide range of topics that includes viral transmission, allocation of medical resources, and government response measures. There have been studies on sentiment analysis of social media platforms during COVID-19 to understand the public response given the rise of cases and government strategies implemented to control the spread of the virus. Sentiment analysis can provide a better understanding of changes in societal opinions and emotional trends during the pandemic. Apart from social media, newspapers have played a vital role in the dissemination of information, including information from the government, experts, and also the public about various topics. A study of sentiment analysis of newspaper sources during COVID-19 for selected countries can give an overview of how the media covered the pandemic. In this study, we select The Guardian newspaper and provide a sentiment analysis during various stages of COVID-19 that includes initial transmission, lockdowns and vaccination. We employ novel large language models (LLMs) and refine them with expert-labelled sentiment analysis data. We also provide an analysis of sentiments experienced pre-pandemic for comparison. The results indicate that during the early pandemic stages, public sentiment prioritised urgent crisis response, later shifting focus to addressing the impact on health and the economy. In comparison with related studies about social media sentiment analyses, we found a discrepancy between The Guardian with dominance of negative sentiments (sad, annoyed, anxious and denial), suggesting that social media offers a more diversified emotional reflection. We found a grim narrative in The Guardian with overall dominance of negative sentiments, pre and during COVID-19 across news sections including Australia, UK, World News, and Opinion

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • The study analyzes sentiment in The Guardian newspaper during the COVID-19 pandemic.
  • It uses novel large language models (LLMs) to perform sentiment analysis on articles from different news sections.
  • The study compares sentiment during the pandemic to the pre-pandemic period.
  • The results show a dominance of negative sentiments like sadness, annoyance, and anxiety in The Guardian's coverage.
  • This contrasts with studies of social media sentiment, which found more diverse emotional expression.

Plain English Explanation

This study looked at the feelings and emotions expressed in articles from The Guardian newspaper during the COVID-19 pandemic. The researchers used advanced AI language models to analyze the sentiment in the newspaper's coverage. They compared the sentiment during the pandemic to the pre-pandemic period.

The results showed that The Guardian's coverage was dominated by negative emotions like sadness, annoyance, and anxiety. This was in contrast to studies of social media, which found a more diverse range of emotions being expressed online. The researchers suggest that The Guardian may have presented a "grim narrative" about the pandemic, focusing more on urgent crisis response and the impacts on health and the economy.

This study provides insights into how the media portrayed the COVID-19 pandemic, and how that differed from the public's own emotional experiences shared on social media platforms. By using advanced sentiment analysis techniques, the researchers were able to get a more nuanced understanding of the media's coverage and the public's response.

Technical Explanation

The researchers in this study conducted a sentiment analysis of articles from The Guardian newspaper during the COVID-19 pandemic. They used novel large language models (LLMs) that were refined with expert-labeled sentiment data to analyze the emotional tone of the coverage.

The analysis spanned different sections of The Guardian, including news from Australia, the UK, and the world, as well as opinion pieces. The researchers compared the sentiment during the pandemic to the pre-pandemic period to understand how the coverage and emotional tone shifted.

The results showed a dominance of negative sentiments like sadness, annoyance, anxiety, and denial in The Guardian's pandemic coverage. This contrasted with studies of social media sentiment, which found a more diverse range of emotions being expressed online, including positive sentiments.

The researchers suggest that The Guardian may have presented a "grim narrative" about the pandemic, with a focus on urgent crisis response and the impacts on health and the economy. This could indicate a discrepancy between the media's portrayal and the public's own emotional experiences shared on social media platforms.

Critical Analysis

The study provides valuable insights into how the media covered the COVID-19 pandemic, but there are a few potential limitations and areas for further research:

  1. The analysis is limited to a single news source, The Guardian, which may not be representative of all media coverage. Expanding the analysis to include other major news outlets could provide a more comprehensive understanding.

  2. The researchers used expert-labeled sentiment data to refine the language models, but the accuracy and reliability of this labeling process is not fully discussed. Validating the sentiment analysis results against human ratings could strengthen the confidence in the findings.

  3. The study does not explore the potential reasons for the discrepancy between The Guardian's coverage and the sentiment observed on social media. Further research could investigate the editorial decision-making processes, journalistic practices, or audience preferences that may have contributed to this difference.

  4. While the study highlights the dominance of negative sentiments in The Guardian's coverage, it does not delve into the specific emotional trajectories or thematic patterns within the news articles. A more detailed analysis of the political entity sentiment could provide additional insights.

Overall, this study offers a valuable contribution to understanding media representation and public sentiment during the COVID-19 pandemic. However, further research is needed to validate the findings, explore potential explanations, and broaden the scope of analysis.

Conclusion

This study provides a sentiment analysis of The Guardian newspaper's coverage during the COVID-19 pandemic. Using novel large language models and expert-labeled data, the researchers found a dominance of negative sentiments like sadness, annoyance, and anxiety in the newspaper's reporting.

This contrasts with studies of social media sentiment, which have found a more diverse range of emotional expression. The researchers suggest that The Guardian may have presented a "grim narrative" about the pandemic, focusing more on urgent crisis response and the impacts on health and the economy.

The findings offer insights into how the media portrayed the COVID-19 crisis and how that differed from the public's own emotional experiences shared on social media. This study highlights the value of sentiment analysis in understanding media coverage and public sentiment during significant events like a global pandemic.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🛠️

Word frequency and sentiment analysis of twitter messages during Coronavirus pandemic

Nikhil Kumar Rajput, Bhavya Ahuja Grover, Vipin Kumar Rathi, Riya Bansal

YC

0

Reddit

0

The COVID-19 epidemic has had a great impact on social media conversation, especially on sites like Twitter, which has emerged as a hub for public reaction and information sharing. This paper deals by analyzing a vast dataset of Twitter messages related to this disease, starting from January 2020. Two approaches were used: a statistical analysis of word frequencies and a sentiment analysis to gauge user attitudes. Word frequencies are modeled using unigrams, bigrams, and trigrams, with power law distribution as the fitting model. The validity of the model is confirmed through metrics like Sum of Squared Errors (SSE), R-squared ($R^2$), and Root Mean Squared Error (RMSE). High $R^2$ and low SSE/RMSE values indicate a good fit for the model. Sentiment analysis is conducted to understand the general emotional tone of Twitter users messages. The results reveal that a majority of tweets exhibit neutral sentiment polarity, with only 2.57% expressing negative polarity.

Read more

6/4/2024

🏷️

COVID-19 Twitter Sentiment Classification Using Hybrid Deep Learning Model Based on Grid Search Methodology

Jitendra Tembhurne, Anant Agrawal, Kirtan Lakhotia

YC

0

Reddit

0

In the contemporary era, social media platforms amass an extensive volume of social data contributed by their users. In order to promptly grasp the opinions and emotional inclinations of individuals regarding a product or event, it becomes imperative to perform sentiment analysis on the user-generated content. Microblog comments often encompass both lengthy and concise text entries, presenting a complex scenario. This complexity is particularly pronounced in extensive textual content due to its rich content and intricate word interrelations compared to shorter text entries. Sentiment analysis of public opinion shared on social networking websites such as Facebook or Twitter has evolved and found diverse applications. However, several challenges remain to be tackled in this field. The hybrid methodologies have emerged as promising models for mitigating sentiment analysis errors, particularly when dealing with progressively intricate training data. In this article, to investigate the hesitancy of COVID-19 vaccination, we propose eight different hybrid deep learning models for sentiment classification with an aim of improving overall accuracy of the model. The sentiment prediction is achieved using embedding, deep learning model and grid search algorithm on Twitter COVID-19 dataset. According to the study, public sentiment towards COVID-19 immunization appears to be improving with time, as evidenced by the gradual decline in vaccine reluctance. Through extensive evaluation, proposed model reported an increased accuracy of 98.86%, outperforming other models. Specifically, the combination of BERT, CNN and GS yield the highest accuracy, while the combination of GloVe, BiLSTM, CNN and GS follows closely behind with an accuracy of 98.17%. In addition, increase in accuracy in the range of 2.11% to 14.46% is reported by the proposed model in comparisons with existing works.

Read more

6/18/2024

#EpiTwitter: Public Health Messaging During the COVID-19 Pandemic

#EpiTwitter: Public Health Messaging During the COVID-19 Pandemic

Ashwin Rao, Nazanin Sabri, Siyi Guo, Louiqa Raschid, Kristina Lerman

YC

0

Reddit

0

Effective communication during health crises is critical, with social media serving as a key platform for public health experts (PHEs) to engage with the public. However, it also amplifies pseudo-experts promoting contrarian views. Despite its importance, the role of emotional and moral language in PHEs' communication during COVID-19 remains under explored. This study examines how PHEs and pseudo-experts communicated on Twitter during the pandemic, focusing on emotional and moral language and their engagement with political elites. Analyzing tweets from 489 PHEs and 356 pseudo-experts from January 2020 to January 2021, alongside public responses, we identified key priorities and differences in messaging strategy. PHEs prioritize masking, healthcare, education, and vaccines, using positive emotional language like optimism. In contrast, pseudo-experts discuss therapeutics and lockdowns more frequently, employing negative emotions like pessimism and disgust. Negative emotional and moral language tends to drive engagement, but positive language from PHEs fosters positivity in public responses. PHEs exhibit liberal partisanship, expressing more positivity towards liberals and negativity towards conservative elites, while pseudo-experts show conservative partisanship. These findings shed light on the polarization of COVID-19 discourse and underscore the importance of strategic use of emotional and moral language by experts to mitigate polarization and enhance public trust.

Read more

6/12/2024

Global News Synchrony and Diversity During the Start of the COVID-19 Pandemic

Global News Synchrony and Diversity During the Start of the COVID-19 Pandemic

Xi Chen, Scott A. Hale, David Jurgens, Mattia Samory, Ethan Zuckerman, Przemyslaw A. Grabowicz

YC

0

Reddit

0

News coverage profoundly affects how countries and individuals behave in international relations. Yet, we have little empirical evidence of how news coverage varies across countries. To enable studies of global news coverage, we develop an efficient computational methodology that comprises three components: (i) a transformer model to estimate multilingual news similarity; (ii) a global event identification system that clusters news based on a similarity network of news articles; and (iii) measures of news synchrony across countries and news diversity within a country, based on country-specific distributions of news coverage of the global events. Each component achieves state-of-the art performance, scaling seamlessly to massive datasets of millions of news articles. We apply the methodology to 60 million news articles published globally between January 1 and June 30, 2020, across 124 countries and 10 languages, detecting 4357 news events. We identify the factors explaining diversity and synchrony of news coverage across countries. Our study reveals that news media tend to cover a more diverse set of events in countries with larger Internet penetration, more official languages, larger religious diversity, higher economic inequality, and larger populations. Coverage of news events is more synchronized between countries that not only actively participate in commercial and political relations -- such as, pairs of countries with high bilateral trade volume, and countries that belong to the NATO military alliance or BRICS group of major emerging economies -- but also countries that share certain traits: an official language, high GDP, and high democracy indices.

Read more

5/2/2024