CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics

Read original: arXiv:2406.15477 - Published 6/26/2024 by Kai Yin, Chengkai Liu, Ali Mostafavi, Xia Hu

💬

Overview

This study explores a novel approach to disaster text classification using a pre-trained Large Language Model (LLM) that has been fine-tuned through instruction-based learning.
The researchers aim to enhance the ability of LLMs to categorize multiple aspects of disaster-related information from social media data, such as the type of event, informativeness, and involvement of human aid.
This approach aims to improve situational awareness and response strategies during emergencies by leveraging the capabilities of LLMs to extract more valuable insights from social media.

Plain English Explanation

When disasters strike, social media can be a valuable source of real-time information to help responders understand what is happening and coordinate relief efforts. However, extracting useful insights from the large volume of social media data can be challenging. This study introduces a new way to address this problem.

The researchers used a powerful language model, which is a type of artificial intelligence that has been trained on massive amounts of text data, and further trained it specifically on disaster-related tweets. This special training helps the model better understand the unique language and information patterns found in social media posts about disasters.

Unlike previous approaches that could only classify a single aspect of a social media post, this new model can simultaneously identify multiple important details, such as the type of disaster, whether the post contains useful information, and whether it mentions human aid efforts. This ability to capture a more comprehensive understanding of disaster-related social media posts can significantly improve the usefulness of this data for situational awareness and response coordination during emergencies.

By leveraging the capabilities of advanced language models, this research paves the way for more effective disaster management tools that can better extract critical information from social media in real-time, ultimately helping to save lives and minimize the impact of disasters.

Technical Explanation

The researchers in this study focused on enhancing the text classification capabilities of pre-trained Large Language Models (LLMs) for the specific domain of disaster-related social media data. Current methods for disaster text classification often rely on single-label models, which fail to capture the nuanced and multifaceted nature of disaster-related information on social media.

To address this, the researchers developed a novel approach that fine-tunes an open-source LLM through instruction-based learning using a comprehensive dataset of disaster-related tweets. This process embeds the LLM with disaster-specific knowledge, enabling it to classify multiple aspects of a social media post simultaneously, such as the type of event, informativeness, and involvement of human aid.

The researchers first created a dataset of disaster-related tweets and annotated them with labels corresponding to these different aspects. They then used this dataset to fine-tune an LLM, a process known as instruction fine-tuning. This approach allows the model to learn how to classify the various elements of disaster-related information within a single tweet, rather than relying on separate models for each task.

The results of this study demonstrate that this fine-tuned LLM outperforms traditional single-label classification models in categorizing critical information from social media posts, making it a valuable tool for improving situational awareness and response strategies during disasters. The researchers highlight the potential for this approach to be further developed into more advanced, adaptable, and robust disaster management systems that leverage the capabilities of LLMs.

Critical Analysis

The researchers acknowledge several limitations and areas for further investigation in their study. While the fine-tuned LLM showed promising results in classifying multiple aspects of disaster-related social media posts, the researchers note that the performance may vary depending on the specific type of disaster and the characteristics of the social media data.

Additionally, the researchers suggest that further research is needed to explore the generalizability of this approach to other languages and cultural contexts, as the majority of the data used in this study was in English. Expanding the capabilities of these models to handle multilingual disaster-related data could significantly broaden their practical application.

Another area for potential improvement is the robustness of the models to handle noisy or incomplete social media data, which can be a common challenge in disaster scenarios. [Exploring techniques like QuakeBERT to enhance the models' ability to accurately classify social media texts in the face of such challenges could further strengthen their utility for real-world disaster response.

Overall, this study presents a promising approach to leveraging the power of LLMs for more comprehensive and effective disaster text classification, paving the way for improved situational awareness and response strategies. However, continued research and refinement will be necessary to address the limitations and expand the capabilities of this technology for monitoring critical infrastructure and other disaster management applications.

Conclusion

This study introduces a novel approach to disaster text classification that enhances a pre-trained Large Language Model (LLM) through instruction fine-tuning. By creating a comprehensive dataset of disaster-related tweets and using it to fine-tune the LLM, the researchers were able to develop a model that can simultaneously classify multiple aspects of disaster-related information from social media posts, such as the type of event, informativeness, and involvement of human aid.

The results demonstrate that this fine-tuned LLM outperforms traditional single-label classification models, significantly improving the utility of social media data for situational awareness during emergencies. This research paves the way for more advanced, adaptable, and robust disaster management tools that can leverage the capabilities of LLMs to extract critical insights from social media in real-time, ultimately enhancing response strategies and saving lives in the face of disasters.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

💬

CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics

Kai Yin, Chengkai Liu, Ali Mostafavi, Xia Hu

In the field of crisis/disaster informatics, social media is increasingly being used for improving situational awareness to inform response and relief efforts. Efficient and accurate text classification tools have been a focal area of investigation in crisis informatics. However, current methods mostly rely on single-label text classification models, which fails to capture different insights embedded in dynamic and multifaceted disaster-related social media data. This study introduces a novel approach to disaster text classification by enhancing a pre-trained Large Language Model (LLM) through instruction fine-tuning targeted for multi-label classification of disaster-related tweets. Our methodology involves creating a comprehensive instruction dataset from disaster-related tweets, which is then used to fine-tune an open-source LLM, thereby embedding it with disaster-specific knowledge. This fine-tuned model can classify multiple aspects of disaster-related information simultaneously, such as the type of event, informativeness, and involvement of human aid, significantly improving the utility of social media data for situational awareness in disasters. The results demonstrate that this approach enhances the categorization of critical information from social media posts, thereby facilitating a more effective deployment for situational awareness during emergencies. This research paves the way for more advanced, adaptable, and robust disaster management tools, leveraging the capabilities of LLMs to improve real-time situational awareness and response strategies in disaster scenarios.

6/26/2024

Active Learning for Identifying Disaster-Related Tweets: A Comparison with Keyword Filtering and Generic Fine-Tuning

David Hanny, Sebastian Schmidt, Bernd Resch

Information from social media can provide essential information for emergency response during natural disasters in near real-time. However, it is difficult to identify the disaster-related posts among the large amounts of unstructured data available. Previous methods often use keyword filtering, topic modelling or classification-based techniques to identify such posts. Active Learning (AL) presents a promising sub-field of Machine Learning (ML) that has not been used much in the field of text classification of social media content. This study therefore investigates the potential of AL for identifying disaster-related Tweets. We compare a keyword filtering approach, a RoBERTa model fine-tuned with generic data from CrisisLex, a base RoBERTa model trained with AL and a fine-tuned RoBERTa model trained with AL regarding classification performance. For testing, data from CrisisLex and manually labelled data from the 2021 flood in Germany and the 2023 Chile forest fires were considered. The results show that generic fine-tuning combined with 10 rounds of AL outperformed all other approaches. Consequently, a broadly applicable model for the identification of disaster-related Tweets could be trained with very little labelling effort. The model can be applied to use cases beyond this study and provides a useful tool for further research in social media analysis.

8/20/2024

💬

Monitoring Critical Infrastructure Facilities During Disasters Using Large Language Models

Abdul Wahab Ziaullah, Ferda Ofli, Muhammad Imran

Critical Infrastructure Facilities (CIFs), such as healthcare and transportation facilities, are vital for the functioning of a community, especially during large-scale emergencies. In this paper, we explore a potential application of Large Language Models (LLMs) to monitor the status of CIFs affected by natural disasters through information disseminated in social media networks. To this end, we analyze social media data from two disaster events in two different countries to identify reported impacts to CIFs as well as their impact severity and operational status. We employ state-of-the-art open-source LLMs to perform computational tasks including retrieval, classification, and inference, all in a zero-shot setting. Through extensive experimentation, we report the results of these tasks using standard evaluation metrics and reveal insights into the strengths and weaknesses of LLMs. We note that although LLMs perform well in classification tasks, they encounter challenges with inference tasks, especially when the context/prompt is complex and lengthy. Additionally, we outline various potential directions for future exploration that can be beneficial during the initial adoption phase of LLMs for disaster response tasks.

4/24/2024

🏷️

QuakeBERT: Accurate Classification of Social Media Texts for Rapid Earthquake Impact Assessment

Jin Han, Zhe Zheng, Xin-Zheng Lu, Ke-Yin Chen, Jia-Rui Lin

Social media aids disaster response but suffers from noise, hindering accurate impact assessment and decision making for resilient cities, which few studies considered. To address the problem, this study proposes the first domain-specific LLM model and an integrated method for rapid earthquake impact assessment. First, a few categories are introduced to classify and filter microblogs considering their relationship to the physical and social impacts of earthquakes, and a dataset comprising 7282 earthquake-related microblogs from twenty earthquakes in different locations is developed as well. Then, with a systematic analysis of various influential factors, QuakeBERT, a domain-specific large language model (LLM), is developed and fine-tuned for accurate classification and filtering of microblogs. Meanwhile, an integrated method integrating public opinion trend analysis, sentiment analysis, and keyword-based physical impact quantification is introduced to assess both the physical and social impacts of earthquakes based on social media texts. Experiments show that data diversity and data volume dominate the performance of QuakeBERT and increase the macro average F1 score by 27%, while the best classification model QuakeBERT outperforms the CNN- or RNN-based models by improving the macro average F1 score from 60.87% to 84.33%. Finally, the proposed approach is applied to assess two earthquakes with the same magnitude and focal depth. Results show that the proposed approach can effectively enhance the impact assessment process by accurate detection of noisy microblogs, which enables effective post-disaster emergency responses to create more resilient cities.

5/14/2024