ViHateT5: Enhancing Hate Speech Detection in Vietnamese With A Unified Text-to-Text Transformer Model

Read original: arXiv:2405.14141 - Published 6/5/2024 by Luan Thanh Nguyen
Total Score

0

🗣️

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This research paper focuses on advancements in hate speech detection (HSD) for the Vietnamese language.
  • The researchers introduce ViHateT5, a T5-based model pre-trained on a large-scale, domain-specific dataset called VOZ-HSD.
  • ViHateT5 can tackle multiple HSD tasks using a unified model and achieve state-of-the-art performance on standard Vietnamese HSD benchmarks.
  • The researchers highlight the importance of label distribution in pre-training data on model efficacy.

Plain English Explanation

Detecting hate speech online is an important task, and it's particularly challenging for languages like Vietnamese. The researchers in this study have made progress in this area by developing a new model called ViHateT5. This model is based on the T5 architecture, which is a powerful language model that can tackle various tasks using a single unified system.

To train ViHateT5, the researchers created a large-scale dataset called VOZ-HSD, which is specifically focused on hate speech in the Vietnamese context. This domain-specific data is crucial because hate speech on online platforms can be quite different from the formal language used in resources like Wikipedia, which are often used to train language models.

By pre-training ViHateT5 on the VOZ-HSD dataset, the researchers were able to create a model that performs exceptionally well on a range of standard Vietnamese hate speech detection benchmarks. This is an important advancement, as it means that a single ViHateT5 model can be used to tackle multiple hate speech-related tasks, rather than requiring separate models for each task.

The researchers also found that the distribution of labels (e.g., the balance between hate speech and non-hate speech examples) in the pre-training data plays a crucial role in the model's effectiveness. This suggests that carefully curating the training data is key to developing high-performing hate speech detection systems.

Technical Explanation

The researchers in this study recognized the need for specialized, fine-tuned models to address the complexity and fragmentation of developing a multitasking hate speech detection (HSD) system for the Vietnamese language. They noted that most current methodologies focus on fine-tuning general pre-trained models, primarily trained on formal textual datasets like Wikipedia, which may not accurately capture human behavior on online platforms.

To address these challenges, the researchers introduced ViHateT5, a T5-based model pre-trained on their proposed large-scale, domain-specific dataset named VOZ-HSD. By leveraging the text-to-text architecture of the T5 model, ViHateT5 can tackle multiple HSD tasks using a unified model and achieve state-of-the-art performance across all standard Vietnamese HSD benchmarks.

The researchers' experiments also highlighted the significance of label distribution in the pre-training data on the model's efficacy. This finding underscores the importance of carefully curating the training data to ensure that it accurately represents the nuances of hate speech in the Vietnamese context.

To facilitate further research, the researchers have made their experimental materials publicly available, including the VOZ-HSD dataset, the pre-trained ViHateT5 checkpoint, and the related source code on GitHub.

Critical Analysis

While the researchers have made significant progress in developing a powerful and versatile HSD system for the Vietnamese language, there are a few areas that could benefit from further exploration.

One potential limitation is the reliance on pre-training on a single, domain-specific dataset (VOZ-HSD). While this approach has yielded impressive results, it would be interesting to see how ViHateT5 performs when fine-tuned on a more diverse set of Vietnamese language datasets, including those focused on Vietnamese AI-generated text detection or detecting anti-semitic hate speech. This could help assess the model's ability to generalize beyond the specific context of the VOZ-HSD dataset.

Additionally, the researchers could explore the potential of using smaller, more efficient models for hate speech detection in Vietnamese, as this could make the technology more widely accessible and deployable, particularly in resource-constrained environments.

Finally, it would be valuable for the researchers to compare their approach to other state-of-the-art methods for Vietnamese HSD, as this could provide a more comprehensive understanding of the strengths and limitations of ViHateT5.

Conclusion

The introduction of ViHateT5, a T5-based model pre-trained on the large-scale, domain-specific VOZ-HSD dataset, represents a significant advancement in hate speech detection for the Vietnamese language. By leveraging a unified, text-to-text architecture, ViHateT5 can tackle multiple HSD tasks with state-of-the-art performance, addressing the complexity and fragmentation that have plagued previous approaches.

The researchers' findings on the importance of label distribution in the pre-training data highlight the need for careful curation of datasets to ensure that hate speech detection models accurately capture the nuances of online discourse. This insight could have broader implications for the development of effective language technology solutions for other languages and tasks.

Overall, this research contributes valuable insights and practical tools to the ongoing effort to combat hate speech and create safer, more inclusive online spaces, particularly for the Vietnamese-speaking community.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🗣️

Total Score

0

ViHateT5: Enhancing Hate Speech Detection in Vietnamese With A Unified Text-to-Text Transformer Model

Luan Thanh Nguyen

Recent advancements in hate speech detection (HSD) in Vietnamese have made significant progress, primarily attributed to the emergence of transformer-based pre-trained language models, particularly those built on the BERT architecture. However, the necessity for specialized fine-tuned models has resulted in the complexity and fragmentation of developing a multitasking HSD system. Moreover, most current methodologies focus on fine-tuning general pre-trained models, primarily trained on formal textual datasets like Wikipedia, which may not accurately capture human behavior on online platforms. In this research, we introduce ViHateT5, a T5-based model pre-trained on our proposed large-scale domain-specific dataset named VOZ-HSD. By harnessing the power of a text-to-text architecture, ViHateT5 can tackle multiple tasks using a unified model and achieve state-of-the-art performance across all standard HSD benchmarks in Vietnamese. Our experiments also underscore the significance of label distribution in pre-training data on model efficacy. We provide our experimental materials for research purposes, including the VOZ-HSD dataset, pre-trained checkpoint, the unified HSD-multitask ViHateT5 model, and related source code on GitHub publicly.

Read more

6/5/2024

Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts
Total Score

0

Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts

Cuong Nhat Vo, Khanh Bao Huynh, Son T. Luu, Trong-Hop Do

The growth of social networks makes toxic content spread rapidly. Hate speech detection is a task to help decrease the number of harmful comments. With the diversity in the hate speech created by users, it is necessary to interpret the hate speech besides detecting it. Hence, we propose a methodology to construct a system for targeted hate speech detection from online streaming texts from social media. We first introduce the ViTHSD - a targeted hate speech detection dataset for Vietnamese Social Media Texts. The dataset contains 10K comments, each comment is labeled to specific targets with three levels: clean, offensive, and hate. There are 5 targets in the dataset, and each target is labeled with the corresponding level manually by humans with strict annotation guidelines. The inter-annotator agreement obtained from the dataset is 0.45 by Cohen's Kappa index, which is indicated as a moderate level. Then, we construct a baseline for this task by combining the Bi-GRU-LSTM-CNN with the pre-trained language model to leverage the power of text representation of BERTology. Finally, we suggest a methodology to integrate the baseline model for targeted hate speech detection into the online streaming system for practical application in preventing hateful and offensive content on social media.

Read more

5/1/2024

🗣️

Total Score

0

Automatic Textual Normalization for Hate Speech Detection

Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Nguyet Thi Nguyen, Khanh Thanh-Duy Ho, Kiet Van Nguyen

Social media data is a valuable resource for research, yet it contains a wide range of non-standard words (NSW). These irregularities hinder the effective operation of NLP tools. Current state-of-the-art methods for the Vietnamese language address this issue as a problem of lexical normalization, involving the creation of manual rules or the implementation of multi-staged deep learning frameworks, which necessitate extensive efforts to craft intricate rules. In contrast, our approach is straightforward, employing solely a sequence-to-sequence (Seq2Seq) model. In this research, we provide a dataset for textual normalization, comprising 2,181 human-annotated comments with an inter-annotator agreement of 0.9014. By leveraging the Seq2Seq model for textual normalization, our results reveal that the accuracy achieved falls slightly short of 70%. Nevertheless, textual normalization enhances the accuracy of the Hate Speech Detection (HSD) task by approximately 2%, demonstrating its potential to improve the performance of complex NLP tasks. Our dataset is accessible for research purposes.

Read more

7/26/2024

🤖

Total Score

0

Vietnamese AI Generated Text Detection

Quang-Dan Tran, Van-Quan Nguyen, Quang-Huy Pham, K. B. Thang Nguyen, Trong-Hop Do

In recent years, Large Language Models (LLMs) have become integrated into our daily lives, serving as invaluable assistants in completing tasks. Widely embraced by users, the abuse of LLMs is inevitable, particularly in using them to generate text content for various purposes, leading to difficulties in distinguishing between text generated by LLMs and that written by humans. In this study, we present a dataset named ViDetect, comprising 6.800 samples of Vietnamese essay, with 3.400 samples authored by humans and the remainder generated by LLMs, serving the purpose of detecting text generated by AI. We conducted evaluations using state-of-the-art methods, including ViT5, BartPho, PhoBERT, mDeberta V3, and mBERT. These results contribute not only to the growing body of research on detecting text generated by AI but also demonstrate the adaptability and effectiveness of different methods in the Vietnamese language context. This research lays the foundation for future advancements in AI-generated text detection and provides valuable insights for researchers in the field of natural language processing.

Read more

5/7/2024