SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages

Read original: arXiv:2403.18933 - Published 4/19/2024 by Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Meriem Beloucif, Christine De Kock and 7 others

🛸

Overview

Presents the first shared task on Semantic Textual Relatedness (STR)
Focuses on semantic relatedness, rather than just similarity, across 14 languages
Includes languages from 5 distinct language families, primarily spoken in Africa and Asia
Each dataset instance is a sentence pair with a score representing their semantic relatedness
Participants competed in supervised, unsupervised, and cross-lingual tracks
163 participants, 70 submissions from 51 teams, and 38 system description papers

Plain English Explanation

This paper describes a new competition focused on understanding the relationship between sentences, rather than just how similar they are. It looks at 14 different languages, including ones commonly spoken in Africa and Asia, where there are relatively few language processing resources available.

Each dataset has sentence pairs, and a score that shows how closely related the two sentences are in meaning. Teams competed in three main tracks - one where they could use training data, one where they couldn't, and one where they had to work across different languages.

A large number of teams and participants got involved, submitting a variety of approaches and detailed descriptions of their systems. The paper summarizes the best-performing methods and the most common techniques used across the different competition tracks.

Technical Explanation

The shared task investigated the broader concept of semantic textual relatedness across 14 languages, going beyond just semantic similarity that was the focus of earlier tasks. The languages - Afrikaans, Algerian Arabic, Amharic, English, Hausa, Hindi, Indonesian, Kinyarwanda, Marathi, Moroccan Arabic, Modern Standard Arabic, Punjabi, Spanish, and Telugu - span 5 distinct language families and are predominantly used in Africa and Asia.

For each language, the dataset contained sentence pairs with a score representing the degree of semantic relatedness between them. The competition had three main tracks: supervised, unsupervised, and cross-lingual. Participants were tasked with ranking the sentence pairs based on their relatedness.

The shared task attracted substantial participation, with 163 individuals across 51 different teams submitting a total of 70 system entries. 38 teams also provided detailed descriptions of their approaches.

Critical Analysis

The paper provides a comprehensive overview of this new shared task on semantic textual relatedness across a diverse set of languages. The inclusion of under-resourced languages from Africa and Asia is a particularly valuable contribution, as it helps advance NLP capabilities in regions that have historically been underserved.

However, the paper does not delve into potential limitations or caveats of the task design or dataset. For example, it's unclear how the relatedness scores were constructed, and whether there were any biases or inconsistencies in the human annotations. Additionally, the paper does not critically examine the generalizability of the top-performing approaches, or discuss how they might fare on real-world applications beyond the specific task.

Further research could investigate the robustness and efficiency of the proposed techniques, particularly in low-resource settings. Analyzing the errors and failure cases of the systems could also yield insights to improve the general flexibility and versatility of semantic relatedness models.

Conclusion

This paper presents the first shared task on Semantic Textual Relatedness, which expands beyond just measuring semantic similarity to investigate the broader concept of how related sentences are in meaning. By including 14 diverse languages, the task advances NLP capabilities in underserved regions of the world.

The large-scale participation and variety of approaches showcased in this work demonstrate the research community's interest in this problem. The summarized best-performing techniques and insights can inform the development of more robust and versatile semantic understanding models, with the potential to benefit a wide range of real-world applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🛸

SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages

Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Meriem Beloucif, Christine De Kock, Oumaima Hourrane, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Krishnapriya Vishnubhotla, Seid Muhie Yimam, Saif M. Mohammad

We present the first shared task on Semantic Textual Relatedness (STR). While earlier shared tasks primarily focused on semantic similarity, we instead investigate the broader phenomenon of semantic relatedness across 14 languages: Afrikaans, Algerian Arabic, Amharic, English, Hausa, Hindi, Indonesian, Kinyarwanda, Marathi, Moroccan Arabic, Modern Standard Arabic, Punjabi, Spanish, and Telugu. These languages originate from five distinct language families and are predominantly spoken in Africa and Asia -- regions characterised by the relatively limited availability of NLP resources. Each instance in the datasets is a sentence pair associated with a score that represents the degree of semantic textual relatedness between the two sentences. Participating systems were asked to rank sentence pairs by their closeness in meaning (i.e., their degree of semantic relatedness) in the 14 languages in three main tracks: (a) supervised, (b) unsupervised, and (c) crosslingual. The task attracted 163 participants. We received 70 submissions in total (across all tasks) from 51 different teams, and 38 system description papers. We report on the best-performing systems as well as the most common and the most effective approaches for the three different tracks.

4/19/2024

UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation

Shubhashis Roy Dipta, Sai Vallurupalli

The aim of SemEval-2024 Task 1, Semantic Textual Relatedness for African and Asian Languages is to develop models for identifying semantic textual relatedness (STR) between two sentences using multiple languages (14 African and Asian languages) and settings (supervised, unsupervised, and cross-lingual). Large language models (LLMs) have shown impressive performance on several natural language understanding tasks such as multilingual machine translation (MMT), semantic similarity (STS), and encoding sentence embeddings. Using a combination of LLMs that perform well on these tasks, we developed two STR models, $textit{TranSem}$ and $textit{FineSem}$, for the supervised and cross-lingual settings. We explore the effectiveness of several training methods and the usefulness of machine translation. We find that direct fine-tuning on the task is comparable to using sentence embeddings and translating to English leads to better performance for some languages. In the supervised setting, our model performance is better than the official baseline for 3 languages with the remaining 4 performing on par. In the cross-lingual setting, our model performance is better than the baseline for 3 languages (leading to $1^{st}$ place for Africaans and $2^{nd}$ place for Indonesian), is on par for 2 languages and performs poorly on the remaining 7 languages. Our code is publicly available at https://github.com/dipta007/SemEval24-Task8.

4/15/2024

🎯

Multilingual Evaluation of Semantic Textual Relatedness

Sharvi Endait, Srushti Sonavane, Ridhima Sinare, Pritika Rohera, Advait Naik, Dipali Kadam

The explosive growth of online content demands robust Natural Language Processing (NLP) techniques that can capture nuanced meanings and cultural context across diverse languages. Semantic Textual Relatedness (STR) goes beyond superficial word overlap, considering linguistic elements and non-linguistic factors like topic, sentiment, and perspective. Despite its pivotal role, prior NLP research has predominantly focused on English, limiting its applicability across languages. Addressing this gap, our paper dives into capturing deeper connections between sentences beyond simple word overlap. Going beyond English-centric NLP research, we explore STR in Marathi, Hindi, Spanish, and English, unlocking the potential for information retrieval, machine translation, and more. Leveraging the SemEval-2024 shared task, we explore various language models across three learning paradigms: supervised, unsupervised, and cross-lingual. Our comprehensive methodology gains promising results, demonstrating the effectiveness of our approach. This work aims to not only showcase our achievements but also inspire further research in multilingual STR, particularly for low-resourced languages.

4/16/2024

📈

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages

Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine De Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Winata, Seid Muhie Yimam, Saif M. Mohammad

Exploring and quantifying semantic relatedness is central to representing language and holds significant implications across various NLP tasks. While earlier NLP research primarily focused on semantic similarity, often within the English language context, we instead investigate the broader phenomenon of semantic relatedness. In this paper, we present textit{SemRel}, a new semantic relatedness dataset collection annotated by native speakers across 13 languages: textit{Afrikaans, Algerian Arabic, Amharic, English, Hausa, Hindi, Indonesian, Kinyarwanda, Marathi, Moroccan Arabic, Modern Standard Arabic, Spanish,} and textit{Telugu}. These languages originate from five distinct language families and are predominantly spoken in Africa and Asia -- regions characterised by a relatively limited availability of NLP resources. Each instance in the SemRel datasets is a sentence pair associated with a score that represents the degree of semantic textual relatedness between the two sentences. The scores are obtained using a comparative annotation framework. We describe the data collection and annotation processes, challenges when building the datasets, baseline experiments, and their impact and utility in NLP.

6/3/2024