A Survey on Contextualised Semantic Shift Detection

Read original: arXiv:2304.01666 - Published 6/12/2024 by Stefano Montanelli, Francesco Periti

🔎

Overview

This paper discusses the task of Semantic Shift Detection (SSD), which is the process of identifying, interpreting, and assessing changes in the meanings of words over time.
Traditionally, SSD has been a manual and time-consuming process carried out by linguists and social scientists.
In recent years, computational approaches using Natural Language Processing and word embeddings have gained attention as a way to automate the SSD process.
Particularly over the past three years, significant advancements in SSD have been made using contextualized embedding models, which can better handle the multiple usages and meanings of words and capture related semantic shifts.

Plain English Explanation

The paper discusses a task called Semantic Shift Detection (SSD), which is the process of identifying and understanding how the meanings of words change over time. Traditionally, this task has been done manually by linguists and social scientists, which can be very time-consuming.

However, in recent years, researchers have started using [object Object] based on [object Object] and [object Object] to try to automate the SSD process as much as possible.

In particular, over the last three years, there have been significant advancements in SSD using contextualized embedding models. These models can better handle the multiple meanings that words can have and more accurately capture the semantic shifts that occur over time.

Technical Explanation

This paper proposes a classification framework for approaches to Contextual Semantic Shift Detection (CSSDetection), which is the use of contextualized embedding models for the SSD task. The framework is characterized by three key dimensions:

Meaning Representation: How the models represent and capture the different meanings and usages of words.
Time-Awareness: How the models take into account the temporal aspect of semantic shifts.
Learning Modality: Whether the models use supervised, unsupervised, or semi-supervised learning techniques.

The paper uses this framework to:

Review the measures that have been proposed for assessing the degree of semantic shift.
Compare the performance of different CSSDetection approaches.
Discuss the current challenges in terms of scalability, interpretability, and robustness of these approaches.

Critical Analysis

The paper highlights several open challenges and areas for future research in CSSDetection:

Scalability: Current approaches may not scale well to large vocabularies or long time periods due to computational constraints.
Interpretability: It can be difficult to interpret the semantic shifts detected by the models and understand the underlying reasons for the changes.
Robustness: The models may not be robust to factors like domain shifts or noisy data, which could affect the reliability of the semantic shift detection.

The paper also notes that most of the recent advancements in CSSDetection have been based on contextualized embedding models, and there may be opportunities to explore other types of models or approaches that could provide complementary benefits.

Conclusion

This paper provides a comprehensive survey of the state-of-the-art in Contextual Semantic Shift Detection (CSSDetection), which uses advanced natural language processing techniques to automatically identify and analyze changes in word meanings over time. The proposed classification framework offers a structured way to understand and compare the different approaches, while also highlighting key challenges and areas for future research in this important and rapidly evolving field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

A Survey on Contextualised Semantic Shift Detection

Stefano Montanelli, Francesco Periti

Semantic Shift Detection (SSD) is the task of identifying, interpreting, and assessing the possible change over time in the meanings of a target word. Traditionally, SSD has been addressed by linguists and social scientists through manual and time-consuming activities. In the recent years, computational approaches based on Natural Language Processing and word embeddings gained increasing attention to automate SSD as much as possible. In particular, over the past three years, significant advancements have been made almost exclusively based on word contextualised embedding models, which can handle the multiple usages/meanings of the words and better capture the related semantic shifts. In this paper, we survey the approaches based on contextualised embeddings for SSD (i.e., CSSDetection) and we propose a classification framework characterised by meaning representation, time-awareness, and learning modality dimensions. The framework is exploited i) to review the measures for shift assessment, ii) to compare the approaches on performance, and iii) to discuss the current issues in terms of scalability, interpretability, and robustness. Open challenges and future research directions about CSSDetection are finally outlined.

6/12/2024

Historical Ink: Semantic Shift Detection for 19th Century Spanish

Tony Montes, Laura Manrique-G'omez, Rub'en Manrique

This paper explores the evolution of word meanings in 19th-century Spanish texts, with an emphasis on Latin American Spanish, using computational linguistics techniques. It addresses the Semantic Shift Detection (SSD) task, which is crucial for understanding linguistic evolution, particularly in historical contexts. The study focuses on analyzing a set of Spanish target words. To achieve this, a 19th-century Spanish corpus is constructed, and a customizable pipeline for SSD tasks is developed. This pipeline helps find the senses of a word and measure their semantic change between two corpora using fine-tuned BERT-like models with old Spanish texts for both Latin American and general Spanish cases. The results provide valuable insights into the cultural and societal shifts reflected in language changes over time.

7/22/2024

A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection

Taichi Aida, Danushka Bollegala

Detecting temporal semantic changes of words is an important task for various NLP applications that must make time-sensitive predictions. Lexical Semantic Change Detection (SCD) task involves predicting whether a given target word, $w$, changes its meaning between two different text corpora, $C_1$ and $C_2$. For this purpose, we propose a supervised two-staged SCD method that uses existing Word-in-Context (WiC) datasets. In the first stage, for a target word $w$, we learn two sense-aware encoders that represent the meaning of $w$ in a given sentence selected from a corpus. Next, in the second stage, we learn a sense-aware distance metric that compares the semantic representations of a target word across all of its occurrences in $C_1$ and $C_2$. Experimental results on multiple benchmark datasets for SCD show that our proposed method achieves strong performance in multiple languages. Additionally, our method achieves significant improvements on WiC benchmarks compared to a sense-aware encoder with conventional distance functions. Source code is available at https://github.com/LivNLP/svp-sdml .

6/4/2024

Definition generation for lexical semantic change detection

Mariia Fedorova, Andrey Kutuzov, Yves Scherrer

We use contextualized word definitions generated by large language models as semantic representations in the task of diachronic lexical semantic change detection (LSCD). In short, generated definitions are used as `senses', and the change score of a target word is retrieved by comparing their distributions in two time periods under comparison. On the material of five datasets and three languages, we show that generated definitions are indeed specific and general enough to convey a signal sufficient to rank sets of words by the degree of their semantic change over time. Our approach is on par with or outperforms prior non-supervised sense-based LSCD methods. At the same time, it preserves interpretability and allows to inspect the reasons behind a specific shift in terms of discrete definitions-as-senses. This is another step in the direction of explainable semantic change modeling.

8/1/2024