An Embedded Diachronic Sense Change Model with a Case Study from Ancient Greek

Read original: arXiv:2311.00541 - Published 6/19/2024 by Schyan Zafar, Geoff K. Nicholls
Total Score

0

📈

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Word meanings change over time, and word senses evolve, emerge, or die out in the process
  • Modeling such changes accurately is challenging, especially for ancient languages with sparse corpora
  • Quantifying uncertainty in sense-change estimates is important in these cases
  • GASC (Genre-Aware Semantic Change) and DiSC (Diachronic Sense Change) are existing generative models used to analyze sense change in an ancient Greek text corpus
  • These models represent word senses as distributions over context words and sense prevalence as a distribution over senses
  • The models are fitted using Markov Chain Monte Carlo (MCMC) methods to measure temporal changes

Plain English Explanation

Words can change in meaning over time, and new word senses can emerge while others disappear. This process of sense change is particularly challenging to model accurately for ancient languages, where the available texts are often limited and scattered.

Accurately quantifying the uncertainty in these sense-change estimates is crucial, as it helps us understand how reliable the findings are. Two existing models, GASC and DiSC, have been used to analyze sense changes in an ancient Greek text corpus. These models represent the different senses of a word, such as "kosmos" (which can mean "decoration," "order," or "world"), as distributions over the words that commonly appear in that sense. They also model the prevalence of each sense as a distribution.

The researchers fit these models using a statistical technique called Markov Chain Monte Carlo (MCMC), which allows them to measure how these representations of word senses change over time.

Technical Explanation

The paper introduces EDiSC, an Embedded DiSC model, which combines word embeddings with the DiSC model to provide improved performance. The authors show that EDiSC offers better predictive accuracy, better recovery of ground truth, and better quantification of uncertainty compared to the original DiSC model. EDiSC also demonstrates better sampling efficiency and scalability when using MCMC methods.

The key challenge in fitting these models is discussed, as the authors explore the difficulties in accurately modeling sense changes, especially for ancient languages with limited text corpora.

Critical Analysis

The paper provides a valuable contribution to the field of lexical semantic change analysis, particularly for ancient languages. The authors acknowledge the inherent challenges in modeling sense changes with sparse data and the importance of quantifying uncertainty in the results.

While the EDiSC model shows promising improvements over the previous DiSC model, the authors could have delved deeper into the limitations and potential issues with their approach. For example, they could have discussed the impact of the specific word embedding technique used, or the sensitivity of the results to the choice of hyperparameters in the MCMC sampling.

Additionally, the authors could have explored the broader implications of their findings, such as how the improved sense-change modeling could inform historical linguistics or enhance our understanding of language evolution.

Conclusion

This paper presents an important advancement in the field of lexical semantic change analysis, particularly for ancient languages with limited text corpora. The introduction of the EDiSC model, which combines word embeddings with the DiSC framework, demonstrates improved performance in terms of predictive accuracy, ground-truth recovery, and uncertainty quantification.

The challenges of fitting these models and the inherent difficulties in modeling sense changes for ancient languages are also discussed, highlighting the significance of this research and the need for continued advancements in this area. The findings of this paper could have far-reaching implications for historical linguistics, language evolution studies, and our understanding of how word meanings change over time.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Total Score

0

An Embedded Diachronic Sense Change Model with a Case Study from Ancient Greek

Schyan Zafar, Geoff K. Nicholls

Word meanings change over time, and word senses evolve, emerge or die out in the process. For ancient languages, where the corpora are often small and sparse, modelling such changes accurately proves challenging, and quantifying uncertainty in sense-change estimates consequently becomes important. GASC (Genre-Aware Semantic Change) and DiSC (Diachronic Sense Change) are existing generative models that have been used to analyse sense change for target words from an ancient Greek text corpus, using unsupervised learning without the help of any pre-training. These models represent the senses of a given target word such as ``kosmos'' (meaning decoration, order or world) as distributions over context words, and sense prevalence as a distribution over senses. The models are fitted using Markov Chain Monte Carlo (MCMC) methods to measure temporal changes in these representations. This paper introduces EDiSC, an Embedded DiSC model, which combines word embeddings with DiSC to provide superior model performance. It is shown empirically that EDiSC offers improved predictive accuracy, ground-truth recovery and uncertainty quantification, as well as better sampling efficiency and scalability properties with MCMC methods. The challenges of fitting these models are also discussed.

Read more

6/19/2024

Definition generation for lexical semantic change detection
Total Score

0

Definition generation for lexical semantic change detection

Mariia Fedorova, Andrey Kutuzov, Yves Scherrer

We use contextualized word definitions generated by large language models as semantic representations in the task of diachronic lexical semantic change detection (LSCD). In short, generated definitions are used as `senses', and the change score of a target word is retrieved by comparing their distributions in two time periods under comparison. On the material of five datasets and three languages, we show that generated definitions are indeed specific and general enough to convey a signal sufficient to rank sets of words by the degree of their semantic change over time. Our approach is on par with or outperforms prior non-supervised sense-based LSCD methods. At the same time, it preserves interpretability and allows to inspect the reasons behind a specific shift in terms of discrete definitions-as-senses. This is another step in the direction of explainable semantic change modeling.

Read more

8/1/2024

A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection
Total Score

0

A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection

Taichi Aida, Danushka Bollegala

Detecting temporal semantic changes of words is an important task for various NLP applications that must make time-sensitive predictions. Lexical Semantic Change Detection (SCD) task involves predicting whether a given target word, $w$, changes its meaning between two different text corpora, $C_1$ and $C_2$. For this purpose, we propose a supervised two-staged SCD method that uses existing Word-in-Context (WiC) datasets. In the first stage, for a target word $w$, we learn two sense-aware encoders that represent the meaning of $w$ in a given sentence selected from a corpus. Next, in the second stage, we learn a sense-aware distance metric that compares the semantic representations of a target word across all of its occurrences in $C_1$ and $C_2$. Experimental results on multiple benchmark datasets for SCD show that our proposed method achieves strong performance in multiple languages. Additionally, our method achieves significant improvements on WiC benchmarks compared to a sense-aware encoder with conventional distance functions. Source code is available at https://github.com/LivNLP/svp-sdml .

Read more

6/4/2024

Deep-change at AXOLOTL-24: Orchestrating WSD and WSI Models for Semantic Change Modeling
Total Score

0

Deep-change at AXOLOTL-24: Orchestrating WSD and WSI Models for Semantic Change Modeling

Denis Kokosinskii, Mikhail Kuklin, Nikolay Arefyev

This paper describes our solution of the first subtask from the AXOLOTL-24 shared task on Semantic Change Modeling. The goal of this subtask is to distribute a given set of usages of a polysemous word from a newer time period between senses of this word from an older time period and clusters representing gained senses of this word. We propose and experiment with three new methods solving this task. Our methods achieve SOTA results according to both official metrics of the first substask. Additionally, we develop a model that can tell if a given word usage is not described by any of the provided sense definitions. This model serves as a component in one of our methods, but can potentially be useful on its own.

Read more

8/12/2024