A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin

Read original: arXiv:2409.07891 - Published 9/14/2024 by Xiaoyun Jin, Mirjam Ernestus, R. Harald Baayen
Total Score

0

A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Investigates pitch contours of monosyllabic words in conversational Taiwan Mandarin
  • Supported by the European Research Council under Grant SUBLIMINAL (#101054902) awarded to R. Harald Baayen

Plain English Explanation

This research paper examines the pitch patterns, or tones, of single-syllable words in casual Mandarin Chinese conversations in Taiwan. Mandarin Chinese is a tonal language, meaning the pitch of a syllable can change the meaning of the word. The researchers looked at how these tones are actually pronounced in natural speech, rather than in carefully controlled lab settings.

Understanding how tones are realized in natural, conversational speech is important for developing accurate speech recognition and language models for tonal languages like Mandarin. The findings could also shed light on how speakers convey meaning through subtle variations in pitch and tone.

Technical Explanation

The researchers used a large corpus of transcribed Mandarin conversations to analyze the pitch contours of monosyllabic words. They extracted acoustic features like pitch and duration for each word and applied statistical modeling techniques to identify patterns in how the four Mandarin tones are realized in natural speech.

The analysis revealed systematic differences in pitch contours between words with the same underlying tone. This suggests that factors beyond just the lexical tone, such as word meaning and context, shape how tones are ultimately produced. The researchers also found evidence that speakers use dynamic, gradient changes in pitch to convey nuanced meaning, rather than simply producing prototypical tone categories.

Critical Analysis

The corpus-based approach provides valuable ecological validity compared to lab-based studies, as it captures tone realization in natural, conversational speech. However, the analysis is correlational and does not directly test the influence of semantic and pragmatic factors on tone production.

Additionally, the study is limited to monosyllabic words, so the findings may not generalize to multisyllabic words or more complex prosodic structures. Further research is needed to investigate how tone interacts with other prosodic features like stress and phrasing in spontaneous Mandarin speech.

Conclusion

This study demonstrates the value of analyzing tone production in naturalistic speech data, rather than relying solely on idealized pronunciations. The findings challenge simplistic views of tone as a static lexical feature, and instead suggest that tonal realization is a dynamic process shaped by multiple linguistic and contextual factors. This has important implications for developing more accurate and nuanced models of tonal language processing and production.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin
Total Score

0

A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin

Xiaoyun Jin, Mirjam Ernestus, R. Harald Baayen

In Mandarin, the tonal contours of monosyllabic words produced in isolation or in careful speech are characterized by four lexical tones: a high-level tone (T1), a rising tone (T2), a dipping tone (T3) and a falling tone (T4). However, in spontaneous speech, the actual tonal realization of monosyllabic words can deviate significantly from these canonical tones due to intra-syllabic co-articulation and inter-syllabic co-articulation with adjacent tones. In addition, Chuang et al. (2024) recently reported that the tonal contours of disyllabic Mandarin words with T2-T4 tone pattern are co-determined by their meanings. Following up on their research, we present a corpus-based investigation of how the pitch contours of monosyllabic words are realized in spontaneous conversational Mandarin, focusing on the effects of contextual predictors on the one hand, and the way in words' meanings co-determine pitch contours on the other hand. We analyze the F0 contours of 3824 tokens of 63 different word types in a spontaneous Taiwan Mandarin corpus, using the generalized additive (mixed) model to decompose a given observed pitch contour into a set of component pitch contours. We show that the tonal context substantially modify a word's canonical tone. Once the effect of tonal context is controlled for, T2 and T3 emerge as low flat tones, contrasting with T1 as a high tone, and with T4 as a high-to-mid falling tone. The neutral tone (T0), which in standard descriptions, is realized based on the preceding tone, emerges as a low tone in its own right, modified by the other predictors in the same way as the standard tones T1, T2, T3, and T4. We also show that word, and even more so, word sense, co-determine words' F0 contours. Analyses of variable importance using random forests further supported the substantial effect of tonal context and an effect of word sense.

Read more

9/14/2024

Word-specific tonal realizations in Mandarin
Total Score

0

Word-specific tonal realizations in Mandarin

Yu-Ying Chuang, Melanie J. Bell, Yu-Hsiang Tseng, R. Harald Baayen

The pitch contours of Mandarin two-character words are generally understood as being shaped by the underlying tones of the constituent single-character words, in interaction with articulatory constraints imposed by factors such as speech rate, co-articulation with adjacent tones, segmental make-up, and predictability. This study shows that tonal realization is also partially determined by words' meanings. We first show, on the basis of a Taiwan corpus of spontaneous conversations, using the generalized additive regression model, and focusing on the rise-fall tone pattern, that after controlling for effects of speaker and context, word type is a stronger predictor of pitch realization than all the previously established word-form related predictors combined. Importantly, the addition of information about meaning in context improves prediction accuracy even further. We then proceed to show, using computational modeling with context-specific word embeddings, that token-specific pitch contours predict word type with 50% accuracy on held-out data, and that context-sensitive, token-specific embeddings can predict the shape of pitch contours with 30% accuracy. These accuracies, which are an order of magnitude above chance level, suggest that the relation between words' pitch contours and their meanings are sufficiently strong to be functional for language users. The theoretical implications of these empirical findings are discussed.

Read more

5/14/2024

Form and meaning co-determine the realization of tone in Taiwan Mandarin spontaneous speech: the case of Tone 3 sandhi
Total Score

0

Form and meaning co-determine the realization of tone in Taiwan Mandarin spontaneous speech: the case of Tone 3 sandhi

Yuxin Lu, Yu-Ying Chuang, R. Harald Baayen

In Standard Chinese, Tone 3 (the dipping tone) becomes Tone 2 (rising tone) when followed by another Tone 3. Previous studies have noted that this sandhi process may be incomplete, in the sense that the assimilated Tone 3 is still distinct from a true Tone 2. While Mandarin Tone 3 sandhi is widely studied using carefully controlled laboratory speech (Xu, 1997) and more formal registers of Beijing Mandarin (Yuan and Chen, 2014), less is known about its realization in spontaneous speech, and about the effect of contextual factors on tonal realization. The present study investigates the pitch contours of two-character words with T2-T3 and T3-T3 tone patterns in spontaneous Taiwan Mandarin conversations. Our analysis makes use of the Generative Additive Mixed Model (GAMM, Wood, 2017) to examine fundamental frequency (f0) contours as a function of normalized time. We consider various factors known to influence pitch contours, including gender, speaking rate, speaker, neighboring tones, word position, bigram probability, and also novel predictors, word and word sense (Chuang et al., 2024). Our analyses revealed that in spontaneous Taiwan Mandarin, T3-T3 words become indistinguishable from T2-T3 words, indicating complete sandhi, once the strong effect of word (or word sense) is taken into account. For our data, the shape of f0 contours is not co-determined by word frequency. In contrast, the effect of word meaning on f0 contours is robust, as strong as the effect of adjacent tones, and is present for both T2-T3 and T3-T3 words.

Read more

8/29/2024

Encoding of lexical tone in self-supervised models of spoken language
Total Score

0

Encoding of lexical tone in self-supervised models of spoken language

Gaofei Shen, Michaela Watkins, Afra Alishahi, Arianna Bisazza, Grzegorz Chrupa{l}a

Interpretability research has shown that self-supervised Spoken Language Models (SLMs) encode a wide variety of features in human speech from the acoustic, phonetic, phonological, syntactic and semantic levels, to speaker characteristics. The bulk of prior research on representations of phonology has focused on segmental features such as phonemes; the encoding of suprasegmental phonology (such as tone and stress patterns) in SLMs is not yet well understood. Tone is a suprasegmental feature that is present in more than half of the world's languages. This paper aims to analyze the tone encoding capabilities of SLMs, using Mandarin and Vietnamese as case studies. We show that SLMs encode lexical tone to a significant degree even when they are trained on data from non-tonal languages. We further find that SLMs behave similarly to native and non-native human participants in tone and consonant perception studies, but they do not follow the same developmental trajectory.

Read more

4/4/2024