Generating Gender Alternatives in Machine Translation

Read original: arXiv:2407.20438 - Published 7/31/2024 by Sarthak Garg, Mozhdeh Gheini, Clara Emmanuel, Tatiana Likhomanenko, Qin Gao, Matthias Paulik

Generating Gender Alternatives in Machine Translation

Overview

Explores ways to generate gender alternatives in machine translation
Proposes an entity-level approach to provide multiple gender-inclusive options
Evaluates the approach on a novel dataset of gender-marked sentences

Plain English Explanation

This paper focuses on the issue of gender bias in machine translation (MT) systems. Traditionally, MT models have had a tendency to generate translations that adhere to stereotypical gender roles, often defaulting to masculine pronouns and titles.

The researchers propose a new approach to address this problem. Instead of relying on binary gender categories, they develop a system that can generate multiple gender alternatives for specific entities mentioned in the text. This allows the output to be more inclusive and representative of diverse gender identities.

The key idea is to identify the entities in the input text (e.g. people, roles, etc.) and then generate alternative translations that use different gender markers for those entities. This is done at the entity level, rather than trying to make broad changes to the overall translation.

The researchers evaluate their approach on a novel dataset of gender-marked sentences across multiple languages. This allows them to assess how well the system can produce accurate and appropriate gender alternatives compared to the original translations.

By taking this more fine-grained and entity-focused approach, the researchers aim to make MT systems more inclusive and respectful of gender diversity. This could have important implications for applications like multilingual communication, content localization, and language learning.

Technical Explanation

The paper presents an entity-level approach to generating gender alternatives in machine translation. The key steps are:

Entity Identification: The first step is to identify the entities (e.g. people, roles, etc.) in the input text that have gender-specific references.
Gender Alternative Generation: For each identified entity, the system generates alternative translations that use different gender markers (e.g. he/she, him/her, etc.).
Ranking and Selection: The system then ranks the generated alternatives based on factors like fluency and semantic coherence, and selects the most appropriate options to include in the final translation output.

The dataset used for evaluation consists of gender-marked sentences across multiple languages, which allows the researchers to assess how well the system can produce accurate and appropriate gender alternatives.

The experimental results show that the entity-level approach outperforms previous methods that relied on more global gender modifications. This suggests that the fine-grained, entity-focused strategy is an effective way to enhance the gender inclusivity of machine translation systems.

Critical Analysis

The paper acknowledges some limitations of the proposed approach. For example, it may not work as well for languages with more complex gender systems, and the ranking and selection of alternatives could still be improved.

Additionally, the dataset used for evaluation, while novel, may not fully capture the nuances and diversity of gender identities in real-world language use. There could be opportunities to further expand the scope and depth of the evaluation.

Overall, the research represents an important step forward in addressing gender bias in machine translation. However, there is still room for further refinement and exploration of more holistic solutions that consider the broader societal and linguistic implications of this challenge.

Conclusion

This paper presents a novel entity-level approach to generating gender alternatives in machine translation. By focusing on the specific entities mentioned in the text, rather than making global changes, the system is able to produce more accurate and inclusive translations.

The evaluation on a dedicated dataset shows promising results, though the authors also note some limitations that could be addressed in future work. Ultimately, this research contributes to the growing effort to make machine translation systems more respectful and representative of gender diversity, which could have important implications for a wide range of applications and user communities.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Generating Gender Alternatives in Machine Translation

Sarthak Garg, Mozhdeh Gheini, Clara Emmanuel, Tatiana Likhomanenko, Qin Gao, Matthias Paulik

Machine translation (MT) systems often translate terms with ambiguous gender (e.g., English term the nurse) into the gendered form that is most prevalent in the systems' training data (e.g., enfermera, the Spanish term for a female nurse). This often reflects and perpetuates harmful stereotypes present in society. With MT user interfaces in mind that allow for resolving gender ambiguity in a frictionless manner, we study the problem of generating all grammatically correct gendered translation alternatives. We open source train and test datasets for five language pairs and establish benchmarks for this task. Our key technical contribution is a novel semi-supervised solution for generating alternatives that integrates seamlessly with standard MT models and maintains high performance without requiring additional components or increasing inference overhead.

7/31/2024

Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German

Manuel Lardelli, Giuseppe Attanasio, Anne Lauscher

The translation of gender-neutral person-referring terms (e.g., the students) is often non-trivial. Translating from English into German poses an interesting case -- in German, person-referring nouns are usually gender-specific, and if the gender of the referent(s) is unknown or diverse, the generic masculine (die Studenten (m.)) is commonly used. This solution, however, reduces the visibility of other genders, such as women and non-binary people. To counteract gender discrimination, a societal movement towards using gender-fair language exists (e.g., by adopting neosystems). However, gender-fair German is currently barely supported in machine translation (MT), requiring post-editing or manual translations. We address this research gap by studying gender-fair language in English-to-German MT. Concretely, we enrich a community-created gender-fair language dictionary and sample multi-sentence test instances from encyclopedic text and parliamentary speeches. Using these novel resources, we conduct the first benchmark study involving two commercial systems and six neural MT models for translating words in isolation and natural contexts across two domains. Our findings show that most systems produce mainly masculine forms and rarely gender-neutral variants, highlighting the need for future research. We release code and data at https://github.com/g8a9/building-bridges-gender-fair-german-mt.

6/11/2024

Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words

Yijie Chen, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

Gender bias has been a focal point in the study of bias in machine translation and language models. Existing machine translation gender bias evaluations are primarily focused on male and female genders, limiting the scope of the evaluation. To assess gender bias accurately, these studies often rely on calculating the accuracy of gender pronouns or the masculine and feminine attributes of grammatical gender via the stereotypes triggered by occupations or sentiment words ({em i.e.}, clear positive or negative attitude), which cannot extend to non-binary groups. This study presents a benchmark AmbGIMT (Gender-Inclusive Machine Translation with Ambiguous attitude words), which assesses gender bias beyond binary gender. Meanwhile, we propose a novel process to evaluate gender bias based on the Emotional Attitude Score (EAS), which is used to quantify ambiguous attitude words. In evaluating three recent and effective open-source LLMs and one powerful multilingual translation-specific model, our main observations are: (1) The translation performance within non-binary gender contexts is markedly inferior in terms of translation quality and exhibits more negative attitudes than binary-gender contexts. (2) The analysis experiments indicate that incorporating constraint context in prompts for gender identity terms can substantially reduce translation bias, while the bias remains evident despite the presence of the constraints. The code is publicly available at url{https://github.com/pppa2019/ambGIMT}.

7/24/2024

💬

Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models

Andrea Piergentili, Beatrice Savoldi, Matteo Negri, Luisa Bentivogli

Machine translation (MT) models are known to suffer from gender bias, especially when translating into languages with extensive gendered morphology. Accordingly, they still fall short in using gender-inclusive language, also representative of non-binary identities. In this paper, we look at gender-inclusive neomorphemes, neologistic elements that avoid binary gender markings as an approach towards fairer MT. In this direction, we explore prompting techniques with large language models (LLMs) to translate from English into Italian using neomorphemes. So far, this area has been under-explored due to its novelty and the lack of publicly available evaluation resources. We fill this gap by releasing Neo-GATE, a resource designed to evaluate gender-inclusive en-it translation with neomorphemes. With Neo-GATE, we assess four LLMs of different families and sizes and different prompt formats, identifying strengths and weaknesses of each on this novel task for MT.

5/15/2024