A systematic review of bias detection methods for non-English word embeddings and language models
Version
Published
Identifiers
10.1007/s10462-025-11375-8
Date Issued
2025-10-08
Author(s)
Rigotti, Carlotta
Fosch-Villaronga, Eduard
Kharas, Mark W.
Søraa, Roger A.
Type
Article
Language
English
Abstract
Biases in applications of machine learning and artificial intelligence are a major limitation of these applications. Stereotypes of the society are reflected in different types of applications, including image generation, machine translation or CV ranking. This is in particular also the case for language models and word embeddings, encoding human language as mathematical vectors. Research addressing the challenging problem of detection (and mitigation) of the bias in these embeddings is often conducted for the English language. However, the stereotypes encoded can be language dependent and impacted by a cultural environment. Thus, dedicated research efforts for languages other than English are required. In this paper, we conduct a systematic literature review to identify and compare existing bias detection methods for non-English word embeddings and language models. In an interdisciplinary team we examine the technical aspects, as well as the definitions of bias used by researchers in the field. Based on our findings, we outline a research plan for making bias detection in the field of NLP more inclusive for languages other than English.
Publisher DOI
Journal or Serie
Artificial Intelligence Review
Journal or Serie
Artificial Intelligence Review
ISSN
1573-7462
Volume
58
Issue
12
Publisher
Springer Nature
Submitter
Kurpicz-Briki, Mascha
Citation apa
Puttick, A. R., Ikae, C., Rigotti, C., Fosch-Villaronga, E., Kharas, M. W., Søraa, R. A., & Kurpicz-Briki, M. (2025). A systematic review of bias detection methods for non-English word embeddings and language models. In Artificial Intelligence Review (Vol. 58, Issue 12). Springer Nature. https://doi.org/10.24451/arbor.12558
File(s)![Thumbnail Image]()
Loading...
open access
Name
s10462-025-11375-8.pdf
License
Attribution-NonCommercial-NoDerivatives 4.0 International
Version
published
Size
4.36 MB
Format
Adobe PDF
Checksum (MD5)
28089f675a276391454435a36fa0defc
