Detecting Bias and Intersectional Bias in Italian Word Embeddings and Language Models
Version
Published
Date Issued
2025-08-01
Type
Conference Paper
Language
English
Abstract
Bias in Natural Language Processing (NLP) applications has become a critical issue, with many methods developed to measure and mitigate bias in word embeddings and language models. However, most approaches focus on single categories such as gender or ethnicity, neglecting the intersectionality of biases, particularly in non-English languages. This paper addresses these gaps by studying both single-category and intersectional biases in Italian word embeddings and language models. We extend existing bias metrics to Italian, introducing GG-FISE, a novel method for detecting intersectional bias while accounting for grammatical gender. We also adapt the CrowS-Pairs dataset and bias metric to Italian. Through a series of experiments using WEAT, SEAT, and LPBS tests, we identify significant biases along gender and ethnic lines, with particular attention to biases against Romanian and South Asian populations. Our results highlight the need for culturally adapted methods to detect and address biases in multilingual and intersectional contexts.
Publisher DOI
Journal or Serie
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)
Publisher URL
Conference
The 63rd Annual Meeting of the Association for Computational Linguistics
Publisher
Association for Computational Linguistics
Submitter
Kurpicz-Briki, Mascha
Citation apa
Puttick, A. R., & Kurpicz-Briki, M. (2025). Detecting Bias and Intersectional Bias in Italian Word Embeddings and Language Models. In Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP). The 63rd Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics. https://doi.org/10.24451/arbor.12847
File(s)![Thumbnail Image]()
Loading...
open access
Name
2025.gebnlp-1.3v2.pdf
License
Attribution 4.0 International
Version
published
Size
484.79 KB
Format
Adobe PDF
Checksum (MD5)
f0ab3f689012dfb0ea0fb0e1c3884316
