Kurpicz-Briki, Mascha (24 June 2020). Cultural Differences in Bias? Origin and Gender Bias in Pre-Trained German and French Word Embeddings In: 5th SwissText & 16th KONVENS Joint Conference 2020. Zürich. 23-25.06.2020.
|
Text
paper6.pdf - Published Version Available under License Creative Commons: Attribution (CC-BY). Download (671kB) | Preview |
Smart applications often rely on training data in form of text. If there is a bias in that training data, the decision of the applications might not be fair. Common training data has been shown to be biased towards different groups of minorities. However, there is no generic algorithm to determine the fairness of training data. One existing approach is to measure gender bias using word embeddings. Most research in this field has been dedicated to the English language. In this work, we identified that there is a bias towards gender and origin in both German and French word embeddings. In particular, we found that real-world bias and stereotypes from the 18th century are still included in today’s word embeddings. Furthermore, we show that the gender bias in German has a different form from English and there is indication that bias has cultural differences that need to be considered when analyzing texts and word embeddings in different languages.
Item Type: |
Conference or Workshop Item (Paper) |
---|---|
Division/Institute: |
School of Engineering and Computer Science > Institute for Data Applications and Security (IDAS) School of Engineering and Computer Science |
Name: |
Kurpicz-Briki, Mascha |
Subjects: |
H Social Sciences > H Social Sciences (General) Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QA Mathematics > QA76 Computer software |
ISSN: |
1613-0073 |
Language: |
English |
Submitter: |
Mascha Kurpicz-Briki |
Date Deposited: |
07 Jul 2020 13:17 |
Last Modified: |
07 Jul 2020 13:17 |
Related URLs: |
|
ARBOR DOI: |
10.24451/arbor.11922 |
URI: |
https://arbor.bfh.ch/id/eprint/11922 |