Data Augmentation for Multi-Class Eating Disorders Text Classification
Version
Published
Date Issued
2024
Author(s)
Merhbene, Ghofrane
Editor(s)
Capol, Corsin
Cieliebak, Mark
Weichselbraun, Albert
Musat, Claudiu
Maier, Elisabeth
Zimmermann, Lucas
Type
Conference Paper
Language
English
Abstract
In this study, we tackle the challenge of detecting Eating Disorders (EDs) in German text, a relatively unexplored area in natural language processing (NLP) for mental health. In this project, we developed a manually annotated German dataset from YouTube comments. To address the class distribution imbalance, we employed back translation as a data augmentation technique. This process significantly enhanced the dataset’s utility. Through a comprehensive grid search, we identified a Support Vector Machine (SVM) model as the most effective, achieving an average F1-score of 0.83. Our findings not only contribute to the research field of ED detection in German but also demonstrate the effectiveness of innovative data augmentation techniques in managing class imbalances in natural language processing.
Publisher URL
Conference
SwissText 2024
Publisher
Association of Computational Linguistics ACL
Submitter
Kurpicz-Briki, Mascha
Citation apa
Merhbene, G., & Kurpicz-Briki, M. (2024). Data Augmentation for Multi-Class Eating Disorders Text Classification (C. Capol, M. Cieliebak, A. Weichselbraun, C. Musat, E. Maier, & L. Zimmermann, Eds.). Association of Computational Linguistics ACL. https://doi.org/10.24451/dspace/11529
File(s)![Thumbnail Image]()
Loading...
open access
Name
Merhbene2024.pdf
Description
Version published
License
Attribution 4.0 International
Size
443.61 KB
Format
Adobe PDF
Checksum (MD5)
86b83ad79fc55c5c857ce5e98755e0dc
