Using Natural Language Processing to find Indication for Burnout with Text Classification: From Online Data to Real-World Data
Date Issued
2024-09-22
Author(s)
Type
Working Paper
Language
English
Abstract
Burnout, classified as a syndrome in the ICD-11, arises from chronic workplace stress that has not been effectively managed. It is characterized by exhaustion, cynicism, and reduced professional efficacy, and estimates of its prevalence vary significantly due to inconsistent measurement methods. Recent advancements in Natural Language Processing (NLP) and machine learning offer promising tools for detecting burnout through textual data analysis, with studies demonstrating high predictive accuracy. This paper contributes to burnout detection in German texts by: (a) collecting an anonymous real-world dataset including free-text answers and Oldenburg Burnout Inventory (OLBI) responses; (b) demonstrating the limitations of a GermanBERT-based classifier trained on online data; (c) presenting two versions of a curated BurnoutExpressions dataset, which yielded models that perform well in
real-world applications; and (d) providing qualitative insights from an interdisciplinary focus group on the interpretability of AI models used for burnout detection. Our findings emphasize the need for greater collaboration between AI researchers and clinical experts to refine burnout detection models. Additionally, more real-world data is essential to validate and enhance the effectiveness of current AI methods developed in NLP research, which are often based on data automatically scraped from online sources and not evaluated in a real-world context. This is essential for ensuring AI tools are well suited for practical applications.
real-world applications; and (d) providing qualitative insights from an interdisciplinary focus group on the interpretability of AI models used for burnout detection. Our findings emphasize the need for greater collaboration between AI researchers and clinical experts to refine burnout detection models. Additionally, more real-world data is essential to validate and enhance the effectiveness of current AI methods developed in NLP research, which are often based on data automatically scraped from online sources and not evaluated in a real-world context. This is essential for ensuring AI tools are well suited for practical applications.
Publisher DOI
Publisher URL
Related URL
Publisher
Cornell University
Submitter
Kurpicz-Briki, Mascha
Citation apa
Kurpicz-Briki, M., Merhbene, G., Puttick, A. R., Ben Souissi, S., Bieri, J. S., Müller, T. J., & Golz, C. (2024). Using Natural Language Processing to find Indication for Burnout with
Text Classification: From Online Data to Real-World Data. Cornell University. https://doi.org/10.24451/dspace/11491
File(s)![Thumbnail Image]()
Loading...
open access
Name
arxivKurpicz-Briki2024.pdf
License
Attribution 4.0 International
Size
960.05 KB
Format
Adobe PDF
Checksum (MD5)
09b1761946becb84c8455af2e048bcf6
