Mapping SNOMED CT Codes to Semi-Structured Texts via an NLP Pipeline
Version
Published
Date Issued
2022-06-29
Author(s)
Kunz, Sebastian
Zgraggen, Cyril
Sariyar, Murat
Type
Article
Language
English
Abstract
In the project presented here, we used NLP tools for annotating German medical trainings documents with SNOMED CT codes. Following research question was addressed: Is it possible to automate the annotation of training documents with an NLP pipeline especially designed for this task but requiring translation into English? The goal of our stakeholder, an institution responsible for the continuing education of physicians, was to facilitate the switch between different medical trainings programs by coding the same requirement with the same SNOMED CT code, even if the wording is different. We first describe how we chose the concrete NLP tools, after which the concrete steps for implementing our prototype are outlined: the NLP pipeline construction, the implementation, and the validation. We infer three important lessons from our results: (i) self-supervision is no free lunch and should be based on a sophisticated task, (ii) the translation via DeepL can be too context-dependent for a peculiar use case, and (iii) ontology extraction can increase efficiency as well as accuracy.
Subjects
QA75 Electronic computers. Computer science
ISBN
9781643682907
Publisher DOI
Journal or Serie
Studies in Health Technology and Informatics
Series/Report No.
Studies in Health Technology and Informatics
ISSN
1879-8365
Publisher URL
Volume
295
Publisher
IOS Press
Submitter
Sariyar, Murat
Citation apa
Kunz, S., Zgraggen, C., & Sariyar, M. (2022). Mapping SNOMED CT Codes to Semi-Structured Texts via an NLP Pipeline. In Studies in Health Technology and Informatics (Vol. 295, pp. 390–393). IOS Press. https://doi.org/10.24451/arbor.18475
File(s)![Thumbnail Image]()
Loading...
open access
Name
SHTI-295-SHTI220747.pdf
License
Attribution-NonCommercial 4.0 International
Version
published
Size
232.84 KB
Format
Adobe PDF
Checksum (MD5)
b04350740a636b603af3ef241bc69df6
