Designing an Extensible Domain-Specific Web Corpus for "Layfication" [chapter]

Marina Santini, Arne Jönsson, Wiktor Strandqvist, Gustav Cederblad, Mikael Nyström, Marjan Alirezaie, Leili Lind, Eva Blomqvist, Maria Lindén, Annica Kristoffersson
2019 Advances in Systems Analysis, Software Engineering, and High Performance Computing  
In the era of data-driven science, corpus-based language technology is an essential part of cyber physical systems. In this chapter, the authors describe the design and the development of an extensible domain-specific web corpus to be used in a distributed social application for the care of the elderly at home. The domain of interest is the medical field of chronic diseases. The corpus is conceived as a flexible and extensible textual resource, where additional documents and additional
more » ... will be appended over time. The main purpose of the corpus is to be used for building and training language technology applications for the "layfication" of the specialized medical jargon. "Layfication" refers to the automatic identification of more intuitive linguistic expressions that can help laypeople (e.g., patients, family caregivers, and home care aides) understand medical terms, which often appear opaque. Exploratory experiments are presented and discussed.
doi:10.4018/978-1-5225-7879-6.ch006 fatcat:tgaorpe5fvepnhl7j66mkp2taa