A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Combining Knowledge about Text Types and Document Structures for Enhanced Content Curation
2021
Conference on Digital Curation Technologies
We present the conceptual design of a language technology (LT) system that enables enhanced document curation and processing of different documents types by providing customized NLP workflows that respond and adapt to the extracted characteristics of the input documents. To optimize document and text understanding, the processing steps will not only incorporate textual features but also layout and document type related features like document structure, and the communicative function of specific
dblp:conf/qurator/ZaczynskaKSR21
fatcat:xjax2mnrobe4xkcg6ighfvwsci