A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Extraction of Semantic XML DTDs from Texts Using Data Mining Techniques
2001
International Conference on Knowledge Capture
Although composed of unstructured texts, documents contained in textual archives such as public announcements, patient records and annual reports to shareholders often share an inherent though undocumented structure. In order to facilitate efficient, structure-based search in archives and to enable information integration of text collections with related data sources, this inherent structure should be made explicit as detailed as possible. Inferring a semantic and structured XML document type
dblp:conf/kcap/WinklerS01
fatcat:exsiw477zvej3jt3bjc3t4p7cy