A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
XStruct: Efficient Schema Extraction from Multiple and Large XML Documents
2006
22nd International Conference on Data Engineering Workshops (ICDEW'06)
XML is the de facto standard format for data exchange on the Web. While it is fairly simple to generate XML data, it is a complex task to design a schema and then guarantee that the generated data is valid according to that schema. As a consequence much XML data does not have a schema or is not accompanied by its schema. In order to gain the benefits of having a schema-efficient querying and storage of XML data, semantic verification, data integration, etc.-this schema must be extracted. In
doi:10.1109/icdew.2006.166
dblp:conf/icde/HegewaldNW06
fatcat:dlzs3og2xzehrebwqawecezd5a