Streaming transformation of XML to RDF using XPath-based mappings

Jyun-Yao Huang, Christoph Lange, Sören Auer
2015 Proceedings of the 11th International Conference on Semantic Systems - SEMANTICS '15  
The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the rise of Linked Data published using the Resource Description Framework (RDF), a number of tools for transforming XML to RDF have been developed. Specifying XML→RDF mappings for these tools often requires skills in programming languages such as XSLT or XQuery. Moreover, these tools rarely have the ability to deal with very large XML inputs. We introduce an XML to RDF transformation approach, which
more » ... s based on mappings comprising RDF triple templates that employ simple XPath expressions. Due to the restricted XPath expressions, which can be evaluated against a stream of XML data our implementation can handle extremely large input XML files. To process the XML input efficiently, we employ XML filtering techniques and a strategy for selecting relevant XML nodes for generating RDF triples. We show that the time complexity of our mapping algorithm is linear in the size of the XML input and also prove its practical efficiency with an evaluation on large real-world data.
doi:10.1145/2814864.2814880 dblp:conf/i-semantics/Huang0A15 fatcat:uf4nsi2abzaddjjbeb6lpxsxlq