A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Robust Runtime Optimization and Skew-Resistant Execution of Analytical SPARQL Queries on Pig
[chapter]
2012
Lecture Notes in Computer Science
We describe a system that incrementally translates SPARQL queries to Pig Latin and executes them on a Hadoop cluster. This system is designed to work efficiently on complex queries with many self-joins over huge datasets, avoiding job failures even in the case of joins with unexpected high-value skew. To be robust against cost estimation errors, our system interleaves query optimization with query execution, determining the next steps to take based on data samples and statistics gathered during
doi:10.1007/978-3-642-35176-1_16
fatcat:bkrhd4o36jfnnnm7iuiebaqtm4