A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
Commercial analytical database systems suffer from a high "timeto-first-analysis": before data can be processed, it must be modeled and schematized (a human effort), transferred into the database's storage layer, and optionally clustered and indexed (a computational effort). For many types of structured data, this upfront effort is unjustifiable, so the data are processed directly over the file system using the Hadoop framework, despite the cumulative performance benefits of processing thisdoi:10.1145/2452376.2452377 dblp:conf/edbt/AbouziedAS13 fatcat:y6qzgzdlr5bmbn4ihbewzya5um