POST STRATIFICATION SAMPLING AND HORVITZ THOMPSON ESTIMATOR FOR RANGE AGGREGATE QUERIES IN BIG DATA ENVIRONMENTS

S Barkath, Nisha, Latha Priyadharshini
IJRCS-International Journal of Research in Computer Science   unpublished
Big Data is a collection of large datasets and handling of data is challenging in this environment. Fast Range Aggregate Queries (FastRAQ) approach is used to process the range aggregate queries that consist of aggregate function on all tuples within the query ranges. The query result can be generated from the range cardinality query algorithm. The weight of the sample estimate is calculated using the Post Stratification sampling method and to estimate the total and mean of a super population
more » ... a stratified sample, Horvitz Thompson estimator is used. The time complexity is reduced by using the sampling methods.
fatcat:sx2uxlip5va3nfteg6tdvb77rq