Load Shedding using Window Aggregation Queries on Data Streams

S. Senthamilarasu, M. Hemalatha
2012 International Journal of Computer Applications  
The processes of extracting knowledge structures for continuous, rapid records are known as the Data Stream Mining. The main issue in stream mining is handling streams of elements delivered rapidly which makes it infeasible to store everything in active storage. To overcome this problem of handling voluminous data we exposed a novel load shedding system using window based aggregate function of the data stream in which we accept those tuples in the stream that meet a criterion. Accepted tuples
more » ... e conceded to another process as a stream, while further tuples are dropped. This proposed model conceivably segregates the data input stream into windows and probabilistically decides which tuple to drop based on the window function. The best window aggregate function used for dropping tuples is identified with the three prediction models used in data mining they are Decision Tree, Naïve Bayes and Logistic Regression. The result shows that the cumulative distance and density rank functions outperforms the remaining methods. Distinct to prior methods, our method preserves uniformity of windows all over a query plan, and constantly distributes subsets of the original query responds with insignificant denial in the excellence of the consequence. Keywords Data stream mining, Windows functions, Load Shedding Scheme. shedding techniques that assume all queries are equally important, or even give priority to the shedding techniques that are based on dropping input tuples according to the regions' priorities of an input query as a whole. DSMS are often read only once or a small number of times using limited computing and storage capabilities occurrences
doi:10.5120/8598-2362 fatcat:ivcneqj7ezbbpkhe6gh7vdhhyi