Database systems research on data mining

Carlos Ordonez, Javier García-García
2010 Proceedings of the 2010 international conference on Management of data - SIGMOD '10  
Data mining remains an important research area in database systems. We present a review of processing alternatives, storage mechanisms, algorithms, data structures and optimizations that enable data mining on large data sets. We focus on the computation of well-known multidimensional statistical and machine learning models. We pay particular attention to SQL and MapReduce as two competing technologies for large scale processing. We conclude with a summary of solved major problems and open
more » ... lems and open research issues.
doi:10.1145/1807167.1807335 dblp:conf/sigmod/OrdonezG10 fatcat:ppgnhhnzorhkpbrpqz7etrz6j4