A Toolbox Approach to Flexible and Efficient Data Mining [chapter]

Ole M. Nielsen⋆, Peter Christen, Markus Hegland, Tatiana Semenova, Timothy Hancock
2001 Lecture Notes in Computer Science  
This paper describes a flexible and efficient toolbox based on the scripting language Python, capable of handling common tasks in data mining. Using either a relational database or flat files the toolbox gives the user a uniform view of a data collection. Two core features of the toolbox are caching of database queries and parallelism within a collection of independent queries. Our toolbox provides a number of routines for basic data mining tasks on top of which the user can add more functions
more » ... mainly domain and data collection dependent -for complex and time consuming data mining tasks.
doi:10.1007/3-540-45357-1_16 fatcat:v3u3byilnfas3cwwi2zaew4bga