Indexing Incomplete Databases [chapter]

Guadalupe Canahuate, Michael Gibas, Hakan Ferhatosmanoglu
2006 Lecture Notes in Computer Science  
Incomplete databases, that is, databases that are missing data, are present in many research domains. It is important to derive techniques to access these databases efficiently. We first show that known indexing techniques for multi-dimensional data search break down in terms of performance when indexed attributes contain missing data. This paper utilizes two popularly employed indexing techniques, bitmaps and quantization, to correctly and efficiently answer queries in the presence of missing
more » ... ata. Query execution and interval evaluation are formalized for the indexing structures based on whether missing data is considered to be a query match or not. The performance of Bitmap indexes and quantization based indexes is evaluated and compared over a variety of analysis parameters for real and synthetic data sets. Insights into the conditions for which to use each technique are provided.
doi:10.1007/11687238_52 fatcat:4xbwlt5pwvgnxcch74f73ugvly