GeoLens: Enabling Interactive Visual Analytics over Large-Scale, Multidimensional Geospatial Datasets

Jared Koontz, Matthew Malensek, Sangmi Pallickara
2014 2014 IEEE/ACM International Symposium on Big Data Computing  
GEOLENS: ENABLING INTERACTIVE VISUAL ANALYTICS OVER LARGE-SCALE, MULTIDIMENSIONAL GEOSPATIAL DATASETS With the rapid increase of scientific data volumes, interactive tools that enable effective visual representation for scientists are needed. This is critical when scientists are manipulating voluminous datasets and especially when they need to explore datasets interactively to develop their hypotheses. In this paper, we present an interactive visual analytics framework, GeoLens. GeoLens
more » ... fast and expressive interactions with voluminous geospatial datasets. We provide an expressive visual query evaluation scheme to support advanced interactive visual analytics technique, such as brushing and linking. To achieve this, we designed and developed the geohash based image tile generation algorithm that automatically adjusts the range of data to access based on the minimum acceptable size of the image tile. In addition, we have also designed an autonomous histogram generation algorithm that generates histograms of user-defined data subsets that do not have pre-computed data properties. Using our approach, applications can generate histograms of datasets containing millions of data points with sub-second latency. The work builds on our visual query coordinating scheme that evaluates geospatial query and orchestrates data aggregation in a distributed storage environment while preserving data locality and minimizing data movements. This paper includes empirical benchmarks of our framework encompassing a billion-file dataset published by the National Climactic Data Center.
doi:10.1109/bdc.2014.12 dblp:conf/bdc/KoontzMP14 fatcat:h7bhew75pvewzi7uritnmyoqwe