2 Hits in 1.4 sec

Evaluating the Open Source Data Containers for Handling Big Geospatial Raster Data

Fei Hu, Mengchao Xu, Jingchao Yang, Yanshou Liang, Kejin Cui, Michael M. Little, Christopher S. Lynnes, Daniel Q. Duffy, Chaowei Yang
2018 ISPRS International Journal of Geo-Information  
well, whereas Spark and ClimateSpark can handle large volumes of data with stable resource consumption; (3) SciDB and Rasdaman provide mature array-based data operation and analytical functions, while  ...  Big geospatial raster data pose a grand challenge to data management technologies for effective big data query and processing.  ...  For example, SciHive extends Hive to implement a scalable, array-based query system to process raw array datasets in parallel with a SQL-like query language [23] .  ... 
doi:10.3390/ijgi7040144 fatcat:csbbnucfbzd2za4ghkqnyclihm

Translation of Array-Based Loops to Distributed Data-Parallel Programs [article]

Leonidas Fegaras, Md Hasanuzzaman Noor
2020 arXiv   pre-print
Scientists, who are typically comfortable with numerical analysis tools but are not familiar with the intricacies of Big Data analytics, must now learn to convert their loop-based programs to distributed  ...  imperative, loop-based language.  ...  Sci-Hive [25] is a scalable array-based query system that enables scientists to process raw array datasets in parallel with a SQL-like query language.  ... 
arXiv:2003.09769v1 fatcat:enwokbtxrravrj32szinem7kc4