805 Hits in 6.0 sec

Lake Data Warehouse Architecture for Big Data Solutions

Emad Saddad, Ali El-Bastawissy, Hoda M., Maryam Hazman
2020 International Journal of Advanced Computer Science and Applications  
The new architecture also needs to handle existing drawbacks, including availability, scalability, and consequently query performance.  ...  Lake Data Warehouse Architecture depends on merging the traditional Data Warehouse architecture with big data technologies, like the Hadoop framework and Apache Spark.  ...  Delta tables also built on Object Storage.  ... 
doi:10.14569/ijacsa.2020.0110854 fatcat:mqcz3xssx5bh3fohmvthcedq6u

Analysis of Big Data Storage Tools for Data Lakes based on Apache Hadoop Platform

Vladimir Belov, Evgeny Nikulchev
2021 International Journal of Advanced Computer Science and Applications  
When developing large data processing systems, the question of data storage arises. One of the modern tools for solving this problem is the so-called data lakes.  ...  Many implementations of data lakes use Apache Hadoop as a basic platform.  ...  Delta Lake is characterized by the following properties:  Support for ACID transactions.  ... 
doi:10.14569/ijacsa.2021.0120864 fatcat:jccyh73rlzc5ziccegetlyhi3a

Terrestrial CDOM in Lakes of Yamal Peninsula: Connection to Lake and Lake Catchment Properties

Yury Dvornikov, Marina Leibman, Birgit Heim, Annett Bartsch, Ulrike Herzschuh, Tatiana Skorospekhova, Irina Fedorova, Artem Khomutov, Barbara Widhalm, Anatoly Gubarkov, Sebastian Rößler
2018 Remote Sensing  
Applying a CDOM algorithm (ratio of green and red band reflectance) for two high spatial resolution multispectral GeoEye-1 and Worldview-2 satellite images, we were able to extrapolate the a(λ) CDOM data  ...  from 18 lakes sampled in the field to 356 lakes in the study area (model R 2 = 0.79).  ...  T.S. performed the laboratory measurements; Y.D. and U.H. performed statistical processing and data analysis; A.B. and B.W. provided digital elevation model and performed a processing of radar satellite  ... 
doi:10.3390/rs10020167 fatcat:evzbjqtopve3xndter4yrsgznq

Size Distribution, Surface Coverage, Water, Carbon, and Metal Storage of Thermokarst Lakes in the Permafrost Zone of the Western Siberia Lowland

Yury Polishchuk, Alexander Bogdanov, Vladimir Polishchuk, Rinat Manasypov, Liudmila Shirokova, Sergey Kirpotin, Oleg Pokrovsky
2017 Water  
We quantified the abundance of thermokarst lakes in the continuous, discontinuous, and sporadic permafrost zones of the western Siberian Lowland (WSL) using Landsat-8 scenes collected over the summers  ...  As such, observations at high spatial resolution (<0.5 ha) are needed to constrain the reservoirs and the mobility of carbon and metals in aquatic systems.  ...  The second objective of this work was to assess the water, carbon, and metal storage in thermokarst lakes.  ... 
doi:10.3390/w9030228 fatcat:7vj4xa52zff6zpl4yhs2t3y2x4

On the Logical Design of a Prototypical Data Lake System for Biological Resources

Haoyang Che, Yucong Duan
2020 Frontiers in Bioengineering and Biotechnology  
As an effective complement to those previous systems, data lakes were devised to store voluminous, varied, and diversely structured or unstructured data in their native formats, for the sake of various  ...  The dual mechanism can ensure the explainability guarantees on the entirety of the data lake system.  ...  Raw data is always discarded or stored in a NAS/SAN/Cloud storage area. TDW and DL transfers data back and forth; sometimes, DL can serve as a staging area for TDW, and vice versa.  ... 
doi:10.3389/fbioe.2020.553904 pmid:33117777 pmcid:PMC7552915 fatcat:fpizpjiahrc7tdkze5mkw6syzi

Regional review: the hydrology of the Okavango Delta, Botswana—processes, data and modelling

Christian Milzow, Lesego Kgotlhang, Peter Bauer-Gottwein, Philipp Meier, Wolfgang Kinzelbach
2009 Hydrogeology Journal  
To predict these impacts, the hydrology of the Delta has to be understood.  ...  The wetlands of the Okavango Delta accommodate a multitude of ecosystems with a large diversity in fauna and flora.  ...  It is, however, remarkable and speaks for the quality of the model that it performs well in Milzow et al. 2008b) simulating the flooding of Lake Ngami, a lake at the southwestern end of the wetlands  ... 
doi:10.1007/s10040-009-0436-0 fatcat:r7whogthxveqnlif7j2x2ajjce

The impact of conifer plantation forestry on the Chydoridae (Cladocera) communities of peatland lakes

T. J. Drinan, C. T. Graham, J. O'Halloran, S. S. C. Harrison
2012 Hydrobiologia  
lake invertebrate communities.  ...  excisa and Alonella exigua, in the plantation forestry-affected lakes, consistent with a shift in lake trophy.  ...  Fishless p High Fish Table 6 . 6 2).  ... 
doi:10.1007/s10750-012-1230-x fatcat:kxu63joxdjdifdpyp37qt5qwty

Fusion insight librA

Le Cai, Jacques Hebert, Kamini Jagtiani, Suzhen Lin, Ye Liu, Demai Ni, Chunfeng Pei, Jason Sun, Yongyan Wang, Li Zhang, Mingyi Zhang, Jianjun Chen (+8 others)
2018 Proceedings of the VLDB Endowment  
In particular, we focus on top four requirements from our customers related to data analytics on the cloud: system availability, auto tuning, query over heterogeneous data models on the cloud, and the  ...  ability to utilize powerful modern hardware for good performance.  ...  First, cloud storage service such as AWS S3, Microsoft Azure Storage, as well as Huawei cloud's OBS (Object Block Storage) have been widely used.  ... 
doi:10.14778/3229863.3229870 fatcat:zwxgz2se5fcehmnvyliwmktoey

Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats [article]

Tianyu Li, Matthew Butrovich, Amadou Ngom, Wan Shen Lim, Wes McKinney, Andrew Pavlo
2020 arXiv   pre-print
storage blocks in a universal open-source format.  ...  Our experiments show that our approach achieves comparable performance with dedicated OLTP DBMSs while enabling orders-of-magnitude faster data exports to external data science and machine learning tools  ...  Related to ORC is Databricks' Delta Lake engine [11] that acts as a ACID transactional engine on top of cloud storage.  ... 
arXiv:2004.14471v1 fatcat:cf3fma5wlbamzmxzvvgqg5himi

Mechanisms regulating CO2 and CH4 dynamics in the Azorean volcanic lakes (São Miguel Island, Portugal)

Franco Tassi, Jacopo Cabassi, Cesar Andrade, Cristiana Callieri, Catarina Silva, Fatima Viveiros, Gianluca Corno, Orlando Vaselli, Enrico Selmo, Andrea Gallorini, Andrea Ricci, Luciano Giannini (+1 others)
2018 Journal of Limnology  
The seasonal thermal stratification favored the development of anaerobic hypolimnia, showing relatively high concentrations of NH4+, NO3-, P and other minor species (Fe, Mn, Zn, As) controlled by microbial  ...  activity and minerogenetic processes occurring within the lake sediments.  ...  (Tabl. 2).  ... 
doi:10.4081/jlimnol.2018.1821 fatcat:a5jjb7gbmbag3ixzzwhsgjaoey

Elemental, isotopic, and structural changes in Tagish Lake insoluble organic matter produced by parent body processes

C. M. O'D. Alexander, G. D. Cody, Y. Kebukawa, R. Bowden, M. L. Fogel, A. L. D. Kilcoyne, L. R. Nittler, C. D. K. Herd
2014 Meteoritics and Planetary Science  
Here, we present the results of a multitechnique study of the bulk properties of insoluble organic material (IOM) from the Tagish Lake meteorite, including four lithologies that have undergone different  ...  Such integrations were performed for both the 1 H and the 13 C spectra using the spectral ranges defined above (Table 3) . Table 2 .  ...  Thus, it appears reasonable to assume that the trend in Tagish Lake IOM molecular evolution is from high H/C, high dD, and high aliphatic content to low H/C, low dD, and high aromatic content.  ... 
doi:10.1111/maps.12282 fatcat:3jb4fk47jfg23dmtd4spepcasu

Recent Advances in Data Engineering for Networking

Engin Zeydan, Josep Mangues-Bafalluy
2022 IEEE Access  
Delta Lake † (from Databricks) is an open-source storage layer.  ...  GCP also offers several storage options, which are as follows: Cloud Storage (a service for storing objects, i.e., immutable data), Cloud Spanner (NewSQL database with unlimited scale, strong consistency  ... 
doi:10.1109/access.2022.3162863 fatcat:jqpp6dyk3jf3tnzovplmlkoewu

Data Recovery at Justiceburg Reservior (Lake Alan Henry), Garza and Kent Counties, Texas: Phase III, Season 1

1992 Index of Texas Archaeology Open Access Grey Literature from the Lone Star State  
Phase III data recovery investigations at one historic and three prehistoric sites, augmented by additional survey and off-site geological investigations, were conducted at Lake Alan Henry (formerly Justiceburg  ...  Lacking high-yield resources that could have provided surplus, food storage was probably minimal and stores may have lasted only short times.  ...  AII of these objectives require archeological or paleoenvironmental samples from good geological contexts and high stratigraphic integrity.  ... 
doi:10.21112/ita.1992.1.13 fatcat:xkh3y5rfrfecvejklqw5u4253a

Integration of large-scale data processing systems and traditional parallel database technology

Azza Abouzied, Daniel J. Abadi, Kamil Bajda-Pawlikowski, Avi Silberschatz
2019 Proceedings of the VLDB Endowment  
We built a prototype, HadoopDB, and demonstrated that it can deliver the high SQL query performance and efficiency of parallel database management systems while still providing the scalability, fault tolerance  ...  More recently, Databricks open-sourced Delta [6] , a transactional table storage for Spark built on top of Parquet.  ...  Presto's inherent separation of compute and storage makes Presto well-suited for deployments in the cloud and in cloud-like environments such as Kubernetes.  ... 
doi:10.14778/3352063.3352145 fatcat:qnwfplmf3jgodaw7tsu3kwjsnq


Boyan Kolev, Jose María Zaragoza, Patricio Martinez, Luis Miguel Garcia, Ofer Biran, Kostas Moutselos, Marios Koniaris, Christos Pavlatos, Aggelos Kolaitis, María Ángeles Sanguino, Jorge Montero, Ana Luiza Pontual (+22 others)
2021 Zenodo  
Moreover, a bottom-up approach has been applied, whose objective is to additionally complement and ident [...]  ...  It is the third and last version of the series of deliverables of this task, whose main objective is to specify the use case scenarios, their involved datasets and their relevant user requirements, as  ...  The major ones today are Delta Lake 21 , Apache Iceberg 22 and Apache Hudi.  ... 
doi:10.5281/zenodo.6043823 fatcat:fuf7egsgfvea3ky5it2wpmt4ha
« Previous Showing results 1 — 15 out of 805 results