Filters








864 Hits in 3.1 sec

Parallel Query Support for Multidimensional Data: Inter-object Parallelism [chapter]

Karl Hahn, Bernd Reiner, Gabriele Höfling, Peter Baumann
2002 Lecture Notes in Computer Science  
This paper presents a number of new techniques for parallelizing queries in multidimensional array database management systems.  ...  It discusses their implementation in the RasDaMan DBMS, the first DBMS for generic multidimensional array data.  ...  In order to achieve performance improvements for a query which executes array operations on a single MDD, the concept described here is not suitable, as the granularity of our data parallelism is a complete  ... 
doi:10.1007/3-540-46146-9_81 fatcat:rrc7ebexvjdhrkmr6ka6k7w6tu

The multidimensional database system RasDaMan

P. Baumann, A. Dehmel, P. Furtado, R. Ritsch, N. Widmann
1998 SIGMOD record  
RasDaMan is a universal -i.e., domain-independent -array DBMS for multidimensional arrays of arbitrary size and structure.  ...  RasDaMan is being used in several international projects for the management of geo and healthcare data of various dimensionality.  ...  The operation set is based on RasDaMan Array Algebra [5] which allows for declarative expression of operations up to the complexity of the Discrete Fourier Transform.  ... 
doi:10.1145/276305.276386 fatcat:yyj6ntyogfdchbxhdissz76hgm

SAVIME: An Array DBMS for Simulation Analysis and ML Models Prediction

Hermano L. S. Lustosa, Anderson C. Silva, Daniel N. R. da Silva, Patrick Valduriez, Fabio Porto
2021 Journal of Information and Data Management  
In order to make them benefit from DBMS support, enabling Declarative data analysis and visualization over scientific data, we present an in-memory array DBMS system called SAVIME.  ...  Our preliminary evaluation show how SAVIME, by using a simple storage definition language (SDL) can outperform the state-of-the-art array database system, SciDB, during the process of data ingestion.  ...  ACKNOWLEDGMENT The authors would like to thank Yania Souto, for the ML models used on the experiments, as well as Brian Tsan and Florin Rusu from the University of California, Merced, for contributions  ... 
doi:10.5753/jidm.2020.2021 fatcat:mjxk4axcq5catfye6tincggafy

The multidimensional database system RasDaMan

P. Baumann, A. Dehmel, P. Furtado, R. Ritsch, N. Widmann
1998 Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98  
RasDaMan is a universal -i.e., domain-independent -array DBMS for multidimensional arrays of arbitrary size and structure.  ...  RasDaMan is being used in several international projects for the management of geo and healthcare data of various dimensionality.  ...  The operation set is based on RasDaMan Array Algebra [5] which allows for declarative expression of operations up to the complexity of the Discrete Fourier Transform.  ... 
doi:10.1145/276304.276386 dblp:conf/sigmod/BaumannDFRW98 fatcat:didyi5rbnvb4xpseu64kdip3ai

Hierarchical Storage Support and Management for Large-Scale Multidimensional Array Database Management Systems [chapter]

Bernd Reiner, Karl Hahn, Gabriele Höfling, Peter Baumann
2002 Lecture Notes in Computer Science  
So tertiary storage memory is only in an insufficient way supported for storing or retrieval of multidimensional array data.  ...  We introduce concepts for efficient hierarchical storage support and management for large-scale multidimensional array database management systems and their integration into the commercial array database  ...  In section 3 we present a concept for efficient storage of multidimensional data. Section 4 will have a focus on the tertiary storage management and support for large MDDs.  ... 
doi:10.1007/3-540-46146-9_68 fatcat:vxdo3xnd7ngzzgnfvdscyzot64

Database system support of simulation data

Hermano Lustosa, Fabio Porto, Patrick Valduriez, Pablo Blanco
2016 Proceedings of the VLDB Endowment  
In this paper, we investigate techniques for managing such data using an array DBMS.  ...  The results indicate that multidimensional arrays and column-stores are much faster than a traditional row-store system for queries over a larger amount of simulation data.  ...  Valduriez) in the context of the Computational Biology Institute (www.ibc-montpellier.fr) and for (F. Porto, H. Lustosa and P. Blanco) in the context of the INCT-MACC project (http://macc.lncc.br).  ... 
doi:10.14778/3007263.3007271 fatcat:gojmoqcesneajliwxab4i3v624

SAVIME: A Multidimensional System for the Analysis and Visualization of Simulation Data [article]

Hermano Lustosa, Fabio Porto
2019 arXiv   pre-print
In order to make simulation applications benefit from DBMS support, the author proposes the development of a system called SAVIME in the context of his PhD thesis.  ...  SAVIME is an array database system designed to manage numerical simulation data. In this document, the author presents all work conducted so far and the current state of development.  ...  Since that time, a myriad of systems emerged in order to allow for the storage and analysis of data over multidimensional arrays, among them are: • Titan [15] is a parallel DBMS designed to deal with  ... 
arXiv:1903.02949v2 fatcat:d74sgpvysjg2fcueqencijjcna

Multidimensional Arrays for Warehousing Data on Clouds [chapter]

Laurent d'Orazio, Sandro Bimonte
2010 Data Management in Grid and Peer-to-Peer Systems  
In this paper we address the pay-as-you-go rules for warehousing data storage. We propose to use the multidimensional arrays storage techniques for clouds. First experiments validate our proposal.  ...  Cloud computing on the impulse of ICT majors like Google, Microsoft and Amazon, has recently focused the attention. OLAP querying and data warehousing in such a context consists in a major issue.  ...  Acknowledgment Thanks to Boussad Mebarki, Ilyas Brahmia, Abdelaziz Merabet, in addition to the APIS team of the LIMOS laboratory and the COPAIN team from the Cemagref for useful discussions on datawarehouses  ... 
doi:10.1007/978-3-642-15108-8_3 dblp:conf/globe/dOrazioB10 fatcat:iu2sa5udbjgobja7btqeco7mq4

Optimized management of large-scale data sets stored on tertiary storage systems

B. Reiner, K. Hahn
2004 IEEE Distributed Systems Online  
As part of the Estedi project, we developed a system that extends RasDaMan (Raster Data Management), the first commercial multidimensional-array database management system (DBMS).  ...  Estedi, an initiative of European database developers, software vendors, and supercomputing centers, seeks to provide a solution for the storage and retrieval of multidimensional HPC array data.  ...  First, performance measurements prove the validity of our concept. On a two-processor machine, speed increased by a factor of up to 1.8, which we consider an extremely good result.  ... 
doi:10.1109/mdso.2004.5 fatcat:7zrsrhpnmra7ff5u6agdn4citm

Managing Large Multidimensional Array Hydrologic Datasets: A Case Study Comparing NetCDF and SciDB

Haicheng Liu, Peter van Oosterom, Chengfang Hu, Wen Wang
2016 Procedia Engineering  
In this research, NetCDF file based solutions and a multidimensional (MD) array database management system (DBMS) applying chunked storage structure are benchmarked to determine the best solution for storing  ...  The research illustrates that for big hydrologic array data management, the properly chunked NetCDF-4 solution without compression is in general more efficient than the SciDB DBMS.  ...  . , the Specialized Research Fund for the Doctoral Program of Higher Education (20130094110007), the 111 Project (B08048) and National Natural Science Foundation of China (Grant No.41301435) are gratefully  ... 
doi:10.1016/j.proeng.2016.07.449 fatcat:tfqpo3cskbbbnnlu4dx5duyahq

Fast UDFs to compute sufficient statistics on large data sets exploiting caching and sampling

Carlos Ordonez, Sasi K. Pitchaimalai
2010 Data & Knowledge Engineering  
We present an aggregate UDF computing multidimensional sufficient statistics that benefit a broad array of statistical models: the linear sum of points and the quadratic sum of crossproducts of point dimensions  ...  A profile of UDF run-time execution shows the UDF is slowed down by I/O when reading from disk.  ...  However, since SQL does not provide advanced manipulation of multidimensional arrays, matrix operations can be difficult to express as efficient SQL queries, especially if joins are required.  ... 
doi:10.1016/j.datak.2009.12.001 fatcat:pzmabknqtnem7o7cmheyqrhgy4

SAVIME: A Database Management System for Simulation Data Analysis and Visualization

Hermano Lustosa, Fábio Porto, Patrick Valduriez
2019 Anais do Simpósio Brasileiro de Banco de Dados (SBBD)  
In order to make scientific applications benefit from DBMS support, enabling declarative data analysis and visualization over scientific data, we present an in-memory array DBMS system called SAVIME.  ...  Our preliminary evaluation shows how SAVIME, by using a simple storage definition language (SDL) can outperform the state-of-the-art array database system, SciDB, during the process of data ingestion.  ...  In addition, results also show that it is possible to retrieve SAVIME data and generate viz files efficiently by using the special purpose visualization operator.  ... 
doi:10.5753/sbbd.2019.8810 dblp:conf/sbbd/LustosaPV19 fatcat:27jltj2iuncvhktfyxivcuysly

Vector and matrix operations programmed with UDFs in a relational DBMS

Carlos Ordonez, Javier García-García
2006 Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06  
A UDF allows fast evaluation of arithmetic expressions, memory manipulation, using multidimensional arrays and exploiting all C language control statements.  ...  In this work, we study how to extend a DBMS with basic vector and matrix operators by programming User-Defined Functions (UDFs).  ...  efficient algorithms assuming the data set is in a flat file outside the DBMS.  ... 
doi:10.1145/1183614.1183687 dblp:conf/cikm/OrdonezG06 fatcat:uql2by2an5c3vmgjv6f4arykki

Scientific Analysis by Queries in Extended SPARQL over a Scalable e-Science Data Store

Andrej Andrejev, Salman Toor, Andreas Hellander, Sverker Holmgren, Tore Risch
2013 2013 IEEE 9th International Conference on e-Science  
To address the scalability problem we present an architecture that enables the same SciSPARQL queries to be executed on the RDF dataset whether it is stored in a relational DBMS or mapped over a specialized  ...  Data-intensive applications in e-Science require scalable solutions for storage as well as interactive tools for analysis of scientific data.  ...  ACKNOWLEDGMENT This project is supported by eSSENCE and the Swedish Foundation for Strategic Research under grant RIT08-0041, (U.S.) Department of Energy (DOE) Award No.  ... 
doi:10.1109/escience.2013.19 dblp:conf/eScience/AndrejevTHHR13 fatcat:4mk4u4vyyrcqxiungjbfqnv67y

Cubrick

Pedro Pedreira, Chris Croswhite, Luis Bona
2016 Proceedings of the VLDB Endowment  
This paper describes the architecture and design of Cubrick, a distributed multidimensional in-memory DBMS suited for interactive analytics over highly dynamic datasets.  ...  Cubrick has a strictly multidimensional data model composed of cubes, dimensions and metrics, supporting sub-second OLAP operations such as slice and dice, roll-up and drill-down over terabytes of data  ...  In this paper we present Cubrick, a distributed in-memory multidimensional DBMS we have developed from scratch at Facebook, capable of executing indexed OLAP operations such as slice-n-dice, roll-ups and  ... 
doi:10.14778/3007263.3007269 fatcat:krf5qjcnjrg47m34gvb53apum4
« Previous Showing results 1 — 15 out of 864 results