A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
Parallel Query Support for Multidimensional Data: Inter-object Parallelism
[chapter]
2002
Lecture Notes in Computer Science
This paper presents a number of new techniques for parallelizing queries in multidimensional array database management systems. ...
It discusses their implementation in the RasDaMan DBMS, the first DBMS for generic multidimensional array data. ...
In order to achieve performance improvements for a query which executes array operations on a single MDD, the concept described here is not suitable, as the granularity of our data parallelism is a complete ...
doi:10.1007/3-540-46146-9_81
fatcat:rrc7ebexvjdhrkmr6ka6k7w6tu
The multidimensional database system RasDaMan
1998
SIGMOD record
RasDaMan is a universal -i.e., domain-independent -array DBMS for multidimensional arrays of arbitrary size and structure. ...
RasDaMan is being used in several international projects for the management of geo and healthcare data of various dimensionality. ...
The operation set is based on RasDaMan Array Algebra [5] which allows for declarative expression of operations up to the complexity of the Discrete Fourier Transform. ...
doi:10.1145/276305.276386
fatcat:yyj6ntyogfdchbxhdissz76hgm
SAVIME: An Array DBMS for Simulation Analysis and ML Models Prediction
2021
Journal of Information and Data Management
In order to make them benefit from DBMS support, enabling Declarative data analysis and visualization over scientific data, we present an in-memory array DBMS system called SAVIME. ...
Our preliminary evaluation show how SAVIME, by using a simple storage definition language (SDL) can outperform the state-of-the-art array database system, SciDB, during the process of data ingestion. ...
ACKNOWLEDGMENT The authors would like to thank Yania Souto, for the ML models used on the experiments, as well as Brian Tsan and Florin Rusu from the University of California, Merced, for contributions ...
doi:10.5753/jidm.2020.2021
fatcat:mjxk4axcq5catfye6tincggafy
The multidimensional database system RasDaMan
1998
Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98
RasDaMan is a universal -i.e., domain-independent -array DBMS for multidimensional arrays of arbitrary size and structure. ...
RasDaMan is being used in several international projects for the management of geo and healthcare data of various dimensionality. ...
The operation set is based on RasDaMan Array Algebra [5] which allows for declarative expression of operations up to the complexity of the Discrete Fourier Transform. ...
doi:10.1145/276304.276386
dblp:conf/sigmod/BaumannDFRW98
fatcat:didyi5rbnvb4xpseu64kdip3ai
Hierarchical Storage Support and Management for Large-Scale Multidimensional Array Database Management Systems
[chapter]
2002
Lecture Notes in Computer Science
So tertiary storage memory is only in an insufficient way supported for storing or retrieval of multidimensional array data. ...
We introduce concepts for efficient hierarchical storage support and management for large-scale multidimensional array database management systems and their integration into the commercial array database ...
In section 3 we present a concept for efficient storage of multidimensional data. Section 4 will have a focus on the tertiary storage management and support for large MDDs. ...
doi:10.1007/3-540-46146-9_68
fatcat:vxdo3xnd7ngzzgnfvdscyzot64
Database system support of simulation data
2016
Proceedings of the VLDB Endowment
In this paper, we investigate techniques for managing such data using an array DBMS. ...
The results indicate that multidimensional arrays and column-stores are much faster than a traditional row-store system for queries over a larger amount of simulation data. ...
Valduriez) in the context of the Computational Biology Institute (www.ibc-montpellier.fr) and for (F. Porto, H. Lustosa and P. Blanco) in the context of the INCT-MACC project (http://macc.lncc.br). ...
doi:10.14778/3007263.3007271
fatcat:gojmoqcesneajliwxab4i3v624
SAVIME: A Multidimensional System for the Analysis and Visualization of Simulation Data
[article]
2019
arXiv
pre-print
In order to make simulation applications benefit from DBMS support, the author proposes the development of a system called SAVIME in the context of his PhD thesis. ...
SAVIME is an array database system designed to manage numerical simulation data. In this document, the author presents all work conducted so far and the current state of development. ...
Since that time, a myriad of systems emerged in order to allow for the storage and analysis of data over multidimensional arrays, among them are: • Titan [15] is a parallel DBMS designed to deal with ...
arXiv:1903.02949v2
fatcat:d74sgpvysjg2fcueqencijjcna
Multidimensional Arrays for Warehousing Data on Clouds
[chapter]
2010
Data Management in Grid and Peer-to-Peer Systems
In this paper we address the pay-as-you-go rules for warehousing data storage. We propose to use the multidimensional arrays storage techniques for clouds. First experiments validate our proposal. ...
Cloud computing on the impulse of ICT majors like Google, Microsoft and Amazon, has recently focused the attention. OLAP querying and data warehousing in such a context consists in a major issue. ...
Acknowledgment Thanks to Boussad Mebarki, Ilyas Brahmia, Abdelaziz Merabet, in addition to the APIS team of the LIMOS laboratory and the COPAIN team from the Cemagref for useful discussions on datawarehouses ...
doi:10.1007/978-3-642-15108-8_3
dblp:conf/globe/dOrazioB10
fatcat:iu2sa5udbjgobja7btqeco7mq4
Optimized management of large-scale data sets stored on tertiary storage systems
2004
IEEE Distributed Systems Online
As part of the Estedi project, we developed a system that extends RasDaMan (Raster Data Management), the first commercial multidimensional-array database management system (DBMS). ...
Estedi, an initiative of European database developers, software vendors, and supercomputing centers, seeks to provide a solution for the storage and retrieval of multidimensional HPC array data. ...
First, performance measurements prove the validity of our concept. On a two-processor machine, speed increased by a factor of up to 1.8, which we consider an extremely good result. ...
doi:10.1109/mdso.2004.5
fatcat:7zrsrhpnmra7ff5u6agdn4citm
Managing Large Multidimensional Array Hydrologic Datasets: A Case Study Comparing NetCDF and SciDB
2016
Procedia Engineering
In this research, NetCDF file based solutions and a multidimensional (MD) array database management system (DBMS) applying chunked storage structure are benchmarked to determine the best solution for storing ...
The research illustrates that for big hydrologic array data management, the properly chunked NetCDF-4 solution without compression is in general more efficient than the SciDB DBMS. ...
. , the Specialized Research Fund for the Doctoral Program of Higher Education (20130094110007), the 111 Project (B08048) and National Natural Science Foundation of China (Grant No.41301435) are gratefully ...
doi:10.1016/j.proeng.2016.07.449
fatcat:tfqpo3cskbbbnnlu4dx5duyahq
Fast UDFs to compute sufficient statistics on large data sets exploiting caching and sampling
2010
Data & Knowledge Engineering
We present an aggregate UDF computing multidimensional sufficient statistics that benefit a broad array of statistical models: the linear sum of points and the quadratic sum of crossproducts of point dimensions ...
A profile of UDF run-time execution shows the UDF is slowed down by I/O when reading from disk. ...
However, since SQL does not provide advanced manipulation of multidimensional arrays, matrix operations can be difficult to express as efficient SQL queries, especially if joins are required. ...
doi:10.1016/j.datak.2009.12.001
fatcat:pzmabknqtnem7o7cmheyqrhgy4
SAVIME: A Database Management System for Simulation Data Analysis and Visualization
2019
Anais do Simpósio Brasileiro de Banco de Dados (SBBD)
In order to make scientific applications benefit from DBMS support, enabling declarative data analysis and visualization over scientific data, we present an in-memory array DBMS system called SAVIME. ...
Our preliminary evaluation shows how SAVIME, by using a simple storage definition language (SDL) can outperform the state-of-the-art array database system, SciDB, during the process of data ingestion. ...
In addition, results also show that it is possible to retrieve SAVIME data and generate viz files efficiently by using the special purpose visualization operator. ...
doi:10.5753/sbbd.2019.8810
dblp:conf/sbbd/LustosaPV19
fatcat:27jltj2iuncvhktfyxivcuysly
Vector and matrix operations programmed with UDFs in a relational DBMS
2006
Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06
A UDF allows fast evaluation of arithmetic expressions, memory manipulation, using multidimensional arrays and exploiting all C language control statements. ...
In this work, we study how to extend a DBMS with basic vector and matrix operators by programming User-Defined Functions (UDFs). ...
efficient algorithms assuming the data set is in a flat file outside the DBMS. ...
doi:10.1145/1183614.1183687
dblp:conf/cikm/OrdonezG06
fatcat:uql2by2an5c3vmgjv6f4arykki
Scientific Analysis by Queries in Extended SPARQL over a Scalable e-Science Data Store
2013
2013 IEEE 9th International Conference on e-Science
To address the scalability problem we present an architecture that enables the same SciSPARQL queries to be executed on the RDF dataset whether it is stored in a relational DBMS or mapped over a specialized ...
Data-intensive applications in e-Science require scalable solutions for storage as well as interactive tools for analysis of scientific data. ...
ACKNOWLEDGMENT This project is supported by eSSENCE and the Swedish Foundation for Strategic Research under grant RIT08-0041, (U.S.) Department of Energy (DOE) Award No. ...
doi:10.1109/escience.2013.19
dblp:conf/eScience/AndrejevTHHR13
fatcat:4mk4u4vyyrcqxiungjbfqnv67y
Cubrick
2016
Proceedings of the VLDB Endowment
This paper describes the architecture and design of Cubrick, a distributed multidimensional in-memory DBMS suited for interactive analytics over highly dynamic datasets. ...
Cubrick has a strictly multidimensional data model composed of cubes, dimensions and metrics, supporting sub-second OLAP operations such as slice and dice, roll-up and drill-down over terabytes of data ...
In this paper we present Cubrick, a distributed in-memory multidimensional DBMS we have developed from scratch at Facebook, capable of executing indexed OLAP operations such as slice-n-dice, roll-ups and ...
doi:10.14778/3007263.3007269
fatcat:krf5qjcnjrg47m34gvb53apum4
« Previous
Showing results 1 — 15 out of 864 results