513 Hits in 4.7 sec

Efficient estimation of joint queries from multiple OLAP databases

Elaheh Pourabbas, Arie Shoshani
2007 ACM Transactions on Database Systems  
Given an OLAP query expressed over multiple source OLAP databases, we study the problem of estimating the result OLAP target database.  ...  The problem arises when it is not possible to derive the result from a single database. The method we use is linear indirect estimation, commonly used for statistical estimation.  ...  They are used to estimate the target database Income(State,Sex).  ... 
doi:10.1145/1206049.1206051 fatcat:lvfoxkhj6vht7hjkfjhpemdat4

Online Aggregation based Approximate Query Processing: A Literature Survey [article]

Pritom Saha Akash, Wei-Cheng Lai, Po-Wen Lin
2022 arXiv   pre-print
An approximate query process (AQP) was proposed to efficiently compute approximate values as close as to the exact answer.  ...  Online aggregation-based AQP progressively generates approximate results with some error estimates (i.e., confidence interval) until the processing of all data is done.  ...  Different from DBL, which learns from queries, an approach called DeepDB [17] was proposed as a data-driven method (not from queries) for the database components learning.  ... 
arXiv:2204.07125v1 fatcat:tdey5uh3szcjhbc3pskwcbf2wa


Chaoqun Zhan, Fang Zheng, Chengliang Chai, Maomeng Su, Chuangxian Wei, Xiaoqiang Peng, Liang Lin, Sheng Wang, Zhe Chen, Feifei Li, Yue Pan
2019 Proceedings of the VLDB Endowment  
With data explosion in scale and variety, OLAP databases play an increasingly important role in serving real-time analysis with low latency (e.g., hundreds of milliseconds), especially when incoming queries  ...  At the same time, it is able to serve 10m+ writes and 100k+ queries per second, while completing complex queries within hundreds of milliseconds. PVLDB Reference Format:  ...  To improve the efficiency of analytical queries, many OLAP databases like Vertica [29] , Teradata DB [10] and Greenplum [5] have been developed.  ... 
doi:10.14778/3352063.3352124 fatcat:u2oa2bbhqbgbfh5iqe5upraf4u

UnifyDR: A Generic Framework for Unifying Data and Replica Placement

Ankita Atrey, Gregory Van Seghbroeck, Higinio Mora, Bruno Volckaert, Filip De Turck
2020 IEEE Access  
The effectiveness and scalability of UnifyDR are showcased by experiments performed on data generated using the TPC-DS benchmark and a trace of the Gowalla OSN for the OLAP queries and OSN service use-case  ...  We establish the generic nature of UnifyDR by portraying its ability to address the CDR problem in two real-world use-cases, that of join-intensive online analytical processing (OLAP) queries and a location-based  ...  Since data warehouses are usually stored in a distributed manner across multiple nodes, successful execution of OLAP queries requires internode transfer of database tables.  ... 
doi:10.1109/access.2020.3041670 fatcat:pz6i555firfobjgm2uedfwzhcu

Spatial Datawarehousing [chapter]

Alejandro A. Vaisman, Esteban Zimányi
2017 Encyclopedia of Database Systems  
techniques for non-standard data, efficient algorithms to compute aggregate queries, and new, application-specific index structures.  ...  in corporate as well as scientific databases.  ...  Currently, OLAP and standard database querying tools provide a-posteriori knowledge on the contents of the database.  ... 
doi:10.1007/978-1-4899-7993-3_80810-1 fatcat:7yo63qvmhfadzn5vwn2x7bizku


Qingwei Lin, Weichen Ke, Jian-Guang Lou, Hongyu Zhang, Kaixin Sui, Yong Xu, Ziyi Zhou, Bo Qiao, Dongmei Zhang
2018 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining - KDD '18  
In this paper, we present BigIN4, a system for instant, interactive identification of insights from multi-dimensional big data.  ...  To enable interactive identification of insights, a large number of dimension combinations need to be searched and a series of aggregation queries need to be quickly answered.  ...  We also thank all the members of Software Analytics team at MSRA for the discussions.  ... 
doi:10.1145/3219819.3219867 dblp:conf/kdd/LinKLZSXZQZ18 fatcat:ynpplolpuvakdgorajqizh5yuq

A computational algorithm for the risk assessment of developing acute coronary syndromes, using online analytical process methodology

Hara Kostakis, Basilis Boutsinas, Demosthenes B. Panagiotakos, Leo D. Kounis
2009 International Journal of Knowledge Engineering and Soft Data Paradigms  
This paper investigates patterns in cardiovascular risk factors from a large population sample of cardiac patients and their matched controls.  ...  OLAP is a new method that is used to explore the role of several risk factors in cardiovascular disease risk assessment.  ...  A model for estimating the global risk of developing ACS by using OLAP methodology is presented in this study.  ... 
doi:10.1504/ijkesdp.2009.021986 fatcat:pduvbzymnnh7fjk4abloe7abey

Towards a theory for privacy preserving distributed OLAP

Alfredo Cuzzocrea, Elisa Bertino, Domenico Saccà
2012 Proceedings of the 2012 Joint EDBT/ICDT Workshops on - EDBT-ICDT '12  
schemes and query optimization.  ...  Privacy Preserving Distributed OLAP identifies a collection of models, methodologies and algorithms devoted to ensuring the privacy of multidimensional OLAP data cubes in distributed environments.  ...  INTRODUCTION The issue of effectively and efficiently computing and managing privacy preserving OLAP data cubes [10, 8] has attracted the interest from a large community of Database and Data Warehousing  ... 
doi:10.1145/2320765.2320826 dblp:conf/edbt/CuzzocreaBS12 fatcat:fv2snlrnrjgszdbnysam64absy

Wavelet-based histograms for selectivity estimation

Yossi Matias, Jeffrey Scott Vitter, Min Wang
1998 SIGMOD record  
Given a query P , we need to estimate the fraction of records in the database that satisfy P .  ...  Query optimization is an integral part of relational database management systems. One important task in query optimization is selectivity estimation.  ...  Introduction Several important components in a database management system (DBMS) require accurate estimation of the selectivity of a given query.  ... 
doi:10.1145/276305.276344 fatcat:fwsu6vzthnhzpafa5jkhpmcdyy

Wavelet-based histograms for selectivity estimation

Yossi Matias, Jeffrey Scott Vitter, Min Wang
1998 Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98  
Given a query P , we need to estimate the fraction of records in the database that satisfy P .  ...  Query optimization is an integral part of relational database management systems. One important task in query optimization is selectivity estimation.  ...  Introduction Several important components in a database management system (DBMS) require accurate estimation of the selectivity of a given query.  ... 
doi:10.1145/276304.276344 dblp:conf/sigmod/MatiasVW98 fatcat:fhvxna7ka5goxgx3acoufuocdi

OLAP over uncertain and imprecise data

Doug Burdick, Prasad M. Deshpande, T. S. Jayram, Raghu Ramakrishnan, Shivakumar Vaithyanathan
2006 The VLDB journal  
We extend the OLAP data model to represent data ambiguity, specifically imprecision and uncertainty, and introduce an allocation-based approach to the semantics of aggregation queries over such data.  ...  While there is much work on representing and querying ambiguous data, to our knowledge this is the first paper to handle both imprecision and uncertainty in an OLAP setting.  ...  Given the efficiency and the desiderata aspects, and the small relative error (under reasonable conditions) for the alternative estimate, we propose using this estimate for answering queries.  ... 
doi:10.1007/s00778-006-0033-y fatcat:7oj26fci2vh2vkuukjldcyrc5e

Optimizing Analytical Queries over Semantic Web Sources [article]

Dilshod Ibragimov
2017 PhD series, Technical Faculty of IT and Design, ˜Aalborg=ålborgœ University  
data from multiple sources.  ...  C O is estimated based on queries that do not retrieve data from triple stores such as: SELECT(1 AS ?v){} or ASK{}. Multiple queries are executed to determine an average.  ... 
doi:10.5278/ fatcat:5d2lmnicazgmze2qhe5cikowwm

Mining Uncertain and Probabilistic Data: problems, Challenges, Methods, and Applications

Jian Pei, Ming Hua
2008 Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08  
can a ranking query be answered efficiently?  ...  Uncertainty and Confidence • Uncertain data can provide probabilistic answers to aggregate questions -How can we estimate the percentage of married voters supporting Obama from survey data?  ... 
doi:10.1145/1401890.1551568 fatcat:lg2jfeisqfeeda5pbtzugy4jzu

Methods for Evaluating Iceberg Queries

A. Padmapriya, T. Shanmuga Priya
2013 International Journal of Computer Applications  
Iceberg queries are a special case of SQL queries involving GROUP BY and HAVING clauses, wherein the answer set is small relative to the database size.  ...  The iceberg refers to the input, and the tip of it refers to the output. This paper is going to present some of the existing iceberg query processing using data mining.  ...  Queries are performed on the cube to retrieve decision support information. Recently, [14] introduced the CUBE operator for conveniently supporting multiple aggregates in OLAP database.  ... 
doi:10.5120/11605-6971 fatcat:jjzf4we53fa57ajlfdflhitfty

Data Warehouse and Decision Support on Integrated Crop Big Data [article]

V.M. Ngo, N.A. Le-Khac, M.T. Kechadi
2020 arXiv   pre-print
We also evaluate the performance of ADW and present some complex queries to extract and return necessary knowledge about crop management.  ...  This has started a while ago (early 20th century) and it is driven by the low cost of collecting data about everything; from information on fields such as seed, soil, fertiliser, pest, to weather data,  ...  Acknowledgment This research is an extended work of Ngo and et al. (2019) being part of the CONSUS research program.  ... 
arXiv:2003.04470v1 fatcat:isvk4u6w7nfobprnw7j66v65pu
« Previous Showing results 1 — 15 out of 513 results