Filters








3,121 Hits in 1.6 sec

Compressing SQL workloads

Surajit Chaudhuri, Ashish Kumar Gupta, Vivek Narasayya
2002 Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02  
Often these tasks are presented with large workloads, i.e., a set of SQL DML statements, as input. A key factor affecting the scalability of such tasks is the size of the workload.  ...  Workload compression is a computationally difficult combinatorial optimization problem. In fact, we show via a reduction from the well known Minimum k-Median problem [16]  ...  General Workload Compression Problem For the purposes of this paper, we define a workload as a set of SQL DML statements.  ... 
doi:10.1145/564744.564747 fatcat:cgtraoijhrautkf667edkodlgu

Compressing SQL workloads

Surajit Chaudhuri, Ashish Kumar Gupta, Vivek Narasayya
2002 Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02  
Often these tasks are presented with large workloads, i.e., a set of SQL DML statements, as input. A key factor affecting the scalability of such tasks is the size of the workload.  ...  In this paper, we present the novel problem of workload compression which helps improve the scalability of such tasks. We present a principled solution to this challenging problem.  ...  General Workload Compression Problem For the purposes of this paper, we define a workload as a set of SQL DML statements.  ... 
doi:10.1145/564691.564747 dblp:conf/sigmod/ChaudhuriGN02 fatcat:54enognxdzcornt53cg3y2nh2q

Query2Vec: An Evaluation of NLP Techniques for Generalized Workload Analytics [article]

Shrainik Jain, Bill Howe, Jiaqi Yan, Thierry Cruanes
2018 arXiv   pre-print
For index recommendation, we cluster the vector representations to compress large workloads with no loss in performance from the recommended index.  ...  We consider vector representations of both raw SQL text and optimized query plans, and evaluate these methods on synthetic and real SQL workloads.  ...  Our work is inspired by the success of these approaches and demonstrates their utility for SQL workload analytics. Compressing SQL Workloads.  ... 
arXiv:1801.05613v2 fatcat:ozdcnfrgpbci5fgjz42rqbhvt4

Columnstore and B+ tree - Are Hybrid Physical Designs Important?

Adam Dziedzic, Jingjing Wang, Sudipto Das, Bolin Ding, Vivek R. Narasayya, Manoj Syamala
2018 Proceedings of the 2018 International Conference on Management of Data - SIGMOD '18  
Commercial DBMSs, such as Microsoft SQL Server, cater to diverse workloads including transaction processing, decision support, and operational analytics.  ...  We extend the Database Engine Tuning Advisor for Microsoft SQL Server to recommend a suitable combination of B+ tree and columnstore indexes for a given workload.  ...  Figure 8 : 8 Example of Run-length encoding used to compress data in columnstores in SQL Server.  ... 
doi:10.1145/3183713.3190660 dblp:conf/sigmod/DziedzicWDDNS18 fatcat:fnrw7ibtlzbqbbsonvx77uo73y

Primitives for Workload Summarization and Implications for SQL [chapter]

Surajit Chaudhuri, Vivek Narasayya, Prasanna Ganesan
2003 Proceedings 2003 VLDB Conference  
These tasks require the ability to summarize large SQL workloads. In this paper, we identify primitives that are important to enable many important workload-summarization tasks.  ...  These primitives also appear to be useful in a variety of practical scenarios besides workload summarization. Today's SQL is inadequate to express these primitives conveniently.  ...  than performing workload compression.  ... 
doi:10.1016/b978-012722442-8/50070-7 dblp:conf/vldb/ChaudhuriGN03 fatcat:5ixryvzqj5hd3kfkrdem3ymxqm

Database Tuning Advisor for Microsoft SQL Server 2005 [chapter]

S AGRAWAL, S CHAUDHURI, L KOLLAR, A MARATHE, V NARASAYYA, M SYAMALA
2004 Proceedings 2004 VLDB Conference  
Third, DTA is able to scale to large databases and workloads using several novel techniques including: (a) workload compression (b) reduced statistics creation and (c) exploiting test server to reduce  ...  The Database Tuning Advisor (DTA) that is part of Microsoft SQL Server 2005 is an automated physical database design tool that significantly advances the state-of-the-art in several ways.  ...  Acknowledgements We are grateful to Alex Boukouvalas, Campbell Fraser, Florian Waas, and Cesar Galindo-Legaria for helping with necessary extensions to Microsoft SQL Server 2005 for DTA.  ... 
doi:10.1016/b978-012088469-8/50097-8 fatcat:urkl7tyvczda7fbifvtwuv6iu4

Database tuning advisor for microsoft SQL server 2005

Sanjay Agrawal, Surajit Chaudhuri, Lubor Kollar, Arun Marathe, Vivek Narasayya, Manoj Syamala
2005 Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05  
Third, DTA is able to scale to large databases and workloads using several novel techniques including: (a) workload compression (b) reduced statistics creation and (c) exploiting test server to reduce  ...  The Database Tuning Advisor (DTA) that is part of Microsoft SQL Server 2005 is an automated physical database design tool that significantly advances the state-of-the-art in several ways.  ...  Acknowledgements We are grateful to Alex Boukouvalas, Campbell Fraser, Florian Waas, and Cesar Galindo-Legaria for helping with necessary extensions to Microsoft SQL Server 2005 for DTA.  ... 
doi:10.1145/1066157.1066292 dblp:conf/sigmod/AgrawalCKMNS05 fatcat:tvnveldk4ve4phfnwqw2du43qu

Database Tuning Advisor for Microsoft SQL Server 2005 [chapter]

Sanjay Agrawal, Surajit Chaudhuri, Lubor Kollar, Arun Marathe, Vivek Narasayya, Manoj Syamala
2004 Proceedings 2004 VLDB Conference  
Third, DTA is able to scale to large databases and workloads using several novel techniques including: (a) workload compression (b) reduced statistics creation and (c) exploiting test server to reduce  ...  The Database Tuning Advisor (DTA) that is part of Microsoft SQL Server 2005 is an automated physical database design tool that significantly advances the state-of-the-art in several ways.  ...  Acknowledgements We are grateful to Alex Boukouvalas, Campbell Fraser, Florian Waas, and Cesar Galindo-Legaria for helping with necessary extensions to Microsoft SQL Server 2005 for DTA.  ... 
doi:10.1016/b978-012088469-8.50097-8 dblp:conf/vldb/AgrawalCKMNS04 fatcat:nph7jzadd5ejvdlm3ijzdgj4wq

Compression aware physical database design

Hideaki Kimura, Vivek Narasayya, Manoj Syamala
2011 Proceedings of the VLDB Endowment  
In this paper, we study how data compression affects choice of appropriate physical database design, such as indexes, for a given workload.  ...  We have implemented our techniques by modifying Microsoft SQL Server and the Database Engine Tuning Advisor (DTA) physical design tool.  ...  Such tools typically take as input a workload of SQL query and update statements and a storage bound, and produce a configuration (i.e. set of indexes) that optimizes workload performance, while not exceeding  ... 
doi:10.14778/2021017.2021023 fatcat:vjkkcguqsjbj7gafqcbxpmpyfi

Comprehensive and Efficient Workload Compression [article]

Shaleen Deep, Anja Gruenheid, Paraschos Koutris, Jeffrey Naughton, Stratis Viglas
2021 arXiv   pre-print
These metrics capture the intuition that the distribution of features in a compressed workload should match a target distribution, increasing representativity, and include common queries as well as outliers  ...  ., a representative workload, over time. To construct such a workload in a principled manner, we formalize the notions of workload representativity and coverage.  ...  Compressing Workloads. Compressing or summarizing SQL workloads has been studied by Chaudhuri et al. [6, 7] .  ... 
arXiv:2011.05549v2 fatcat:owo3fie7zfaadap3o4bhceqf7q

Accelerating analytics with dynamic in-memory expressions

Aurosish Mishra, Maria Colgan, Jesse Kamp, Niloy Mukherjee, Vineet Marwah, Shasank Chavan, Allison Holloway, Tirthankar Lahiri, Zhen Hua Liu, Sunil Chakkappen, Dennis Lui, Vinita Subramanian (+1 others)
2016 Proceedings of the VLDB Endowment  
Oracle Database In-Memory (DBIM) accelerates analytic workload performance by orders of magnitude through an inmemory columnar format utilizing techniques such as SIMD vector processing, in-memory storage  ...  The DIME mechanism automatically detects frequently occurring expressions in a query workload, and then creates highly optimized, transactionally consistent, in-memory columnar representations of these  ...  ACKNOWLEDGEMENTS We acknowledge the contributions of members in Oracle Data, Space, Transactions, SQL, JSON, Functional Testing and Stress Testing teams involved throughout the product lifecycle.  ... 
doi:10.14778/3007263.3007280 fatcat:nyvytmwh2fezzn7bmmnyxvphuu

On the Performance of Three In-Memory Data Systems for On Line Analytical Processing

Ionut HRUBARU, Marin FOTACHE
2017 Informatică economică  
This paper aims to test the performance of using such type of systems for TPCH analytical workloads.  ...  Performance is analyzed in terms of data loading, memory footprint and execution time of the TPCH query set for three in-memory data systems: Oracle, SQL Server and MemSQL.  ...  analytical workloads;  Use SQL Server 2016 which combines inmemory with columnar storage;  Test performance when increasing the number of concurrent users (e.g. using JMeter);  Test also OLTP workloads  ... 
doi:10.12948/issn14531305/21.1.2017.01 fatcat:xg3qzkh4wzfrxhh4dm2qji3jfm

SQL-on-Hadoop

Avrilia Floratou, Umar Farooq Minhas, Fatma Özcan
2014 Proceedings of the VLDB Endowment  
In this paper, we compare the performance of these two systems by conducting a set of cluster experiments using a TPC-H like benchmark and two TPC-DS inspired workloads.  ...  Among many systems providing some SQL support over Hadoop, Hive is the first native Hadoop system that uses an underlying framework such as MapReduce or Tez to process SQL-like statements.  ...  Impala supports a SQL-like query language which is a subset of SQL as well.  ... 
doi:10.14778/2732977.2733002 fatcat:7onmassrafh33dp3kpo4eod2jy

A Comparison of ORC-Compress Performance with Big Data Workload on Virtualization

Kritwara Rattanaopas, Sureerat Kaewkeerat, Yanapat Chuchuen
2016 Applied Mechanics and Materials  
The results show that ZLIB can compress data up to 87% compared to NONE compressing data. It was better than SNAPPY which has space saving 79%.  ...  It provides a SQL-like interface to Hadoop over Map-Reduce framework. Currently, Big Data solution starts to adopt HiveQL tool to improve execution time of relational information.  ...  Our results show that snappy compression provides slightly better query performance than zlib. Similar result with TPCH workload are present in [4] .  ... 
doi:10.4028/www.scientific.net/amm.855.153 fatcat:vdy2ovsjbzamhly4zkwpiywtgm

Clustered Columnstore Indexes [chapter]

Dmitri Korotkevitch
2014 Pro SQL Server Internals  
(Dictionary Encoding) Columnstore Compression (Value-Based Encoding) COMPRESSION AND DATA SIZE Demo Compression in SQL Server Type Demo Data Size (MB)* Description No Compression 1,633MB  ...  partitioned views in OLTP (when appropriate) Volatile active data in B-Tree indexes -Old static data in columnstore indexes • Nonclustered columnstore indexes could be the better option in the mixed workload  ... 
doi:10.1007/978-1-4302-5963-3_35 fatcat:dcmve47fjrfmhmzpmpzsvldoye
« Previous Showing results 1 — 15 out of 3,121 results