Filters








20 Hits in 1.3 sec

Big Data Benchmark Compendium [chapter]

Todor Ivanov, Tilmann Rabl, Meikel Poess, Anna Queralt, John Poelman, Nicolas Poggi, Jeffrey Buell
2016 Lecture Notes in Computer Science  
The goal is to understand the current state in Big Data benchmarking and guide practitioners in their approaches and use cases.  ...  This makes the investigation and standardization of such systems very difficult.  ...  Acknowledgment This research has been supported by the Research Group of the Standard Performance Evaluation Corporation (SPEC).  ... 
doi:10.1007/978-3-319-31409-9_9 fatcat:n7lwtxainnblpf2xp4c5o2eynq

Profiling the Performance of Virtualized Databases with the TPCx-V Benchmark [chapter]

Andrew Bond, Doug Johnson, Greg Kopczynski, H. Reza Taheri
2016 Lecture Notes in Computer Science  
data centers by augmenting the capabilities of the TPCx-V benchmark kit.  ...  In this paper, we will provide a brief description of the benchmark, discuss the results and the conclusions drawn from the experiments, and propose future directions for analyzing the performance of cloud  ...  of the TPCx-V benchmark.  ... 
doi:10.1007/978-3-319-31409-9_10 fatcat:mjynjzoacrfajgsuurihtqyiki

Adapting Big Data Standards, Maturity Models to Smart Grid Distributed Generation: Critical Review

Aditya Sundararajan, Alexander Scott Hernandez, Arif Sarwat
2020 IET Smart Grid  
This study bridges the gap by analysing the role of big data in smart grids, and explores if and how big data standards and CMMs can be adapted specifically to ten distributed generation (DG) use-cases  ...  Although big data standards and CMMs exist for other domains, no work in the literature considers adapting them to smart grids, which will benefit from both.  ...  The TPCx-HS benchmark focuses on testing commercialised Apache Hadoop systems [54, 55] . The testing covers both the hardware and software aspects of the system, which includes the operating system.  ... 
doi:10.1049/iet-stg.2019.0298 fatcat:nuvgxgkc6jdnvb7uwllmidymzy

Performance Analysis of Two Big Data Technologies on a Cloud Distributed Architecture. Results for Non-Aggregate Queries on Medium-Sized Data

Marin Fotache, Ionuț Hrubaru
2016 Scientific Annals of Economics and Business  
Subsequently a number of models were developed for relating performance on the system and also on various query parameters such as the number of attributes in SELECT and WHERE clause, number of joins,  ...  This paper compares data processing performance of two systems belonging to SQL (PostgreSQL/Postgres XL) and Big Data (Hadoop/Hive) camps on a distributed five-node cluster deployed in cloud.  ...  four categories:  OLTP: TPC-C and TPC-E;  Decision support: TPC-H, TPC-DS and TPC-DI;  Virtualization: TPC-WMS;  Big Data: TPCx-HS.  ... 
doi:10.1515/saeb-2016-0134 fatcat:4etnd72qbjgk5ofwtzzpi3h76i

A Survey on Edge Performance Benchmarking

Blesson Varghese, Nan Wang, David Bermbach, Cheol-Ho Hong, Eyal De Lara, Weisong Shi, Christopher Stewart
2021 ACM Computing Surveys  
This article first reviews articles published over the past three decades to trace the history of performance benchmarking from tightly coupled to loosely coupled systems.  ...  Edge performance benchmarking is a nascent research avenue that has started gaining momentum over the past five years.  ...  The first is the TPCx-V (47) benchmark, which is a virtual machine benchmark for database workloads. The second is the TPCx-HS (48) benchmark, which is a big data benchmark on the cloud.  ... 
doi:10.1145/3444692 fatcat:75kmnweazzeppefekfyrxmemze

Big Data Methodologies, Tools And Infrastructures

Kim Hee, Todor Ivanov, Roberto V. Zicari, Rut Waldenfels, Hevin Özmen, Naveed Mushtaq, Minsung Hong, Tharsis Teoh, Rajendra Akerkar
2018 Zenodo  
In order to tackle the demands and challenges in the transportation domain, an optimal stack of Big Data technologies needs to be selected and designed based on the application requirements.  ...  The transportation industry is a leader in creating the so-called Internet of Everything.  ...  The main challenge is using significantly improved technologies and methods to gather and understand the data in order for business decisions to be informed by better insights.  ... 
doi:10.5281/zenodo.1465539 fatcat:mkad5yu2tnfw7fdi3xqcermac4

A Survey on Edge Performance Benchmarking [article]

Blesson Varghese and Nan Wang and David Bermbach and Cheol-Ho Hong and Eyal de Lara and Weisong Shi and Christopher Stewart
2020 arXiv   pre-print
This article first reviews articles published over the past three decades to trace the history of performance benchmarking from tightly coupled to loosely coupled systems.  ...  Edge performance benchmarking is a nascent research avenue that has started gaining momentum over the past five years.  ...  The first is the TPCx-V (47) benchmark, which is a virtual machine benchmark for database workloads. The second is the TPCx-HS (48) benchmark, which is a big data benchmark on the cloud.  ... 
arXiv:2004.11725v2 fatcat:gyqqgfqf5fe2jk2ntlnd2itmli

SQALPEL: A database performance platform

Martin L. Kersten, Stefan Manegold, Ying Zhang, Panos Kuoutsourakis
2019 Conference on Innovative Data Systems Research  
The approach is based on deriving a domain specific language from a sample complex query to identify and execute a query workload.  ...  Despite their popularity, database benchmarks only highlight a small fraction of the capabilities of any given DBMS.  ...  Acknowledgments This research has received funding from the European Union's Horizon 2020 research and innovation programme under Grant Agreement no. 732366 (ACTiCLOUD).  ... 
dblp:conf/cidr/KerstenMZK19 fatcat:ysnjk3f36fgb3etoo2ku3xjshe

An Efficient Industrial Big-Data Engine

Pablo Basanta-Val
2018 IEEE Transactions on Industrial Informatics  
In this context, the article explores the development of a medium size big-data engine (i.e. implementation) able to improve performance in map-reduce computing by splitting the analytic into different  ...  The number of issues an industrial infrastructure has to face is large and includes challenges such as the definition of different efficient architecture setups for different applications, and the definition  ...  We thank our anonymous reviewers their efforts in improving the quality of the article providing suggesting over 300 changes in the original a1ticle.  ... 
doi:10.1109/tii.2017.2755398 fatcat:5esbkh6z3vfenc5yd3diuaqu6e

BigDataBench-MT: A Benchmark Tool for Generating Realistic Mixed Data Center Workloads [article]

Rui Han, Shulin Zhan, Chenrong Shao, Junwei Wang, Lizy K. John, Jiangtao Xu, Gang Lu, Lei Wang
2015 arXiv   pre-print
Based on this, our demo illustrates the workload customization and generation process using a visual interface.  ...  To fill this gap, we propose a benchmark tool that is a first step towards generating a mix of actual service and data analysis workloads on the basis of real workload traces.  ...  center benchmarks Workload mix Workload implementation logic Algorithms Database operations I/O operations No mix WordCount 1 [26], Grep 1 [9], Sort 1 [19], Terasort 1 [22], HiBench 1 [39], TPCx-HS  ... 
arXiv:1504.02205v3 fatcat:5wln7lm22zdpdiguewi66be7eq

Designing and implementing a Big Data benchmark in a financial context: application to a cash management use case

Lilia Sfaxi, Mohamed Mehdi Ben Aissa
2021 Computing  
On the other hand, we show that the overhead caused by BABEL's integration with the platform at runtime is very negligible.  ...  We highlight the modular design of BABEL, and present an evaluation methodology and best practices for its application on real world systems.  ...  MRBench [9] , HiBench [10] and MRBS [11] are benchmarking suites destined to test MapReduce workloads while TPCx-HS [12] defines a standard produced by the Transaction Processing Performance Council  ... 
doi:10.1007/s00607-021-00933-x fatcat:myqri224vfbsnd4omulyhxr5zy

Introduction to the IEEE Transactions on Big Data

Qiang Yang
2015 IEEE Transactions on Big Data  
Advances in telecommunications technologies and services facilitated the massive exchange of data among client devices, data centers and clouds.  ...  I T is my great pleasure to present this inaugural issue of the IEEE Transactions on Big Data (IEEE TBDATA).  ...  In 2012, he launched the workshop series on Big Data Benchmarking, which has engendered new activity in industry standard benchmarks for Big Data, including creation of the TPCx-HS standard; formation  ... 
doi:10.1109/tbdata.2015.2469391 fatcat:s2b5lshffbglpfyoaexfxw6axm

Host managed contention avoidance storage solutions for Big Data

Pratik Mishra, Arun K. Somani
2017 Journal of Big Data  
Through trace driven simulation based experiments with cloud emulating MapReduce benchmarks, we show the effectiveness of BID-HDD which results in 28-52% lesser time for all I/O requests than the best  ...  Through trace driven simulation based experiments with cloud emulating MapReduce benchmarks, we show the effectiveness of BID-HDD which results in 28-52% lesser time for all I/O requests than the best  ...  He made several editorial comments and suggestions, held valuable discussion and provided feedback on design of experiments on a shorter version of this paper [42] , where he was also a coauthor.  ... 
doi:10.1186/s40537-017-0080-9 fatcat:t2ni3rvbkvdapk2to7skgp445i

Data-Centric Benchmarking [chapter]

Jérôme Darmont
Encyclopedia of Information Science and Technology, Fourth Edition  
We survey benchmarks from three families: transaction benchmarks aimed at On-Line Transaction Processing (OLTP), decision-support benchmarks aimed at On-Line Analysis Processing (OLAP) and big data benchmarks  ...  TPC members include all the major industrial actors from the database field. The aim of this chapter is to present an overview of the major past and present state-of-the-art data-centric benchmarks.  ...  Its metric is the minimum value of the selected benchmark's metric on all VMs. TPCx-HS (TPC, 2015c) focuses on Hadoop and MapReduce-based applications.  ... 
doi:10.4018/978-1-5225-2255-3.ch154 fatcat:u26qfefzebgwpbuewscnnv4rpa

Data-Centric Benchmarking [chapter]

Jérôme Darmont
Advances in Computer and Electrical Engineering  
We survey benchmarks from three families: transaction benchmarks aimed at On-Line Transaction Processing (OLTP), decision-support benchmarks aimed at On-Line Analysis Processing (OLAP) and big data benchmarks  ...  TPC members include all the major industrial actors from the database field. The aim of this chapter is to present an overview of the major past and present state-of-the-art data-centric benchmarks.  ...  Its metric is the minimum value of the selected benchmark's metric on all VMs. TPCx-HS (TPC, 2015c) focuses on Hadoop and MapReduce-based applications.  ... 
doi:10.4018/978-1-5225-7598-6.ch025 fatcat:6zfmlk6bhfdahag5g52xjqhzai
« Previous Showing results 1 — 15 out of 20 results