18 Hits in 3.0 sec


Dominic Battré, Stephan Ewen, Fabian Hueske, Odej Kao, Volker Markl, Daniel Warneke
2010 Proceedings of the 1st ACM symposium on Cloud computing - SoCC '10  
to express many data processing tasks both naturally and efficiently. 2) Map/reduce ties a program to a single fixed execution strategy, which is robust but highly suboptimal for many tasks. 3) Map/reduce  ...  We describe methods to transform a PACT program into a data flow for Nephele, which executes its sequential building blocks in parallel and deals with communication, synchronization and fault tolerance  ...  We also thank Guy Lohman for suggesting the name "PACT" for the contracts, as well as the anonymous reviewers for their constructive comments and suggestions.  ... 
doi:10.1145/1807128.1807148 dblp:conf/cloud/BattreEHKMW10 fatcat:7u6wfwtwjjaslc7nd5mdahlkjq

Big Data Technologies circa 2012

Vinayak R. Borkar, Michael J. Carey
2012 International Conference on Management of Data  
Sawzall [29] and (much later) Tenzing [25] were two systems built by Google using the MapReduce layer as a runtime and parallelizing framework for text-processing and SQL execution, respectively.  ...  -Hyracks -Stratosphere (Nephele/PACTs) Vinayak Borkar is a PhD. candidate and a Research Scientist at the University of California, Irvine in the Computer Science department.  ... 
dblp:conf/comad/BorkarC12 fatcat:mdhkyhd53zefnlfa6z42te4i5y

A Survey on Vertical and Horizontal Scaling Platforms for Big Data Analytics

Ahmed Hussein Ali, ICCI, Informatics Institute for Postgraduate Studies, Baghdad, IRAQ, Mahmood Zaki Abdullah, Department of Computer Engineering, Al-Mustansiriyah University, Baghdad, IRAQ
2019 International Journal of Integrated Engineering  
Special thanks to the anonymous reviewers for their valuable suggestions and constructive comments.  ...  Acknowledgement The authors would like to thank ICCI, Informatics Institute for Postgraduate Studies (IIPS_IRAQ) for their moral support.  ...  Nephele/PACT This is a parallel system for data processing which is made up of a programming platform known as parallelization contracts, and a scalable engine for parallel execution known as Nephele  ... 
doi:10.30880/ijie.2019.11.06.015 fatcat:qbtbeq6ukbe5pmpmgkld3r33fe

MapReduce and PACT - Comparing Data Parallel Programming Models

Alexander Alexandrov, Stephan Ewen, Max Heimel, Fabian Hueske, Odej Kao, Volker Markl, Erik Nijkamp, Daniel Warneke
2011 Datenbanksysteme für Business, Technologie und Web  
The Nephele/PACT system uses a programming model that pushes the idea of MapReduce further.  ...  Web-Scale Analytical Processing is a much investigated topic in current research. Next to parallel databases, new flavors of parallel data processors have recently emerged.  ...  The MapReduce programming model and execution framework [DG04] are among the first approaches for data processing on the scale of several thousand machines.  ... 
dblp:conf/btw/AlexandrovEHHKMNW09 fatcat:ifcvy2bhabes5otlwrgjdzp2y4

Efficient and Parallel Data Processing and Resource Allocation in the Cloud by using Nephele's Data Processing Framework

V. Saranya, S. Ramya, R.G. Suresh Kumar, T. Nalini
2016 International Journal of Grid and Distributed Computing  
A performance comparison with the well known data processing framework hadoop has been done.  ...  In this paper, we introduced Nephele, a data processing framework to exploit dynamic resource provisioning offered by IaaS clouds.  ...  Acknowledgements We are grateful and thankful to the CARD research system of our college.  ... 
doi:10.14257/ijgdc.2016.9.3.05 fatcat:2o72jgi42feqpn4lvavockif5e

Map-Reduce Implementations: Survey and Performance Comparison

Zeba Khanam, Shafali Agarwal
2015 International Journal of Computer Science & Information Technology (IJCSIT)  
This survey intends to explore large scale data processing using MapReduce and its various implementations to facilitate the database, researchers and other communities in developing the technical understanding  ...  Map Reduce has gained remarkable significance as a prominent parallel data processing tool in the research community, academia and industry with the spurt in volume of data that is to be analyzed.  ...  Other runtime platforms, including Nephele/PACTs [4] and Hyracks [6] , have been developed to improve the MapReduce execution model.  ... 
doi:10.5121/ijcsit.2015.7410 fatcat:egex75g65rg3nbialubagvllpi

The Family of MapReduce and Large Scale Data Processing Systems [article]

Sherif Sakr, Anna Liu, Ayman G. Fayoumi
2013 arXiv   pre-print
This article provides a comprehensive survey for a family of approaches and mechanisms of large scale data processing mechanisms that have been implemented based on the original idea of the MapReduce framework  ...  MapReduce is a simple and powerful programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines.  ...  Figure 17 : 17 The Nephele/PACT System Architecture [8] .  ... 
arXiv:1302.2966v1 fatcat:ttpqmigyurhepfa2wi2slicelm

Big Data Analytics for Large-scale Wireless Networks

Hong-Ning Dai, Raymond Chi-Wing Wong, Hao Wang, Zibin Zheng, Athanasios V. Vasilakos
2019 ACM Computing Surveys  
In this paper, we present a survey of the state-of-art big data analytics (BDA) approaches for large scale wireless networks.  ...  We then present a detailed survey of the technical solutions to the challenges in BDA for large scale wireless networks according to each stage in the life cycle of BDA.  ...  Hon for his constructive comments.  ... 
doi:10.1145/3337065 fatcat:vjjoymozrzb6pkchdp36dh6lze

"All roads lead to Rome"

Sebastian Schelter, Stephan Ewen, Kostas Tzoumas, Volker Markl
2013 Proceedings of the 22nd ACM international conference on Conference on information & knowledge management - CIKM '13  
Nephele/PACTs: A programming model and execution framework ACM, 33(8):103–111, 1990. for web-scale analytical processing. In SoCC, pp. 119–130, 2010. [33] M. Zaharia, T.  ...  Leiser, and G. Czajkowski. Pregel: A system for large-scale graph leverage memory-efficient probabilistic data structures for storing processing.  ... 
doi:10.1145/2505515.2505753 dblp:conf/cikm/SchelterETM13 fatcat:sng3cz5dc5cwje2psop37wmb4u

ASTERIX: towards a scalable, semistructured data platform for evolving-world models

Alexander Behm, Vinayak R. Borkar, Michael J. Carey, Raman Grover, Chen Li, Nicola Onose, Rares Vernica, Alin Deutsch, Yannis Papakonstantinou, Vassilis J. Tsotras
2011 Distributed and parallel databases  
ASTERIX is a new data-intensive storage and computing platform project spanning UC Irvine, UC Riverside, and UC San Diego.  ...  In this paper we provide an overview of the ASTERIX project, starting with its main goal-the storage and anal-Communicated by:  ...  using distributed XML Acknowledgements This project is supported by NSF IIS awards 0910989, 0910859, 0910820, and 0844574, a grant from the UC Discovery program, and a matching donation from eBay.  ... 
doi:10.1007/s10619-011-7082-y fatcat:aho7n6sazrchvbb3rh3dgp755y

A comprehensive view of Hadoop research—A systematic literature review

Ivanilton Polato, Reginaldo Ré, Alfredo Goldman, Fabio Kon
2014 Journal of Network and Computer Applications  
datasets, but we were able to spot promising areas and suggest topics for future research within the framework.  ...  Context: In recent years, the valuable knowledge that can be retrieved from petabyte scale datasetsknown as Big Dataled to the development of solutions to process information based on parallel and distributed  ...  models and/or simulation.  ... 
doi:10.1016/j.jnca.2014.07.022 fatcat:4xjveqy6mrctzjc4ou7llyy4u4

Big data analytics for manufacturing internet of things: opportunities, challenges and enabling technologies

Hong-Ning Dai, Hao Wang, Guangquan Xu, Jiafu Wan, Muhammad Imran
2019 Enterprise Information Systems  
This paper first starts with a discussion on necessities and challenges of big data analytics in manufacturing data of MIoT.  ...  Then, the enabling technologies of big data analytics of manufacturing data are surveyed and discussed. Moreover, this paper also outlines the future directions in this promising area.  ...  In particular, we consider a pure cloud computing framework and a pure edge computing framework as baseline models.  ... 
doi:10.1080/17517575.2019.1633689 fatcat:75muhzfrdnecdjwp75stikhxwu


Tim Kaldewey, Eugene J. Shekita, Sandeep Tata
2012 Proceedings of the 15th International Conference on Extending Database Technology - EDBT '12  
MapReduce has emerged as a promising architecture for large scale data analytics on commodity clusters.  ...  This demonstrates that MapReduce in general, and Hadoop in particular, is a far more compelling platform for structured data processing than previous results suggest.  ...  -the use of commodity low-cost hardware, fault-tolerance, elasticity, scalability, and a flexible programming model.  ... 
doi:10.1145/2247596.2247600 dblp:conf/edbt/KaldeweyST12 fatcat:lovj3vh7t5ftdg3ksnncfup2gm

Report from Dagstuhl Seminar 11321 Information Management in the Cloud

Anastassia Ailamaki, Michael Carey, Donald Kossmann, Steve Loughran, Volker Markl, Epfl -Lausanne, Ch, Anastassia Ailamaki, Michael Carey, Donald Kossmann, Steve Loughran, Volker Markl (+1 others)
Dagstuhl Reports   unpublished
Cloud architectures strive to massively parallelize complex processing tasks through a computational model motivated by functional programming.  ...  processing, parallelization of large scale data and compute intensive operations as well as implementation techniques for fault tolerance.  ...  The PACT Programming Model is a generalization and extension of the well-known MapReduce Programming Model.  ... 

Tenzing A SQL Implementation On The MapReduce Framework

Biswapesh Chattopadhyay, Liang Lin, Weiran Liu, Sagar Mittal, Prathyusha Aragonda, Vera Lychagina, Younghee Kwon, Michael Wong Mcwong@ Google
Tenzing is a query engine built on top of MapReduce [9] for ad hoc analysis of Google data.  ...  , low latency, support for columnar storage and structured data, and easy extensibility.  ...  Our focus has been on MapReduce [9] , but there are several others such as Nephele/PACT [3] and Dryad [15] . Hadoop [8] is an open source implementation of the MapReduce framework.  ... 
« Previous Showing results 1 — 15 out of 18 results