Filters








83 Hits in 6.4 sec

A gray-box modeling methodology for runtime prediction of Apache Spark jobs

Hani Al-Sayeh, Stefan Hagedorn, Kai-Uwe Sattler
<span title="2020-03-10">2020</span> <i title="Springer Science and Business Media LLC"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/of3bmcozzfddjojxsoxwzys55y" style="color: black;">Distributed and parallel databases</a> </i> &nbsp;
In this paper, we present a gray-box modeling methodology for runtime prediction of Apache Spark jobs.  ...  We further show how to use this gray-box approach not only for predicting the runtime of a given job, but also as part of a decision model for reusing intermediate cached results of Spark jobs.  ...  To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10619-020-07286-y">doi:10.1007/s10619-020-07286-y</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/yzlcylb52rhvxmfe2qcxnvsofm">fatcat:yzlcylb52rhvxmfe2qcxnvsofm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200510034328/https://link.springer.com/content/pdf/10.1007/s10619-020-07286-y.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c2/71/c2711ccefe083cad9e4e2a6962f8307e196f8eb4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10619-020-07286-y"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Gray Box Modeling Methodology for Runtime Prediction of Apache Spark Jobs

Hani Al-Sayeh, Kai-Uwe Sattler
<span title="">2019</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vuw5ktdyknehrehttg5qm4rfqm" style="color: black;">2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)</a> </i> &nbsp;
In this paper, we present a gray-box modeling methodology for runtime prediction of Apache Spark jobs.  ...  We further show how to use this gray-box approach not only for predicting the runtime of a given job, but also as part of a decision model for reusing intermediate cached results of Spark jobs.  ...  To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icdew.2019.00-23">doi:10.1109/icdew.2019.00-23</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icde/Al-SayehS19.html">dblp:conf/icde/Al-SayehS19</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/p2jaokrf7fdrflvc3d6wvfq3bi">fatcat:p2jaokrf7fdrflvc3d6wvfq3bi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210716000806/https://www.db-thueringen.de/servlets/MCRFileNodeServlet/dbt_derivate_00052709/1573-7578_38_2020_4_819-839.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4a/80/4a80645ebd88cb9e04b71e74072bd0db3164969a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icdew.2019.00-23"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Introduction to the special issue on Self-managing and Hardware-Optimized Database Systems 2019

Shimin Chen, Panos K. Chrysanthis, Khuzaima Daudjee, Meichun Hsu, Mohammad Sadoghi
<span title="2020-09-27">2020</span> <i title="Springer Science and Business Media LLC"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/of3bmcozzfddjojxsoxwzys55y" style="color: black;">Distributed and parallel databases</a> </i> &nbsp;
In "A Gray-Box Modeling Methodology for Runtime Prediction of Apache Spark Jobs", Al-Sayeh et al. studies the challenging problem of predicting Spark job runtimes, which depend on numerous factors such  ...  The authors propose a gray-box modeling methodology that comprises a white-box model for predicting RDD cardinalities, and a black-box model for predicting the runtime of each task for given RDD cardinalities  ...  We are also indebted to the DAPD Journal Editors, editorial office, and the publishing and production teams for their assistance in preparation and publication of this issue.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10619-020-07313-y">doi:10.1007/s10619-020-07313-y</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/mlgyadkydnhvrarsoptbtjufei">fatcat:mlgyadkydnhvrarsoptbtjufei</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201108153535/https://link.springer.com/content/pdf/10.1007/s10619-020-07313-y.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/9a/3d/9a3d346709764ed45b6836d6d2dfa16b6fdc29c7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10619-020-07313-y"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

An Enhanced Parallelisation Model for Performance Prediction of Apache Spark on a Multinode Hadoop Cluster

Nasim Ahmed, Andre L. C. Barczak, Mohammad A. Rashid, Teo Susnjak
<span title="2021-11-05">2021</span> <i title="MDPI AG"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tdvcddxjjzfavkwy267ww7wn5m" style="color: black;">Big Data and Cognitive Computing</a> </i> &nbsp;
Both models use simple equations that allows us to predict the runtime when the size of the job and the number of executables are known.  ...  Apache Spark has been established as one of the most popular big data engines for its efficiency and reliability.  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/bdcc5040065">doi:10.3390/bdcc5040065</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/pc4q65uwzfdv5lfmlfhftohaeq">fatcat:pc4q65uwzfdv5lfmlfhftohaeq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211113235807/https://mdpi-res.com/d_attachment/BDCC/BDCC-05-00065/article_deploy/BDCC-05-00065.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ca/f8/caf8abd2af711cf93998c5552859fe5ace9786f9.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/bdcc5040065"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> mdpi.com </button> </a>

A parallelization model for performance characterization of Spark Big Data jobs on Hadoop clusters

N. Ahmed, Andre L. C. Barczak, Mohammad A. Rashid, Teo Susnjak
<span title="2021-08-14">2021</span> <i title="Springer Science and Business Media LLC"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/pkhnkszyprhb3orbf6g7tqmgiu" style="color: black;">Journal of Big Data</a> </i> &nbsp;
A particular runtime pattern emerged when adding more executors to run a job. For some workloads, the runtime was longer with more executors added.  ...  The proposed model can predict the runtime for generic workloads as a function of the number of executors, without necessarily knowing how the algorithms were implemented.  ...  Authorws' contributions NA and ALCB were the main contributors of this work.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/s40537-021-00499-7">doi:10.1186/s40537-021-00499-7</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/nbjm642ixrcldgullh3toqjkhe">fatcat:nbjm642ixrcldgullh3toqjkhe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210816024331/https://journalofbigdata.springeropen.com/track/pdf/10.1186/s40537-021-00499-7.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/22/21/2221f2ae62b612ed681cd7aacaf84d89b665a419.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/s40537-021-00499-7"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> springer.com </button> </a>

A Survey on Automatic Parameter Tuning for Big Data Processing Systems

Herodotos Herodotou, Yuxing Chen, Jiaheng Lu
<span title="2020-04-26">2020</span> <i title="Association for Computing Machinery (ACM)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/eiea26iqqjcatatlgxdpzt637y" style="color: black;">ACM Computing Surveys</a> </i> &nbsp;
for executing jobs in big data processing systems.  ...  To make matters worse, some parameters might affect the performance of different jobs in different ways, while certain groups of parameters may have dependent effects (i.e., a good setting for one parameter  ...  Discussion Cost modeling is a typical white-box technique that uses a set of mathematical formulae for predicting job performance.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3381027">doi:10.1145/3381027</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/7aglimtuwze25boptuano4ufdy">fatcat:7aglimtuwze25boptuano4ufdy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201107060008/https://helda.helsinki.fi/bitstream/handle/10138/318447/3381027.pdf;jsessionid=185F0BCB990D073050D9D87F1B91E97C?sequence=1" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/59/81/5981e7d521ec962809b05f1f819308ba86e8c3fb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3381027"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Global Optimization of Data Pipelines in Heterogeneous Cloud Environments [article]

Erica Lin, Luna Xu, Suraj Bramhavar, Marco Montes de Oca, Sean Gorsky, Lingyun Yi, Arianna Groetsema, Jeffrey Chou
<span title="2022-02-11">2022</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
for the entire workflow with a cost-performance objective.  ...  We propose AGORA, a scheduler that considers both task-level resource allocation and execution for DAG workflows as a whole in heterogeneous cloud environments.  ...  [5] take a gray-box modeling approach, using both a white-box model and a black-box model.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2202.05711v1">arXiv:2202.05711v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/tf2xjr55ezh3hoaimlkhohsxse">fatcat:tf2xjr55ezh3hoaimlkhohsxse</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220217075902/https://arxiv.org/pdf/2202.05711v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/6c/3a/6c3a30c27f14851eb062bf43969856d8e6e900b8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2202.05711v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Table of contents

<span title="">2019</span> <i title="IEEE"> 2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW) </i> &nbsp;
Box Modeling Methodology for Runtime Prediction of Apache Spark Jobs 117 Hani Al-Sayeh (Technische Universität Ilmenau) and Kai-Uwe Sattler (TU Ilmenau) Guided Bayesian Optimization to AutoTune Memory-Based  ...  Smart-Wearables: A Mathematical Modelling Approach 74 Debajyoti Pal (King Mongkut's University of Technology Thonburi), Tuul Triyason (King Mongkut's University of Technology Thonburi), Vijayakumar Varadarajan  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icdew.2019.00004">doi:10.1109/icdew.2019.00004</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/f2qwa6cndnejjkorlwhiiyh34a">fatcat:f2qwa6cndnejjkorlwhiiyh34a</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201108081321/https://ieeexplore.ieee.org/ielx7/8743499/8750905/08750918.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/9d/a3/9da3d6e88de9d18ae7566a9b4921a5547fca0c19.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icdew.2019.00004"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Big Data Methodologies, Tools And Infrastructures

Kim Hee, Todor Ivanov, Roberto V. Zicari, Rut Waldenfels, Hevin Özmen, Naveed Mushtaq, Minsung Hong, Tharsis Teoh, Rajendra Akerkar
<span title="2018-07-31">2018</span> <i title="Zenodo"> Zenodo </i> &nbsp;
The goal is to create value out of this amount of data, by providing a comprehensive picture of what's happening, using business analytics, leveraging big data tools and predictive analytics, to help transportation  ...  This report, which is a follow up of Deliverable 1.1, offers an in-depth introduction to relevant technologies for Big Data Analytics and Big Data Management.  ...  D1.3: Big Data Methodologies, Tools and Infrastructures, P  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.1465539">doi:10.5281/zenodo.1465539</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/mkad5yu2tnfw7fdi3xqcermac4">fatcat:mkad5yu2tnfw7fdi3xqcermac4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200227232947/https://zenodo.org/record/1465539/files/20180716_D1.3_Big%20data%20methodologies%2C%20tools%20and%20infrastructures_LeMO.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/da/a5/daa593ac42b53996662568f9e88f08b84d6b0790.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.1465539"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Big Data Preventive Maintenance for Hard Disk Failure Detection

Su Chuan-Jun, Jorge A. Quan Yon
<span title="">2018</span> <i title="EJournal Publishing"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/e6is5iwmnbcmbnglc3l4nytwou" style="color: black;">International Journal of Information and Education Technology</a> </i> &nbsp;
Finally, we use random forest algorithm to construct the predictive model.  ...  However, service interruption is the most important factor to consider for every data center, affecting the user experience, or causing loss in a business.  ...  Discovering rules from disk events and performing black-box model to improve the accuracy of disk-drive failure prediction. E.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18178/ijiet.2018.8.7.1085">doi:10.18178/ijiet.2018.8.7.1085</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ca4fodutwbcxxaqqe2q6sg543a">fatcat:ca4fodutwbcxxaqqe2q6sg543a</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180721134752/http://www.ijiet.org/vol8/1085-JR273.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/08/8c/088c025db8ccfbac47bc19ef4bdb04340ca8d491.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18178/ijiet.2018.8.7.1085"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Towards automatic parameter tuning of stream processing systems

Muhammad Bilal, Marco Canini
<span title="">2017</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/eitdfnn7k5fohgz7jhhim3f4bm" style="color: black;">Proceedings of the 2017 Symposium on Cloud Computing - SoCC &#39;17</a> </i> &nbsp;
Our framework supports standard black-box optimization algorithms as well as a novel gray-box optimization algorithm.  ...  In this paper, we present a framework for automating parameter tuning for stream-processing systems.  ...  We thank the anonymous reviewers and our shepherd, Avrilia Floratou, for their useful feedback.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3127479.3127492">doi:10.1145/3127479.3127492</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/cloud/BilalC17.html">dblp:conf/cloud/BilalC17</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/lhpdfc7xfzbibljltgdgf4spjm">fatcat:lhpdfc7xfzbibljltgdgf4spjm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180723140154/https://repository.kaust.edu.sa/bitstream/handle/10754/626125/p189-bilal.pdf;jsessionid=205ED47C4C8FB825A2E354567BC3A9EA?sequence=1" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b7/dc/b7dc7a2a3b5655cff46ef1ddfa7dbe8771e69ea5.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3127479.3127492"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Pico: A Domain-Specific Language For Data Analytics Pipelines

Claudia Misale, Marco Aldinucci, Guy Tremblay
<span title="2017-05-11">2017</span> <i title="Zenodo"> Zenodo </i> &nbsp;
This analysis can be considered as a first step toward a formal model to be exploited in the design of a (new) framework for Big Data analytics.  ...  For this reason, we use the Dataflow model as a starting point to build a programming environment with a simplified programming model implemented as a Domain-Specific Language, that is on top of a stack  ...  Acknowledgements Funding This work has been partially supported by the Italian Ministry of Education and Research (MIUR), by the EU-H2020 RIA project "Toreador" (no. 688797), the EU-H2020 RIA project  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.579753">doi:10.5281/zenodo.579753</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/aadje57qh5hk3ijmqn4j7vkhpm">fatcat:aadje57qh5hk3ijmqn4j7vkhpm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200112165921/https://zenodo.org/record/579753/files/Misale_thesis.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8d/c8/8dc8704007be7d17a23df8ca1bb2d8393f16729f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.579753"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

A Comprehensive Survey on Parallelization and Elasticity in Stream Processing

Henriette Röger, Ruben Mayer
<span title="2019-04-30">2019</span> <i title="Association for Computing Machinery (ACM)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/eiea26iqqjcatatlgxdpzt637y" style="color: black;">ACM Computing Surveys</a> </i> &nbsp;
Again, they di er in their optimization objectives and assumptions about the operator parallelization model employed, the target system architecture, state management as well as timing and methodology.  ...  Hence, there is an urgent need for a broad investigation, classi cation and comparison of the state of the art in methods for SP parallelization and elasticity.  ...  Beyond approaches to adapt a streaming model in MapReduce, Apache Flink [20] , Apache Spark [154] and AJIRA [137] support batch and stream processing.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3303849">doi:10.1145/3303849</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hq3byyhqvjg2dpryb2thwz4vfe">fatcat:hq3byyhqvjg2dpryb2thwz4vfe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210715094633/https://www2.informatik.uni-stuttgart.de/bibliothek/ftp/ncstrl.ustuttgart_fi/ART-2019-20/ART-2019-20.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/25/19/2519285458ff2bc902b505adf72872c35c36b8d6.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3303849"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

A Comprehensive Survey on Parallelization and Elasticity in Stream Processing [article]

Henriette Röger, Ruben Mayer
<span title="2019-01-29">2019</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Therefore, in this survey, we study the literature and develop a classification of current methods for both parallelization and elasticity in SP systems.  ...  The current research landscape provides a broad spectrum of methods for parallelization and elasticity in SP. Each method makes specific assumptions and focuses on particular aspects of the problem.  ...  Beyond approaches to adapt a streaming model in MapReduce, Apache Flink [20] , Apache Spark [154] and AJIRA [137] support batch and stream processing.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1901.09716v2">arXiv:1901.09716v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/pgx7y4zdjrhznnewlgvr7nfc6m">fatcat:pgx7y4zdjrhznnewlgvr7nfc6m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200824170121/https://arxiv.org/pdf/1901.09716v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/46/4e/464e165f20eaff4ee0eb281f2718d4d04aea7344.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1901.09716v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Big Data Semantics

Paolo Ceravolo, Antonia Azzini, Marco Angelini, Tiziana Catarci, Philippe Cudré-Mauroux, Ernesto Damiani, Alexandra Mazak, Maurice Van Keulen, Mustafa Jarrar, Giuseppe Santucci, Kai-Uwe Sattler, Monica Scannapieco (+3 others)
<span title="2018-05-23">2018</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rj2j22rgnzbc3g374eiteuwo4y" style="color: black;">Journal on Data Semantics</a> </i> &nbsp;
In this paper, the third of its kind co-authored by members of IFIP WG 2.6 on Data Semantics, we propose a review of the literature addressing these topics and discuss relevant challenges for future research  ...  Indeed, multiple components and procedures must be coordinated to ensure a high level of data quality and accessibility for the application layers, e.g., data analytics and reporting.  ...  HDFS provides a foundation for several MapReducelike data processing frameworks such as Hadoop MapReduce, Apache Spark, or Flink [137] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s13740-018-0086-2">doi:10.1007/s13740-018-0086-2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/bhbeyntbtzdkvf5t3dcko42jpy">fatcat:bhbeyntbtzdkvf5t3dcko42jpy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190502010443/http://www.jarrar.info/publications/BigDataSemantics.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f1/58/f158cad7a51f46be0d8831a00a2869ee97524048.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s13740-018-0086-2"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 83 results