A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit <a rel="external noopener" href="http://pdfs.semanticscholar.org/f02d/23c0a550c7813cc444fbb4cedfe861d4360d.pdf">the original URL</a>. The file type is <code>application/pdf</code>.
Towards Distributed Model Analytics with Apache Spark
<span title="">2018</span>
<i title="SCITEPRESS - Science and Technology Publications">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/lcvyca6jirfxxantcpxh4xhblq" style="color: black;">Proceedings of the 6th International Conference on Model-Driven Engineering and Software Development</a>
</i>
The growing number of models and other related artefacts in model-driven engineering has recently led to the emergence of approaches and tools for analyzing and managing them on a large scale. The framework SAMOS applies techniques inspired by information retrieval and data mining to analyze large sets of models. As the data size and analysis complexity goes up, however, further scalability is needed. In this paper we extend SAMOS to operate on Apache Spark, a popular engine for distributed Big
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5220/0006735407670772">doi:10.5220/0006735407670772</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/modelsward/BaburCB18.html">dblp:conf/modelsward/BaburCB18</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/kq4punadorcp3k3m6rpw5rwsqm">fatcat:kq4punadorcp3k3m6rpw5rwsqm</a>
</span>
more »
... Data processing, by partitioning the data and parallelizing the comparison and analysis phase. We present preliminary studies using a cluster infrastructure and report the results for two datasets: one with 250 Ecore metamodels where we detail the performance gain with various settings, and a larger one of 7.3k metamodels with nearly one million model elements for further demonstrating scalability. Babur, Ö., Cleophas, L. and Brand, M. Towards Distributed Model Analytics with Apache Spark.
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190308232703/http://pdfs.semanticscholar.org/f02d/23c0a550c7813cc444fbb4cedfe861d4360d.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/f0/2d/f02d23c0a550c7813cc444fbb4cedfe861d4360d.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5220/0006735407670772">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
Publisher / doi.org
</button>
</a>