Optimization framework for DFG-based automated process discovery approaches

Adriano Augusto, Marlon Dumas, Marcello La Rosa, Sander J. J. Leemans, Seppe K. L. M. vanden Broucke
<span title="2021-02-27">2021</span> <i title="Springer Science and Business Media LLC"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/dfdth37zyfetvchsdjk5e2osrq" style="color: black;">Journal of Software and Systems Modeling</a> </i> &nbsp;
AbstractThe problem of automatically discovering business process models from event logs has been intensely investigated in the past two decades, leading to a wide range of approaches that strike various trade-offs between accuracy, model complexity, and execution time. A few studies have suggested that the accuracy of automated process discovery approaches can be enhanced by means of metaheuristic optimization techniques. However, these studies have remained at the level of proposals without
more &raquo; ... lidation on real-life datasets or they have only considered one metaheuristic in isolation. This article presents a metaheuristic optimization framework for automated process discovery. The key idea of the framework is to construct a directly-follows graph (DFG) from the event log, to perturb this DFG so as to generate new candidate solutions, and to apply a DFG-based automated process discovery approach in order to derive a process model from each DFG. The framework can be instantiated by linking it to an automated process discovery approach, an optimization metaheuristic, and the quality measure to be optimized (e.g., fitness, precision, F-score). The article considers several instantiations of the framework corresponding to four optimization metaheuristics, three automated process discovery approaches (Inductive Miner—directly-follows, Fodina, and Split Miner), and one accuracy measure (Markovian F-score). These framework instances are compared using a set of 20 real-life event logs. The evaluation shows that metaheuristic optimization consistently yields visible improvements in F-score for all the three automated process discovery approaches, at the cost of execution times in the order of minutes, versus seconds for the baseline approaches.
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10270-020-00846-x">doi:10.1007/s10270-020-00846-x</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qzo2akk6yfdvfegngujna2kcoi">fatcat:qzo2akk6yfdvfegngujna2kcoi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210718094832/https://link.springer.com/content/pdf/10.1007/s10270-020-00846-x.pdf?error=cookies_not_supported&amp;code=1f2a3e70-9b89-4b12-9038-682f519bc8fb" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/32/c7/32c72d33991f9b8a4cd5d9dbd0c27ec761940873.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10270-020-00846-x"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> springer.com </button> </a>