Filters








2 Hits in 0.37 sec

Discovering Domain Orders through Order Dependencies [article]

Reza Karegar, Melicaalsadat Mirsafian, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta
<span title="2021-09-07">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Much real-world data come with explicitly defined domain orders; e.g., lexicographic order for strings, numeric for integers, and chronological for time. Our goal is to discover implicit domain orders that we do not already know; for instance, that the order of months in the Chinese Lunar calendar is Corner < Apricot < Peach. To do so, we enhance data profiling methods by discovering implicit domain orders in data through order dependencies. We enumerate tractable special cases and proceed
more &raquo; ... ds the most general case, which we prove is NP-complete. We show that the general case nevertheless can be effectively handled by a SAT solver. We also devise an interestingness measure to rank the discovered implicit domain orders, which we validate with a user study. Based on an extensive suite of experiments with real-world data, we establish the efficacy of our algorithms, and the utility of the domain orders discovered by demonstrating significant added value in three applications (data profiling, query optimization, and data mining).
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2005.14068v4">arXiv:2005.14068v4</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/mz5mxrngyzcvfowwn6rljrz76i">fatcat:mz5mxrngyzcvfowwn6rljrz76i</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201020225916/https://arxiv.org/pdf/2005.14068v3.pdf" title="fulltext PDF download [not primary version]" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <span style="color: #f43e3e;">&#10033;</span> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2005.14068v4" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Profiles of thesis students in the field of computer science and their link with UM research projects

Marisa Daniela Panizzi, Iris Sattolo, Javier Lafont, Nicolas Armilla
<span title="2020-02-13">2020</span> <i title="figshare"> Figshare </i> &nbsp;
[EP34] Shivangi Chopra, Hannah Gautreau, Abeer Khan, Melicaalsadat Mirsafian, & Lukasz Golab. (2018). Gender Differences in Undergraduate Engineering Applicants: A Text Mining Approach.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.6084/m9.figshare.11852637.v1">doi:10.6084/m9.figshare.11852637.v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/4uvfkyp3srfyzflrnboq5c4tuu">fatcat:4uvfkyp3srfyzflrnboq5c4tuu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200214063358/https://s3-eu-west-1.amazonaws.com/pfigshare-u-files/21724389/AppendixSpringer2020.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4a/e3/4ae3451fa1caa66fe50df5e9322d2fd6e06213f7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.6084/m9.figshare.11852637.v1"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> figshare.com </button> </a>