A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit <a rel="external noopener" href="https://static.aminer.org/pdf/20170130/pdfs/sigmod/jmqhznk34pmdswectr08v1gufyjz7yqs.pdf">the original URL</a>. The file type is <code>application/pdf</code>.
Filters
DataXFormer
<span title="">2015</span>
<i title="ACM Press">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data - SIGMOD '15</a>
</i>
In this demonstration, we present the user-interaction with DataXFormer and show scenarios on how it can be used to transform data and explore the effectiveness and efficiency of several approaches for ...
a look-up in some reference data. ...
In particular, we trace the intermediate steps of DataXFormer to find and wrap Web forms in an interactive manner. An initial version of DataXFormer is already available at http://dataxformer.org. ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2723372.2735366">doi:10.1145/2723372.2735366</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/MorcosAIOPS15.html">dblp:conf/sigmod/MorcosAIOPS15</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5uumnyhwzbfcdaldtghnv4ahti">fatcat:5uumnyhwzbfcdaldtghnv4ahti</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218125316/https://static.aminer.org/pdf/20170130/pdfs/sigmod/jmqhznk34pmdswectr08v1gufyjz7yqs.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/98/75/98752dbb0699bfb44fbd352851cf817d13d71a94.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2723372.2735366">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
acm.org
</button>
</a>
TrendQuery
<span title="">2016</span>
<i title="ACM Press">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the Workshop on Human-In-the-Loop Data Analytics - HILDA '16</a>
</i>
Thus, it is necessary for an expert human-in-the-loop to be involved in the process of trend analysis. ...
The surfacing of trends from data collections such as usergenerated content streams and news articles is a popular and important data analysis activity, used in applications such as business intelligence ...
While some other tools like Bellman [6] help users understand the quality and structure of the database, others such as Toped++ [24] transform the data as well. ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2939502.2939514">doi:10.1145/2939502.2939514</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/KamatWN16.html">dblp:conf/sigmod/KamatWN16</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6sl3563v75evtmpxbzf7pp6fua">fatcat:6sl3563v75evtmpxbzf7pp6fua</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218131320/https://static.aminer.org/pdf/20170130/pdfs/sigmod/rexdvsbngqwovoyfiqzajll6upsk9m0t.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/03/db/03db9cea73c5283f1bea12a34126f5f512af0c7b.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2939502.2939514">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
acm.org
</button>
</a>
WebRelate: integrating web data with spreadsheets using examples
<span title="2017-12-27">2017</span>
<i title="Association for Computing Machinery (ACM)">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/dqy7qc7jkzal5bz3gueys3siz4" style="color: black;">Proceedings of the ACM on Programming Languages</a>
</i>
WebRelate achieves this by learning a string transformation program using a few example URLs. ...
Data integration between web sources and relational data is a key challenge faced by data scientists and spreadsheet users. ...
We would also like to thank the members of the Microsoft Excel and Power BI teams for their helpful feedback on various versions of the WebRelate system and the real-world web data integration scenarios ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3158090">doi:10.1145/3158090</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/journals/pacmpl/InalaS18.html">dblp:journals/pacmpl/InalaS18</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vvwbdsxljze3ppscuctrvrfxju">fatcat:vvwbdsxljze3ppscuctrvrfxju</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200322063514/https://www.microsoft.com/en-us/research/wp-content/uploads/2017/12/webrelate_popl18.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/4b/7a/4b7a993c8d5638caa6047bd57e1061b9a62a50c2.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3158090">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
acm.org
</button>
</a>
WebRelate: Integrating Web Data with Spreadsheets using Examples
[article]
<span title="2017-11-15">2017</span>
<i >
arXiv
</i>
<span class="release-stage" >pre-print</span>
WebRelate achieves this by learning a string transformation program using a few example URLs. ...
Data integration between web sources and relational data is a key challenge faced by data scientists and spreadsheet users. ...
We would also like to thank the members of the Microsoft Excel and Power BI teams for their helpful feedback on various versions of the WebRelate system and the real-world web data integration scenarios ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1711.05787v1">arXiv:1711.05787v1</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/khsrkruw5fh5hkli4ctqnbxa5e">fatcat:khsrkruw5fh5hkli4ctqnbxa5e</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200930014255/https://arxiv.org/pdf/1711.05787v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/b0/bd/b0bdd152196fe1b9299470a6a1682aeb6928e4f2.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1711.05787v1" title="arxiv.org access">
<button class="ui compact blue labeled icon button serp-button">
<i class="file alternate outline icon"></i>
arxiv.org
</button>
</a>
Detecting data errors
<span title="2016-08-01">2016</span>
<i title="VLDB Endowment">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a>
</i>
Since different types of errors may coexist in the same data set, we often need to run more than one kind of tool. ...
To answer these two questions, we obtained multiple data cleaning tools that utilize a variety of error detection techniques. ...
For example the attributes journal title and journal abbreviation, which suffer from missing values, can be enriched through tools, such as DataXFormer [3] , by looking for semantic transformations of ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/2994509.2994518">doi:10.14778/2994509.2994518</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zx4yulp3d5gzbbntm6jnmmqxgy">fatcat:zx4yulp3d5gzbbntm6jnmmqxgy</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190214175526/http://www.vldb.org:80/pvldb/vol9/p993-abedjan.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/d7/d4/d7d4c94843952109131383d1ee4ba8b3ead09526.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/2994509.2994518">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
Publisher / doi.org
</button>
</a>
Foofah
<span title="">2017</span>
<i title="ACM Press">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2017 ACM International Conference on Management of Data - SIGMOD '17</a>
</i>
This data transformation task is tedious, time-consuming, and often requires programming skills beyond the expertise of data analysts. ...
Data transformation is a critical first step in modern data analysis: before any analysis can be done, data from a variety of sources must be wrangled into a uniform format that is amenable to the intended ...
Manually transforming the data record-by-record would be tedious and error-prone, so he uses the interactive data cleaning tool Wrangler [22] . ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3035918.3064034">doi:10.1145/3035918.3064034</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/JinACJ17.html">dblp:conf/sigmod/JinACJ17</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3lsdcqv2nnamre7z6xxge54ktu">fatcat:3lsdcqv2nnamre7z6xxge54ktu</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218144727/https://static.aminer.org/pdf/20170130/pdfs/sigmod/m2365vghjkcn7jteuoyen8vicalfpo0r.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/7b/b3/7bb31b56812fb45a05c97246241e94507667ea09.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3035918.3064034">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
acm.org
</button>
</a>
PRESISTANT: Learning based assistant for data pre-processing
[article]
<span title="2018-03-02">2018</span>
<i >
arXiv
</i>
<span class="release-stage" >pre-print</span>
A given data pre-processing operator (e.g., transformation) can have positive, negative or zero impact on the final result of the analysis. ...
Data pre-processing is one of the most time consuming and relevant steps in a data analysis process (e.g., classification task). ...
interacting with crowds. ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1803.01024v1">arXiv:1803.01024v1</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/cxlogv22qvg4bmr5omznlff3am">fatcat:cxlogv22qvg4bmr5omznlff3am</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200823160444/https://arxiv.org/pdf/1803.01024v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/56/a1/56a1414b337d46e2683c66c777760a4a62af29ee.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1803.01024v1" title="arxiv.org access">
<button class="ui compact blue labeled icon button serp-button">
<i class="file alternate outline icon"></i>
arxiv.org
</button>
</a>
AI Enabling Technologies: A Survey
[article]
<span title="2019-05-08">2019</span>
<i >
arXiv
</i>
<span class="release-stage" >pre-print</span>
These pieces include data collection, data conditioning, algorithms, computing, robust artificial intelligence, and human-machine teaming. ...
This article is meant to highlight many of these technologies that are involved in an end-to-end AI system. ...
The main objective for this subcomponent is to transform data into information. ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1905.03592v1">arXiv:1905.03592v1</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dui76274qvb5bie7pok2gpek6u">fatcat:dui76274qvb5bie7pok2gpek6u</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200827175509/https://arxiv.org/ftp/arxiv/papers/1905/1905.03592.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/e1/0e/e10efe62e2c9da8da728974ba34714d2705d8dec.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1905.03592v1" title="arxiv.org access">
<button class="ui compact blue labeled icon button serp-button">
<i class="file alternate outline icon"></i>
arxiv.org
</button>
</a>
Just-in-time Analytics Over Heterogeneous Data and Hardware
<span title="2017-11-28">2017</span>
Transformations & Term Validation. Semantic transformations involve an equi-join or a similarity join with auxiliary data. ...
Existing data cleaning approaches can be classified into two main categories: The first category includes interactive tools through which a user specifies constraints for the columns of a tabular dataset ...
For instance, an H 2 TAP engine could configure its scheduler to move unused CPU cores from task-to data-parallel archipelago, and use them for running analytical queries under light OLTP workloads. ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5075/epfl-thesis-8077">doi:10.5075/epfl-thesis-8077</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/y4a3i5zgcfewjbaarrkejthcgi">fatcat:y4a3i5zgcfewjbaarrkejthcgi</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190307093346/https://infoscience.epfl.ch/record/232585/files/EPFL_TH8077.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/9f/29/9f291d59b903630ab33c992b43139d6c97d46b04.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5075/epfl-thesis-8077">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
Publisher / doi.org
</button>
</a>