Filters








9 Hits in 1.8 sec

DataXFormer

John Morcos, Ziawasch Abedjan, Ihab Francis Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker
<span title="">2015</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data - SIGMOD &#39;15</a> </i> &nbsp;
In this demonstration, we present the user-interaction with DataXFormer and show scenarios on how it can be used to transform data and explore the effectiveness and efficiency of several approaches for  ...  a look-up in some reference data.  ...  In particular, we trace the intermediate steps of DataXFormer to find and wrap Web forms in an interactive manner. An initial version of DataXFormer is already available at http://dataxformer.org.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2723372.2735366">doi:10.1145/2723372.2735366</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/MorcosAIOPS15.html">dblp:conf/sigmod/MorcosAIOPS15</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5uumnyhwzbfcdaldtghnv4ahti">fatcat:5uumnyhwzbfcdaldtghnv4ahti</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218125316/https://static.aminer.org/pdf/20170130/pdfs/sigmod/jmqhznk34pmdswectr08v1gufyjz7yqs.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/98/75/98752dbb0699bfb44fbd352851cf817d13d71a94.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2723372.2735366"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

TrendQuery

Niranjan Kamat, Eugene Wu, Arnab Nandi
<span title="">2016</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the Workshop on Human-In-the-Loop Data Analytics - HILDA &#39;16</a> </i> &nbsp;
Thus, it is necessary for an expert human-in-the-loop to be involved in the process of trend analysis.  ...  The surfacing of trends from data collections such as usergenerated content streams and news articles is a popular and important data analysis activity, used in applications such as business intelligence  ...  While some other tools like Bellman [6] help users understand the quality and structure of the database, others such as Toped++ [24] transform the data as well.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2939502.2939514">doi:10.1145/2939502.2939514</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/KamatWN16.html">dblp:conf/sigmod/KamatWN16</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6sl3563v75evtmpxbzf7pp6fua">fatcat:6sl3563v75evtmpxbzf7pp6fua</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218131320/https://static.aminer.org/pdf/20170130/pdfs/sigmod/rexdvsbngqwovoyfiqzajll6upsk9m0t.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/03/db/03db9cea73c5283f1bea12a34126f5f512af0c7b.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2939502.2939514"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

WebRelate: integrating web data with spreadsheets using examples

Jeevana Priya Inala, Rishabh Singh
<span title="2017-12-27">2017</span> <i title="Association for Computing Machinery (ACM)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/dqy7qc7jkzal5bz3gueys3siz4" style="color: black;">Proceedings of the ACM on Programming Languages</a> </i> &nbsp;
WebRelate achieves this by learning a string transformation program using a few example URLs.  ...  Data integration between web sources and relational data is a key challenge faced by data scientists and spreadsheet users.  ...  We would also like to thank the members of the Microsoft Excel and Power BI teams for their helpful feedback on various versions of the WebRelate system and the real-world web data integration scenarios  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3158090">doi:10.1145/3158090</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/journals/pacmpl/InalaS18.html">dblp:journals/pacmpl/InalaS18</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vvwbdsxljze3ppscuctrvrfxju">fatcat:vvwbdsxljze3ppscuctrvrfxju</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200322063514/https://www.microsoft.com/en-us/research/wp-content/uploads/2017/12/webrelate_popl18.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4b/7a/4b7a993c8d5638caa6047bd57e1061b9a62a50c2.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3158090"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

WebRelate: Integrating Web Data with Spreadsheets using Examples [article]

Jeevana Priya Inala, Rishabh Singh
<span title="2017-11-15">2017</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
WebRelate achieves this by learning a string transformation program using a few example URLs.  ...  Data integration between web sources and relational data is a key challenge faced by data scientists and spreadsheet users.  ...  We would also like to thank the members of the Microsoft Excel and Power BI teams for their helpful feedback on various versions of the WebRelate system and the real-world web data integration scenarios  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1711.05787v1">arXiv:1711.05787v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/khsrkruw5fh5hkli4ctqnbxa5e">fatcat:khsrkruw5fh5hkli4ctqnbxa5e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200930014255/https://arxiv.org/pdf/1711.05787v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b0/bd/b0bdd152196fe1b9299470a6a1682aeb6928e4f2.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1711.05787v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Detecting data errors

Ziawasch Abedjan, Xu Chu, Dong Deng, Raul Castro Fernandez, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker, Nan Tang
<span title="2016-08-01">2016</span> <i title="VLDB Endowment"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a> </i> &nbsp;
Since different types of errors may coexist in the same data set, we often need to run more than one kind of tool.  ...  To answer these two questions, we obtained multiple data cleaning tools that utilize a variety of error detection techniques.  ...  For example the attributes journal title and journal abbreviation, which suffer from missing values, can be enriched through tools, such as DataXFormer [3] , by looking for semantic transformations of  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/2994509.2994518">doi:10.14778/2994509.2994518</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zx4yulp3d5gzbbntm6jnmmqxgy">fatcat:zx4yulp3d5gzbbntm6jnmmqxgy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190214175526/http://www.vldb.org:80/pvldb/vol9/p993-abedjan.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d7/d4/d7d4c94843952109131383d1ee4ba8b3ead09526.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/2994509.2994518"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Foofah

Zhongjun Jin, Michael R. Anderson, Michael Cafarella, H. V. Jagadish
<span title="">2017</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2017 ACM International Conference on Management of Data - SIGMOD &#39;17</a> </i> &nbsp;
This data transformation task is tedious, time-consuming, and often requires programming skills beyond the expertise of data analysts.  ...  Data transformation is a critical first step in modern data analysis: before any analysis can be done, data from a variety of sources must be wrangled into a uniform format that is amenable to the intended  ...  Manually transforming the data record-by-record would be tedious and error-prone, so he uses the interactive data cleaning tool Wrangler [22] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3035918.3064034">doi:10.1145/3035918.3064034</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/JinACJ17.html">dblp:conf/sigmod/JinACJ17</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3lsdcqv2nnamre7z6xxge54ktu">fatcat:3lsdcqv2nnamre7z6xxge54ktu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218144727/https://static.aminer.org/pdf/20170130/pdfs/sigmod/m2365vghjkcn7jteuoyen8vicalfpo0r.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/7b/b3/7bb31b56812fb45a05c97246241e94507667ea09.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3035918.3064034"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

PRESISTANT: Learning based assistant for data pre-processing [article]

Besim Bilalli and Alberto Abelló and Tomàs Aluja-Banet and Robert Wrembel
<span title="2018-03-02">2018</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
A given data pre-processing operator (e.g., transformation) can have positive, negative or zero impact on the final result of the analysis.  ...  Data pre-processing is one of the most time consuming and relevant steps in a data analysis process (e.g., classification task).  ...  interacting with crowds.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1803.01024v1">arXiv:1803.01024v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/cxlogv22qvg4bmr5omznlff3am">fatcat:cxlogv22qvg4bmr5omznlff3am</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200823160444/https://arxiv.org/pdf/1803.01024v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/56/a1/56a1414b337d46e2683c66c777760a4a62af29ee.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1803.01024v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

AI Enabling Technologies: A Survey [article]

Vijay Gadepally, Justin Goodwin, Jeremy Kepner, Albert Reuther, Hayley Reynolds, Siddharth Samsi, Jonathan Su, David Martinez
<span title="2019-05-08">2019</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
These pieces include data collection, data conditioning, algorithms, computing, robust artificial intelligence, and human-machine teaming.  ...  This article is meant to highlight many of these technologies that are involved in an end-to-end AI system.  ...  The main objective for this subcomponent is to transform data into information.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1905.03592v1">arXiv:1905.03592v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dui76274qvb5bie7pok2gpek6u">fatcat:dui76274qvb5bie7pok2gpek6u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200827175509/https://arxiv.org/ftp/arxiv/papers/1905/1905.03592.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e1/0e/e10efe62e2c9da8da728974ba34714d2705d8dec.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1905.03592v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Just-in-time Analytics Over Heterogeneous Data and Hardware

Manolis Karpathiotakis
<span title="2017-11-28">2017</span>
Transformations & Term Validation. Semantic transformations involve an equi-join or a similarity join with auxiliary data.  ...  Existing data cleaning approaches can be classified into two main categories: The first category includes interactive tools through which a user specifies constraints for the columns of a tabular dataset  ...  For instance, an H 2 TAP engine could configure its scheduler to move unused CPU cores from task-to data-parallel archipelago, and use them for running analytical queries under light OLTP workloads.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5075/epfl-thesis-8077">doi:10.5075/epfl-thesis-8077</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/y4a3i5zgcfewjbaarrkejthcgi">fatcat:y4a3i5zgcfewjbaarrkejthcgi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190307093346/https://infoscience.epfl.ch/record/232585/files/EPFL_TH8077.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/9f/29/9f291d59b903630ab33c992b43139d6c97d46b04.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5075/epfl-thesis-8077"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>