Filters








136 Hits in 3.6 sec

Searching Web Data using MinHash LSH

BiChen Rao, Erkang Zhu
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
In addition, we describe an on-line demo for the index with real Web data.  ...  In this extended abstract, we explore the use of MinHash Locality Sensitive Hashing (MinHash LSH) to address the problem of indexing and searching Web data.  ...  Applying the statistical tuning strategy to LSH Ensemble is another potential direction of research. '16 June 26 -July 01, 2016, San Francisco, CA, USA c 2016 Copyright held by the owner/author(s).  ... 
doi:10.1145/2882903.2914838 dblp:conf/sigmod/RaoZ16 fatcat:fzxf4wp2sfgwxmzvqzk6p44h6u

Graph Summarization for Geo-correlated Trends Detection in Social Networks

Colin Biafore, Faisal Nawab
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
These models are pre-defined and rigid which creates the need to expose the social network graph to data scientists to introduce the human-element in trends detection.  ...  We tackle this problem by providing effective graph summarizations aimed at the application of geo-correlated trends detection in social networks.  ...  SIGMOD'16 June 26 -July 01, 2016, San Francisco, CA, USA c 2016 Copyright held by the owner/author(s). ACM ISBN 978-1-4503-3531-7/16/06.  ... 
doi:10.1145/2882903.2914832 dblp:conf/sigmod/BiaforeN16 fatcat:qhfpkxzh6bd4zitctgwhlcpb44

Research Contribution as a Measure of Influence

Lais M.A. Rocha, Mirella M. Moro
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
We propose the 3c-index that measures the influence degree of researchers by evaluating the links they establish between communities. We evaluate its performance against well known metrics.  ...  The results show 3c-index outperforms them in most cases and can be employed as a complementary metric to assess researchers' productivity.  ...  A complete version of this work is available at [8] .  ... 
doi:10.1145/2882903.2914834 dblp:conf/sigmod/RochaM16 fatcat:u35txl7tfncgtfqykcstpcuqyy

Adaptive Data Skipping in Main-Memory Systems

Wilson Qin, Stratos Idreos
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
Scans benefit from data skipping when the data order is sorted, semi-sorted, or comprised of clustered values. However data skipping loses effectiveness over arbitrary data distributions.  ...  Applying data skipping techniques over non-sorted data can significantly decrease query performance since the extra cost of metadata reads result in no corresponding scan performance gains.  ...  Figure 1 : 1 SIGMOD/PODS'16 June 26 -July 01, 2016, San Francisco, CA, USA © 2016 Copyright held by the owner/author(s). ACM ISBN 978-1-4503-3531-7/16/06.  ... 
doi:10.1145/2882903.2914836 dblp:conf/sigmod/QinI16 fatcat:clyllh6lx5ayhhonrb3fuc6cvm

Main Memory Adaptive Denormalization

Zezhou Liu, Stratos Idreos
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
the minimized loading, update, and storage costs of a normalized schema.  ...  We replace the traditional join operations with efficient scans over the relevant partial universal tables without incurring the prohibitive costs of full denormalization.  ...  Figure 1 : 1 SIGMOD'16 June 26 -July 01, 2016, San Francisco, CA, USA c 2016 Copyright held by the owner/author(s). ACM ISBN 978-1-4503-3531-7/16/06.  ... 
doi:10.1145/2882903.2914835 dblp:conf/sigmod/LiuI16 fatcat:uw7gvnftfvam3jhch5gubdlaxq

Exploring Visualization of Data Transforms

Larry Xu
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
Through a series of user studies, we evaluate tweening as an effective method of understanding the changes that result from data transformations.  ...  We present the concept of "tweening" of resultsets as a method of incrementally visualizing data transformations, and explore approaches towards generating these resultset tweens.  ...  SIGMOD'16 June 26 -July 01, 2016, San Francisco, CA, USA c 2016 Copyright held by the owner/author(s). ACM ISBN 978-1-4503-3531-7/16/06.  ... 
doi:10.1145/2882903.2914837 dblp:conf/sigmod/Xu16 fatcat:b3gfntjnmjam7cihyqp5owlzwm

Big Graph Analytics Systems

Da Yan, Yingyi Bu, Yuanyuan Tian, Amol Deshpande, James Cheng
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
The topics covered in this tutorial include programming models and algorithm design, computation models, communication mechanisms, out-of-core support, fault tolerance, dynamic graph support, and so on  ...  We also highlight future research opportunities on Big Graph analytics.  ...  Amol Deshpande is supported by NS-F under grant IIS-1319432, and an IBM Collaborative Research Award. '16, June 26-July 01, 2016, San Francisco, CA, USA c 2016 ACM.  ... 
doi:10.1145/2882903.2912566 dblp:conf/sigmod/YanBTDC16 fatcat:cgmkhia4gnbbtnyhrnvz3ea6fy

Automatic Entity Recognition and Typing in Massive Text Data

Xiang Ren, Ahmed El-Kishky, Heng Ji, Jiawei Han
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
Since these methods do not rely on annotated data, predefined typing schema or hand-crafted features, they can be quickly adapted to a new domain, genre and language.  ...  In this tutorial, we introduce data-driven methods to recognize typed entities of interest in massive, domain-specific text corpora.  ...  SIGMOD' 16 , 16 June 26-July 01, 2016, San Francisco, CA, USA c 2016 ACM.  ... 
doi:10.1145/2882903.2912567 dblp:conf/sigmod/RenEJH16 fatcat:ycuyy6wdt5ffjjucikqbamvbhq

Minimizing Average Regret Ratio in Database

Sepanta Zeighami, Raymond Chi-Wing Wong
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
While assuming the existence of some utility functions for the users, in contrast to the top-k query, it does not require a user to input his or her utility function but instead depends on the probability  ...  We propose "average regret ratio" as a metric to measure users' satisfaction after a user sees k selected points of a database, instead of all of the points in the database.  ...  Figure 1 : 1 Results on the House-6d dataset, varying k '16 June 26 -July 01, 2016, San Francisco, CA, USA c 2016 Copyright held by the owner/author(s).  ... 
doi:10.1145/2882903.2914831 dblp:conf/sigmod/ZeighamiW16 fatcat:idkg57xw4nb27dg3xn7cv6znvi

Vectorizing an In Situ Query Engine

Panagiotis Sioulas, Anastasia Ailamaki
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
On the other hand, performing analysis over raw data entails numerous overheads because of the potentially inefficient data representations.  ...  In this paper, we investigate the effect of vector processing on raw data querying. We enhance the operators of a query engine to use SIMD operations.  ...  Copyright is held by the owner/author(s).SIGMOD'16, June 26 -July 01, 2016, San Francisco, CA, USA ACM 978-1-4503-3531-7/16/06. http://dx.doi.org/10.1145/2882903.2914829 Figure 3 : 3 Misprediction sensitivity  ... 
doi:10.1145/2882903.2914829 dblp:conf/sigmod/SioulasA16 fatcat:kikxqopcuzdmdmoq3wamezhesy

Provenance

Melanie Herschel, Marcel Hlawatsch
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
The second part of this tutorial therefore focuses on enabling users to leverage provenance through adapted visualizations.  ...  In the past, different types of provenance meta-data have been proposed, each with a different scope.  ...  The authors thank the German Research Foundation (DFG) for financial support within projects B01 and D03 of SFB/Transregio 161.  ... 
doi:10.1145/2882903.2912568 dblp:conf/sigmod/HerschelH16 fatcat:j56hczlu4ffzhcfndzugmy6p2m

CLAMS

Mina Farid, Alexandra Roatis, Ihab F. Ilyas, Hella-Franziska Hoffmann, Xu Chu
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
"load-first" paradigm, the new environment presents serious data management challenges.  ...  Among them, the assessment of data quality and cleaning large volumes of heterogeneous data sources become essential tasks in unveiling the value of big data.  ...  SIGMOD' 16 , 16 June 26-July 01, 2016, San Francisco, CA, USA c 2016 ACM.  ... 
doi:10.1145/2882903.2899391 dblp:conf/sigmod/FaridRIHC16 fatcat:re5aay3od5d3vmeutzva6chrse

Web-based Benchmarks for Forecasting Systems

Robert Ulbricht, Claudio Hartmann, Martin Hahmann, Hilko Donker, Wolfgang Lehner
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
We propose the ECAST online platform in order to solve that problem. The system's capability is demonstrated on a real-world use case by comparing the performance of different prediction tools.  ...  The role of precise forecasts in the energy domain has changed dramatically.  ...  We gratefully acknowledge the contributions of Lucas Bruenings and Johannes Wilke.  ... 
doi:10.1145/2882903.2899399 dblp:conf/sigmod/UlbrichtHHDL16 fatcat:xjzqa5p2lncahaeufyjvughbf4

Emma in Action

Alexander Alexandrov, Andreas Salzmann, Georgi Krastev, Asterios Katsifodimos, Volker Markl
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
To retain a sufficient level of abstraction and lower the barrier of entry for data scientists, projects like Spark and Flink currently offer domain-specific APIs on top of their parallel collection abstractions  ...  The proposed design promises increased programmer productivity due to avoiding an impedance mismatch, thereby reducing the lag times and cost of data analysis.  ...  '16, June 26-July 01, 2016, San Francisco, CA, USA © 2016 Copyright held by the owner/author(s). Publication rights licensed to ACM.  ... 
doi:10.1145/2882903.2899396 dblp:conf/sigmod/AlexandrovSKKM16 fatcat:hre3crgnj5dp5mjiq22etivfpu

The Challenges of Global-scale Data Management

Faisal Nawab, Divyakant Agrawal, Amr El Abbadi
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
This has led to the emergence of global-scale data management and event processing.  ...  of data management systems.  ...  ACKNOWLEDGMENT This work is supported by NSF grants IIS 1018637, 1528178, and 1442966 and is partially funded by a gift grant from NEC Labs America. '16, June 26-July 01, 2016, San Francisco, CA, USA c  ... 
doi:10.1145/2882903.2912571 dblp:conf/sigmod/NawabAA16 fatcat:4ja2phoh7zajxnckcupa25o65y
« Previous Showing results 1 — 15 out of 136 results