Filters








60,824 Hits in 6.0 sec

Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams

Anshumali Shrivastava, Arnd Christian Konig, Mikhail Bilenko
<span title="">2016</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2016 International Conference on Management of Data - SIGMOD &#39;16</a> </i> &nbsp;
In this work, we describe a new method, Time-adaptive Sketches, (Ada-sketch), that overcomes these limitations, while extending and providing a strict generalization of several popular sketching algorithms  ...  The simplicity of the procedure and the method's generalization of classic sketching techniques give hope for wide applicability of Ada-sketches in practice.  ...  Hokusai uses a set of Count-Min sketches for different time intervals, to estimate the counts of any item for a given time or interval.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2882903.2882946">doi:10.1145/2882903.2882946</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/ShrivastavaKB16.html">dblp:conf/sigmod/ShrivastavaKB16</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3astgrdxcjeqzlhpqucdu2kgfe">fatcat:3astgrdxcjeqzlhpqucdu2kgfe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20160910195945/http://www.cs.rice.edu:80/~as143/Papers/16-ada-sketches.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/03/b9/03b91a1e49805406f041ca3399be730729c62338.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2882903.2882946"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Approximation techniques for spatial data

Abhinandan Das, Johannes Gehrke, Mirek Riedewald
<span title="">2004</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2004 ACM SIGMOD international conference on Management of data - SIGMOD &#39;04</a> </i> &nbsp;
Our techniques can be constructed in a single scan over the input, handle inserts and deletes to the database incrementally, and hence they can also be used for processing of streaming spatial data.  ...  We present a detailed analysis and experimentally demonstrate the efficacy of the proposed techniques.  ...  Note that our techniques easily generalize to multidimensional data spaces with different domains for the dimensions. In Section 5.1 we discuss how to handle real-valued domains. Definition 1.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1007568.1007646">doi:10.1145/1007568.1007646</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/DasGR04.html">dblp:conf/sigmod/DasGR04</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/34btkjeoivadzcc5qh4rdvvth4">fatcat:34btkjeoivadzcc5qh4rdvvth4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170706110710/http://www.cs.cornell.edu/johannes/papers/2004/sigmod2004-spatial.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4b/7d/4b7de76e9aff445bb5f384080580c9d85ff1a590.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1007568.1007646"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

On Efficient Query Processing of Stream Counts on the Cell Processor

Dina Thomas, Rajesh Bordawekar, Charu C. Aggarwal, Philip S. Yu
<span title="">2009</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/6nlwwu55hrbtxnvepn5gtiqug4" style="color: black;">Proceedings / International Conference on Data Engineering</a> </i> &nbsp;
To address these concerns, we implement a sketch-based counting algorithm, FCM, that is specifically adapted for the Cell processor architecture.  ...  In recent years, the sketch-based technique has been presented as an effective method for counting stream items on processors with limited storage and processing capabilities, such as the network processors  ...  Therefore, our first step was to examine the different sketch-based algorithms for their suitability for adaptation to the Cell processor.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icde.2009.35">doi:10.1109/icde.2009.35</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icde/ThomasBAY09.html">dblp:conf/icde/ThomasBAY09</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/x3x3rezez5dzdcr62fri3wrwmm">fatcat:x3x3rezez5dzdcr62fri3wrwmm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20121019130128/http://charuaggarwal.net/cell-icde.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/21/41/2141575db0fd11bdc5326fcdec4cc065f17b8767.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icde.2009.35"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Survey on Query Estimation in Data Streams

Sudhanshu Gupta, Deepak Garg
<span title="">2009</span> <i title="IEEE"> 2009 IEEE International Advance Computing Conference </i> &nbsp;
Sampling, Histograms, Wavelets, Sketches, Discrete In this paper we will provide general view of the query cosine series etc. are used to store data distribution estimation, along with its related work  ...  So it becomes challenging to determine the challenging in case of fast, continuous, online data best set of synopses for a given combination of datasets, streams.  ...  In order for such techniques to be useful tehe g estimation techniques should be effective for different priori knowledge.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/iadcc.2009.4809224">doi:10.1109/iadcc.2009.4809224</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ptjvk56acjczbhhurdc2m2vt5m">fatcat:ptjvk56acjczbhhurdc2m2vt5m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20160806183955/http://gdeepak.com/pubs/survey%20on%20query%20estimation%20in%20data%20streams.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b1/60/b160d0a89f755e38aa81f57eb7c8f1a4b3a8c63e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/iadcc.2009.4809224"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Approximate Colored Range Queries [chapter]

Ying Kit Lai, Chung Keung Poon, Benyun Shi
<span title="">2005</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
In this paper, we formulate a class of colored range query problems to model the multi-dimensional range queries in the presence of categorical information.  ...  By applying appropriate sketching techniques on our framework, we obtained efficient data structures that provide approximate solutions to these problems.  ...  In terms of techniques, they adapted the intuition and ideas from [12, 11] and extended them to work for higher dimensions.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/11602613_37">doi:10.1007/11602613_37</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hq6iofpfrva6xckd3pz7pre43e">fatcat:hq6iofpfrva6xckd3pz7pre43e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20090625185515/http://www.cs.cityu.edu.hk/~ckpoon/research/colored.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/23/ad/23adb01da91b577d057b3062d547792998812b8c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/11602613_37"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Approximate colored range and point enclosure queries

Ying Kit Lai, Chung Keung Poon, Benyun Shi
<span title="">2008</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/usw2n4yaarcurchx7i4on2iqea" style="color: black;">Journal of Discrete Algorithms</a> </i> &nbsp;
Based on a new framework of combining sketching techniques and traditional data structures, we obtain two sets of results in solving the problems approximately and efficiently.  ...  Many of these problems are difficult to solve using traditional data structural techniques.  ...  Acknowledgement The authors thank the anonymous referees for their helpful comments on improving the presentation of the paper.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.jda.2007.10.001">doi:10.1016/j.jda.2007.10.001</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/4vb3zathmbbxllxqwtiw5letda">fatcat:4vb3zathmbbxllxqwtiw5letda</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190319232242/https://core.ac.uk/download/pdf/82474884.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/62/95/6295e960e0fb4131572c9a324425978f217ae33a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.jda.2007.10.001"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> elsevier.com </button> </a>

Privacy Preserving Gate Counting with Collaborative Bluetooth Scanners [chapter]

Nelson Gonçalves, Rui José, Carlos Baquero
<span title="">2011</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
To this end, we present an analysis of several stochastic counting techniques that not only provide an accurate count for the number of unique devices, but offer privacy guarantees as well.  ...  This paper shows how Bluetooth scanning can be used in gate counting scenarios, where the main goal is to provide an accurate count for the number of unique devices sighted.  ...  and a general purpose platform for massive sensing and actuation in urban spaces.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-25126-9_65">doi:10.1007/978-3-642-25126-9_65</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/bzx7ygorjjh2payiubyvqndi2m">fatcat:bzx7ygorjjh2payiubyvqndi2m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180721175928/http://gsd.di.uminho.pt/members/cbm/ps/cr.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/24/bf/24bfe9c5b24fa705ef72e4e541ecb807482fb9ee.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-25126-9_65"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

An Adaptive Grid and Incentive Mechanism for Personalized Differentially Private Location Data in the Local Setting

Kangsoo Jung, Seog Park, Peter Brida
<span title="2020-12-30">2020</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/khqygtfojnby3jyh5obuwd7ko4" style="color: black;">Mobile Information Systems</a> </i> &nbsp;
The proposed incentive mechanism has two models, and both models pay the incentive differently according to the user's safe region size to motivate to set a more precise safe region.  ...  In this paper, we propose a local differential privacy scheme in an environment where there is no trusted third party to implement privacy protection techniques and incentive mechanisms to motivate users  ...  Our intuition is that the count-min sketch is suitable because location data is generally skewed in a specific area.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2020/8898223">doi:10.1155/2020/8898223</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ned6kcz5qrd3bhzhnkkqfbei2m">fatcat:ned6kcz5qrd3bhzhnkkqfbei2m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210217100911/https://downloads.hindawi.com/journals/misy/2020/8898223.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0f/0f/0f0fdc648226ace989a7017219d44d3cdaa9db9e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2020/8898223"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

Signature-File-Based Approach for Query Answering Over Wireless Sensor Networks

Mo Li, Lei Chen, Jizhong Zhao, Qian Zhang, Yunhao Liu
<span title="">2008</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/fmebo7gm4vhg7ljs3x2bbrazk4" style="color: black;">IEEE Transactions on Vehicular Technology</a> </i> &nbsp;
Because sensor nodes are generally battery powered, to prolong network lifetime, energy conservation becomes a major concern in answering queries over sensor networks.  ...  We propose a signature-file-based approach to approximately answer queries over WSNs.  ...  Current environment monitoring is typically manually conducted and in a sparse way, due to the lack of corresponding techniques for constructing a large-scale sensing system, which conforms to the practical  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tvt.2007.912340">doi:10.1109/tvt.2007.912340</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/kl7jhan5t5bufjaskhj5zuvmoe">fatcat:kl7jhan5t5bufjaskhj5zuvmoe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20171025200555/https://core.ac.uk/download/pdf/21751894.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/1e/99/1e9983e6b1e61d0e6c8301dfa7b3592102df6526.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tvt.2007.912340"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

A survey of sketches in traffic measurement: Design, Optimization, Application and Implementation [article]

Shangsen Li, Lailong Luo, Deke Guo, Qianzhen Zhang, Pengtao Fu
<span title="2021-07-20">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Furthermore, we conduct a comprehensive analysis and qualitative/quantitative comparison of the sketch designs.  ...  Currently, tremendous redesigns and optimizations have been proposed to improve the sketches for better network measurement performance.  ...  However, CML Sketch sacrifices the ability to support deletions for a more extensive counting range.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2012.07214v2">arXiv:2012.07214v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/lme2ghsshje3tag2m5q3xgvcna">fatcat:lme2ghsshje3tag2m5q3xgvcna</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210722112128/https://arxiv.org/pdf/2012.07214v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/21/c0/21c04d5cb3e486a19c95ba68f9e8787aa06ce695.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2012.07214v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Distributed Data Streams [chapter]

Minos Garofalakis
<span title="">2016</span> <i title="Springer New York"> Encyclopedia of Database Systems </i> &nbsp;
To summarize, the focus is on techniques for processing queries over collections of remote data streams.  ...  Such techniques have to work in a distributed setting (i.e., over a communication network), support one-shot or continuous query answers, and be space, time, and communication efficient.  ...  Combined with intelligent sketching techniques and methods for bounding the overall query error, such approaches can be used to track a large class of complex, holistic queries, only requiring concise  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-1-4899-7993-3_137-2">doi:10.1007/978-1-4899-7993-3_137-2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/fnr32foqjbb7hlhko5ob7532g4">fatcat:fnr32foqjbb7hlhko5ob7532g4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829041523/http://www.dblab.ntua.gr/~gtsat/collection/data%20streams/eds09dstreams.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c0/68/c068c359a52ca0e360091c5bbf88302abce746c8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-1-4899-7993-3_137-2"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Stream Clustering using Probabilistic Data Structures [article]

Andrei Sorin Sabau
<span title="2016-12-08">2016</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
A count-min sketch using a damped window model estimates stream density. Bloom filters employing a variation of active-active buffering estimate cluster membership.  ...  This paper proposes a novel alternative to the traditional two phase stream clustering scheme, introducing sketch-based data structures for assessing both stream density and cluster membership with probabilistic  ...  Sketch based summarization techniques Sketch based techniques employ noncryptographic hash functions for approximating frequency counts over the data stream.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1612.02701v1">arXiv:1612.02701v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vkrjbcx5dzhu7jpvp34hpkcz3m">fatcat:vkrjbcx5dzhu7jpvp34hpkcz3m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200826020222/https://arxiv.org/ftp/arxiv/papers/1612/1612.02701.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/85/82/8582eda990f5467203018a6d7ede5d36fb42ea8a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1612.02701v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Approximate Data Mining Using Sketches for Massive Data

Parul Gupta, Swati Agnihotri, Suman Saha
<span title="">2013</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/kcayf4mk7zbrbhvdwizgrxaruu" style="color: black;">Procedia Technology - Elsevier</a> </i> &nbsp;
Random Sketch is used to reduce the dimensions of the dataset.  ...  With the popularity of the Web and Internet, massive data is generated.However, this enormous datasets present the challenge to apply data mining techniques in order to extract useful information.  ...  Step 2: Generate a random vector equal to number of attributes in the input file(Sketch matrix).  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.protcy.2013.12.422">doi:10.1016/j.protcy.2013.12.422</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/uzlxbsewpvf3famrs2tnrkqwyq">fatcat:uzlxbsewpvf3famrs2tnrkqwyq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190307173717/https://core.ac.uk/download/pdf/82534611.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/2f/97/2f979ef6bff5c1fbd90091b894dbb02fdb4ac5cf.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.protcy.2013.12.422"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> elsevier.com </button> </a>

Holistic UDAFs at streaming speeds

Graham Cormode, Theodore Johnson, Flip Korn, S. Muthukrishnan, Oliver Spatscheck, Divesh Srivastava
<span title="">2004</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2004 ACM SIGMOD international conference on Management of data - SIGMOD &#39;04</a> </i> &nbsp;
In this paper, we study the performance implications of using user-defined aggregate functions (UDAFs) to incorporate selectionbased and sketch-based algorithms for holistic aggregates into a data stream  ...  However, little work has been done to explore what techniques are required to incorporate these algorithms in a data stream query processor, and to make them useful in practice.  ...  To find items with the highest counts, we keep 6 sketches P w $ 4 l ) 4 ) v 6 $ of items at 6 different levels of granularity (e.g., a sketch for ¢ s, sketch for b ¢ © Ad f , for b ¢ © W« Af etc).  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1007568.1007575">doi:10.1145/1007568.1007575</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/JohnsonCKMSS04.html">dblp:conf/sigmod/JohnsonCKMSS04</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/v7nx5yqiurc3znqliz4mnnhnoe">fatcat:v7nx5yqiurc3znqliz4mnnhnoe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170810165415/http://dimacs.rutgers.edu/~graham/pubs/papers/cjkmss-streamudaf.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/02/3d/023db6fa5c87852885dcee9531a1c843063c2347.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1007568.1007575"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Persistent Data Sketching

Zhewei Wei, Ge Luo, Ke Yi, Xiaoyong Du, Ji-Rong Wen
<span title="">2015</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data - SIGMOD &#39;15</a> </i> &nbsp;
In many of today's big data applications, in particular for high-speed streaming data, the volume and velocity of the data are so high that we cannot afford to store everything.  ...  All streaming algorithms work by maintaining a small data structure in memory, which is usually called a sketch, summary, or synopsis.  ...  To generalize our technique, we will prove the following space bound for the persistent Count-Min Sketch under the random turnstile model: THEOREM 3.3.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2723372.2749443">doi:10.1145/2723372.2749443</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/WeiLYDW15.html">dblp:conf/sigmod/WeiLYDW15</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/eyf6bkbajrgyhkt3nxt6woc7wq">fatcat:eyf6bkbajrgyhkt3nxt6woc7wq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829212431/http://www.cse.ust.hk/~yike/sigmod15.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8a/bc/8abc855e181c4d0ecad14db87c88166c64f8688d.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2723372.2749443"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 60,824 results