Filters








1,033 Hits in 5.9 sec

Efficient query processing in distributed search engines

Simon Jonassen
2012 SIGIR Forum  
Our first approach combines the advantage of pipelined and traditional (non-pipelined) query processing.  ...  Subsequently, we present several skipping extensions to pipelined query processing, which as we show can improve the query processing performance and/or the quality of results.  ...  Rocha-Junior for the useful advices and comments on the paper. Acknowledgments. This work was done while the second author was an intern at Yahoo!  ... 
doi:10.1145/2492189.2492201 fatcat:uwasxhngrfgntemkhawyv3te64

Adaptive Query Processing

Amol Deshpande, Zachary Ives, Vijayshankar Raman
2007 Foundations and Trends in Databases  
We focus primarily on intra-query adaptivity of long-running, but not full-fledged streaming, queries. We conclude with a discussion of open research problems that are of high importance.  ...  adaptive query processing in general.  ...  AQP 5 Full text available at: http://dx.doi.org/10.1561/1900000001 1.2 Motivations for AQP 7 Full text available at: http://dx.doi.org/10.1561/1900000001  ... 
doi:10.1561/1900000001 fatcat:oqg667kvabfqfd5kvydxmkcej4

Parallel computing in information retrieval – an updated review

A. Macfarlane, S.E. Robertson, J.A. Mccann
1997 Journal of Documentation  
In particular we stress the importance of the motivation in using parallel computing for Text Retrieval.  ...  Reuters use this system for their Text Retrieval purposes. DapText has been implemented on both the 500 and 600 series of the DAP.  ...  ACKNOWLEDGEMENTS This research is supported by the Department for Education and Employment, grant number IS96/4197.  ... 
doi:10.1108/eum0000000007201 fatcat:2zuwtehixbd6xk33hwb3j43nse

Accessing very high dimensional spaces in parallel

F. J. Artigas-Fuentes, J. M. Badía
2016 Journal of Supercomputing  
A thorough experimental analysis on different datasets shows that our method can process efficiently large flows of queries, compete with other parallel algorithms and obtain at the same time very high  ...  Access methods are a fundamental tool on Information Retrieval.  ...  Another classifying criterion differentiates inter-query parallelism if different queries are executed concurrently and intra-query parallelism if different parts of the same query are executed in parallel  ... 
doi:10.1007/s11227-016-1673-3 fatcat:syppxcky3ng75lr2q6z75axf2i

On Improving User Response Times in Tableau

Pawel Terlecki, Fei Xu, Marianne Shaw, Valeri Kim, Richard Wesley
2015 Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data - SIGMOD '15  
Query processing usually dominates the cost of visualization generation.  ...  In this paper we discuss key data processing components in Tableau: the query processor, query caches, Tableau Data Engine [1, 2] and Data Server.  ...  Rendering a dashboard requires retrieving necessary data and post-processing it for visualization.  ... 
doi:10.1145/2723372.2742799 dblp:conf/sigmod/TerleckiXSKW15 fatcat:vd6xnnluavf6hlud5gullo2hgu

Web Query Processing Approaches A Survey and Comparison

M. Manikantan, S. Duraisamy
2014 International Journal of Computer Applications  
techniques in web query processing.  ...  We can classify the evolutionary development of web query processing from database query processing and SQL optimizations as Learning and Adaptive query processing, Web query through HTML and web search  ...  Information Retrieval is the activity of obtaining information resources through searches over metadata or on full text indexing.  ... 
doi:10.5120/14893-3362 fatcat:qlux6724hfgpnof2k43dqrc5gq

Approximate Similarity Search for Online Multimedia Services on Distributed CPU-GPU Platforms [article]

George Teodoro, Eduardo Valle, Nathan Mariano, Ricardo Torres, Wagner Meira Jr, Joel H. Saltz
2012 arXiv   pre-print
In order to address these challenges, we introduce hypercurves, a flexible framework for answering approximate k-nearest neighbor (kNN) queries for very large multimedia databases, aiming at online content-based  ...  Recently, there has been an increase interest in similarity search for online content-based multimedia services.  ...  Moreover, classical distributed algorithms tend to ignore the response-time of processing each individual query, and to concentrate in providing maximum throughput for batches of queries.  ... 
arXiv:1209.0410v1 fatcat:xhcm6doprzeszgofjttuy3dvj4

Approximate similarity search for online multimedia services on distributed CPU–GPU platforms

George Teodoro, Eduardo Valle, Nathan Mariano, Ricardo Torres, Wagner Meira, Joel H. Saltz
2013 The VLDB journal  
In this work, we address these challenges with Hypercurves, a flexible framework for answering approximate k-nearest neighbor (kNN) queries for very large multimedia databases.  ...  Similarity search in highdimensional spaces is a pivotal operation for several database applications, including online content-based multimedia services.  ...  Acknowledgements We would like to express our gratitude to the reviewers for their valuable comments, which helped us to improve our work both in terms of content and presentation. E. Valle  ... 
doi:10.1007/s00778-013-0329-7 fatcat:rpcxwmr5hvdwll73wvvnuxirke

Data partitioning enables the use of standard SOAP Web Services in genome-scale workflows

Pawel Sztromwasser, Pål Puntervoll, Kjell Petersen
2011 Journal of Integrative Bioinformatics  
We evaluated the data-partitioning strategy by comparing it with typical communication patterns on an example pipeline for genomic sequence annotation.  ...  Combining these resources is a common practice in bioinformatics, but integration of heterogeneous and often distributed tools and datasets can be challenging.  ...  Acknowledgements We would like to thank Inge Jonassen for his feedback regarding this work and the manuscript.  ... 
doi:10.2390/biecoll-jib-2011-163 pmid:21788681 fatcat:655h4a4rdrgxnfd2q7xinw6kmq

Data partitioning enables the use of standard SOAP Web Services in genome-scale workflows

Paweł Sztromwasser, Kjell Petersen, Pál Puntervoll
2011 Journal of Integrative Bioinformatics  
We evaluated the data-partitioning strategy by comparing it with typical communication patterns on an example pipeline for genomic sequence annotation.  ...  Combining these resources is a common practice in bioinformatics, but integration of heterogeneous and often distributed tools and datasets can be challenging.  ...  Acknowledgements We would like to thank Inge Jonassen for his feedback regarding this work and the manuscript.  ... 
doi:10.1515/jib-2011-163 fatcat:nz7lt3nl35fh5eh2if2jpkv7iq

Imagine This! Scripts to Compositions to Videos [article]

Tanmay Gupta, Dustin Schwenk, Ali Farhadi, Derek Hoiem, Aniruddha Kembhavi
2018 arXiv   pre-print
Our contributions include sequential training of components of CRAFT while jointly modeling layout and appearances, and losses that encourage learning compositional representations for retrieval.  ...  For a glimpse of videos generated by CRAFT, see https://youtu.be/688Vv86n0z8.  ...  Similar to Entity Retriever, the Background Retriever produces a query embedding for the desired scene from text and retrieves the closest background video from the target database.  ... 
arXiv:1804.03608v1 fatcat:bma2filzibaynmvhu2fwkzthhu

D4.1 REUSABLE MODEL & ANALYTICAL TOOLS: DESIGN AND OPEN SPECIFICATION 1

Ofer Biran, Oshrit Feder, Sandra Ebro, Alejandro Ramiro, María Ángeles Sanguino, Jorge Montero, Argyro Mavrogiorgou, Thanos Kiourtis, George Manias, Nikitas Sgouros, Kostas Nasias
2020 Zenodo  
Specification and design of the built-in analytics tools for Situational Knowledge, Opinion Mining & Sentiment Analysis, Social Dynamics & Behavioral Data analysis.  ...  Internal architecture of the Integrated Acquisition and Analytics Layer, responsible for the integration of analytical tools in extensible manner, registration of new data sources and applying the required  ...  The latter allows for inter and intra query and operation processing, thus making it possible for a query statement to be executed in a distributed manner.  ... 
doi:10.5281/zenodo.4081335 fatcat:ei5tghz6drgdnc4akwwrbl7ole

Lost but not forgotten: finding pages on the unarchived web

Hugo C. Huurdeman, Jaap Kamps, Thaer Samar, Arjen P. de Vries, Anat Ben-David, Richard A. Rogers
2015 International Journal on Digital Libraries  
on average, with host-level representations leading to further improvement of the retrieval effectiveness for websites.  ...  Second, the link and anchor text have a highly skewed distribution: popular pages such as home pages have more links pointing to them and more terms in the anchor text, but the richness tapers off quickly  ...  Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons. org/licenses/by/4.0/), which permits unrestricted use, distribution  ... 
doi:10.1007/s00799-015-0153-3 fatcat:f5yhxhrdxjduznbnxamlcjvacm

PolicyCLOUD D4.3 Reusable Model & Analytical Tools: Design and Open Specification 2

Ofer Biran, Oshrit Feder, Yosef Moatti, Pavlos Kranas, Javier Pereira, Alejandro Ramiro, María Ángeles Sanguino, Tomás Pariente, Argyro Mavrogiorgou, Thanos Kiourtis, George Manias, Nikitas Sgouros (+4 others)
2022 Zenodo  
In essence, the Data Acquisition and Analytics Layer provides an extensible framework for data source and analytic tools, controlling the full data pass from the data sources through filtering, transformation  ...  This document is the second architecture deliverable for WP4.  ...  The latter allows for inter and intra query and operation processing, thus making it possible for a query statement to be executed in a distributed manner.  ... 
doi:10.5281/zenodo.5971046 fatcat:peu3vjd7gve3tgl43zpnwhdkae

Parallelism in relational database management systems

C. Mohan, H. Pirahesh, W. G. Tang, Y. Wang
1994 IBM Systems Journal  
That is, we employ full DP and full PP and name this approach parallel asynchronous pipelines.  ...  DB2 will initiate multiple concurrent 1/0 requests for a single-user query and perform parallel /o processing on multiple data partitions.  ... 
doi:10.1147/sj.332.0349 fatcat:wrpn7ytdqzd5lernxnnru4qnt4
« Previous Showing results 1 — 15 out of 1,033 results