Filters








13,181 Hits in 8.0 sec

Three-Level Caching for Efficient Query Processing in Large Web Search Engines

Xiaohui Long, Torsten Suel
2006 World wide web (Bussum)  
Large web search engines have to answer thousands of queries per second with interactive response times.  ...  Our experimental evaluation based on a large web crawl and real search engine query log shows significant performance gains for the best schemes, both in isolation and in combination with the other caching  ...  CONCLUDING REMARKS In this paper, we have proposed a new three-level caching architecture for web search engines that can improve query throughput.  ... 
doi:10.1007/s11280-006-0221-0 fatcat:tmer4oaw7zfwlcvqojhtfk5zjm

Three-level caching for efficient query processing in large Web search engines

Xiaohui Long, Torsten Suel
2005 Proceedings of the 14th international conference on World Wide Web - WWW '05  
Large web search engines have to answer thousands of queries per second with interactive response times.  ...  Our experimental evaluation based on a large web crawl and real search engine query log shows significant performance gains for the best schemes, both in isolation and in combination with the other caching  ...  CONCLUDING REMARKS In this paper, we have proposed a new three-level caching architecture for web search engines that can improve query throughput.  ... 
doi:10.1145/1060745.1060785 dblp:conf/www/LongS05 fatcat:xbodm26ml5fk5lsrqzhxe7wmti

Modeling Static Caching in Web Search Engines [chapter]

Ricardo Baeza-Yates, Simon Jonassen
2012 Lecture Notes in Computer Science  
In this paper we model a two-level cache of a Web search engine, such that given memory resources, we find the optimal split fraction to allocate for each cache, results and index.  ...  The final result is very simple and implies to compute just five parameters that depend on the input data and the performance of the search engine.  ...  Baeza-Yates and Saint-Jean propose a three level index organization for Web search engines [5] , similar to the one used in current architectures.  ... 
doi:10.1007/978-3-642-28997-2_37 fatcat:7oen45tyhjh35o2g2fg4t46pfq

An Intersection Cache Based on Frequent Itemset Mining in Large Scale Search Engines

Wanwan Zhou, Ruixuan Li, Xinhua Dong, Zhiyong Xu, Weijun Xiao
2015 2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)  
In this paper, we analyze the characteristics of query term intersections in typical search engines, and present a novel three-level cache architecture, called TLMCA, which combines the intersection cache  ...  Caching is an effective optimization in large scale web search engines, which is to reduce the underlying I/O burden of storage systems as far as possible by leveraging cache localities.  ...  Section III analyzes the characteristics of dataset and query log in large scale search engines. Section IV describes the system design of TLMCA, the three-level cache architecture.  ... 
doi:10.1109/hotweb.2015.17 dblp:conf/hotweb/ZhouLDXX15 fatcat:j7tgqlqflfbfjmshlpt53dfjsy

A comprehensive framework for the semantic cache systems

Mohammad Ahmed Alghobiri, Hikmat Ullah Khan, Tahir Afzal Tahir Afzal, Saqib Iqbal
2016 International Journal of Advanced and Applied Sciences  
Semantic Cache deals in both semantic descriptions as well as the results of previous queries providing improved efficiency.  ...  The proposed framework, described at the detailed level, may present guidance for future works on semantic cache systems.  ...  Abdul Qadir, an eminent professor and expert in the field of semantic web and semantic cache, is Head of the Centre for Distributed and Semantic Computing and Dean, Faculty of Computing, Capital University  ... 
doi:10.21833/ijaas.2016.10.012 fatcat:v6sl6jbolneufeo4m5rwbvsiyi

Scalability and Efficiency Challenges in Large-Scale Web Search Engines

B. Barla Cambazoglu, Ricardo Baeza-Yates
2015 Proceedings of the Eighth ACM International Conference on Web Search and Data Mining - WSDM '15  
In particular, the tutorial provides an in-depth architectural overview of a web search engine, mainly focusing on the web crawling, indexing, and query processing components.  ...  In this tutorial, we focus on the last two goals, providing a fairly comprehensive overview of the scalability and e ciency challenges in large-scale web search engines.  ...  In addition to the state-of-the-art ranking techniques employed in web search engines, our tutorial covers a variety of problems including query processing on multi-core architectures, early exit optimizations  ... 
doi:10.1145/2684822.2697039 dblp:conf/wsdm/CambazogluB15 fatcat:kxoaikbpu5cbvbpgvljm54h2sy

Scalability and efficiency challenges in large-scale web search engines

Ricardo Baeza-Yates, B. Barla Cambazoglu
2014 Proceedings of the 23rd International Conference on World Wide Web - WWW '14 Companion  
In particular, the tutorial provides an in-depth architectural overview of a web search engine, mainly focusing on the web crawling, indexing, and query processing components.  ...  In this tutorial, we focus on the last two goals, providing a fairly comprehensive overview of the scalability and e ciency challenges in large-scale web search engines.  ...  In addition to the state-of-the-art ranking techniques employed in web search engines, our tutorial covers a variety of problems including query processing on multi-core architectures, early exit optimizations  ... 
doi:10.1145/2567948.2577271 dblp:conf/www/Baeza-YatesC14 fatcat:ygasqlbobjbrzipvtd2kj4babq

Scalability and efficiency challenges in large-scale web search engines

B. Barla Cambazoglu, Ricardo Baeza-Yates
2014 Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval - SIGIR '14  
In particular, the tutorial provides an in-depth architectural overview of a web search engine, mainly focusing on the web crawling, indexing, and query processing components.  ...  In this tutorial, we focus on the last two goals, providing a fairly comprehensive overview of the scalability and e ciency challenges in large-scale web search engines.  ...  In addition to the state-of-the-art ranking techniques employed in web search engines, our tutorial covers a variety of problems including query processing on multi-core architectures, early exit optimizations  ... 
doi:10.1145/2600428.2602291 dblp:conf/sigir/CambazogluB14 fatcat:ldl4l4nas5hcvcclhvdq5755zu

Scalability and Efficiency Challenges in Large-Scale Web Search Engines

B. Barla Cambazoglu, Ricardo Baeza-Yates
2016 Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - SIGIR '16  
In particular, the tutorial provides an in-depth architectural overview of a web search engine, mainly focusing on the web crawling, indexing, and query processing components.  ...  In this tutorial, we focus on the last two goals, providing a fairly comprehensive overview of the scalability and e ciency challenges in large-scale web search engines.  ...  In addition to the state-of-the-art ranking techniques employed in web search engines, our tutorial covers a variety of problems including query processing on multi-core architectures, early exit optimizations  ... 
doi:10.1145/2911451.2914808 dblp:conf/sigir/CambazogluB16 fatcat:4xs6z35h2jhcbf5pratjl5gbeu

A five-level static cache architecture for web search engines

Rifat Ozcan, I. Sengor Altingovde, B. Barla Cambazoglu, Flavio P. Junqueira, Özgür Ulusoy
2012 Information Processing & Management  
Caching is a crucial performance component of large-scale web search engines, as it greatly helps reducing average query response times and query processing workloads on backend search clusters.  ...  In this paper, we describe a multi-level static cache architecture that stores five different item types: query results, precomputed scores, posting lists, precomputed intersections of posting lists, and  ...  Query processing overview Web search engines are composed of multiple replicas of large search clusters.  ... 
doi:10.1016/j.ipm.2010.12.007 fatcat:ozvshcjalzbq7jxxnkydw3fthu

Efficient query processing in distributed search engines

Simon Jonassen
2012 SIGIR Forum  
Third, we elaborate on caching in Web search engines in two independent contributions. First, we present an analytical model that finds the optimal split in a static memory-based two-level cache.  ...  Second, we present several strategies for selecting, ordering and scheduling prefetch queries and demonstrate that these can improve the efficiency and effectiveness of Web search engines.  ...  Rocha-Junior for the useful advices and comments on the paper. Acknowledgments. This work was done while the second author was an intern at Yahoo!  ... 
doi:10.1145/2492189.2492201 fatcat:uwasxhngrfgntemkhawyv3te64

Batch query processing for web search engines

Shuai Ding, Josh Attenberg, Ricardo Baeza-Yates, Torsten Suel
2011 Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11  
Large web search engines are now processing billions of queries per day. Most of these queries are interactive in nature, requiring a response in fractions of a second.  ...  Our conclusion is that significant cost reductions are possible by using specialized mechanisms for executing batch queries in Web search engines.  ...  It has been studied extensively in search engines on three different levels: Result caching [25, 22, 21, 30, 34] , which deals with the case where identical queries are issued repeatedly by keeping a  ... 
doi:10.1145/1935826.1935858 dblp:conf/wsdm/DingABS11 fatcat:3sjgnc5bwvdsjdelcqycp46k7u

A New Replacement Algorithm of Web Search Engine Cache based on User Behavior

Zhang Yong-Heng, Zhang Feng, You Fei
2014 Applied Mathematics & Information Sciences  
By analyzing the documents and the users query logs of a real search engine based on the Web caching, through a large number of statistical analyzed of user behavior and found that the search engine query  ...  The efficiency of retrieval system is crucial for large-scale information retrieval systems.  ...  All levels of cache location as shown in Figure 1 .  ... 
doi:10.12785/amis/080645 fatcat:4swkdjltqzcihcvyuhzlvv5spy

An Efficient SSD-based Hybrid Storage Architecture for Large-Scale Search Engines

Ruixuan Li, Chengzhou Li, Weijun Xiao, Hai Jin, Heng He, Xiwu Gu, Kunmei Wen, Zhiyong Xu
2012 2012 41st International Conference on Parallel Processing  
Large-scale search engines use hard disk drives (HDD) to store the mass index data for their capacity, whose performances are limited by the relatively low I/O performance of HDD.  ...  In this paper, we adopt a solid state disk (SSD) based storage architecture, which uses SSD as a secondary cache for memory.  ...  Large search engines need to process hundreds of queries per second on collections of millions of documents.  ... 
doi:10.1109/icpp.2012.17 dblp:conf/icpp/LiLXJHGWX12 fatcat:yqw4exfejbbhrakjnko725dkfi

The Anatomy of a Multi-domain Search Infrastructure [chapter]

Stefano Ceri, Alessandro Bozzon, Marco Brambilla
2011 Lecture Notes in Computer Science  
While searching the Web is the preferred method for accessing information in everyday's practice, users expect that search systems will soon be capable of mastering complex queries.  ...  Current search engines do not support queries that require a complex combination of information.  ...  Conclusions This paper presented our vision for a novel class of search systems, advocating that a new generation of search infrastuctures with a modular software organization is required for addressing  ... 
doi:10.1007/978-3-642-22233-7_1 fatcat:ap6mpev4yzdtrgj7iiawyfrwwu
« Previous Showing results 1 — 15 out of 13,181 results