Maguro, a system for indexing and searching over very large text collections

Knut Magne Risvik, Trishul Chilimbi, Henry Tan, Karthik Kalyanaraman, Chris Anderson
2013 Proceedings of the sixth ACM international conference on Web search and data mining - WSDM '13  
Maguro is a system for efficiently searching very large collections of text content of up to 1 trillion documents at low cost. Search engines span across content that is very dynamic and highly augmented with metadata to the tail content of the web. A long tail distribution of content calls for different trade-offs in the design space for good efficiency across the entire index range. Maguro is designed for the long tail of content with less dynamics and less metadata, but very good cost
more » ... ncy. Maguro is part of the serving stack in Bing and allows us to scale the index significantly better.
doi:10.1145/2433396.2433486 dblp:conf/wsdm/RisvikCTKA13 fatcat:d2uz2xu7hvetlo4mjwdcsq63i4