4 Hits in 6.1 sec

Content-Aware DataGuides: Interleaving IR and DB Indexing Techniques for Efficient Retrieval of Textual XML Data [chapter]

Felix Weigel, Holger Meuss, François Bry, Klaus U. Schulz
2004 Lecture Notes in Computer Science  
XML is well-suited for modelling structured data with textual content.  ...  To this end, the Content-Aware DataGuide (CADG) enhances the wellknown DataGuide with (1) simultaneous keyword and path matching and (2) a precomputed content/structure join.  ...  The authors thank Tim Furche for providing the query generator used in the experiments.  ... 
doi:10.1007/978-3-540-24752-4_28 fatcat:u2lnf5kl5bhavlcqwdxn34x6t4

Visual exploration and retrieval of XML document collections with the generic system X2

Holger Meuss, Klaus U. Schulz, Felix Weigel, Simone Leonardi, François Bry
2005 International Journal on Digital Libraries  
Another salient characteristic of X 2 which distinguishes it from other visual query systems for XML is that it supports various degrees of detailedness in the presentation of answers, as well as techniques  ...  bridging the gap between these three views on the data to be retrieved.  ...  We also thank the anonymous referees for their detailed and helpful comments on a preliminary version of this paper.  ... 
doi:10.1007/s00799-004-0109-5 fatcat:k7ivb6mdzbeizedhcj26lhafuu

TopX: efficient and versatile top-k query processing for semistructured data

Martin Theobald, Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Gerhard Weikum
2007 The VLDB journal  
Keywords Efficient XML full-text search · Content-and structure-aware ranking · Top-k query processing · Cost-based index access scheduling · Probabilistic candidate pruning · Dynamic query expansion ·  ...  TopX is a top-k retrieval engine for text and semistructured data.  ...  We believe that the integration of DB and IR functionalities and system architectures will remain a strategically important and rewarding research field, and we hope to make further contributions to this  ... 
doi:10.1007/s00778-007-0072-z fatcat:pwqyugrna5cypncgn52jofsnem

TopX : efficient and versatile top-k query processing for text, structured, and semistructured data [article]

Martin Theobald, Universität Des Saarlandes, Universität Des Saarlandes
pursue an IR-style "andish" ranked retrieval for XML data.  ...  XRANK Among the most prominent IR approaches for ranked retrieval of XML data is XRANK [GSBS03] .  ...  We claim that (A) a packing for this instance of KNAPSACK has capacity ≤ C and utility ≥ U if and only if (B) the corresponding SAS instance has a scan of total depth C and score decrease of ≥ U.  ... 
doi:10.22028/d291-23776 fatcat:wsdfpjnpsjgbjn4x437dn53gdm