Filters








26,987 Hits in 5.8 sec

Structural Selectivity Estimation for XML Documents

Damien K. Fisher, Sebastian Maneth
2007 2007 IEEE 23rd International Conference on Data Engineering  
A new synopsis for XML documents is introduced which can be effectively used to estimate the selectivity of complex path queries.  ...  In terms of XML databases, the problem of selectivity estimation of queries presents new challenges: many evaluation operators are possible, such as simple navigation, structural joins, or twig joins,  ...  Acknowledgment The authors gratefully thank Neoklis Polyzotis for providing us with an implementation of his TreeSketch estimation framework.  ... 
doi:10.1109/icde.2007.367908 dblp:conf/icde/FisherM07 fatcat:3j6vt7se5vebddf3qtd2mjuspa

Synopsis Data Structures for XML Databases: Models, Issues, and Research Perspectives

Angela Bonifati, Alfredo Cuzzocrea
2007 18th International Conference on Database and Expert Systems Applications (DEXA 2007)  
Specifically, these data structures are very useful for both selectivity estimation and approximate query answering purposes.  ...  databases while ensuring low computational overhead and high accuracy for many XML processing tasks.  ...  Synopsis data structures allow to achieve two main goals. The first goal is to enable selectivity estimation for XML queries (e.g., [1] ).  ... 
doi:10.1109/dexa.2007.100 dblp:conf/dexaw/BonifatiC07 fatcat:squr6j3spfepflniynyxprvnoq

Synopsis Data Structures for XML Databases: Models, Issues, and Research Perspectives

Angela Bonifati, Alfredo Cuzzocrea
2007 Database and Expert Systems Applications  
Specifically, these data structures are very useful for both selectivity estimation and approximate query answering purposes.  ...  databases while ensuring low computational overhead and high accuracy for many XML processing tasks.  ...  Synopsis data structures allow to achieve two main goals. The first goal is to enable selectivity estimation for XML queries (e.g., [1] ).  ... 
doi:10.1109/dexa.2007.4312849 fatcat:m67tg7g5zvarbn6ye3iolaz37a

Effective pruning for XML structural match queries

Yefei Xin, Zhen He, Jinli Cao
2010 Data & Knowledge Engineering  
Extensible Markup Language (XML) is becoming the de facto standard for exchanging information over the Internet, which results in the proliferation of XML documents.  ...  One of the main challenges is processing large collections of XML documents efficiently.  ...  They propose a selectivity estimation technique for XML documents which estimates selectivity for all XPath axes, gives a guaranteed range which the actual selectivity lies in and allows incremental updates  ... 
doi:10.1016/j.datak.2010.02.004 fatcat:kgw4tlioezeotl6tpt2smet34i

XML Information Retrieval [chapter]

Mounia Lalmas
2009 Encyclopedia of Library and Information Sciences, Third Edition  
Nowadays, increasingly, documents are marked-up using XML, the format standard for structured documents.  ...  In contrast to HTML, which is mainly layoutoriented, XML follows the fundamental concept of separating the logical structure of a document from its layout.  ...  Acknowledgments This article is based on two other articles on XML information retrieval co-written by the author, a book chapter on "Structured Text Retrieval" to appear in the second edition of [8]  ... 
doi:10.1081/e-elis3-120043691 fatcat:kjsqvk2s6bgtpjrca4aef5u2mi

XML Information Retrieval [chapter]

Mounia Lalmas
2011 Understanding Information Retrieval Systems  
Nowadays, increasingly, documents are marked-up using XML, the format standard for structured documents.  ...  In contrast to HTML, which is mainly layoutoriented, XML follows the fundamental concept of separating the logical structure of a document from its layout.  ...  Acknowledgments This article is based on two other articles on XML information retrieval co-written by the author, a book chapter on "Structured Text Retrieval" to appear in the second edition of [8]  ... 
doi:10.1201/b11499-29 fatcat:4y2gxbhponcxrbw633dmk5gx5e

On the use of query-driven XML auto-indexing

Karsten Schmidt, Theo Harder
2010 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010)  
) manual index selection-for rapid autonomic reactions and self-tuning options by the DBMS.  ...  Autonomous index management in native XML DBMSs has to address XML's flexibility and storage mapping features, which provide a rich set of indexing options.  ...  In addition to clustering and structure-specific compression, tailor-made XML storage mappings [11] provide a variety of options for XML-specific indexing.  ... 
doi:10.1109/icdew.2010.5452741 dblp:conf/icde/SchmidtH10 fatcat:uea375wvarbillrtnzpmtkhw3y

Cost-based optimization in DB2 XML

A. Balmin, T. Eliaz, J. Hornibrook, L. Lim, G. M. Lohman, D. Simmen, M. Wang, C. Zhang
2006 IBM Systems Journal  
DB2 XML augments DB2t UDB with a native XML store, XML indexes, and query processing capabilities for both XQuery and SQL/XML that are integrated with those of SQL.  ...  This paper presents the extensions made to the DB2 UDB compiler, and especially its costbased query optimizer, to support XQuery and SQL/XML queries, using much of the same infrastructure developed for  ...  ACKNOWLEDGMENT We would like to thank the entire DB2 XML team for their support of this work.  ... 
doi:10.1147/sj.452.0299 fatcat:ac232lw4z5bnjibsqfjua55xv4

StatiX

Juliana Freire, Jayant R. Haritsa, Maya Ramanath, Prasan Roy, Jérôme Siméon
2002 Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02  
StatiX leverages standard XML technology for gathering statistics, notably XML Schema validators, and it uses histograms to summarize both the structure and values in an XML document.  ...  Finally, the proposals involve either usage of specialized data structures, or expensive processes for system initialization, or costly maintenance for document updates.  ...  Acknowledgements We thank Zhiyuan Chen and Divesh Srivastava for generously providing the Twigs software and helping us with its installation and use.  ... 
doi:10.1145/564712.564713 fatcat:xq2eyolhz5h63ag2r5axk2tiui

Tree-Pattern Similarity Estimation for Scalable Content-based Routing

Raphael Chand, Pascal Felber, Minos Garofalakis
2007 2007 IEEE 23rd International Conference on Data Engineering  
In this paper, we propose a general framework and algorithmic tools for estimating different tree-pattern similarity metrics over continuous streams of XML documents.  ...  In a nutshell, our approach relies on continuously maintaining a novel, concise synopsis structure over the observed document stream that allows us to accurately estimate the fraction of documents satisfying  ...  We have generated sets of XML documents with IBM's XML Generator [13] tool, using a uniform distribution for selecting element tag names.  ... 
doi:10.1109/icde.2007.368960 dblp:conf/icde/ChandFG07 fatcat:2irrcncubbgstjdc273wk5vasu

A General Framework for Estimating XML Query Cardinality [chapter]

Carlo Sartiani
2004 Lecture Notes in Computer Science  
This paper presents a framework for estimating XML query cardinality.  ...  summaries for XML data.  ...  Acknowledgments The author would like to thank Dan Suciu for his help during the revision of the paper.  ... 
doi:10.1007/978-3-540-24607-7_16 fatcat:aph4vjl6xzantpljlypxuej3tm

Estimating the Selectivity of XML Path Expressions for Internet Scale Applications

Ashraf Aboulnaga, Alaa R. Alameldeen, Jeffrey F. Naughton
2001 Very Large Data Bases Conference  
Both techniques work by summarizing the structure of the XML data in a small amount of memory and using this summary for selectivity estimation.  ...  In this paper, we propose two techniques for estimating the selectivity of simple XML path expressions over complex large-scale XML data as would be handled by Internet-scale applications: path trees and  ...  Acknowledgements We thank Zhiyuan Chen and Divesh Srivastava for providing us with the code for pruned suffix trees and helping us with this code, and for providing us with a real XML data set that we  ... 
dblp:conf/vldb/AboulnagaAN01 fatcat:cac57iypmnajzcxxzmak6lw2my

Building XML statistics for the hidden web

Ashraf Aboulnaga, Jeffrey F. Naughton
2003 Proceedings of the twelfth international conference on Information and knowledge management - CIKM '03  
We describe an on-line statistics structure that stores such annotated path expressions and information about their selectivity for use in estimating the selectivity of future XPath queries.  ...  In this paper, we assume that queries to a hidden Web data source are XPath selections from a virtual XML document that represents all the data at this source.  ...  XPath is the standard path expression language for selecting parts of an XML document based on structure and content.  ... 
doi:10.1145/956863.956930 dblp:conf/cikm/AboulnagaN03 fatcat:gdw2iby4ijhzfe2ddvvh5qibju

Building XML statistics for the hidden web

Ashraf Aboulnaga, Jeffrey F. Naughton
2003 Proceedings of the twelfth international conference on Information and knowledge management - CIKM '03  
We describe an on-line statistics structure that stores such annotated path expressions and information about their selectivity for use in estimating the selectivity of future XPath queries.  ...  In this paper, we assume that queries to a hidden Web data source are XPath selections from a virtual XML document that represents all the data at this source.  ...  XPath is the standard path expression language for selecting parts of an XML document based on structure and content.  ... 
doi:10.1145/956927.956930 fatcat:3tcu4rlaovcmzdchmavil3jvlu

StatiX

Juliana Freire, Jayant R. Haritsa, Maya Ramanath, Prasan Roy, Jérôme Siméon
2002 Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02  
StatiX leverages standard XML technology for gathering statistics, notably XML Schema validators, and it uses histograms to summarize both the structure and values in an XML document.  ...  The availability of summary data for XML documents has many applications, from providing users with quick feedback about their queries, to cost-based storage design and query optimization.  ...  Acknowledgements We thank Zhiyuan Chen and Divesh Srivastava for generously providing the Twigs software and helping us with its installation and use.  ... 
doi:10.1145/564691.564713 dblp:conf/sigmod/FreireHRRS02 fatcat:i4zw3bgvlvan7gfbxfustfhtsa
« Previous Showing results 1 — 15 out of 26,987 results