Filters








7,444 Hits in 4.5 sec

Compiler support for efficient processing of XML datasets

Xiaogang Li, Renato Ferreira, Gagan Agrawal
2003 Proceedings of the 17th annual international conference on Supercomputing - ICS '03  
In this paper, we report our experiences in compiling a recently developed XML Query Language, XQuery for applications that process scientific datasets.  ...  By implementing the techniques in a compiler and generating code for a runtime system called Active Data Repository (ADR), we are able to achieve efficient processing of disk-resident datasets and parallelization  ...  In addition, we will like to thank the anonymous reviewers and our shepherd, Robert A. van Engelen, for their constructive comments.  ... 
doi:10.1145/782814.782823 dblp:conf/ics/LiFA03 fatcat:jnrw44rmabgcbjfgwpxsa7f4ci

Supporting XML Based High-Level Abstractions on HDF5 Datasets: A Case Study in Automatic Data Virtualization [chapter]

Swarup Kumar Sahoo, Gagan Agrawal
2005 Lecture Notes in Computer Science  
We describe how our compiler can generate efficient code to access and process HDF5 datasets using the above information.  ...  The goal is to enable automatic creation of efficient data services to support a high-level or virtual view of the data.  ...  Our compiler generates C code which uses HDF5 library for efficient processing of large disk-resident datasets.  ... 
doi:10.1007/11532378_22 fatcat:hp5ngp3onfewrnhnhcvpfnmoyi

Supporting High-Level Abstractions through XML Technology [chapter]

Xiaogang Li, Gagan Agrawal
2004 Lecture Notes in Computer Science  
In this paper, we describe the use of XML technologies for supporting high-level programming methodologies for processing scientific datasets.  ...  We show how XML Schemas can be used to give a high-level abstraction of a dataset to an application developer.  ...  In this paper, we describe the use of XML technologies for supporting high-level programming methodologies for processing scientific datasets.  ... 
doi:10.1007/978-3-540-24644-2_9 fatcat:ayqbayhctraqxplybcuhxuzgtq

High Level Programming Methodologies for Data Intensive Computations [chapter]

Gagan Agrawal, Ruoming Jin, Renato Ferreira, Joel Saltz
2000 Lecture Notes in Computer Science  
Summary In this paper, we have addressed the problem of expressing data intensive computations in a high-level languages and then compiling such codes to efficiently manage data storage, retrieval and  ...  We have developed data-parallel extensions to Java for expressing this important class of applications.  ...  The objective of loop planning is to determine a schedule to efficiently process a loop based on the amount of available resources in the parallel machine.  ... 
doi:10.1007/3-540-40889-4_3 fatcat:x5dzgy63kzcsjlphnrj7xti5ey

Data-centric transformations on non-integer iteration spaces

S.K. Sahoo, G. Agrawal
2005 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05)  
Our algorithms have been implemented as part of a compiler for the XML query language XQuery, which supports processing over virtual XML. Our system also parallelizes the processing.  ...  Then, we show how in many cases, we can exploit the repetitive nature of the dataset to avoid the overhead associated with such a table.  ...  Our algorithms have been implemented as part of a compiler for the XML query language XQuery, which supports processing over virtual XML. Our system also parallelizes the processing.  ... 
doi:10.1109/pact.2005.15 dblp:conf/IEEEpact/SahooA05 fatcat:knqqwgiw3jh7xluuxdfeciy4pu

A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments

George Teodoro, Tulio Tavares, Renato Ferreira, Tahsin Kurc, Wagner Meira, Dorgival Guedes, Tony Pan, Joel Saltz
2008 International journal of parallel programming  
Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and checkpointing of intermediate results are  ...  Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets.  ...  Challenges in designing and implementing support for efficient data analysis are many, mainly due to characteristics of scientific applications that generate and reference very large datasets.  ... 
doi:10.1007/s10766-007-0068-8 pmid:22582009 pmcid:PMC3348585 fatcat:cmkw7szgibd4loss4sbmsdbtpm

PXML-Miner: A Projection-Based Interesting XML Rule Mining Technique

D Sasikala, K Premalatha
2014 Data Science Journal  
Experiments are conducted using XML databases available in the XML data repository. The results illustrate that the proposed method is efficient in terms of computation time and memory usage.  ...  The node scheme modified with UID encoding speeds up the mining process. A significance measure is used to identify the important rules found in the XML database.  ...  of space used for processing the datasets under a given support value.  ... 
doi:10.2481/dsj.13-017 fatcat:evekemcpmvcbzpxltguiievswy

XML Goes Native: Run-Time Representations for Xtatic [chapter]

Vladimir Gapeyev, Michael Y. Levin, Benjamin C. Pierce, Alan Schmitt
2005 Lecture Notes in Computer Science  
Xtatic is a lightweight extension of C offering native support for statically typed XML processing.  ...  In particular, it must provide representations for XML tags, trees, and textual data that use memory efficiently, support efficient pattern matching, allow maximal sharing of common substructures, and  ...  We also thank Haruo Hosoya, Alain Frisch, Christian Kirkegaard, and Xavier Franc for discussing various aspects of this work.  ... 
doi:10.1007/978-3-540-31985-6_4 fatcat:iegjshirirhk5m4mb3mhmyxr6e

GEOSPATIAL DATA EXCHANGE USING BINARY DATA SERIALIZATION APPROACHES

P. Mooney, M. Minghini
2022 The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences  
inefficient for an increasingly higher number of applications due to their inflated volumes of data, low speed and the high computational cost for parsing and processing.  ...  Using a number of experiments, we illustrate the advantages and disadvantages of both approaches for common workflows that make use of geospatial data encodings such as GeoPackage and GeoJSON.  ...  ACKNOWLEDGEMENTS The authors acknowledge the support of the European Commission -Joint Research Centre (JRC) through contract number CT-EX2014D166355-104 entitled "Evaluation of Novel approaches for governing  ... 
doi:10.5194/isprs-archives-xlviii-4-w1-2022-307-2022 fatcat:bspmbsjsmzgijjrxigedkmlfie

Unleashing XQuery for Data-Independent Programming

Sebastian Bächle, Caetano Sauer
2014 Datenbank-Spektrum  
The remaining aspects of the language are processed in a standard fashion, yet can be compiled anytime to more efficient native operations of the actual runtime environment.  ...  Apart from standard XML processing, however, advanced language features make it hard to efficiently implement the complete language for large data volumes.  ...  We took the dataset of the relational decision support benchmark TPC-H and the SQL queries Q2 and Q6 of the benchmark.  ... 
doi:10.1007/s13222-014-0160-3 fatcat:fovgm2e7zfdebjvh4euqapaoxy

Mapping Physical Formats to Logical Models to Extract Data and Metadata: The Defuddle Parsing Engine [chapter]

Tara D. Talbott, Karen L. Schuchardt, Eric G. Stephan, James D. Myers
2006 Lecture Notes in Computer Science  
In this paper, we describe a complementary approach -a general capability for articulating the format of arbitrary textual and binary data using a logical data model, expressed in XML-Schema, which can  ...  Scientists, motivated by the desire for systems-level understanding of phenomena, increasingly need to share their results across multiple disciplines.  ...  UPA provides an accessible user interface and batch processing capabilities for handling large datasets [12] .  ... 
doi:10.1007/11890850_9 fatcat:uddrdxg6tnfebjvptk4yg5vuqq

XML Compaction Improvements Based on Binary String Encodings

Ramez Alkhatib
2017 International Journal of Database Management Systems  
Therefore, it is important to provide a repository for XML documents, which supports efficient management and storage of XML data.  ...  space requirements and to improve the efficiency of XML query processing using labeling schemes.  ...  While this technique has the potential for compact storage, it also supports efficient querying and update processing of the compacted XML documents by taking advantage of the ORDPATH labeling scheme.  ... 
doi:10.5121/ijdms.2017.9201 fatcat:ywjxubs335c7zeh4kqvl4siyii

Boosting XML Filtering with a Scalable FPGA-based Architecture [article]

Abhishek Mitra, Petko Bakalov
2009 arXiv   pre-print
By introducing stacks within the FPGA we are able to express and process a wide range of path queries very efficiently, on a scalable environment.  ...  The input in such systems typically consists of a stream of XML documents and a set of user subscriptions expressed as XML queries.  ...  Vieira's work has been funded by a CAPES (Brazilian Federal Agency for Post-Graduate Education)/Fulbright Ph.D. fellowship.  ... 
arXiv:0909.1781v1 fatcat:ux2il75jbnfkdd2gtxc6m47rcy

Complex pattern matching in complex structures: The XSeq approach

Kai Zeng, Mohan Yang, B. Mozafari, C. Zaniolo
2013 2013 IEEE 29th International Conference on Data Engineering (ICDE)  
There is much current interest in applications of complex event processing over data streams and of complex pattern matching over stored sequences.  ...  While some applications use streams of flat records, XML and various semi-structured information formats are preferred by many others-in particular, applications that deal with domain science, social networks  ...  However, experience with processing XML streams has revealed that they present several limitations, both in terms of expressive power and amenability to efficient implementation, which can be effectively  ... 
doi:10.1109/icde.2013.6544936 dblp:conf/icde/ZengYMZ13 fatcat:u2avgztxhncvbpu5akla6fagc4

High-performance complex event processing over XML streams

Barzan Mozafari, Kai Zeng, Carlo Zaniolo
2012 Proceedings of the 2012 international conference on Management of Data - SIGMOD '12  
Second, it allows for efficient stream processing algorithms.  ...  XSeq extends XPath with powerful constructs that support (i) the specification of and search for complex sequential patterns over XML, and (ii) efficient implementation using the Kleene-* optimization  ...  We have described the design and implementation of XSeq, a query language for XML streams that adds powerful extensions to XPath while remaining very amenable to optimization and efficient implementation  ... 
doi:10.1145/2213836.2213866 dblp:conf/sigmod/MozafariZZ12 fatcat:jxfthfm3srdrtkczao7cdsbxia
« Previous Showing results 1 — 15 out of 7,444 results