The Internet Archive has a preservation copy of this work in our general collections.
The file type is application/pdf
.
Filters
Enhancing Content-And-Structure Information Retrieval using a Native XML Database
[article]
2005
arXiv
pre-print
The final results of our experiments show that when the XML retrieval task focusses on highly relevant elements our hybrid XML retrieval system with the Coherent Retrieval Elements module is 1.8 times ...
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and third ...
their support with using Zettair and to the anonymous reviewers for their useful suggestions. ...
arXiv:cs/0508017v1
fatcat:g77widajljc5nhkfhnizpap33q
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database
2005
Information retrieval (Boston)
and a very effective XML retrieval. ...
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that takes ...
Acknowledgements We thank Wolfgang Meier for his assistance with using eXist, and Nick Lester, Falk Scholer and other members of the Search Engine Group at RMIT for their support with using Zettair. ...
doi:10.1007/s10791-005-0748-1
fatcat:x4rwxiu3rndnrlozun2q6rcptm
Combining Indexing Schemes to Accelerate Querying XML on Content and Structure
2004
Twente Data Management Workshop
This talk focuses on retrieval models and methods for the document-centric view on XML and their evaluation within the INEX initiative. ...
INEX has studied two types of tasks so far: 1. content-only (CO) queries are standard IR queries, where the system should retrieve the most specific elements answering the query; ...
their support with using Zettair and to the anonymous reviewers for their useful suggestions. ...
dblp:conf/tdm/RamirezV04
fatcat:72gp6v5gcbgcfnyo2dguq6a5ey
TREC 14 Enterprise Track at CSIRO and ANU
2005
Text Retrieval Conference
For both tasks, we used the PADRE retrieval system [1], in which the Okapi BM25 relevance function was implemented. ...
We parsed the HTML pages in the original collection into an XML format (the DTD is shown in the appendix), and removed non-email pages. ...
According to PADRE's query language, this query would be transformed internally to: retrieve the email that contains one or more terms from "official introduction to 'Dan Connolly'" with connolly@hal.com ...
dblp:conf/trec/WuHT05
fatcat:gqlb3lnrk5cfth6p4riybfcdjy
Routing of XML and XPath Queries in Data Dissemination Networks
2008
2008 The 28th International Conference on Distributed Computing Systems
XML-based data dissemination networks are rapidly gaining momentum. ...
In these networks XML content is routed from data producers to data consumers throughout an overlay network of content-based routers. ...
Prior to receiving XML documents, consumers must have expressed interest in receiving XML documents by registering XPEs with the network. ...
doi:10.1109/icdcs.2008.31
dblp:conf/icdcs/LiHJ08
fatcat:r42gs7mn3nhz5nv4dwkbduqyh4
Decentralized Execution of Event-Driven Scientific Workflows
2006
2006 IEEE Services Computing Workshops
PADRES has been developed with features inspired by the requirements of SWF management. ...
, composite subscription processing support, a rule-based matching and routing mechanism, a querybased historic data access mechanism, and support for the decentralized execution of SWFs specified in XML ...
We would also like to thank the members of the PADRES team including Alex Cheung, Alex Wun, Eli Fidler, Pengcheng Wan, Shuang Hou, Gerald Chan, David Matheson, and Matt Medland. ...
doi:10.1109/scw.2006.10
dblp:conf/scw/LiMJM06
fatcat:u5bn3gxjobd7xn7y77pxrqw43q
TREC 2002 Interactive Track Report
2002
Text Retrieval Conference
Results could be obtained in XML format by sending a query via CGI, e.g., trec.panopticsearch.com/gov/padre-sw_xml.cgi?collection=gov&query=bush and getting back a padre_results packet. ...
Particularly, for the collection of documents from the.gov domain, they used the level two domain name to categories the retrieved documents. ...
dblp:conf/trec/Hersh02
fatcat:uogum2w4hvcgjda3aodk7tujni
Boosting Web Retrieval Through Query Operations
[chapter]
2005
Lecture Notes in Computer Science
We explore the use of phrase and proximity terms in the context of web retrieval, which is different from traditional ad-hoc retrieval both in document structure and in query characteristics. ...
We also analyze why phrase and proximity terms are far more effective for web retrieval than for ad-hoc retrieval. ...
Additionally, we will apply our current results to additional corpora where, similarly to web documents, multiple representations of documents exist: such corpora are XML documents [15] and biomedical ...
doi:10.1007/978-3-540-31865-1_36
fatcat:sna7jjdfrjh6ritpvjesn3onjq
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies
[article]
2019
arXiv
pre-print
We introduce GeBioToolkit, a tool for extracting multilingual parallel corpora at sentence level, with document and gender information from Wikipedia biographies. ...
File Restructure Module Finally, the extracted parallel sentences are written as an xml file using a document-level mark-up. ...
In this case, and in addition to the automatic tags of each document (ID, language and gender), each document is tagged with an occupation category. ...
arXiv:1912.04778v1
fatcat:mf27cv7shzalta4yy432tcnfdy
CSIRO's Participation in INEX 2006
[chapter]
Lecture Notes in Computer Science
We split the documents in subdocuments according to the elements that we need to retrieve and indexed the files with PADRE. In a first step we extracted query elements from the INEX topics. ...
Then the query processor generated PADRE queries and post-processed the results according to specifications for each run. ...
In order for PADRE to retrieve sub-document parts like XML elements, the original XML files had to be split into smaller documents according to the XML elements relevant for retrieval. ...
doi:10.1007/978-3-540-73888-6_8
fatcat:ebvrrhhhszao3jyqbbq4ovgei4
Towards an extensible efficient event processing kernel
2012
Proceedings of the on SIGMOD/PODS 2012 PhD Symposium - PhD '12
Finally, we conduct a comprehensive evaluation to demonstrate the superiority of our proposed techniques in comparison with state-of-the-art algorithms designed for event processing. ...
., user profiles and preferences) modeled as events using attribute-value pairs, XML document, or relational tuples. ...
these applications are predefined set of patterns (e.g., investment strategies and attack specifications) modeled as subscriptions and streams of incoming data (e.g., XML documents, data packets, stock ...
doi:10.1145/2213598.2213602
dblp:conf/sigmod/Sadoghi12
fatcat:dqbmivxweva77nth3vkepgs2iy
Automatic Generation of Human-like Route Descriptions: A Corpus-driven Approach
2013
Journal of Emerging Technologies in Web Intelligence
retrieved from Google search engine. ...
Siga pela Avenida Padre Cacique 19. Continue pela Avenida Padre Cacique passando por Armazém do Sabor 20. Chegue no destino que fica próximo a Zé Pneus na Avenida Padre Cacique
TABLE III . ...
doi:10.4304/jetwi.5.4.413-423
fatcat:cx6iuus5r5b4vduzxik37jq5nq
Efficient and scalable filtering of graph-based metadata
2005
Journal of Web Semantics
G-ToPSS is particularly well suited for applications that deal with large-volume content distribution from diverse sources. ...
Current RSS feed arregators follow a pull-based architecture model, which is not going to scale with the increasing number of RSS feeds becoming available on the Web. ...
The filtering service forwards the document to the interested clients which could be other XML-RPC clients as well as other Drupal modules. ...
doi:10.1016/j.websem.2005.09.006
fatcat:7mtlpldlu5bmzlagsieyqvhg6y
Efficient and Scalable Filtering of Graph-Based Metadata
2005
Social Science Research Network
G-ToPSS is particularly well suited for applications that deal with large-volume content distribution from diverse sources. ...
Current RSS feed arregators follow a pull-based architecture model, which is not going to scale with the increasing number of RSS feeds becoming available on the Web. ...
The filtering service forwards the document to the interested clients which could be other XML-RPC clients as well as other Drupal modules. ...
doi:10.2139/ssrn.3199262
fatcat:huvjilas6jbjlkfbyhopykjlii
An Interoperable Electronic Health Record System for Clinical Cardiology
2022
Informatics
All documents have been given as Health Level 7 (HL7) clinical document architecture and messages are sent as HL7-Version 2 (V2) and/or HL7 Fast Healthcare Interoperability Resources (FHIR). ...
The system has been used for more than three years with a good level of satisfaction by the users. ...
Pathway for search and retrieval of the diagnostic report list.
Figure 7 . 7 Figure 7. Pathway for search and retrieval of the diagnostic report list. ...
doi:10.3390/informatics9020047
fatcat:ytjtiazmhzcadhtc2klsxc6kmu
« Previous
Showing results 1 — 15 out of 143 results