Filters








143 Hits in 3.5 sec

Enhancing Content-And-Structure Information Retrieval using a Native XML Database [article]

Jovan Pehcevski, Anne-Marie Vercoustre
2005 arXiv   pre-print
The final results of our experiments show that when the XML retrieval task focusses on highly relevant elements our hybrid XML retrieval system with the Coherent Retrieval Elements module is 1.8 times  ...  Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and third  ...  their support with using Zettair and to the anonymous reviewers for their useful suggestions.  ... 
arXiv:cs/0508017v1 fatcat:g77widajljc5nhkfhnizpap33q

Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database

Jovan Pehcevski, James A. Thom, Anne-Marie Vercoustre
2005 Information retrieval (Boston)  
and a very effective XML retrieval.  ...  This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that takes  ...  Acknowledgements We thank Wolfgang Meier for his assistance with using eXist, and Nick Lester, Falk Scholer and other members of the Search Engine Group at RMIT for their support with using Zettair.  ... 
doi:10.1007/s10791-005-0748-1 fatcat:x4rwxiu3rndnrlozun2q6rcptm

Combining Indexing Schemes to Accelerate Querying XML on Content and Structure

Georgina Ramírez, Arjen P. de Vries
2004 Twente Data Management Workshop  
This talk focuses on retrieval models and methods for the document-centric view on XML and their evaluation within the INEX initiative.  ...  INEX has studied two types of tasks so far: 1. content-only (CO) queries are standard IR queries, where the system should retrieve the most specific elements answering the query;  ...  their support with using Zettair and to the anonymous reviewers for their useful suggestions.  ... 
dblp:conf/tdm/RamirezV04 fatcat:72gp6v5gcbgcfnyo2dguq6a5ey

TREC 14 Enterprise Track at CSIRO and ANU

Mingfang Wu, David Hawking, Paul Thomas
2005 Text Retrieval Conference  
For both tasks, we used the PADRE retrieval system [1], in which the Okapi BM25 relevance function was implemented.  ...  We parsed the HTML pages in the original collection into an XML format (the DTD is shown in the appendix), and removed non-email pages.  ...  According to PADRE's query language, this query would be transformed internally to: retrieve the email that contains one or more terms from "official introduction to 'Dan Connolly'" with connolly@hal.com  ... 
dblp:conf/trec/WuHT05 fatcat:gqlb3lnrk5cfth6p4riybfcdjy

Routing of XML and XPath Queries in Data Dissemination Networks

Guoli Li, Shuang Hou, Hans-Arno Jacobsen
2008 2008 The 28th International Conference on Distributed Computing Systems  
XML-based data dissemination networks are rapidly gaining momentum.  ...  In these networks XML content is routed from data producers to data consumers throughout an overlay network of content-based routers.  ...  Prior to receiving XML documents, consumers must have expressed interest in receiving XML documents by registering XPEs with the network.  ... 
doi:10.1109/icdcs.2008.31 dblp:conf/icdcs/LiHJ08 fatcat:r42gs7mn3nhz5nv4dwkbduqyh4

Decentralized Execution of Event-Driven Scientific Workflows

Guoli Li, Vinod Muthusamy, H.-arno Jacobsen, Serge Mankovski
2006 2006 IEEE Services Computing Workshops  
PADRES has been developed with features inspired by the requirements of SWF management.  ...  , composite subscription processing support, a rule-based matching and routing mechanism, a querybased historic data access mechanism, and support for the decentralized execution of SWFs specified in XML  ...  We would also like to thank the members of the PADRES team including Alex Cheung, Alex Wun, Eli Fidler, Pengcheng Wan, Shuang Hou, Gerald Chan, David Matheson, and Matt Medland.  ... 
doi:10.1109/scw.2006.10 dblp:conf/scw/LiMJM06 fatcat:u5bn3gxjobd7xn7y77pxrqw43q

TREC 2002 Interactive Track Report

William R. Hersh
2002 Text Retrieval Conference  
Results could be obtained in XML format by sending a query via CGI, e.g., trec.panopticsearch.com/gov/padre-sw_xml.cgi?collection=gov&query=bush and getting back a padre_results packet.  ...  Particularly, for the collection of documents from the.gov domain, they used the level two domain name to categories the retrieved documents.  ... 
dblp:conf/trec/Hersh02 fatcat:uogum2w4hvcgjda3aodk7tujni

Boosting Web Retrieval Through Query Operations [chapter]

Gilad Mishne, Maarten de Rijke
2005 Lecture Notes in Computer Science  
We explore the use of phrase and proximity terms in the context of web retrieval, which is different from traditional ad-hoc retrieval both in document structure and in query characteristics.  ...  We also analyze why phrase and proximity terms are far more effective for web retrieval than for ad-hoc retrieval.  ...  Additionally, we will apply our current results to additional corpora where, similarly to web documents, multiple representations of documents exist: such corpora are XML documents [15] and biomedical  ... 
doi:10.1007/978-3-540-31865-1_36 fatcat:sna7jjdfrjh6ritpvjesn3onjq

GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies [article]

Marta R. Costa-jussà, Pau Li Lin, Cristina España-Bonet
2019 arXiv   pre-print
We introduce GeBioToolkit, a tool for extracting multilingual parallel corpora at sentence level, with document and gender information from Wikipedia biographies.  ...  File Restructure Module Finally, the extracted parallel sentences are written as an xml file using a document-level mark-up.  ...  In this case, and in addition to the automatic tags of each document (ID, language and gender), each document is tagged with an occupation category.  ... 
arXiv:1912.04778v1 fatcat:mf27cv7shzalta4yy432tcnfdy

CSIRO's Participation in INEX 2006 [chapter]

Alexander Krumpholz, David Hawking
Lecture Notes in Computer Science  
We split the documents in subdocuments according to the elements that we need to retrieve and indexed the files with PADRE. In a first step we extracted query elements from the INEX topics.  ...  Then the query processor generated PADRE queries and post-processed the results according to specifications for each run.  ...  In order for PADRE to retrieve sub-document parts like XML elements, the original XML files had to be split into smaller documents according to the XML elements relevant for retrieval.  ... 
doi:10.1007/978-3-540-73888-6_8 fatcat:ebvrrhhhszao3jyqbbq4ovgei4

Towards an extensible efficient event processing kernel

Mohammad Sadoghi
2012 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium - PhD '12  
Finally, we conduct a comprehensive evaluation to demonstrate the superiority of our proposed techniques in comparison with state-of-the-art algorithms designed for event processing.  ...  ., user profiles and preferences) modeled as events using attribute-value pairs, XML document, or relational tuples.  ...  these applications are predefined set of patterns (e.g., investment strategies and attack specifications) modeled as subscriptions and streams of incoming data (e.g., XML documents, data packets, stock  ... 
doi:10.1145/2213598.2213602 dblp:conf/sigmod/Sadoghi12 fatcat:dqbmivxweva77nth3vkepgs2iy

Automatic Generation of Human-like Route Descriptions: A Corpus-driven Approach

Rafael Teles, Bruno Barroso, Adolfo Guimaraes, Hendrik Macedo
2013 Journal of Emerging Technologies in Web Intelligence  
retrieved from Google search engine.  ...  Siga pela Avenida Padre Cacique 19. Continue pela Avenida Padre Cacique passando por Armazém do Sabor 20. Chegue no destino que fica próximo a Zé Pneus na Avenida Padre Cacique TABLE III .  ... 
doi:10.4304/jetwi.5.4.413-423 fatcat:cx6iuus5r5b4vduzxik37jq5nq

Efficient and scalable filtering of graph-based metadata

Haifeng Liu, Milenko Petrovic, Hans-Arno Jacobsen
2005 Journal of Web Semantics  
G-ToPSS is particularly well suited for applications that deal with large-volume content distribution from diverse sources.  ...  Current RSS feed arregators follow a pull-based architecture model, which is not going to scale with the increasing number of RSS feeds becoming available on the Web.  ...  The filtering service forwards the document to the interested clients which could be other XML-RPC clients as well as other Drupal modules.  ... 
doi:10.1016/j.websem.2005.09.006 fatcat:7mtlpldlu5bmzlagsieyqvhg6y

Efficient and Scalable Filtering of Graph-Based Metadata

Haifeng Liu, Milenko Petrovic, Hans-Arno Jacobsen
2005 Social Science Research Network  
G-ToPSS is particularly well suited for applications that deal with large-volume content distribution from diverse sources.  ...  Current RSS feed arregators follow a pull-based architecture model, which is not going to scale with the increasing number of RSS feeds becoming available on the Web.  ...  The filtering service forwards the document to the interested clients which could be other XML-RPC clients as well as other Drupal modules.  ... 
doi:10.2139/ssrn.3199262 fatcat:huvjilas6jbjlkfbyhopykjlii

An Interoperable Electronic Health Record System for Clinical Cardiology

Elena Lazarova, Sara Mora, Norbert Maggi, Carmelina Ruggiero, Alessandro Cosolito Vitale, Paolo Rubartelli, Mauro Giacomini
2022 Informatics  
All documents have been given as Health Level 7 (HL7) clinical document architecture and messages are sent as HL7-Version 2 (V2) and/or HL7 Fast Healthcare Interoperability Resources (FHIR).  ...  The system has been used for more than three years with a good level of satisfaction by the users.  ...  Pathway for search and retrieval of the diagnostic report list. Figure 7 . 7 Figure 7. Pathway for search and retrieval of the diagnostic report list.  ... 
doi:10.3390/informatics9020047 fatcat:ytjtiazmhzcadhtc2klsxc6kmu
« Previous Showing results 1 — 15 out of 143 results