3,700 Hits in 4.5 sec

A Fusion Approach to XML Structured Document Retrieval

Ray R. Larson
2005 Information retrieval (Boston)  
The basic notion of "data fusion" or "meta-search" approaches to IR is quite simple and intuitively appealing.  ...  In the research reported here, we examine the application of data fusion methods to the XML retrieval problem.  ...  In this paper we have examined the fusion of different algorithms and document components in content-oriented and content and structure XML retrieval.  ... 
doi:10.1007/s10791-005-0749-0 fatcat:jrs55vnjzrgpdmybzzab5mr5ia

Combining Image and Structured Text Retrieval [chapter]

D. N. F. Awang Iskandar, Jovan Pehcevski, James A. Thom, S. M. M. Tahaghoghi
2006 Lecture Notes in Computer Science  
In this paper we present our approach of combining evidence from a contentoriented XML retrieval system and a content-based image retrieval system using a linear evidence combination approach as part of  ...  Two common approaches to retrieving images from a collection are retrieval by text keywords, and retrieval by visual content.  ...  We acknowledge Jonathan Yu for his assistance in proposing and assessing a multimedia topic.  ... 
doi:10.1007/978-3-540-34963-1_40 fatcat:rwhe2fyr7rctvf3w4jgatlbdh4

Kinship contextualization

Muhammad A. Norozi, Paavo Arvola
2013 Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '13  
In this study we hypothesize that the context of an XML-element originated from its preceding and following elements in the sequential ordering of a document improves the quality of retrieval.  ...  In the tree form of the document's structure, kinship contextualization means, contextualization based on the horizontal and vertical elements in the kinship tree, or elements in closer to a wider structural  ...  XML documents are used as a sample case of semi-structured documents, these documents have hierarchical structure, which is often represented in a form of tree.  ... 
doi:10.1145/2484028.2484111 dblp:conf/sigir/NoroziA13 fatcat:iwkcznwt2fbdrjqomwlpm77i2a

Contextualization using hyperlinks and internal hierarchical structure of Wikipedia documents

Muhammad Ali Norozi, Paavo Arvola, Arjen P. de Vries
2012 Proceedings of the 21st ACM international conference on Information and knowledge management - CIKM '12  
We hypothesize that documents in a good context (having strong contextual evidence) should be good candidates to be relevant to the posed query, and vice versa.  ...  semi-structured documents.  ...  The tree-structure of the XML document is considered as a graph. Myriad of random surfers traverse the XML graphs.  ... 
doi:10.1145/2396761.2396855 dblp:conf/cikm/NoroziAV12 fatcat:gmpjpuxrm5b5hljmsgdp6sf7di

Information Retrieval of Sequential Data in Heterogeneous XML Databases [chapter]

Eugen Popovici, Pierre-François Marteau, Gildas Ménier
2006 Lecture Notes in Computer Science  
In this article we introduce a retrieval scheme designed to manage sequential data in an XML context based on two levels of approximation: on the structural localization/organization of the sequential  ...  The XML documents evolved from plain structured text representations, to documents having complex and heterogeneous structures and contents: video descriptions, mathematical formulas, time series or sequences  ...  A general opinion [8] and also our belief is that using similarity operators adapted to the document content types and to the XML structure in the retrieval process will improve the precision of the  ... 
doi:10.1007/11670834_19 fatcat:kktag6u3xjgt5lvbmoh77pndgm

Integrating Text Retrieval and Image Retrieval in XML Document Searching [chapter]

D. Tjondronegoro, J. Zhang, J. Gu, A. Nguyen, S. Geva
2006 Lecture Notes in Computer Science  
Two search engines, an XML document search engine using both content and structure based on text, and a content-based image search engine were used at the same time.  ...  We test this hypothesis by doing a series of experiments using the Lonely Planet XML document collection.  ...  Fig. 2 . 2 Database Structure for text-and image-based XML Document Retrieval.  ... 
doi:10.1007/978-3-540-34963-1_39 fatcat:zyzf5xslmvckfg5bi6j5gn47wq

Combining structured and unstructured information in a retrieval model for accessing legislation

Marie-Francine Moens
2005 Proceedings of the 10th international conference on Artificial intelligence and law - ICAIL '05  
Such an approach is very promising for the retrieval of legal documents. This is illustrated with two retrieval models specifically designed for the retrieval of legislation.  ...  Legal documents typically combine structured and unstructured information, the former being tagged with markup languages such as XML (Extensible Markup Language).  ...  With regard to information retrieval from document-centric XML data, the research community has exhibited a large interest in XML retrieval models.  ... 
doi:10.1145/1165485.1165507 dblp:conf/icail/Moens05 fatcat:73tf7wv755ggdcdlbgb7kyqxuy

A Novel Framework for Data Extraction from Multiple Repositories and Generation of Ontologies using Inverted Indexing Technique

Sudeepthi Govathoti, M. Surendra Prasad Babu
2017 International Journal of Database Theory and Application  
It is a fact that information retrieval and data extraction are difficult tasks in handling the large collection of web documents.  ...  Semantic web is a new technology used to handle the massive raw data to transform it into knowledgeable representation.  ...  [16] proposed a System for retrieving and organizing the educational materials using the semantic based approach, a frame based approach for representing conceptual entities for handling large text  ... 
doi:10.14257/ijdta.2017.10.7.07 fatcat:pv7dlqcpzvf3vhu33dopu7puu4

Finding Relevant Passages in Scientific Articles: Fusion of Automatic Approaches vs. an Interactive Team Effort

Dina Demner-Fushman, Susanne M. Humphrey, Nicholas C. Ide, Russell F. Loane, Patrick Ruch, Miguel E. Ruiz, Lawrence H. Smith, Lorraine K. Tanabe, W. John Wilbur, Alan R. Aronson
2006 Text Retrieval Conference  
To continue using our TREC 2005 fusion approach, we needed a common representation for the full text biomedical articles to be shared by the four base systems (Essie, SMART, EasyIR and Theme.)  ...  This paper presents our approach to retargeting the information retrieval systems designed and/or optimized for retrieval of MEDLINE citations to the task of finding relevant passages in the text of scientific  ...  This latter strategy has been shown to be very effective but relies on a structured XML document format that is not always available.  ... 
dblp:conf/trec/Demner-FushmanHILSTWADRR06 fatcat:m7gfptuajvfzpgxvl7b2flulbi

When is the Structural Context Effective?

Muhammad Ali Norozi, Paavo Arvola
2013 Dutch-Belgian Workshop on Information Retrieval  
Search and Retrieval]: Search process INTRODUCTION Document parts, referred to as elements, have both a hierarchical and a sequential relationship with each other.  ...  In focused retrieval, the use of context is a driving force to alleviate or "un-bias" the retrieval of items with varying length.  ...  The tree-structure of the XML document (Figure 1 ) is assumed to be a graph.  ... 
dblp:conf/dir/NoroziA13 fatcat:laana746jzg5taj2z5cpv3xasi

RSLIS at INEX 2011: Social Book Search Track [chapter]

Toine Bogers, Kirstine Wilfred Christensen, Birger Larsen
2012 Lecture Notes in Computer Science  
We investigate the contribution of different types of document metadata, both social and controlled, and examine the effectiveness of re-ranking retrieval results using social features.  ...  We find that the best results are obtained using all available document fields and topic representations.  ...  We combine this weighting method with the three fusion methods CombMAX, CombSUM, and CombMNZ to arrive at a weighted fusion approach.  ... 
doi:10.1007/978-3-642-35734-3_3 fatcat:utoizgy26vdthiybkrpaeyodw4

Combining content and structure similarity for XML document classification using composite SVM kernels

Saptarshi Ghosh, Pabitra Mitra
2008 Pattern Recognition (ICPR), Proceedings of the International Conference on  
Combination of structure and content features is necessary for effective retrieval and classification of XML documents.  ...  Composite kernels provide a way for fusion of content and structure information.  ...  The recent trend in XML retrieval and classification, as exemplified by the INEX 2006 challenge [2] , is to utilise both structure and content information.  ... 
doi:10.1109/icpr.2008.4761539 dblp:conf/icpr/GhoshM08 fatcat:5ltulzed2naobjyaucdp2pn2k4

UTD HLTRI at TREC 2017: Precision Medicine Track

Travis R. Goodwin, Michael A. Skinner, Sanda M. Harabagiu
2017 Text Retrieval Conference  
retrieved documents be within the domain of precision medicine and that retrieved documents have a focus on treatment.  ...  Our experiments reveal that the aspect-based approach leads to improved quality of retrieved scientific articles and clinical trials.  ...  Aspect Fusion When documents are retrieved using the Aspect Retrieval strategy, it is necessary to combine the ranked list of documents obtained for each aspect to produce a single ranked list of documents  ... 
dblp:conf/trec/GoodwinSH17 fatcat:kfkl65zgh5euzne6yzqvdoutru

TRECVID 2010 Known-item Search (KIS) Task by I2R

Lekha Chaisorn, Kong-Wah Wan, Yan-Tao Zheng, Yongwei Zhu, Tian-Shiang Kok, Hui Li Tan, Zixiang Fu, Susanna Bolling
2010 TREC Video Retrieval Evaluation  
Locating the unique video for a query, however, poses new challenges over existing information retrieval approaches.  ...  By collecting a number of relevant videos, the searchers can perform relevance feedback to refine the retrieval and continue the search.  ...  documents (XML) and the queries).  ... 
dblp:conf/trecvid/ChaisornWZZKTFB10 fatcat:orpcfvvcmnbk3dxg5cwdwsmijy

Finding an application-appropriate model for XML data warehouses

Franck Ravat, Olivier Teste, Ronan Tournier, Gilles Zurfluh
2010 Information Systems  
Two formalisms may be used by XML documents to describe their own structure: DTD (Document Type Definition) and XSchema [94,95].  ...  XML documents are less structured at first glance.  ...  The idea is to query the document storage space and to retrieve a particular set of documents.  ... 
doi:10.1016/ fatcat:2qeuwx3cx5a5fjxttaobsx2bdy
« Previous Showing results 1 — 15 out of 3,700 results