Filters








2,570 Hits in 8.6 sec

Applying the Divergence from Randomness Approach for Content-Only Search in XML Documents [chapter]

Mohammad Abolhassani, Norbert Fuhr
2004 Lecture Notes in Computer Science  
In this paper, we investigate the application of a specific language model for this task, namely Amati's approach of divergence from randomness.  ...  Content-only retrieval of XML documents deals with the problem of locating the smallest XML elements that satisfy the query.  ...  By adopting ideas from the successful augmentation approach, we have extended Amati's model by a third normalisation component which takes into account the hierarchical structure of XML documents.  ... 
doi:10.1007/978-3-540-24752-4_30 fatcat:dbwx643sovhghgnyfdekf4c3gq

Document Clustering Evaluation: Divergence from a Random Baseline [article]

Christopher M. De Vries, Shlomo Geva, Andrew Trotman
2012 arXiv   pre-print
Divergence from a random baseline is a technique for the evaluation of document clustering.  ...  The divergence from a random baseline approach is able to differentiate ineffective clusterings encountered in the INEX XML Mining track.  ...  Section 8 analyses the application of divergence from a random baseline using the INEX 2010 XML mining track. The paper is concluded in Section 9.  ... 
arXiv:1208.5654v2 fatcat:ct7lmr5hx5ejjbpa3ux7kdoo5e

The Impact of Linked Documents and Graph Analysis on Information Retrieval Methods for Book Recommendation

Chahinez Benkoussas, Patrice Bellot, Anais Ollagnier
2015 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)  
We considered the application of a graph based algorithm in a new retrieval approach to related document network comprised of social links.  ...  We used different theoretical retrieval models: probabilistic as InL2 (Divergence From Randomness model) and language models and tested their interpolated combination.  ...  Abolhassani and Fuhr have investigated several possibilities for applying Amati's DFR model [6] for content-only search in XML documents. [10] .  ... 
doi:10.1109/wi-iat.2015.200 dblp:conf/webi/BenkoussasBO15 fatcat:2id5cguiovdo3imyznwf6baxn4

Information Retrieval and Graph Analysis Approaches for Book Recommendation

Chahinez Benkoussas, Patrice Bellot
2015 The Scientific World Journal  
Specifically, this work tackles the problem of book recommendation in the context of INEX (Initiative for the Evaluation of XML retrieval) Social Book Search track.  ...  We consider the application of this algorithm in a new retrieval approach to related document network comprised of social links.  ...  Acknowledgment This work was supported by the French program "Investissements d'Avenir-Développement de l'Economie Numérique" under Project Inter-Textes no. O14751-408983.  ... 
doi:10.1155/2015/926418 pmid:26504899 pmcid:PMC4609525 fatcat:sm2cvhatorhtthcxnpiyhs6uam

Cross-Document Search Engine For Book Recommendation

Chahinez Benkoussas, Patrice Bellot
2015 ACM Conference on Recommender Systems  
We considered the application of a graph based algorithm in a new retrieval approach to related document network comprised of social links.  ...  We used different theoretical retrieval models: probabilistic as InL2 (Divergence From Randomness model) and language models and tested their interpolated combination.  ...  This work was supported by the French program Investissements d'Avenir FSN and the French Région PACA under the projects InterTextes and Agoraweb.  ... 
dblp:conf/recsys/BenkoussasB15 fatcat:wcoq6p3dmnbrtcyhjwkyblhyoa

Efficient tree pattern queries on encrypted XML documents

Jianneng Cao, Fang-Yu Rao, Mehmet Kuzu, Elisa Bertino, Murat Kantarcioglu
2013 Proceedings of the Joint EDBT/ICDT 2013 Workshops on - EDBT '13  
Past approaches on this topic either leak structural information or fail to support searching that has constraints on XML node content.  ...  By assigning each node in the hierarchy a position, we create for each document a vector, which encodes both the structural and textual information about the document.  ...  Finally, the user prunes all the false positives by a post-processing step. Such an approach can potentially be applied to search XML documents.  ... 
doi:10.1145/2457317.2457338 dblp:conf/edbt/CaoRKBK13 fatcat:unzc6qvh4veo7oal3fumlmmoxm

Collaborative Filtering for Book Recommandation

Chahinez Benkoussas, Hussam Hamdan, Shereen Albitar, Anaïs Ollagnier, Patrice Bellot
2014 Conference and Labs of the Evaluation Forum  
In this paper, we present our contribution in INEX 2014 Social Book Search Track.  ...  In our experiments we used dierent methods, one of our submissions which uses INL2 got the second rank w.r.t nDCG@10 measure, the ocial measure for this task.  ...  Retrieval Model InL2 We used InL2 model implemented in Terrier. InL2 is DFR-based model (Divergence From Randomness).  ... 
dblp:conf/clef/BenkoussasHAOB14 fatcat:voxf3xsgjna6bjtwmzcxqv7c6a

Concept-Based Search on Semi-structured Data Exploiting Mined Semantic Relations [chapter]

Jens Graupmann
2004 Lecture Notes in Computer Science  
In this paper we show the current state of the ongoing research concerning our prototype for a search engine on semi-structured data incorporating rules mined on extracted structured data.  ...  We illuminate some ideas from the research field of data mining and how to apply them to the retrieval process. Additionally, we show technical aspects and features of our search engine.  ...  (see also [9, 10] for keyword search in an XML context).  ... 
doi:10.1007/978-3-540-30192-9_4 fatcat:fb667y76gvaapag3fimvd3hpce

Unstructured Content Analysis & Classification System for the IRS

R.Palson Kennedy
2010 International Journal of Computer Applications  
The proposed XML Schema Model for Unstructured Content Personalization shown in Figure 1 .  ...  From the legacy design efforts for CSDL to the myriad of approaches to XML schema development including the development of XIRQL , Hybrid XML retrieval and XML queries , the adoption of advanced techniques  ...  This permits full text searching, enabling retrieval on the basis of any words in the document. In others, a digitized image of the document is stored, usually on a write-once optical disc.  ... 
doi:10.5120/105-216 fatcat:f4r4ahs7vfhihp5vftr5cw54uy

Conference Mining via Generalized Topic Modeling [chapter]

Ali Daud, Juanzi Li, Lizhu Zhou, Faqir Muhammad
2009 Lecture Notes in Computer Science  
Previous approaches mined conferences by using network connectivity or by using semantics-based intrinsic structure of the words present between documents (modeling from document level (DL)), while ignored  ...  In this paper, we address this problem by considering semantics-based intrinsic structure of the words present in conferences (richer semantics) by modeling from conference level (CL).  ...  For this purpose we apply equation 4 only on the word tokens in the new conference each time temporarily updating the count matrices of (word by topic) and (topic by conference).  ... 
doi:10.1007/978-3-642-04180-8_33 fatcat:zk3cds6pwvf5fkz3vmlz3grixm

Book Recommendation Using Information Retrieval Methods and Graph Analysis

Chahinez Benkoussas, Anaïs Ollagnier, Patrice Bellot
2015 Conference and Labs of the Evaluation Forum  
In this paper, we present our contribution in INEX 2015 Social Book Search Track.  ...  We integrated tools from natural language processing (NLP) and approaches based on graph analysis to improve the recommendation performances.  ...  Acknowledgements This work was supported by the French program "Investissements d'Avenir -Développement de l'Economie Numérique" under the project Inter-Textes #O14751-408983.  ... 
dblp:conf/clef/BenkoussasOB15 fatcat:xbqfjh5w2fgb5c5dfse56hlppe

Real World Evaluation of Approaches to Research Paper Recommendation [article]

Siddharth Dinesh
2018 arXiv   pre-print
We find that a term based similarity search performs better than keyword based approaches. These results are a good starting point in finding performance improvements for related document searches.  ...  In this work, we have identified the need for choosing baseline approaches for research-paper recommendation systems.  ...  Random The approach randomly picks the set of documents to recommend to the user. We experiment with this approach by randomly choosing to apply a language filter 50% of the time.  ... 
arXiv:1802.06892v1 fatcat:5e2jgjicxrg65ihagbxwjzvmai

An overview on XML similarity: Background, current trends and future directions

Joe Tekli, Richard Chbeir, Kokou Yetongnon
2009 Computer Science Review  
Owing to an unparalleled increasing use of the XML standard, developing efficient techniques for comparing XML-based documents becomes essential in the database and information retrieval communities.  ...  In recent years, XML has been established as a major means for information management, and has been broadly utilized for complex data representation (e.g. multimedia objects).  ...  ) model [24] , the DFR (Divergence From Randomness) model [3] , etc.  ... 
doi:10.1016/j.cosrev.2009.03.001 fatcat:c3mvd7her5ae3ohbip25c753b4

Search on the Semantic Web

Li Ding, T. Finin, A. Joshi, Yun Peng, Rong Pan, P. Reddivari
2005 Computer  
Their details diverge, however, due to differences in the distribution of SWDs and the semantics of their content.  ...  Google's simple random surfer model is not appropriate for these paths. For example, an agent reasoning over the content found in an SWD should access and process all of the ontologies it imports.  ... 
doi:10.1109/mc.2005.350 fatcat:mwldp5vsw5gt7eakeau7lshmu4

Two Statistical Summarizers at INEX 2012 Tweet Contextualization Track

Juan-Manuel Torres-Moreno, Patricia Velázquez-Morales
2012 Conference and Labs of the Evaluation Forum  
This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia." We present summarizers Cortex and KL-summ applied to the INEX 2012 task.  ...  Cortex summarizer uses several sentence selection metrics and an optimal decision module to score sentences from a document source.  ...  High values mean a less divergence of summary from source document. In other words, lower divergences (High Fresa scores) shows a more quantity of content of summary.  ... 
dblp:conf/clef/MorenoV12 fatcat:xn2bl25uxbfttdrophubnxhxwm
« Previous Showing results 1 — 15 out of 2,570 results