Filters








75 Hits in 5.3 sec

Overview of the INEX 2009 XML Mining Track: Clustering and Classification of XML Documents [chapter]

Richi Nayak, Christopher M. De Vries, Sangeetha Kutty, Shlomo Geva, Ludovic Denoyer, Patrick Gallinari
2010 Lecture Notes in Computer Science  
This report explains the objectives, datasets and evaluation criteria of both the clustering and classification tasks set in the INEX 2009 XML Mining track.  ...  The report also describes the approaches and results obtained by the different participants.  ...  Acknowledgments We would like to thank all the participants for their efforts and hard work. 6.  ... 
doi:10.1007/978-3-642-14556-8_36 fatcat:2gyfgsqdfngpbeichjycrgmtau

Report on INEX 2009

T. Beckers, S. Geva, W.-C. Huang, T. Iofciu, J. Kamps, G. Kazai, M. Koolen, S. Kutty, M. Landoni, M. Lehtonen, V. Moriceau, P. Bellot (+17 others)
2010 SIGIR Forum  
This paper reports on the INEX 2009 evaluation campaign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining.  ...  XML-Mining Track Investigating structured document mining, especially the classification and clustering of semi-structured documents.  ...  Overview The Efficiency Track was run for the second time in 2009, with its first incarnation at INEX 2008 [17] .  ... 
doi:10.1145/1842890.1842897 fatcat:46evgkszirdm3grr6fqrtuyqtm

Report on INEX 2008

Gianluca Demartin, Gabriella Kazai, Marijn Koolen, Monica Landoni, Ragnar Nordlie, Nils Pharo, Ralf Schenkel, Martin Theobald, Andrew Trotman, Arjen P. de Vries, Alan Woodley, Ludovic Denoye (+8 others)
2009 SIGIR Forum  
This paper reports on the INEX 2008 evaluation campaign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining. • Link-the-Wiki  ...  Track Investigating link discovery between Wikipedia documents, both at the file level and at the element level. • XML-Mining Track Investigating structured document mining, especially the classification  ...  This question will be examined by the track in 2009. XML Mining Track In this section, we briefly discuss the XML Mining track; a detailed discussion is in [4] .  ... 
doi:10.1145/1670598.1670603 fatcat:zoxbecrybrf63fg54g4kt7w7na

Overview of the INEX 2008 Ad Hoc Track [chapter]

Jaap Kamps, Shlomo Geva, Andrew Trotman, Alan Woodley, Marijn Koolen
2009 Lecture Notes in Computer Science  
This paper gives an overview of the INEX 2007 Ad Hoc Track.  ...  The main purpose of the Ad Hoc Track was to investigate the value of the internal document structure (as provided by the XML markup) for retrieving relevant information.  ...  Acknowledgments Eternal thanks to Benjamin Piwowarski for completely updating the X-RAI tools to ensure that all passage offsets can be mapped exactly.  ... 
doi:10.1007/978-3-642-03761-0_1 fatcat:exrtt2h6gzdjxmoqiodqhrhmhy

Clustering XML Documents Using Frequent Subtrees [chapter]

Sangeetha Kutty, Tien Tran, Richi Nayak, Yuefeng Li
2009 Lecture Notes in Computer Science  
This paper presents an experimental study conducted over the INEX 2008 Document Mining Challenge corpus using both the structure and the content of XML documents for clustering them.  ...  In spite of the large number of documents in the INEX 2008 Wikipedia dataset, the proposed frequent subtree-based clustering approach was successful in clustering the documents.  ...  Table 1 summarises the clustering results for INEX Wikipedia XML Mining Track 2008. Table 1.  ... 
doi:10.1007/978-3-642-03761-0_45 fatcat:3jopojjmmnhapkqyixqezs75ie

Report on INEX 2010

D. Alexander, J. Kamps, G. Kazai, M. Koolen, S. Kutty, M. Landoni, V. Moriceau, R. Nayak, R. Nordlie, N. Pharo, E. SanJuan, P. Arvola (+15 others)
2011 SIGIR Forum  
XML-Mining Track Investigating structured document mining, especially the classification and clustering of semi-structured documents.  ...  and XML Mining.  ...  INEX 2011 will proudly continue pushing research boundaries, with a wide range of new tasks including Social Search, Faceted Retrieval, and Snippet Retrieval.  ... 
doi:10.1145/1988852.1988854 fatcat:mpw23y6ywjdt7myyopp3olz37y

Overview of the INEX 2009 Ad Hoc Track [chapter]

Shlomo Geva, Jaap Kamps, Miro Lethonen, Ralf Schenkel, James A. Thom, Andrew Trotman
2010 Lecture Notes in Computer Science  
This paper gives an overview of the INEX 2007 Ad Hoc Track.  ...  The main purpose of the Ad Hoc Track was to investigate the value of the internal document structure (as provided by the XML markup) for retrieving relevant information.  ...  Jaap Kamps was supported by the Netherlands Organization for Scientific Research (NWO, grants # 612.066.513, 639.072.601, and 640.001.501), and by the E.U.'  ... 
doi:10.1007/978-3-642-14556-8_4 fatcat:bdnyqr63bzdzxkxmbkma4mjpqm

Overview of the INEX 2010 XML Mining Track: Clustering and Classification of XML Documents [chapter]

Christopher M. De Vries, Richi Nayak, Sangeetha Kutty, Shlomo Geva, Andrea Tagarelli
2011 Lecture Notes in Computer Science  
This report explains the objectives, datasets and evaluation criteria of both the clustering and classification tasks set in the INEX 2010 XML Mining track.  ...  The report also describes the approaches and results obtained by participants.  ...  This track has run for six editions during INEX 2005 INEX , 2006 INEX , 2007 INEX , 2008 INEX , 2009 and 2010.  ... 
doi:10.1007/978-3-642-23577-1_35 fatcat:q6rrg5zgczgl3mnxy5cznbz7qu

Overview of the INEX 2011 Snippet Retrieval Track [chapter]

Matthew Trappett, Shlomo Geva, Andrew Trotman, Falk Scholer, Mark Sanderson
2012 Lecture Notes in Computer Science  
This paper gives an overview of the INEX 2011 Snippet Retrieval Track.  ...  We discuss the setup of the track, and the evaluation results.  ...  Test Collection The Snippet Retrieval Track uses the INEX Wikipedia collection introduced in 2009 an XML version of the English Wikipedia, based on a dump taken on 8 October 2008, and semantically annotated  ... 
doi:10.1007/978-3-642-35734-3_27 fatcat:4pxcehyejvfxdppv52dlyo6wgq

Overview of the INEX 2008 Link the Wiki Track [chapter]

Wei Che Huang, Shlomo Geva, Andrew Trotman
2009 Lecture Notes in Computer Science  
The Link the Wiki track at INEX 2008 offered two tasks, file-to-file link discovery and anchor-to-BEP link discovery. In the former 6600 topics were used and in the latter 50 were used.  ...  Manual assessment of the anchor-to-BEP runs was performed using a tool developed for the purpose.  ...  Introduction Trotman & Geva [1] introduced the Link the Wiki task in 2006. It ran at INEX for the first time in 2007 [2] . This contribution discusses the track as it was run in 2008.  ... 
doi:10.1007/978-3-642-03761-0_32 fatcat:rmosxjznpbervl64uslgiehkjy

Link-the-Wiki: Performance Evaluation Based on Frequent Phrases [chapter]

Mao-Lung Chen, Richi Nayak, Shlomo Geva
2009 Lecture Notes in Computer Science  
In this paper, we discuss our participation to the INEX 2008 Linkthe-Wiki track. We utilized a sliding window based algorithm to extract the frequent terms and phrases.  ...  Using the extracted phrases and term as descriptive vectors, the anchors and relevant links (both incoming and outgoing) are recognized efficiently.  ...  One of the research tracks organized by INEX is Link-the-Wiki, which was introduced on 2006 [1] . The objective of this track is to automatically discover the hyperlinks among Wikipedia web pages.  ... 
doi:10.1007/978-3-642-03761-0_33 fatcat:vqqho5kc7rdehamvybwraepely

Report on INEX 2012

P. Bellot, M. Marx, A. Mishra, V. Moriceau, J. Mothe, M. Preminger, G. Ramírez, M. Sanderson, E. Sanjuan, F. Scholer, A. Schuh, T. Chappell (+12 others)
2012 SIGIR Forum  
This paper reports on the INEX'12 evaluation campaign, which consisted of a five tracks: Linked Data, Relevance Feedback, Snippet Retrieval, Social Book Search, and Tweet Contextualization.  ...  INEX'12 was an exciting year for INEX in which we joined forces with CLEF and for the first time ran our workshop as part of the CLEF labs in order to facilitate knowledge transfer between the evaluation  ...  Collection The Snippet Retrieval Track uses the INEX Wikipedia collection introduced in 2009-an XML version of the English Wikipedia, based on a dump taken on 8 October 2008, and semantically annotated  ... 
doi:10.1145/2422256.2422264 fatcat:eueapb4nanhcxbruxpppgb5pda

Overview of the INEX 2010 Link the Wiki Track [chapter]

Andrew Trotman, David Alexander, Shlomo Geva
2011 Lecture Notes in Computer Science  
The Link the Wiki track at INEX 2008 offered two tasks, file-to-file link discovery and anchor-to-BEP link discovery. In the former 6600 topics were used and in the latter 50 were used.  ...  Manual assessment of the anchor-to-BEP runs was performed using a tool developed for the purpose.  ...  Introduction Trotman & Geva [1] introduced the Link the Wiki task in 2006. It ran at INEX for the first time in 2007 [2] . This contribution discusses the track as it was run in 2008.  ... 
doi:10.1007/978-3-642-23577-1_22 fatcat:shyvfv46wbhtpi4tbeg4llh5ue

Overview of the INEX 2009 Link the Wiki Track [chapter]

Wei Che Huang, Shlomo Geva, Andrew Trotman
2010 Lecture Notes in Computer Science  
The Link the Wiki track at INEX 2008 offered two tasks, file-to-file link discovery and anchor-to-BEP link discovery. In the former 6600 topics were used and in the latter 50 were used.  ...  Manual assessment of the anchor-to-BEP runs was performed using a tool developed for the purpose.  ...  Introduction Trotman & Geva [1] introduced the Link the Wiki task in 2006. It ran at INEX for the first time in 2007 [2] . This contribution discusses the track as it was run in 2008.  ... 
doi:10.1007/978-3-642-14556-8_31 fatcat:r7mw2jxjyvfyzletpir5ou2yq4

XCFS

Sangeetha Kutty, Richi Nayak, Yuefeng Li
2009 Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09  
An XML clustering algorithm should process both structural and content information of XML documents in order to improve the accuracy and meaning of the clustering solution.  ...  This paper introduces a novel approach that first determines structural similarity in the form of frequent subtrees and then uses these frequent subtrees to represent the constrained content of the XML  ...  Report on the XML mining track at INEX 2005 and INEX 2006: categorization and clustering of XML documents. ACM SIGIR Forum. 41(1): 79-90.  ... 
doi:10.1145/1645953.1646216 dblp:conf/cikm/KuttyNL09 fatcat:abj64ezia5ayhbrfd4mgteqmci
« Previous Showing results 1 — 15 out of 75 results