RMIT University at the TREC 2007 Enterprise Track

Mingfang Wu, Falk Scholer, Milad Shokouhi, Simon J. Puglisi, Halil Ali
2007 Text Retrieval Conference  
At TREC 2007, RMIT University participated in the document search task of the enterprise track. Our goals were to investigate: 1. Which sources of external evidence (anchor text, PageRank and Indegree) are useful for improving a document-based ranking scheme for a key page finding task? Discussion Unlike the entry page finding task in which the use of anchor text has significantly improved the search results over using the content collection only, A similar result was not achieved here for the
more » ... ey document search task. Using an interpolation of Indegree/PageRank and the content run provides a small but not significant improvement in precision. We observed that the web pages from the CSIRO collection follow a certain template: global navigation bar at the top of a page, local navigational bar on the left, related link area on the right, copyright bar at the bottom, and content area in the middle. We observed that most links come from the non-content area, this may provide an explanation why anchor text, Indegree and PageRank may be more helpful for the entry page or named page finding task than for the key document finding task -which may require the authored links from the content and/or related link area. We are doing a post analysis of this issue.
dblp:conf/trec/WuSSPA07 fatcat:dinx364b5bda3ggbbcade4abai