Effective collection metasearch in a hierarchical environment

Jack G. Conrad, Changwen Yang, Joanne S. Claussen
2002 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02  
We compare standard global IR searching with user-centric localized techniques to address the database selection problem. We conduct a series of experiments to compare the retrieval effectiveness of three separate search modes applied to a hierarchically structured data environment of textual database representations. The data environment is represented as a tree-like directory containing over 15,000 unique databases and over 100,000 total leaf nodes. Our search modes consist of varying degrees
more » ... of browse and search, from a global search at the root node to a refined search at a subnode using dynamically-calculated inverse document frequencies (idf s) to score candidate databases for probable relevance. Our findings indicate that a browse and search approach that relies upon localized searching from sub-nodes is capable of producing the most effective results.
doi:10.1145/564376.564449 dblp:conf/sigir/ConradYC02 fatcat:sjoyrq23ifg77m3u3y3wgctkci