Filters








42,989 Hits in 7.1 sec

Mobile information retrieval with search results clustering: Prototypes and evaluations

Claudio Carpineto, Stefano Mizzaro, Giovanni Romano, Matteo Snidero
2009 Journal of the American Society for Information Science and Technology  
We measure the effectiveness of their clustered results compared to a ranked list of results on a subtopic retrieval task, by means of the device-independent notion of subtopic reach time together with  ...  Next, we evaluate the retrieval performance of the three prototype systems.  ...  We also are indebted to Andrea Della Pietra, Luca Di Gaspero, and Annalisa Filardo for their help with preparation of the experiments.  ... 
doi:10.1002/asi.21036 fatcat:qctnkhps7bgb5boo3zyfutuata

Evaluating subtopic retrieval methods: Clustering versus diversification of search results

Claudio Carpineto, Massimiliano D'Amico, Giovanni Romano
2012 Information Processing & Management  
The main finding of our experiments is that diversification of top hits is more useful for quick coverage of distinct subtopics whereas clustering is better for full retrieval of single subtopics, with  ...  well on queries with low divergence subtopics, mainly due to the difficulty of generating discriminative cluster labels.  ...  Acknowledgments We would like to thank Stanislaw Osiński and Dawid Weiss for running Lingo and Lingo3G on the AMBIENT and ODP-239 test collections, and providing us with the results.  ... 
doi:10.1016/j.ipm.2011.08.004 fatcat:dszwmvo2mjginbvmem4e4raq6u

Flexible intrinsic evaluation of hierarchical clustering for TDT

James Allan, Ao Feng, Alvaro Bolivar
2003 Proceedings of the twelfth international conference on Information and knowledge management - CIKM '03  
We demonstrate that some obvious evaluation techniques fail for degenerate cases. For a few others we attempt to develop an intuitive sense of what the evaluation numbers mean.  ...  The Topic Detection and Tracking (TDT) evaluation program has included a "cluster detection" task since its inception in 1996.  ...  Any opinions, findings and conclusions or recommendations expressed in this material are the authors' and do not necessarily reflect those of the sponsor.  ... 
doi:10.1145/956863.956914 dblp:conf/cikm/AllanFB03 fatcat:vtkqmm6rqfclreiguu2uj7pcuy

Flexible intrinsic evaluation of hierarchical clustering for TDT

James Allan, Ao Feng, Alvaro Bolivar
2003 Proceedings of the twelfth international conference on Information and knowledge management - CIKM '03  
We demonstrate that some obvious evaluation techniques fail for degenerate cases. For a few others we attempt to develop an intuitive sense of what the evaluation numbers mean.  ...  The Topic Detection and Tracking (TDT) evaluation program has included a "cluster detection" task since its inception in 1996.  ...  Any opinions, findings and conclusions or recommendations expressed in this material are the authors' and do not necessarily reflect those of the sponsor.  ... 
doi:10.1145/956912.956914 fatcat:lg3f3etplnevvhjhvd6nyf4dfu

Mobile Clustering Engine [chapter]

Claudio Carpineto, Andrea Della Pietra, Stefano Mizzaro, Giovanni Romano
2006 Lecture Notes in Computer Science  
An experimental evaluation, besides confirming that finding information is more difficult on a PDA than on a desktop computer, suggests that mobile clustering engine is more effective than mobile search  ...  Although mobile information retrieval is seen as the next frontier of the search market, the rendering of results on mobile devices is still unsatisfactory.  ...  For topics 1 and 4, the clusters produced by the system were pretty good, which explains the good performance of clustering on PDA.  ... 
doi:10.1007/11735106_15 fatcat:6b4b66ms5fejni6b6ktyzjigcq

Reexamining the cluster hypothesis

Marti A. Hearst, Jan O. Pedersen
1996 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '96  
We present Scatter/Gather, a cluster-based document browsing method, as an alternative to ranked titles for the organization and viewing of retrieval results.  ...  We systematically evaluate Scatter/Gather in this context and find significant improvements over similarity search ranking alone.  ...  but was not able to find an improvement using clustering with this strategy.  ... 
doi:10.1145/243199.243216 dblp:conf/sigir/HearstP96 fatcat:r3xc77iwdjgq7krnpwwojuqaly

Incremental cluster-based retrieval using compressed cluster-skipping inverted files

Ismail Sengor Altingovde, Engin Demir, Fazli Can, Özgür Ulusoy
2008 ACM Transactions on Information Systems  
We propose a unique cluster-based retrieval (CBR) strategy using a new cluster-skipping inverted file for improving query processing efficiency.  ...  Our experiments with various collections show that the incremental-CBR strategy using a compressed cluster-skipping inverted file significantly improves CPU time efficiency, regardless of query length.  ...  CONCLUSIONS AND FUTURE WORK Cluster-based retrieval (CBR) is a long-studied research area for improving efficiency and effectiveness of document retrieval.  ... 
doi:10.1145/1361684.1361688 fatcat:3iuznbyiobdzpp3m4ih7qvquuu

Evaluating exploratory visualization systems: A user study on how clustering-based visualization systems support information seeking from large document collections

Yujie Liu, Scott Barlowe, Yaqin Feng, Jing Yang, Min Jiang
2012 Information Visualization  
Our approach is built upon cognitive load theory that takes the users as well as the system as the foci of evaluation.  ...  Although many EVSs have been developed recently, there is a lack of general guidance on how to evaluate such systems.  ...  In the follow- Find as many distinct topics from the dataset as possible. Describe each of them using a few sentences.  ... 
doi:10.1177/1473871612459995 fatcat:mkxs5grpuveirf325ilehx3mzy

Phrase based Clustering Scheme of Suffix Tree Document Clustering Model

Anoop KumarJain, Satyam Maheshwari
2013 International Journal of Computer Applications  
Document clustering arises from information retrieval domains, and "It finds grouping for a set of documents belonging to the same cluster are similar and documents belongs to the different cluster are  ...  Information retrieval finds the file contents and identifies their similarity. It measures the performance of the documents by using the precision and recall.  ...  In order to evaluate the quality of the clustering, there are many different quality measures to access the cluster effectiveness.  ... 
doi:10.5120/10504-5273 fatcat:7n2urtkhdfa7hf5562rbntjxlu

Replicator Graph Clustering

Michael Donoser
2013 Procedings of the British Machine Vision Conference 2013  
In this paper we introduce an efficient, effective and scalable clustering method denoted as Replicator Graph Clustering.  ...  Individual steps have low computational complexity which leads to an efficient clustering method, scaling well with an increasing number of data points.  ...  In this field, manifold analysis is frequently used to improve the retrieval performance by re-evaluating the similarities between elements in the context of the entire database.  ... 
doi:10.5244/c.27.38 dblp:conf/bmvc/Donoser13 fatcat:xsbyxqkdfneyrey4mfcfls7qwm

The cluster hypothesis in information retrieval

Oren Kurland
2013 Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '13  
The cluster hypothesis • Historical view of the effect of the hypothesis on work on ad hoc information retrieval • Testing the cluster hypothesis • Cluster-based document retrieval • Using topic models  ...  , fusion, federated search, query expansion, microblog retrieval, relevance feedback, adversarial search • Concluding notes The ad hoc retrieval task • Ranking the documents in a corpus by their relevance  ...  list of documents) is relevant to a query • As it turns out, quite a few QPP and cluster ranking methods are based on the exact same principles • The geometric mean of retrieval scores in a result list  ... 
doi:10.1145/2484028.2484192 dblp:conf/sigir/Kurland13 fatcat:3lw6kkfzs5gsvmsvkpwlynck2u

Interactive Steering of Hierarchical Clustering

Weikai Yang, Xiting Wang, Jie Lu, Wenwen Dou, Shixia Liu
2020 IEEE Transactions on Visualization and Computer Graphics  
The quantitative evaluation and case study demonstrate that the proposed approach facilitates the building of customized clustering trees in an efficient and effective manner.  ...  the interactive steering of clustering through a visual interface (user-driven).  ...  The quantitative evaluation and case study demonstrate that the proposed approach facilitates the building of customized clustering trees in an efficient and effective manner.  ... 
doi:10.1109/tvcg.2020.2995100 pmid:32746252 fatcat:fdwsn4qkzffrjgn2hrjhb47bqe

Decentralized Probabilistic Text Clustering

Odysseas Papapetrou, Wolf Siberski, Norbert Fuhr
2012 IEEE Transactions on Knowledge and Data Engineering  
Extensive experimental evaluation with up to 1 million peers and 1 million documents demonstrates the scalability and effectiveness of the algorithm.  ...  It enables a peer to compare each of its documents only with very few selected clusters, without significant loss of clustering quality.  ...  A separate research stream is the usage of clustering to improve query routing efficiency. Peers are clustered by topic, and queries are primarily routed to members of the right cluster.  ... 
doi:10.1109/tkde.2011.120 fatcat:joxcz4hk65h3rbir5ceqqwaujq

A survey of Web clustering engines

Claudio Carpineto, Stanislaw Osiński, Giovanni Romano, Dawid Weiss
2009 ACM Computing Surveys  
A Survey of Web Clustering Engines 17:5 Section 7 is devoted to retrieval performance evaluation.  ...  We highlight the main characteristics of a number of existing Web clustering engines and also discuss how to evaluate their retrieval performance.  ...  EVALUATION OF RETRIEVAL PERFORMANCE In this section we address the issue of evaluating the retrieval performance of a clustering engine.  ... 
doi:10.1145/1541880.1541884 fatcat:e3ndkaq6ovhe3ep6kkamrtol3e

Semantic based Document Clustering: A Detailed Review

Neepa Shah, Sunita Mahajan
2012 International Journal of Computer Applications  
Following are few applications of document clustering [12] .  Finding Similar Documents: To find similar documents matching with the search result document.  ...  Thus, it is important for improving clustering efficiency and effectiveness.  ... 
doi:10.5120/8202-1598 fatcat:mb5hph2d6vhofmyxuyib7srgqq
« Previous Showing results 1 — 15 out of 42,989 results