Non-hierarchical document clustering using the ICL distribution array processor

E. Rasmussen, P. Willett
1987 Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '87  
This paper considers the suitability and efficiency of a highly parallel computer, the ICL Distributed Array Processor (DAP), for document clustering. Algorithms are described for the implementation of the single-pass and reallocation clustering methods on the DAP and on a conventional mainframe computer. These methods are used to classify the Cranfield, Vaswani and UKCIS document test collections. The results suggest that the parallel architecture of the DAP is not well suited to the
more » ... ength records which characterise bibliographic data.
doi:10.1145/42005.42020 dblp:conf/sigir/RasmussenW87 fatcat:pbc666jdtzbibkjphxfbiksrgi