A Framework for Comparing Groups of Documents

Arun Maiya
2015 Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing  
We present a general framework for comparing multiple groups of documents. A bipartite graph model is proposed where document groups are represented as one node set and the comparison criteria are represented as the other node set. Using this model, we present basic algorithms to extract insights into similarities and differences among the document groups. Finally, we demonstrate the versatility of our framework through an analysis of NSF funding programs for basic research.
doi:10.18653/v1/d15-1100 dblp:conf/emnlp/Maiya15 fatcat:twb2prwjibaerd7tj46nnkqv2u