Information-content based sentence extraction for text summarization

D. Mallett, J. Elding, M.A. Nascimento
2004 International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004.  
This paper proposes the FULL-COVERAGE summarizer: an efficient, information retrieval oriented method to extract non-redundant sentences from text for summarization purposes. Our method leverages existing Information Retrieval technology by extracting key-sentences on the premise that the relevance of a sentence is proportional to its similarity to the whole document. We show that our method can produce sentence-based summaries that are up to 78% smaller than the original text with only 3% loss in retrieval performance.
doi:10.1109/itcc.2004.1286634 dblp:conf/itcc/MallettEN04 fatcat:kxfcsjbfunf3phbw3ugeyt3qcu