Clustering of Deep Contextualized Representations for Summarization of Biomedical Texts [article]

Milad Moradi, Matthias Samwald
2019 arXiv   pre-print
In recent years, summarizers that incorporate domain knowledge into the process of text summarization have outperformed generic methods, especially for summarization of biomedical texts. However, construction and maintenance of domain knowledge bases are resource-intense tasks requiring significant manual annotation. In this paper, we demonstrate that contextualized representations extracted from the pre-trained deep language model BERT, can be effectively used to measure the similarity between
more » ... sentences and to quantify the informative content. The results show that our BERT-based summarizer can improve the performance of biomedical summarization. Although the summarizer does not use any sources of domain knowledge, it can capture the context of sentences more accurately than the comparison methods. The source code and data are available at https://github.com/BioTextSumm/BERT-based-Summ.
arXiv:1908.02286v2 fatcat:3pa62vuvi5hnrg5fkvvm7fsjmq