Multi-topical Discussion Summarization Using Structured Lexical Chains and Cue Words [chapter]

Jun Hatori, Akiko Murakami, Jun'ichi Tsujii
2011 Lecture Notes in Computer Science  
We propose a method to summarize threaded, multi-topical texts automatically, particularly online discussions and e-mail conversations. These corpora have a so-called reply-to structure among the posts, where multiple topics are discussed simultaneously with a certain level of continuity, although each post is typically short. We specifically focus on the multi-topical aspect of the corpora, and propose the use of two linguistically motivated features: lexical chains and cue words, which
more » ... the topics and topic structure. Particularly, we introduce the structured lexical chain, which is a combination of traditional lexical chains with the thread structure. In experiments, we show the effectiveness of these features on the Innovation Jam 2008 Corpus and the BC3 Mailing List Corpus based on two task settings: key-sentence and keyword extraction. We also present detailed analysis of the result with some intuitive examples.
doi:10.1007/978-3-642-19437-5_26 fatcat:3onxjgc6ivfmxi2lwbex4jfcce