A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Clustering Abstracts Instead of Full Texts
[chapter]
2004
Lecture Notes in Computer Science
Accessibility of digital libraries and other web-based repositories has caused the illusion of accessibility of the full texts of scientific papers. However, in the majority of cases such an access (at least free access) is limited only to abstracts having no more then 50-100 words. Traditional keyword-based approach for clustering this type of documents gives unstable and imprecise results. We show that they can be easy improved with more adequate keyword selection and document similarity
doi:10.1007/978-3-540-30120-2_17
fatcat:dhpdcbkzqfgvfa36r3ejwo7m5e