A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Graph of Words
2017
Proceedings of the 26th International Conference on World Wide Web Companion - WWW '17 Companion
The Bag-of-words model has been the dominant approach for IR and Text mining for many years assuming the word independence and the frequencies as the main feature for feature selection and for query to document similarity. Although the long and successful usage, bag-of-words ignores words' order and distance within the document -weakening thus the expressive power of the distance metrics. We propose graph-of-word, an alternative approach that capitalizes on a graph representation of documents
doi:10.1145/3041021.3055362
dblp:conf/www/Vazirgiannis17
fatcat:vlplcnl3abd2foyqnirsbp75wa