A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Document Clustering based on Phrase and Single Term Similarity using Neo4j
2020
VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE
Document similarity generally rely on single term similarity such as cosine similarity. To achieve better document similarity, along with single term phrase- more informative feature can be used. To find out shared phrases across the corpus the Document Index graph (DIG) representation model is used. Document representation - DIG model incrementally construct the graph and simultaneously finds the shared phrase between current document and previously inserted documents from the graph. The
doi:10.35940/ijitee.c9050.109320
fatcat:t4a24pa3argxdecpi66t5sqchi