Automatic External Plagiarism Detection Using Passage Similarities - Lab Report for PAN at CLEF 2010

Clara Vania, Mirna Adriani
2010 Conference and Labs of the Evaluation Forum  
In this paper, we report our approach in detecting external plagiarism. For the pre-processing stage, we identify non-English documents and translate them into English using an online translator tool. Then we index and retrieve the top documents that are similar to the suspicious documents. We divide the retrieved documents into passages where each passage contains twenty sentences. The plagiarism is detected by identifying the number of overlapped words between suspicious and source passages.
dblp:conf/clef/VaniaA10 fatcat:fms5msivujaijlijlxp4btlfbm