A Framework for Plagiarism Detection in Arabic Documents

Imtiaz Hussain Khan, Muazzam Ahmed Siddiqui, Kamal Mansoor Jambi, Abobakr Ahmed Bagais
2015 Computer Science & Information Technology ( CS & IT )   unpublished
We are developing a web-based plagiarism detection system to detect plagiarism in written Arabic documents. This paper describes the proposed framework of our plagiarism detection system. The proposed plagiarism detection framework comprises of two main components, one global and the other local. The global component is heuristics-based, in which a potentially plagiarized given document is used to construct a set of representative queries by using different best performing heuristics. These
more » ... ies are then submitted to Google via Google's search API to retrieve candidate source documents from the Web. The local component carries out detailed similarity computations by combining different similarity computation techniques to check which parts of the given document are plagiarised and from which source documents retrieved from the Web. Since this is an ongoing research project, the quality of overall system is not evaluated yet.
doi:10.5121/csit.2015.50201 fatcat:e5g3nshdy5dmjomwbd7r45ccmu