Source Retrieval and Text Alignment Corpus Construction for Plagiarism Detection

Leilei Kong, Zhimao Lu, Yong Han, Haoliang Qi, Zhongyuan Han, Qibo Wang, Zhenyuan Hao, Jing Zhang
2015 Conference and Labs of the Evaluation Forum  
For the task of source retrieval, we focus on the process of Download Filtering. For the process from chunking to search control, we aim at high recall, and for the process of download filtering, we devote to improve precision. A vote-based approach and a classification-based approach are incorporated to filter the searching results to get the plagiarism sources. For the task of text alignment corpus construction, we describe the methods we use to construct the Chinese plagiarism cases. At
more » ... we report the statistics of text alignment dataset submissions.
dblp:conf/clef/KongLHQHWHZ15 fatcat:hdjgh7knw5f2vlyef6t5aggeqm