The implementation of web service based text preprocessing to measure Indonesian student thesis similarity level

Yan Watequlis Syaifudin, Pramana Yoga Saputra, Dwi Puspitasari, Ade Gafar Abdullah, Asep Bayu Dani Nandiyanto
2018 MATEC Web of Conferences  
The plagiarism of scientific work, especially undergraduate thesis, mostly happened in the college. In this research we used text mining, a new method which can be used to do the checking procedure, to obtain specific pattern of the document. After obtaining the document pattern, we compare the pattern with another document pattern. If the level of pattern similarity is high, it can be suspected as plagiarism. This paper will explain the development of the text preprocessing, a part of text
more » ... a part of text mining. We choosed Nazief and Adriani Algorithm as a text preprocessing algorithm for this research. This research will result a text preprocessing web service. The web service is expected to be used for further development of text mining.
doi:10.1051/matecconf/201819703019 fatcat:d3ud6ky5gzhqtk5q2vmznkaj5e