Filters








32 Hits in 4.9 sec

A Plagiarism Detection Approach Based on SVM for Persian Texts

Fezeh Esteki, Faramarz Safi Esfahani
2016 Forum for Information Retrieval Evaluation  
Numerous methods have been proposed to detect plagiarism in different languages; however, not a lot has been accomplished in Persian.  ...  The present study has utilized statistical and semantic features to determine the functionality of Support Vector Machines (SVMs) in detecting acts of plagiarism in Persian.  ...  METHODOLOGY The present study has proposed a new method based on SVM to detect plagiarism in Persian texts. Statistical attributes were used to train and apply the SVM.  ... 
dblp:conf/fire/EstekiE16 fatcat:tlxdq4xxkfcjhcq3hwqisppih4

A Text Alignment Algorithm Based on Prediction of Obfuscation Types Using SVM Neural Network

Fatemeh Mashhadirajab, Mehrnoush Shamsfard
2016 Forum for Information Retrieval Evaluation  
Then, we set the parameter values in our text alignment algorithm based on the detected type of obfuscation.  ...  For this purpose, we use SVM neural network for classification of documents according to the type of obfuscation strategy used in a document pair.  ...  Persian Plagdet 2016 competition 2 which is a subtask of PAN Fire 2016 competition 3 is held for Persian language. It means that the text alignment algorithms are evaluated on a Persian corpus.  ... 
dblp:conf/fire/MashhadirajabS16 fatcat:g5lybkuohzbmvge2sqnfu6lg2u

Algorithms and Corpora for Persian Plagiarism Detection [chapter]

Habibollah Asghari, Salar Mohtaj, Omid Fatemi, Heshaam Faili, Paolo Rosso, Martin Potthast
2018 Lecture Notes in Computer Science  
We organized the Persian PlagDet shared task at PAN 2016 in an effort to promote the comparative assessment of NLP techniques for plagiarism detection with a special focus on plagiarism that appears in  ...  a Persian text corpus.  ...  Our special thanks go to the renowned experts who served on the organizing committee for their contributions and devoted work to make this shared task possible.  ... 
doi:10.1007/978-3-319-73606-8_5 fatcat:76h7b4p7lfgbvo3pzip7bdhfaq

Scalable and language-independent embedding-based approach for plagiarism detection considering obfuscation type: no training phase

Erfaneh Gharavi, Hadi Veisi, Paolo Rosso
2019 Neural computing & applications (Print)  
Obfuscation type was also taken into account while comparing the text parts. In the proposed method, we employed two approaches for filtering the detected plagiarism cases.  ...  In this paper, we employ text embedding vectors to compare similarity among documents to detect plagiarism. Word vectors are combined by a simple aggregation function to represent a text document.  ...  Acknowledgement: The work of Paolo Rosso was partially funded by the Spanish MICINN under the research project MISMIS-FAKEnHATE on Misinformation and Miscommunication in social media: FAKE news and HATE  ... 
doi:10.1007/s00521-019-04594-y fatcat:tilsahe55vdhbl6pajcdtenogi

Academic Plagiarism Detection

Tomáš Foltýnek, Norman Meuschke, Bela Gipp
2019 ACM Computing Surveys  
Since we seek to cover the most influential papers on academic plagiarism detection, we consider a relevance ranking based on citation counts as an advantage rather than a disadvantage.  ...  Academic Plagiarism Detection: A Systematic Literature Review 112:3 a. Did researchers propose conceptually new approaches for this task? b.  ...  In 2015, the PAN organizers also introduced a shared task on plagiarism detection for Arabic texts [32] , followed by a shared task for Persian texts one year later [22] .  ... 
doi:10.1145/3345317 fatcat:yk6f5xl2kvdxlhvsolem6zfdsu

Plagiarism Detection Techniques for Arabic Script Languages: A Literature Review

Ribwar Ibrahim, Soran Saeed, Karzan Wakil
2017 Kurdistan Journal of Applied Research  
In this paper we investigate and review the plagiarism detection techniques and algorithms which have been developed for Arabic Script Languages (ASL), and providing a literature review of the utilized  ...  There are numerous of plagiarism detection techniques have been developed for various natural languages, mainly English.  ...  [37] Literal Text 2016 Persia n An extrinsic SVM- based the functionality and performance of SVM method to detect plagiarism in Persian texts was evaluated.  ... 
doi:10.24017/science.2017.3.1 fatcat:zwrivu4rlbajzdletr3nzdxze4

Source Retrieval and Text Alignment Corpus Construction for Plagiarism Detection

Leilei Kong, Zhimao Lu, Yong Han, Haoliang Qi, Zhongyuan Han, Qibo Wang, Zhenyuan Hao, Jing Zhang
2015 Conference and Labs of the Evaluation Forum  
A vote-based approach and a classification-based approach are incorporated to filter the searching results to get the plagiarism sources.  ...  For the task of source retrieval, we focus on the process of Download Filtering.  ...  A supervised learning method based on LDA(Linear Discriminant Analysis) was used to learn a classification model to decide which candidate plagiarism source was the positive detections before downloading  ... 
dblp:conf/clef/KongLHQHWHZ15 fatcat:hdjgh7knw5f2vlyef6t5aggeqm

Plagiarism Detection Based on Citing Sentences [chapter]

Sidik Soleman, Atsushi Fujii
2017 Lecture Notes in Computer Science  
Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries.  ...  In the Persian language, statements of suspicious documents are examined compared to an exact search approach.  ...  -Divides the text into sections of 5 sentences -Selection of the ten best phrases in each section based on the keywords extraction method BM25 and weighted tf.idf -A rating model based on SVM ranking for  ... 
doi:10.1007/978-3-319-67008-9_38 fatcat:vjf67csttbg4fch7rra427xh5u

Automatic Detection of Plagiarism in Writing

Mahshad Davoodifard
2022 Studies in Applied Linguistics & TESOL  
This paper reports on preliminary steps to create an external plagiarism detection tool.  ...  While extending the algorithm based on the suggested pipeline would allow for a more accurate evaluation of the model, manual comparison of sample documents provided some validity of the model developed  ...  of plagiarized texts in Persian.  ... 
doi:10.52214/salt.v21i2.9058 fatcat:tynajccqv5autgsxnfbaopebsy

Analyzing Non-Textual Content Elements to Detect Academic Plagiarism

Norman Meuschke, Bela Gipp, Harald Reiterer, Michael L. Nelson
2021 Zenodo  
The study presents the weaknesses of current detection approaches for identifying strongly disguised plagiarism.  ...  To demonstrate the benefit of combining non-textual and text-based detection methods, the thesis describes the first plagiarism detection system that integrates th [...]  ...  Section 5.1.1, p. 146) has provided approaches that could prove valuable for the plagiarism detection use case. The research on Math-based Plagiarism Detection is infant.  ... 
doi:10.5281/zenodo.4913344 fatcat:xmpaahvwuva53l5l5i2gaidvi4

Overview of the PAN/CLEF 2015 Evaluation Lab [chapter]

Efstathios Stamatatos, Martin Potthast, Francisco Rangel, Paolo Rosso, Benno Stein
2015 Lecture Notes in Computer Science  
In plagiarism detection, community-driven corpus construction is introduced as a new way of developing evaluation resources with diversity.  ...  A new corpus was built for this challenging, yet realistic, task covering four languages.  ...  Starting in 2012, we have completely overhauled our evaluation approach to plagiarism detection based on the insights gained from the previous years [42] .  ... 
doi:10.1007/978-3-319-24027-5_49 fatcat:fcpf2p7nujet5ez4zswoiscatq

Evaluating the effects of textual features on authorship attribution accuracy

Reza Ramezani, Navid Sheydaei, Mohsen Kahani
2013 ICCKE 2013  
This task is based on this assumption that the author of an unseen text can be discriminated by comparing some textual features extracted from that unseen text with those of texts with known authors.  ...  In this paper the effects of 29 different textual features on the accuracy of author identification on Persian corpora in 30 different scenarios are evaluated.  ...  AA is based on text representation.  ... 
doi:10.1109/iccke.2013.6682828 fatcat:zvsowujpffglhpcl3mhyieyo4u

Author Identification with Machine Learning Algorithms

İbrahim Yülüce, Feriştah Dalkılıç
2022 International journal of multidisciplinary studies and innovative technologies  
In this study, we conducted an experiment for the identification of the author of a Turkish language text by using classical machine learning methods including Support Vector Machines (SVM), Gaussian Naive  ...  Author identification is one of the application areas of text mining.  ...  The need to identify the content creator on the internet, detect plagiarism and prevent copyright infringement has increased the interest in authorship identification.  ... 
doi:10.36287/ijmsit.6.1.45 fatcat:4trjejori5frfaq2uhtet6xqji

Corpus-Based Paraphrase Detection Experiments and Review

Tedo Vrbanec, Ana Meštrović
2020 Information  
Paraphrase detection is important for a number of applications, including plagiarism detection, authorship attribution, question answering, text summarization, text mining in general, etc.  ...  Through a great number of experiments, we decided on the most appropriate approaches for text pre-processing: hyper-parameters, sub-model selection—where they exist (e.g., Skipgram vs.  ...  These features are then the input to a logistic classifier for PI. Gharavi et al. (2016) [27] proposed a "deep learning based method to detect plagiarism" in the Persian language.  ... 
doi:10.3390/info11050241 fatcat:ndgh7pl5graefgmzkzeppq7s4e

Network motifs for translator stylometry identification

Heba El-Fiqi, Eleni Petraki, Hussein A. Abbass, Diego Raphael Amancio
2019 PLoS ONE  
These results demonstrate that classic tools based on lexical features can be used for identifying translator stylometry if they get augmented with appropriate non-parametric scaling.  ...  In a two stage process, this paper first evaluates the use of existing lexical measures for the translator stylometry problem.  ...  For SVM, we used weka.classifiers.functions.SMO; which is based on Sequential Minimal Optimization algorithm for support vector machine [75] [76] .  ... 
doi:10.1371/journal.pone.0211809 pmid:30735512 pmcid:PMC6368295 fatcat:avcyogpbtbharbrpyiulpyvlza
« Previous Showing results 1 — 15 out of 32 results