Filters








50 Hits in 4.3 sec

Using Local Text Similarity in Pairwise Document Analysis for Monolingual Plagiarism Detection [chapter]

Nava Ehsan, Azadeh Shakery
2018 Lecture Notes in Computer Science  
We will report our monolingual plagiarism detection system which is used to process the Persian plagiarism corpus for the task of pairwise document similarity.  ...  The task of plagiarism detection entails two main steps, suspicious candidate retrieval and pairwise document similarity analysis also called detailed analysis.  ...  We introduce a pairwise document analysis approach for Persian language. An approach based on a vector space model is described for computing pairwise document similarity.  ... 
doi:10.1007/978-3-319-73606-8_9 fatcat:k63odg25fvawxfbptss73bqvte

Detecting Cross-Lingual Plagiarism Using Simulated Word Embeddings [article]

Victor Thompson
2018 arXiv   pre-print
One of the most common methods for detecting CLP requires online machine translators (such as Google or Microsoft translate) which are not always available, and given that plagiarism detection typically  ...  involves large document comparison, the amount of translations required would overwhelm an online machine translator, especially when detecting plagiarism over the web.  ...  Extensive studies have been carried out on monolingual plagiarism analysis which involves searching for plagiarism in documents of the same language, but CLP still remains a challenge.  ... 
arXiv:1712.10190v2 fatcat:fd3ukosck5fv7fqs3o5why7vpq

Reducing computational effort for plagiarism detection by using citation characteristics to limit retrieval space

Norman Meuschke, Bela Gipp
2014 IEEE/ACM Joint Conference on Digital Libraries  
This paper proposes a hybrid approach to plagiarism detection in academic documents that integrates detection methods using citations, semantic argument structure, and semantic word similarity with character-based  ...  methods to achieve a higher detection performance for disguised plagiarism forms.  ...  ACKNOWLEDGMENTS The authors thank the German Academic Exchange Service (DAAD) for its support.  ... 
doi:10.1109/jcdl.2014.6970168 dblp:conf/jcdl/MeuschkeG14 fatcat:vie6oxpm2zfzpgt3jlmxrz6iom

Academic Plagiarism Detection

Tomáš Foltýnek, Norman Meuschke, Bela Gipp
2019 ACM Computing Surveys  
Academic Plagiarism Detection: A Systematic Literature Review 112:3 a. Did researchers propose conceptually new approaches for this task? b.  ...  What are the major developments in the research on computational methods for plagiarism detection in academic documents since our last literature review in 2013?  ...  Translation with monolingual analysis is a widely used approach.  ... 
doi:10.1145/3345317 fatcat:yk6f5xl2kvdxlhvsolem6zfdsu

A Resource-Light Method for Cross-Lingual Semantic Textual Similarity [article]

Goran Glavaš, Marc Franco-Salvador, Simone Paolo Ponzetto, Paolo Rosso
2018 arXiv   pre-print
In contrast, we propose an unsupervised and a very resource-light approach for measuring semantic similarity between texts in different languages.  ...  Requiring only a limited-size set of word translation pairs between the languages, the proposed approach is applicable to virtually any pair of languages for which there exists a sufficiently large corpus  ...  Table 7 : 7 Performance analysis on the task of cross-lingual plagiarism detection (in terms of R@k, where k = {1, 5, 10, 20}).  ... 
arXiv:1801.06436v1 fatcat:hftd7zwksjgnjdsp2s5csbybbu

A resource-light method for cross-lingual semantic textual similarity

Goran Glavaš, Marc Franco-Salvador, Simone P. Ponzetto, Paolo Rosso
2018 Knowledge-Based Systems  
In contrast, we propose an unsupervised and a very resource-light approach for measuring semantic similarity between texts in different languages.  ...  Requiring only a limited-size set of word translation pairs between the languages, the proposed approach is applicable to virtually any pair of languages for which there exists a sufficiently large corpus  ...  Table 7 : 7 Performance analysis on the task of cross-lingual plagiarism detection (in terms of R@k, where k = {1, 5, 10, 20}).  ... 
doi:10.1016/j.knosys.2017.11.041 fatcat:3ii72eswyfaetdv6gkzfisdi6i

Understanding Plagiarism Linguistic Patterns, Textual Features, and Detection Methods

Salha M. Alzahrani, Naomie Salim, Ajith Abraham
2012 IEEE Transactions on Systems Man and Cybernetics Part C (Applications and Reviews)  
Our study corroborates that existing systems for plagiarism detection focus on copying text but fail to detect intelligent plagiarism when ideas are presented in different words.  ...  Systematic frameworks and methods of monolingual, extrinsic, intrinsic, and cross-lingual plagiarism detection are surveyed and correlated with plagiarism types, which are listed in the taxonomy.  ...  [15] used semantic features for similarity analysis and obfuscated plagiarism detection.  ... 
doi:10.1109/tsmcc.2011.2134847 fatcat:umjzayni2bdobiwpkdtu7upir4

Meta-Analysis of Cross-Language Plagiarism and Self-Plagiarism Detection Methods for Russian-English Language Pair

Alina Tlitova, Alexander Toschev, Max Talanov, Vitaliy Kurnosov
2020 Frontiers in Computer Science  
Citation: Tlitova A, Toschev A, Talanov M and Kurnosov V (2020) Meta-Analysis of Cross-Language Plagiarism and Self-Plagiarism Detection Methods for Russian-English Language Pair. Front. Comput.  ...  Identification of translated plagiarism is a complex task, and there are almost no such tools for this purpose on the Russian market now.  ...  CONCLUSION We conducted a meta-analysis of approaches used for detection of cross-language plagiarism and studied the methods for identifying cross-language plagiarism during the course of this research  ... 
doi:10.3389/fcomp.2020.523053 fatcat:2eqhh655mbb3jedkai74igfk7q

Cross-Language High Similarity Search: Why No Sub-linear Time Bound Can Be Expected [chapter]

Maik Anderka, Benno Stein, Martin Potthast
2010 Lecture Notes in Computer Science  
Use cases for this task include cross-language plagiarism detection and translation search.  ...  Given a collection D of documents and a query q in a language different from the language of D, the task is to retrieve highly similar documents with respect to q.  ...  This is illustrated by an empirical analysis of different fingerprinting approaches in a monolingual high similarity search scenario, see Figure 2 .  ... 
doi:10.1007/978-3-642-12275-0_66 fatcat:pqn4y4dzbbb4dgbj5yisdj6wgq

Graph transformer for cross-lingual plagiarism detection

Oumaima Hourrane, El Habib Benlahmar
2022 IAES International Journal of Artificial Intelligence (IJ-AI)  
Current cross-lingual plagiarism detection approaches usually employ syntactic and lexical properties, external machine translation systems, or finding similarities within a multilingual set of text documents  ...  detection approaches with and without paraphrasing cases, and provides further insights on the use of knowledge graphs on a language-independent model.  ...  EXPERIMENTS We evaluate and compare our CL-GTA for plagiarism detection model with several state-of-the-art approaches in the task of cross-lingual plagiarism analysis.  ... 
doi:10.11591/ijai.v11.i3.pp905-915 fatcat:ikrbf7buwjconfxwxvtctyiqze

Analyzing Non-Textual Content Elements to Detect Academic Plagiarism

Norman Meuschke, Bela Gipp, Harald Reiterer, Michael L. Nelson
2021 Zenodo  
The study presents the weaknesses of current detection approaches for identifying strongly disguised plagiarism.  ...  The thesis addresses this problem by proposing plagiarism detection approaches that implement a different concept—analyzing non-textual content in academic documents, such as citations, images, and mathematical  ...  In intrinsic plagiarism detection, a combined analysis of stylometric features is the standard approach [484] .  ... 
doi:10.5281/zenodo.4913344 fatcat:xmpaahvwuva53l5l5i2gaidvi4

Analyzing Mathematical Content to Detect Academic Plagiarism

Norman Meuschke, Moritz Schubotz, Felix Hamborg, Tomas Skopal, Bela Gipp
2017 Proceedings of the 2017 ACM on Conference on Information and Knowledge Management - CIKM '17  
From this investigation, we derive possible feature selection and feature comparison strategies for developing math-based detection approaches and a ground truth for our experiments.  ...  Third, we develop a first math-based detection approach by implementing and evaluating different feature comparison approaches using an open source parallel data processing pipeline built using the Apache  ...  To detect disguised forms of academic plagiarism, researchers have proposed a variety of monolingual approaches employing semantic and syntactic feature analysis, crosslingual IR methods, and language  ... 
doi:10.1145/3132847.3133144 dblp:conf/cikm/MeuschkeSHSG17 fatcat:ds63cywbxrcvlku7ojnwcod7ca

Citation-based plagiarism detection: Practicability on a large-scale scientific corpus

Bela Gipp, Norman Meuschke, Corinna Breitinger
2014 Journal of the Association for Information Science and Technology  
for document verification.  ...  A recently proposed language-independent approach to plagiarism detection, Citation-based Plagiarism Detection (CbPD), allows the detection of semantic similarity even in the absence of text overlap by  ...  Acknowledgments We acknowledge Mario Lipinski, André Gernandt, Leif Timm, Markus Bruns, Markus Föllmer, and Rebecca Böttche for their contributions to improving the CitePlag prototype.  ... 
doi:10.1002/asi.23228 fatcat:4dgqulgwobhkbeafedqeov26ai

Evaluation of Different Plagiarism Detection Methods: A Fuzzy MCDM Perspective

Kamal Mansour Jambi, Imtiaz Hussain Khan, Muazzam Ahmed Siddiqui
2022 Applied Sciences  
As a result, the study serves as a "blueprint" for constructing the next generation of plagiarism-checking tools.  ...  Plagiarism-checking software programs are useful for detecting plagiarism in examinations, projects, publications, and academic research.  ...  Acknowledgments: The authors thank Science and Technology Unit, King Abdulaziz University for the technical support. Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/app12094580 fatcat:ahqjb3djwvgyln76fwe4xopfza

Cross-Language High Similarity Search Using a Conceptual Thesaurus [chapter]

Parth Gupta, Alberto Barrón-Cedeño, Paolo Rosso
2012 Lecture Notes in Computer Science  
This work addresses the issue of cross-language high similarity and near-duplicates search, where, for the given document, a highly similar one is to be identified from a large cross-language collection  ...  We propose a concept-based similarity model for the problem which is very light in computation and memory.  ...  , document clustering, plagiarism detection and retrieval by example.  ... 
doi:10.1007/978-3-642-33247-0_8 fatcat:5ts5ddoakjaffdaugfdwl6fyqm
« Previous Showing results 1 — 15 out of 50 results