Filters








309 Hits in 8.6 sec

Experiments on Document Chunking and Query Formation for Plagiarism Source Retrieval

Amit Prakash, Sujan Kumar Saha
2014 Conference and Labs of the Evaluation Forum  
Our work is focused on intelligent chunking of suspicious documents and a hybrid approach of query formation.  ...  The queries are then submitted to the ChatNoir search API to retrieve documents that are likely to be the sources of plagiarism.  ...  Query Formation and search control Forming query for ChatNoir search engine is a challenging task due to the fact that ChatNoir allows maximum 10 words per query to retrieve the sources.  ... 
dblp:conf/clef/PrakashS14 fatcat:r67boygidfgzzn245ecfyyjs6e

Approaches for candidate document retrieval

Simon Suchomel, Michal Brandejs
2014 2014 5th International Conference on Information and Communication Systems (ICICS)  
This paper describes an architecture and concepts of a realworld document retrieval system, which is a part of a general anti-plagiarism software.  ...  Plagiarism has become a serious problem mainly because of the electronically available documents. An online document retrieval is weighty part of a modern anti-plagiarism tool.  ...  ACKNOWLEDGMENT The authors would like to thank to the Information System of Masaryk University for creating an opportunity to improve the plagiarism issue in Europe.  ... 
doi:10.1109/iacs.2014.6841959 fatcat:whu7atptfzhyrgmb3pce4675ie

Source Retrieval for Plagiarism Detection

Šimon Suchomel, Michal Brandejs
2015 Journal of Advances in Information Technology  
Up to date systems for plagiarism detection are discussed from the source retrieval perspective. The key approaches of source retrieval are compared.  ...  Proper usage of such systems contributes to the gradual improvement of the quality of student theses.  Index Terms-plagiarism detection, plagiarism, source document retrieval, candidate document retrieval  ...  ACKNOWLEDGMENT The authors would like to thank to the Information System of Masaryk University for creating an opportunity to improve the plagiarism issue in Europe.  ... 
doi:10.12720/jait.6.1.18-26 fatcat:hhkvfizjyrdllfqxmnzhdsglfu

Overview of the 4th International Competition on Plagiarism Detection

Martin Potthast, Tim Gollub, Matthias Hagen, Johannes Kiesel, Maximilian Michel, Arnd Oberländer, Martin Tippmann, Alberto Barrón-Cedeño, Parth Gupta, Paolo Rosso, Benno Stein
2012 Conference and Labs of the Evaluation Forum  
We report on their performances for two sub-tasks of external plagiarism detection: candidate document retrieval and detailed document comparison.  ...  Furthermore, we introduce the PAN plagiarism corpus 2012, the TIRA experimentation platform, and the ChatNoir search engine for the ClueWeb.  ...  Acknowledgements We thank the participants of PAN for their dedicated work, and for being patient with our ever changing idea of how plagiarism detectors should be evaluated.  ... 
dblp:conf/clef/PotthastGHKMOTBGRS12 fatcat:uuppmo3hofbxbgfdilxxdsta3a

Plagiarism Detection Based on Citing Sentences [chapter]

Sidik Soleman, Atsushi Fujii
2017 Lecture Notes in Computer Science  
Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries.  ...  Generating suitable queries is the heart of this technique, and the existing methods suffer from the lack of producing accurate queries, Precision, and Speed of retrieved results.  ...  Experiment-3 studies the relation between the number of retrieved documents and Precision and Recall parameters for each query.  ... 
doi:10.1007/978-3-319-67008-9_38 fatcat:vjf67csttbg4fch7rra427xh5u

Overview of the 5th International Competition on Plagiarism Detection

Martin Potthast, Matthias Hagen, Tim Gollub, Martin Tippmann, Johannes Kiesel, Paolo Rosso, Efstathios Stamatatos, Benno Stein
2013 Conference and Labs of the Evaluation Forum  
We report on their performances for the two tasks source retrieval and text alignment of external plagiarism detection.  ...  This paper overviews 18 plagiarism detectors that have been evaluated within the fifth international competition on plagiarism detection at PAN 2013.  ...  Acknowledgements We thank the participating teams of this task for their devoted work.  ... 
dblp:conf/clef/PotthastHGTKRSS13 fatcat:p6avlw7pdzgc3moj74bmvebyay

Plagiarism Detection for Indonesian Texts

Lucia D. Krisnawati, Klaus U. Schulz
2013 Proceedings of International Conference on Information Integration and Web-based Applications & Services - IIWAS '13  
Evaluation Corpus We plan to enlarge our corpus by increasing the number of both source and test documents.  ...  the preliminary experiments and for building our evaluation corpus.  ...  • How does one formulate effective queries for retrieving all of these source documents?  ... 
doi:10.1145/2539150.2539213 dblp:conf/iiwas/KrisnawatiS13 fatcat:r6p2h4oiq5fi3mhlazokatknrq

Evaluating text reuse discovery on the web

Stanford Chiu, Ibrahim Uysal, W. Bruce Croft
2010 Proceeding of the third symposium on Information interaction in context - IIiX '10  
In this work, we 1) introduce a novel text reuse searching interface for the web, based on a previously proposed architecture, 2) evaluate its feasibility for hasty users, and 3) investigate techniques  ...  to improve both effectiveness and efficiency.  ...  Retrieval Methods Document Level Retrieval To create the initial document set we applied the iterative chunking (IC) method proposed in [1] , and a simpler method based on the query n-grams (QN).  ... 
doi:10.1145/1840784.1840829 dblp:conf/iiix/ChiuUC10 fatcat:6mx7doamcnfxzbikocx355ee5u

Analyzing Non-Textual Content Elements to Detect Academic Plagiarism

Norman Meuschke, Bela Gipp, Harald Reiterer, Michael L. Nelson
2021 Zenodo  
Identifying academic plagiarism is a pressing problem, among others, for research institutions, publishers, and funding organizations.  ...  Subsequently, the thesis summarizes work that initiated the research on analyzing non-textual content elements to detect academic plagiarism by studying citation patterns in academic documents.  ...  The main differences between MathPD and mathematical document retrieval are query formulation and query processing.  ... 
doi:10.5281/zenodo.4913344 fatcat:xmpaahvwuva53l5l5i2gaidvi4

Study on Extrinsic Text Plagiarism Detection Techniques and Tools

K. Vani, Department of Computer Science & Engineering, Amrita School of Engineering, Amrita University , Amrita Vishwa Vidyapeetham, Bangalore, India, Deepa Gupta, Department of Mathematics, Amrita School of Engineering, Amrita University , Amrita Vishwa Vidyapeetham, Bangalore, India
2016 Journal of Engineering Science and Technology Review  
In this paper, a study on plagiarism is done with the focus on extrinsic text plagiarism detection, which is a fast emerging research area in this domain.  ...  The paper also throws light on the popular PAN competition, which is conducted yearly since 2009 in plagiarism domain and the major tasks involved in it.  ...  Different levels of document chunking, viz., line chunks, word chunks, sentence chunks or some combination of them are employed for retrieving near duplicate sources.  ... 
doi:10.25103/jestr.095.02 fatcat:bdkkhdxfonbwnfztri4npt3vz4

Study on Extrinsic Text Plagiarism Detection Techniques and Tools

K. Vani, Deepa Gupta
2016 Journal of Engineering Science and Technology Review  
In this paper, a study on plagiarism is done with the focus on extrinsic text plagiarism detection, which is a fast emerging research area in this domain.  ...  The paper also throws light on the popular PAN competition, which is conducted yearly since 2009 in plagiarism domain and the major tasks involved in it.  ...  Different levels of document chunking, viz., line chunks, word chunks, sentence chunks or some combination of them are employed for retrieving near duplicate sources.  ... 
doi:10.25103/jestr.094.23 fatcat:2dpsg74yl5cnvdua5knu6bkhdy

Understanding Plagiarism Linguistic Patterns, Textual Features, and Detection Methods

Salha M. Alzahrani, Naomie Salim, Ajith Abraham
2012 IEEE Transactions on Systems Man and Cybernetics Part C (Applications and Reviews)  
Our study corroborates that existing systems for plagiarism detection focus on copying text but fail to detect intelligent plagiarism when ideas are presented in different words.  ...  The taxonomy supports deep understanding of different linguistic patterns in committing plagiarism, for example, changing texts into semantically equivalent but with different words and organization, shortening  ...  chunks with 30-character overlap, and retrieval of documents that share at least one fingerprint with d q [4] .  ... 
doi:10.1109/tsmcc.2011.2134847 fatcat:umjzayni2bdobiwpkdtu7upir4

State of the Art in Detecting Academic Plagiarism

Norman Meuschke, Bela Gipp
2013 Zenodo  
Proposed approaches for this task include intrinsic, cross-lingual and citation-based plagiarism detection.  ...  Each method offers unique strengths and weaknesses; however, none is currently mature enough for practical use.  ...  The sources of plagiarism are available on the internet, except for one document, which originated from a DVD encyclopedia.  ... 
doi:10.5281/zenodo.3482941 fatcat:e4bl72bt3nboxnjig5nvkpciv4

Detection of Plagiarism in Arabic Documents

Mohamed El Bachir Menai
2012 International Journal of Information Technology and Computer Science  
Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English.  ...  We evaluate its performance in terms of precision and recall on a large data set of Arabic documents, and show its capability in identifying direct and sophisticated copying, such as sentence reordering  ...  Acknowledgments This paper is an extended version of a conference paper presented at the 6th International Conference on Computer Science & Education (ICCSE 2011) [26] .  ... 
doi:10.5815/ijitcs.2012.10.10 fatcat:uc3sxp3ctrgvfemkcw3tx7dagy

Plagiarism Detection Technique using www and Wordnet

Kamlesh Sharma*, Nidhi Garg, Arun Pandey, Daksh Yadav, Nikhil .
2021 Indian Journal of Artificial Intelligence and Neural Networking  
This paper primarily focuses to detect the plagiarism in the suspicious document based on the meaning and linguistic variation of the content.  ...  The techniques used for this context is based on Natural language processing. In this Paper, we present how the semantic analysis and syntactic driven Parsing can be used to detect the plagiarism.  ...  This document contains the String to be checked against the Plagiarism over the internet. The file format of this document is usually like .txt, .doc, .pdf and another document format.  ... 
doi:10.35940/ijainn.b1015.061321 fatcat:lntiaulz2jexnma6tk3b3t63ni
« Previous Showing results 1 — 15 out of 309 results