715,285 Hits in 10.3 sec

Authorship Verification based on Lucene architecture

Zhihao Liao, Yong Han, Leilei Kong, Zhuopeng Hong, Zijian Li, Guiyuan Liang, Zhenwei Mo, Zhixian Li, Zhongyuan Han
2021 Conference and Labs of the Evaluation Forum  
Authorship verification is the task of deciding whether two texts have been written by the same author based on comparing the texts' writing styles.  ...  Then a weighted score is computed to decide whether two documents come from the same author. Using this simple-minded approach, we get achieved 0.3032 on the overall score.  ...  Acknowledgments This work is supported by the National Natural Science Foundation of China (No.61806075 and No.61772177), and the Social Science Foundation of Guangdong Province (No. GD20CTS02).  ... 
dblp:conf/clef/LiaoHKH0LMLH21 fatcat:4btfbhb7zjdijciwycwyagrgqy

Overview of the Style Change Detection Task at PAN 2021

Eva Zangerle, Maximilian Mayerl, Martin Potthast, Benno Stein
2021 Conference and Labs of the Evaluation Forum  
In this year's PAN style change detection task, we asked the participants to answer the following questions: (1) Given a document, was it written by a single or by multiple authors?  ...  ., assign all paragraphs of a text uniquely to some author, given the list of authors assumed for the multi-author document.  ...  The core of the approach hence is an authorship verification model which the authors use to determine whether two given paragraphs are written by the same author.  ... 
dblp:conf/clef/ZangerleMP021 fatcat:qhcotsy2erdlton6dchtqua5m4

Authorship Identification Using Correlations of Frequent Features Notebook for PAN at CLEF 2013

Timo Petmanson
2013 Conference and Labs of the Evaluation Forum  
By hypothesizing that the distances of different documents written by a same author belong to same distribution, we can provide educated guesses for the authorship identification.  ...  In this work, we explore how well can we perform the PAN'13 authorship identification task by using correlation of frequent significant features found in documents written by a known author.  ...  We use Student's t-test to determine if the distances come from the same distribution, hence same author.  ... 
dblp:conf/clef/Petmanson13 fatcat:3l2mgqpgsfdltl6kg7o4wc7sea

Style Change Detection with Feed-forward Neural Networks

Chaoyuan Zuo, Yu Zhao, Ritwik Banerjee
2019 Conference and Labs of the Evaluation Forum  
This scenario, however, is not applicable to documents written by more than one author.  ...  To this end, we present a system consisting of two modules, one for distinguishing the single-author documents from the multiauthor documents and the other for determining the exact number of authors in  ...  Over 40k segment pairs are selected from the documents. For half of the pairs, the two segments are written by the same author and we treat them as one category.  ... 
dblp:conf/clef/ZuoZB19 fatcat:r2ynesgverak3afyqqanwunuye

Authorship Verification, combining Linguistic Features and Different Similarity Functions

Daniel Castro-Castro, Yaritza Adame Arcia, María Peláez Brioso, Rafael Muñoz
2015 Conference and Labs of the Evaluation Forum  
if it exceeds the average of similarity obtained between texts written by him and if it got the major value comparing the average similarity with the rest of the authors.  ...  Using this idea, a text that was not written by an author, would not exceed the average of similarity with known texts and a text of unknown authorship would be considered as written by the author, only  ...  Acknowledgements This research has been partially funded by the Spanish Ministry of Science and Innovation (TIN2012-38536-C03-03)  ... 
dblp:conf/clef/CastroAPM15 fatcat:qdfxkbhu7febbf2surik7bndti

UniNE at PAN-CLEF 2021: Authorship Verification

Catherine Ikae
2021 Conference and Labs of the Evaluation Forum  
The task is to verify if a given pair of text is written by the same or different authors. The training sample contains verification cases from previously unseen authors and topics.  ...  Transforming this question into a similarity problem, we can determine whether one or two authors have written a given text pair.  ...  Feature Selection To determine whether two text chunks have been written by the same author, we need to determine a text representation that can characterize the stylistic idiosyncrasies of each possible  ... 
dblp:conf/clef/Ikae21a fatcat:f76e53472revxclcyhg7ez5wm4

A Language Independent Author Verifier Using Fuzzy C-Means Clustering

Pashutan Modaresi, Philipp Gross
2014 Conference and Labs of the Evaluation Forum  
The author verification task presents participants with a set of problems where each problem consists of a set of documents written by the same author and a questioned document with an unknown author.  ...  The task is then to decide whether the questioned document has the same author as the other documents or not.  ...  Given a set of documents written by a single author, the task here is to determine whether a questioned document is written by the same author or not.  ... 
dblp:conf/clef/ModaresiG14 fatcat:ptvdgxdc6bhgpcqzvlq22j23mq

Writing Style Change Detection on Multi-Author Documents

Rhia Singh, Janith Weerasinghe, Rachel Greenstadt
2021 Conference and Labs of the Evaluation Forum  
classifier, together with the labels indicating if the two paragraphs were written by the same author or not.  ...  We extracted stylometric features from each paragraph in each document and used the absolute differences between the feature vectors corresponding to pairs of paragraphs as input to a Logistic Regression  ...  Our work was supported by the National Science Foundation under grant 1931005 and the McNulty Foundation.  ... 
dblp:conf/clef/SinghWG21 fatcat:zougfjfg5rfbjbizro2ublhjhm

Dynamic Similarity Threshold in Authorship Verification: Evidence from Classical Arabic

Hossam Ahmed
2017 Procedia Computer Science  
Documents that score below that threshold are rejected as not written by the same author. Current definitions of θ rely on both negative and positive training input.  ...  The authors propose that, compared to texts written by different authors, long texts written by the same author differ in only a small number of shared features.  ...  The goal of this step is to determine what similarity value is a threshold θ that constitutes a cutoff point for accepting a suspect document as being written by the same author.  ... 
doi:10.1016/j.procs.2017.10.103 fatcat:yuyltuvixbdafag4jd5e7qgagi

A Novel Approach for Authorship Verification using Similarity Measure

2020 International journal for innovative engineering and management research  
Authorship verification is a task of identifying whether two text documents are written by the same author or not by evaluating the veracity and authenticity of writings.  ...  Based on the threshold value of similarity score, the author of a test document is verified whether the test document is written by the suspected author or not.  ...  The Authorship Verification task is comparing two documents and automatically determines if they are written by the same author or not.  ... 
doi:10.48047/ijiemr/v09/i12/83 fatcat:fri65pdlkfaulog7xftoryzg3i

Authorship Verification with Entity Coherence and Other Rich Linguistic Features Notebook for PAN at CLEF 2013

Vanessa Wei Feng, Graeme Hirst
2013 Conference and Labs of the Evaluation Forum  
For texts written in languages other than English, some stylometric features are unavailable due to the lack of appropriate NLP tools, and their coherence features are derived from their translations produced  ...  accuracy of 68.2%, and roughly the same performance across three languages.  ...  Introduction Authorship verification, a sub-task of authorship identification, deals with the demand of identifying whether a two documents are written by the same author or not.  ... 
dblp:conf/clef/FengH13 fatcat:hgx6xgb7cjgk3lyz53iwyod3cy

Unsupervised Decomposition of a Multi-Author Document Based on Naive-Bayesian Model

Khaled Aldebei, Xiangjian He, Jie Yang
2015 Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)  
We assume that we do not know anything about the document and the authors, except the number of the authors of that document.  ...  Experimental results show that the proposed method outperforms two state-of-the-art methods.  ...  It is obvious that our method achieves high accuracies even when the ducuments are written by more than two authors.  ... 
doi:10.3115/v1/p15-2082 dblp:conf/acl/AldebeiH015 fatcat:wnsjlvgqezae7o7wtwsip7gwxa

Author clustering with the Aid of a Simple Distance Measure

Houda Alberts
2017 Conference and Labs of the Evaluation Forum  
A simple distance measure has been applied to the author clustering problem to determine which documents are written by the same author.  ...  The top most frequent features k, where k is chosen to be 300, determine the distribution where punctuation is present.  ...  Two parts are present during this task: the establishment of links between documents (denoting that two documents are written by same author) and the clustering task.  ... 
dblp:conf/clef/Alberts17 fatcat:3fimuvm7yfgedjiotmdeurd2vm

Automatically Identifying Pseudepigraphic Texts

Moshe Koppel, Shachar Seidman
2013 Conference on Empirical Methods in Natural Language Processing  
The identification of pseudepigraphic textstexts not written by the authors to which they are attributedhas important historical, forensic and commercial applications.  ...  The idea is to identify textual outliers in a corpus based on the pairwise similarities of all documents in the corpus.  ...  al. 2011) , in which we need to determine if two, possibly short, documents were written by the same author.  ... 
dblp:conf/emnlp/KoppelS13 fatcat:qoossojsp5d2jjuwghywhn46jy

Language Bias in the Google Scholar Ranking Algorithm

Cristòfol Rovira, Lluís Codina, Carlos Lopezosa
2021 Future Internet  
already present and which are taken into account by the algorithm.  ...  This study seeks to advance our knowledge in this line of research by determining whether the language in which a document is published is a positioning factor in the Google Scholar relevance ranking algorithm  ...  If the documents are written in the same language, this factor is overridden.  ... 
doi:10.3390/fi13020031 fatcat:k53ozqjjljacxojrl555hpyku4
« Previous Showing results 1 — 15 out of 715,285 results