Filters








939 Hits in 4.9 sec

Authorship Attribution Using Text Distortion

Efstathios Stamatatos
2017 Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers  
Authorship attribution is associated with important applications in forensics and humanities research.  ...  Based on experiments on two main tasks in authorship attribution, closed-set attribution and authorship verification, we demonstrate that the proposed approach can enhance existing methods especially under  ...  In authorship attribution it is not always realistic to assume that the texts of known authorship and the texts under investigation belong in the same genre and are in the same thematic area.  ... 
doi:10.18653/v1/e17-1107 dblp:conf/eacl/Stamatatos17 fatcat:wwbnsijdtjch5ickkct55zbequ

Forensic Authorship Analysis of Microblogging Texts Using N-Grams and Stylometric Features [article]

Nicole Mariah Sharon Belvisi, Naveed Muhammad, Fernando Alonso-Fernandez
2020 arXiv   pre-print
We use for our experiments a self-captured database of 40 users, with 120 to 200 tweets per user.  ...  In recent years, messages and text posted on the Internet are used in criminal investigations. Unfortunately, the authorship of many of them remains unknown.  ...  METHODS FOR WRITER IDENTIFICATION Here, we employ the most popular features for authorship attribution: n-grams and stylometric features.  ... 
arXiv:2003.11545v1 fatcat:45yqhsl6f5ai7dvke3gmyitaf4

Author gender identification from text

Na Cheng, R. Chandramouli, K.P. Subbalakshmi
2011 Digital Investigation. The International Journal of Digital Forensics and Incident Response  
Note that this is different from the authorship attribution problem.  ...  Extensive experiments on large text corpora (Reuters Corpus Volume 1 newsgroup data and Enron e-mail data) indicate an accuracy up to 85.1% in identifying the gender.  ...  Although authorship identification methods have achieved some degree of success in many literary and forensic applications as mentioned above, very limited studies have been undertaken specifically for  ... 
doi:10.1016/j.diin.2011.04.002 fatcat:7gx7y2gd6ndcxi6eurhn7uxu34

Authorship attribution of texts: a review

M.B. Malyutov
2005 Electronic Notes in Discrete Mathematics  
literary works.  ...  We study the authorship attribution of documents given some prior stylistic characteristics of the author's writing extracted from a corpus of known works, e.g., authentication of disputed documents or  ...  The methods that are now developing are promising and could also very well apply in other similar problems of authorship attribution, some of which might even have significant security applications.  ... 
doi:10.1016/j.endm.2005.07.064 fatcat:7csyaqars5ayjd4f3kqn3yaj7y

Authorship Attribution of Texts: A Review [chapter]

M. B. Malyutov
2006 Lecture Notes in Computer Science  
literary works.  ...  We study the authorship attribution of documents given some prior stylistic characteristics of the author's writing extracted from a corpus of known works, e.g., authentication of disputed documents or  ...  The methods that are now developing are promising and could also very well apply in other similar problems of authorship attribution, some of which might even have significant security applications.  ... 
doi:10.1007/11889342_20 fatcat:z7o47qjygnh2po3sal2es762ta

Authorship identification from unstructured texts

Chunxia Zhang, Xindong Wu, Zhendong Niu, Wei Ding
2014 Knowledge-Based Systems  
It has been applied to more and more practical applications including literary works, intelligence, criminal law, civil law, and computer forensics.  ...  The increasingly large volumes of anonymous texts on the Internet enhance the great yet urgent necessity for authorship identification.  ...  Authorship identification has been applied to more and more applications including literary works, intelligence, criminal law, civil law, and computer forensics [1] [2] [3] .  ... 
doi:10.1016/j.knosys.2014.04.025 fatcat:d4d6ksofw5ha7aju4rsc4jcnpu

Generalizing Unmasking for Short Texts

Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast
2019 Proceedings of the 2019 Conference of the North  
The new approach is on par with other state-ofthe-art techniques that are optimized for texts of this length: it achieves accuracies of 75-80 %, while also allowing for easy adjustment to forensic scenarios  ...  Authorship verification is the problem of inferring whether two texts were written by the same author.  ...  Although mostly the narrower task of authorship attribution has been considered, where texts are attributed to a set of given authors, recently, authorship verification has been proposed as a more fundamental  ... 
doi:10.18653/v1/n19-1068 dblp:conf/naacl/BevendorffSHP19 fatcat:ku2e73t6rzbqxlfigv3kama4gi

Writer Identification Using Microblogging Texts for Social Media Forensics [article]

Fernando Alonso-Fernandez, Nicole Mariah Sharon Belvisi, Kevin Hernandez-Diaz, Naveed Muhammad, Josef Bigun
2021 arXiv   pre-print
In such cases, automatic attribution can provide significant time savings to experts in suspect search. For completeness, we report verification results.  ...  Establishing authorship of online texts is fundamental to combat cybercrimes. Unfortunately, text length is limited on some platforms, making the challenge harder.  ...  In this context, it is of huge interest the development of methods for authorship attribution to aid in forensic investigations of cybercrimes [1] .  ... 
arXiv:2008.01533v2 fatcat:jaqu34fv3zdlhjh6wmsg75gn54

Authorship Attribution of Social Media and Literary Russian-Language Texts Using Machine Learning Methods and Feature Selection

Anastasia Fedotova, Aleksandr Romanov, Anna Kurtukova, Alexander Shelupanov
2021 Future Internet  
The average accuracy for literary texts was 80.4% using SVM combined with GA, 82.3% using deep NNs, and 82.1% using fastText.  ...  Authorship attribution is one of the important fields of natural language processing (NLP).  ...  Acknowledgments: The authors express their gratitude to the editor and reviewers for their work and valuable comments on the article.  ... 
doi:10.3390/fi14010004 fatcat:u2aqr6jurrabjlr6bizziibxhi

MedLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts [article]

Silvia Corbara, Alejandro Moreo, Fabrizio Sebastiani, Mirko Tavoni
2021 arXiv   pre-print
MedLatinEpi and MedLatinLit consist of 294 and 30 curated texts, respectively, labelled by author; MedLatinEpi texts are of epistolary nature, while MedLatinLit texts consist of literary comments and treatises  ...  We present and make available MedLatinEpi and MedLatinLit, two datasets of medieval Latin texts to be used in research on computational authorship analysis.  ...  ACKNOWLEDGMENTS We would like to thank Gabriella Albanese and Paolo Pontari for helping us to identify the medieval Latin texts that we have incorporated into our datasets; Patrick Juola, Moshe Koppel,  ... 
arXiv:2006.12289v2 fatcat:2spkdiiuxbca3eq3tddec2v24y

Authorship Identification of a Russian-Language Text Using Support Vector Machine and Deep Neural Networks

Aleksandr Romanov, Anna Kurtukova, Alexander Shelupanov, Anastasia Fedotova, Valery Goncharov
2020 Future Internet  
Text authorship methods are particularly useful for information security and forensics.  ...  For example, such methods can be used to identify authors of suicide notes, and other texts are subjected to forensic examinations. Another area of application is plagiarism detection.  ...  Acknowledgments: The authors express their gratitude to the editor and reviewers for their work and valuable comments on the article.  ... 
doi:10.3390/fi13010003 fatcat:ls6odobiwvartmn6a4zj2oe7qa

Stylometry with R: A Package for Computational Text Analysis

Maciej Eder, Jan Rybicki, Mike Kestemont
2016 The R Journal  
In this paper we introduce the possibilities of stylo for computational text analysis, via a number of dummy case studies from English and French literature.  ...  Stylometry (computational stylistics) is concerned with the quantitative study of writing style, e.g. authorship verification, an application which has considerable potential in forensic contexts, as well  ...  Acknowledgments We would like to thank the users of stylo for the valuable feedback and feature requests which we have received over the past years.  ... 
doi:10.32614/rj-2016-007 fatcat:mvkvz45gbfdt5p6reu4rueplyi

Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts

G. Hirst, O. Feiguina
2007 Literary and Linguistic Computing  
We present a method for authorship discrimination that is based on the frequency of bigrams of syntactic labels that arise from partial parsing of the text.  ...  Moreover, high accuracies are achieved even on fragments of text little more than 200 words long.  ...  We are grateful to Neil Graham for his assistance and for the use of his code for the lexical features that were used in his work.  ... 
doi:10.1093/llc/fqm023 fatcat:rwlkxehekbdvdevjqsf7farn4a

Naïve Bayes classifiers for authorship attribution of Arabic texts

Alaa Saleh Altheneyan, Mohamed El Bachir Menai
2014 Journal of King Saud University: Computer and Information Sciences  
Comparison results with related methods indicate that MBNB and MNB are appropriate for authorship attribution.  ...  Authorship attribution is the process of assigning an author to an anonymous text based on writing characteristics.  ...  For each experiment, five authors with 20 texts for each were used.  ... 
doi:10.1016/j.jksuci.2014.06.006 fatcat:2xjtcojmrzh63pkjplq3iea6yi

Surveying the Development of Authorship Identification of Text Messages

Abdulaziz Altamimi, Saud Alotaibi, Abdulrahman Alruban
2019 International Journal of Intelligent Computing Research  
People typically use multiple messaging systems and send text messages concurrently by different messaging systems such as SMS, email, Twitter, and Facebook.  ...  Identification of suspects has become crucial for law enforcement due in particularly to the anonymity that the internet and associated services provide and identify the ownership of messages.  ...  The performance of authorship attribution systems on short texts can be affected by several factors.  ... 
doi:10.20533/ijicr.2042.4655.2019.0116 fatcat:vy2c26prjfcmjjlul3qtvio4mi
« Previous Showing results 1 — 15 out of 939 results