A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
Authorship Attribution Using Text Distortion
2017
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
Authorship attribution is associated with important applications in forensics and humanities research. ...
Based on experiments on two main tasks in authorship attribution, closed-set attribution and authorship verification, we demonstrate that the proposed approach can enhance existing methods especially under ...
In authorship attribution it is not always realistic to assume that the texts of known authorship and the texts under investigation belong in the same genre and are in the same thematic area. ...
doi:10.18653/v1/e17-1107
dblp:conf/eacl/Stamatatos17
fatcat:wwbnsijdtjch5ickkct55zbequ
Forensic Authorship Analysis of Microblogging Texts Using N-Grams and Stylometric Features
[article]
2020
arXiv
pre-print
We use for our experiments a self-captured database of 40 users, with 120 to 200 tweets per user. ...
In recent years, messages and text posted on the Internet are used in criminal investigations. Unfortunately, the authorship of many of them remains unknown. ...
METHODS FOR WRITER IDENTIFICATION Here, we employ the most popular features for authorship attribution: n-grams and stylometric features. ...
arXiv:2003.11545v1
fatcat:45yqhsl6f5ai7dvke3gmyitaf4
Author gender identification from text
2011
Digital Investigation. The International Journal of Digital Forensics and Incident Response
Note that this is different from the authorship attribution problem. ...
Extensive experiments on large text corpora (Reuters Corpus Volume 1 newsgroup data and Enron e-mail data) indicate an accuracy up to 85.1% in identifying the gender. ...
Although authorship identification methods have achieved some degree of success in many literary and forensic applications as mentioned above, very limited studies have been undertaken specifically for ...
doi:10.1016/j.diin.2011.04.002
fatcat:7gx7y2gd6ndcxi6eurhn7uxu34
Authorship attribution of texts: a review
2005
Electronic Notes in Discrete Mathematics
literary works. ...
We study the authorship attribution of documents given some prior stylistic characteristics of the author's writing extracted from a corpus of known works, e.g., authentication of disputed documents or ...
The methods that are now developing are promising and could also very well apply in other similar problems of authorship attribution, some of which might even have significant security applications. ...
doi:10.1016/j.endm.2005.07.064
fatcat:7csyaqars5ayjd4f3kqn3yaj7y
Authorship Attribution of Texts: A Review
[chapter]
2006
Lecture Notes in Computer Science
literary works. ...
We study the authorship attribution of documents given some prior stylistic characteristics of the author's writing extracted from a corpus of known works, e.g., authentication of disputed documents or ...
The methods that are now developing are promising and could also very well apply in other similar problems of authorship attribution, some of which might even have significant security applications. ...
doi:10.1007/11889342_20
fatcat:z7o47qjygnh2po3sal2es762ta
Authorship identification from unstructured texts
2014
Knowledge-Based Systems
It has been applied to more and more practical applications including literary works, intelligence, criminal law, civil law, and computer forensics. ...
The increasingly large volumes of anonymous texts on the Internet enhance the great yet urgent necessity for authorship identification. ...
Authorship identification has been applied to more and more applications including literary works, intelligence, criminal law, civil law, and computer forensics [1] [2] [3] . ...
doi:10.1016/j.knosys.2014.04.025
fatcat:d4d6ksofw5ha7aju4rsc4jcnpu
Generalizing Unmasking for Short Texts
2019
Proceedings of the 2019 Conference of the North
The new approach is on par with other state-ofthe-art techniques that are optimized for texts of this length: it achieves accuracies of 75-80 %, while also allowing for easy adjustment to forensic scenarios ...
Authorship verification is the problem of inferring whether two texts were written by the same author. ...
Although mostly the narrower task of authorship attribution has been considered, where texts are attributed to a set of given authors, recently, authorship verification has been proposed as a more fundamental ...
doi:10.18653/v1/n19-1068
dblp:conf/naacl/BevendorffSHP19
fatcat:ku2e73t6rzbqxlfigv3kama4gi
Writer Identification Using Microblogging Texts for Social Media Forensics
[article]
2021
arXiv
pre-print
In such cases, automatic attribution can provide significant time savings to experts in suspect search. For completeness, we report verification results. ...
Establishing authorship of online texts is fundamental to combat cybercrimes. Unfortunately, text length is limited on some platforms, making the challenge harder. ...
In this context, it is of huge interest the development of methods for authorship attribution to aid in forensic investigations of cybercrimes [1] . ...
arXiv:2008.01533v2
fatcat:jaqu34fv3zdlhjh6wmsg75gn54
Authorship Attribution of Social Media and Literary Russian-Language Texts Using Machine Learning Methods and Feature Selection
2021
Future Internet
The average accuracy for literary texts was 80.4% using SVM combined with GA, 82.3% using deep NNs, and 82.1% using fastText. ...
Authorship attribution is one of the important fields of natural language processing (NLP). ...
Acknowledgments: The authors express their gratitude to the editor and reviewers for their work and valuable comments on the article. ...
doi:10.3390/fi14010004
fatcat:u2aqr6jurrabjlr6bizziibxhi
MedLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts
[article]
2021
arXiv
pre-print
MedLatinEpi and MedLatinLit consist of 294 and 30 curated texts, respectively, labelled by author; MedLatinEpi texts are of epistolary nature, while MedLatinLit texts consist of literary comments and treatises ...
We present and make available MedLatinEpi and MedLatinLit, two datasets of medieval Latin texts to be used in research on computational authorship analysis. ...
ACKNOWLEDGMENTS We would like to thank Gabriella Albanese and Paolo Pontari for helping us to identify the medieval Latin texts that we have incorporated into our datasets; Patrick Juola, Moshe Koppel, ...
arXiv:2006.12289v2
fatcat:2spkdiiuxbca3eq3tddec2v24y
Authorship Identification of a Russian-Language Text Using Support Vector Machine and Deep Neural Networks
2020
Future Internet
Text authorship methods are particularly useful for information security and forensics. ...
For example, such methods can be used to identify authors of suicide notes, and other texts are subjected to forensic examinations. Another area of application is plagiarism detection. ...
Acknowledgments: The authors express their gratitude to the editor and reviewers for their work and valuable comments on the article. ...
doi:10.3390/fi13010003
fatcat:ls6odobiwvartmn6a4zj2oe7qa
Stylometry with R: A Package for Computational Text Analysis
2016
The R Journal
In this paper we introduce the possibilities of stylo for computational text analysis, via a number of dummy case studies from English and French literature. ...
Stylometry (computational stylistics) is concerned with the quantitative study of writing style, e.g. authorship verification, an application which has considerable potential in forensic contexts, as well ...
Acknowledgments We would like to thank the users of stylo for the valuable feedback and feature requests which we have received over the past years. ...
doi:10.32614/rj-2016-007
fatcat:mvkvz45gbfdt5p6reu4rueplyi
Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts
2007
Literary and Linguistic Computing
We present a method for authorship discrimination that is based on the frequency of bigrams of syntactic labels that arise from partial parsing of the text. ...
Moreover, high accuracies are achieved even on fragments of text little more than 200 words long. ...
We are grateful to Neil Graham for his assistance and for the use of his code for the lexical features that were used in his work. ...
doi:10.1093/llc/fqm023
fatcat:rwlkxehekbdvdevjqsf7farn4a
Naïve Bayes classifiers for authorship attribution of Arabic texts
2014
Journal of King Saud University: Computer and Information Sciences
Comparison results with related methods indicate that MBNB and MNB are appropriate for authorship attribution. ...
Authorship attribution is the process of assigning an author to an anonymous text based on writing characteristics. ...
For each experiment, five authors with 20 texts for each were used. ...
doi:10.1016/j.jksuci.2014.06.006
fatcat:2xjtcojmrzh63pkjplq3iea6yi
Surveying the Development of Authorship Identification of Text Messages
2019
International Journal of Intelligent Computing Research
People typically use multiple messaging systems and send text messages concurrently by different messaging systems such as SMS, email, Twitter, and Facebook. ...
Identification of suspects has become crucial for law enforcement due in particularly to the anonymity that the internet and associated services provide and identify the ownership of messages. ...
The performance of authorship attribution systems on short texts can be affected by several factors. ...
doi:10.20533/ijicr.2042.4655.2019.0116
fatcat:vy2c26prjfcmjjlul3qtvio4mi
« Previous
Showing results 1 — 15 out of 939 results