Filters








997 Hits in 5.1 sec

Overview of PAN'17 [chapter]

Martin Potthast, Francisco Rangel, Michael Tschuggnall, Efstathios Stamatatos, Paolo Rosso, Benno Stein
2017 Lecture Notes in Computer Science  
This paper gives a high-level overview of each of the three shared tasks organized this year, namely author identification, author profiling, and author obfuscation.  ...  For each task, we give a brief summary of the evaluation data, performance measures, and results obtained.  ...  Acknowledgements Our special thanks go to all of PAN's participants, to Symanto Group 2 for sponsoring PAN and to MeaningCloud 3 for sponsoring the author profiling shared task award.  ... 
doi:10.1007/978-3-319-65813-1_25 fatcat:mwcxd74zmvf6jdf243ur2t7a4y

Overview of PAN'16 [chapter]

Paolo Rosso, Francisco Rangel, Martin Potthast, Efstathios Stamatatos, Michael Tschuggnall, Benno Stein
2016 Lecture Notes in Computer Science  
PAN 2016 comprises three shared tasks: (i) author identification, addressing author clustering and diarization (or intrinsic plagiarism detection); (ii) author profiling, addressing age and gender prediction  ...  from a crossgenre perspective; and (iii) author obfuscation, addressing author masking and obfuscation evaluation.  ...  Acknowledgements We thank the organizing committees of PAN's shared tasks Ben Verhoeven, Walter Daelemans, Patrick Juola. Our special thanks go to all of PAN's participants, to Adobe 12  ... 
doi:10.1007/978-3-319-44564-9_28 fatcat:qyopkvia5jalvg5lazy2x4ugqu

Overview of PAN 2018 [chapter]

Efstathios Stamatatos, Francisco Rangel, Michael Tschuggnall, Benno Stein, Mike Kestemont, Paolo Rosso, Martin Potthast
2018 Lecture Notes in Computer Science  
Finally, the author obfuscation task studies how a text by a certain author can be paraphrased so that existing author identification tools are confused and cannot recognize the similarity with other texts  ...  In addition, a shared task in multimodal author profiling examines, for the first time, a combination of information from both texts and images posted by social media users to estimate their gender.  ...  Acknowledgments Our special thanks go to all of PAN's participants, to Symanto Group 7 for sponsoring PAN and to MeaningCloud 8 for sponsoring the author profiling shared task award.  ... 
doi:10.1007/978-3-319-98932-7_25 fatcat:7kjolzjqzncvfbv3mcxyhfchyu

Shared Tasks on Authorship Analysis at PAN 2020 [chapter]

Janek Bevendorff, Bilal Ghanem, Anastasia Giachanou, Mike Kestemont, Enrique Manjavacas, Martin Potthast, Francisco Rangel, Paolo Rosso, Günther Specht, Efstathios Stamatatos, Benno Stein, Matti Wiegmann (+1 others)
2020 Lecture Notes in Computer Science  
The tasks include author profiling, celebrity profiling, crossdomain author verification, and style change detection, seeking to advance the state of the art and to evaluate it on new benchmark datasets  ...  The paper gives a brief overview of the four shared tasks that are to be organized at the PAN 2020 lab on digital text forensics and stylometry, hosted at CLEF conference.  ...  At different editions of PAN (since 2007), author identification has been studied in multiple incarnations: AUTHORSHIP ATTRI-BUTION: given a document and a set of candidate authors, determine which of  ... 
doi:10.1007/978-3-030-45442-5_66 fatcat:c7yewwmji5hwjgmda5ouhloia4

Overview of the RUSProfiling PAN at FIRE Track on Cross-genre Gender Identification in Russian

Tatiana Litvinova, Francisco M. Rangel Pardo, Paolo Rosso, Pavel Seredin, Olga Litvinova
2017 Forum for Information Retrieval Evaluation  
After addressing at PAN@CLEF 1 mainly age and gender identification, in this RusProfiling PAN@FIRE track we have addressed the problem of predicting author's gender in Russian from a cross-genre perspective  ...  Author profiling consists of predicting some author's traits (e.g. age, gender, personality) from her writing.  ...  author profiling standpoint and have never been addressed at PAN.  ... 
dblp:conf/fire/LitvinovaPRSL17 fatcat:2c2bfpn6wrh4lgm6hjxehvm2pm

Overview of the PAN/CLEF 2015 Evaluation Lab [chapter]

Efstathios Stamatatos, Martin Potthast, Francisco Rangel, Paolo Rosso, Benno Stein
2015 Lecture Notes in Computer Science  
PAN 2015 comprises three tasks: plagiarism detection, author identification and author profiling studying important variations of these problems.  ...  This paper presents an overview of the PAN/CLEF evaluation lab.  ...  Acknowledgements We thank the organizing committees of PAN's shared tasks Fabio Celli, Walter Daelemans, Ben Verhoeven, Patrick Juola, and Aurelio López-López.  ... 
doi:10.1007/978-3-319-24027-5_49 fatcat:fcpf2p7nujet5ez4zswoiscatq

Academic Plagiarism Detection

Tomáš Foltýnek, Norman Meuschke, Bela Gipp
2019 ACM Computing Surveys  
Over the period we review, the field has seen major advances regarding the automated detection of strongly obfuscated and thus hard-to-identify forms of academic plagiarism.  ...  The high intensity and rapid pace of research on academic plagiarism detection make it difficult for researchers to get an overview of the field.  ...  We recommend readers interested in related tasks to refer to the overview paper of PAN'17 [200] .  ... 
doi:10.1145/3345317 fatcat:yk6f5xl2kvdxlhvsolem6zfdsu

Fine-grained analysis of language varieties and demographics

Francisco Rangel, Paolo Rosso, Wajdi Zaghouani, Anis Charfi
2020 Natural Language Engineering  
We compared the performance of this method with the best performing teams in the Author Profiling task at PAN 2017.  ...  We obtained an average accuracy of 92.08% versus 91.84% for the best performing team at PAN 2017. We also analyse the relationship of the language variety identification with the authors' gender.  ...  The statements made herein are solely the responsibility of the authors. References  ... 
doi:10.1017/s1351324920000108 fatcat:mdk2yxafbjffnhm5te3b7lscxe

Improving the Reproducibility of PAN's Shared Tasks: [chapter]

Martin Potthast, Tim Gollub, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein
2014 Lecture Notes in Computer Science  
This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks on plagiarism detection, author identification, and author profiling.  ...  Unlike many other labs, PAN asks participants to submit running softwares instead of their run output.  ...  Moreover, we thank our student assistants Anna Beyer and Matthias Busse for helping with maintaining TIRA. Our special thanks go to all of PAN's participants.  ... 
doi:10.1007/978-3-319-11382-1_22 fatcat:anztewljlbgznjdipotoxcdp2q

Author Profiling Tracks at FIRE

Paolo Rosso, Francisco Rangel
2020 SN Computer Science  
In this chapter, we will focus on the description of three author profiling tracks, on their data creation as well as the result analysis.  ...  Benchmarking activities are vital for fostering research and addressing new challenging problems.  ...  Regarding age and gender identification, the best performing team in the three first editions of the author profiling shared task at PAN@CLEF used a second-order representation which relates documents  ... 
doi:10.1007/s42979-020-0073-1 fatcat:lnpfmywpkbc4tnhagaocvpwjky

STEREO: Scientific Text Reuse in Open Access Publications [article]

Lukas Gienapp, Wolfgang Kircheis, Bjarne Sievers, Benno Stein, Martin Potthast
2021 arXiv   pre-print
Featuring a high coverage of scientific disciplines and varieties of reuse, as well as comprehensive metadata to contextualize each case, our dataset addresses the most salient shortcomings of previous  ...  Webis-STEREO-21 allows for tackling a wide range of research questions from different scientific backgrounds, facilitating both qualitative and quantitative analysis of the phenomenon as well as a first-time  ...  The heavily obfuscated test sets studied at PAN were dedicated to study extreme cases of plagiarism, where an author expends much effort to hide the fact via severe forms of paraphrasing.  ... 
arXiv:2112.11800v1 fatcat:d74wspv4hrdajkkziyor6273gq

Bots and Gender Profiling using Masking Techniques

Victor Jimenez-Villar, Javier Sánchez-Junquera, Manuel Montes-y-Gómez, Luis Villaseñor Pineda, Simone Paolo Ponzetto
2019 Conference and Labs of the Evaluation Forum  
This work describes our proposed solution for the author profiling shared task at PAN 2019.  ...  The task consists in identifying whether the author of a Twitter feed is a bot or a human, and, in case of a human, in determining if the author is male or female.  ...  This work was partially supported by CONACYT-Mexico under the scholarship 868585 and project grant CB-2015-01-257383.  ... 
dblp:conf/clef/Jimenez-VillarS19 fatcat:5s3yphwhjzextafzrv3lla65ly

Multilingual Author Profiling using LSTMs: Notebook for PAN at CLEF 2018

Roy Khristopher Bayot, Teresa Gonçalves
2018 Conference and Labs of the Evaluation Forum  
This paper shows one approach of the Universidade de Évora for author profiling for PAN 2018. The approach mainly consists of using word vectors and LSTMs for gender classification.  ...  Using the PAN 2018 dataset, we achieved an accuracy of 67.60% for Arabic, 77.16% for English, and 68.73% for Spanish gender classification.  ...  PAN Editions PAN is one of the initiatives at CLEF that has various tasks related to author analysis. It has author identification, obfuscation, and profiling.  ... 
dblp:conf/clef/BayotG18 fatcat:7gjohpuljvbbjnfc75vuj7f5u4

Recent Trends in Digital Text Forensics and Its Evaluation [chapter]

Tim Gollub, Martin Potthast, Anna Beyer, Matthias Busse, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein
2013 Lecture Notes in Computer Science  
, and author profiling.  ...  This paper outlines the concepts and achievements of our evaluation lab on digital text forensics, PAN 13, which called for original research and development on plagiarism detection, author identification  ...  identification, and author profiling respectively.  ... 
doi:10.1007/978-3-642-40802-1_28 fatcat:3sceedp6zzgipemglyvmuookkm

On Textual Analysis and Machine Learning for Cyberstalking Detection

Ingo Frommholz, Haider M. al-Khateeb, Martin Potthast, Zinnar Ghasem, Mitul Shukla, Emma Short
2016 Datenbank-Spektrum  
We then discuss PAN, a network and evaluation initiative that focusses on digital text forensics, in particular author identification.  ...  We present a framework for the detection of text-based cyberstalking and the role and challenges of some core techniques such as author identification, text classification and personalisation.  ...  Figure 2 gives an overview of its four major components, namely attribution, verification, profiling, and reuse detection.  ... 
doi:10.1007/s13222-016-0221-x pmid:29368749 pmcid:PMC5750836 fatcat:h4nn4ufsqnbypbkid7na77z55i
« Previous Showing results 1 — 15 out of 997 results