Author Profiling on Social Media: An Ensemble Learning Approach using Various Features

Youngjun Joo, Inchon Hwang
2019 Conference and Labs of the Evaluation Forum  
We describe our participation in the PAN 2019 shared task on author profiling, determine whether a tweet's author is a bot or a human, and in case of human, identify author's gender for English and Spanish datasets. In this paper, we investigate the complementarities of both stylometry methods and contentbased methods, putting forward various techniques for building flexible features. Acting as a complement to these methods, we investigate an ensemble learning method paves the way to improve
more » ... performance of AP tasks. Experimental results demonstrate that the ensemble method by the combination of the stylometry methods and content-based methods can more accurately capture the author profiles than traditional methods. Our proposed model obtained 0.9333 and 0.8352 of accuracy in the bot and gender identification tasks for English test dataset respectively.
dblp:conf/clef/JooH19 fatcat:7ceztesujncmrpml5b55cxl2jq