Classification Of Twitter's Data To Get Gender Identification

Waqas Ali, Malik Tahir Hassan, Syed Fawad Raza, Usman Fiaz
2018 VAWKUM Transactions on Computer Sciences  
This paper describes the accuracy of various algorithms for classification of text on the basis of gender identification. We examined the knowledge extracted from corpus of twitter's online social media in term of gender identity. By comparing algorithms on different feature sets, we established a feature set of 20 distinct arguments which increase the correctness of gender identification on all over the twitter. We reported accuracies of three algorithms obtained by using two approaches
more » ... on two classes of gender i.e. male and female; a model where a lot of features are reduced using powerset transformation.
doi:10.21015/vtcs.v16i1.545 fatcat:mj6jmz4sr5edbicl544daaxbwu