Using Bag-of-Words and Psycho-Linguistic features for MAPonSMS

Asmara Safdar, Osama Akhter, Osama Inayat, Abdullah Khalid
2018 Forum for Information Retrieval Evaluation  
This paper presents the use of Bag-of-Words( BoW) and Psycho-Linguistic( P-L) approaches based upon the demographic trends in modeling multilingual( Roman-Urdu and English) SMS text( Short Message Service) for gender and age prediction. The data set 1 was provided as a standard source to work for the multilingual author profiling task in the contest FIRE'18-MAPonSMS 2 . The proposed approaches, as compared to the baseline results, adequately classify the test set to age and gender separately.
dblp:conf/fire/SafdarAIK18 fatcat:llklelhjofb2nlr53uvbtpwxam