Scientific VS Non-Scientific Citation Annotational Complexity Analysis using Machine Learning Classifiers

Hassan Raza, M. Faizan, Naeem Akhtar, Ayesha Abbas, Naveed-Ul-Hassan
2020 International Journal of Advanced Computer Science and Applications  
This paper evaluates the citation sentences' annotation complexity of both scientific as well as non-scientific text related articles to find out major complexity reasons by performing sentiment analysis of scientific and non-scientific domain articles using our own developed corpora of these domains separately. For this research, we selected different data sources to prepare our corpora in order to perform sentimental analysis. After that, we have performed a manual annotation procedure to
more » ... gn polarities using our defined annotation guidelines. We developed a classification system to check the quality of annotation work for both domains. From results, we have found that the scientific domain gave us more accurate results than the non-scientific domain. We have also explored the reasons for less accurate results and concluded that non-scientific text especially linguistics is of complex nature that leads to poor understanding and incorrect annotation.
doi:10.14569/ijacsa.2020.0110228 fatcat:jm2hjnjv55h3pcrfnupnowosre