A Study on Mining User-Aware Uncommon Consecutive Topic Patterns in Report Streams

Ankit Gururaj
2016 Asia-pacific Journal of Convergent Research Interchange  
Textual documents created and distributed on the Internet are constantly changing in different structures. The greater part of existing works is given to subject demonstrating and the development of individual topics, while sequential relations of topics in progressive reports distributed by a particular client are disregarded. In this paper, keeping in mind the end goal to describe and identify customized and anomalous practices of Internet clients, we propose Sequential Topic Patterns (STPs)
more » ... nd figure the issue of mining User-mindful Rare Sequential Topic Patterns (URSTPs) in archive streams on the Internet. They are uncommon all in all however moderately visit for particular clients, so can be connected in some genuine situations, for example, real-time monitoring on abnormal user behaviors. We show a gathering of calculations to tackle this inventive mining issue through three stages: preprocessing to remove probabilistic themes and recognize sessions for various clients, producing all the STP applicants with (expected) bolster values for every client by example development, and selecting URSTPs by making client mindful irregularity investigation on determined STPs. Probes both genuine (Twitter) and manufactured datasets demonstrate that our approach can without a doubt find extraordinary clients and interpretable URSTPs successfully and effectively, which significantly reflect users' characteristics.
doi:10.21742/apjcri.2016.12.03 fatcat:77rxv3mmkvd3hpsj24emsyngfa