Hierarchy Website Fingerprint Using N-gram Byte Distribution

Mohammed Aldarwbi, Essa Shahra
2017 Transactions on Networks and Communications  
According to www.internetlivestats.com, there are over one billion websites on the world wide web (WWW) today while in 1991, there were only one single website. Websites classification based on traffic analysis has become a difficult problem due to the large number of websites within the internet. All the proposed approaches in the literature could not classify more than 100 websites which is a very trivial number compared to the total number of websites over the internet. In this paper, a
more » ... evel websites' classification technique is proposed. At the first level, the traffic is classified to a general category such as sports, news, social, healthy, education, etc. Then, for further information the packet could be classified within the same category to identify from which websites the packet came.
doi:10.14738/tnc.56.3767 fatcat:hgx7c5fbmnhpfmtae4wi5tgmhy