FP-outlier: Frequent pattern based outlier detection

Zengyou He, Xiaofei Xu, Zhexue Huang, Shengchun Deng
2005 Computer Science and Information Systems  
An outlier in a dataset is an observation or a point that is considerably dissimilar to or inconsistent with the remainder of the data. Detection of such outliers is important for many applications and has recently attracted much attention in the data mining research community. In this paper, we present a new method to detect outliers by discovering frequent patterns (or frequent itemsets) from the data set. The outliers are defined as the data transactions that contain less frequent patterns
more » ... their itemsets. We define a measure called FPOF (Frequen Pa ern Outlier Factor) to detect the outlier transactions and propose the FindFPOF algorithm to discover outliers. The experimental results have shown that our approach outperformed the existing methods on identifying interesting outliers. t tt
doi:10.2298/csis0501103h fatcat:yvnwv3zelfelbmixxvlpmv5a7y