An Optimized Density-based Algorithm for Anomaly Detection in High Dimensional Datasets

Adeel Shiraz Hashmi, Mohammad Najmud Doja, Tanvir Ahmad
2018 Scalable Computing : Practice and Experience  
In this study, the authors aim to propose an optimized density-based algorithm for anomaly detection with focus on high-dimensional datasets. The optimization is achieved by optimizing the input parameters of the algorithm using firefly meta-heuristic. The performance of different similarity measures for the algorithm is compared including both L1 and L2 norms to identify the most efficient similarity measure for high-dimensional datasets. The algorithm is optimized further in terms of speed
more » ... n terms of speed and scalability by using Apache Spark big data platform. The experiments were conducted on publicly available datasets, and the results were evaluated on various performance metrics like execution time, accuracy, sensitivity, and specificity.
doi:10.12694/scpe.v19i1.1394 fatcat:36otfgyk6zcj7khzsv5ggjpww4