Using AUC and accuracy in evaluating learning algorithms

Jin Huang, C.X. Ling
2005 IEEE Transactions on Knowledge and Data Engineering  
The area under the ROC (Receiver Operating Characteristics) curve, or simply AUC, has been recently proposed as an alternative single-number measure for evaluating the predictive ability of learning algorithms. However, no formal arguments were given as to why AUC should be preferred over accuracy. In this paper, we establish formal criteria for comparing two different measures for learning algorithms, and we show theoretically and empirically that AUC is, in general, a better measure (defined
more » ... recisely) than accuracy. We then reevaluate well-established claims in machine learning based on accuracy using AUC, and obtain interesting and surprising new results. We also show that AUC is more directly associated with the net profit than accuracy in direct marketing, suggesting that learning algorithms should optimize AUC instead of accuracy in real-world applications.
doi:10.1109/tkde.2005.50 fatcat:f2qdpgcxs5e3npeqwrmekpdqla