A visual tool for bayesian data analysis

Giorgio Maria Di Nunzio, Alessandro Sordoni
2012 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12  
Naïve Bayes (NB) classifiers are simple probabilistic classifiers still widely used in supervised learning due to their tradeoff between efficient model training and good empirical results. One of the drawbacks of these classifiers is that in situations of data sparsity (i.e. when the size of training set is small) the maximum likelihood estimation of the probability of unseen features in these situations is equal to zero causing arithmetic anomalies. To prevent this undesirable behavior, a
more » ... er of smoothing techniques have been proposed [4] . Among these, the Bayesian approach incorporates smoothing in terms of prior knowledge about the parameters of the model usually called hyper-parameters. Our research question is: can a visualization tool help researchers to quickly assess the goodness of the performance of NB classifiers by setting optimal smoothing parameters?
doi:10.1145/2348283.2348427 dblp:conf/sigir/NunzioS12 fatcat:nyr4tspjdzcutnae3g3hhwzc74