Implicit Filter Sparsification In Convolutional Neural Networks [article]

Dushyant Mehta, Kwang In Kim, Christian Theobalt
2019 arXiv   pre-print
We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. Through an extensive empirical study (Mehta et al., 2019) we hypothesize the mechanism behind the sparsification process, and find surprising links to certain filter sparsification heuristics proposed in literature. Emergence of, and the subsequent pruning of
more » ... selective features is observed to be one of the contributing mechanisms, leading to feature sparsity at par or better than certain explicit sparsification / pruning approaches. In this workshop article we summarize our findings, and point out corollaries of selective-featurepenalization which could also be employed as heuristics for filter pruning
arXiv:1905.04967v1 fatcat:2zqa53evrrgiblqbqxx24fvaea