Improving Deep Neural Network Sparsity through Decorrelation Regularization

Xiaotian Zhu, Wengang Zhou, Houqiang Li
2018 Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence  
Modern deep learning models usually suffer high complexity in model size and computation when transplanted to resource constrained platforms. To this end, many works are dedicated to compressing deep neural networks. Adding group LASSO regularization is one of the most effective model compression methods since it generates structured sparse networks. We investigate the deep neural networks trained by group LASSO constraint and observe that even with strong sparsity regularization imposed, there
more » ... still exists substantial filter correlation among the convolution filters, which is undesired for a compact neural network. We propose to suppress such correlation with a new kind of constraint called decorrelation regularization, which explicitly forces the network to learn a set of less correlated filters. The experiments on CIFAR10/100 and ILSVRC2012 datasets show that when combined our decorrelation regularization with group LASSO, the correlation between filters could be effectively weakened, which increases the sparsity of the resulting model and leads to better compressing performance.
doi:10.24963/ijcai.2018/453 dblp:conf/ijcai/ZhuZL18 fatcat:nvkuvyjuwbfkveytejuurrg5hq