Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks

Gaurav Srivastava, Deepak Kadetotad, Shihui Yin, Visar Berisha, Chaitali Chakrabarti, Jae-sun Seo
2019 ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
Deep neural networks (DNN) have shown tremendous success in various cognitive tasks, such as image classification, speech recognition, etc. However, their usage on resource-constrained edge devices has been limited due to high computation and large memory requirement. To overcome these challenges, recent works have extensively investigated model compression techniques such as element-wise sparsity, structured sparsity and quantization. While most of these works have applied these compression
doi:10.1109/icassp.2019.8682791 dblp:conf/icassp/SrivastavaKYBCS19 fatcat:bzs4w7gcgvdzpjyidb54kojkyi