Effective and High Computing Algorithms for Convolution Neural Networks

P Syamala Rao, Dr G.P.SaradhiVarma, Rajasekhar Mutukuri
2018 International Journal of Engineering & Technology  
Training a large set of data takes GPU days using Deep convolution neural networks which are a time taking process. Self-driving cars require very low latency for pedestrian detection. Image recognition constrained by limited processing resources for mobile phones. The computation speed of the training set determines that in these situations convolution neural networks was a success. For large filters, Conventional Faster Fourier Transform based convolution is preferably fast, yet in case of
more » ... ll, 3 × 3 filters state of the art convolutional neural networks is used. By using Winograd's minimal filtering algorithms the new class of fast algorithms for convolutional neural networks was introduced by us. Instead of small tiles, minimal complexity convolution was computed by the algorithms, this increases the computing speed with small batch sizes and small filters. With the VGG network, we benchmark a GPU implementation of our algorithm and at batch sizes from 1 to 64 state of the art throughput was shown.
doi:10.14419/ijet.v7i3.31.18203 fatcat:24m3ebi5dbccjg5l7j6pdb6e7q