Special Aspects of Matrix Operation Implementations for Low-Precision Neural Network Model on the Elbrus Platform

E.E. Limonova, Federal Research Center ", M.I. Neiman-zade, V.L. Arlazarov, Computer Science and Control", of the Russian Academy of Sciences, Smart Engines Service LLC, JSC "MCST", Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences
2020 Bulletin of the South Ural State University Series Mathematical Modelling Programming and Computer Software  
This paper investigates the possibility of effective implementation of calculations in lowprecision neural network models on the Elbrus platform with the VLIW architecture. Such models are widely used in practice to increase the computational efficiency of recognition and well suit computers with the x86 and ARM architectures. In this paper, we consider an 8-bit neural network model, in which matrix multiplication is the most resource-intensive part of the implementation. This paper presents an
more » ... effective implementation of matrix multiplication that takes into account the features of the Elbrus architecture: the presence of several computational channels with various arithmetic and logic devices, an array prefetch buffer, and its own SIMD extension. We carry out theoretical and experimental comparisons of the computational efficiency of low-precision and classical neural network models, which show that Elbrus processors have much more capabilities for performing fast floating point calculations and require the development of new approaches to increase the computational efficiency of neural network models.
doi:10.14529/mmp200109 fatcat:7txfuvur35dovkvkuzsdmixxrm