A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Ring-Mesh: A Scalable and High-Performance Approach for Manycore Accelerators
[article]
2019
arXiv
pre-print
There is an increasing number of works addressing the design challenge of fast, scalable solutions for the growing machine learning based application domain. Recently, most of the solutions aimed at improving processing element capabilities to speed up the execution of deep learning (DL) application. However, only a few works focused on the interconnection subsystem as a potential source of performance improvement. Wrapping many cores together offer excellent parallelism, but it comes with
arXiv:1904.03428v1
fatcat:avwhcxwgkzfc5ma5uwcimiex34