Profiling and mapping of parallel workloads on network processors

Ning Weng, Tilman Wolf
2005 Proceedings of the 2005 ACM symposium on Applied computing - SAC '05  
Network processors are embedded system-on-a-chip multiprocessors that are optimized to perform simple packet processing tasks at data rates of several Gigabits per second. To meet the performance demands of increasing link speeds and more complex network applications, network processors are implemented with several dozens of processor cores and execute multiple packet processing applications in parallel. The complexity of such systems makes it increasingly difficult for application developers
more » ... map applications to the various system resources and achieve optimal performance. We propose an automated profiling and mapping methodology for these highly parallel, embedded systems that starts out with a simple uniprocessor implementation of the networking application. An architecture independent representation of the runtime behavior of the application is used to map and schedule different processing steps to the underlying hardware. An analytic performance model is used in the process to estimate system performance and to find an near-optimal solution through iteration.
doi:10.1145/1066677.1066879 dblp:conf/sac/WengW05 fatcat:r2xa4y77l5erninwi247dhmxuy