Subword permutation instructions for two-dimensional multimedia processing in MicroSIMD architectures

R.B. Lee
Proceedings IEEE International Conference on Application-Specific Systems, Architectures, and Processors  
MicroSIMD architectures incorporating subword parallelism are very efficient for application-specific media processors as well as for fast multimedia information processing in general-purpose processors. This paper addresses the unsolved problem of the need to permute the subwords packed in registers for maximum parallelism performance, especially for two-dimensional (2-D) multimedia algorithms. We propose a new systematic approach for identifying the fundamental data rearrangement needs in
more » ... ent and future 2-D pixel processing programs based on the hierarchical decomposition of frames and objects into atomic 2-D structures. We define new subword permutation instructions, Check, Excheck, Exchange, and Permset, that achieve these data rearrangements across multiple registers. We also define an alphabet of subword permutation primitives, including these new instructions and the Mix instruction defined for PA-RISC MAX-2 and IA-64, which supports the data rearrangement needs of 2-D frames and objects. We show the sufficiency and efficiency of this alphabet for achieving all possible permutations of hierarchical 2-D blocks. 0-7695-0716-6/00 $10.00 ã 2000 IEEE
doi:10.1109/asap.2000.862373 dblp:conf/asap/Lee00 fatcat:4obc7vz5pzcdddrjildnepdp5m