Although FIG. 5 depicts pixel blocks 20 as square and of equal size, the division may be performed in any other manner that is efficient with respect to a particular application (e.g. in light of such factors as instruction set architecture, datapath width, and storage or tiling format).