"Run-time thread sorting to expose data-level parallelism."

Tirath Ramdas et al. (2008)
a service of Schloss Dagstuhl - Leibniz Center for Informatics