"Mesh-TensorFlow: Deep Learning for Supercomputers."

Noam Shazeer et al. (2018)
a service of Schloss Dagstuhl - Leibniz Center for Informatics