"A Visual Attention Grounding Neural Model for Multimodal Machine Translation."

Mingyang Zhou et al. (2018)
a service of Schloss Dagstuhl - Leibniz Center for Informatics