"Layer-wise enhanced transformer with multi-modal fusion for image caption."

Jingdan Li, Yi Wang, Dexin Zhao (2023)

Details and statistics

DOI: 10.1007/S00530-022-01036-Z

access: closed

type: Journal Article

metadata version: 2023-06-04

a service of  Schloss Dagstuhl - Leibniz Center for Informatics