"Vision-Text Cross-Modal Fusion for Accurate Video Captioning."

Kaouther Ouenniche, Ruxandra Tapu, Titus B. Zaharia (2023)

Details and statistics

DOI: 10.1109/ACCESS.2023.3324052

access: open

type: Journal Article

metadata version: 2023-11-09

a service of  Schloss Dagstuhl - Leibniz Center for Informatics