"Multi-Modal Structure-Embedding Graph Transformer for Visual Commonsense ..."

Jian Zhu, Hanli Wang, Bin He (2024)

Details and statistics

DOI: 10.1109/TMM.2023.3279691

access: closed

type: Journal Article

metadata version: 2024-02-10

a service of  Schloss Dagstuhl - Leibniz Center for Informatics