"From Pixels to Voice: A Simple and Efficient End-to-End Spoken Image ..."

Chung Tran, Sakriani Sakti (2025)

Details and statistics

DOI: 10.1109/ICASSP49660.2025.10890285

access: closed

type: Conference or Workshop Paper

metadata version: 2025-07-07