ZSE-Cap: A Zero-Shot Ensemble for Image Retrieval and Prompt-Guided Captioning
A zero-shot ensemble for image retrieval and prompt-guided captioning, ranked Top-4 on the private test set of the EVENTA Challenge at ACM Multimedia 2025.
EVENTA Challenge @ ACM Multimedia 2025 July 1, 2025