KTVIC: A Vietnamese Image Captioning Dataset on the Life Domain
Introduces KTVIC, a life-domain Vietnamese image captioning dataset for benchmarking multimodal models.
Anh-Cuong Pham
Introduces KTVIC, a life-domain Vietnamese image captioning dataset for benchmarking multimodal models.
Defines a visual abductive reasoning benchmark for hazard prediction with a newly collected dataset.
Transfers video coding priors into deep enhancement networks for improved restoration quality.