Article

Introduces KTVIC, a life-domain Vietnamese image captioning dataset for benchmarking multimodal models.

Anh-Cuong Pham

• May 1, 2024 • 1 min read

Defines a visual abductive reasoning benchmark for hazard prediction with a newly collected dataset.

Korawat Charoenpitaks

• Nov 1, 2023 • 1 min read

Transfers video coding priors into deep enhancement networks for improved restoration quality.

Thong Bach

• Jul 1, 2023 • 1 min read