GRIT: Faster and Better Image Captioning Transformer Using Dual Visual FeaturesOct 1, 2022·Van-Quang Nguyen,Masanori Suganuma,Takayuki Okatani· 0 min readTypeConference paperPublicationEuropean Conference on Computer Vision (ECCV) 2022Last updated on Oct 1, 2022 AuthorsVan-Quang NguyenPostdoc Researcher, RIKEN AIP ← Leveraging Video Coding Knowledge for Deep Video Enhancement Jul 1, 2023Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks Aug 1, 2021 →