Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]
Federico Cocchi
fede97
AI & ML interests
Multimodal LLM - Computer Vision
Recent Activity
updated a model about 2 months ago
aimagelab/LLaVA_MORE-gemma_2_2b-finetuning published a model about 2 months ago
aimagelab/LLaVA_MORE-gemma_2_2b-finetuning updated a model 7 months ago
aimagelab/LLaVA_MORE-gemma_2_9b-dinov2-finetuning