TWIN Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" glab-caltech/TWIN-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated about 2 hours ago • 21 glab-caltech/TWIN-InternVL3_5-1B Image-Text-to-Text • 1B • Updated about 1 hour ago • 4 • 1 glab-caltech/FGVQA Viewer • Updated about 8 hours ago • 12k • 16 glab-caltech/TWIN Viewer • Updated about 8 hours ago • 562k • 24 • 2
VALOR Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" glab-caltech/VALOR-8B 8B • Updated 19 days ago • 61 glab-caltech/VALOR-GroundingDINO Object Detection • Updated 19 days ago
TWIN Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" glab-caltech/TWIN-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated about 2 hours ago • 21 glab-caltech/TWIN-InternVL3_5-1B Image-Text-to-Text • 1B • Updated about 1 hour ago • 4 • 1 glab-caltech/FGVQA Viewer • Updated about 8 hours ago • 12k • 16 glab-caltech/TWIN Viewer • Updated about 8 hours ago • 562k • 24 • 2
VALOR Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" glab-caltech/VALOR-8B 8B • Updated 19 days ago • 61 glab-caltech/VALOR-GroundingDINO Object Detection • Updated 19 days ago