facebook/dinov3-vits16-pretrain-lvd1689m Image Feature Extraction • 21.6M • Updated Aug 19, 2025 • 171k • 61
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated 8 days ago • 12.3k • 90
VST Collection A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 5 items • Updated Nov 12, 2025 • 6
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated 21 days ago • 18
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated 21 days ago • 33