view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks Aug 11, 2025 • 75
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5, 2025 • 56
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 203
Portuguese LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 18 items • Updated 31 minutes ago • 41
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control +2 Feb 4, 2025 • 186
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published Jan 30, 2025 • 88
ViTucano-v1 Collection ViTucano is our first attempt at creating a vision assistant natively pretrained in Portuguese. ViTucano is built on top of the Tucano series. • 5 items • Updated May 31, 2025 • 1
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 213