Deepseek v3.2 Speciale Collection Distilled models and datasets for Deepseek v3.2 Speciale. • 11 items • Updated 5 days ago • 1
Gemini 3 Pro Collection Distilled models and datasets for Gemini 3 Pro. • 9 items • Updated 2 days ago • 1
view article Article Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset +1 Mar 15, 2024 • 13
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 14 days ago • 212
GPT-4 generated datasets Collection Collection of some GPT-4 generated datasets. It may be useful for those looking for the best-quality datasets to train competitive LLMs. • 18 items • Updated Apr 16, 2024 • 10
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 410
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model +1 May 14, 2024 • 278
Cut and Learn for Unsupervised Object Detection and Instance Segmentation Paper • 2301.11320 • Published Jan 26, 2023 • 1
ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction Paper • 2504.21855 • Published Apr 30 • 13
CodeFusion: A Pre-trained Diffusion Model for Code Generation Paper • 2310.17680 • Published Oct 26, 2023 • 73
TESS: Text-to-Text Self-Conditioned Simplex Diffusion Paper • 2305.08379 • Published May 15, 2023 • 3