In-Video Instructions: Visual Signals as Generative Control Paper • 2511.19401 • Published 12 days ago • 29
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published Jun 20 • 64
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper • 2403.13372 • Published Mar 20, 2024 • 172
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Paper • 2403.18795 • Published Mar 27, 2024 • 20
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Paper • 2406.06367 • Published Jun 10, 2024
Poison-splat: Computation Cost Attack on 3D Gaussian Splatting Paper • 2410.08190 • Published Oct 10, 2024 • 1
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering Paper • 2503.16422 • Published Mar 20 • 14
Discrete Diffusion in Large Language and Multimodal Models: A Survey Paper • 2506.13759 • Published Jun 16 • 43
Improve Vision Language Model Chain-of-thought Reasoning Paper • 2410.16198 • Published Oct 21, 2024 • 26
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 137
Vista3D: Unravel the 3D Darkside of a Single Image Paper • 2409.12193 • Published Sep 18, 2024 • 10
Vista3D: Unravel the 3D Darkside of a Single Image Paper • 2409.12193 • Published Sep 18, 2024 • 10
Vista3D: Unravel the 3D Darkside of a Single Image Paper • 2409.12193 • Published Sep 18, 2024 • 10 • 2