PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation Paper • 1612.00593 • Published Dec 2, 2016
COPILOT: Human-Environment Collision Prediction and Localization from Egocentric Videos Paper • 2210.01781 • Published Oct 4, 2022
SceneHGN: Hierarchical Graph Networks for 3D Indoor Scene Generation with Fine-Grained Geometry Paper • 2302.10237 • Published Feb 16, 2023
PartNet: A Large-scale Benchmark for Fine-grained and Hierarchical Part-level 3D Object Understanding Paper • 1812.02713 • Published Dec 6, 2018
World Simulation with Video Foundation Models for Physical AI Paper • 2511.00062 • Published Oct 28 • 40
Multi3DRefer: Grounding Text Description to Multiple 3D Objects Paper • 2309.05251 • Published Sep 11, 2023
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images Paper • 2406.11579 • Published Jun 17, 2024
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published Mar 10 • 47
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion Paper • 2408.03178 • Published Aug 6, 2024 • 40
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15, 2024 • 21
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 46