TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration Paper • 2604.14116 • Published 1 day ago • 5
The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training Paper • 2603.10444 • Published Mar 11 • 11
End-to-End Video Character Replacement without Structural Guidance Paper • 2601.08587 • Published Jan 13 • 8
view post Post 331 QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management See translation 👍 1 1 + Reply
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping Paper • 2510.18927 • Published Oct 21, 2025 • 85
RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies Paper • 2510.17950 • Published Oct 20, 2025 • 9
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping Paper • 2306.12981 • Published Jun 22, 2023
Towards Language-Driven Video Inpainting via Multimodal Large Language Models Paper • 2401.10226 • Published Jan 18, 2024 • 2
OMG-Seg: Is One Model Good Enough For All Segmentation? Paper • 2401.10229 • Published Jan 18, 2024 • 1
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD Paper • 2404.06512 • Published Apr 9, 2024 • 30
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning Paper • 2406.17770 • Published Jun 25, 2024 • 19
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper • 2407.03320 • Published Jul 3, 2024 • 94
InternLM-Law: An Open Source Chinese Legal Large Language Model Paper • 2406.14887 • Published Jun 21, 2024
RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation Paper • 2407.08634 • Published Jul 11, 2024
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection Paper • 2401.02361 • Published Jan 4, 2024
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose Paper • 2303.07399 • Published Mar 13, 2023
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space Paper • 2504.13835 • Published Apr 18, 2025 • 38