Peking University

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zooblastlbz submitted a paper about 10 hours ago

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

zooblastlbz submitted a paper about 1 month ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

jinlujia authored a paper 4 months ago

Universal Image Restoration Pre-training via Degradation Classification

View all activity

Papers

Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models

Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models

View all Papers

thkelper

authored a paper 3 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 7 days ago • 12

xianbao

authored a paper 3 months ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published Oct 20, 2025 • 8

jinlujia

authored 2 papers 4 months ago

Universal Image Restoration Pre-training via Degradation Classification

Paper • 2501.15510 • Published Jan 26, 2025 • 1

Universal Image Restoration Pre-training via Masked Degradation Classification

Paper • 2510.13282 • Published Oct 15, 2025 • 11

Jinfa

authored 8 papers 7 months ago

Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Paper • 2303.14369 • Published Mar 25, 2023

Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

Paper • 2305.12218 • Published May 20, 2023

A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges

Paper • 2311.05112 • Published Nov 9, 2023 • 1

LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference

Paper • 2406.18139 • Published Jun 26, 2024 • 2

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8, 2024 • 19

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

Paper • 2411.13093 • Published Nov 20, 2024 • 2

QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension

Paper • 2503.08689 • Published Mar 11, 2025 • 4

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 93

Jinfa

authored a paper 8 months ago

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Paper • 2505.20292 • Published May 26, 2025 • 52

huangjy-pku

authored a paper 9 months ago

Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis

Paper • 2503.22420 • Published Mar 28, 2025

hkc20

authored 2 papers 11 months ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published Mar 20, 2025 • 41

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published Mar 20, 2025 • 41

qubvel-hf

updated 4 models 12 months ago

PekingU/rtdetr_v2_r101vd

Object Detection • 76.8M • Updated Feb 6, 2025 • 4.71k • 13

PekingU/rtdetr_v2_r50vd

Object Detection • 43M • Updated Feb 6, 2025 • 17.9k • 26

PekingU/rtdetr_v2_r34vd

Object Detection • 31.5M • Updated Feb 6, 2025 • 12k • 7

PekingU/rtdetr_v2_r18vd

Object Detection • 20.2M • Updated Feb 6, 2025 • 137k • 5