6 38 7

Shangqing Tu

tsq2000

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Qwen3-VL Technical Report

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 7 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

View all activity

Organizations

upvoted a paper 3 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 11 days ago • 107

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 172

upvoted a paper 7 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 13 days ago • 54

upvoted a paper about 1 month ago

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

Paper • 2510.23451 • Published Oct 27 • 26

upvoted 2 papers about 2 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20 • 67

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

Paper • 2510.11683 • Published Oct 13 • 13

upvoted a collection about 2 months ago

LLaDA-8B-BGPO

Collection

Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models • 4 items • Updated Oct 11 • 4

New activity in THU-KEG/DeepPrune-Judge-4B about 2 months ago

Update license metadata and add paper abstract

#1 opened about 2 months ago by

nielsr

updated a collection about 2 months ago

DeepPrune

Collection

Parallel Scaling without Inter-trace Redundancy • 3 items • Updated Oct 10 • 1

upvoted a paper about 2 months ago

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9 • 24

commented a paper about 2 months ago

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9 • 24 •

updated a dataset about 2 months ago

THU-KEG/DeepPrune

Preview • Updated Oct 10 • 36 • 1

updated a collection about 2 months ago

DeepPrune

Collection

Parallel Scaling without Inter-trace Redundancy • 3 items • Updated Oct 10 • 1

updated a model about 2 months ago

THU-KEG/DeepPrune-Judge-4B

Text Classification • Updated Oct 11 • 9 • 1

published a model 2 months ago

THU-KEG/DeepPrune-Judge-4B

Text Classification • Updated Oct 11 • 9 • 1

published a dataset 2 months ago

THU-KEG/DeepPrune

Preview • Updated Oct 10 • 36 • 1

upvoted 2 papers 2 months ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2 • 52

SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression

Paper • 2509.25176 • Published Sep 29 • 13

authored a paper 4 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 192

Shangqing Tu

AI & ML interests

Recent Activity

Organizations

tsq2000's activity

Update license metadata and add paper abstract