Tianbao Xie's picture

Tianbao Xie PRO

tianbaoxiexxx

·

https://tianbaoxie.com

AI & ML interests

NLP, AI, RL, Robotics

Recent Activity

updated a dataset 14 days ago

xlangai/ubuntu_osworld_verified_trajs

upvoted a paper about 2 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

upvoted a paper about 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

View all activity

Organizations

upvoted 3 papers about 2 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22 • 19

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15 • 57

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published Oct 10 • 51

upvoted a collection 3 months ago

Qwen3-Coder

5 items • Updated Jul 31 • 135

upvoted 4 papers 4 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12 • 31

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

CoAct-1: Computer-using Agents with Coding as Actions

Paper • 2508.03923 • Published Aug 5 • 14

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

upvoted a collection 5 months ago

OpenCUA: Open Foundations for Computer-Use Agents

This is the official versions of OpenCUA models and AgentNet datasets. Website: https://opencua.xlang.ai/ • 8 items • Updated 7 days ago • 21

upvoted an article 6 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11

•

35

upvoted 5 papers 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 142

One-shot Entropy Minimization

Paper • 2505.20282 • Published May 26 • 6

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 45

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

upvoted 3 papers 7 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 23

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 45

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 26

upvoted 2 papers 9 months ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 36

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113