Weida Wang's picture

7 23 4

Weida Wang

weidawang

·

https://davidweidawang.github.io/

davidweida

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

upvoted a paper 28 days ago

Scaling Agent Learning via Experience Synthesis

upvoted a paper about 1 month ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

View all activity

Organizations

upvoted a paper 20 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 21 days ago • 132

upvoted a paper 28 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 80

upvoted a paper about 1 month ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29 • 44

upvoted 5 papers about 2 months ago

Chem-R: Learning to Reason as a Chemist

Paper • 2510.16880 • Published Oct 19 • 52

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20 • 67

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19 • 104

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Paper • 2509.23768 • Published Sep 28 • 48

upvoted 3 papers 2 months ago

OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!

Paper • 2509.26495 • Published Sep 30 • 10

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 173

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25 • 103

upvoted 2 papers 3 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9 • 31

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25 • 48

upvoted 3 papers 4 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 97

IAG: Input-aware Backdoor Attack on VLMs for Visual Grounding

Paper • 2508.09456 • Published Aug 13 • 8

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published Aug 11 • 42

upvoted 3 papers 5 months ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 41

Control-R: Towards controllable test-time scaling

Paper • 2506.00189 • Published May 30 • 6

Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention

Paper • 2506.23542 • Published Jun 30 • 14

upvoted a paper 6 months ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10 • 37