Motoki Wu's picture

Motoki Wu

tokestermw

·

https://motoki.co

AI & ML interests

None yet

Recent Activity

liked a model 29 days ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8

liked a model about 2 months ago

Qwen/Qwen3.5-35B-A3B

liked a model about 2 months ago

Qwen/Qwen3.5-27B

View all activity

Organizations

upvoted a paper about 2 months ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 72

upvoted a paper 3 months ago

Agentic-R: Learning to Retrieve for Agentic Search

Paper • 2601.11888 • Published Jan 17 • 19

upvoted 2 collections 3 months ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 4 days ago • 269

GLM-4.7

3 items • Updated Jan 19 • 65

upvoted a paper 4 months ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 42

upvoted 3 articles 4 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

•

111

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

84

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Dec 8, 2025

•

57

upvoted an article 5 months ago

Article

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

+2

Nov 21, 2025

•

26

upvoted a collection 6 months ago

PromptMII

Prompt-MII: Meta-Learning Instruction Induction for LLMs. Link to paper: https://arxiv.org/abs/2510.16932 • 4 items • Updated Oct 21, 2025 • 2

upvoted a paper 6 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted an article 6 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

33

upvoted a paper 6 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 131

upvoted a collection 7 months ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 197

upvoted 2 papers 7 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

upvoted 3 papers 8 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published Aug 23, 2025 • 24

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 162

upvoted a collection 8 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 4 days ago • 104