7 20 18

Kaixin Li

likaixin

https://likaixin2000.github.io/

likaixin2000

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

authored a paper 4 days ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

upvoted a paper 4 days ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

View all activity

Organizations

upvoted a paper about 3 hours ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 5 days ago • 32

upvoted a paper 4 days ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Paper • 2601.21296 • Published 12 days ago • 18

upvoted a paper about 2 months ago

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 132

upvoted 2 papers 2 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 153

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

Paper • 2511.09067 • Published Nov 12, 2025 • 2

upvoted 2 papers 3 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 106

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 101

upvoted a collection 4 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 621

upvoted 3 papers 4 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Paper • 2404.09486 • Published Apr 15, 2024 • 2

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Paper • 2507.01702 • Published Jul 2, 2025 • 4

upvoted an article 4 months ago

Article

BigCodeArena: Judging code generations end to end with code executions

Oct 7, 2025

•

upvoted a paper 4 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 175

upvoted a paper 6 months ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20, 2025 • 43

upvoted 2 articles 8 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

upvoted a paper 8 months ago

ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

Paper • 2504.07981 • Published Apr 4, 2025 • 4

upvoted 2 papers 9 months ago

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Paper • 2504.12764 • Published Apr 17, 2025 • 42

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27, 2025 • 109

upvoted an article about 1 year ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

Jan 3, 2025

•

Kaixin Li

AI & ML interests

Recent Activity

Organizations

likaixin's activity

BigCodeArena: Judging code generations end to end with code executions

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

GRPO for GUI Grounding Done Right

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use