MJ-Agent

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

richardxp888 authored a paper 11 days ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

richardxp888 authored a paper 17 days ago

Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery

richardxp888 authored a paper 17 days ago

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

View all activity

richardxp888

authored a paper 11 days ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published 13 days ago • 46

richardxp888

authored 4 papers 17 days ago

Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery

Paper • 2508.17380 • Published Aug 24 • 6

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

Paper • 2509.21882 • Published Sep 26

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 97

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 18 days ago • 104

richardxp888

authored a paper 2 months ago

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27 • 62

richardxp888

authored a paper 4 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7 • 140

richardxp888

authored 7 papers 5 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning

Paper • 2506.00555 • Published May 31 • 1

PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models

Paper • 2506.17667 • Published Jun 21 • 4

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 89

ChemMLLM: Chemical Multimodal Large Language Model

Paper • 2505.16326 • Published May 22

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

Paper • 2412.06141 • Published Dec 9, 2024

Anyprefer: An Agentic Framework for Preference Data Synthesis

Paper • 2504.19276 • Published Apr 27

Lillianwei

updated a model 7 months ago

CriticAgent/CriticRM

Updated May 14

richardxp888

authored 2 papers 9 months ago

MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation

Paper • 2502.01719 • Published Feb 3

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Paper • 2503.13964 • Published Mar 18 • 20

richardxp888

authored 2 papers about 1 year ago

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 24

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

Lillianwei

authored a paper about 1 year ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

AI & ML interests

Recent Activity

Team members 4

MJ-Agent's activity