Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens Paper • 2511.19418 • Published 12 days ago • 26
ReasonFLux-Coder Collection Coding LLMs excel at both writing code and generating unit tests. • 9 items • Updated May 26 • 11
ReasonFlux Series Collection A series of released reasoning models based on ReasonFlux • 9 items • Updated Oct 31 • 3
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Paper • 2510.14943 • Published Oct 16 • 39
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published Oct 16 • 33
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 65