SO-Bench: A Structural Output Evaluation of Multimodal LLMs Paper • 2511.21750 • Published 17 days ago • 5