OpenGVL - Benchmarking Visual Temporal Progress for Data Curation Paper • 2509.17321 • Published Sep 22 • 3
OpenGVL - Benchmarking Visual Temporal Progress for Data Curation Paper • 2509.17321 • Published Sep 22 • 3
OpenGVL - Benchmarking Visual Temporal Progress for Data Curation Paper • 2509.17321 • Published Sep 22 • 3 • 2
Behind Closed Words: Creating and Investigating the forePLay Annotated Dataset for Polish Erotic Discourse Paper • 2412.17533 • Published Dec 23, 2024
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models Paper • 2505.03821 • Published May 3 • 25
BAN-PL: a Novel Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service Paper • 2308.10592 • Published Aug 21, 2023
What Matters in Hierarchical Search for Combinatorial Reasoning Problems? Paper • 2406.03361 • Published Jun 5, 2024 • 1
What Matters in Hierarchical Search for Combinatorial Reasoning Problems? Paper • 2406.03361 • Published Jun 5, 2024 • 1
Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models Paper • 2409.12969 • Published Sep 2, 2024 • 1
When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options Paper • 2409.00113 • Published Aug 27, 2024 • 2
When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options Paper • 2409.00113 • Published Aug 27, 2024 • 2