GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 19 days ago • 37 • 7
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20, 2025 • 22
Large Language Models for Page Stream Segmentation Paper • 2408.11981 • Published Aug 21, 2024 • 3
OCR Collection Data and models for optical character recognition • 6 items • Updated 17 days ago • 5
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 19 days ago • 37 • 7
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 19 days ago • 37