Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published 5 days ago • 35
Lapa v0.1.2 Release Collection Release of SOTA Ukrainian LLM and Datasets • 18 items • Updated Nov 13, 2025 • 28
OmniGEC Collection This is a collection of multilingual silver-standard datasets and models for the task of Grammatical Error Correction (GEC). • 9 items • Updated Sep 19, 2025 • 8
view article Article Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM Apr 23, 2025 • 62
Ukrainian Text-to-Speech datasets Collection Five voices: Mykyta, Oleksa, Lada, Kateryna or Tetiana • 6 items • Updated Feb 26, 2025 • 4