Spaces:
Running
Running
Commit History
Upload from GitHub Actions: updated disclaimer on frontend bbb82e8 verified
Upload from GitHub Actions: Merge pull request #9 from datenlabor-bmz/jn-dev 7c06aef verified
Upload from GitHub Actions: fixed Type Error 71ab1e9 verified
Upload from GitHub Actions: Merge pull request #8 from datenlabor-bmz/jn-dev 3665390 verified
Upload from GitHub Actions: Merge pull request #5 from datenlabor-bmz/jn-dev abd65a6 verified
Upload from GitHub Actions: Fix crashes when searching low-resource languages fe700d4 verified
Upload from GitHub Actions: Exclude TruthfulQA from proficiency score 3fbff09 verified
Upload from GitHub Actions: Get more results, compute average based on all tasks 98c6811 verified
Upload from GitHub Actions: Correlation plot b0aa389 verified
Upload from GitHub Actions: added some transparency to model contribution info 0044d85 verified
Upload from GitHub Actions: Fix linter problems in frontend e8341d2 verified
Upload from GitHub Actions: More models and languages a73f888 verified
Upload from GitHub Actions: Improve UX and style 70582ce verified
Upload from GitHub Actions: Improve UX and style 53d2039 verified
Upload from GitHub Actions: Merge remote changes with local frontend updates 760c6c6 verified
Upload from GitHub Actions: adjusted wording 2367ef4 verified
Upload from GitHub Actions: Merge remote changes and apply terminology updates: Commercial->closed-source, Open->open-source ebaf279 verified
Upload from GitHub Actions: Use task subset for average score b1e5b40 verified
Upload from GitHub Actions: Eavaluate on 40 languages 941d5c5 verified
Upload from GitHub Actions: Make community links work, add CONTRIBUTING 3f60023 verified
Upload from GitHub Actions: Add math benchmarks 549360a verified
Upload from GitHub Actions: Quick fixes 9c2c019 verified
Upload from GitHub Actions: Display N/A scores as such 1e8952a verified
Add GH action for pushing to HF f9431d1
David Pomerenke commited on
Add symbols for progress plot 68e918f
David Pomerenke commited on
Display more language names de40d0a
David Pomerenke commited on
Run on 40 languages, additional models 260c1a3
David Pomerenke commited on
Add scores to world map hover title 3680a5f
David Pomerenke commited on
Change frontend text f046407
David Pomerenke commited on
Fix response when no evals data is available c856043
David Pomerenke commited on
Remove unnecessary function a5cf2d9
David Pomerenke commited on
Add WIP disclaimer 37ec45a
David Pomerenke commited on
Fix: sort copy, not in place 2eeba23
David Pomerenke commited on
Change title and add blurb 58de179
David Pomerenke commited on
Improve plots and dataset table a9e6b9b
David Pomerenke commited on
Add model history plot f52ec6e
David Pomerenke commited on
Add nice cumulative language population plot b54f543
David Pomerenke commited on
Implement MMLU task a683732
David Pomerenke commited on
Add dataset metadata about human/machine translation d8f2dee
David Pomerenke commited on
Refactor score columns 4106f13
David Pomerenke commited on
Translation both from and to 731eddd
David Pomerenke commited on
Add OpenRouter metadata to models 9002fc2
David Pomerenke commited on
Run on 100 languages, adjust display 8274634
David Pomerenke commited on
Dataset table grouping 9051509
David Pomerenke commited on
Adjust font sizes 51cb38c
David Pomerenke commited on
Add Dockerfile 4d13673
David Pomerenke commited on
Fix world map and apply filters for it 92d8154
David Pomerenke commited on
Add logo as PNG 73c776c
David Pomerenke commited on
More concise title 140e08c
David Pomerenke commited on