also, pleasw consider to dclm-edu instead of dclm-baseline
Khietsly Tristan
khtsly
AI & ML interests
None yet
Recent Activity
commented on
an
article
2 days ago
The Optimal Architecture for Small Language Models
commented on
an
article
2 days ago
The Optimal Architecture for Small Language Models
new activity
2 days ago
codelion/dhara-70m:1024 in max_position_embeddings
Organizations
None yet
