phonemetransformers/IPA-BabyLM
Viewer • Updated • 12.5M • 299 • 2
GPT2 trained on the BabyLM 2024 training set using a character-based tokenizer.
Model trained for From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes.
Base model
openai-community/gpt2