Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Suchir Salhan's picture
1 12 2

Suchir Salhan

suchirsalhan
21world's profile picture snklp's profile picture dianags's profile picture
·
https://www.suchirsalhan.com/
  • suchirsalhan
  • suchirsalhan
  • ssalhan

AI & ML interests

Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.

Recent Activity

updated a model 31 minutes ago
Beetle-HumanScale/beetlelm_nld_L1-eng_L2_B1_per_language
published a model 31 minutes ago
Beetle-HumanScale/beetlelm_nld_L1-eng_L2_B1_per_language
updated a model about 5 hours ago
Beetle-HumanScale/beetlelm_nld_L1-eng_L2_balanced
View all activity

Organizations

SomosNLP's profile picture CLIMB's profile picture ALTA's profile picture CLIMB-MAO's profile picture Pico Language Model's profile picture ADA-LM's profile picture Looking to Learn's profile picture Cambridge-KAIST's profile picture Cambridge-KAIST2's profile picture BabyLM Challenge's profile picture ByteSpan Tokenisers's profile picture BabyLM Sequence Length's profile picture ContingentChat's profile picture Multilingual UnigramLM's profile picture Beetles's profile picture RA at ALTA's profile picture BrainAlign's profile picture Beetle-Data's profile picture Beetle-HumanScale's profile picture Beetle-FineWeb's profile picture

suchirsalhan 's datasets 9

suchirsalhan/kidalign-llama-filterable

Viewer • Updated 3 days ago • 97.6k • 31

suchirsalhan/kidalign-llama-3.1-8B-Instruct

Updated 3 days ago • 2.39k

suchirsalhan/babylm-detox

Viewer • Updated 9 days ago • 11.6M • 46

suchirsalhan/gptbert-tokenised

Updated Jul 24, 2025 • 5

suchirsalhan/Phonemized-UD

Viewer • Updated May 30, 2025 • 1.19M • 78

suchirsalhan/BabyLM-Pretokenised

Viewer • Updated Jan 31, 2025 • 1.64M • 6

suchirsalhan/MAO-CHILDES

Viewer • Updated Apr 11, 2024 • 3.81M • 5

suchirsalhan/CLiMP

Preview • Updated Apr 2, 2024 • 33 • 1

suchirsalhan/SLING

Viewer • Updated Apr 2, 2024 • 40k • 32
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs