Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Omar Kamali's picture
Open to Collab
15 5 11

Omar Kamali PRO

omarkamali
anderbogia's profile picture stellaray777's profile picture zbliss's profile picture
Β·
https://omarkama.li
  • omarkamali
  • omarkamali
  • omar-kamali

AI & ML interests

NLP & LLMs for low resource languages.

Recent Activity

liked a dataset about 7 hours ago
omneity-labs/ipa-dict
repliedto their post 4 days ago
I just might have cracked tokenizer-free LLMs. No vocab, no softmax. I'm training a 22M params LLM rn to test this "thing" and it's able to formulate coherent sentences 🀯 Bear in mind, this is a completely new, tokenizer-free LLM architecture with built-in language universality. Check the explainer video to understand what's happening. Feedback welcome on this approach!
liked a Space 5 days ago
omneity-labs/lid-benchmark
View all activity

Organizations

Masakhane NLP's profile picture Blog-explorers's profile picture Tamazight NLP's profile picture Sawalni AI's profile picture Omneity Labs's profile picture ml-fw-prerelease's profile picture DangGang's profile picture Residuals's profile picture menaio's profile picture wikiwonka's profile picture Wikilangs's profile picture

upvoted a collection 7 days ago

OLDI and friends

Collection
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task. β€’ 5 items β€’ Updated 7 days ago β€’ 5
upvoted a collection 8 months ago

Text Datasets

Collection
25 items β€’ Updated Feb 8 β€’ 3
upvoted an article over 1 year ago
view article
Article

Finding Moroccan Arabic (Darija) in Fineweb 2

Dec 8, 2024
β€’
23
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs