Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deepseek-ai 's Collections
DeepSeek-V3.2
DeepSeek-V3.1
DeepSeek-R1
DeepSeek-V3
DeepSeek-Math
DeepSeek-VL2
Janus
DeepSeek-Prover
DeepSeek-V2
DeepSeekCoder-V2
ESFT
DeepSeek-VL
DeepSeek-Coder
DeepSeek-LLM
DeepSeek-V2.5
DeepSeek-MoE

DeepSeek-MoE

updated 9 days ago

DeepSeek MoE series

Upvote
22

  • deepseek-ai/deepseek-moe-16b-base

    Text Generation • 16B • Updated Jan 12, 2024 • 17.4k • 131

  • deepseek-ai/deepseek-moe-16b-chat

    Text Generation • 16B • Updated Feb 5, 2024 • 14.4k • 150

  • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Paper • 2401.06066 • Published Jan 11, 2024 • 58
Upvote
22
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs