AI & ML interests

LLM

Recent Activity

sinwangĀ  updated a dataset about 3 hours ago
OpenMOSS-Team/OmniAction
lkdhyĀ  new activity about 13 hours ago
OpenMOSS-Team/SciJudge-30B:upload-model-v1
lkdhyĀ  new activity about 13 hours ago
OpenMOSS-Team/SciJudge-4B:upload-model-v1
View all activity

OpenMOSS-Team 's collections 18

MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MOSS Embodied Planner
MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MOSS Embodied Planner
MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"