Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jingyaogong
/
MiniMind2-Pytorch
like
8
arxiv:
2405.04434
arxiv:
2402.14905
arxiv:
2401.04088
Model card
Files
Files and versions
xet
Community
fd7b5f7
MiniMind2-Pytorch
3.93 GB
1 contributor
History:
5 commits
jingyaogong
Upload 14 files
fd7b5f7
verified
about 1 year ago
images
Upload 14 files
about 1 year ago
.gitattributes
Safe
2.29 kB
Upload 14 files
about 1 year ago
README.md
Safe
91.1 kB
Upload 2 files
about 1 year ago
README_en.md
Safe
101 kB
Upload 2 files
about 1 year ago
full_sft_512.pth
103 MB
xet
Upload 12 files
about 1 year ago
full_sft_512_zero.pth
103 MB
xet
Upload 12 files
about 1 year ago
full_sft_640_moe.pth
580 MB
xet
Upload 12 files
about 1 year ago
full_sft_768.pth
416 MB
xet
Upload 12 files
about 1 year ago
pretrain_512.pth
103 MB
xet
Upload 12 files
about 1 year ago
pretrain_640_moe.pth
580 MB
xet
Upload 12 files
about 1 year ago
pretrain_768.pth
416 MB
xet
Upload 12 files
about 1 year ago
reason_512.pth
103 MB
xet
Upload 12 files
about 1 year ago
reason_768.pth
416 MB
xet
Upload 12 files
about 1 year ago
rlhf_512.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
103 MB
xet
Upload 12 files
about 1 year ago
rlhf_640_moe.pth
580 MB
xet
Upload 12 files
about 1 year ago
rlhf_768.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
416 MB
xet
Upload 12 files
about 1 year ago